python struct中pack和unpack-1032120121-ChinaUnix博客

1032120121

首页　| 　博文目录　| 　关于我

1032120121

博客访问： 392246
博文数量： 73
博客积分： 2620
博客等级：少校
技术积分： 1212
用户组：普通用户
注册时间： 2009-11-09 10:47

文章分类

全部博文（73）

u-boot移植（4）
linux命令及使用（4）
编程语言（1）

python（1）
uclinux（4）
心情日记（15）
嵌入式系统（32）

TILEpro36-MPCB系（11）

编译问题（7）

LPC2468-E1_AMC系（5）

器件介绍（1）
Linux学习（6）
APUE学习笔记（6）
未分配的博文（1）

文章存档

2011年（18）

2010年（50）

2009年（5）

我的朋友

come9660

相关博文

python struct中pack和unpack

分类： Python/Ruby

2010-05-19 13:13:30

Format	C Type	Python	Notes
`x`	pad byte	no value
`c`	`char`	string of length 1
`b`	`signed char`	integer
`B`	`unsigned char`	integer
`h`	`short`	integer
`H`	`unsigned short`	integer
`i`	`int`	integer
`I`	`unsigned int`	long
`l`	`long`	integer
`L`	`unsigned long`	long
`q`	`long long`	long	(1)
`Q`	`unsigned long long`	long	(1)
`f`	`float`	float
`d`	`double`	float
`s`	`char[]`	string
`p`	`char[]`	string
`P`	`void *`	integer

完全摘自于 python 安装目录中的doc中，居然上网找了半天才发现。

转自http://hi.baidu.com/pythond/blog/item/f66b49556d884d50d009067d.html

struct中pack unpack用法

转自http://wwty.javaeye.com/blog/401414

这两天做TCP协议，数据的传输都是二进制的，需要解释，于是用到了struct
看到这样一句代码：

Python代码

length = struct.unpack('>I', self.buffer[:4])[0]

length = struct.unpack('>I', self.buffer[:4])[0]

当时没有明白format=">I"是什么意思，从google找了一下，有人说这个东西，可都是比较笼统，没能让我明白，于是硬着头皮看API：
By default, C numbers are represented in the machine’s native format and byte order, and properly aligned by skipping pad bytes if necessary (according to the rules used by the C compiler).
通常，C语言下数字都是机器语言的格式并且按照字节排序，同时在需要的情况下会利用跳过填补的字节来进行适当的调整

Alternatively, the first character of the format string can be used to indicate the byte order, size and alignment of the packed data。
非此即彼：字符串的第一个字符要么被用于表示字符串的字节的排序，或者是字符串的size，还有就是数据是否对准。

Native byte order is big-endian or little-endian, depending on the host system. For example, Motorola and Sun processors are big-endian; Intel and DEC processors are little-endian.
计算机的字节序要么是高位顺序，要么是低位的，这依赖于主机本身。比如，摩托罗拉和sun的处理器是高位的，但是intel和DEC的是低位的。

这样子就明白了上面的format=">I"的意思，也就是说按照高位顺序来格式化取得一个int或long值。下面问题就又来了，你怎么知道读取的就是一个int或long值呢？

通过看struct的文档，可以看到struct通过两张表制定了一定的format规则，我按照自己的观察，给他归纳为两类，一个是和C当中类型的对照，另一个就是选择按照高位还是低位来解释字节。上面已经说了高低字节顺序，那么观察和C对照的表格，发现I 代表的就是integer or long ，详细的可以去看python的API。

下面是一些使用的例子，具体的使用，可以参考这些例子：
1. 设置fomat格式，如下:
# 取前5个字符，跳过4个字符华，再取3个字符
format = '5s 4x 3s'

2. 使用struck.unpack获取子字符串
import struct
print struct.unpack(format, 'Test astring')
#('Test', 'ing')
来个简单的例子吧，有一个字符串'He is not very happy'，处理一下，把中间的not去掉，然后再输出。
import struct
theString = 'He is not very happy'
format = '2s 1x 2s 5x 4s 1x 5s'
print ' '.join(struct.unpack(format, theString))
输出结果：
He is very happy

随后是关于网络字节的东东，从网上看来的，感觉有用：

Python的socket库采用string类型来发送和接收数据，这样当我们用
i = socket.recv(4)
来接收一个4字节的整数时，该整数实际上是以二进制的形式保存在字符串 i 的前4个字节中；大多数的时候我们需要的是一个真正的integer/long型，而不是一个用string型表示的整型。这时我们可以使用struct库：Interpret

strings as packed binary data. 对上面的情况，我们可以写
t = unpack("I", i)
第一个参数是格式化字符串，I指明字符串 i 包含的头一个数据项是一个以C语言的unsigned integer表示的整数，这里 i 只包含了一个数据项，实际上这个被解释的字符串也可以包含多个数据项，只要在格式化字符串里为每项数据指明一个格式即可；自然地，unpack返回的就是一个tuple类型了。

阅读(19455) | 评论(0) | 转发(0) |

上一篇：minicom使用问题

下一篇：编译模块时EXPORT_SYMBOL的用法（转）

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6