Intel 移位指令的陷阱-GFree

linux开发专注者(坚持原创)linuxfocus.blog.chinaunix.net

首页　| 　博文目录　| 　关于我

GFree_Wind

博客访问： 8173006
博文数量： 159
博客积分： 10424
博客等级：少将
技术积分： 14615
用户组：普通用户
注册时间： 2010-07-14 12:45

个人简介

啦啦啦~~~

文章分类

全部博文（159）

编写安全无错代码（11）
Linux（66）

TCP/IP源码（39）

内核I/O（0）

应用编程（7）

netfilter源码学（8）

ULK学习笔记（0）

驱动学习（0）

内核启动（1）

内核杂项（5）

shell（1）
C/C++（17）

代码优化（0）

C99标准学习笔记（4）

编译与链接（3）

避免Bug(我犯的错（3）

有趣的问题（1）

代码重构（1）

代码风格（2）

基础概念（1）
开源代码学习（8）

netmap（0）

Linux命令源代码（0）

zeromq（5）

glibc源码学习（3）
调试技巧（8）
并行编程（1）
软件工程（4）

经验之谈（1）

设计模式（3）
数据结构与算法（11）

算法（7）

数据结构（4）
网络设备开发（0）
Networks（9）

学习笔记（1）
计算机体系结构（0）
代码分享（1）
Light TCP proxy（1）
资料（0）

2012系统架构师大（0）
函数式编程（3）

Haskell（3）
职业发展（6）

我的思考（1）

优秀书目（5）
转载（1）
数据库（1）

sqlite（1）
其它（11）

职场（2）

随笔（7）
未分配的博文（0）

文章存档

2015年（5）

2014年（1）

2013年（5）

2012年（10）

2011年（116）

2010年（22）

我的朋友

相关博文

Intel 移位指令的陷阱

分类： C/C++

2011-02-18 15:01:06

作者：gfree.wind@gmail.com

博客：linuxfocus.blog.chinaunix.net

今天发现了一个Intel逻辑左移指令shl的一个bug。

逻辑左移的概念是对给定的目的操作数左移COUNT次,每次移位时最高位移入标志位CF中,最低位补零. 其中OPRD1为目的操作数, 可以是通用寄存器或存储器操作数。

首先说明一下我的环境：Intel(R) Pentium(R) 4 CPU，操作系统是Fedora 12，gcc的版本是4.4.2。

下面请看测试程序：

#include <stdio.h>
int main()
{
#define MOVE_CONSTANT_BITS 32
unsigned int move_step=MOVE_CONSTANT_BITS;
unsigned int value1 = 1ul << MOVE_CONSTANT_BITS;
printf("value1 is 0x%X\n", value1);
unsigned int value2 = 1ul << move_step;
printf("value2 is 0x%X\n", value2);
return 0;
}

编译：

[root@Lnx99 test]#gcc -g test.c -o test
test.c: In function ‘main’:
test.c:8: warning: left shift count >= width of type

看到这里，我想问一下大家，这两个value的值都是什么？是否相等呢？

我相信会有很大一部分人会说这两个值一样，都是0.因为根据逻辑左移的概念，这个1被移了出去，低位补了32个0.

所以值肯定是零。

那么让我执行一下，看看吧。

[root@Lnx99 test]#./test
value1 is 0x0
value2 is 0x1

有些奇怪吧，为什么这样呢。让我们看看汇编代码吧。

Dump of assembler code for function main:
0x080483c4 : push %ebp
0x080483c5 : mov %esp,%ebp
0x080483c7 : and $0xfffffff0,%esp
0x080483ca : push %ebx
0x080483cb : sub $0x2c,%esp
0x080483ce : movl $0x20,0x14(%esp)
0x080483d6 : movl $0x0,0x18(%esp)
0x080483de : mov $0x80484f4,%eax
0x080483e3 : mov 0x18(%esp),%edx
0x080483e7 : mov %edx,0x4(%esp)
0x080483eb : mov %eax,(%esp)
0x080483ee : call 0x80482f4
0x080483f3 : mov 0x14(%esp),%eax
0x080483f7 : mov $0x1,%edx
0x080483fc : mov %edx,%ebx
0x080483fe : mov %eax,%ecx
0x08048400 : shl %cl,%ebx
0x08048402 : mov %ebx,%eax
0x08048404 : mov %eax,0x1c(%esp)
0x08048408 : mov $0x8048504,%eax
0x0804840d : mov 0x1c(%esp),%edx
0x08048411 : mov %edx,0x4(%esp)
0x08048415 : mov %eax,(%esp)
0x08048418 : call 0x80482f4
0x0804841d : mov $0x0,%eax
0x08048422 : add $0x2c,%esp
0x08048425 : pop %ebx
0x08048426 : mov %ebp,%esp
0x08048428 : pop %ebp
0x08048429 : ret
End of assembler dump.

汇编代码中红色的代码对应于unsigned int value1 = 1ul << MOVE_CONSTANT_BITS;蓝色的代码对应于unsigned int value2 = 1ul << move_step;

从这些代码可以看出，对于第一个指令，gcc直接计算出了结果的值，然后将其赋给了value1，而第二个指令真正的执行了逻辑左移shl。

但是为什么逻辑左移shl运算的结果是1，而不是0呢。这个逻辑左移的结果居然与循环左移ROL的结果是一样的。到此，我有点怀疑是不是编译器的问题，在生成机器码的时候，是否错误的生成了ROL对应的机器码呢。

使用objdump -d test查看test的机器码。

对应逻辑左移的机器码是d3 e3.

8048400: d3 e3 shl %cl,%ebx

为了使用循环左移ROL，只能通过修改汇编代码的方式。那么首先使用gcc -S test.c 生成汇编代码test.s，然后修改

sall %cl, %ebx 行为roll %cl, %ebx，再用gcc -g test.s -o test汇编代码test.s重新生成test。

再次使用objdump -d test查看test的机器码。

对应循环左移的机器码是d3 c3。

8048400: d3 c3 rol %cl,%ebx

到此我们可以确定编译器没有问题，使用的就是Intel提供的逻辑左移指令，那么为什么最终的结果与期望的不同呢。

难道是Intel的bug？！

我们不能轻易下这个结论。因为逻辑左移是一个很基础的指令，Intel会出现这么一个明显的bug吗？

让我们去看一下Intel的指令手册吧。

SAL/SAR/SHL/SHR—Shift (Continued)——32位机
Description
These instructions shift the bits in the first operand (destination operand) to the left or right by
the number of bits specified in the second operand (count operand). Bits shifted beyond the
destination operand boundary are first shifted into the CF flag, then discarded. At the end of the
shift operation, the CF flag contains the last bit shifted out of the destination operand.
The destination operand can be a register or a memory location. The count operand can be an
immediate value or register CL. The count is masked to five bits, which limits the count range
to 0 to 31. A special opcode encoding is provided for a count of 1.

这下真相大白了。原来在32位机器上，移位counter只有5位。那么当执行左移32位时，实际上就是左移0位。

那么这个1ul << move_step就相当于1ul<<0。那么value2自然就是1了。

到此，我们虽然已经知道整个儿的来龙去脉了，可是不能不说Intel的移位指令是有着陷阱的。因为在除了在Intel这个手册中说明了这个情况，在其它的汇编语言的资料中，从没有提及过这个情况。有的朋友可能说了，之前gcc已经给了一个“test.c:8: warning: left shift count >= width of type”这样的警告了啊，已经对这个情况做了提示。关于这个warning，如果代码再复杂一些，移位的个数不再是一个常量，gcc肯定是无法检测出来的。所以，当我们需要做移位处理时，一定要注意是否超出了32位（64位机则是64位）。

另外，对于gcc的处理，我也有一点意见。当1ul<<32时，gcc自己预处理的结果与进行运算的结果不符，虽然它更符合用户的期望。但是，当用户开始使用常量时，结果是对的，一旦换成了变量，结果就不一样了。在大型的程序中，这样会让用户很难定位到问题的。

阅读(9923) | 评论(2) | 转发(1) |

上一篇：UDP socket流程（12）——udp_push_pending_frames

下一篇：UDP socket流程（13）——ip_push_pending_frames

给主人留下些什么吧！~~

GFree_Wind2011-03-02 22:06:34

caoxudong818: 好文，深入细致，敬佩博主。.....

呵呵。我也是在工作中，当左移32位，发现结果不正确。然后gdb时，发现通过gdb计算的移位是正确的，但实际上的左值是不对的。于是研究了有一阵子，才知道问题的所在。

回复 | 举报

caoxudong8182011-03-02 22:03:34

好文，深入细致，敬佩博主。

回复 | 举报

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6