ISO C中的restrict关键字-conghonglei-ChinaUnix博客

conghongleihonglei.blog.chinaunix.net

首页　| 　博文目录　| 　关于我

conghonglei

博客访问： 479246
博文数量： 143
博客积分： 6159
博客等级：准将
技术积分： 1667
用户组：普通用户
注册时间： 2010-08-25 23:08

文章分类

全部博文（143）

杂记（4）
programming（8）

erlang（1）
日志计划（4）
心路（9）
system（12）
network（19）
总线接口（17）
Linux（14）

nptl（11）
Joke（41）
未分配的博文（15）

文章存档

2013年（1）

2012年（11）

2011年（55）

2010年（76）

我的朋友

相关博文

ISO C中的restrict关键字

分类： C/C++

2011-07-10 10:38:28

最近在作avs优化的时候，有同学问，好奇怪阿，他这个指针要了加个restrict？
把wiki中的restict的解释放到这里：）

In the , as of the , restrict is a that can be used in declarations. The restrict keyword is a declaration of intent given by the programmer to the . It says that for the lifetime of the pointer, only it or a value directly derived from it (such as pointer + 1) will be used to access the object to which it points. This limits the effects of , aiding caching optimizations. If the declaration of intent is not followed and the object is accessed by an independent pointer, this will result in . Optimization

If the compiler knows that there is only one pointer to a memory block, it can produce better code. The following hypothetical example makes it clearer:

void updatePtrs(size_t *ptrA, size_t *ptrB, size_t *val)
{
*ptrA += *val;
*ptrB += *val;
}

In the above code, the pointers ptrA, ptrB, and val might refer to the , so the compiler will generate a less optimal code :

load R1 ← *val ; Load the value of val pointer
load R2 ← *ptrA ; Load the value of ptrA pointer
add R2 += R1 ; Perform Addition
set R2 → *ptrA ; Update the value of ptrA pointer
; Similarly for ptrB, note that val is loaded twice,
; because ptrA may be equal to val.
load R1 ← *val
load R2 ← *ptrB
add R2 += R1
set R2 → *ptrB

However if the restrict keyword is used and the above function is declared as :

void updatePtrs(size_t *restrict ptrA, size_t *restrict ptrB, size_t *restrict val);

then the compiler is allowed to assume that ptrA, ptrB, and val point to different locations and updating one pointer will not affect the other pointers. The programmer, not the compiler, is responsible for ensuring that the pointers do not point to identical locations.

Now the compiler can generate better code as follows:

load R1 ← *val
load R2 ← *ptrA
add R2 += R1
set R2 → *ptrA
; Note that val is not reloaded,
; because the compiler knows it is unchanged
load R2 ← *ptrB
add R2 += R1
set R2 → *ptrB

Note that the above assembly code is shorter because val is loaded once.

阅读(1092) | 评论(0) | 转发(0) |

上一篇：http://blog.xiqiao.info/ 有点意思的blog

下一篇：[弯曲评论] 对大宋下一代高端通信系统设计的七个展望

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6