Linux 2.6 内核 per cpu data 分析(转)-compilehacker-ChinaUnix博客

compilehacker

首页　| 　博文目录　| 　关于我

compilehacker

博客访问： 173422
博文数量： 63
博客积分： 2961
博客等级：少校
技术积分： 445
用户组：普通用户
注册时间： 2010-01-21 10:44

文章分类

全部博文（63）

个人日记（1）
技术专栏（58）

LFS（8）

matlab（1）

gnu（2）

实时性（1）

操作系统（16）

嵌入式（18）

中断（1）

git（1）

C程序设计（8）
未分配的博文（4）

文章存档

2011年（17）

2010年（46）

我的朋友

相关博文

Linux 2.6 内核 per cpu data 分析(转)

分类： LINUX

2010-12-26 21:02:50

Linux 2.6 内核 per cpu data 分析

--------------------------------------------------------------------------------

在arch/i386/kernel/vmlinux.lds中有
/* will be freed after init */
. = ALIGN(4096); /* Init code and data */
__init_begin = .;

/* 此处省略若干行:) */

. = ALIGN(32);
__per_cpu_start = .;
.data.percpu : { *(.data.percpu) }
__per_cpu_end = .;
. = ALIGN(4096);
__init_end = .;
/* freed after init ends here */

这说明__per_cpu_start和__per_cpu_end标识.data.percpu这个section的开头和结尾
并且，整个.data.percpu这个section都在__init_begin和__init_end之间，
也就是说，该section所占内存会在系统启动后释放(free)掉

因为有
#define DEFINE_PER_CPU(type, name) __attribute__((__section__(".data.percpu"))) __typeof__(type) per_cpu__##name

所以
static DEFINE_PER_CPU(struct runqueue, runqueues);
会扩展成
__attribute__((__section__(".data.percpu"))) __typeof__(struct runqueue) per_cpu__runqueues;
也就是在.data.percpu这个section中定义了一个变量per_cpu__runqueues，其类型是struct runqueue。
事实上，这里所谓的变量per_cpu__runqueues，其实就是相对于__per_cpu_start的偏移量。

系统启动后，在start_kernel()中会调用如下函数

unsigned long __per_cpu_offset[NR_CPUS];

static void __init setup_per_cpu_areas(void)
{
unsigned long size, i;
char *ptr;
/* Created by linker magic */
extern char __per_cpu_start[], __per_cpu_end[];

/* Copy section for each CPU (we discard the original) */
size = ALIGN(__per_cpu_end - __per_cpu_start, SMP_CACHE_BYTES);
#ifdef CONFIG_MODULES
if (size < PERCPU_ENOUGH_ROOM)
size = PERCPU_ENOUGH_ROOM;
#endif

ptr = alloc_bootmem(size * NR_CPUS);

for (i = 0; i < NR_CPUS; i++, ptr += size) {
__per_cpu_offset[i] = ptr - __per_cpu_start;
memcpy(ptr, __per_cpu_start, __per_cpu_end - __per_cpu_start);
}
}

在该函数中，为每个CPU分配一段内存，并将.data.percpu中的数据拷贝到其中，
每个CPU各有一份，其中CPU n对应的专有数据区的首地址为__per_cpu_offset[n]。
这样，前述相应于__per_cpu_start的偏移量per_cpu__runqueues就变成了相应于
__per_cpu_offset[n]的偏移量，这样.data.percpu这个section在系统初始化后
就可以释放了。

再看如何存取per cpu的变量

/* This macro obfuscates arithmetic on a variable address so that gcc
shouldn't recognize the original var, and make assumptions about it */
#define RELOC_HIDE(ptr, off) ({ unsigned long __ptr; __asm__ ("" : "=g"(__ptr) : "0"(ptr)); (typeof(ptr)) (__ptr + (off)); })

/* var is in discarded region: offset to particular copy we want */
#define per_cpu(var, cpu) (*RELOC_HIDE(&per_cpu__##var, __per_cpu_offset[cpu]))
#define __get_cpu_var(var) per_cpu(var, smp_processor_id())

#define get_cpu_var(var) (*({ preempt_disable(); &__get_cpu_var(var); }))

对于__get_cpu_var(runqueues)，将等效地扩展为
__per_cpu_offset[smp_processor_id()] + per_cpu__runqueues
并且是一个lvalue，也就是说可以进行赋值操作。
这正好是上述对应CPU的专有数据区的首地址加上对应偏移量per_cpu__runqueues，

由于不同的per cpu变量有不同的偏移量，并且不同的CPU其专有数据区首地址不同，
因此，通过__get_cpu_var()便访问到了不同的变量。

阅读(1035) | 评论(0) | 转发(0) |

上一篇：今天看到一个电子科大师兄的博客，分享一下

下一篇：2011-02-10

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6