<Bottleneck Identification and Scheduling in Multithreaded Applications>读书报告——郑怡，赵雨洁-xuyuanchao

XUYUANCHAO 教学博客xuyuanchao.blog.chinaunix.net

首页　| 　博文目录　| 　关于我

xuyuanchao_cnu

博客访问： 2241647
博文数量： 436
博客积分： 9833
博客等级：中将
技术积分： 5558
用户组：普通用户
注册时间： 2010-09-29 10:27

文章分类

全部博文（436）

10级实习与毕设（24）

bochs模拟器（6）

web远程管理（11）

cache模拟器（7）
南小院交流讨论区（99）
嵌入式操作系统20（0）
网络工程2010-201（39）
嵌入式操作系统与（108）
信息工程专业课程（12）
网络工程2011-201（28）
hadoop云计算专题（10）
Linux内核网络协（9）
谷歌云计算专题（2）
google Android （5）
google Android （13）
torque 3D游戏专（5）
torque 2D游戏专（8）
嵌入式网络协议专（8）
blog微博等专题（5）
恶意代码分析专题（10）
框计算与社会计算（3）
P2P专题（7）
未分配的博文（41）

文章存档

2013年（47）

2012年（79）

2011年（192）

2010年（118）

我的朋友

分类： LINUX

2012-12-24 20:01:49

1、引言

Our key idea is thus simple: measure the number of cycles spent by threads waiting for each bottleneck and accelerate the bottlenecks responsible for the highest thread waiting cycles.

我们主要的想法就是：测量线程等待每一个瓶颈的周期数，并加速负责最高线程等待周期的瓶颈。

This solution is too costly because (a) writing correct parallel programs is already a daunting task, and (b) serializing bottlenecks change with machine conﬁgu- ration, program input set, and program phase (as we show in Sec- tion 2.2), thus, what may seem like a bottleneck to the programmer may not be a bottleneck in the ﬁeld and vice versa.

这个解决方法代价很高因为（a）程序员写并行程序是一个艰巨的任务（b）一系列的瓶颈会随着机器配置、程序输入集、程序计划阶段改变而改变，所以是不是一个瓶颈不一定

The programmer, compiler or library delimits potential bot- tlenecks using BottleneckCall and BottleneckReturn instructions, and replaces the code that waits for bottlenecks with a Bottleneck- Wait instruction.

程序员利用BottleneckCall和BottleneckReturn指令分割潜在的瓶颈，用Bottleneck-Wait指令代替等待瓶颈的代码

The bottlenecks with the highest number of thread waiting cycles are selected for acceleration on one or more large cores. On executing a BottleneckCall instruction, the small core checks if the bottleneck has been selected for acceleration.

最高线程等待的周期的瓶颈被选择为加速在一个或多个大核上。在执行BC指令时小核检查瓶颈是否被选择为加速

How- ever, it only applies to barriers in statically scheduled workloads, where the work to be performed by each thread is known before runtime.

只适用于静态调度的障碍

3.3 加速瓶颈

BIS, consists of two parts: identiﬁcation of critical bottlenecks and acceleration of those bottlenecks.

BIS，包括两部分：识别临界瓶颈并且加速这些瓶颈。

Identiﬁcation of critical bottlenecks is done in hardware based on information provided by the software.

识别临界瓶颈是在软件提供的信息基础上在硬件上实现的。

There are multiple ways to accelerate a bottleneck, e.g. increasing core frequency, giving a thread higher priority in shared hard- ware resources, or migrating the bottleneck to a faster core with a more aggressive microarchitecture or higher frequency.

有许多方法加速瓶颈，例如提高核的频率，共享硬件资源给一个线程更高的优先权，或者把瓶颈移到有更积极的微体系结构建模或更高频率的核中。

问题：

1. However, these proposals lack generality and ﬁnegrained adaptivity.中的finegrained 怎么理解（细粒）

阅读(956) | 评论(0) | 转发(0) |

上一篇：深入理解Linux内核——孟倩倩

下一篇：小组问题探究--张津（徐加注）

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6