Overview of Packet Reception-tianyashuibin-ChinaUnix博客

tianyashuibin的LIBRARY

首页　| 　博文目录　| 　关于我

tianyashuibin

博客访问： 920819
博文数量： 299
博客积分： 0
博客等级：民兵
技术积分： 2493
用户组：普通用户
注册时间： 2014-03-21 10:07

个人简介

Linux后台服务器编程。

文章分类

全部博文（299）

Python（1）
BOOST（21）
libevent（1）
GCC,GDB,MKFILE（6）
Mmap（1）
Nagios（1）
Linux内核（4）
安全（2）
IT职场（3）
体系结构（1）
后台面试（3）
经典小游戏（2）
文件系统（6）
IO（20）
数据库（3）
网络编程（28）
其他（5）
设计模式（6）
进程，线程（18）
C/C++（67）
tcp/ip（31）
进程间通信（13）
linux（44）
数据结构、算法（12）
未分配的博文（0）

文章存档

2015年（2）

2014年（297）

推荐博文

相关博文

Overview of Packet Reception

分类： LINUX

2014-05-05 09:26:24

本文来源：红帽 Performance Tuning Guide

To better analyze network bottlenecks and performance issues, you need to understand how packet
reception works. Packet reception is important in network performance tuning because the receive
path is where frames are often lost. Lost frames in the receive path can cause a significant penalty to
network performance.

Figure 8.1. Network receive path diagram
The Linux kernel receives each frame and subjects it to a four?step process:
1. Hardware Reception: the network interface card (NIC) receives the frame on the wire.
Depending on its driver configuration, the NIC transfers the frame either to an internal hardware
buffer memory or to a specified ring buffer.
2. Hard IRQ: the NIC asserts the presence of a net frame by interrupting the CPU. This causes the
NIC driver to acknowledge the interrupt and schedule the soft IRQ operation.
3. Soft IRQ: this stage implements the actual frame?receiving process, and is run in softirq
context. This means that the stage pre?empts all applications running on the specified CPU, but
still allows hard IRQs to be asserted.
In this context (running on the same CPU as hard IRQ, thereby minimizing locking overhead), the
kernel actually removes the frame from the NIC hardware buffers and processes it through the
network stack. From there, the frame is either forwarded, discarded, or passed to a target
listening socket.
When passed to a socket, the frame is appended to the application that owns the socket. This
process is done iteratively until the NIC hardware buffer runs out of frames, or until the device
weight (dev_weight ). For more information about device weight, refer to Section 8.4.1, “NIC
Hardware Buffer” (s?network?common?queue?issues.html#s?network?commonque?nichwbuf)
4. Application receive: the application receives the frame and dequeues it from any owned
sockets via the standard POSIX calls (read , recv , recvfrom ). At this point, data received
over the network no longer exists on the network stack.
CPU/cache affinity
To maintain high throughput on the receive path, it is recommended that you keep the L2 cache hot.
As described earlier, network buffers are received on the same CPU as the IRQ that signaled their
presence. This means that buffer data will be on the L2 cache of that receiving CPU.
To take advantage of this, place process affinity on applications expected to receive the most data on
the NIC that shares the same core as the L2 cache. This will maximize the chances of a cache hit, and
thereby improve performance.

阅读(881) | 评论(0) | 转发(0) |

上一篇：http主要特点

下一篇：C++标准库和标准模板库

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6