分类: LINUX
2008-11-05 13:02:55
主要涉及到 setbuf, fflush, fsync,sync等函数。
首先来说输入输出库的缓冲。
The three types of buffering available are unbuffered, block buffered, and line buffered.
When an output stream is unbuffered, information appears on the
destination file or terminal as soon as written; when it is
block buffered many characters are saved up and written as a block; when it is line buffered characters are
saved up until a newline is output or input is read from any stream attached to a terminal device (typically
stdin). The function fflush(3) may be used to force the block out early. (See fclose(3).) Normally all files
are block buffered. When the first I/O operation occurs on a file, malloc(3) is called, and a buffer is
obtained. If a stream refers to a terminal (as stdout normally does) it is line buffered. The standard error
stream stderr is always unbuffered by default.
一般来说,block buffered的效率高些,将多次的操作合并成一次操作。现在标准库里缓存一部分,
直到该缓冲区满了,或者程序显示的调用fflush时,将进行更新操作。
而setbuf 则可以设置该缓冲区的大小。
#include
void setbuf(FILE *stream, char *buf);
这个函数应该必须在如何输出被写到该文件之前调用。一般放在main里靠前面的语句!
但是setbuf有个经典的错误,man手册上也提到了,c陷阱和缺陷上也提到了
You must make sure that both buf and the space it points to still exist by the time stream is closed, which also happens at program termination. For example, the following is illegal:
#include
int main()
{
char buf[BUFSIZ];
setbuf(stdin, buf);
printf("Hello, world!\n");
return 0;
}
之前C运行库所必须进行的清理工作的一部分,但是此时buf字符数组已经释放。
修改的方法是 将buf设置为static,或者全局变量; 或者调用malloc来动态申请内存。
char * malloc();
setbuf(stdout,malloc(BUFSIZE));
这里不需要判断malloc的返回值,如果malloc调用失败,将返回一个null指针,setbuf的第二个参数可以
是null,此时不进行缓冲!
对应的,fflush函数则刷新缓冲区,将缓冲区上的内容更新到文件里。
#include
int fflush(FILE *stream);
The function fflush forces a write of all user-space buffered
data for the given output or update stream via the stream underlying
write function. The open status of the stream is unaffected.
If the stream argument is NULL, fflush flushes all open output streams.
但是fflush仅仅刷新C库里的缓冲。
其他的一些数据的刷新需要调用fsync或者sync!!
Note that fflush() only flushes the user space buffers provided by the C library. To ensure that the data is physically stored on disk the kernel buffers must be flushed too, e.g. with sync(2) or fsync(2).
fsync和sync最终将缓冲的数据更新到文件里。
#include
int fsync(int fd);
fsync copies all in-core parts of a file to disk, and waits until the
device reports that all parts are on stable storage. It also updates
metadata stat information. It does not necessarily ensure that the
entry in the directory containing the file has also reached
disk. For that an explicit fsync on the file descriptor of the
directory is also needed.
NAME
sync - commit buffer cache to disk
SYNOPSIS
#include
void sync(void);
DESCRIPTION
sync first commits inodes to buffers, and then buffers to disk.
ERRORS
This function is always successful.
同步命令sync就直接调用了sync函数来更新磁盘上的缓冲!!
传统的UNIX实现在内核中设有缓冲区高速缓存或页面高速缓存,大多数磁盘I/O都通过缓冲进行。当将数据写 入文件时,内核通常先将该数据复制到其中一个缓冲区中,如果该缓冲区尚未写满,则并不将其排入输出队列,而是等待其写满或者当内核需要重用该缓冲区以便存 放其他磁盘块数据时,再将该缓冲排入输出队列,然后待其到达队首时,才进行实际的I/O操作。这种输出方式被称为延迟写(delayed write)(Bach [1986]第3章详细讨论了缓冲区高速缓存)。
延迟写减少了磁盘读写次数,但是却降低了文件内容的更新速度,使得欲写到文件中的数据在一段时间内并没有写到 磁盘上。当系统发生故障时,这种延迟可能造成文件更新内容的丢失。为了保证磁盘上实际文件系统与缓冲区高速缓存中内容的一致性,UNIX系统提供了 sync、fsync和fdatasync三个函数。
sync函数只是将所有修改过的块缓冲区排入写队列,然后就返回,它并不等待实际写磁盘操作结束。
通常称为update的系统守护进程会周期性地(一般每隔30秒)调用sync函数。这就保证了定期冲洗内核的块缓冲区。命令sync(1)也调用sync函数。
fsync函数只对由文件描述符filedes指定的单一文件起作用,并且等待写磁盘操作结束,然后返回。fsync可用于数据库这样的应用程序,这种应用程序需要确保将修改过的块立即写到磁盘上。
fdatasync函数类似于fsync,但它只影响文件的数据部分。而除数据外,fsync还会同步更新文件的属性。
本书说明的所有四种平台都支持sync和fsync函数。但是,FreeBSD 5.2.1和Mac OS X 10.3并不支持fdatasync。