The architecture of REDIS-zhm_sunboy-ChinaUnix博客

坚持到底max2008.blog.chinaunix.net

首页　| 　博文目录　| 　关于我

zhm_sunboy

博客访问： 3315045
博文数量： 346
博客积分： 10189
博客等级：上将
技术积分： 3125
用户组：普通用户
注册时间： 2008-08-05 19:46

文章分类

全部博文（346）

java（1）
flume（1）
zookeeper（0）
MQ（5）

Zeromq（0）

Metaq（0）

Jafka（0）

Kafka（5）
MongoDB（0）
Webserver（2）

LVS（0）

Tomcat（0）

Lighttpd（0）

Apache（1）

Nigix（1）
Hadoop（0）
Storm（17）
Libevent（1）
Memcached（0）
Redis（7）
Protocol（13）

SIP（12）
Algorithm（4）
Streaming（29）
网络技术（7）
Telecom（32）
计算机原理（0）
网络编程（26）
人生励志（4）
软件设计（1）
软件测试（2）
技术心得（0）
Web（1）
UML学习（2）
Wiki使用（2）
公司培训（0）
Windows编程（20）

GDIPlus使用（3）

MFC编程（4）

VC基础（9）

文件操作（1）

多线程（3）
Value-added（14）
Interview（1）
Programming（76）

Network pro（0）

Design patt（8）

Generic pro（2）

STL（4）

C++（49）

C（13）
DB（1）

Oracle（1）
Linux/Unix（77）

Unix下C/C++编程（13）

Shell脚本（14）

UNIX命令（14）

系统管理（11）

C/C++开发工具（25）
未分配的博文（0）

文章存档

2013年（35）

2011年（35）

2010年（76）

2009年（48）

2008年（152）

我的朋友

相关博文

The architecture of REDIS

分类： NOSQL

2013-06-18 13:25:28

is an open source, advanced key-value store. It is often referred to as a data structure server since keys can contain strings, hashes, lists, sets and sorted sets. Redis works with an in-memory dataset. Depending on your use case, you can persist it either by dumping the dataset to disk every once in a while, or by appending each command to a log. You can read more about its features on project’s . Redis’s original author is (awesome work Salvatore !) and on March 15 2010 started the project. Redis is used in several important products and services such as: gitHub, ,, and many more as you can read from page.

Redis is written in ANSI C and works in most POSIX systems like Linux, BSD, OS X and Solaris without external dependencies. From my limited point of view it’s code is simple and elegant to read. Codebase is not very big (about 20k LOC for 2.2 release) and its have been documented well on website. Before going on I invite you to read Redis which has been recently written by the author in which he explains philosophy of his development. I quote and I appreciate in particular the following:

We optimize for joy. We believe writing code is a lot of hard work, and the only way it can be worth is by enjoying it. When there is no longer joy in writing code, the best thing to do is stop. To prevent this, we’ll avoid taking paths that will make Redis less of a joy to develop.

This article tries to summarize what I have read from docs and code. Redis is a client/server system. A typicaldeployment diagram is the following:

Redis server is a process that talks a TCP protocol with clients.
redis-cli is the official client while other external projects have implemented bindings for several flavors of languages such as: C++, C#, Clojure, Common Lisp, Erlang, Haskell, Java, JavaScript, Lua, Objective-C, Perl, PHP, Python, R, Ruby, Scala, Go, and Tcl. is well documented on the website so you can write your own. A Master-Slave replication is also possible if you need.

I designed a conceptual modularization of the source code as the following:

redis. is the server daemon. It is made by a single redis.c file which is about 6K LOC.
networking. code for implementing networking. In particular the event-based logic which can be implemented by epoll, kqueue and select system calls
datastructure. represents important data structures used by server. Crucial is for example sds.c which represents a redis string upon which all the code is written.
redis-cli. is the command line client

How Redis works

At its roots, Redis is a single-threaded server. This means that a single thread reads incoming connections using an event-based paradigm such as epoll, kqueue and select. When a specific event occurs on a file descriptor, it processes them and write back responses. This UML sequence diagram shows how a command received by a client is processed internally by Redis:

Redis uses an home-made event library which abstracts low level socket management. Central object is theeventLoop which contains events that has been fired according socket I/O. aeApiPoll(eventLoop) polls all the socket descriptors to see if there is network activity. In the aeProcessEvents() all fired events are checked and the appropriate handler is invoked. Redis uses a command table to interpret every command from the protocol and to execute an appropriate action. A command table is defined in redis.c as the following:

static struct redisCommand cmdTable[] = {
   {"get",getCommand,2,REDIS_CMD_INLINE,NULL,1,1,1},
   {"set",setCommand,3,REDIS_CMD_BULK|REDIS_CMD_DENYOOM,NULL,0,0,0},
   . . . . . .
   {"ping",pingCommand,1,REDIS_CMD_INLINE,NULL,0,0,0},
   {"echo",echoCommand,2,REDIS_CMD_BULK,NULL,0,0,0},
   {"save",saveCommand,1,REDIS_CMD_INLINE,NULL,0,0,0},
   . . . . . .

second parameter of this structure is the name of the method to invoke. For example a saveCommand() is implemented as the following:

	static void saveCommand(redisClient *c) {
	    if (server.bgsavechildpid != -1) {
	        addReplySds(c,sdsnew("-ERR background save in progress\r\n"));
	        return;
	    }
	    if (rdbSave(server.dbfilename) == REDIS_OK) {
	        addReply(c,shared.ok);
	    } else {
	        addReply(c,shared.err);
	    }
	}

addReply() is used to push back responses to client.

Just a quick recap on what we have seen:

Redis uses a single thread that manages synchronously all network connection. A thin event library has been implemented to abstract several unix system calls (epoll, select, kqueue). There is an interesting discussion on about motivation to use a new event library instead relying on existing opensource such as .
Requests are managed with commands. Using a command table and according what is read from sockets a command handler is invoked to perform desired action.
Fundamental data structure is the SDS string. You can read more

This is the occasion (at least for me) to read about the C10K problem when you develop networked servers.

Data store management

Redis instance contains global variables the represents state of the server. For example in redis.c there is the following definition:

	/* Global server state structure */
	struct redisServer {
	    pthread_t mainthread;
	    int port;
	    int fd;
	    . . . . . .

one of the variables is redisDB which is defined as:

	typedef struct redisDb {
	    dict *dict;                 /* The keyspace for this DB */
	    dict *expires;              /* Timeout of keys with a timeout set */
	    dict *blockingkeys;         /* Keys with clients waiting for data (BLPOP) */
	    dict *io_keys;              /* Keys with clients waiting for VM I/O */
	    int id;
	} redisDb;

dict is the in-memory datastructure that is used to model Redis. It is defined in dict.c and contains a set of methods that you expect when you implement an hashtable such as:

.......
dict *dictCreate(dictType *type, void *privDataPtr);
int dictAdd(dict *d, void *key, void *val);
int dictReplace(dict *d, void *key, void *val);
int dictDelete(dict *d, const void *key);
........

and if we see an extract of one of the most important commands – set command – we see:

static void setGenericCommand(redisClient *c, int nx, robj *key, robj *val, robj *expire) {
. . . . . . . .
. . . . . . . . 
retval = dictAdd(c->db->dict,key,val);
 if (retval == DICT_ERR) {
     if (!nx) {
         /* If the key is about a swapped value, we want a new key object
          * to overwrite the old. So we delete the old key in the database.
          * This will also make sure that swap pages about the old object
          * will be marked as free. */
         if (server.vm_enabled && deleteIfSwapped(c->db,key))
             incrRefCount(key);
         dictReplace(c->db->dict,key,val);
         incrRefCount(val);
     } else {
         addReply(c,shared.czero);
         return;
     }
 } else {
     incrRefCount(key);
     incrRefCount(val);
 }
. . . . . . . .
. . . . . . . .

How Redis starts

Bootup process is very simple. As previously said redis instance is made by a set of global variables and methods that access them. main() of redis server is:

After reading a configuration file, initServer() is called. This method initializes all global variables of redis. In particular it creates a set of linked list useful for management:

......
server.clients = listCreate();
server.slaves = listCreate();
server.monitors = listCreate();
server.objfreelist = listCreate();
......

it creates the most important structure:

......
server.db[j].dict = dictCreate(&dbDictType,NULL);
server.db[j].expires = dictCreate(&keyptrDictType,NULL);
server.db[j].blockingkeys = dictCreate(&keylistDictType,NULL);
......

it initializes the event library creating the eventloop and the server socket. Eventually it enters into the main loop for managing I/O with clients:

	void aeMain(aeEventLoop *eventLoop) {
	    eventLoop->stop = 0;
	    while (!eventLoop->stop) {
	        if (eventLoop->beforesleep != NULL)
	            eventLoop->beforesleep(eventLoop);
	        aeProcessEvents(eventLoop, AE_ALL_EVENTS);
	    }
	}

aeProcessEvents is the method analyzed in previous sections.

Virtual Memory

Redis supports Virtual Memory: this means that when your dataset doesn’t fit in RAM, it can be backed on disk. In this of the author’s blog there is a discussion regarding design’s rational behind virtual memory. There are good sources of documentation on the official documentation page: how VM and how VM has been

转自：

阅读(3610) | 评论(0) | 转发(0) |

上一篇：redis设计思想

下一篇：Redis源码解析（1）—源码目录介绍

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6