Asynchronous programming with Tornado-laoliulaoliu-ChinaUnix博客

miraclemiracle.blog.chinaunix.net

首页　| 　博文目录　| 　关于我

laoliulaoliu

博客访问： 4663072
博文数量： 1214
博客积分： 13195
博客等级：上将
技术积分： 9105
用户组：普通用户
注册时间： 2007-01-19 14:41

个人简介

C++,python,热爱算法和机器学习

文章分类

全部博文（1214）

cloud（3）
operation（9）
tornado（4）
mac_os（1）
golang（4）
架构（13）
git（4）
security（29）
shell（1）
macbook（1）
ruby（13）
javascript（15）
design（3）
testing（1）
mac（1）
bigdata（69）
nosql（46）
R（9）
gcj/acm（6）
NLP（10）
小说（3）
matlab（4）
web（44）
java（66）
product（7）
c#（1）
language（4）
machine learning（76）
science（4）
opencourse（2）
windows（3）
search（33）
algorithm（65）
database（51）
compiler（11）
ACE（5）
poem（1）
programming（29）
python（140）
assembly（1）
linux（49）
C++（16）
book（2）
cate（1）
phliosophy（3）
mental（30）
Science fiction（1）
Software（5）
c（23）
network（65）
CS（15）
thinking（10）
BSD（13）
solaris10（2）
life（57）
Debian（16）
economy（7）
Mathematics（57）
OS（8）
ibm（2）
gentoo（32）
未分配的博文（8）

文章存档

2021年（13）

2020年（49）

2019年（14）

2018年（27）

2017年（69）

2016年（100）

2015年（106）

2014年（240）

2013年（5）

2012年（193）

2011年（155）

2010年（93）

2009年（62）

2008年（51）

2007年（37）

我的朋友

相关博文

Asynchronous programming with Tornado

分类： Python/Ruby

2016-05-20 12:39:21

原文地址：https://lbolla.info/blog/2012/10/03/asynchronous-programming-with-tornado

Asynchronous programming can be tricky for beginners, therefore I think it’s useful to iron some basic concepts to avoid common pitfalls. For an explanation about generic asynchronous programming, I recommend you one of the . I will focus solely on asynchronous programming in Tornado.

From Tornado’s homepage:

FriendFeed’s web server is a relatively simple, non-blocking web server written in Python. The FriendFeed application is written using a web framework that looks a bit like web.py or Google’s webapp, but with additional tools and optimizations to take advantage of the non-blocking web server and tools. Tornado is an open source version of this web server and some of the tools we use most often at FriendFeed. The framework is distinct from most mainstream web server frameworks (and certainly most Python frameworks) because it is non-blocking and reasonably fast. Because it is non-blocking and uses epoll or kqueue, it can handle thousands of simultaneous standing connections, which means the framework is ideal for real-time web services. We built the web server specifically to handle FriendFeed’s real-time features every active user of FriendFeed maintains an open connection to the FriendFeed servers. (For more information on scaling servers to support thousands of clients, see The C10K problem.)

The first step as a beginner is to figure out if you really need to go asynchronous. Asynchronous programming is more complicated that synchronous programming, because, as someone described, it does not fit human brain nicely.

You should use asynchronous programming when your application needs to monitor some resources and react to changes in their state. For example, a web server sitting idle until a request arrives through a socket is an ideal candidate. Or an application that has to execute tasks periodically or delay their execution after some time. The alternative is to use multiple threads (or processes) to control multiple tasks and this model becomes quickly complicated.

The second step is to figure out if you can go asynchronous. Unfortunately in Tornado, not all the tasks can be executed asynchronously.

Tornado is single threaded (in its common usage, although it supports multiple threads in advanced configurations), therefore any “blocking” task will block the whole server. This means that a blocking task will not allow the framework to pick the next task waiting to be processed. The selection of tasks is done by the IOLoop, which, as everything else, runs in the only available thread.

For example, this is a wrong way of using IOLoop:

						
										# Example of misuse of callback. DON'T DO THIS!
									
										import time
									
										from tornado.ioloop import IOLoop
									
										def blocking_func():
									
										print 'sleeping'
									
										time.sleep(1)
									
										print 'awake!'
									
										if __name__ == "__main__":
									
										# Note that code is executed sequantially!
									
										IOLoop.instance().add_callback(blocking_func)
									
										IOLoop.instance().add_callback(blocking_func)
									
										IOLoop.instance().start()
									
			 hosted with ? by

Note that blocking_func is called correctly, but, being blocking (time.sleep blocks!), it will prevent the execution of the following task (the second call to the same function). Only when the first call will end, the second will be called by IOLoop. Therefore, the output in console is sequential (“sleeping”, “awake!”, “sleeping”, “awake!”).

Compare the same “algorithm”, but using an “asynchronous version” of time.sleep, i.e. add_timeout:

						
										# Example of non-blocking sleep.
									
										import time
									
										from tornado.ioloop import IOLoop
									
										from tornado import gen
									
										@gen.engine
									
										def f():
									
										print 'sleeping'
									
										yield gen.Task(IOLoop.instance().add_timeout, time.time() + 1)
									
										print 'awake!'
									
										if __name__ == "__main__":
									
										# Note that now code is executed "concurrently"
									
										IOLoop.instance().add_callback(f)
									
										IOLoop.instance().add_callback(f)
									
										IOLoop.instance().start()
									
			 hosted with ? by

In this case, the first task will be called, it will print “sleeping” and then it will ask IOLoop to schedule the execution of the rest of the routine after 1 second. IOLoop, having the control again, will fire the second call the function, which will print “sleeping” again and return control to IOLoop. After 1 second IOLoop will carry on where he left with the first function and “awake” will be printed. Finally, the second “awake” will be printed, too. So, the sequence of prints will be: “sleeping”, “sleeping”, “awake!”, “awake!”. The two function calls have been executed concurrently (not in parallel, though!).

So, I hear you asking, “how do I create functions that can be executed asynchronously”? In Tornado, every function that has a “callback” argument can be used with gen.engine.Task. Beware though: being able to use Task does not make the execution asynchronous! There is no magic going on: the function is simply scheduled to execution, executed and whatever is passed tocallback will become the return value of Task. See below:

						
										import time
									
										from tornado.ioloop import IOLoop
									
										from tornado import gen
									
										def my_function(callback):
									
										print 'do some work'
									
										# Note: this line will block!
									
										time.sleep(1)
									
										callback(123)
									
										@gen.engine
									
										def f():
									
										print 'start'
									
										# Call my_function and return here as soon as "callback" is called.
									
										# "result" is whatever argument was passed to "callback" in "my_function".
									
										result = yield gen.Task(my_function)
									
										print 'result is', result
									
										IOLoop.instance().stop()
									
										if __name__ == "__main__":
									
										f()
									
										IOLoop.instance().start()
									
			 hosted with ? by

Most beginners expect to be able to just write: Task(my_func), and automagically execute my_func asynchronously. This is not how Tornado works. This is how works! And this is my last remark:

In a function that is going to be used “asynchronously”, only asynchronous libraries should be used.

By this, I mean that blocking calls like time.sleep or urllib2.urlopen or db.query will need to be substituted by their equivalent asynchronous version. For example, IOLoop.add_timeout instead of time.sleep, AsyncHTTPClient.fetchinstead of urllib2.urlopen etc. For DB queries, the situation is more complicated and specific asynchronous drivers to talk to the DB are needed. For example: Motor for MongoDB.

阅读(1016) | 评论(0) | 转发(0) |

上一篇：理解 tornado.gen

下一篇：Asynchronous programming with Tornado

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6

	# Example of misuse of callback. DON'T DO THIS!
	import time
	from tornado.ioloop import IOLoop


	def blocking_func():
	print 'sleeping'
	time.sleep(1)
	print 'awake!'


	if __name__ == "__main__":
	# Note that code is executed sequantially!
	IOLoop.instance().add_callback(blocking_func)
	IOLoop.instance().add_callback(blocking_func)
	IOLoop.instance().start()

	# Example of non-blocking sleep.
	import time
	from tornado.ioloop import IOLoop
	from tornado import gen


	@gen.engine
	def f():
	print 'sleeping'
	yield gen.Task(IOLoop.instance().add_timeout, time.time() + 1)
	print 'awake!'


	if __name__ == "__main__":
	# Note that now code is executed "concurrently"
	IOLoop.instance().add_callback(f)
	IOLoop.instance().add_callback(f)
	IOLoop.instance().start()