初識Python asynic異步編程

時間 2019-11-17

原文原文鏈接

什麼是異步編程？

同步代碼(synchrnous code)咱們都很熟悉，就是運行完一個步驟再運行下一個。要在同步代碼裏面實現"同時"運行多個任務，最簡單也是最直觀地方式就是運行多個 threads 或者多個 processes。這個層次的『同時運行』多個任務，是操做系統協助完成的。 也就是操做系統的任務調度系統來決定何時運行這個任務，何時切換任務，你本身，做爲一個應用層的程序員，是沒辦法進行干預的。程序員

我相信你也已經據說了什麼關於 thread 和 process 的抱怨：process 過重，thread 又要牽涉到不少頭條的鎖問題。尤爲是對於一個 Python 開發者來講，因爲GIL（全局解釋器鎖）的存在，多線程沒法真正使用多核，若是你用多線程來運行計算型任務，速度會更慢。編程

異步編程與之不一樣的是，值使用一個進程，不使用 threads，可是也能實現"同時"運行多個任務（這裏的任務其實就是函數）。flask

這些函數有一個很是 nice 的 feature：必要的能夠暫停，把運行的權利交給其餘函數。等到時機恰當，又能夠恢復以前的狀態繼續運行。這聽上去是否是有點像進程呢？能夠暫停，能夠恢復運行。只不過進程的調度是操做系統完成的，這些函數的調度是進程本身（或者說程序員你本身）完成的。這也就意味着這將省去了不少計算機的資源，由於進程的調度必然須要大量 syscall，而 syscall 是很昂貴的。服務器

一定義一個簡單的協程：

import asyncio
 
async def execute(x):
    print('Number:', x)
    return x
 
coroutine = execute(1)
print('Coroutine:', coroutine)
print('After calling execute')
 
loop = asyncio.get_event_loop()
task = loop.create_task(coroutine)
print('Task:', task)
loop.run_until_complete(task)
print('Task:', task)
print('After calling loop')

# print('Task Result:', task.result())  這樣也能查看task執行的結果

運行結果：網絡

Coroutine: <coroutine object execute at 0x10e0f7830>
After calling execute
Task: <Task pending coro=<execute() running at demo.py:4>>
Number: 1
Task: <Task finished coro=<execute() done, defined at demo.py:4> result=1>
After calling loop

咱們使用 async 定義了一個 execute() 方法，方法接收一個數字參數，方法執行以後會打印這個數字。
隨後咱們直接調用了這個方法，然而這個方法並無執行，而是返回了一個 coroutine 協程對象。session

隨後咱們使用 get_event_loop() 方法建立了一個事件循環 loop，並調用了 loop 對象的 run_until_complete() 方法將協程註冊到事件循環 loop 中，而後啓動。最後咱們纔看到了 execute() 方法打印了輸出結果。
可見，async 定義的方法就會變成一個沒法直接執行的 coroutine 對象，必須將其註冊到事件循環中才能夠執行。

多線程

咱們也能夠不使用task來運行，它裏面相比 coroutine 對象多了運行狀態，好比 running、finished 等，咱們能夠用這些狀態來獲取協程對象的執行狀況。app

將 coroutine 對象傳遞給 run_until_complete() 方法的時候，實際上它進行了一個操做就是將 coroutine 封裝成了 task 對象，如：異步

import asyncio

async def execute(x):
    print('Number:', x)

coroutine = execute(1)
print('Coroutine:', coroutine)
print('After calling execute')

loop = asyncio.get_event_loop()
loop.run_until_complete(coroutine)
print('After calling loop')

View Code

查看了源碼，正好能夠驗證上面這一觀點：async

run_until_complete()這個方法位於源碼中的base_events.py，函數有句註釋：
Run until the Future is done.If the argument is a coroutine, it is wrapped in a Task.

二發送網絡請求結合aiohttp實現異步：

咱們用一個網絡請求做爲示例，這就是一個耗時等待的操做，由於咱們請求網頁以後須要等待頁面響應並返回結果。耗時等待的操做通常都是 IO 操做，好比文件讀取、網絡請求等等。協程對於處理這種操做是有很大優點的，當遇到須要等待的狀況的時候，程序能夠暫時掛起，轉而去執行其餘的操做，從而避免一直等待一個程序而耗費過多的時間，充分利用資源。爲了測試，我本身先經過flask 建立一個實驗環境：

from flask import Flask
import time
 
app = Flask(__name__)
 
@app.route('/')
def index():
    time.sleep(3)
    return 'Hello!'
 
if __name__ == '__main__':
    app.run(threaded=True)

開始測試...

import asyncio
import aiohttp
import time
 
start = time.time()
 
async def get(url):
    session = aiohttp.ClientSession()
    response = await session.get(url)
    result = await response.text()
    session.close()
    return result
 
async def request():
    url = 'http://127.0.0.1:5000'          # 訪問flask搭建的服務器（睡眠3秒），模仿IO阻塞
    print('Waiting for', url)
    result = await get(url)
    print('Get response from', url, 'Result:', result)
 
tasks = [asyncio.ensure_future(request()) for _ in range(5)]
loop = asyncio.get_event_loop()
loop.run_until_complete(asyncio.wait(tasks))
 
end = time.time()
print('Cost time:', end - start)

運行結果:

Waiting for http://127.0.0.1:5000
Waiting for http://127.0.0.1:5000
Waiting for http://127.0.0.1:5000
Waiting for http://127.0.0.1:5000
Waiting for http://127.0.0.1:5000
Get response from http://127.0.0.1:5000 Result: Hello!
Get response from http://127.0.0.1:5000 Result: Hello!
Get response from http://127.0.0.1:5000 Result: Hello!
Get response from http://127.0.0.1:5000 Result: Hello!
Get response from http://127.0.0.1:5000 Result: Hello!
Cost time: 3.0199508666992188

咱們發現此次請求的耗時由 15 秒變成了 3 秒，耗時直接變成了原來的 1/5。

代碼裏面咱們使用了 await，後面跟了 get() 方法，在執行這五個協程的時候，若是遇到了 await，那麼就會將當前協程掛起，轉而去執行其餘的協程，直到其餘的協程也掛起或執行完畢，再進行下一個協程的執行。