NIO系列1：框架拆解

時間 2019-11-16

標籤 nio 系列框架拆解欄目 Netty 简体版

原文原文鏈接

最近一年用NIO寫了很多網絡程序，也研究了一些開源NIO網絡框架netty、mina等，總結了一下NIO的架構特色。java

不管是netty仍是mina它們都在java原生NIO的基礎上進行了完善的封裝，雖然細節有所不一樣，但整體架構思路一致，都大概劃分出了如下幾個組成部分：react

- - transport：傳輸層的抽象程序員

- - protocol：協議codec的抽象緩存

- - event model：統一事件模型安全

- - buffer：底層buffer封裝網絡

在徹底屏蔽底層API的同時，對上層應用提供了自身的統一API接口。session

框架進行黑盒封裝的同時，再進行通用化的接口開放，帶來的好處是統一化，但壞處是程序的透明度下降，抽象度提升，增長理解難度和實現難度。多線程

下面說說每一個部分的一些設計考量：架構

transport傳輸層抽象都是對java原生NIO API的封裝，在這一層封裝的程度在於框架的實現目標。例如mina立足於通用的網絡框架，所以徹底屏蔽了原生的API，提供了自身的統一接口，由於它不只須要封裝NIO的API還有一系列其餘類型的IO操做的API，提供統一API接口。爲了通用兼顧各種傳輸通道所以可能不得不暴露多餘的API接口，使用方需甄別傳輸通道的不一樣，增長了理解難度。併發

protocol封裝各種經常使用協議的codec操做，但目前這些網絡框架的codec實現都與自身的API緊密綁定，下降了可重用性。

event model 事件模型的設計一般不能徹底獨立，例如java NIO自己的模型是事件驅動的，但傳統阻塞型IO並不是事件驅動，要兼顧兩者一般要付出額外的代價和開銷。

有一種說法是讓異步IO同步化使用（由於同步化使用更簡單，異步致使了業務處理的碎片化）到底對不對值得商榷？模型阻抗致使的代價和開銷屏蔽在了黑盒中，也容易誤導應用程序員對本該採用同步化處理的業務卻濫用了異步化機制，並不會帶來什麼好處。

buffer 一般都用來配合底層IO數據流和協議codec使用，自己是否適合暴露給應用方取決於框架是否整合codec，由於codec自己帶有業務性質，而純粹的IO數據流處理使用的buffer則徹底無需暴露給應用方。

以上簡單說了下NIO框架各部分的設計考量，能夠看出目前流行的NIO框架（netty和mina）都在走一條相似「瑞士軍刀」的路線，集各類功能與一身（多種IO封裝、協議封裝），但你又很難把瑞士軍刀上的某個刀片拆下來單獨使用。

在實踐中感受，考慮從單一性、簡潔性、重用性、組合性、透明性幾個方面去設計原子化的IO組件也許更可取，更像是一種「工具箱」路線。

典型的事件驅動模型NIO框架組件交互圖以下：

Acceptor：負責監聽鏈接事件負責接入

Processor：負責IO讀寫事件處理

EventDispatcher：負責事件派發

Handler：業務處理器

後面將經過一個系列文章來討論一個原子化的NIO組件實現的細節及設計考量。

NIO系列2：TCP監聽綁定

分類：踏莎行·術2012-04-01 08:09 1241人閱讀評論(0) 收藏舉報

bind nio NIO tcp TCP

注：本文適合對象需對java NIO API的使用及異步事件模型（Reactor模式）有必定程度的瞭解，主要講述使用java原生NIO實現一個TCP監聽綁定的過程及細節設計。

咱們一開始設計了一個TCP接入服務類，這個類提供了一個API方法提供對本地一系列地址（端口）的監聽綁定，類初始化後完成Selector的open操做以下：

[java] view plain copy

selector = Selector.open();

提供的綁定API，其方法簽名以下：

[java] view plain copy

/**
* Binds to the specified local addresses and start to accept incoming connections. If any address binding failed then
* rollback the already binding addresses. Bind is fail fast, if encounter the first bind exception then throw it immediately.
*
* @param firstLocalAddress
* @param otherLocalAddresses
* @throws throw if bind failed.
*/
synchronized public void bind(SocketAddress firstLocalAddress, SocketAddress... otherLocalAddresses) throws IOException；

爲什麼須要同步？由於咱們不但願多個線程同時調用該方法，致使地址綁定異常。

參數中能夠傳遞多個本地地址（端口）同時進行監聽綁定。

在NIO的綁定過程當中需進行事件註冊（對OP_ACCEPT感興趣），以下：

[java] view plain copy

ServerSocketChannel ssc = ServerSocketChannel.open();
ssc.configureBlocking(false);
ServerSocket ss = ssc.socket();
ss.setReuseAddress(config.isReuseAddress());
ss.bind(address, config.getBacklog());
ssc.register(selector, SelectionKey.OP_ACCEPT);

因爲註冊過程當中除了涉及鎖競爭還可能產生死鎖，因此通常的作法都是將綁定地址放在隊列中進行異步註冊由reactor線程進行處理，例如：

[java] view plain copy

bindAddresses.addAll(localAddresses);
if (!bindAddresses.isEmpty()) {
synchronized (lock) {
// wake up for unblocking the select() to process binding addresses
selector.wakeup();
// wait for bind result
wait0();
}
}

從同步註冊變爲異步註冊後就存在一個問題，實際註冊綁定時可能存在端口已綁定的異常，在異步狀況下就須要線程間通訊來通知異常消息，並向調用方反饋。

如上面代碼片斷中的wait0()方法就是等待綁定結果，若出現綁定異常則拋出

[java] view plain copy

private void wait0() throws IOException {
while (!this.endFlag) {
try {
lock.wait();
} catch (InterruptedException e) {
throw new IOException(e);
}
}
// reset end flag
this.endFlag = false;
if (this.exception != null) {
IOException e = exception;
this.exception = null;
throw e;
}

以上代碼也說明了，NIO異步模型轉化爲同步API致使的模型阻抗付出了額外的代價和開銷 --- 線程間通訊。

至此，完成了TCP服務監聽過程，下文將進一步講述服務接入和數據傳輸相關設計細節。

NIO系列3：TCP服務接入

分類：踏莎行·術2012-04-01 08:10 1181人閱讀評論(0) 收藏舉報

nio NIO tcp TCP

注：本文適合對象需對java NIO API的使用及異步事件模型（Reactor模式）有必定程度的瞭解，主要講述使用java原生NIO實現一個TCP服務的過程及細節設計。

前文講述了NIO TCP服務綁定過程的實現機制，如今能夠開始講述服務監聽啓動後如何和處理接入和數據傳輸相關的細節設計。

在NIO的接入類中有一個Reactor線程，用於處理OP_ACCEPT事件通知，以下:

[java] view plain copy

private class AcceptThread extends Thread {
public void run() {
while (selectable) {
try {
int selected = selector.select();
if (selected > 0) {
accept();
}
// bind addresses to listen
bind0();
// unbind canceled addresses
unbind0();
} catch (Exception e) {
LOG.error("Unexpected exception caught while accept", e);
}
}
// if selectable == false, shutdown the acceptor
try {
shutdown0();
} catch (Exception e) {
LOG.error("Unexpected exception caught while shutdown", e);
}
}

當有客戶端接入時selector.select()方法返回大於0的整數，並進入accept()方法進行處理，具體以下：

[java] view plain copy

private void accept() {
Iterator<SelectionKey> it = selector.selectedKeys().iterator();
while (it.hasNext()) {
SelectionKey key = it.next();
it.remove();
AbstractSession session = (AbstractSession) acceptByProtocol(key);
Processor processor = pool.get(session);
session.setProcessor(processor);
processor.add(session);
}
}

[java] view plain copy

protected Session acceptByProtocol(SelectionKey key) {
if (key == null || !key.isValid() || !key.isAcceptable()) {
<span style="white-space:pre"> </span>return null;
<span style="white-space:pre"> </span>}
ServerSocketChannel ssc = (ServerSocketChannel) key.channel();
SocketChannel sc = null;
try {
sc = ssc.accept();
if(sc == null) {
return null;
}
sc.configureBlocking(false);
} catch (IOException e) {
LOG.warn(e.getMessage(), e);
if(sc != null) {
try {
sc.close();
} catch (IOException ex) {
LOG.warn(ex.getMessage(), ex);
}
}
}
Session session = new TcpSession(sc, config);
return session;
}

爲每個接入的客戶端經過調用NIO原生accept()方法返回一個SocketChannel的抽象，並封裝成一個session對象（session的概念來自mina框架）

注意：此時與客戶鏈接的通道還沒有註冊對讀/寫事件感興趣，由於它的註冊與前文綁定過程同樣須要異步進行。

所以將封裝通道的session轉交給一個processor對象（io讀寫處理器，該概念也是來自mina），processor內部維持了一個新建session的隊列，在其內部reactor線程循環中進行註冊處理。

有關processor處理讀寫事件的細節設計見下文。

NIO系列4：TCP服務數據讀寫

分類：踏莎行·術2012-04-01 08:10 1289人閱讀評論(0) 收藏舉報

nio NIO tcp TCP

上文講到當客戶端完成與服務端的鏈接創建後，爲其SocketChannel封裝了一個session對象表明這個鏈接，並交給processor處理。

processor的內部有3個重要的隊列，分別存放新建立的session、須要寫數據的session和準備關閉的session，以下：

[java] view plain copy

/** A Session queue containing the newly created sessions */
private final Queue<AbstractSession> newSessions = new ConcurrentLinkedQueue<AbstractSession>();
/** A queue used to store the sessions to be flushed */
private final Queue<AbstractSession> flushingSessions = new ConcurrentLinkedQueue<AbstractSession>();
/** A queue used to store the sessions to be closed */
private final Queue<AbstractSession> closingSessions = new ConcurrentLinkedQueue<AbstractSession>();

在processor的reactor循環處理線程中，每輪循環的處理包括以下步驟：

1. selector.select()，其中爲了處理鏈接超時的狀況，select方法中傳遞了超時參數以避免其永久阻塞，一般是1秒。該方法即時在沒有事件發生時每秒返回一次，進入循環檢測超時

[java] view plain copy

int selected = selector.select(SELECT_TIMEOUT);

2. 從select返回後，首先檢查newSessions隊列是否有新的session加入，併爲其註冊監聽事件（讀事件監聽）。session只有在註冊完事件後，咱們才認爲其狀態爲open並派發打開事件。（關於session狀態，有建立、打開、關閉中、已關閉幾種）

[java] view plain copy

for (AbstractSession session = newSessions.poll(); session != null; session = newSessions.poll()) {
SelectableChannel sc = session.getChannel();
SelectionKey key = sc.register(selector, SelectionKey.OP_READ, session);
session.setSelectionKey(key);
// set session state open, so we can read / write
session.setOpened();
// fire session opened event
eventDispatcher.dispatch(new Event(EventType.SESSION_OPENED, session, null, handler));
n++;
}

3. 有讀/寫事件時，進行相關處理，每次讀寫事件發生時更新一次最後的IO時間。

[java] view plain copy

// set last IO time
session.setLastIoTime(System.currentTimeMillis());
// Process reads
if (session.isOpened() && isReadable(session)) {
read(session);
}
// Process writes
if (session.isOpened() && isWritable(session)) {
asyWrite(session);
}

讀取數據時有一個小技巧在於靈活自適應buffer分配（來自mina的一個實現策略），每次判斷讀取到的字節數若乘以2依然小於buffer大小，則收縮buffer爲原來一半，若讀取的字節數已裝滿buffer則擴大一倍。

[java] view plain copy

int readBytes = 0;
int ret;
while ((ret = ((SocketChannel) session.getChannel()).read(buf)) > 0) {
readBytes += ret;
if (!buf.hasRemaining()) {
break;
}
}
if (readBytes > 0) {
if ((readBytes << 1) < session.getReadBufferSize()) {
shrinkReadBufferSize(session);
} else if (readBytes == session.getReadBufferSize()) {
extendReadBufferSize(session);
}
fireMessageReceived(session, buf, readBytes);
}
// read end-of-stream, remote peer may close channel so close session.
if (ret < 0) {
asyClose(session);
}

處理寫操做實際上是異步的，老是放入flushSessions中等待寫出。

[java] view plain copy

private void asyWrite(AbstractSession session) {
// Add session to flushing queue, soon after it will be flushed in the same select loop.
flushingSessions.add(session);

4. 如有須要寫數據的session，則進行flush操做。

寫事件通常默認都是不去關注的，由於在TCP緩衝區可寫或遠端斷開或IO錯誤發生時都會觸發該事件，容易誘發服務端忙循環從而CPU100%問題。爲了保證讀寫公平，寫buffer的大小設置爲讀buffer的1.5倍（來自mina的實現策略），每次寫數據前設置爲對寫事件再也不感興趣。限制每次寫出數據大小的緣由除了避免讀寫不公平，也避免某些鏈接有大量數據須要寫出時一次佔用了過多的網絡帶寬而其餘鏈接的數據寫出被延遲從而影響了公平性。

[java] view plain copy

// First set not be interested to write event
etInterestedInWrite(session, false);

首先向TCP緩衝區寫出數據（NIO的原生API操做都是不阻塞的）

[java] view plain copy

int qota = maxWrittenBytes - writtenBytes;
int localWrittenBytes = write(session, buf, qota);

寫完後根據返回的寫出數據字節數，可能存在如下多種狀況：

- - buffer一次寫完，則派發消息已經發送事件

[java] view plain copy

// The buffer is all flushed, remove it from write queue
if (!buf.hasRemaining()) {
if (LOG.isDebugEnabled()) {
LOG.debug("The buffer is all flushed, remove it from write queue");
}
writeQueue.remove();
// fire message sent event
eventDispatcher.dispatch(new Event(EventType.MESSAGE_SENT, session, buf.array(), handler));
}

- - 若返回的寫入字節數爲0，多是TCP緩存buffer已滿，則註冊對寫事件感興趣，稍待下次再寫。

[java] view plain copy

// 0 byte be written, maybe kernel buffer is full so we re-interest in writing and later flush it.
if (localWrittenBytes == 0) {
if (LOG.isDebugEnabled()) {
LOG.debug("0 byte be written, maybe kernel buffer is full so we re-interest in writing and later flush it");
}
setInterestedInWrite(session, true);
flushingSessions.add(session);
return;
}

- - 若一次寫入沒有寫完buffer中的數據，依然註冊對寫事件感興趣，稍待下次再寫。

[java] view plain copy

// The buffer isn't empty(bytes to flush more than max bytes), we re-interest in writing and later flush it.
if (localWrittenBytes > 0 && buf.hasRemaining()) {
if (LOG.isDebugEnabled()) {
LOG.debug("The buffer isn't empty(bytes to flush more than max bytes), we re-interest in writing and later flush it");
}
setInterestedInWrite(session, true);
flushingSessions.add(session);
return;
}

- - 一次寫入數據太多時，爲了保證公平性，依然下次再寫入

[java] view plain copy

// Wrote too much, so we re-interest in writing and later flush other bytes.
if (writtenBytes >= maxWrittenBytes && buf.hasRemaining()) {
if (LOG.isDebugEnabled()) {
LOG.debug("Wrote too much, so we re-interest in writing and later flush other bytes");
}
setInterestedInWrite(session, true);
flushingSessions.add(session);
return;
}

5. 有須要關閉的session，則進行關閉操做。引起關閉session的操做可能來自應用方主動關閉，也多是因爲IO異常後自動關閉。因爲關閉session可能存在多線程調用，爲了不鎖同步，咱們經過狀態檢測來規避用鎖機制提升效率。

關閉session的操做具體來講就是對channel.close()和key.cancel()，這2個操做後其實尚未徹底釋放socket佔用的文件描述符，需等到下次select()操做後，一些NIO框架會主動調用，因爲咱們這裏select(TIMEOUT)帶有超時參數會自動喚醒，所以不存在這個問題。

[java] view plain copy

private int close() throws IOException {
int n = 0;
for (AbstractSession session = closingSessions.poll(); session != null; session = closingSessions.poll()) {
if (LOG.isDebugEnabled()) { LOG.debug("Closing session: " + session); }
if (session.isClosed()) {
if (LOG.isDebugEnabled()) { LOG.debug("Escape close session, it has been closed: " + session); }
continue;
}
session.setClosing();
close(session);
n++;
session.setClosed();
// fire session closed event
eventDispatcher.dispatch(new Event(EventType.SESSION_CLOSED, session, null, handler));
if (LOG.isDebugEnabled()) { LOG.debug("Closed session: " + session); }
}
return n;

NIO系列5：事件模型

分類：踏莎行·術2012-04-01 08:10 1263人閱讀評論(0) 收藏舉報

event Event nio NIO

前文講述了NIO數據讀寫處理，那麼這些數據最終如何被遞交給上層業務程序進行處理的呢？

NIO框架通常都採用了事件派發模型來與業務處理器交互，它與原生NIO的事件機制是模型匹配的，缺點是帶來了業務處理的碎片化。須要業務程序開發者對事件的生命週期有一個清晰的瞭解，不像傳統方式那麼直觀。

事件派發器（EventDispatcher）就成爲了NIO框架中IO處理線程和業務處理回調接口（Handler）之間的橋樑。

因爲業務處理的時間長短是難以肯定的，因此通常事件處理器都會分離IO處理線程，使用新的業務處理線程池來進行事件派發，回調業務接口實現。

下面經過一段示例代碼來講明事件的派發過程：

這是processor從網絡中讀取到一段字節後發起的MESSAGE_RECEIVED事件，調用了eventDispatcher.dispatch(Event e)方法。

[java] view plain copy

private void fireMessageReceived(AbstractSession session, ByteBuffer buf, int length) {
// fire message received event, here we copy buffer bytes to a new byte array to avoid handler expose <code>ByteBuffer</code> to end user.
byte[] barr = new byte[length];
System.arraycopy(buf.array(), 0, barr, 0, length);
eventDispatcher.dispatch(new Event(EventType.MESSAGE_RECEIVED, session, barr, handler));
}

dispatch的方法實現有如下關鍵點須要考慮：

1. 事件派發是多線程的，派發線程最終會調用業務回調接口來進行事件處理，回調接口由業務方實現自身去保證線程併發性和安全性。

2. 對於TCP應用來講，由同一session（這裏可表明同一個鏈接）收到的數據必須保證有序派發，不一樣的session可無序。

3. 不一樣session的事件派發要儘量保證公平性，例如：session1有大量事件產生致使派發線程繁忙時，session2產生一個事件不會由於派發線程都在忙於處理session1的事件而被積壓，session2的事件也能儘快獲得及時派發。

下面是一個實現思路的代碼示例：

[java] view plain copy

public void dispatch(Event event) {
AbstractSession s = (AbstractSession) event.getSession();
s.add(event);
if (!s.isEventProcessing()) {
squeue.offer(s);
}
}

爲了保證每一個session的事件有序，咱們將事件存放在每一個session自身包含的隊列中，而後再將session放入一個公共的阻塞隊列中。

有一組worker線程在監聽阻塞隊列，一旦有session進入隊列，它們被激活對session進行事件派發，以下：

[java] view plain copy

public void run() {
try {
for (AbstractSession s = squeue.take(); s != null; s = squeue.take()) {
// first check any worker is processing this session? if any other worker thread is processing this event with same session, just ignore it.
synchronized (s) {
if (!s.isEventProcessing()) {
s.setEventProcessing(true);
} else {
continue;
}
}
// fire events with same session
fire(s);
// last reset processing flag and quit current thread processing
s.setEventProcessing(false);
// if remaining events, so re-insert to session queue
if (s.getEventQueue().size() > 0 && !s.isEventProcessing()) {
squeue.offer(s);
}
}
} catch (InterruptedException e) {
LOG.warn(e.getMessage(), e);
}
}