Golang標準庫之Buffer

時間 2019-11-10

標籤 golang 標準 buffer 欄目 Go 简体版

原文原文鏈接

Buffer
bootstrap

Go標準庫Buffer是一個可變大小的字節緩衝區，能夠用Wirte和Read方法操做它，在Go標準庫中,定義了以下關於Buffer的數據結構。數據結構

type Buffer struct {
    buf       []byte            // contents are the bytes buf[off : len(buf)]
    off       int               // read at &buf[off], write at &buf[len(buf)]
    runeBytes [utf8.UTFMax]byte // avoid allocation of slice on each WriteByte or Rune
    bootstrap [64]byte          // memory to hold first slice; helps small buffers (Printf) avoid allocation.
    lastRead  readOp            // last read operation, so that Unread* can work correctly.
}
// The readOp constants describe the last action performed on
// the buffer, so that UnreadRune and UnreadByte can
// check for invalid usage.
type readOp int
const (
    opInvalid  readOp = iota // Non-read operation.
    opReadRune               // Read rune.
    opRead                   // Any other read operation.
)

如上定義，Buffer存儲的數據是在off到len(buf)區域之間，其餘區域是沒有數據，並且只能從&buf[off]開始讀取數據和從&buf[len(buf)]寫數據，同時爲了不對內存的屢次操做，對於小的緩衝區，Buffer定義了bootstrap來避免屢次內存的操做，runeBytes的定義也是如此目的，還有一個表示對Buffer的操做標識符lastRead。app

Buffer的常見操做ide

初始化Buffer編碼
```
func NewBuffer(buf []byte) *Buffer { return &Buffer{buf: buf} }
func NewBufferString(s string) *Buffer {
    return &Buffer{buf: []byte(s)}
}
```
方法NewBuffer使用buf做爲參數初始化Buffer，Buffer既能夠被讀也能夠被寫，若是是讀Buffer，buf需填充必定的數據，若是是寫，buf需有必定的容量(capacity)，固然也能夠經過new(Buffer)來初始化Buffer。另一個方法NewBufferString用一個string來初始化可讀Buffer，並用string的內容填充Buffer.spa

讀寫操做code

func (b *Buffer) Read(p []byte) (n int, err error)
func (b *Buffer) Next(n int) []byte
func (b *Buffer) ReadByte() (c byte, err error)
func (b *Buffer) ReadRune() (r rune, size int, err error)
func (b *Buffer) ReadBytes(delim byte) (line []byte, err error)
func (b *Buffer) readSlice(delim byte) (line []byte, err error)
func (b *Buffer) ReadString(delim byte) (line string, err error)
func (b *Buffer) Write(p []byte) (n int, err error)
func (b *Buffer) WriteString(s string) (n int, err error)
func (b *Buffer) ReadFrom(r io.Reader) (n int64, err error)
func (b *Buffer) WriteTo(w io.Writer) (n int64, err error)
func (b *Buffer) WriteByte(c byte) error
func (b *Buffer) WriteRune(r rune) (n int, err error)

下面對Read，ReadRune，ReadBytes方法進行分析，對於方法Read, 其主要作三個步驟：第一，判斷Buffer是否爲空，若是是，則重置Buffer；第二，複製Buffer的buf的數據到p,並調整off的位置標識Buffer的可讀位置；第三，設置讀標識符爲opRead。orm

func (b *Buffer) Read(p []byte) (n int, err error) {
    b.lastRead = opInvalid
    if b.off >= len(b.buf) {
        // Buffer is empty, reset to recover space.
        b.Truncate(0)
        if len(p) == 0 {
            return
        }
        return 0, io.EOF
    }
    n = copy(p, b.buf[b.off:])
    b.off += n
    if n > 0 {
        b.lastRead = opRead
    }
    return
}

方法ReadRune()定義瞭如何讀取Buffer中UTF8編碼的rune數據，一樣也需三個步驟，第一，判斷Buffer是否爲空，如果，重置Buffer；第二，設置讀操做符爲opReadRune；第三，判斷可讀位置off處的byte是否小於utf8.Runeself，如果，調整off位置並返回。不然，將Buffer的數據解碼成rune，調整off位置，返回解碼後的rune及大小。接口

// ReadRune reads and returns the next UTF-8-encoded
// Unicode code point from the buffer.
// If no bytes are available, the error returned is io.EOF.
// If the bytes are an erroneous UTF-8 encoding, it
// consumes one byte and returns U+FFFD, 1.
func (b *Buffer) ReadRune() (r rune, size int, err error) {
    b.lastRead = opInvalid
    if b.off >= len(b.buf) {
        // Buffer is empty, reset to recover space.
        b.Truncate(0)
        return 0, 0, io.EOF
    }
    b.lastRead = opReadRune
    c := b.buf[b.off]
    if c < utf8.RuneSelf {
        b.off++
        return rune(c), 1, nil
    }
    r, n := utf8.DecodeRune(b.buf[b.off:])
    b.off += n
    return r, n, nil
}

方法ReadBytes(delim byte)讀取Buffer中從off到第一次delim之間的數據，而且包括delim，ReadBytes調用私有方法readSlice來實現，readSlice方法首先查找delim的位置，若是不存在，則返回從off到len(buf)之間的數據，若是存在，則返回off到off+location(delim)+1之間數據，其中加1是爲了包括delim，最後設置操做標識符爲opRead。內存

// ReadBytes reads until the first occurrence of delim in the input,
// returning a slice containing the data up to and including the delimiter.
// If ReadBytes encounters an error before finding a delimiter,
// it returns the data read before the error and the error itself (often io.EOF).
// ReadBytes returns err != nil if and only if the returned data does not end in
// delim.
func (b *Buffer) ReadBytes(delim byte) (line []byte, err error) {
    slice, err := b.readSlice(delim)
    // return a copy of slice. The buffer's backing array may
    // be overwritten by later calls.
    line = append(line, slice...)
    return
}
// readSlice is like ReadBytes but returns a reference to internal buffer data.
func (b *Buffer) readSlice(delim byte) (line []byte, err error) {
    i := IndexByte(b.buf[b.off:], delim)
    end := b.off + i + 1
    if i < 0 {
        end = len(b.buf)
        err = io.EOF
    }
    line = b.buf[b.off:end]
    b.off = end
    b.lastRead = opRead
    return line, err
}

一樣對相應的Write，WriteRune，ReadFrom， WriteTo寫方法進行分析, 對於方法Write，相對Read方法來講，要簡單些，主要是擴展Buffer空間，而後將p中的數據複製到Buffer。

// Write appends the contents of p to the buffer, growing the buffer as
// needed. The return value n is the length of p; err is always nil. If the
// buffer becomes too large, Write will panic with ErrTooLarge.
func (b *Buffer) Write(p []byte) (n int, err error) {
    b.lastRead = opInvalid
    m := b.grow(len(p))
    return copy(b.buf[m:], p), nil
}

對於方法WriteRune,首先判斷要寫的數據rune是否小於utf8.RuneSelf，如果，調用WriteByte將其寫入Buffer，若不是，則將要寫的數據rune編碼成utf8,並調用Write將其寫入Buffer。

// WriteRune appends the UTF-8 encoding of Unicode code point r to the
// buffer, returning its length and an error, which is always nil but is
// included to match bufio.Writer's WriteRune. The buffer is grown as needed;
// if it becomes too large, WriteRune will panic with ErrTooLarge.
func (b *Buffer) WriteRune(r rune) (n int, err error) {
    if r < utf8.RuneSelf {
        b.WriteByte(byte(r))
        return 1, nil
    }
    n = utf8.EncodeRune(b.runeBytes[0:], r)
    b.Write(b.runeBytes[0:n])
    return n, nil
}

ReadFrom方法從io.Reader或者實現io.Reader接口的實例中讀取全部數據到Buffer，默認狀況下最少讀取512字節，若是Buffer空間不足512，需增長Buffer空間，該方法返回讀取的字節數以及錯誤信息。從下面可知ReadFrom首先判斷Buffer是否爲空，若空，則重置Buffer；其次是判斷Buffer的free空間是否足夠，若小於512且off+free小於512，表示Buffer從0到off之間的空間不足以存放當前Buffer中未讀數據的大小，此時設置一臨時緩衝區並使其空間Buffer的2倍加上MinRead(512)的空間，將原來Buffer的數據複製到臨時緩衝區，而後再把臨時緩衝區的數據複製到源Buffer，最後使用io.Reader的Read方法從io.Reader中讀取數據，直到遇到io.EOF。

// ReadFrom reads data from r until EOF and appends it to the buffer, growing
// the buffer as needed. The return value n is the number of bytes read. Any
// error except io.EOF encountered during the read is also returned. If the
// buffer becomes too large, ReadFrom will panic with ErrTooLarge.
func (b *Buffer) ReadFrom(r io.Reader) (n int64, err error) {
    b.lastRead = opInvalid
    // If buffer is empty, reset to recover space.
    if b.off >= len(b.buf) {
        b.Truncate(0)
    }
    for {
        if free := cap(b.buf) - len(b.buf); free < MinRead {
            // not enough space at end
            newBuf := b.buf
            if b.off+free < MinRead {
                // not enough space using beginning of buffer;
                // double buffer capacity
                newBuf = makeSlice(2*cap(b.buf) + MinRead)
            }
            copy(newBuf, b.buf[b.off:])
            b.buf = newBuf[:len(b.buf)-b.off]
            b.off = 0
        }
        m, e := r.Read(b.buf[len(b.buf):cap(b.buf)])
        b.buf = b.buf[0 : len(b.buf)+m]
        n += int64(m)
        if e == io.EOF {
            break
        }
        if e != nil {
            return n, e
        }
    }
    return n, nil // err is EOF, so return nil explicitly
}

相對ReadFrom方法，WriteTo方法比較簡單，WriteTo將Buffer中的數據寫到io.Writer，直到Buffer中沒有數據，當Buffer爲空時，重置Buffer並返回。

// WriteTo writes data to w until the buffer is drained or an error occurs.
// The return value n is the number of bytes written; it always fits into an
// int, but it is int64 to match the io.WriterTo interface. Any error
// encountered during the write is also returned.
func (b *Buffer) WriteTo(w io.Writer) (n int64, err error) {
    b.lastRead = opInvalid
    if b.off < len(b.buf) {
        nBytes := b.Len()
        m, e := w.Write(b.buf[b.off:])
        if m > nBytes {
            panic("bytes.Buffer.WriteTo: invalid Write count")
        }
        b.off += m
        n = int64(m)
        if e != nil {
            return n, e
        }
        // all bytes should have been written, by definition of
        // Write method in io.Writer
        if m != nBytes {
            return n, io.ErrShortWrite
        }
    }
    // Buffer is now empty; reset.
    b.Truncate(0)
    return
}

3. 擴展空間和重置

Buffer的重置方法Reset()經過調用Truncate(n int)方法來實現清除Buffer的數據，Truncate丟棄除了從off開始的n個未讀數據以外的全部數據, 若是n爲0，那就重置Buffer。

// Truncate discards all but the first n unread bytes from the buffer.
// It panics if n is negative or greater than the length of the buffer.
func (b *Buffer) Truncate(n int) {
    b.lastRead = opInvalid
    switch {
    case n < 0 || n > b.Len():
        panic("bytes.Buffer: truncation out of range")
    case n == 0:
        // Reuse buffer space.
        b.off = 0
    }
    b.buf = b.buf[0 : b.off+n]
}
// Reset resets the buffer so it has no content.
// b.Reset() is the same as b.Truncate(0).
func (b *Buffer) Reset() { b.Truncate(0) }

在對Buffer進行寫數據時，一般須要擴展其空間來使全部的數據都能寫入Buffer，Buffer用Grow(n int)方法來實現擴展Buffer空間的功能，該方法調用私有方法grow(n int)。

// grow grows the buffer to guarantee space for n more bytes.
// It returns the index where bytes should be written.
// If the buffer can't grow it will panic with ErrTooLarge.
func (b *Buffer) grow(n int) int {
    m := b.Len()
    // 若是Buffer爲空，重置Buffer
    if m == 0 && b.off != 0 {
        b.Truncate(0)
    }
    //空間增長n後超過Buffer的容量
    if len(b.buf)+n > cap(b.buf) {
        //聲明一個臨時buf
        var buf []byte
        //Buffer的buf只是被聲明，尚未初始化，若是n小於bootstrap的空間，
        //直接將boostrap賦值給buf避免內存的操做而增長負載。
        if b.buf == nil && n <= len(b.bootstrap) {
            buf = b.bootstrap[0:]
        //若是知足此條件，滑動b.buf的數據而不是分配一個新的slice空間，而後將b.buf的數據複製給你buf。
        } else if m+n <= cap(b.buf)/2 {
            copy(b.buf[:], b.buf[b.off:])
            buf = b.buf[:m]
        } else {
            //空間不足，從新分配空間
            buf = makeSlice(2*cap(b.buf) + n)
            copy(buf, b.buf[b.off:])
        }
        b.buf = buf
        b.off = 0
    }
    //擴展n的空間，並返回能夠寫數據的位置
    b.buf = b.buf[0 : b.off+m+n]
    return b.off + m
}
// Grow grows the buffer's capacity, if necessary, to guarantee space for
// another n bytes. After Grow(n), at least n bytes can be written to the
// buffer without another allocation.
// If n is negative, Grow will panic.
// If the buffer can't grow it will panic with ErrTooLarge.
func (b *Buffer) Grow(n int) {
    if n < 0 {
        panic("bytes.Buffer.Grow: negative count")
    }
    m := b.grow(n)
    b.buf = b.buf[0:m]
}

相關標籤/搜索

每日一句

每一个你不满意的现在，都有一个你没有努力的曾经。