新手學習FFmpeg - 調用API完成錄屏

時間 2019-11-10

標籤新手學習 ffmpeg 調用 api 完成简体版

原文原文鏈接

調用FFMPEG Device API完成Mac錄屏功能。git

調用FFMPEG提供的API來完成錄屏功能，大體的思路是:github

打開輸入設備.
打開輸出設備.
從輸入設備讀取視頻流，而後通過解碼->編碼，寫入到輸出設備.

+--------------------------------------------------------------+
|   +---------+    decode               +------------+         |
|   | Input   | ----------read -------->|  Output    |         |
|   +---------+                 encode  +------------+         |
+--------------------------------------------------------------+

所以主要使用的API就是:框架

avformat_open_input
avcodec_find_decoder
av_read_frame
avcodec_send_packet/avcodec_receive_frame
avcodec_send_frame/avcodec_receive_packet

打開輸入設備

若是使用FFmpeg提供的-list_devices 命令能夠查詢到當前支持的設備，其中分爲兩類:ide

AVFoundation video devices
AVFoundation audio devices

AVFoundation 是Mac特有的基於時間的多媒體處理框架。本次是演示錄屏功能，所以忽略掉audio設備，只考慮video設備。在avfoundation.m文件中沒有發現能夠程序化讀取設備的API。FFmpeg官方也說明沒有程序化讀取設備的方式，通用方案是解析日誌來獲取設備(https://trac.ffmpeg.org/wiki/DirectShow#Howtoprogrammaticallyenumeratedevices)，下一篇再研究如何經過日誌獲取當前支持的設備，本次就直接寫死設備ID。函數

獲取指定格式的輸入設備

pAVInputFormat = av_find_input_format("avfoundation");

經過指定格式名稱獲取到AVInputFormat結構體。編碼

打開設備

value = avformat_open_input(&pAVFormatContext, "1", pAVInputFormat, &options);
    if (value != 0) {
        cout << "\nerror in opening input device";
        exit(1);
    }

"1"指代的是設備ID。 options是打開設備時輸入參數，日誌

// 記錄鼠標
    value = av_dict_set(&options, "capture_cursor", "1", 0);
    if (value < 0) {
        cout << "\nerror in setting capture_cursor values";
        exit(1);
    }

    // 記錄鼠標點擊事件
    value = av_dict_set(&options, "capture_mouse_clicks", "1", 0);
    if (value < 0) {
        cout << "\nerror in setting capture_mouse_clicks values";
        exit(1);
    }

    // 指定像素格式
    value = av_dict_set(&options, "pixel_format", "yuyv422", 0);
    if (value < 0) {
        cout << "\nerror in setting pixel_format values";
        exit(1);
    }

經過value值判斷設備是否正確打開。而後獲取設備視頻流ID(解碼數據包時須要判斷是否一致)，再獲取輸入編碼器(解碼時須要)。code

打開輸出設備

假設須要將從輸入設備讀取的數據保存成mp4格式的文件。orm

將視頻流保存到文件中，只須要一個合適的編碼器(用於生成符合MP4容器規範的幀)既可。獲取編碼器大體分爲兩個步驟:視頻

構建編碼器上下文(AVFormatContext)
匹配合適的編碼器(AVCodec)

構建編碼器:

// 根據output_file後綴名推測合適的編碼器
    avformat_alloc_output_context2(&outAVFormatContext, NULL, NULL, output_file);
    if (!outAVFormatContext) {
        cout << "\nerror in allocating av format output context";
        exit(1);
    }

匹配編碼器:

output_format = av_guess_format(NULL, output_file, NULL);
    if (!output_format) {
        cout << "\nerror in guessing the video format. try with correct format";
        exit(1);
    }

    video_st = avformat_new_stream(outAVFormatContext, NULL);
    if (!video_st) {
        cout << "\nerror in creating a av format new stream";
        exit(1);
    }

編解碼

從輸入設備讀取的是原生的數據流，也就是通過設備編碼以後的數據。須要先將原生數據進行解碼，變成程序可讀的數據，在編碼成輸出設備可識別的數據。因此這一步的流程是：

解碼輸入設備數據
轉碼
編碼寫入輸出設備

經過av_read_frame從輸入設備讀取數據:

while (av_read_frame(pAVFormatContext, pAVPacket) >= 0) {
    ...
}

對讀取後的數據進行拆包，找到咱們所感興趣的數據

// 最開始沒有作這種判斷，出現不可預期的錯誤。 在官網example中找到這句判斷，但還不是很清楚其意義。應該和packet封裝格式有關
    pAVPacket->stream_index == VideoStreamIndx

從FFmpeg 4.1開始，有了新的編解碼函數。爲了長遠考慮，直接使用新API。使用avcodec_send_packet將輸入設備的數據發往解碼器進行解碼，而後使用avcodec_receive_frame從解碼器接受解碼以後的數據幀。代碼大概是下面的樣子:

value = avcodec_send_packet(pAVCodecContext, pAVPacket);
            if (value < 0) {
                fprintf(stderr, "Error sending a packet for decoding\n");
                exit(1);
            }

            while(1){
                value = avcodec_receive_frame(pAVCodecContext, pAVFrame);
                if (value == AVERROR(EAGAIN) || value == AVERROR_EOF) {
                    break;

                } else if (value < 0) {
                    fprintf(stderr, "Error during decoding\n");
                    exit(1);
                }

                .... do something
            }

讀取到數據幀後，就能夠對每一幀進行轉碼:

sws_scale(swsCtx_, pAVFrame->data, pAVFrame->linesize, 0, pAVCodecContext->height, outFrame->data,outFrame->linesize);

最後將轉碼後的幀封裝成輸出設備可設別的數據包格式。也就是解碼的逆動做，使用avcodec_send_frame將每幀發往編碼器進行編碼，經過avcodec_receive_packet一直接受編碼以後的數據包。處理邏輯大體是:

value = avcodec_send_frame(outAVCodecContext, outFrame);
                if (value < 0) {
                    fprintf(stderr, "Error sending a frame for encoding\n");
                    exit(1);
                }

                while (value >= 0) {
                    value = avcodec_receive_packet(outAVCodecContext, &outPacket);
                    if (value == AVERROR(EAGAIN) || value == AVERROR_EOF) {
                        break;
                    } else if (value < 0) {
                        fprintf(stderr, "Error during encoding\n");
                        exit(1);
                    }

                    ... do something;

                    av_packet_unref(&outPacket);
                }

之後就按照這種的處理邏輯，不停的從輸入設備讀取數據，而後通過解碼->轉碼->編碼，最後發送到輸出設備。這樣就完成了錄屏功能。
上面是大體處理思路，完整源代碼能夠參考 (https://github.com/andy-zhangtao/ffmpeg-examples/tree/master/ScreenRecord) .

相關標籤/搜索

每日一句

每一个你不满意的现在，都有一个你没有努力的曾经。