最簡單的視頻編碼器:基於libx264(編碼YUV爲H.264)

=====================================================git

最簡單的視頻編碼器系列文章列表:github

最簡單的視頻編碼器:編譯

最簡單的視頻編碼器:基於libx264(編碼YUV爲H.264)
app

最簡單的視頻編碼器:基於libx265(編碼YUV爲H.265)
函數

最簡單的視頻編碼器:libvpx(編碼YUV爲VP8)
oop

=====================================================測試

本文記錄一個最簡單的基於libx264的H.264視頻編碼器。此前記錄的H.264編碼器都是基於FFmpeg調用libx264完成編碼的,例如:編碼

 《最簡單的基於FFMPEG的視頻編碼器(YUV編碼爲H.264)》
相比與上文中的編碼器,本文記錄的編碼器屬於「輕量級」的編碼器。由於它再也不包含FFmpeg的代碼,直接調用libx264完成編碼。所以項目的體積很是小巧。該編碼器能夠將輸入的YUV數據編碼爲H.264碼流文件。
 

流程圖

調用libx264進行視頻編碼的流程圖以下所示。spa


流程圖中主要的函數以下所示。
x264_param_default():設置參數集結構體x264_param_t的缺省值。
x264_picture_alloc():爲圖像結構體x264_picture_t分配內存。
x264_encoder_open():打開編碼器。
x264_encoder_encode():編碼一幀圖像。
x264_encoder_close():關閉編碼器。
x264_picture_clean():釋放x264_picture_alloc()申請的資源。
 
存儲數據的結構體以下所示。
x264_picture_t:存儲壓縮編碼前的像素數據。
x264_nal_t:存儲壓縮編碼後的碼流數據。.net

 
此外流程圖中還包括一個「flush_encoder」模塊,該模塊使用的函數和編碼模塊是同樣的。惟一的不一樣在於再也不輸入視頻像素數據。它的做用在於輸出編碼器中剩餘的碼流數據。
 

源代碼

/**
 * 最簡單的基於X264的視頻編碼器
 * Simplest X264 Encoder
 *
 * 雷霄驊 Lei Xiaohua
 * leixiaohua1020@126.com
 * 中國傳媒大學/數字電視技術
 * Communication University of China / Digital TV Technology
 * http://blog.csdn.net/leixiaohua1020
 *
 * 本程序能夠YUV格式的像素數據編碼爲H.264碼流,是最簡單的
 * 基於libx264的視頻編碼器
 *
 * This software encode YUV data to H.264 bitstream.
 * It's the simplest encoder example based on libx264.
 */
#include <stdio.h>
#include <stdlib.h>
 
#include "stdint.h"
 
#if defined ( __cplusplus)
extern "C"
{
#include "x264.h"
};
#else
#include "x264.h"
#endif
 
 
int main(int argc, char** argv)
{
 
         int ret;
         int y_size;
         int i,j;
 
         //FILE* fp_src  = fopen("../cuc_ieschool_640x360_yuv444p.yuv", "rb");
         FILE* fp_src  = fopen("../cuc_ieschool_640x360_yuv420p.yuv", "rb");
 
         FILE* fp_dst = fopen("cuc_ieschool.h264", "wb");
        
         //Encode 50 frame
         //if set 0, encode all frame
         int frame_num=50;
         int csp=X264_CSP_I420;
         int width=640,height=360;
 
         int iNal   = 0;
         x264_nal_t* pNals = NULL;
         x264_t* pHandle   = NULL;
         x264_picture_t* pPic_in = (x264_picture_t*)malloc(sizeof(x264_picture_t));
         x264_picture_t* pPic_out = (x264_picture_t*)malloc(sizeof(x264_picture_t));
         x264_param_t* pParam = (x264_param_t*)malloc(sizeof(x264_param_t));
        
         //Check
         if(fp_src==NULL||fp_dst==NULL){
                   printf("Error open files.\n");
                   return -1;
         }
 
         x264_param_default(pParam);
         pParam->i_width   = width;
         pParam->i_height  = height;
         /*
         //Param
         pParam->i_log_level  = X264_LOG_DEBUG;
         pParam->i_threads  = X264_SYNC_LOOKAHEAD_AUTO;
         pParam->i_frame_total = 0;
         pParam->i_keyint_max = 10;
         pParam->i_bframe  = 5;
         pParam->b_open_gop  = 0;
         pParam->i_bframe_pyramid = 0;
         pParam->rc.i_qp_constant=0;
         pParam->rc.i_qp_max=0;
         pParam->rc.i_qp_min=0;
         pParam->i_bframe_adaptive = X264_B_ADAPT_TRELLIS;
         pParam->i_fps_den  = 1;
         pParam->i_fps_num  = 25;
         pParam->i_timebase_den = pParam->i_fps_num;
         pParam->i_timebase_num = pParam->i_fps_den;
         */
         pParam->i_csp=csp;
         x264_param_apply_profile(pParam, x264_profile_names[5]);
        
         pHandle = x264_encoder_open(pParam);
   
         x264_picture_init(pPic_out);
         x264_picture_alloc(pPic_in, csp, pParam->i_width, pParam->i_height);
 
         //ret = x264_encoder_headers(pHandle, &pNals, &iNal);
 
         y_size = pParam->i_width * pParam->i_height;
         //detect frame number
         if(frame_num==0){
                   fseek(fp_src,0,SEEK_END);
                   switch(csp){
                   case X264_CSP_I444:frame_num=ftell(fp_src)/(y_size*3);break;
                   case X264_CSP_I420:frame_num=ftell(fp_src)/(y_size*3/2);break;
                   default:printf("Colorspace Not Support.\n");return -1;
                   }
                   fseek(fp_src,0,SEEK_SET);
         }
        
         //Loop to Encode
         for( i=0;i<frame_num;i++){
                   switch(csp){
                   case X264_CSP_I444:{
                            fread(pPic_in->img.plane[0],y_size,1,fp_src);         //Y
                            fread(pPic_in->img.plane[1],y_size,1,fp_src);         //U
                            fread(pPic_in->img.plane[2],y_size,1,fp_src);         //V
                            break;}
                   case X264_CSP_I420:{
                            fread(pPic_in->img.plane[0],y_size,1,fp_src);         //Y
                            fread(pPic_in->img.plane[1],y_size/4,1,fp_src);     //U
                            fread(pPic_in->img.plane[2],y_size/4,1,fp_src);     //V
                            break;}
                   default:{
                            printf("Colorspace Not Support.\n");
                            return -1;}
                   }
                   pPic_in->i_pts = i;
 
                   ret = x264_encoder_encode(pHandle, &pNals, &iNal, pPic_in, pPic_out);
                   if (ret< 0){
                            printf("Error.\n");
                            return -1;
                   }
 
                   printf("Succeed encode frame: %5d\n",i);
 
                   for ( j = 0; j < iNal; ++j){
                             fwrite(pNals[j].p_payload, 1, pNals[j].i_payload, fp_dst);
                   }
         }
         i=0;
         //flush encoder
         while(1){
                   ret = x264_encoder_encode(pHandle, &pNals, &iNal, NULL, pPic_out);
                   if(ret==0){
                            break;
                   }
                   printf("Flush 1 frame.\n");
                   for (j = 0; j < iNal; ++j){
                            fwrite(pNals[j].p_payload, 1, pNals[j].i_payload, fp_dst);
                   }
                   i++;
         }
         x264_picture_clean(pPic_in);
         x264_encoder_close(pHandle);
         pHandle = NULL;
 
         free(pPic_in);
         free(pPic_out);
         free(pParam);
 
         fclose(fp_src);
         fclose(fp_dst);
 
         return 0;
}

運行結果

程序的輸入爲一個YUV文件(已經測試過YUV444P和YUV420P兩種格式)。code


輸出爲H.264碼流文件。


H.264碼流文件的信息以下所示。


下載


Simplest Encoder

 

項目主頁

SourceForge:https://sourceforge.net/projects/simplestencoder/

Github:https://github.com/leixiaohua1020/simplest_encoder

開源中國:http://git.oschina.net/leixiaohua1020/simplest_encoder


CDSN下載地址:http://download.csdn.net/detail/leixiaohua1020/8284105

 
該解決方案包含了幾個常見的編碼器的使用示例:
simplest_vpx_encoder:最簡單的基於libvpx的視頻編碼器
simplest_x264_encoder:最簡單的基於libx264的視頻編碼器 simplest_x265_encoder:最簡單的基於libx265的視頻編碼器