最簡單的基於FFMPEG的視訊編碼器（YUV編碼為H.264）

阿新 • • 發佈：2018-12-19

=====================================================

最簡單的基於FFmpeg的視訊編碼器文章列表：

=====================================================

本文介紹一個最簡單的基於FFMPEG的視訊編碼器。該編碼器實現了YUV420P的畫素資料編碼為H.264的壓縮編碼資料。編碼器程式碼十分簡單，但是每一行程式碼都很重要，適合好好研究一下。弄清楚了本程式碼也就基本弄清楚了FFMPEG的編碼流程。目前我雖然已經調通了程式，但是還是有些地方沒有完全搞明白，需要下一步繼續探究然後補充內容。

本程式使用最新版的類庫（編譯時間為2014.5.6），開發平臺為VC2010。所有的配置都已經做好，只需要執行就可以了。

流程

下面附一張使用FFmpeg編碼視訊的流程圖。使用該流程，不僅可以編碼H.264的視訊，而且可以編碼MPEG4/MPEG2/VP8等等各種FFmpeg支援的視訊。圖中藍色背景的函式是實際輸出資料的函式。淺綠色的函式是視訊編碼的函式。

簡單介紹一下流程中各個函式的意義：

av_register_all()：註冊FFmpeg所有編解碼器。
avformat_alloc_output_context2()：初始化輸出碼流的AVFormatContext。
avio_open()：開啟輸出檔案。
av_new_stream()：建立輸出碼流的AVStream。
avcodec_find_encoder()：查詢編碼器。

avcodec_open2()：開啟編碼器。
avformat_write_header()：寫檔案頭（對於某些沒有檔案頭的封裝格式，不需要此函式。比如說MPEG2TS）。
avcodec_encode_video2()：編碼一幀視訊。即將AVFrame（儲存YUV畫素資料）編碼為AVPacket（儲存H.264等格式的碼流資料）。
av_write_frame()：將編碼後的視訊碼流寫入檔案。
flush_encoder()：輸入的畫素資料讀取完成後呼叫此函式。用於輸出編碼器中剩餘的AVPacket。
av_write_trailer()：寫檔案尾（對於某些沒有檔案頭的封裝格式，不需要此函式。比如說MPEG2TS）。

程式碼

/** * 最簡單的基於FFmpeg的視訊編碼器 * Simplest FFmpeg Video Encoder *  * 雷霄驊 Lei Xiaohua * [email protected] * 中國傳媒大學/數字電視技術 * Communication University of China / Digital TV Technology * http://blog.csdn.net/leixiaohua1020 *  * 本程式實現了YUV畫素資料編碼為視訊碼流（H264，MPEG2，VP8等等）。 * 是最簡單的FFmpeg視訊編碼方面的教程。 * 通過學習本例子可以瞭解FFmpeg的編碼流程。 * This software encode YUV420P data to H.264 bitstream. * It's the simplest video encoding software based on FFmpeg.  * Suitable for beginner of FFmpeg  */#include <stdio.h>#define __STDC_CONSTANT_MACROS#ifdef _WIN32//Windowsextern "C"{#include "libavutil/opt.h"#include "libavcodec/avcodec.h"#include "libavformat/avformat.h"};#else//Linux...#ifdef __cplusplusextern "C"{#endif#include <libavutil/opt.h>#include <libavcodec/avcodec.h>#include <libavformat/avformat.h>#ifdef __cplusplus};#endif#endifint flush_encoder(AVFormatContext *fmt_ctx,unsigned int stream_index){ int ret; int got_frame; AVPacket enc_pkt; if (!(fmt_ctx->streams[stream_index]->codec->codec->capabilities &  CODEC_CAP_DELAY))  return 0; while (1) {  enc_pkt.data = NULL;  enc_pkt.size = 0;  av_init_packet(&enc_pkt);  ret = avcodec_encode_video2 (fmt_ctx->streams[stream_index]->codec, &enc_pkt,   NULL, &got_frame);  av_frame_free(NULL);  if (ret < 0)   break;  if (!got_frame){   ret=0;   break;  }  printf("Flush Encoder: Succeed to encode 1 frame!\tsize:%5d\n",enc_pkt.size);  /* mux encoded frame */  ret = av_write_frame(fmt_ctx, &enc_pkt);  if (ret < 0)   break; } return ret;}int main(int argc, char* argv[]){ AVFormatContext* pFormatCtx; AVOutputFormat* fmt; AVStream* video_st; AVCodecContext* pCodecCtx; AVCodec* pCodec; AVPacket pkt; uint8_t* picture_buf; AVFrame* pFrame; int picture_size; int y_size; int framecnt=0; //FILE *in_file = fopen("src01_480x272.yuv", "rb"); //Input raw YUV data  FILE *in_file = fopen("../ds_480x272.yuv", "rb");   //Input raw YUV data int in_w=480,in_h=272;                              //Input data's width and height int framenum=100;                                   //Frames to encode //const char* out_file = "src01.h264";              //Output Filepath  //const char* out_file = "src01.ts"; //const char* out_file = "src01.hevc"; const char* out_file = "ds.h264"; av_register_all(); //Method1. pFormatCtx = avformat_alloc_context(); //Guess Format fmt = av_guess_format(NULL, out_file, NULL); pFormatCtx->oformat = fmt;  //Method 2. //avformat_alloc_output_context2(&pFormatCtx, NULL, NULL, out_file); //fmt = pFormatCtx->oformat; //Open output URL if (avio_open(&pFormatCtx->pb,out_file, AVIO_FLAG_READ_WRITE) < 0){  printf("Failed to open output file! \n");  return -1; } video_st = avformat_new_stream(pFormatCtx, 0); //video_st->time_base.num = 1;  //video_st->time_base.den = 25;   if (video_st==NULL){  return -1; } //Param that must set pCodecCtx = video_st->codec; //pCodecCtx->codec_id =AV_CODEC_ID_HEVC; pCodecCtx->codec_id = fmt->video_codec; pCodecCtx->codec_type = AVMEDIA_TYPE_VIDEO; pCodecCtx->pix_fmt = AV_PIX_FMT_YUV420P; pCodecCtx->width = in_w;   pCodecCtx->height = in_h; pCodecCtx->bit_rate = 400000;   pCodecCtx->gop_size=250; pCodecCtx->time_base.num = 1;   pCodecCtx->time_base.den = 25;   //H264 //pCodecCtx->me_range = 16; //pCodecCtx->max_qdiff = 4; //pCodecCtx->qcompress = 0.6; pCodecCtx->qmin = 10; pCodecCtx->qmax = 51; //Optional Param pCodecCtx->max_b_frames=3; // Set Option AVDictionary *param = 0; //H.264 if(pCodecCtx->codec_id == AV_CODEC_ID_H264) {  av_dict_set(¶m, "preset", "slow", 0);  av_dict_set(¶m, "tune", "zerolatency", 0);  //av_dict_set(¶m, "profile", "main", 0); } //H.265 if(pCodecCtx->codec_id == AV_CODEC_ID_H265){  av_dict_set(¶m, "preset", "ultrafast", 0);  av_dict_set(¶m, "tune", "zero-latency", 0); } //Show some Information av_dump_format(pFormatCtx, 0, out_file, 1); pCodec = avcodec_find_encoder(pCodecCtx->codec_id); if (!pCodec){  printf("Can not find encoder! \n");  return -1; } if (avcodec_open2(pCodecCtx, pCodec,¶m) < 0){  printf("Failed to open encoder! \n");  return -1; } pFrame = av_frame_alloc(); picture_size = avpicture_get_size(pCodecCtx->pix_fmt, pCodecCtx->width, pCodecCtx->height); picture_buf = (uint8_t *)av_malloc(picture_size); avpicture_fill((AVPicture *)pFrame, picture_buf, pCodecCtx->pix_fmt, pCodecCtx->width, pCodecCtx->height); //Write File Header avformat_write_header(pFormatCtx,NULL); av_new_packet(&pkt,picture_size); y_size = pCodecCtx->width * pCodecCtx->height; for (int i=0; i<framenum; i++){  //Read raw YUV data  if (fread(picture_buf, 1, y_size*3/2, in_file) <= 0){   printf("Failed to read raw data! \n");   return -1;  }else if(feof(in_file)){   break;  }  pFrame->data[0] = picture_buf;              // Y  pFrame->data[1] = picture_buf+ y_size;      // U   pFrame->data[2] = picture_buf+ y_size*5/4;  // V  //PTS  //pFrame->pts=i;  pFrame->pts=i*(video_st->time_base.den)/((video_st->time_base.num)*25);  int got_picture=0;  //Encode  int ret = avcodec_encode_video2(pCodecCtx, &pkt,pFrame, &got_picture);  if(ret < 0){   printf("Failed to encode! \n");   return -1;  }  if (got_picture==1){   printf("Succeed to encode frame: %5d\tsize:%5d\n",framecnt,pkt.size);   framecnt++;   pkt.stream_index = video_st->index;   ret = av_write_frame(pFormatCtx, &pkt);   av_free_packet(&pkt);  } } //Flush Encoder int ret = flush_encoder(pFormatCtx,0); if (ret < 0) {  printf("Flushing encoder failed\n");  return -1; } //Write file trailer av_write_trailer(pFormatCtx); //Clean if (video_st){  avcodec_close(video_st->codec);  av_free(pFrame);  av_free(picture_buf); } avio_close(pFormatCtx->pb); avformat_free_context(pFormatCtx); fclose(in_file); return 0;}

結果

軟體執行截圖（受限於檔案體積，原始YUV幀數很少）：

編碼前的YUV序列：

編碼後的H.264碼流：

下載

Simplest FFmpeg Video Encoder

專案主頁

下載地址：

【修正】之前發現編碼後的H.264碼流與YUV輸入的幀數不同。經過觀察對比其他程式後發現需要呼叫flush_encoder()將編碼器中剩餘的視訊幀輸出。已經將該問題修正。

CSDN下載地址（修正後）：

PUDN下載地址（修正後）：

SourceForge上已經更新。

更新-1.1 (2015.1.03)=========================================

CSDN下載地址：http://download.csdn.net/detail/leixiaohua1020/8322003

更新-1.2 (2015.2.13)=========================================

這次考慮到了跨平臺的要求，調整了原始碼。經過這次調整之後，原始碼可以在以下平臺編譯通過：

VC++：開啟sln檔案即可編譯，無需配置。

cl.exe：開啟compile_cl.bat即可命令列下使用cl.exe進行編譯，注意可能需要按照VC的安裝路徑調整腳本里面的引數。編譯命令如下。

::VS2010 Environmentcall "D:\Program Files\Microsoft Visual Studio 10.0\VC\vcvarsall.bat"::[email protected] INCLUDE=include;%INCLUDE%::[email protected] LIB=lib;%LIB%::compile and linkcl simplest_ffmpeg_video_encoder.cpp /link avcodec.lib avformat.lib avutil.lib ^avdevice.lib avfilter.lib postproc.lib swresample.lib swscale.lib /OPT:NOREF

MinGW：MinGW命令列下執行compile_mingw.sh即可使用MinGW的g++進行編譯。編譯命令如下。

g++ simplest_ffmpeg_video_encoder.cpp -g -o simplest_ffmpeg_video_encoder.exe \-I /usr/local/include -L /usr/local/lib \-lavformat -lavcodec -lavutil

GCC：Linux或者MacOS命令列下執行compile_gcc.sh即可使用GCC進行編譯。編譯命令如下。

gcc simplest_ffmpeg_video_encoder.cpp -g -o simplest_ffmpeg_video_encoder.out \-I /usr/local/include -L /usr/local/lib -lavformat -lavcodec -lavutil

SourceForge上已經更新。

最簡單的基於FFMPEG的視訊編碼器（YUV編碼為H.264）

流程

程式碼

結果

下載

最簡單的基於FFMPEG的視訊編碼器（YUV編碼為H.264）

最簡單的基於FFMPEG的影象編碼器（YUV編碼為JPEG）

最簡單的基於FFMPEG的音訊編碼器（PCM編碼為AAC

最簡單的VideoView視訊播放器

android 基於FFmpeg視訊解碼器

【Learning Notes】變分自編碼器（Variational Auto-Encoder，VAE）

超方便、最簡單版本：java 郵件傳送（半分鐘寫完程式碼）

V4L2採集+編碼壓縮（M-JPEG和H.264壓縮效能比較）

100行程式碼實現最簡單的基於FFMPEG+SDL的視訊播放器（SDL1.x）

最簡單的基於FFmpeg的解碼器-純淨版（不包含libavformat）

最簡單的基於libVLC的例子：最簡單的基於libVLC的視訊播放器（圖形介面版）

最簡單的基於Flash的流媒體示例：網頁播放器（HTTP，RTMP，HLS）

最簡單的基於FFmpeg的AVfilter例子（水印疊加

最簡單的基於FFmpeg的AVDevice例子（讀取攝像頭）解讀

1小時學會：最簡單的iOS直播推流（九）flv 編碼與音視訊時間戳同步

基於qml創建最簡單的圖像處理程序（1）-基於qml創建界面

基於qml創建最簡單的圖像處理程序（2）-使用c++&qml進行圖像處理

基於qml創建最簡單的圖像處理程序（3）-使用opencv&qml進行圖像處理

基於C語言的編碼器（光耦）程式設計之C程式碼解析（二）

MFC 基於VLC的視訊播放器（三）---介面設計以及對話方塊接受檔案的拖拽

最簡單的基於FFMPEG的視訊編碼器（YUV編碼為H.264）

流程

程式碼

結果

下載

相關推薦