FFmpeg 通過 showwavespic 獲取音訊的頻譜圖

阿新 • • 發佈：2018-12-02

FFmpeg 的 showwavespic 濾鏡如何得到頻譜圖

音訊資料通常由波形影象表示。

FFmpeg 通過使用 showwavespic 可以得到音訊資料的頻譜圖

ffmpeg -i input -filter_complex "showwavespic=s=640x120" -frames:v 1 output.png

執行上面一條命令之後，即可得到一張如下的圖片：
在這裡插入圖片描述

那麼 FFmpeg 是如何將音訊資料轉換為波形圖的呢？

首先通過命令我們知道使用了名為showwavespic 的濾鏡，根據名字大概猜想此濾鏡就是生成頻譜圖的關鍵所在。

所以，我麼直接定位到 showwavespic

的定義處：

// showwavespic 濾鏡的輸入
static const AVFilterPad showwavespic_inputs[] = {
    {
        .name         = "default",
        .type         = AVMEDIA_TYPE_AUDIO,
        .config_props = showwavespic_config_input, 
        .filter_frame = showwavespic_filter_frame, 
    },
    { NULL }
};

// showwavespic 濾鏡的輸出 

static const AVFilterPad showwavespic_outputs[] = {
    {
        .name          = "default",
        .type          = AVMEDIA_TYPE_VIDEO,
        .config_props  = config_output,  // 配置下一個濾鏡的相關引數（例如輸出frame 的寬、高）
        .request_frame = request_frame,
    },
    { NULL }
};

AVFilter ff_avf_showwavespic = 
 {
    .name          = "showwavespic", // 輸入的音訊轉換為頻譜圖輸出
    .description   = NULL_IF_CONFIG_SMALL("Convert input audio to a video output single picture."),
    .init          = init,          // 初始化方法
    .uninit        = uninit,        
    .query_formats = query_formats, // 濾鏡支援的格式
    .priv_size     = sizeof(ShowWavesContext),
    .inputs        = showwavespic_inputs,
    .outputs       = showwavespic_outputs,
    .priv_class    = &showwavespic_class,
};

通過參考其他資源，理清楚濾鏡的工作流程。花費幾天的時間閱讀 FFmpeg 的原始碼，生成波形圖的原理 – 解碼音訊檔案得到音訊裸資料 —> 通過 showwavespic 濾鏡處理PCM資料得到波形圖

showwavespic 濾鏡是如何處理 PCM 資料得到波形圖的呢？

PCM 資料

首先我們要了解什麼是 PCM 音訊資料：

PCM(Pulse Code Modulation)稱為脈衝編碼調製，PCM 音訊資料是未經壓縮的音訊取樣資料裸流，它是由模擬訊號經過取樣、量化、編碼轉換成的標準的數字音訊資料。

儲存格式

如果是單聲道的音訊檔案，取樣資料按時間的先後順序依次存入（有時也會採用LRLRLR方式儲存，只是另一個聲道的資料為0），如果是雙聲道的話就按照LRLRLR的方式儲存。

在這裡插入圖片描述

單聲道

+------+------+------+------+------+------+------+------+------+
|  500 |  300 | -100 | -20  | -300 |  900 | -200 |  -50 |  250 |      
+------+------+------+------+------+------+------+------+------+

每個取樣的整數的大小最小為 -32768，最大為 32768。根據取樣資料的位置和值畫一個圖的話，就會得到像播放器上那樣的波浪形圖。
在這裡插入圖片描述

立體聲的取樣是每一個 frame 是一個 16bit 的取樣點。左右聲道的資料交叉存放。

那麼取樣資料的絕對值按照生成圖片的高的比例即可得出振幅。頻率通過生成圖片的寬計算得到。

音訊檔案解碼得到 PCM（音訊裸資料), 統計音訊的取樣總數
以取樣總數 / 輸出圖片的寬度為波形圖統計頻率
取樣資料的絕對值 * 生成圖片的高度 / 32768 計算得出振幅大小

濾鏡處理流程

流程詳情

init – showwavespic 濾鏡的初始化

static av_cold int init(AVFilterContext *ctx)
{
    // showwaves 濾鏡的私有資料
    ShowWavesContext *showwaves = ctx->priv;
    if (!strcmp(ctx->filter->name, "showwavespic")) {
        // 如果是 showwavespic 濾鏡
        showwaves->single_pic = 1;
        // 使用 cline 的繪圖 mode
        showwaves->mode = MODE_CENTERED_LINE;
    }

    return 0;
}

showwavespic_config_input – 配置 showwavespic 相關屬性

static int showwavespic_config_input(AVFilterLink *inlink)
{
    // showwavespic 濾鏡
    AVFilterContext *ctx = inlink->dst;
    // 濾鏡私有引數
    ShowWavesContext *showwaves = ctx->priv;
    if (showwaves->single_pic) {
        // 聲道取樣資料的和(初始化陣列記憶體空間)
        showwaves->sum = av_mallocz_array(inlink->channels, sizeof(*showwaves->sum));
        if (!showwaves->sum)
            return AVERROR(ENOMEM);
    }
    return 0;
}

config_output – 配置輸出影象的引數 & showwavespic 濾鏡引數

static int config_output(AVFilterLink *outlink)
{
    // 程式碼較長，省略
    ...
    // 取樣的x、y座標    
    showwaves->buf_idx = 0;
    if (!(showwaves->buf_idy = av_mallocz_array(nb_channels, sizeof(*showwaves->buf_idy)))) {
        av_log(ctx, AV_LOG_ERROR, "Could not allocate showwaves buffer\n");
        return AVERROR(ENOMEM);
    }
    // 輸出圖片的寬高、寬高比、幀率
    outlink->w = showwaves->w;
    outlink->h = showwaves->h;
    outlink->sample_aspect_ratio = (AVRational){1,1}; // 1

    outlink->frame_rate = av_div_q((AVRational){inlink->sample_rate,showwaves->n},
                                   (AVRational){showwaves->w,1});
    
    // 設定 draw_sample & get_h 函式
    ...
        
     // 預設使用的顏色為： red|green|...
    colors = av_strdup(showwaves->colors);
    if (!colors)
        return AVERROR(ENOMEM);

    /* multiplication factor, pre-computed to avoid in-loop divisions */
    x = 255 / ((showwaves->split_channels ? 1 : nb_channels) * showwaves->n); // 255/2
    if (outlink->format == AV_PIX_FMT_RGBA) {
        uint8_t fg[4] = { 0xff, 0xff, 0xff, 0xff };

        // 左聲道為紅色，右聲道為綠色
        for (ch = 0; ch < nb_channels; ch++) {
            char *color;

            color = av_strtok(ch == 0 ? colors : NULL, " |", &saveptr);
            if (color)
                av_parse_color(fg, color, -1, ctx);
            showwaves->fg[4*ch + 0] = fg[0] * x / 255.;
            showwaves->fg[4*ch + 1] = fg[1] * x / 255.;
            showwaves->fg[4*ch + 2] = fg[2] * x / 255.;
            showwaves->fg[4*ch + 3] = fg[3] * x / 255.;
        }
    } else {
        for (ch = 0; ch < nb_channels; ch++)
            showwaves->fg[4 * ch + 0] = x;
    }
    av_free(colors);
}

showwavespic_filter_frame – 配置 showwavespic 濾鏡的引數（初始化輸出frame、音訊幀等）

static int showwavespic_filter_frame(AVFilterLink *inlink, AVFrame *insamples)
{
    // showwavespic 濾鏡
    AVFilterContext *ctx = inlink->dst;
    // showwavespic 濾鏡與其下一個濾鏡之間的聯絡
    AVFilterLink *outlink = ctx->outputs[0];
    // showwavespic 濾鏡的私有資料
    ShowWavesContext *showwaves = ctx->priv;
    // 輸入資料
    int16_t *p = (int16_t *)insamples->data[0];
    int ret = 0;

    if (showwaves->single_pic) {
        struct frame_node *f;
        // 給 showwaves 濾鏡的輸出圖片 frame 分配一個空的buffer
        ret = alloc_out_frame(showwaves, p, inlink, outlink, insamples);
        if (ret < 0)
            goto end;

        /* queue the audio frame （audio frame 佇列）*/
        f = av_malloc(sizeof(*f));
        if (!f) {
            ret = AVERROR(ENOMEM);
            goto end;
        }
        f->frame = insamples;
        f->next = NULL;
        // showwavespic 濾鏡的音訊佇列
        if (!showwaves->last_frame) {
            showwaves->audio_frames =
            showwaves->last_frame   = f;
        } else {
            showwaves->last_frame->next = f;
            showwaves->last_frame = f;
        }
        // 總音訊取樣數
        showwaves->total_samples += insamples->nb_samples;

        return 0;
    }

end:
    av_frame_free(&insamples);
    return ret;
}

request_frame – 請求濾鏡處理後的 frame

static int request_frame(AVFilterLink *outlink)
{
    ShowWavesContext *showwaves = outlink->src->priv;
    AVFilterLink *inlink = outlink->src->inputs[0];
    int ret;

    ret = ff_request_frame(inlink);
    if (ret == AVERROR_EOF && showwaves->outpicref) {
        // 讀取完所有的 frame
        if (showwaves->single_pic)
            push_single_pic(outlink); // 生成頻譜圖
        else
            push_frame(outlink);
    }

    return ret;
}

push_single_pic – 根據取樣資料生成頻譜圖，並傳給下一個濾鏡

static int push_single_pic(AVFilterLink *outlink)
{
    // showwavespic 濾鏡
    AVFilterContext *ctx = outlink->src;
    // showwavespic 與上一個濾鏡之間的聯絡
    AVFilterLink *inlink = ctx->inputs[0];
    // showwavespic 濾鏡的私有資料
    ShowWavesContext *showwaves = ctx->priv;
    // max_samples -- 音訊總取樣數 / 輸出圖片的寬（頻率）
    int64_t n = 0, max_samples = showwaves->total_samples / outlink->w;
    // 輸出 frame
    AVFrame *out = showwaves->outpicref;
    struct frame_node *node;
    // 聲道數
    const int nb_channels = inlink->channels;
    const int ch_height = showwaves->split_channels ? outlink->h / nb_channels : outlink->h; // h
    const int linesize = out->linesize[0];
    const int pixstep = showwaves->pixstep; // 4
    int col = 0;
    int64_t *sum = showwaves->sum;

    if (max_samples == 0) {
        av_log(ctx, AV_LOG_ERROR, "Too few samples\n");
        return AVERROR(EINVAL);
    }

    av_log(ctx, AV_LOG_DEBUG, "Create frame averaging %"PRId64" samples per column\n", max_samples);

    memset(sum, 0, nb_channels);

    // 迴圈從濾鏡 audio 佇列中取出 frame
    for (node = showwaves->audio_frames; node; node = node->next) {
        int i;
        const AVFrame *frame = node->frame;
        // 當前 frame 的資料
        const int16_t *p = (const int16_t *)frame->data[0];

        // 當前 frame 的取樣數
        for (i = 0; i < frame->nb_samples; i++) {
            int ch;

            for (ch = 0; ch < nb_channels; ch++)
                sum[ch] += abs(p[ch + i*nb_channels]) << 1;
            if (n++ == max_samples) {
                for (ch = 0; ch < nb_channels; ch++) {
                    int16_t sample = sum[ch] / max_samples;
                    uint8_t *buf = out->data[0] + col * pixstep;
                    int h;

                    if (showwaves->split_channels)
                        buf += ch*ch_height*linesize;
                    av_assert0(col < outlink->w);
                    h = showwaves->get_h(sample, ch_height);
                    showwaves->draw_sample(buf, ch_height, linesize, &showwaves->buf_idy[ch], &showwaves->fg[ch * 4], h);
                    sum[ch] = 0;
                }
                col++;
                n = 0;
            }
        }
    }

    return push_frame(outlink);
}

FFmpeg 通過 showwavespic 獲取音訊的頻譜圖

FFmpeg 的 showwavespic 濾鏡如何得到頻譜圖音訊資料通常由波形影象表示。 FFmpeg 通過使用 showwavespic 可以得到音訊資料的頻譜圖 ffmpeg -i input -filter_complex "showwavespic=s=640x120

[前端]利用WebAudioAPI獲取音訊頻譜（html5音訊視覺化）

專案希望可以把音訊視覺化，有條隨聲音波動的曲線或者是像唱吧那種。開始是搜到了騰訊大腿（TGideas）寫的audio視覺化元件，想著直接用，後來各種原因還是打算自己重新寫一個……雖然明顯寫得low了很多。騰訊大腿的audio元件地址http://www.3fwork.com/b403/001620

FFmpeg通過PTS獲取當前幀所在的毫秒時間

AVStream *stream=pFormatCtx->streams[packet.stream_index]; avcodec_decode_video2(pCodecCtx

Android之通過ContentResolver獲取手機圖片和視訊的路徑和生成縮圖和縮圖路徑

1 問題獲取手機所有圖片和視訊的路徑和生成圖片和視訊的縮圖和縮圖路徑生成縮圖我們用的系統函式 public static Bitmap getThumbnail(ContentResolver cr, long origId, int kind, Opti

ffmpeg處理視訊獲取第一幀截圖

<?php //使用PHP SDK，並且使用自定義配置檔案 require app_path().'/include/BaiduBce.phar'; require app_path().'/include/SampleConf.php'; require app_path().'

android 獲取視訊縮圖終極解決方案(ffmpeg)

前些天有個師弟(在做一個仿LinkInEyes行車記錄儀的app)問我怎麼獲取視訊縮圖，起初以為很簡單，就找了個常用的解決方案(使用者獲取正常的視訊檔案的縮圖)：方案1: private void initView() { imgPic = (

ContentProvider之通過ContentResolver獲取影象、視訊、音訊舉例

MediaStore中定義了一系列的資料表格，通過ContentResolver提供的查詢介面，我們可以得到各種需要的媒體資訊。通過以下兩個URI可以掃描裝置外部和內部的媒體檔案。Android系統提供了MediaProvider，MediaStore，MediaScanne

mt7628/mt7620實現alsa架構通過ffmpeg解碼並播放音訊

//by Sven之前在評估用MT7628做一個音樂播放器，最初使用ffmpeg+sdl但過程曲折離奇，費了一番折騰最後發現mt7628的效能根本無法支撐ffmpeg的資源訴求，播放出來的聲音一卡一卡的，解碼速度跟不上。無奈使用了另一替代方案libmad+libao，此方案

淺談highcharts（echarts）通過ajax獲取後臺資料從而改變資料圖

好久沒寫csdn部落格了，隨著工作專案的展開自己也越來懶了。。不過今天有點空餘的時間，所以來寫寫部落格。恰巧這次的專案有圖表這一塊，所以就用到了highcharts和echarts。我們都知道如果寫純靜態的圖表圖很簡單，那麼如果寫動態的圖表圖該如何寫呢？好了，不多BB

C# NAudio錄音和播放音訊檔案-實時繪製音訊波形圖（從音訊流資料獲取，而非裝置獲取）

　　NAudio的錄音和播放錄音都有對應的類，我在使用Wav格式進行錄音和播放錄音時使用的類時WaveIn和WaveOut，這兩個類是對功能的回撥和一些事件觸發。　　在WaveIn和WaveOut之外還有對音訊流讀寫使用的WaveFileWriter和WaveFileReader類，具體細節可檢視其原始碼進

SQL Server2008中通過SQL獲取表結構

nds 數據 join xtend isn val data 運行 order SQL Server2008中通過SQL獲取表結構新增數據用戶，角色為public。映射到待獲取表結構的數據庫上，授與用戶在該數據庫上的身份為db_owner 運行例如以下SQL語

通過js獲取class類名的函數封裝

clas ret -1 .class class urn getclass ++ 不同通過className獲取元素，不同的瀏覽器會有不同的支持情況，所以為了兼容各個瀏覽器在這裏，我寫了幾個函數獲取className的值 function byclass(classn){

百度接口通過ip獲取用戶所在地

tools nec return mage rate edr ram try arr /** * 百度接口 * 通過用戶ip獲取用戶所在地 * @param userIp * @return */ public static S

apiCloud通過ajax獲取數據

tel detect 獲取數據 url res eba ref text data <!doctype html> <html> <head> <meta charset="utf-8"> <meta

jQuery通過地址獲取經緯度demo

text console 服務器展示 content 百度地圖 index min 類型在開始之前，首先需要登錄百度地圖API控制臺申請密鑰ak。 1、登錄百度地圖開放平臺http://lbsyun.baidu.com 註冊賬號，完善信息，點擊網站右上角的“API控制臺

AOP通過反射獲取自定義註解

ram tar .get tty sig runt type log eth 自定義註解： @Target({ElementType.METHOD}) @Retention(RetentionPolicy.RUNTIME) @Documented @Component

C# 通過SendMessage獲取瀏覽器地址欄的地址

ntp bar pac login classname window edit and ces 1：通過SPY++獲得地址欄的層次結構，然後一層一層獲得 2：代碼 using System; using System.Collections.Generic; using

通過request獲取網頁資訊通過BeautifulSoup剖析網頁元素

獲取網頁 alink his odi res req 特定 bsp css屬性 import requests newsUrl =‘http://news.sina.com.cn/china/‘ res = requests.get(newsUrl) res.encod

前端通過jqplot繪制折線圖

一個 mark 分類 options poi [] 密碼 nec 需要首先需要下載jqplot需要的js與css文件，我已近打包好了，需要的可以下載接下來導入其中關鍵的js與css如下， <link href="css/jquery.jqplot.min.css

Safari通過JavaScript獲取系統語言

avi nav Language fire def browser ble clas 獲取 IE6 IE7 IE8Firefox Chrome SafariOpera navigator.language undefined zh-CN zh-CN navigato

FFmpeg 通過 showwavespic 獲取音訊的頻譜圖

FFmpeg 的 showwavespic 濾鏡如何得到頻譜圖

PCM 資料

儲存格式

單聲道

相關推薦