h264視訊編碼

阿新 • • 發佈：2019-02-13

(由於本文使用swift3呼叫底層C來實現 h264硬編碼，所以讀者需要對swift3 OC C均要有一定的基礎才能看懂本文，文後附有程式碼執行思路)

建立一個類用於設定h264的設定屬性（引數通過類的物件的屬性的方式來做）

//
//  TGVTSessionSetProperty.h
//  videocapture
//
//  Created by targetcloud on 2017/3/31.
//  Copyright © 2017年 targetcloud. All rights reserved.
//

#import <UIKit/UIKit.h>

@interface TGVTSessionSetProperty : NSObject
@property(nonatomic,assign) int width;
@property(nonatomic,assign) int height;
@property(nonatomic,assign) int expectedFrameRate;
@property(nonatomic,assign) int averageBitRate;
@property(nonatomic,assign) int maxKeyFrameInterval;
@end

//
//  TGVTSessionSetProperty.m
//  videocapture
//
//  Created by targetcloud on 2017/3/31.
//  Copyright © 2017年 targetcloud. All rights reserved.
//

#import "TGVTSessionSetProperty.h"

@implementation TGVTSessionSetProperty

@end

每次建立編碼器模式

//
//  TGH264Encoder.h
//  videocapture
//
//  Created by targetcloud on 2017/3/30.
//  Copyright © 2017年 targetcloud. All rights reserved.
//

#import <UIKit/UIKit.h>
#import <VideoToolbox/VideoToolbox.h>
@class TGVTSessionSetProperty;

@interface TGH264Encoder : NSObject
- (instancetype)initWithProperty : (TGVTSessionSetProperty *) properties;
- (void)encodeSampleBuffer:(CMSampleBufferRef)sampleBuffer;
- (void)endEncode;
@end

//
//  TGH264Encoder.m
//  videocapture
//
//  Created by targetcloud on 2017/3/30.
//  Copyright © 2017年 targetcloud. All rights reserved.
//

#import "TGH264Encoder.h"
#import "TGVTSessionSetProperty.h"
@interface TGH264Encoder()
@property (nonatomic, assign) NSInteger frameID;
@property (nonatomic, assign) VTCompressionSessionRef compressionSession;
@property (nonatomic, strong) NSFileHandle *fileHandle;
@property(nonatomic, strong) TGVTSessionSetProperty * properties ;
@end

@implementation TGH264Encoder

- (instancetype)initWithProperty : (TGVTSessionSetProperty *) properties {
    if (self = [super init]) {
        self.properties = properties;
        [self setupFileHandle];
        [self setupVideoSession];
    }
    return self;
}

- (void)setupFileHandle {
    NSString *file = [[NSSearchPathForDirectoriesInDomains(NSDocumentDirectory, NSUserDomainMask, YES) lastObject]
                      stringByAppendingPathComponent:@"videoAudioCapture.h264"];
    [[NSFileManager defaultManager] removeItemAtPath:file error:nil];
    [[NSFileManager defaultManager] createFileAtPath:file contents:nil attributes:nil];
    self.fileHandle = [NSFileHandle fileHandleForWritingAtPath:file];
}


- (void)setupVideoSession {
    self.frameID = 0;
    int width = self.properties.width;
    int height = self.properties.height;
    
    // 建立CompressionSession物件,該物件用於對畫面進行編碼，kCMVideoCodecType_H264 : 表示使用h.264進行編碼，h264VTCompressionOutputCallback : 當一次編碼結束會在該函式進行回撥,可以在該函式中將資料,寫入檔案中
    VTCompressionSessionCreate(NULL,
                               width,
                               height,
                               kCMVideoCodecType_H264,
                               NULL,
                               NULL,
                               NULL,
                               h264VTCompressionOutputCallback,
                               (__bridge void *)(self),
                               &_compressionSession);
    
    // 設定實時編碼輸出（直播是實時輸出,否則會有延遲）
    VTSessionSetProperty(self.compressionSession, kVTCompressionPropertyKey_RealTime, (__bridge CFTypeRef _Nonnull)(@YES));//kCFBooleanTrue
    
    // 設定期望幀率(每秒多少幀,如果幀率過低,會造成畫面卡頓)
    int fps = self.properties.expectedFrameRate;
    CFNumberRef  fpsRef = CFNumberCreate(kCFAllocatorDefault, kCFNumberIntType, &fps);
    VTSessionSetProperty(self.compressionSession, kVTCompressionPropertyKey_ExpectedFrameRate, fpsRef);
    
    // 設定位元率(或叫位元速率: 編碼效率, 位元速率越高則畫面越清晰)
    int bitRate = self.properties.averageBitRate;
    CFNumberRef bitRateRef = CFNumberCreate(kCFAllocatorDefault, kCFNumberSInt32Type, &bitRate);
    VTSessionSetProperty(self.compressionSession, kVTCompressionPropertyKey_AverageBitRate, bitRateRef);//bit
    NSArray *limit = @[@(bitRate * 1.5/8), @(1)];
    VTSessionSetProperty(self.compressionSession, kVTCompressionPropertyKey_DataRateLimits, (__bridge CFArrayRef)limit);//byte
    
    // 設定關鍵幀（GOPsize)間隔
    int frameInterval = self.properties.maxKeyFrameInterval;
    CFNumberRef  frameIntervalRef = CFNumberCreate(kCFAllocatorDefault, kCFNumberIntType, &frameInterval);
    VTSessionSetProperty(self.compressionSession, kVTCompressionPropertyKey_MaxKeyFrameInterval, frameIntervalRef);
    
    // 設定結束, 準備進行編碼
    VTCompressionSessionPrepareToEncodeFrames(self.compressionSession);
}

// 編碼完成回撥
void h264VTCompressionOutputCallback(void *outputCallbackRefCon, void *sourceFrameRefCon, OSStatus status, VTEncodeInfoFlags infoFlags, CMSampleBufferRef sampleBuffer) {
    if (status != noErr) {
        return;
    }
    TGH264Encoder* encoder = (__bridge TGH264Encoder*)outputCallbackRefCon;
    
    //判斷是否是關鍵幀
    //bool isKeyframe = !CFDictionaryContainsKey( (CFArrayGetValueAtIndex(CMSampleBufferGetSampleAttachmentsArray(sampleBuffer, true), 0)), kCMSampleAttachmentKey_NotSync);
    CFArrayRef attachments = CMSampleBufferGetSampleAttachmentsArray(sampleBuffer, true);
    CFDictionaryRef dict = CFArrayGetValueAtIndex(attachments,0);
    BOOL isKeyframe = !CFDictionaryContainsKey(dict,kCMSampleAttachmentKey_NotSync);
    
    if (isKeyframe){//是關鍵幀則獲取sps & pps資料
        // 獲取編碼後的資訊
        CMFormatDescriptionRef format = CMSampleBufferGetFormatDescription(sampleBuffer);
        
        // 獲取SPS
        size_t sparameterSetSize, sparameterSetCount;
        const uint8_t *sparameterSet;
        CMVideoFormatDescriptionGetH264ParameterSetAtIndex(format, 0, &sparameterSet, &sparameterSetSize, &sparameterSetCount, NULL );
        
        // 獲取PPS
        size_t pparameterSetSize, pparameterSetCount;
        const uint8_t *pparameterSet;
        CMVideoFormatDescriptionGetH264ParameterSetAtIndex(format, 1, &pparameterSet, &pparameterSetSize, &pparameterSetCount, NULL );
        
        // sps/pps轉NSData，以便寫入檔案
        NSData *sps = [NSData dataWithBytes:sparameterSet length:sparameterSetSize];
        NSData *pps = [NSData dataWithBytes:pparameterSet length:pparameterSetSize];
        
        // 寫入檔案
        [encoder gotSpsPps:sps pps:pps];
    }
    
    // 獲取資料塊
    CMBlockBufferRef dataBuffer = CMSampleBufferGetDataBuffer(sampleBuffer);
    size_t length, totalLength;
    char *dataPointer;
    OSStatus statusCodeRet = CMBlockBufferGetDataPointer(dataBuffer, 0, &length, &totalLength, &dataPointer);
    if (statusCodeRet == noErr) {
        size_t bufferOffset = 0;
        static const int h264AVCCHeaderLength = 4;
        // 迴圈獲取NALU
        while (bufferOffset < totalLength - h264AVCCHeaderLength) {//一幀的影象可能需要寓情於景入多個NALU單元，slice切片處理
            uint32_t NALUnitLength = 0;
            memcpy(&NALUnitLength, dataPointer + bufferOffset, h264AVCCHeaderLength);//NALU length
            NALUnitLength = CFSwapInt32BigToHost(NALUnitLength);// 從h264編碼的資料的大端模式(位元組序)轉系統端模式
            NSData* data = [[NSData alloc] initWithBytes:(dataPointer + bufferOffset + h264AVCCHeaderLength) length:NALUnitLength];
            [encoder gotEncodedData:data isKeyFrame:isKeyframe];
            bufferOffset += h264AVCCHeaderLength + NALUnitLength;
        }
    }
}

- (void)gotSpsPps:(NSData*)sps pps:(NSData*)pps{
    // NALU header
    const char bytes[] = "\x00\x00\x00\x01";//有一個隱藏的'\0'結束符 所以要-1
    size_t length = (sizeof bytes) - 1;
    NSData *ByteHeader = [NSData dataWithBytes:bytes length:length];
    [self.fileHandle writeData:ByteHeader];
    [self.fileHandle writeData:sps];
    [self.fileHandle writeData:ByteHeader];
    [self.fileHandle writeData:pps];
    
}

- (void)gotEncodedData:(NSData*)data isKeyFrame:(BOOL)isKeyFrame{
    NSLog(@" --- gotEncodedData %d --- ", (int)[data length]);
    if (self.fileHandle != NULL){
        const char bytes[] = "\x00\x00\x00\x01";
        size_t length = (sizeof bytes) - 1; //string literals have implicit trailing '\0'
        NSData *ByteHeader = [NSData dataWithBytes:bytes length:length];
        [self.fileHandle writeData:ByteHeader];
        [self.fileHandle writeData:data];
    }
}

//從這裡開始 -> h264VTCompressionOutputCallback
- (void)encodeSampleBuffer:(CMSampleBufferRef)sampleBuffer {
    CVImageBufferRef imageBuffer = (CVImageBufferRef)CMSampleBufferGetImageBuffer(sampleBuffer);//將sampleBuffer轉成imageBuffer
    CMTime presentationTimeStamp = CMTimeMake(self.frameID++, self.properties.expectedFrameRate);//PTS DTS 根據當前的幀數,建立CMTime的時間
    VTEncodeInfoFlags flag;
    
    // 開始編碼該幀資料
    OSStatus statusCode = VTCompressionSessionEncodeFrame(self.compressionSession,
                                                          imageBuffer,
                                                          presentationTimeStamp,
                                                          kCMTimeInvalid,
                                                          NULL,
                                                          (__bridge void * _Nullable)(self),//h264VTCompressionOutputCallback sourceFrameRefCon
                                                          &flag);//h264VTCompressionOutputCallback infoFlags
    if (statusCode == noErr) {
        NSLog(@" --- H264: VTCompressionSessionEncodeFrame Success --- ");
    }
}

- (void)endEncode {
    VTCompressionSessionCompleteFrames(self.compressionSession, kCMTimeInvalid);
    VTCompressionSessionInvalidate(self.compressionSession);
    CFRelease(self.compressionSession);
    self.compressionSession = NULL;
}

@end

使用

//
//  TGVideoCapture.swift
//  videocapture
//
//  Created by targetcloud on 2017/3/30.
//  Copyright © 2017年 targetcloud. All rights reserved.
//

import UIKit
import AVFoundation

class TGVideoCapture: NSObject {
    fileprivate lazy var videoQueue = DispatchQueue.global()
    fileprivate lazy var audioQueue = DispatchQueue.global()
    fileprivate lazy var session : AVCaptureSession = {
        let session = AVCaptureSession()
        session.sessionPreset = AVCaptureSessionPreset1280x720;
        return session
    }()
    
    //MARK:- 每次建立方式 1
    fileprivate var encoder : TGH264Encoder?
    
    fileprivate lazy var previewLayer : AVCaptureVideoPreviewLayer = AVCaptureVideoPreviewLayer(session: self.session)
    fileprivate var connection : AVCaptureConnection?
    fileprivate var videoOutput : AVCaptureVideoDataOutput?
    fileprivate var videoInput : AVCaptureDeviceInput?
    fileprivate var view : UIView
    
    init(_ view : UIView){
        self.view = view
        super.init()
        setupVideo()
        setupAudio()
    }

    func startCapture() {
        //MARK:- 每次建立方式 1（每次開始都是一個新的encoder）
        encoder =  { () -> TGH264Encoder! in
            let p  = TGVTSessionSetProperty()
            p.height = 1280
            p.width = 720
            p.expectedFrameRate = 30
            p.averageBitRate = 1280*720//1920*1080 1280*720 720*576 640*480 480*360
            p.maxKeyFrameInterval = 30//GOP大小 數值越大，壓縮後越小
            return TGH264Encoder(property: p)
        }()
        
        
        if connection?.isVideoOrientationSupported ?? false {
            connection?.videoOrientation = .portrait
        }
        connection?.preferredVideoStabilizationMode = .auto
        
        previewLayer.frame = view.bounds
        view.layer.insertSublayer(previewLayer, at: 0)
        session.startRunning()
    }

    func endCapture() {
        session.stopRunning()
        previewLayer.removeFromSuperlayer()
        
        //MARK:- 每次建立方式 3
        encoder?.endEncode()
    }
    

    func switchFrontOrBack() {
        // CATransition
        let rotaionAnim = CATransition()
        rotaionAnim.type = "oglFlip"
        rotaionAnim.subtype = "fromLeft"
        rotaionAnim.duration = 0.5
        view.layer.add(rotaionAnim, forKey: nil)
        
        // Check Current videoInput
        guard let videoInput = videoInput else { return }
        
        // Change Position
        let position : AVCaptureDevicePosition = videoInput.device.position == .front ? .back : .front
        
        // New DeviceInput
        guard let devices = AVCaptureDevice.devices(withMediaType: AVMediaTypeVideo) as? [AVCaptureDevice] else { return }
        guard let newDevice = devices.filter({$0.position == position}).first else { return }
        guard let newVideoInput = try? AVCaptureDeviceInput(device: newDevice) else { return }
        
        // Remove videoInput & Add newVideoInput
        session.beginConfiguration()
        session.removeInput(videoInput)
        session.addInput(newVideoInput)
        session.commitConfiguration()
        
        // Save Current videoInput
        self.videoInput = newVideoInput
        
        // portrait
        connection = videoOutput?.connection(withMediaType: AVMediaTypeVideo)
        if connection?.isVideoOrientationSupported ?? false {
            connection?.videoOrientation = .portrait
        }
        connection?.preferredVideoStabilizationMode = .auto
    }
}

extension TGVideoCapture {
    fileprivate func setupVideo() {
        //info.plist add Privacy - Camera Usage Description
        guard let devices = AVCaptureDevice.devices(withMediaType: AVMediaTypeVideo) as? [AVCaptureDevice] else {return}
        guard let device = devices.filter({$0.position == .back}).first else {return}
        guard let videoInput = try? AVCaptureDeviceInput(device: device) else {return}
        if session.canAddInput(videoInput){
            session.addInput(videoInput)
        }

        self.videoInput = videoInput
        
        let videoOutput = AVCaptureVideoDataOutput()
        videoOutput.setSampleBufferDelegate(self, queue:videoQueue)
        videoOutput.alwaysDiscardsLateVideoFrames = true
        if session.canAddOutput(videoOutput){
            session.addOutput(videoOutput)
        }
        
        connection = videoOutput.connection(withMediaType: AVMediaTypeVideo)
        self.videoOutput = videoOutput
    }
    
    fileprivate func setupAudio() {
        //info.plist add Privacy - Microphone Usage Description
        guard let device = AVCaptureDevice.defaultDevice(withMediaType: AVMediaTypeAudio) else {return}
        guard let audioInput = try? AVCaptureDeviceInput(device: device) else {return}
        if session.canAddInput(audioInput){
            session.addInput(audioInput)
        }
        
        let audioOutput = AVCaptureAudioDataOutput()
        audioOutput.setSampleBufferDelegate(self, queue:audioQueue)
        if session.canAddOutput(audioOutput){
            session.addOutput(audioOutput)
        }
    }
}

extension TGVideoCapture : AVCaptureVideoDataOutputSampleBufferDelegate,AVCaptureAudioDataOutputSampleBufferDelegate{
    func captureOutput(_ captureOutput: AVCaptureOutput!, didOutputSampleBuffer sampleBuffer: CMSampleBuffer!, from connection: AVCaptureConnection!) {
        if connection == self.connection {
            print("-採集到視訊畫面");
        }else{
            print("採集到音訊資料-");
        }
        
        //MARK:- 每次建立方式 2
        encoder?.encode(sampleBuffer)
    }
}

懶載入方式建立編碼器模式

//
//  TGH264Encoder.m
//  videocapture
//
//  Created by targetcloud on 2017/3/30.
//  Copyright © 2017年 targetcloud. All rights reserved.
//

#import "TGH264Encoder.h"
#import "TGVTSessionSetProperty.h"
@interface TGH264Encoder()
@property (nonatomic, assign) NSInteger frameID;
@property (nonatomic, assign) VTCompressionSessionRef compressionSession;
@property (nonatomic, strong) NSFileHandle *fileHandle;
@property(nonatomic, strong) TGVTSessionSetProperty * properties ;
@end

@implementation TGH264Encoder

- (instancetype)initWithProperty : (TGVTSessionSetProperty *) properties {
    if (self = [super init]) {
        self.properties = properties;
        [self setupFileHandle];
        [self setupVideoSession];
    }
    return self;
}

- (void)setupFileHandle {
    NSString *file = [[NSSearchPathForDirectoriesInDomains(NSDocumentDirectory, NSUserDomainMask, YES) lastObject]
                      stringByAppendingPathComponent:@"videoAudioCapture.h264"];
    [[NSFileManager defaultManager] removeItemAtPath:file error:nil];
    [[NSFileManager defaultManager] createFileAtPath:file contents:nil attributes:nil];
    self.fileHandle = [NSFileHandle fileHandleForWritingAtPath:file];
}


- (void)setupVideoSession {
    self.frameID = 0;
    int width = self.properties.width;
    int height = self.properties.height;
    
    // 建立CompressionSession物件,該物件用於對畫面進行編碼，kCMVideoCodecType_H264 : 表示使用h.264進行編碼，h264VTCompressionOutputCallback : 當一次編碼結束會在該函式進行回撥,可以在該函式中將資料,寫入檔案中
    VTCompressionSessionCreate(NULL,
                               width,
                               height,
                               kCMVideoCodecType_H264,
                               NULL,
                               NULL,
                               NULL,
                               h264VTCompressionOutputCallback,
                               (__bridge void *)(self),
                               &_compressionSession);
    
    // 設定實時編碼輸出（直播是實時輸出,否則會有延遲）
    VTSessionSetProperty(self.compressionSession, kVTCompressionPropertyKey_RealTime, (__bridge CFTypeRef _Nonnull)(@YES));//kCFBooleanTrue
    
    // 設定期望幀率(每秒多少幀,如果幀率過低,會造成畫面卡頓)
    int fps = self.properties.expectedFrameRate;
    CFNumberRef  fpsRef = CFNumberCreate(kCFAllocatorDefault, kCFNumberIntType, &fps);
    VTSessionSetProperty(self.compressionSession, kVTCompressionPropertyKey_ExpectedFrameRate, fpsRef);
    
    // 設定位元率(或叫位元速率: 編碼效率, 位元速率越高則畫面越清晰)
    int bitRate = self.properties.averageBitRate;
    CFNumberRef bitRateRef = CFNumberCreate(kCFAllocatorDefault, kCFNumberSInt32Type, &bitRate);
    VTSessionSetProperty(self.compressionSession, kVTCompressionPropertyKey_AverageBitRate, bitRateRef);//bit
    NSArray *limit = @[@(bitRate * 1.5/8), @(1)];
    VTSessionSetProperty(self.compressionSession, kVTCompressionPropertyKey_DataRateLimits, (__bridge CFArrayRef)limit);//byte
    
    // 設定關鍵幀（GOPsize)間隔
    int frameInterval = self.properties.maxKeyFrameInterval;
    CFNumberRef  frameIntervalRef = CFNumberCreate(kCFAllocatorDefault, kCFNumberIntType, &frameInterval);
    VTSessionSetProperty(self.compressionSession, kVTCompressionPropertyKey_MaxKeyFrameInterval, frameIntervalRef);
    
    // 設定結束, 準備進行編碼
    VTCompressionSessionPrepareToEncodeFrames(self.compressionSession);
}

// 編碼完成回撥
void h264VTCompressionOutputCallback(void *outputCallbackRefCon, void *sourceFrameRefCon, OSStatus status, VTEncodeInfoFlags infoFlags, CMSampleBufferRef sampleBuffer) {
    if (status != noErr) {
        return;
    }
    TGH264Encoder* encoder = (__bridge TGH264Encoder*)outputCallbackRefCon;
    
    //判斷是否是關鍵幀
    //bool isKeyframe = !CFDictionaryContainsKey( (CFArrayGetValueAtIndex(CMSampleBufferGetSampleAttachmentsArray(sampleBuffer, true), 0)), kCMSampleAttachmentKey_NotSync);
    CFArrayRef attachments = CMSampleBufferGetSampleAttachmentsArray(sampleBuffer, true);
    CFDictionaryRef dict = CFArrayGetValueAtIndex(attachments,0);
    BOOL isKeyframe = !CFDictionaryContainsKey(dict,kCMSampleAttachmentKey_NotSync);
    
    if (isKeyframe){//是關鍵幀則獲取sps & pps資料
        // 獲取編碼後的資訊
        CMFormatDescriptionRef format = CMSampleBufferGetFormatDescription(sampleBuffer);
        
        // 獲取SPS
        size_t sparameterSetSize, sparameterSetCount;
        const uint8_t *sparameterSet;
        CMVideoFormatDescriptionGetH264ParameterSetAtIndex(format, 0, &sparameterSet, &sparameterSetSize, &sparameterSetCount, NULL );
        
        // 獲取PPS
        size_t pparameterSetSize, pparameterSetCount;
        const uint8_t *pparameterSet;
        CMVideoFormatDescriptionGetH264ParameterSetAtIndex(format, 1, &pparameterSet, &pparameterSetSize, &pparameterSetCount, NULL );
        
        // sps/pps轉NSData，以便寫入檔案
        NSData *sps = [NSData dataWithBytes:sparameterSet length:sparameterSetSize];
        NSData *pps = [NSData dataWithBytes:pparameterSet length:pparameterSetSize];
        
        // 寫入檔案
        [encoder gotSpsPps:sps pps:pps];
    }
    
    // 獲取資料塊
    CMBlockBufferRef dataBuffer = CMSampleBufferGetDataBuffer(sampleBuffer);
    size_t length, totalLength;
    char *dataPointer;
    OSStatus statusCodeRet = CMBlockBufferGetDataPointer(dataBuffer, 0, &length, &totalLength, &dataPointer);
    if (statusCodeRet == noErr) {
        size_t bufferOffset = 0;
        static const int h264AVCCHeaderLength = 4;
        // 迴圈獲取NALU
        while (bufferOffset < totalLength - h264AVCCHeaderLength) {//一幀的影象可能需要寓情於景入多個NALU單元，slice切片處理
            uint32_t NALUnitLength = 0;
            memcpy(&NALUnitLength, dataPointer + bufferOffset, h264AVCCHeaderLength);//NALU length
            NALUnitLength = CFSwapInt32BigToHost(NALUnitLength);// 從h264編碼的資料的大端模式(位元組序)轉系統端模式
            NSData* data = [[NSData alloc] initWithBytes:(dataPointer + bufferOffset + h264AVCCHeaderLength) length:NALUnitLength];
            [encoder gotEncodedData:data isKeyFrame:isKeyframe];
            bufferOffset += h264AVCCHeaderLength + NALUnitLength;
        }
    }
}

- (void)gotSpsPps:(NSData*)sps pps:(NSData*)pps{
    // NALU header
    const char bytes[] = "\x00\x00\x00\x01";//有一個隱藏的'\0'結束符 所以要-1
    size_t length = (sizeof bytes) - 1;
    NSData *ByteHeader = [NSData dataWithBytes:bytes length:length];
    [self.fileHandle writeData:ByteHeader];
    [self.fileHandle writeData:sps];
    [self.fileHandle writeData:ByteHeader];
    [self.fileHandle writeData:pps];
    
}

- (void)gotEncodedData:(NSData*)data isKeyFrame:(BOOL)isKeyFrame{
    NSLog(@" --- gotEncodedData %d --- ", (int)[data length]);
    if (self.fileHandle != NULL){
        const char bytes[] = "\x00\x00\x00\x01";
        size_t length = (sizeof bytes) - 1; //string literals have implicit trailing '\0'
        NSData *ByteHeader = [NSData dataWithBytes:bytes length:length];
        [self.fileHandle writeData:ByteHeader];
        [self.fileHandle writeData:data];
    }
}

//從這裡開始 -> h264VTCompressionOutputCallback
- (void)encodeSampleBuffer:(CMSampleBufferRef)sampleBuffer {
    CVImageBufferRef imageBuffer = (CVImageBufferRef)CMSampleBufferGetImageBuffer(sampleBuffer);//將sampleBuffer轉成imageBuffer
    CMTime presentationTimeStamp = CMTimeMake(self.frameID++, self.properties.expectedFrameRate);//PTS DTS 根據當前的幀數,建立CMTime的時間
    VTEncodeInfoFlags flag;
    
    // 開始編碼該幀資料
    OSStatus statusCode = VTCompressionSessionEncodeFrame(self.compressionSession,
                                                          imageBuffer,
                                                          presentationTimeStamp,
                                                          kCMTimeInvalid,
                                                          NULL,
                                                          (__bridge void * _Nullable)(self),//h264VTCompressionOutputCallback sourceFrameRefCon
                                                          &flag);//h264VTCompressionOutputCallback infoFlags
    if (statusCode == noErr) {
        NSLog(@" --- H264: VTCompressionSessionEncodeFrame Success --- ");
    }
}

- (void)endEncode {
    VTCompressionSessionCompleteFrames(self.compressionSession, kCMTimeInvalid);
    
    //以下程式碼是結束編碼後 把此次的編碼改名存放，並重置videoAudioCapture.h264為初始化狀態，適用於懶載入編碼器
    NSString * path = [NSSearchPathForDirectoriesInDomains(NSDocumentDirectory, NSUserDomainMask, YES) lastObject];
    
    NSDateFormatter *formatter = [[NSDateFormatter alloc] init];
    [formatter setDateFormat:@"yyyy-MM-dd HH:mm:ss"];
    NSString * dateStr = [formatter stringFromDate:[NSDate date]];
    
    [[NSFileManager defaultManager] copyItemAtPath:[ path stringByAppendingPathComponent:@"videoAudioCapture.h264"]
                                            toPath:[ path stringByAppendingPathComponent:[NSString stringWithFormat:@"%@.h264",dateStr]] error:NULL];
    [self setupFileHandle];
    
    //因為外面是懶載入建立TGH264Encoder，所以這裡不置空Session，如果外面不是懶載入建立的，則置空，取消下面的三行註釋
    //VTCompressionSessionInvalidate(self.compressionSession);
    //CFRelease(self.compressionSession);
    //self.compressionSession = NULL;
}

@end

使用

//
//  TGVideoCapture.swift
//  videocapture
//
//  Created by targetcloud on 2017/3/30.
//  Copyright © 2017年 targetcloud. All rights reserved.
//

import UIKit
import AVFoundation

class TGVideoCapture: NSObject {
    fileprivate lazy var videoQueue = DispatchQueue.global()
    fileprivate lazy var audioQueue = DispatchQueue.global()
    fileprivate lazy var session : AVCaptureSession = {
        let session = AVCaptureSession()
        session.sessionPreset = AVCaptureSessionPreset1280x720;
        return session
    }()
    //MARK:- 懶方式 1
    fileprivate lazy var encoder : TGH264Encoder = {
        let p  = TGVTSessionSetProperty()
        p.height = 1280
        p.width = 720
        p.expectedFrameRate = 30
        p.averageBitRate = 1280*720//1920*1080 1280*720 720*576 640*480 480*360
        p.maxKeyFrameInterval = 30//GOP大小 數值越大，壓縮後越小
        return TGH264Encoder(property: p)
    }()
    
    fileprivate lazy var previewLayer : AVCaptureVideoPreviewLayer = AVCaptureVideoPreviewLayer(session: self.session)
    fileprivate var connection : AVCaptureConnection?
    fileprivate var videoOutput : AVCaptureVideoDataOutput?
    fileprivate var videoInput : AVCaptureDeviceInput?
    fileprivate var view : UIView
    
    init(_ view : UIView){
        self.view = view
        super.init()
        setupVideo()
        setupAudio()
    }

    func startCapture() {
        if connection?.isVideoOrientationSupported ?? false {
            connection?.videoOrientation = .portrait
        }
        connection?.preferredVideoStabilizationMode = .auto
        
        previewLayer.frame = view.bounds
        view.layer.insertSublayer(previewLayer, at: 0)
        session.startRunning()
    }

    func endCapture() {
        session.stopRunning()
        previewLayer.removeFromSuperlayer()
        
        //MARK:- 懶方式 3
        encoder.endEncode()
    }
    
    func switchFrontOrBack() {
        // CATransition
        let rotaionAnim = CATransition()
        rotaionAnim.type = "oglFlip"
        rotaionAnim.subtype = "fromLeft"
        rotaionAnim.duration = 0.5
        view.layer.add(rotaionAnim, forKey: nil)
        
        // Check Current videoInput
        guard let videoInput = videoInput else { return }
        
        // Change Position
        let position : AVCaptureDevicePosition = videoInput.device.position == .front ? .back : .front
        
        // New DeviceInput
        guard let devices = AVCaptureDevice.devices(withMediaType: AVMediaTypeVideo) as? [AVCaptureDevice] else { return }
        guard let newDevice = devices.filter({$0.position == position}).first else { return }
        guard let newVideoInput = try? AVCaptureDeviceInput(device: newDevice) else { return }
        
        // Remove videoInput & Add newVideoInput
        session.beginConfiguration()
        session.removeInput(videoInput)
        session.addInput(newVideoInput)
        session.commitConfiguration()
        
        // Save Current videoInput
        self.videoInput = newVideoInput
        
        // portrait
        connection = videoOutput?.connection(withMediaType: AVMediaTypeVideo)
        if connection?.isVideoOrientationSupported ?? false {
            connection?.videoOrientation = .portrait
        }
        connection?.preferredVideoStabilizationMode = .auto
    }
}

extension TGVideoCapture {
    fileprivate func setupVideo() {
        //info.plist add Privacy - Camera Usage Description
        guard let devices = AVCaptureDevice.devices(withMediaType: AVMediaTypeVideo) as? [AVCaptureDevice] else {return}
        guard let device = devices.filter({$0.position == .back}).first else {return}
        guard let videoInput = try? AVCaptureDeviceInput(device: device) else {return}
        if session.canAddInput(videoInput){
            session.addInput(videoInput)
        }

        self.videoInput = videoInput
        
        let videoOutput = AVCaptureVideoDataOutput()
        videoOutput.setSampleBufferDelegate(self, queue:videoQueue)
        videoOutput.alwaysDiscardsLateVideoFrames = true
        if session.canAddOutput(videoOutput){
            session.addOutput(videoOutput)
        }
        
        connection = videoOutput.connection(withMediaType: AVMediaTypeVideo)
        self.videoOutput = videoOutput
    }
    
    fileprivate func setupAudio() {
        //info.plist add Privacy - Microphone Usage Description
        guard let device = AVCaptureDevice.defaultDevice(withMediaType: AVMediaTypeAudio) else {return}
        guard let audioInput = try? AVCaptureDeviceInput(device: device) else {return}
        if session.canAddInput(audioInput){
            session.addInput(audioInput)
        }
        
        let audioOutput = AVCaptureAudioDataOutput()
        audioOutput.setSampleBufferDelegate(self, queue:audioQueue)
        if session.canAddOutput(audioOutput){
            session.addOutput(audioOutput)
        }
    }
}

extension TGVideoCapture : AVCaptureVideoDataOutputSampleBufferDelegate,AVCaptureAudioDataOutputSampleBufferDelegate{
    func captureOutput(_ captureOutput: AVCaptureOutput!, didOutputSampleBuffer sampleBuffer: CMSampleBuffer!, from connection: AVCaptureConnection!) {
        if connection == self.connection {
            print("-採集到視訊畫面");
        }else{
            print("採集到音訊資料-");
        }
        //MARK:- 懶方式 2
        encoder.encode(sampleBuffer)
    }
}

由於編碼器採用OC編碼，外層使用用swift3編碼，所以還有一個橋接檔案

//
//  Use this file to import your target's public headers that you would like to expose to Swift.
//

#import "TGH264Encoder.h"
#import "TGVTSessionSetProperty.h"

UI（VC/控制器）最外層呼叫的是swift3寫的TGVideoCapture

//
//  ViewController.swift
//  videocapture
//
//  Created by targetcloud on 2016/11/12.
//  Copyright © 2016年 targetcloud. All rights reserved.
//

import UIKit

class ViewController: UIViewController {
    fileprivate lazy var videoCapture : TGVideoCapture = TGVideoCapture(self.view)
    
    override func viewDidLoad() {
        super.viewDidLoad()
    }

    @IBAction func startCapture(_ sender: Any) {
        videoCapture.startCapture()
    }

    @IBAction func endCapture(_ sender: Any) {
        videoCapture.endCapture()
    }
    
    
    @IBAction func switchFrontOrBack(_ sender: Any) {
        videoCapture.switchFrontOrBack()
    }
}

總體程式碼執行過程是

1、ViewController建立了一個懶載入的videoCapture，開始捕捉視訊時用videoCapture.startCapture()，結束（停止）時呼叫videoCapture.endCapture()，需要切換前置或後置攝像頭時呼叫videoCapture.switchFrontOrBack()

2、videoCapture初始化時把1的view傳進來，用於預覽層，初始化同時設定了setupVideo

3、startCapture開始時，作者介紹了兩種方式來使用h264編碼，根據需要來進行選擇，如果不需要每次建立h264編碼器，那麼請使用懶載入方式

3.1、懶載入時，我們對 h264解碼器進行了各種屬性設定，根據需要在這裡進行設定

fileprivatelazyvar encoder :TGH264Encoder = {

let p =TGVTSessionSetProperty()

p.height =1280

p.width =720

p.expectedFrameRate =30

p.averageBitRate =1280*720//1920*1080 1280*720 720*576 640*480 480*360

p.maxKeyFrameInterval =30//GOP大小數值越大，壓縮後越小

returnTGH264Encoder(property: p)//關鍵程式碼

}()

3.2、在懶載入內部，我們用TGVTSessionSetProperty正式對各種屬性進行了設定，主要對回撥進行了設定h264VTCompressionOutputCallback

- (instancetype)initWithProperty : (TGVTSessionSetProperty *) properties;

- (instancetype)initWithProperty : (TGVTSessionSetProperty *) properties {

if (self = [superinit]) {

self.properties = properties;

[selfsetupFileHandle];

[selfsetupVideoSession];

}

returnself;

}

- (void)setupVideoSession {

self.frameID =0;

int width =self.properties.width;

int height =self.properties.height;

//建立CompressionSession物件,該物件用於對畫面進行編碼，kCMVideoCodecType_H264 : 表示使用h.264進行編碼，h264VTCompressionOutputCallback :當一次編碼結束會在該函式進行回撥,可以在該函式中將資料,寫入檔案中

VTCompressionSessionCreate(NULL,

   width,

   height,

kCMVideoCodecType_H264,

NULL,

NULL,

NULL,

h264VTCompressionOutputCallback,

   (__bridgevoid *)(self),

   &_compressionSession);

//設定實時編碼輸出（直播是實時輸出,否則會有延遲）

VTSessionSetProperty(self.compressionSession,kVTCompressionPropertyKey_RealTime, (__bridgeCFTypeRef_Nonnull)(@YES));//kCFBooleanTrue

//設定期望幀率(每秒多少幀,如果幀率過低,會造成畫面卡頓)

int fps =self.properties.expectedFrameRate;

CFNumberRef fpsRef =CFNumberCreate(kCFAllocatorDefault,kCFNumberIntType, &fps);

VTSessionSetProperty(self.compressionSession,kVTCompressionPropertyKey_ExpectedFrameRate, fpsRef);

//設定位元率(或叫位元速率:編碼效率,位元速率越高則畫面越清晰)

int bitRate =self.properties.averageBitRate;

CFNumberRef bitRateRef =CFNumberCreate(kCFAllocatorDefault,kCFNumberSInt32Type, &bitRate);

VTSessionSetProperty(self.compressionSession,kVTCompressionPropertyKey_AverageBitRate, bitRateRef);//bit

NSArray *limit =@[@(bitRate *1.5/8),@(1)];

VTSessionSetProperty(self.compressionSession,kVTCompressionPropertyKey_DataRateLimits, (__bridgeCFArrayRef)limit);//byte

//設定關鍵幀（GOPsize)間隔

int frameInterval =self.properties.maxKeyFrameInterval;

CFNumberRef frameIntervalRef =CFNumberCreate(kCFAllocatorDefault,kCFNumberIntType, &frameInterval);

VTSessionSetProperty(self.compressionSession,kVTCompressionPropertyKey_MaxKeyFrameInterval, frameIntervalRef);

//設定結束,準備進行編碼

VTCompressionSessionPrepareToEncodeFrames(self.compressionSession);

}

4、captureOutput是session.startRunning()時會呼叫的，此時正式進入-(void)encodeSampleBuffer:(CMSampleBufferRef)sampleBuffer;

觸發程式碼是

encoder.encode(sampleBuffer)

5、 encodeSampleBuffer 將呼叫我們第3步中的 h264VTCompressionOutputCallback，h264VTCompressionOutputCallback完成編碼

//從這裡開始 -> h264VTCompressionOutputCallback
- (void)encodeSampleBuffer:(CMSampleBufferRef)sampleBuffer {
    CVImageBufferRef imageBuffer = (CVImageBufferRef)CMSampleBufferGetImageBuffer(sampleBuffer);//將sampleBuffer轉成imageBuffer
    CMTime presentationTimeStamp = CMTimeMake(self.frameID++, self.properties.expectedFrameRate);//PTS DTS 根據當前的幀數,建立CMTime的時間
    VTEncodeInfoFlags flag;
    
    // 開始編碼該幀資料
    OSStatus statusCode = VTCompressionSessionEncodeFrame(self.compressionSession,
                                                          imageBuffer,
                                                          presentationTimeStamp,
                                                          kCMTimeInvalid,
                                                          NULL,
                                                          (__bridge void * _Nullable)(self),//h264VTCompressionOutputCallback sourceFrameRefCon
                                                          &flag);//h264VTCompressionOutputCallback infoFlags
    if (statusCode == noErr) {
        NSLog(@" --- H264: VTCompressionSessionEncodeFrame Success --- ");
    }
}

H264視訊編碼成MP4檔案

最近需要將H264視訊編碼成MP4格式。研究了一下，一種方法是採用ffmpeg庫，可以先將H264檔案解碼，再編碼生成MP4檔案，但這種方式效率較低，10M的視訊可能需要幾秒鐘才能完成。另一種方式根據MP4檔案協議直接將H264包封裝成MP4格式，由於是直接

H264視訊編碼的基本瞭解

參考：https://www.axis.com/files/whitepaper/wp_h264_34203_cn_0901_lo.pdf 因為視訊是有圖片不斷切換產生的，現在一張圖片都有幾兆，這樣一次傳輸一張完整圖片的成本太大，會產生大量的流量。因此會出現壓縮編碼的需求。

h264視訊編碼

(由於本文使用swift3呼叫底層C來實現 h264硬編碼，所以讀者需要對swift3 OC C均要有一定的基礎才能看懂本文，文後附有程式碼執行思路) 建立一個類用於設定h264的設定屬性（引數通過類的物件的屬性的方式來做） // // TGVTSessionSetP

畢設系列之Libx264實時視訊流（YUV 420P轉H264視訊編碼篇）

FFmpeg 4.0.2編碼YUV序列為H264視訊檔案

/****************************** 功能：編碼YUV序列為h264視訊檔案 FFmpeg：4.0.2 ******************************/ #include <iostream> extern "C" { #include &

YUYV資料—X264編碼H264視訊例項

x264庫的編譯可以見之前部落格：http://blog.csdn.net/li_wen01/article/details/53571929 在PC上編譯X264,可以直接執行下面三條命令： ./configure --enable-shared make make install

深入淺出理解視訊編碼H264結構

編碼流程：那麼 H.264 其編解碼流程是怎麼樣的呢？其實可以主要分為 5 部分：幀間和幀內預測（Estimation）、變換（Transform）和反變換、量化（Quantization）和反量化、環路濾波（Loop Filter）、熵編碼（Entropy Coding）。看起來很高深的樣子，實際上

webrtc 視訊編碼之 h264 自動調節解析度一

webrtc 內部支援 vp8，vp9，h264 視訊編碼，由於業務需要和出於通用性考慮，我選擇了 h264 編碼，webrtc集成了openh264，ffmpeg用於h264的編解碼。當然在移動平臺也集成了硬體編解碼，但是測試發現在ios上硬體編碼還算可以，android上

使用MediaCodec實現H264編碼「第四章，Android音視訊編碼那點破事」

本章僅對部分程式碼進行講解，以幫助讀者更好的理解章節內容。本系列文章涉及的專案HardwareVideoCodec已經開源到Github，支援軟編和硬編。使用它你可以很容易的實現任何解析度的視訊編碼，無需關心攝像頭預覽大小。一切都如此簡單。目前已迭代多個穩定版本，歡迎

利用ffmpeg進行攝像頭提取視訊編碼為h264通過RTP傳送資料到指定的rtp地址

話不多說命令如下： ffmpeg -f dshow -i video="Logitech QuickCam Easy/Cool" -vcodec libx264 -preset:v ultrafast -tune:v zerolatency -f rtp rtp://127

樹莓派sip視訊電話-3:exosip2+硬體h264+g711編碼初步實現

之前使用python語音實現，但是python下的exosip2庫部分功能不能實現，現改為c語音的exsip方式，初步實現sip視訊電話功能。測試環境：樹莓派------------elastix---------------ekiga（pc端）視訊

視訊編碼（H264概述）

編碼中預設值為0，當網路識別此單元中存在位元錯誤時，可將其設為1，以便接收方丟掉該單元，主要用於適應不同種類的網路環境（比如有線無線相結合的環境）。例如對於從無線到有線的閘道器，一邊是無線的非IP環境，一邊是有線網路的無位元錯誤的環境。假設一個NAL單元到達無線那邊時，校驗和檢測失敗，閘道器可以選擇從NAL流

嵌入式視訊編碼（H264）hi3518

8.獲取/釋放編碼的碼流HI_S32 HI_MPI_VENC_GetStream(VENC_CHN VeChn, VENC_STREAM_S *pstStream, HI_U32 u32BlockFlag); HI_S32 HI_MPI_VENC_ReleaseStream(VENC_CHN VeChn, V

Android視訊編碼--H264編碼

Android視訊編碼–H264編碼 Android中的H264編碼有兩種編碼方式：硬編碼軟編碼 1.硬編碼 Android中的H264硬編碼主要是通過自身提供的API，呼叫底層的硬體模組實現編碼，不使用CPU。採用硬編碼的核心示例程式

V4L2採集YUYV資料—X264編碼H264視訊例項

前幾天在網上買個羅技的C270攝像頭，它支援YUYV(YUV422)和JPEG資料輸出。它規格書上寫的是支援HD720P(1280*720畫素)，在實際的除錯過程中，我使用該解析度會導致資料採集過慢。這裡需要注意一下，羅技的攝像頭C270在有些虛擬機器上使用是有異常的

使用ffmpeg將BMP圖片編碼為x264視訊檔案,將H264視訊儲存為BMP圖片,yuv視訊檔案儲存為圖片的程式碼

#include <stdio.h> #include <stdlib.h> #include <string.h> #include <windows.h> #ifdef __cplusplus extern "C" { #endif #include

【H264】視訊編碼發展簡史

## 一、常見視訊編碼格式編碼格式有很多，如下圖： ![H264_History_A.png](https://i.loli.net/2021/03/14/Mn6OEx8pXTRmUat.png) 目前比較常用的編碼有： - H26x系列：由ITU（國際電傳視訊聯盟）主導，側重網路傳輸 - M

ios 視頻流H264硬編碼---分解LFLiveKit

header count enable api osc center dealloc using 默認 #import "LFHardwareVideoEncoder.h" #import <VideoToolbox/VideoToolbox.h> @int

即時通訊音視訊開發（五）：認識主流視訊編碼技術H.264

前言即時通訊應用中的實時音視訊技術，幾乎是IM開發中的最後一道高牆。原因在於：實時音視訊技術 = 音視訊處理技術 + 網路傳輸技術的橫向技術應用集合體，而公共網際網路不是為了實時通訊設計的。系列文章《即時通訊音視訊開發（三）：視訊編解碼之編碼基礎》《即時通訊音視訊

視訊編碼名詞引數解釋

GOP（Group of Pictures）策略影響編碼質量：所謂GOP，意思是畫面組，一個GOP就是一組連續的畫面。MPEG編碼將畫面（即幀）分為I、P、B三種，I是內部編碼幀，P是前向預測幀，B是雙向內插幀。簡單地講，I幀是一個完整的畫面，而P幀和B幀記錄的是相對於I幀的變化。沒有I幀

h264視訊編碼

總體程式碼執行過程是

1、ViewController建立了一個懶載入的videoCapture，開始捕捉視訊時用videoCapture.startCapture()，結束（停止）時呼叫videoCapture.endCapture()，需要切換前置或後置攝像頭時呼叫videoCapture.switchFrontOrBack()

2、videoCapture初始化時把1的view傳進來，用於預覽層，初始化同時設定了setupVideo

3、startCapture開始時，作者介紹了兩種方式來使用h264編碼，根據需要來進行選擇，如果不需要每次建立h264編碼器，那麼請使用懶載入方式

3.1、懶載入時，我們對 h264解碼器進行了各種屬性設定，根據需要在這裡進行設定

3.2、在懶載入內部，我們用TGVTSessionSetProperty正式對各種屬性進行了設定，主要對回撥進行了設定h264VTCompressionOutputCallback

4、captureOutput是session.startRunning()時會呼叫的，此時正式進入-(void)encodeSampleBuffer:(CMSampleBufferRef)sampleBuffer;

5、 encodeSampleBuffer 將呼叫我們第3步中的 h264VTCompressionOutputCallback，h264VTCompressionOutputCallback完成編碼

相關推薦