車輛識別（特徵提取+svm分類器）

阿新 • • 發佈：2019-01-04

以下為udacity的SDCND的一個專案

ps：這裡使用的是用opencv進行特徵提取+svm分類器的方法實現物體檢測，是在深度學習流行前比較經典的實現方法

專案描述：

使用openCV提取圖片特徵，訓練svm分類器，分類車輛與非車輛。用訓練好的模型識別汽車前置攝像頭記錄視訊中的車輛。

實現步驟：

分析訓練資料，提取圖片HOG特徵。
訓練分類器
應用滑動視窗(sliding windows)實現車輛檢測
應用熱力圖(heatMap)過濾錯誤檢測(false positive)

分析訓練資料，提取圖片HOG特徵

訓練資料為64x64x3的RBG圖片，包含車輛與非車輛圖片兩類，車輛圖片8792張，非車輛圖片8968張。以下為車輛，非車輛圖片樣例：

提取HOG特徵，以下為實現方法：

# Define a function to return HOG features and visualization
def get_hog_features(img, orient, pix_per_cell, cell_per_block, vis=False, feature_vec=True):
    if vis == True:
        features, hog_image = hog(img, orientations=orient, pixels_per_cell=(pix_per_cell, pix_per_cell),
                                  cells_per_block=(cell_per_block, cell_per_block), transform_sqrt=False, 
                                  visualise=True, feature_vector=False)
        return features, hog_image
    else:      
        features = hog(img, orientations=orient, pixels_per_cell=(pix_per_cell, pix_per_cell),
                       cells_per_block=(cell_per_block, cell_per_block), transform_sqrt=False, 
                       visualise=False, feature_vector=feature_vec)
        return features

以下為原圖與提取的HOG特徵圖對比：

訓練分類器

這裡使用SVM分類器，以下為程式碼：

t = time.time()
car_features = utils.extract_features(cars, cspace=colorspace, orient=orient,
                        pix_per_cell=pix_per_cell, cell_per_block=cell_per_block,
                        hog_channel=hog_channel)
notcar_features = utils.extract_features(notcars, cspace=colorspace, orient=orient,
                        pix_per_cell=pix_per_cell, cell_per_block=cell_per_block,
                        hog_channel=hog_channel)

t2 = time.time()
print(round(t2-t, 2), 'Seconds to extract features...')

# Create an array stack of feature vectors
X = np.vstack((car_features, notcar_features))
X = X.astype(np.float64)                       
# Fit a per-column scaler
# X_scaler = StandardScaler().fit(X)
# Apply the scaler to X
# scaled_X = X_scaler.transform(X)

# Define the labels vector
y = np.hstack((np.ones(len(car_features)), np.zeros(len(notcar_features))))


# Split up data into randomized training and test sets
rand_state = np.random.randint(0, 100)
X_train, X_test, y_train, y_test = train_test_split(
    X, y, test_size=0.2, random_state=rand_state)


print('Feature vector length:', len(X_train[0]))
# Use a linear SVC 
svc = LinearSVC()
# Check the training time for the SVC
t = time.time()
svc.fit(X_train, y_train)
t2 = time.time()
t2 = time.time()
print(round(t2-t, 2), 'Seconds to train classfier...')
# Check the score of the SVC
print('Test Accuracy of classfier = ', round(svc.score(X_test, y_test), 4))
# Check the prediction time for a single sample
t=time.time()
n_predict = 10
print('My classfier predicts: ', svc.predict(X_test[0:n_predict]))
print('For these',n_predict, 'labels: ', y_test[0:n_predict])
t2 = time.time()
print(round(t2-t, 5), 'Seconds to predict', n_predict,'labels with classfier')

最終訓練的分類器在測試資料集得到98.0%準確率

應用滑動視窗(sliding windows)實現車輛檢測

由於提取HOG特徵比較耗時，先直接提取整張圖片的HOG特徵，然後獲取每個視窗所屬的那部分HOG特徵，這樣效率會更高，以下為滑動視窗搜尋的程式碼實現：

# Define a single function that can extract features using hog sub-sampling and make predictions
def find_cars(img, ystart, ystop, scale, cspace, hog_channel, svc, X_scaler, orient,
              pix_per_cell, cell_per_block, spatial_size, hist_bins, show_all_rectangles=False):
    # array of rectangles where cars were detected
    windows = []

    img = img.astype(np.float32) / 255

    img_tosearch = img[ystart:ystop, :, :]

    # apply color conversion if other than 'RGB'
    if cspace != 'RGB':
        if cspace == 'HSV':
            ctrans_tosearch = cv2.cvtColor(img_tosearch, cv2.COLOR_RGB2HSV)
        elif cspace == 'LUV':
            ctrans_tosearch = cv2.cvtColor(img_tosearch, cv2.COLOR_RGB2LUV)
        elif cspace == 'HLS':
            ctrans_tosearch = cv2.cvtColor(img_tosearch, cv2.COLOR_RGB2HLS)
        elif cspace == 'YUV':
            ctrans_tosearch = cv2.cvtColor(img_tosearch, cv2.COLOR_RGB2YUV)
        elif cspace == 'YCrCb':
            ctrans_tosearch = cv2.cvtColor(img_tosearch, cv2.COLOR_RGB2YCrCb)
    else:
        ctrans_tosearch = np.copy(img)

    # rescale image if other than 1.0 scale
    if scale != 1:
        imshape = ctrans_tosearch.shape
        ctrans_tosearch = cv2.resize(ctrans_tosearch, (np.int(imshape[1] / scale), np.int(imshape[0] / scale)))

    # select colorspace channel for HOG
    if hog_channel == 'ALL':
        ch1 = ctrans_tosearch[:, :, 0]
        ch2 = ctrans_tosearch[:, :, 1]
        ch3 = ctrans_tosearch[:, :, 2]
    else:
        ch1 = ctrans_tosearch[:, :, hog_channel]

    # Define blocks and steps as above
    nxblocks = (ch1.shape[1] // pix_per_cell) + 1  # -1
    nyblocks = (ch1.shape[0] // pix_per_cell) + 1  # -1
    nfeat_per_block = orient * cell_per_block ** 2
    # 64 was the orginal sampling rate, with 8 cells and 8 pix per cell
    window = 64
    nblocks_per_window = (window // pix_per_cell) - 1
    cells_per_step = 2  # Instead of overlap, define how many cells to step
    nxsteps = (nxblocks - nblocks_per_window) // cells_per_step
    nysteps = (nyblocks - nblocks_per_window) // cells_per_step

    # Compute individual channel HOG features for the entire image
    hog1 = utils.get_hog_features(ch1, orient, pix_per_cell, cell_per_block, feature_vec=False)
    if hog_channel == 'ALL':
        hog2 = utils.get_hog_features(ch2, orient, pix_per_cell, cell_per_block, feature_vec=False)
        hog3 = utils.get_hog_features(ch3, orient, pix_per_cell, cell_per_block, feature_vec=False)

    for xb in range(nxsteps):
        for yb in range(nysteps):
            ypos = yb * cells_per_step
            xpos = xb * cells_per_step
            # Extract HOG for this patch
            hog_feat1 = hog1[ypos:ypos + nblocks_per_window, xpos:xpos + nblocks_per_window].ravel()
            if hog_channel == 'ALL':
                hog_feat2 = hog2[ypos:ypos + nblocks_per_window, xpos:xpos + nblocks_per_window].ravel()
                hog_feat3 = hog3[ypos:ypos + nblocks_per_window, xpos:xpos + nblocks_per_window].ravel()
                hog_features = np.hstack((hog_feat1, hog_feat2, hog_feat3))
            else:
                hog_features = hog_feat1

            xleft = xpos * pix_per_cell
            ytop = ypos * pix_per_cell



            test_prediction = svc.predict(hog_features)

            if test_prediction == 1 or show_all_rectangles:
                xbox_left = np.int(xleft * scale)
                ytop_draw = np.int(ytop * scale)
                win_draw = np.int(window * scale)
                windows.append(
                    ((xbox_left, ytop_draw + ystart), (xbox_left + win_draw, ytop_draw + win_draw + ystart)))

    return windows

車輛由於距離遠近不同會在視訊呈現的不一樣的大小且出現的位置也會各異，這裡使用4類不同大小的滑動視窗對圖片中的車輛進行搜尋：

第一類大小為64x64,重疊率(overlap)為0.75，用來檢測距離較遠的車輛：

第二類大小為96x96，重疊率(overlap)為0.75，用來檢測中距離車輛：

第三類大小為128x128,重疊率(overlap)為0.75，用來檢測近距離車輛:

第四類大小為224x224,重疊率(overlap)為0.75，用來檢測極近距離車輛:

應用在測試圖片得到的下列結果：

可以看到存在一些多視窗重合及錯誤檢測(false positive)現象

應用熱圖(heatMap)過濾錯誤檢測(false positive)

由於使用多個大小不一滑動視窗，且視窗存在重疊，單個車輛影象會被多個視窗捕捉檢測。使用這個現象可以過濾錯誤檢測。

記錄一張圖片上所有positive detections，使用記錄的positive detections形成一個檢測熱圖：

def add_heat(heatmap, bbox_list):
    # Iterate through list of bboxes
    for box in bbox_list:
        # Add += 1 for all pixels inside each bbox
        # Assuming each "box" takes the form ((x1, y1), (x2, y2))
        heatmap[box[0][1]:box[1][1], box[0][0]:box[1][0]] += 1

以下應用在測試圖片得到的檢測熱圖：

然後對熱圖進行閾值過濾,過濾錯誤檢測,以下為閾值過濾實現程式碼:

def apply_threshold(heatmap, threshold):
    # Zero out pixels below the threshold
    heatmap[heatmap <= threshold] = 0
    # Return thresholded map
    return heatmap

以下為整個pipeline應用在測試圖片的效果：

車輛識別（特徵提取+svm分類器）

以下為udacity的SDCND的一個專案 ps：這裡使用的是用opencv進行特徵提取+svm分類器的方法實現物體檢測，是在深度學習流行前比較經典的實現方法專案描述：使用openCV提取圖片特徵，訓練svm分類器，分類車輛與非車輛。用訓練好的模型識別汽車前置攝

提取HOG特徵訓練SVM分類器（一）HOG篇

利用hog特徵訓練svm分類器的總體思路：1、提取正負樣本hog特徵 2、投入svm分類器訓練，得到model3、由model生成檢測子4、利用檢測子檢測負樣本，得到hardexample 5、提取hardexample的hog特徵並結合第一步中的特徵一起投入訓練，得到最終

利用Hog特徵和SVM分類器進行行人檢測

https://blog.csdn.net/qianqing13579/article/details/46509037 梯度直方圖特徵(HOG) 是一種對影象區域性重疊區域的密集型描述符, 它通過計算區域性區域的梯度方向直方圖來構成特徵。Hog特徵結合SVM分類器已經被廣

tf.estimator API技術手冊（8）——DNNClassifier（深度神經網路分類器）

（一）簡介繼承自Estimator，定義在tensorflow/python/estimator/canned/dnn.py中，用來建立深度神經網路模型。示例如下： categorical_fe

周志華《機器學習》之第七章（貝葉斯分類器）概念總結

貝葉斯分類器是利用概率的知識完成資料的分類任務，在機器學習中使用貝葉斯決策論實施決策的基本方法也是在概率的框架下進行的，它是考慮如何基於這些概率和誤判損失來選擇最優的類別標記。 1、貝葉斯決策論條件風險：假設有N種可能的類別標記，Y={c1,c2,c3

OPENCV HOG特徵+SVM分類器行人識別（從訓練到識別）

想要訓練分類器，首先要有樣本，正樣本和負樣本，在這裡就是有人的樣本和沒有人的樣本，我的樣本來源於”INRIA Person Dataset”這個網站，連結為點選開啟連結，在下邊有個藍色here（970M），點選下載即可，也可以去我的網盤下載，地址點選開啟

OpenCV學習記錄（二）：自己訓練haar特徵的adaboost分類器進行人臉識別

上一篇文章中介紹瞭如何使用OpenCV自帶的haar分類器進行人臉識別（點我開啟）。這次我試著自己去訓練一個haar分類器，前後花了兩天，最後總算是訓練完了。不過效果並不是特別理想，由於我是在自己的筆記本上進行訓練，為減少訓練時間我的樣本量不是很大，最後也只是勉強看看效果了

Regularized least-squares classification（正則化最小二乘法分類器）取代SVM

得出 ack 提高 kernel sys 風險重要 ref height 在機器學習或者是模式識別其中有一種重要的分類器叫做：SVM 。這個被廣泛的應用於各個領域。可是其計算的復雜度以及訓練的速度是制約其在實時的計算機應用的主要原因。因此也非常非常多的算法

【ECG理論篇】（3）AI實現心律失常判別：心電訊號的波形識別與特徵提取

心電圖中的各個波形都包含了非常多的資訊，例如RR間期可以反映心動週期的時限；相鄰心動週期的 RR 間期的比值可以反映室性早搏；R 波和 S 波幅值的比值和 R 波和 S 波之間的時限可以反映房性早搏等異常情況，等等所以識別這些波形以及提取相應特徵對我們後續做心律失常的分類很重要。

OpenCV機器學習：SVM分類器實現MNIST手寫數字識別

0. 開發環境最近機器學習隨著AI人工智慧的興起越來越火，博主想找一些ML的庫來練手。突然想起之前在看Opencv的doc時發現有ML的component，於是心血來潮就開始寫程式碼試試。話不多說，直接進正題。以下我的開發環境配置： -Windows7

Python構建SVM分類器（線性）

1.SVM建立線性分類器SVM用來構建分類器和迴歸器的監督學習模型，SVM通過對數學方程組的求解，可以找出兩組資料之間的最佳分割邊界。2.準備工作我們首先對資料進行視覺化，使用的檔案來自學習書籍配套管網。首先增加以下程式碼：import numpy as np import

SVM分類器的實現（包括交叉驗證選擇引數，Dlib，視覺化）

慣例先放結果圖，左側為訓練樣本，右側為訓練完後的分類演示圖 Dlib的支援向量機用起來比Opencv的爽多了，支援交叉驗證，降低支援向量的個數以及兩種方式判別類別（正負以及可能性兩種）然後就是簡單粗暴的程式碼了： //需要配置Opencv以及Dlib的環境

Matlab自帶的分類學習工具箱（SVM、決策樹、Knn等分類器）

在matlab中，既有各種分類器的訓練函式，比如“fitcsvm”，也有圖形介面的分類學習工具箱，裡面包含SVM、決策樹、Knn等各類分類器，使用非常方便。接下來講講如何使用。啟動：點選“應用程式”，在面板中找到“Classification Lea

Regularized least-squares classification（正則化最小二乘法分類器）代替SVM

在機器學習或者是模式識別當中有一種重要的分類器叫做：SVM 。這個被廣泛的應用於各個領域。但是其計算的複雜度以及訓練的速度是制約其在實時的計算機應用的主要原因。因此也很很多的演算法被提出來，如SMO，Kernel的方法。但是這裡要提到的 Regularized le

人臉檢測（Haar特徵+Adaboost級聯分類器）

一、Haar分類器的前世今生人臉檢測屬於計算機視覺的範疇，早期人們的主要研究方向是人臉識別，即根據人臉來識別人物的身份，後來在複雜背景下的人臉檢測需求越來越大，人臉檢測也逐漸作為一個單獨的研究方向發展起來。目前的人臉檢測方法主要有兩大類：基於知識和基於統計。 “

斯坦福大學公開課機器學習：Neural Networks，representation: non-linear hypotheses（為什麽需要做非線性分類器）

繼續例子產生成本 log repr 概率 .cn 成了如上圖所示，如果用邏輯回歸來解決這個問題，首先需要構造一個包含很多非線性項的邏輯回歸函數g(x)。這裏g仍是s型函數（即）。我們能讓函數包含很多像這的多項式，當多項式足夠多時，那麽你也許能夠得到可以

『科學計算』從Logistic回歸到SVM分類器

zoom ram edi 情況下投影導出 bmp 幾何 sig 轉自：http://blog.csdn.net/v_july_v/article/details/7624837 前言動筆寫這個支持向量機(support vector machine)是費了不少

python調用百度語音（語音識別-鬥地主語音記牌器）

receive idt 本地文件 file post 最終 callback import pri 一、概述本篇簡要介紹百度語音語音識別的基本使用（其實是鬥地主時想弄個記牌器又沒money，抓包什麽的又不會，只好搞語音識別的了）二、創建應用打開百度語

大資料分析學習筆記（Z檢驗，分類器以及Association Rule）

大資料分析學習筆記（Z檢驗，分類器以及Association Rule） Task 1 – Hypothesis Testing To improve student learning performance, a teacher developed two new learning app

利用 sklearn SVM 分類器對 IRIS 資料集分類

利用 sklearn SVM 分類器對 IRIS 資料集分類支援向量機（SVM）是一種最大化分類間隔的線性分類器（如果不考慮核函式）。通過使用核函式可以用於非線性分類。SVM 是一種判別模型，既適用於分類也適用於迴歸問題，標準的 SVM 是二分類器，可以採用 “one vs one”

車輛識別（特徵提取+svm分類器）

以下為udacity的SDCND的一個專案

專案描述：

分析訓練資料，提取圖片HOG特徵

訓練分類器

應用滑動視窗(sliding windows)實現車輛檢測

應用熱圖(heatMap)過濾錯誤檢測(false positive)

相關推薦