基於TensorFlow的SSD車輛檢測-3

阿新 • • 發佈：2019-01-05

百度雲連結總是掛掉，大家實在有需要發我郵箱吧[email protected]

此係列部落格是用來學習Tensorflow和Python的，由於是新手上車，如有錯誤之處希望大家不吝指出。

谷歌雲盤：

三. label製備以及batch資料供給

本環節主要包含下面三塊內容：

一些關於anchor生成的常量**
介紹如何通過原始的標註框來生成計算Loss所需的label以及mask;
如果在訓練階段批量的提供訓練資料，幷包含shuffle等操作；

1.一些關於anchor生成的常量

在constants.py檔案中定義了一些關於anchor的常量：

# coding=utf-8 


# to pre-define some constant variables

# SSD網路中6個預測分支中feature map的大小
feature_size = [38, 19, 10, 5, 3, 1]

# 300 / feature_size：feature map中畫素在原圖中對應的感受野比例
anchor_steps = [8, 16, 30, 60, 100, 300]

# 6個預測分支分別對應的anchor類別數。注意：SSD原文中是[4 6 6 6 4 4 ]，但是由於KITTI中圖片縮放後導致存在更多的小目標，因此為了提高小目標的檢測率，將第一個分支的anchor的種類由4提高到6. 

anchors_num = [6, 6, 6, 6, 4, 4]

# 則anchor的總數量也由原文中的8732提高到11620
all_anchors_num = 11620

# 6個分支所使用的anchor的長寬比，注意長寬比1:1的anchor有兩種，但大小不一
anchors_ratio = [[1, 1, 2, 0.5, 3, 1./3],
                 [1, 1, 2, 0.5, 3, 1./3],
                 [1, 1, 2, 0.5, 3, 1./3],
                 [1, 1, 2, 0.5, 3, 1./3],
                 [1 
, 1, 2, 0.5],
                 [1, 1, 2, 0.5]]

# 按照論文規則設計的anchor大小：最小0.07，最大的0.87，然後等差分配，則6種anchor的大小佔原圖的百分比依次為[0.07 0.23 ... 0.87]
# 特別的，對於長寬比1:1的anchor，再增加一種稍大的尺寸
# the first: ratio=1, sqrt(S_k*S_(k+1))
# the second: 0.07+(k-1)*(0.87-0.1)/(6-1), k=1...6
"""anchors_scales = [[0.13, 0.07],
                  [0.30, 0.23],
                  [0.46, 0.39],
                  [0.62, 0.55],
                  [0.79, 0.71],
                  [0.95, 0.87]]"""

# 300*anchors_scales
anchors_size = [[39, 21],
                [90, 69],
                [138, 108],
                [186, 165],
                [237, 213],
                [285, 261]]

2.如何生成label以及mask

我生成label的方法比較呆板：
- （1）首先利用genBatch.py中的gen_anchors函式生成所有可能的anchors，維度為11620*4（座標格式為[x y w h]）;
- （2）然後利用genBatch.py中的gen_labels迴圈處理每一個標註的車輛的bounding box：每一個bounding box都去和所有anchors計算IOU，如果和某些anchor的IOU大於一定閾值，就將該anchor的屬性label置為1，並按照下式計算相應的bounding box offset:

這裡寫圖片描述

相應的計算函式如下：

# compute normalized offset between boxG(ground truth) and boxD(default anchor) [x,y,w,h]
def compute_offset(boxG, boxD):
    offset = np.zeros([1, 4])
    # offset_x, offset_dy
    offset[0, :2] = [(boxG[0] - boxD[0]) / boxD[2], (boxG[1] - boxD[1]) / boxD[3]]
    # offset_w, offset_h
    offset[0, 2:] = np.log([boxG[2] / boxD[2], boxG[3] / boxD[3]])
    return offset

mask的製作就顯得比較簡單了，具體定義已經在上一節中介紹過了，相應的程式碼如下:

# generate two masks to weights different parts in the final ssd loss
def gen_masks(cls_label, neg_weight=3.0, reg_weight=1.0):
    pos_mask = cls_label[:, 1]
    neg_mask = 1. - pos_mask
    pos_num = np.sum(pos_mask)
    neg_num = np.sum(neg_mask)

    if pos_num > 0:
        pos_mask = pos_mask / pos_num
    if neg_num > 0:
        neg_mask = neg_mask / neg_num * neg_weight

    return pos_mask + neg_mask, pos_mask * reg_weight

（3）需要注意的是：當有多個標註的boundingbox與同一個anchor的IOU大於一定閾值時，我們只選擇IOU最大的那個標註。

3.如何供給Batch資料

Batch的資料供給主要考慮到在訓練過程中，自動的為訓練提供正確的資料以及對應的label，主要考慮的因素有：batch_Szie，是否shuffle, 是否進行資料擴張以及各種資料擴張的比例等等。

為此，我們定義瞭如下類：

class GenBatch:
    def __init__(self, image_path, label_path,
                 batch_size, new_w, new_h, is_color=True, is_shuffle=True):
        self.image_path, self.label_path = image_path, label_path,
        self.batch_size, self.new_w, self.new_h, self.is_color, self.is_shuffle = \
            batch_size, new_w, new_h, is_color, is_shuffle

        self.readPos = 0

        # read KITTI
        self.image_list = readKITTI.get_filelist(image_path, '.png')
        self.bbox_list = readKITTI.get_bboxlist(label_path, self.image_list)
        if len(self.image_list) > 0 and len(self.image_list) == len(self.bbox_list):
            print("The amount of images is %d" % (len(self.image_list)))

            self.initOK = True
            self.all_anchors = gen_anchors()

            # init the outputs
            self.batch_image = np.zeros([batch_size, new_h, new_w, 3 if self.is_color else 1], dtype=np.float32)
            self.batch_cls_label = np.zeros([batch_size * all_anchors_num, 2], dtype=np.float32)
            self.batch_reg_label = np.zeros([batch_size * all_anchors_num, 4], dtype=np.float32)
            self.batch_cls_mask = np.zeros([batch_size * all_anchors_num], dtype=np.float32)
            self.batch_reg_mask = np.zeros([batch_size * all_anchors_num], dtype=np.float32)
        else:
            print("The amount of images is %d, while the amount of "
                  "corresponding label is %d" % (len(self.image_list), len(self.bbox_list)))
            self.initOK = False

    # generate a new batch
    # mirror_ratio and crop_ratio are used to control the image augmentation,
    # the default zeros means no images augmentation
    # cls_pos_weight and reg_weight are used to generate a mask to compute the final SSD loss
    def nextbatch(self, mirror_ratio=0.0, crop_ratio=0.0):
        if self.initOK is False:
            print("NO successful initiation!.")
            return []
        for i in range(self.batch_size):
            # if a epoch is completed
            if self.readPos >= len(self.image_list)-1:
                self.readPos = 0
                if self.is_shuffle is True:
                    r_seed = random.random()
                    random.seed(r_seed)
                    random.shuffle(self.image_list)
                    random.seed(r_seed)
                    random.shuffle(self.bbox_list)
                    print('Shuffle the data successfully.\n')

            img = cv2.imread(self.image_path + self.image_list[self.readPos])

            bbox = self.bbox_list[self.readPos]

            self.readPos += 1

            # randomly crop under a specified probability
            if crop_ratio > 0 and random.random() < crop_ratio:
                img, bbox = imAugment.imcrop(img, bbox, min(self.new_w, self.new_h))

            # check the input image's size and color
            img, bbox = imAugment.imresize(img, bbox, self.new_w, self.new_h, self.is_color)

            # horizontally flip the input image under a specified probability
            if mirror_ratio > 0 and random.random() < mirror_ratio:
                img, bbox = imAugment.immirror(img, bbox)

            # generate processed labels
            cls_label, reg_label = gen_labels(bbox, self.all_anchors)

            # generate masks
            cls_mask, reg_mask = gen_masks(cls_label)

            self.batch_image[i, :, :, :] = img.astype(np.float32)
            self.batch_cls_label[i*all_anchors_num:(i+1)*all_anchors_num, :] = cls_label
            self.batch_reg_label[i*all_anchors_num:(i+1)*all_anchors_num, :] = reg_label
            self.batch_cls_mask[i*all_anchors_num:(i+1)*all_anchors_num] = cls_mask
            self.batch_reg_mask[i*all_anchors_num:(i+1)*all_anchors_num] = reg_mask

        return self.batch_image, self.batch_cls_label, self.batch_reg_label, self.batch_cls_mask, self.batch_reg_mask

基於TensorFlow的SSD車輛檢測-3

百度雲連結總是掛掉，大家實在有需要發我郵箱吧[email protected] 此係列部落格是用來學習Tensorflow和Python的，由於是新手上車，如有錯誤之處希望大家不吝指出。谷歌雲盤：三. label製備以及batch資料

基於BP演算法的3維馬爾可夫隨機場運動目標檢測

介紹這篇論文主要介紹一種基於BP（Beilef propagation）演算法在3維空間-時間馬爾可夫隨機場的運用來進行運動目標檢測。對於目標檢測，有兩種主要的方法即提取背景和幀差法，提取背景的方法顧名思義就是需要將前景與背景分開來達到檢測運動目標的目的，

基於TensorFlow的SSD車輛檢測-2

此係列部落格是用來學習Tensorflow和Python的，由於是新手上車，如有錯誤之處希望大家不吝指出。二. SSD網路構建在網路模型構建環節，主要包含下面三塊內容：構建網路的基礎部分：VGG_base 構建網路的分支部分：SSD的6個預測分

基於OMAPL：Linux3.3內核的編譯

手冊可能會有 exit 裏的 UC 成功代碼 sta 基於OMAPL：Linux3.3內核的編譯 OMAPL對應3個版本的linux源代碼，分別是：Linux-3.3、Linux-2.6.37、Linux2.6.33，這裏的差距在於Linux2，缺少SYSLINK支持

opencv +Hog + SVM 車輛檢測

最近嘗試了一下用opencv做了一下車輛檢測其中hog特徵使用opencv自帶函式庫進行提取描述如下： HOGDescriptor *hog = new HOGDescriptor(Size(64, 64), Size(16, 16), Size(8, 8), Size(8, 8)

【機器學習】HOG+SVM進行車輛檢測的流程及原始碼

在進行機器學習檢測車道線時，參考了這篇博文，基於LBP+SVM實現了車道線檢測的初步效果。覺得講解很到位，程式碼也容易理解和修改，故在此分享，供更多人學習。原地址：https://www.cnblogs.com/louyihang-loves-baiyan/p/4658478.html HOG

車輛檢測和車道檢測

車輛檢測和車道檢測 NKU計算機視覺期末大作業目錄車輛檢測和車道檢測軟體要求車輛檢測根據hog特

Mybatis generator生成Service，Controller,新增批量新增資料介面(基於mybatis-generator-1.3.5原始碼修改)

　　　　好久記錄筆記，這段時間做政府的專案，資料錄入系統基本都是通過excel匯入，且資料量大，許多也是單表的錄入，這就有很多可以通用的程式碼，如controller，service層的那一套都是可以程式碼生成，添加了一個數據庫批量新增介面(目前只支援oracle)，程式碼是基於mybatis-gener

Mybatis generator生成Service，Controller,添加批量新增數據接口(基於mybatis-generator-1.3.5源碼修改)

src value new lse 項目上線 uuid err opera auth 　　好久記錄筆記，這段時間做政府的項目,數據錄入系統基本都是通過excel導入,且數據量大,許多也是單表的錄入,這就有很多可以通用的代碼,如controller,service層的那一套都

基於視覺化檢測的文件質量提升

cvpr2018論文。論文主要基於3d點雲的方法，對文件圖片進行去陰影操作，進而提升檢測和識別。整體流程：首先作者將一副影象想象成具有哦3d資訊的點雲。畫素值的大小表示3d表面的凹凸。白色背景表示為高原，汙跡，陰影表示為火山地帶，黑色的字表示為峽谷。由b中

基於opencv的檢測到人臉，便將人臉用骷髏頭代替。

工具： /*Result window title*/ #define WND_RESULT "result" static CvMemStorage* storage = 0; static CvHaarClassifierCascade* cascade ; con

AI世界-車輛檢測

AI世界花哨技術展示 AI世界-跨鏡追蹤 AI世界-行人屬性分析 AI世界- 客流統計 AI世界-人體關鍵點 AI世界-熱力圖 AI世界-換臉 AI世界-人臉PS AI世界-三維人臉重建 AI世界-人臉密集關鍵點 AI世界-車輛檢測

使用caffe訓練的深度學習做目標檢測(車輛檢測)

#include "opencv2/core/core.hpp" #include "opencv2/imgproc/imgproc.hpp" #include "opencv2/highgui/highgui.hpp" #include "opencv2/dnn/dnn.

Jexus Web Server 完全傻瓜化圖文配置教程（基於Ubuntu 12.04.3 64位）[內含Hyper-v 2012虛擬機器映象下載地址]

1. 前言近日有感許多新朋友想嘗試使用Jexus，不過絕大多數都困惑徘徊在Linux如何安裝啊，如何編譯Mono啊，如何配置Jexus啊。。。等等基礎問題，於是昨日向宇內流雲兄提議，不如搞幾個配置好的虛擬機器映象讓新朋友先嚐嘗Jexus，感受Jexus的效能再慢慢學配置，何不更好？今日小弟決定坐言起行

第10章－基於樹的方法(3)-樹的改進-整合方法

參考： https://homes.cs.washington.edu/~tqchen/pdf/BoostedTree.pdf rob.schapire.net/papers/explaining-adaboost.pdf *https://statweb.stanford.edu/~

UTM篇(6.0) 01. 基於代理與基於流的檢測模式 ❀ 飛塔 (Fortinet) 防火牆

　　【簡介】FortiGate防火牆可以在基於代理與基於流中選擇兩種檢查模式之一，以控制你的FortiGate或VDOM的安全配置檔案檢查模式。基於代理的模式提供了更多的功能，基於流的設計是為了優化效能。　基於代理檢測　　如果一個FortiGate或VDOM配置了基於

無人駕駛之車輛檢測與跟蹤

整個專案原始碼：GitHub 整個專案資料集：車輛資料集、無車輛資料集引言本次分享主要介紹，如何對道路上的汽車進行識別與跟蹤。這裡我們實現一個簡單的demo。後續我們還會對前面的程式碼及功能進行重構，從而進一步豐富我們的功能。專案軟體框

Hadoop入門-3.HDFS的簡單API（demo）（基於hadoop-2.7.3）

條件準備下載部署下載Hadoop-2.7.3.tar.gz包，可以去官網下載。也可以下載原始碼編譯：點選開啟連結然後部署在Linux上，可以參考點選開啟連結 win下eclipse開發配置通常習慣，

基於S3C2440的Linux-3.6.6移植——音效卡驅動

Linux的ALSA音效卡驅動較為複雜，它需要註冊多個平臺裝置。在mach-zhaocj2440.c檔案中的平臺裝置陣列內一共有四個與ALSA相關的平臺裝置： &s3c_device_iis, &uda1340_codec, &mini2440_au

海思移植opencv+車輛檢測

1.確保ubuntu能上網2.安裝cmake程式碼: 全選sudo apt-get install cmake-gui3.下載opencv2.4.9 Linux版原始碼，不要用最新的3.0.0http://opencv.org/downloads.html4.解壓open

基於TensorFlow的SSD車輛檢測-3

三. label製備以及batch資料供給

1.一些關於anchor生成的常量

2.如何生成label以及mask

3.如何供給Batch資料

相關推薦