【目標檢測】SSD演算法--損失函式的詳解(tensorflow實現）

阿新 • • 發佈：2019-01-01

SSD的損失函式包含用於分類的log loss 和用於迴歸的smooth L1，並對正負樣本比例進行了控制，可以提高優化速度和訓練結果的穩定性。

總的損失函式是分類和迴歸的誤差的帶權加和。α表示兩者的權重，N表示匹配到default box的數量

1 loc的損失函式：smooth L1

y_true：shape: (batch_size,n_boxes,4) ,最後一個維度包括(xmin, xmax, ymin, ymax)

y_pred的shape應該和y_true保持一致。

但是一張圖片中的ground truth就幾個到幾十個，如何和y_pred保持統一形狀

    def smooth_L1_loss(self, y_true, y_pred):
        
        absolute_loss = tf.abs(y_true - y_pred)
        square_loss = 0.5 * (y_true - y_pred)**2
        l1_loss = tf.where(tf.less(absolute_loss, 1.0), square_loss, absolute_loss - 0.5)
        return tf.reduce_sum(l1_loss, axis=-1)

2 conf的損失函式：Log loss

y_true shape:(batch_size, n_boxes, n_classes)

def log_loss(self, y_true, y_pred):

        # 確保y_pred中不含0，否則會使log函式崩潰的
        y_pred = tf.maximum(y_pred, 1e-15)
        # Compute the log loss
        log_loss = -tf.reduce_sum(y_true * tf.log(y_pred), axis=-1)
        return log_loss

3 hard negative mining

主要思路：

1.根據正樣本的個數和正負比例，確定負樣本的個數，negative_keep

2.找到confidence loss最大的negative_keep個負樣本，計算他們的分類損失之和

3.計算正樣本的分類損失之和，分類損失是正樣本和負樣本的損失和

4.計算正樣本的位置損失localization loss.無法計算負樣本位置損失 %>_<%

5. 對迴歸損失和位置損失之和

def compute_loss(self, y_true, y_pred):
    self.neg_pos_ratio = tf.constant(self.neg_pos_ratio)
    self.n_neg_min = tf.constant(self.n_neg_min)
    self.alpha = tf.constant(self.alpha)

    batch_size = tf.shape(y_pred)[0] # Output dtype: tf.int32
    n_boxes = tf.shape(y_pred)[1] 
    # Output dtype: tf.int32, note that `n_boxes` in this context denotes the total number of boxes per image, not the number of boxes per cell.

    ## 計算每個box的類別和框的損失

    classification_loss = tf.to_float(self.log_loss(y_true[:,:,:-12], y_pred[:,:,:-12]))
    # Output shape: (batch_size, n_boxes)
    localization_loss = tf.to_float(self.smooth_L1_loss(y_true[:,:,-12:-8], y_pred[:,:,-12:-8])) 
    # Output shape: (batch_size, n_boxes)

    ## 為正的和負的groud truth 製作mask
    #此時需要對y_true提前進行編碼。
    #對於類別只有所屬的類別是1，其他全是0，對於出ground truth之外的box的類別，背景設為1，其餘全設為0

    negatives = y_true[:,:,0] # Tensor of shape (batch_size, n_boxes)
    positives = tf.to_float(tf.reduce_max(y_true[:,:,1:-12], axis=-1)) 
    # Tensor of shape (batch_size, n_boxes)

    #統計正樣本的個數
    n_positive = tf.reduce_sum(positives)

    # 掩蓋負的box,計算正樣本box的損失之和
    pos_class_loss = tf.reduce_sum(classification_loss * positives, axis=-1) # Tensor of shape (batch_size,)

    # 計算所有負樣本的box的損失之和
    neg_class_loss_all = classification_loss * negatives # Tensor of shape (batch_size, n_boxes)
    #計算損失非零的負樣本的個數
    n_neg_losses = tf.count_nonzero(neg_class_loss_all, dtype=tf.int32) # The number of non-zero loss entries in `neg_class_loss_all`  

    # Compute the number of negative examples we want to account for in the loss.
    # 至多保留 `self.neg_pos_ratio` 倍於 y_true中正樣本的數量, 至少保留 n_neg_min個負樣本 per batch.
    n_negative_keep = tf.minimum(tf.maximum(self.neg_pos_ratio * tf.to_int32(n_positive), self.n_neg_min), n_neg_losses)

    def f1():
        '''
        當不存在負樣本的ground truth時，直接返回0
        '''
        return tf.zeros([batch_size])
    def f2():
        '''
        獲得confidence loss最高的k(n_negative_keep)個負樣本。
        損失越大說明，越難訓練，也就是尋找hard negative 
        '''
        # To do this, we reshape `neg_class_loss_all` to 1D
        neg_class_loss_all_1D = tf.reshape(neg_class_loss_all, [-1]) # Tensor of shape (batch_size * n_boxes,)
        # ...and then we get the indices for the `n_negative_keep` boxes with the highest loss out of those...
        values, indices = tf.nn.top_k(neg_class_loss_all_1D,
                                      k=n_negative_keep,
                                      sorted=False) # We don't need them sorted.
        # 對這些選擇出來的保留負樣本，做一個掩碼mask
        negatives_keep = tf.scatter_nd(indices=tf.expand_dims(indices, axis=1),
                                       updates=tf.ones_like(indices, dtype=tf.int32),
                                       shape=tf.shape(neg_class_loss_all_1D)) # Tensor of shape (batch_size * n_boxes,)
        negatives_keep = tf.to_float(tf.reshape(negatives_keep, [batch_size, n_boxes])) # Tensor of shape (batch_size, n_boxes)
        # 計算保留的負樣本的損失之和
        neg_class_loss = tf.reduce_sum(classification_loss * negatives_keep, axis=-1) # Tensor of shape (batch_size,)
        return neg_class_loss

     neg_class_loss = tf.cond(tf.equal(n_neg_losses, tf.constant(0)), f1, f2)

    class_loss = pos_class_loss + neg_class_loss # Tensor of shape (batch_size,)

    # 3: 計算正樣本的位置損失之和
    # 我們不能計算對於那些預測為負樣本的box計算座標損失，你可能會問，為啥呢？
    #因為根本不存在標準的負樣本box的座標啊。對於正樣本可以計算是因為存在對應的ground truth
    loc_loss = tf.reduce_sum(localization_loss * positives, axis=-1) # Tensor of shape (batch_size,)

    total_loss = (class_loss + self.alpha * loc_loss) / tf.maximum(1.0, n_positive) # In case `n_positive == 0`
    total_loss = total_loss * tf.to_float(batch_size)
    return total_loss

完整程式碼，在這裡

剛開始啃這一塊，如果有理解的不對的地方，歡迎指出

【目標檢測】SSD演算法--損失函式的詳解(tensorflow實現）

SSD的損失函式包含用於分類的log loss 和用於迴歸的smooth L1，並對正負樣本比例進行了控制，可以提高優化速度和訓練結果的穩定性。總的損失函式是分類和迴歸的誤差的帶權加和。α表示兩者的權重，N表示匹配到default box的數量 1 loc的損失函式

【目標檢測】FastRCNN演算法詳解

摘自沈曉璐有待補充自己的理解. 繼2014的RCNN之後，推出了FastRCNN ,構思精巧，流程更為緊湊，大幅提升了目標檢測的速度。同樣使用最大規模的網路，FastRCNN 和RCNN相比，訓練時間從84小時減少為9.5小時，測試時間從47秒，減少為

【目標檢測】RCNN演算法詳解

Girshick, Ross, et al. “Rich feature hierarchies for accurate object detection and semantic segmentation.” Proceedings of the IE

目標檢測模型的評估指標mAP詳解(附程式碼）

文章轉自：https://zhuanlan.zhihu.com/p/37910324 對於使用機器學習解決的大多數常見問題，通常有多種可用的模型。每個模型都有自己的獨特之處，並隨因素變化而表現不同。每個模型在“驗證/測試”資料集上來評估效能，效能衡量使用各種統計量如準確度（accuracy

【SSH 基礎】SSH框架--struts深入詳解（二）

繼上篇部落格既然我們知道了不使用struts給我們帶來這麼多弊端，那麼下面我們來看看struts是如何封裝的，怎麼解決我們出現的問題的？先來說一下struts的基本流程，幫助大家理解下面的程式碼： Struts基本簡要流程如下： 1、客戶端瀏覽器發出HT

【目標檢測】Faster RCNN演算法詳解

Ren, Shaoqing, et al. “Faster R-CNN: Towards real-time object detection with region proposal networks.” Advances in Neural Information P

【目標檢測】Fast RCNN演算法詳解

Girshick, Ross. “Fast r-cnn.” Proceedings of the IEEE International Conference on Computer Vision. 2015. 繼2014年的RCNN之後，Ross Girshick在15年

【目標檢測】NMS(Non-maximum suppression，非極大值抑制)演算法

NMS廣泛應用於目標檢測演算法中。其目的是為了消除多餘的候選框，找到最佳的物體檢測位置。現在假設有有一個候選的boxes的集合B和其對應的scores集合S： 1. 找出分數最高的M； 2. 將M對應的box從B中刪除； 3. 將刪除的box新增到集合D中；

基於深度學習的【目標檢測】演算法綜述

目標檢測一直是計算機視覺的基礎問題，在 2010 年左右就開始停滯不前了。自 2013 年一篇論文的發表，目標檢測從原始的傳統手工提取特徵方法變成了基於卷積神經網路的特徵提取，從此一發不可收拾。本文將跟著歷史的潮流，簡要地探討「目標檢測」演算法的兩種思想和這些

【目標檢測】Mask RCNN演算法詳解

1 總體架構及與faster RCNN的比較其中黑色部分為原來的 Faster-RCNN，紅色部分為在 Faster網路上的修改，總體流程如下： 1）輸入影象； 2）將整張圖片輸入CNN，進行特徵提取； 3）用FPN生成建議視窗(propo

【目標檢測】Cascade R-CNN 論文解析

都是 org 檢測 rpn 很多 .org 實驗 bubuko pro 目錄 0. 論文鏈接 1. 概述 @ 0. 論文鏈接 Cascade R-CNN 1. 概述 ??這是CVPR 2018的一篇文章，這篇文章也為我之前讀R-CNN系列困擾的一個問題提供了一個解決方案

【目標檢測】閱讀YOLOv1 論文的一些feelings

YOLOv1 是這周看的跟目標檢測相關的第5篇paper，在瞭解了rcnn系列paper的work原理之後，YOLO還是有很大不同的，rcnn系列的論文要麼通過ss方法要麼通過RPN 產生bounding box，對每個產生的bounding box進行分類檢測，而YOLO

【目標檢測】對RCNN論文的一些理解

RCNN可謂是深度學習應用目標檢測的開山之作，RCNN提出之前，目標檢測往往用傳統的HOG、SIFT等方法提取特徵，RBG大神認為CNN既然在圖片分類產生了巨大作用，為何不能用來提特徵呢？於是RCNN就誕生了。在設計神經網路應用目標檢測的時候，首先我們可能考慮將其作

【目標檢測】對SPPNet論文的一些理解

SPPNet不得不說，對後續的Fast-RCNN，Faster-RCNN都起到了舉足輕重的作用。SPPNet主要解決的是固定輸入層尺寸的這個限制，也從各個方面說明了不限制輸入尺寸帶來的好處。文章在一開始的時候就說明了目前深度網路存在的弊端：如果固定網路輸入size的話，要麼選擇crop策略，要麼選擇

【目標檢測】目標檢測原理與實現(五)--基於Cascade分類器的目標檢測

基於Cascade分類器的目標檢測從今天開始進入基於機器學習的目標檢測，前幾節雖然也接觸了一些機器學習的方法，但它主要是做輔助工作，機器學習的方法和非機器學習的方法結合在一起使用，說到這想起來前幾天看到一位博士師兄發的笑話，說的是百度實驗室：

【目標檢測】（三）Faster RCNN

Faster R-CNN與RCNN,fast RCNN最大的區別在於，提出RPN網路取代Selective Search演算法使得檢測任務可以由神經網路端到端地完成。fast RCNN先進行提取特徵再結合候選框進行後續步驟，這使得RCNN中重複特徵提取造成的計算量大的缺點得到了解決。而Faster-RCNN

【目標檢測】LBP特徵學習記錄

LBP（Local Binary Patterns，區域性二值模式）是用來描述影象區域性紋理特徵的描述子。它最早在1994年被提出，2002年由作者整理和改進後重新發表。 T為影象的區域性紋理，將其表示為：其中，gc是中心點的灰度值，gp(p=0,...,P-1)是

【文字檢測】SSD+Tensorflow 300&512 配置詳解

SSD_300_vgg和SSD_512_vgg weights下載連結【需要科學上網~】： Model Training data Testing data mAP FPS SSD-300 VGG-base

【文字檢測】SSD+Tensorflow 300&512 配置詳解

SSD_300_vgg和SSD_512_vgg weights下載連結【需要科學上網~】： Model Training data Testing data mAP FPS VOC07+12+COCO trainval VOC07

【目標檢測】目標檢測原理與實現(一)

轉載：http://blog.csdn.net/marvin521/article/details/9058735 基於閾值影象處理的目標檢測從今天起開始要寫一些關於目標檢測的文章,涵蓋從簡單的閾值影象處理檢測、霍夫變換（hough transf

【目標檢測】SSD演算法--損失函式的詳解(tensorflow實現）

1 loc的損失函式：smooth L1

2 conf的損失函式：Log loss

3 hard negative mining

相關推薦