Object detection at 200 Frames Per Second

阿新 • • 發佈：2020-12-27

論文連結：

Object detection at 200 Frames Per Secondarxiv.org

本文記錄該篇文章中蒸餾策略：

針對yolo模型，COCO資料集作為目標檢測任務的訓練目標難度大，意味著teacher network會預測出更多的背景bbox，如果直接用teacher network的預測輸出作為student network學習的soft label會有嚴重的類別不均衡問題。解決這個問題需要引入新的方法，以論文中的reg loss為例：

paddlepaddle中的蒸餾程式碼為例：

https://github.com/PaddlePaddle/PaddleDetection/blob/release/0.2/slim/distillation/distill.py

def obj_weighted_reg(sx, sy, sw, sh, tx, ty, tw, th, tobj):
     loss_x = fluid.layers.sigmoid_cross_entropy_with_logits(a sx, fluid.layers.sigmoid(tx))
     loss_y = fluid.layers.sigmoid_cross_entropy_with_logits(a sy, fluid.layers.sigmoid(ty))
     loss_w = fluid.layers.abs(sw - tw) 
     loss_h = fluid.layers.abs(sh - th)
     loss = fluid.layers.sum([loss_x, loss_y, loss_w, loss_h])
     weighted_loss = fluid.layers.reduce_mean(loss * fluid.layers.sigmoid(tobj))
     return weighted_loss

從上述程式碼可以看出是在teacher netowork和student network的loss前乘以objectness的得分作為權重來抑制背景框

網路大量的網格和anchor都會預測同一個物體，因此在利用知識蒸餾訓練時，當teacher network將資訊遷移到student network時，高度重合的檢測區域對應的feature map會使得反向傳播時，對應於同一個目標類別，梯度會變得很大，從而導致網路過擬合。為了解決這個問題，論文作者提出 Feature Map-NMS (FM-NMS)。具體思想是如果在3 * 3鄰域的cell中，多個候選框都對應同一個類別，那麼這很可能是同一個物體。因此只選擇objectness值最高的那個候選框。另外會將對應同一類別的其他候選框在最後一層的feature map中的 class probabilities置為0。因此只有objectness值最大的那個才會對student network產生影響。我認為在具體實現的時候應當採用siliding window的方式去掃描來計算。

Object detection at 200 Frames Per Second

Object detection at 200 Frames Per Second

object detection api調參詳解（兼SSD演演算法引數詳解）

深度學習論文翻譯解析（八）：Rich feature hierarchies for accurate object detection and semantic segmentation

yolo-v4：Optimal Speed and Accuracy of Object Detection解析

Anchor Boxes for Object Detection

Object-Detection-Loss

RGB-D Salient Object Detection:綜述論文筆記

目標檢測論文筆記一：RefineDet《Single-Shot Refinement Neural Network for Object Detection》

PIoU Loss: Towards Accurate Oriented Object Detection in Complex Environments論文閱讀翻譯 - 2020ECCV

基於TensorFlow Object Detection API 實現利用雙層模型進行（人體識別+其他）安全帽與口罩的檢測與判定

文獻翻譯——YOLOv4:Optimal Speed and Accuracy of Object Detection

YOLO1學習筆記:You Only Look Once:Unified,Real-Time Object Detection

Feature Selective Anchor-Free Module for Single-Shot Object Detection

Tensorflow Object Detection API 從無到有

使用TensorFlow Object Detection Api 進行環境搭建、訓練自定義的資料集、輸出模型、Android端使用模型目標檢測

論文筆記（七）【yolo v3】You Only Look Once: Unified, Real-Time Object Detection

使用Tensorflow object detection API訓練自己的資料教程

[論文翻譯] RGBD Salient Object Detection via Deep Fusion

加權框融合 WBF（Weighted Boxes Fusion: combining boxes for object detection models）

[R-CNN]Rich feature hierarchies for accurate object detection and semantic segmentation

Object detection at 200 Frames Per Second

相關推薦