Small U-Net for vehicle detection

阿新 • • 發佈：2018-12-29

Model:

The model we chose is is a scaled down version of a deep learning architecture called U-net. U-net is a encoder-decoder type network architecture for image segmentation. The name of the architecture comes from its unique shape, where the feature maps from convolution part in downsampling step are fed to the up-convolution part in up-sampling step. U-net has been used extensively for biomedical applications to detect cancer, kidney pathologies and tracking cells etc. U-net has proven to be very powerful segmentation tool in scenarios with limited data (less than 50 training samples in some cases). Another advantage of using a U-net is that it does not have any fully connected layers, therefore has no restriction on the size of the input image. This feature allows us to extract features from images of different sizes, which is an attractive attribute for applying deep learning to high fidelity biomedical imaging data. The ability of U-net to work with very little data and no specific requirement on input image size make it a strong candidate for image segmentation tasks.

Another reason to choose the U-net architecture is the letter U. As the data set was provided by Udacity and as am currently enrolled in Udacity’s self-driving car, choice of U-net was a fitting tribute to Udacity.

The input to U-net is a resized 960X640 3-channel RGB image and output is 960X640 1-channel mask of predictions. We wanted the predictions to reflect probability of a pixel being a vehicle or not, so we used an activation function of sigmoid on the last layer.

Training:

As with any segmentation deep learning neural network, training took long time. We were unable to fit data set with batch size more than 1 on a titan X gpu with the full U-net, we therefore decided to choose a batch size of 1 for all architectures. This 1 image was randomly samples and augmented from all training images. As we chose a batch size of 1, we chose adam optimizer with a learning rate of 0.0001. Setting up the training itself was straight forward, but training the segmentation model made my Titan X gpu cringe. To perform 10000 iterations, my titan X machine took about 20 minutes.

Objective:

We defined a custom objective function in keras to compute approximate Intersection over Union (IoU) between the network output and target mask. IoU is a popular metric of choice for tasks involving bounding boxes. The objective was to maximize IoU, as IoU always varies between 0 and 1, we simply chose to minimize the negative of IoU.

Intersection over Union (IoU) metric for bounding boxes

Instead of implementing a direct computation for intersection over union or cross entropy, we used a much simpler metric for area where we multiply two times the network’s output with the target mask, and divide it by the sum of all values in the predicted output and the true mask. This trick helped us avoid computationally involved area calculations, which resulted in lower training times.

Results:

We stopped the training after 2 hours, and decided to use the network to make predictions. In test time, no augmentation was applied for prediction. The algorithm was surprisingly fast. It took 200ms to make 10 predictions (average of 20ms per image), this included reading file off of disk, and drawing the blobs.

Figures below present performance of the model for vehicle detection. It was surprising that the neural network was able to identify cars correctly in the driving frames it did not see before. Figures below present result of segmentation algorithm applied for vehicle predictions. The panels are organized as original image, predicted mask and ground truth boxes.

Small U-Net for vehicle detection

Model:The model we chose is is a scaled down version of a deep learning architecture called U-net. U-net is a encoder-decoder type network architecture for

「Medical Image Analysis」Note on 3D U-JAPA-Net for Abdominal Multi-organ CT Segmentation

[1] 3D U-JAPA-Net Mixture of Convolutional Networks for Abdominal Multi-organ CT Segmentation MICCAI

Receptive Field Block Net for Accurate and Fast Object Detection

高效 splay 兩個 spp 位置 ont 由於通用性能 Receptive Field Block Net for Accurate and Fast Object Detection 作者：Songtao Liu, Di Huang*, and Yunhong W

《U-Net: Convolutional Networks for Biomedical Image Segmentation》學習筆記

1. 總述在15年的文章：《U-Net: Convolutional Networks for Biomedical Image Segmentation》中提出了一種基於少量資料進行訓練的網路的模型，得到了不錯的分割精度，並且網路的速度很快。對於分割一副5

醫學影象分割--U-Net: Convolutional Networks for Biomedical Image Segmentation

這裡我們將 FCN 修改為 U-Net，主要是上取樣階段，我們同樣也有許多特徵通道，這樣網路可以傳遞更多的 context 資訊到 higher resolution 網路層 in the upsampling part we have also a

[論文閱讀筆記]U-Net: Convolutional Networks for Biomedical Image Segmentation

摘要大意是說，普遍認為深度網路需要大量已標籤資料集，這個網路(U-Net)可以依靠資料增強來事先少量資料集訓練網路。而且，這個網路訓練得很快，運用GPU執行，512*512的圖片只需要不

LabelRank（A Stabilized Label Propagation Algorithm for Community Detection in Networks）非重疊社區發現

date nal zed con ati rop target lan detect 最近在研究基於標簽傳播的社區分類，LabelRank算法基於標簽傳播和馬爾科夫隨機遊走思路上改裝的算法，引用率較高，打算將代碼實現，便於加深理解。一、概念相關概念不再累述，詳情見前兩篇

Convolutional Patch Networks with Spatial Prior for Road Detection and Urban Scene Understanding

line evel linux 程序 providing form ram -s visio Convolutional Patch Networks with Spatial Prior for Road Detection and Urban Sce

Two-phase clustering process for outliers detection 文章翻譯

存儲器圖像必須傳統生成樹 dia var oda 不同的基於二階段聚集模式的異常探測 M.F .Jiang, S.S. Tseng *, C.M. Su 國立交通大學計算機與信息科學系，中國臺北市新竹路100150號 1999年11月17日; 2000年4月

論文翻譯 DOTA:A Large-scale Dataset for Object Detection in Aerial Images

網絡操作邊框允許官方靈活數量級 image 轉化簡介：武大遙感國重實驗室-夏桂松和華科電信學院-白翔等合作做的一個航拍圖像數據集摘要：目標檢測是計算機視覺領域一個重要且有挑戰性的問題。雖然過去的十幾年中目標檢測在自然場景已經有了較重要的成就

語義分割(semantic segmentation) 常用神經網絡介紹對比-FCN SegNet U-net DeconvNet，語義分割,簡單來說就是給定一張圖片,對圖片中的每一個像素點進行分類；目標檢測只有兩類,目標和非目標，就是在一張圖片中找到並用box標註出所有的目標.

avi projects div 般的 ict 中間接受 img dense from：https://blog.csdn.net/u012931582/article/details/70314859 2017年04月21日 14:54:10 閱讀數：4369

Small U-Net for vehicle detection

Small U-Net for vehicle detection

「Medical Image Analysis」Note on 3D U-JAPA-Net for Abdominal Multi-organ CT Segmentation

Receptive Field Block Net for Accurate and Fast Object Detection

《U-Net: Convolutional Networks for Biomedical Image Segmentation》學習筆記

醫學影象分割--U-Net: Convolutional Networks for Biomedical Image Segmentation

[論文閱讀筆記]U-Net: Convolutional Networks for Biomedical Image Segmentation

LabelRank（A Stabilized Label Propagation Algorithm for Community Detection in Networks）非重疊社區發現

Convolutional Patch Networks with Spatial Prior for Road Detection and Urban Scene Understanding

Two-phase clustering process for outliers detection 文章翻譯

論文翻譯 DOTA:A Large-scale Dataset for Object Detection in Aerial Images

深度學習圖像分割——U-net網絡

U-NET語義分割方法解讀

《Randomized Low-Rank Dynamic Mode Decomposition for Motion Detection》讀書筆記（下）

《Randomized Low-Rank Dynamic Mode Decomposition for Motion Detection》讀書筆記（中）

《Randomized Low-Rank Dynamic Mode Decomposition for Motion Detection》讀書筆記（上）

Parallel Feature Pyramid Network for Object Detection

從零開始的無人駕駛 02：Vehicle Detection

【Network Architecture】Feature Pyramid Networks for Object Detection(FPN)論文解析（轉）

Feature Pyramid Networks for Object Detection 總結

Small U-Net for vehicle detection

相關推薦