【USE】《An End-to-End System for Automatic Urinary Particle Recognition with CNN》

阿新 • • 發佈：2018-10-31

Urine Sediment Examination（USE）
JMOS-2018

Faster RCNN ：shareable CNN feature extraction + region proposal generation + region classification and regression，採用 a pyramid of anchors
MS-Faster RCNN：builds a more sophisticated network for Fast R-CNN detector by a combination of both global context and local appearance features.
OHEM-Faster RCNN：Instead of a sampled mini-batch, it eliminates several heuristics and hyperparameters in common use and selects automatically hard examples by loss.
SSD：Unlike YOLO, it improves detection quality by applying a set of small convolutional filters to multiple feature maps to predict confidences and boxes offsets for various-size categories.
Trimmed SSD：作者的資料集類別數比較少，SSD直接拿來用，會 produce a large number of redundant prediction results interfering with the final detection performance. For simplification, we attempt to remove several top convolutional layers from the auxiliary network of SSD, which leads to the trimmed SSD.
removing conv7, conv8, and conv9 layers

5 Experiments

5.1 datasets

Dataset consisting of 5,376 annotated images corresponding to 7 categories：

erythrocyte （紅細胞）目標數：21,815
leukocyte（白細胞）目標數：6,169
epithelial cell（上皮細胞）目標數：6,175
crystal（結晶）目標數：1,644
cast（管型）目標數：3,663
mycete（黴菌）目標數：2,083
epithelial nuclei（上皮核）目標數：687

資料集分佈情況

5.2 Trainning

5.2.1 Feature extractors

（ZF、VGG、ResNet-50、ResNet-101、PVANet）

5.2.2 Training strategies

4 steps as Faster RCNN
approximate joint training（end-to-end training）

end-to-end 比較好

5.3 不同scales 和 backbones 比較

不同 backbone，anchor 的不同 scales（ratios都是 1：1，1：2，2：1，因為資料集的 object 比較小，所以增加了scales的種類）的結果如下，PVANet 比較好。

5.4 Data augmentation

a horizontal flip to augment training set

下圖展示了 horizontal 和 verticle flip 的比較，單獨用都有提升，一起用沒有提升，一般都是用 horizontal，為啥 vertical 也會有提升呢，個人感覺因為資料集是細胞，所以形狀在豎直方向翻轉，影響沒有那麼大。

5.5 Faster RCNN vs MS-Faster RCNN

從表格可以分析，MS-Faster RCNN 的效果會比 Faster RCNN 差，但是隨著 anchor scales 的 diversity 增加，他們的之間的 gap 會縮小，且 MS-Faster RCNN 在小目標上會有更好的效果。

5.6 Faster RCNN vs Faster RCNN+OHEM

加了效果好，資料集越多，more benefits

5.7 SSD vs Trimmed SSD

為了適應小目標，smaller is better

5.8 Adding bells & whistles

5.8.1 anchor scales

the more the better
the smaller the superior

下圖（a）VGG-16 為例，不同 anchor scales 的 proposal recall，（b）是不同 backbones 的 proposal recall，（c）不同 backbones 的 mAP

5.8.2 Feature extractors

圖6 （b），用不同的 backbones

5.8.3 PVANet vs. VGG-16

由圖6（c）可以看出，PVANet的 proposal 質量會差（曲線下降的比較快），但是由 table 2 看出，他最終的結果比較好，下圖是檢測時 recall 和prediction 的圖，可以看出，隨著 recall 增加，PVANet 的 precision 相較於 VGG-16 下降的更慢，且比 VGG-16 高。

6 Conclusion

在Faster RCNN 和 SSD 的基礎上結合自己的資料集，用不同的 backbones，anchor scales，training stages 來提升 mAP。

MS Faster RCNN
Trimmed SSD（去掉一些層）

【USE】《An End-to-End System for Automatic Urinary Particle Recognition with CNN》

Urine Sediment Examination（USE） JMOS-2018 目錄目錄 1 Background and Motivation 2 Innovation

Fear the REAPER A System for Automatic Multi-Document Summarization with Reinforcement Learning

Cody Rioux, Sadid A. Hasan, Yllias Chali ##Abstract Achieve the largest coverage of the docu ments content.目標的覆蓋整個文件的內容 Concentrate dis

【論文筆記】An End-to-End Model for QA over KBs with Cross-Attention Combining Global Knowledge

一、概要該文章發於ACL 2017，在Knowledge base-based question answering (KB-QA)上，作者針對於前人工作中存在沒有充分考慮候選答案的相關資訊來訓練question representation的問題，提出

深度學習【8】基於迴圈神經網路（RNN）的端到端（end-to-end）對話系統

注：本篇部落格主要內容來自：A Neural Conversational Model，這篇論文。 http://blog.csdn.net/linmingan/article/details/51077837 與傳統的基於資料庫匹配的對話\翻譯系統不一樣

【論文筆記07】End-To-End Memory Networks

1 背景（1）在記憶網路中，主要由4個模組組成：I、G、O、R，前面也提到I和G模組其實並沒有進行多複雜的操作，只是將原始文字進行向量表示後直接儲存在記憶槽中。而主要工作集中在O和R模組，O用來選擇與問題相關的記憶，R用來回答，而這兩部分都需要監督，也就是需要

《An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its...》論文閱讀之CRNN

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition paper: CRNN 翻譯：CRNN

深度學習論文翻譯解析（二）：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

論文標題：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition 論文作者： Baoguang Shi, Xiang B

【USE】《An End-to-End System for Automatic Urinary Particle Recognition with CNN》

目錄

1 Background and Motivation

2 Innovation

3 Advantages

4 Methods（Meta-architectures）

5 Experiments

5.1 datasets

5.2 Trainning

5.2.1 Feature extractors

5.2.2 Training strategies

5.3 不同scales 和 backbones 比較

5.4 Data augmentation

5.5 Faster RCNN vs MS-Faster RCNN

5.6 Faster RCNN vs Faster RCNN+OHEM

5.7 SSD vs Trimmed SSD

5.8 Adding bells & whistles

5.8.1 anchor scales

5.8.2 Feature extractors

5.8.3 PVANet vs. VGG-16

6 Conclusion

相關推薦