RGB-D object recognition and pose estimation based on pre-trained convolutional neural network 閱讀記錄

阿新 • • 發佈：2019-02-08

最近發現將閱讀論文的心得體會記錄下來是很有必要的，一方面將自己的想法用文字表達出來，可以鍛鍊論文寫作表達能力，便於後續論文寫作。另一方面，便於回顧自己的工作。

本文僅代表我自己的觀點，對論文理解有誤的地方，歡迎大家指正。

正如標題說是，本文是利用遷移學習技術將訓練好的CNN模型應用於室內物體（household object）的識別及姿態評估任務。為了獲得物體的姿態，並提高物體識別的精度，本文利用RGB-D資訊訓練神經網路。由於卷積神經網路（本文用的是A. Krizhevsky在ImageNet ILSVRC 2011上用的模型，A. Krizhevsky, I. Sutskever, and G. E. Hinton,“Imagenet classification with deep convolutional neural networks,” in Advancesin Neural Information Processing Systems (NIPS), 2012, pp. 1097–1105.）大部分是使用RGB影象進行訓練，而深度圖是用灰度圖表示的，為了能將深度圖作為輸入，訓練神經網路，作者用了一個技巧，首先從深度圖中提取出需要識別的目標物體，然後對其上色，得到Colorized image，如下圖所示。

然後作者根據卷積神經網路輸出的結果，利用SVM（支援向量機）得到物體類別和姿態（這部分論文為詳細闡述，故不太瞭解具體是怎麼做的）。

深度學習論文翻譯解析（二）：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

論文標題：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition 論文作者： Baoguang Shi, Xiang B

RGB-D object recognition and pose estimation based on pre-trained convolutional neural network 閱讀記錄

RGB-D object recognition and pose estimation based on pre-trained convolutional neural network 閱讀記錄

6D姿態估計從0單排——看論文的小雞篇——Learning Descriptors for Object Recognition and 3D Pose Estimation

OS and DSA Concepts based on eight hundred and seven

Build data driven apps with real time and offline capabilities based on GraphQL

A NEW HYPERSPECTRAL BAND SELECTION APPROACH BASED ON CONVOLUTIONAL NEURAL NETWORK文章筆記

MSCNN論文解讀-A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection

MACNN-Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition

模型加速--LCNN: Lookup-based Convolutional Neural Network

網路結構搜尋（3） —— Simple and efficient architecture search for convolutional neural network

ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs（閱讀理解）

論文解析《Deep Convolutional Neural Network Features and the Original Image》

論文筆記：Visual Object Tracking based on Adaptive Siamese and Motion Estimation Network

讀書筆記25：2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning（CVPR2018）

Look into Person: Joint Body Parsing & Pose Estimation Network and A New Benchmark閱讀筆記

D. Arpa and a list of numbers Codeforces Round #432 (Div. 2, based on IndiaHacks Final Round 2017)

《Frustum PointNets for 3D Object Detection from RGB-D Data》論文及程式碼學習（二）程式碼部分

《Frustum PointNets for 3D Object Detection from RGB-D Data》論文及程式碼學習

《An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its...》論文閱讀之CRNN

[論文理解]Region-Based Convolutional Networks for Accurate Object Detection and Segmentation

深度學習論文翻譯解析（二）：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

RGB-D object recognition and pose estimation based on pre-trained convolutional neural network 閱讀記錄

相關推薦