Three models for Kaggle’s “Flowers Recognition” Dataset

阿新 • • 發佈：2018-12-28

After resizing, samples were divided into two parts for training and validation. I didn’t use test dataset because 1) the experiment was not for competition purpose that I didn’t need a precise accuracy; 2) the dataset was already very small for training.

The train_images was added by sequence from sub-folders so we need to shuffle the dataset. Otherwise, the model can only learn what is “daisy” from the first 800 images, which wouldn't optimise the parameters of the model. Note that the seed need to be set and applied to both train_images and train_labels so that each image can match the right label.

Build The Model

The first model I used was a model built from scratch. I had three hidden layers and two FN. The convolution shape for each layer was 32, 64 and 128, the most common setting for image classification tasks. The activation I used was ‘ReLU’. The pooling size I used was 2x2. Two dense function with size 512 and 128 accompanying ‘ReLU’ activation function. ‘softmax’ function was used at last dense

function with size 5 as the output layer. The loss function I used was ‘categorical_crossentropy’. The optimiser I used here was ‘adam’ which can automatically change learning rate during training process.

The second model I tried was customised pre-trained model VGG19. Here I froze the first 5 layers with untrainable parameters and the customised layers were two dense functions with size 1028 accompanying ‘ReLU’ activation function. The last layer and corresponding parameters I chose were the same as the first model.

The third model I tried was customised pre-trained model ResNet-50. All settings were the same as the second model except the first layer was untrainable, which meant my model would fit more to the new dataset.

Summary for model built from pre-trained model ResNet-50

Input Data

Considering the dataset was small, data augmentation might be a useful way to improve the accuracy. Except for normalising pixel value with 255, rotation, shift, shear, zoom and horizontal flip were applied to input data by ImageDataGenerator in Keras.

Train and Evaluation the Model

The batch size I used was 32, which was quite common and friendly to GPU’s parallel computing.

The epochs I chose for these three models were different.

I used 50 for the first model because the model was built from scratch and took longer time to learn.
I used 10 for the pre-trained VGG19 model because I found that the accuracies for train and validation were both low (only 0.24). Even though I tried to froze the first layer only, the result was similar. ResNet-50 has more complex layer and larger number of params than VGG19. Maybe the information preserved in model VGG19 is not big enough for this dataset, or unsuitable for this dataset. I might try to add more convolutional layer to VGG19 for training in the future.
The third (with pre-trained ResNet-50 model) I used was 30. I didn’t imagine that it could fit so well in just around 12 epochs. Considering VGG19 and ResNet-50 used the same dataset ‘ImageNet’, the features they catch should be similar. It was a little suprise that I got totally different results.

As we can see, the accuracy is very good for customised ResNet-50 model. But I guess these five flowers classes were contained in ImageNet Dataset, which was a little cheating for my experiment.

Finally, I randomly downloaded flowers pictures online and tested with my model. Customised model with ResNet-50 worked perfectly well and model built from scratch also made resonable predictions.

Prediction for images downloaded from Baidu

So, this is for now. I might update my model in the future and hope I can find out why the second and third results are that different.

Again, the code and details can be viewed on my github.

Thanks for your watching!

Three models for Kaggle’s “Flowers Recognition” Dataset

After resizing, samples were divided into two parts for training and validation. I didn’t use test dataset because 1) the experiment was not for competitio

論文筆記：雙線性模型《Bilinear CNN Models for Fine-Grained Visual Recognition》

雙線性模型是2015年提出的一種細粒度影象分類模型。該模型使用的是兩個並列的CNN模型，這種CNN模型使用的是AlexNet或VGGNet去掉最後的全連線層和softmax層，這個作為特徵提取器，然後使用SVM作為最後的線性分類器。當然，作者還在實驗中嘗試了多種方法，比如最後使用softmax但

Three robot advances that'll be needed for DARPA's new underground challenge

This week, the US Defense Advanced Research Projects Agency announced a challenge to push the limits of robotic design and control. DARPA's Subterranean Ch

VGGnet論文總結（VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION）

lrn cli 共享融合 loss sca 得到同時 works VGGNet的主要貢獻：　　1、增加了網絡結構的深度　　2、使用了更小的filter（3*3） 1 introduction 這部分主要說明了，由於在所有的卷積網絡上使用了3*3的filter，所以使

論文閱讀：A Primer on Neural Network Models for Natural Language Processing（1）

選擇 works embed 負責距離 feature 結構 tran put 前言 2017.10.2博客園的第一篇文章，Mark。由於實驗室做的是NLP和醫療相關的內容，因此開始啃NLP這個硬骨頭，希望能學有所成。後續將關註知識圖譜，深度強化學習等內

Chapter3_Linear Models for Regression(討論課)

對數公式推導 ace 最小化 font 分布推導 image 關於討論課提綱：自我介紹簡單說一下回歸的主要問題，給定數據集，找出輸入和輸出之間的關系，對於一個新的輸入可以預測其輸出我們將從兩個角度來討論這個問題，一個是傳統的頻率學派，

Utterance-Wise Recurrent Dropout And Iterative Speaker Adaptation For Robust Monaural Speech Recognition

back hid eve 以及 pre learn line sig ann 單聲道語音識別的逐句循環Dropout叠代說話人自適應 WRBN（wide residual BLSTM network，寬殘差雙向長短時記憶網絡） [2] J. Heymann

MTK功能機編譯錯誤ToolsMSYSinmake.exe: *** Couldn’t reserve spac e for cygwin’s heap, Win32 error

批處理文件方法 ould please parser build 功能機 mtk bin MTK功能機編譯錯誤 E:\workspace\project\XIN03D_11C\Tools\MSYS\bin\make.exe: *** Couldn‘t reserve s

Password authencated key exchange based on lattice for C/S model&&Resistance to quantum computers

sed concise ech show real public 技術分享 rime 分享 Password authented key exchange based on lattice for C/S model l&& Resistance to qu

【USE】《An End-to-End System for Automatic Urinary Particle Recognition with CNN》

Urine Sediment Examination（USE） JMOS-2018 目錄目錄 1 Background and Motivation 2 Innovation

Deep Mixture of Diverse Experts for Large-Scale Visual Recognition 閱讀及相關疑問

1. 背景大規模視覺識別有三大方向：1）對網路結構改造，加深網路，增加每層網路的神經元數量。 2）做遷移學習：例如學習到的1000類分類器用在500類（大用在小）。 3）多個CNN結合：多個1000類分類器來識別10000類（小用在大）。——本文的方向 Deep Mixture ：深度混合

【論文閱讀】Siamese Neural Networks for One-shot Image Recognition

關鍵詞： one-short learning : 待解決的問題只有少量的標註資料，先驗知識很匱乏，遷移學習就屬於one-short learning的一種 zero-short learning: 這個種情況下完全沒有

《An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its...》論文閱讀之CRNN

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition paper: CRNN 翻譯：CRNN

Learning Invariant Deep Representation for NIR-VIS Face Recognition

查詢異質影象匹配的過程中，發現幾篇某組的論文，都是關於NIR-VIS的識別問題，提到了許多處理異質影象的處理方法，網路結構和idea都很不錯，記錄其中一篇。摘要 VIS-NIR（可見光與近紅外）面部識別仍然是異質影象識別中的挑戰。本文只用一個網路來對映NIR和VIS影象至一個緊湊的歐式空間。網路的低階層

《2018-Deep Progressive Reinforcement Learning for Skeleton-based Action Recognition》

動機這篇文章開篇就指出，我們的模型是要從人體動作的序列中選取出最informative的那些幀，而丟棄掉用處不大的部分。但是由於對於不同的視訊序列，挑出最有代表性的幀的方法是不同的，因此，本文提出用深度增強學習來將幀的選擇模擬為一個不斷進步的progressive proces

深度學習論文翻譯解析（二）：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

論文標題：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition 論文作者： Baoguang Shi, Xiang B

Three models for Kaggle’s “Flowers Recognition” Dataset

Build The Model

Input Data

Train and Evaluation the Model

Three models for Kaggle’s “Flowers Recognition” Dataset

論文筆記：雙線性模型《Bilinear CNN Models for Fine-Grained Visual Recognition》

Three robot advances that'll be needed for DARPA's new underground challenge

VGGnet論文總結（VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION）

論文閱讀：A Primer on Neural Network Models for Natural Language Processing（1）

Chapter3_Linear Models for Regression(討論課)

Utterance-Wise Recurrent Dropout And Iterative Speaker Adaptation For Robust Monaural Speech Recognition

MTK功能機編譯錯誤ToolsMSYSinmake.exe: *** Couldn’t reserve spac e for cygwin’s heap, Win32 error

Password authencated key exchange based on lattice for C/S model&&Resistance to quantum computers

【USE】《An End-to-End System for Automatic Urinary Particle Recognition with CNN》

Deep Mixture of Diverse Experts for Large-Scale Visual Recognition 閱讀及相關疑問

【論文閱讀】Siamese Neural Networks for One-shot Image Recognition

《An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its...》論文閱讀之CRNN

Learning Invariant Deep Representation for NIR-VIS Face Recognition

《2018-Deep Progressive Reinforcement Learning for Skeleton-based Action Recognition》

深度學習論文翻譯解析（二）：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

Wasserstein CNN: Learning Invariant Features for NIR-VIS Face Recognition

The reasons for bitcoin’s falling

Constructing Category-Specific Models for Monocular Object-SLAM（閱讀筆記)

MACNN-Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition

Three models for Kaggle’s “Flowers Recognition” Dataset

Build The Model

Input Data

Train and Evaluation the Model

相關推薦