深度學習Caffe實戰筆記（3）用AlexNet跑自己的資料

阿新 • • 發佈：2019-01-18

上一篇部落格介紹瞭如何在caffe框架平臺下，用LeNet網路訓練車牌識別資料，今天介紹用AlexNet跑自己的資料，同樣基於windows平臺下，會比基於Ubuntu平臺下麻煩一些，特別是後面的Siamese網路，說起Siamese網路真是一把辛酸一把淚啊，先讓我哭一會，，，，，哭了5分鐘，算了，Siamese網路的苦水等以後再倒吧，言歸正傳，開始train。
在caffe平臺下，實現用Alexnet跑自己的資料步驟和上一篇的步驟差不多，可以說幾乎一樣。。。。。
1、準備資料
在caffe根目錄下data資料夾新建一個資料夾，名字自己起一個就行了，我起的名字是cloth，在cloth資料夾下新建兩個資料夾，分別存放train和val資料，在train資料夾下存放要分類的資料，要分幾類就建立幾個資料夾，分別把對應的影象放進去。（當然，也可以把所有的影象都放在一個資料夾下，只是在標籤檔案中標明就行）。
這裡寫圖片描述

然後建立train資料集對應的標籤txt檔案。同樣，在val資料夾下存放驗證資料，並建立驗證影象對應的txt標籤檔案。
這裡寫圖片描述

2、轉換資料
編譯成功的caffe根目錄下bin資料夾下有一個convert_imageset.exe檔案，用來轉換資料，在cloth資料夾下新建一個指令碼檔案，內容：
這裡寫圖片描述
mtrainldb和mvalldb分別是轉化好的資料集檔案，既caffe需要的檔案。這樣在cloth資料夾下會生成兩個資料夾：

這裡面儲存的就是生成的資料檔案。
3、計算均值
在cloth資料夾下新建一個計算均值的指令碼檔案，內容如下：
這裡寫圖片描述
用到的computer_image_mean也是bin目錄下生成的一個可執行檔案，用來計算均值，mtrainldb是存放訓練資料的資料夾，mimg_mean_binaryproto就是要生成的均值檔案。雙擊執行後會生成mimg_mean_binaryproto檔案，這個檔案就是計算出來的均值檔案。
4、開始訓練

同樣，在cloth資料夾下，新建一個train指令碼檔案，檔案內容如下：
這裡寫圖片描述

這個就不過多解釋了吧，solver就是Alexnet的超參檔案，開啟後如下：

net: "train_val.prototxt"   #需要用哪個網路訓練
test_iter: 1000             
test_interval: 1000
base_lr: 0.01              #初始化學習率
lr_policy: "step"          #學習策略，每stepsize之後，將學習率乘以gamma
gamma: 0.1   #學習率變化因子
stepsize: 100000  #每stpesize之後降低學習率 

display: 20
max_iter: 450000  #最大迭代次數
momentum: 0.9    #動量，上次引數更新的權重
weight_decay: 0.0005   #權重衰減量
snapshot: 10000    #每10000次儲存一次模型結果
snapshot_prefix: "cloth"  #模型儲存路徑
solver_mode: GPU   #CPU或者GPU訓練，這裡使用CPU，所以需要把GPU改成CPU

開啟train_val.prototxt，內容如下：

name: "AlexNet"
layer {
  name: "data"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TRAIN
  }
  transform_param {
    mirror: true
    crop_size: 227
    mean_file: "mimg_mean.binaryproto" #均值檔案
  }
  data_param {
    source: "mtrainldb"  #訓練資料
    batch_size: 256
    backend: LMDB
  }
}
layer {
  name: "data"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TEST
  }
  transform_param {
    mirror: false
    crop_size: 227
    mean_file: "mimg_mean.binaryproto"  #均值檔案
  }
  data_param {
    source: "mvaldb"   #驗證資料
    batch_size: 50
    backend: LMDB
  }
}
layer {
  name: "conv1"
  type: "Convolution"
  bottom: "data"
  top: "conv1"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  convolution_param {
    num_output: 96
    kernel_size: 11
    stride: 4
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0
    }
  }
}
layer {
  name: "relu1"
  type: "ReLU"
  bottom: "conv1"
  top: "conv1"
}
layer {
  name: "norm1"
  type: "LRN"
  bottom: "conv1"
  top: "norm1"
  lrn_param {
    local_size: 5
    alpha: 0.0001
    beta: 0.75
  }
}
layer {
  name: "pool1"
  type: "Pooling"
  bottom: "norm1"
  top: "pool1"
  pooling_param {
    pool: MAX
    kernel_size: 3
    stride: 2
  }
}
layer {
  name: "conv2"
  type: "Convolution"
  bottom: "pool1"
  top: "conv2"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  convolution_param {
    num_output: 256
    pad: 2
    kernel_size: 5
    group: 2
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0.1
    }
  }
}
layer {
  name: "relu2"
  type: "ReLU"
  bottom: "conv2"
  top: "conv2"
}
layer {
  name: "norm2"
  type: "LRN"
  bottom: "conv2"
  top: "norm2"
  lrn_param {
    local_size: 5
    alpha: 0.0001
    beta: 0.75
  }
}
layer {
  name: "pool2"
  type: "Pooling"
  bottom: "norm2"
  top: "pool2"
  pooling_param {
    pool: MAX
    kernel_size: 3
    stride: 2
  }
}
layer {
  name: "conv3"
  type: "Convolution"
  bottom: "pool2"
  top: "conv3"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  convolution_param {
    num_output: 384
    pad: 1
    kernel_size: 3
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0
    }
  }
}
layer {
  name: "relu3"
  type: "ReLU"
  bottom: "conv3"
  top: "conv3"
}
layer {
  name: "conv4"
  type: "Convolution"
  bottom: "conv3"
  top: "conv4"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  convolution_param {
    num_output: 384
    pad: 1
    kernel_size: 3
    group: 2
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0.1
    }
  }
}
layer {
  name: "relu4"
  type: "ReLU"
  bottom: "conv4"
  top: "conv4"
}
layer {
  name: "conv5"
  type: "Convolution"
  bottom: "conv4"
  top: "conv5"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  convolution_param {
    num_output: 256
    pad: 1
    kernel_size: 3
    group: 2
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0.1
    }
  }
}
layer {
  name: "relu5"
  type: "ReLU"
  bottom: "conv5"
  top: "conv5"
}
layer {
  name: "pool5"
  type: "Pooling"
  bottom: "conv5"
  top: "pool5"
  pooling_param {
    pool: MAX
    kernel_size: 3
    stride: 2
  }
}
layer {
  name: "fc6"
  type: "InnerProduct"
  bottom: "pool5"
  top: "fc6"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  inner_product_param {
    num_output: 4096
    weight_filler {
      type: "gaussian"
      std: 0.005
    }
    bias_filler {
      type: "constant"
      value: 0.1
    }
  }
}
layer {
  name: "relu6"
  type: "ReLU"
  bottom: "fc6"
  top: "fc6"
}
layer {
  name: "drop6"
  type: "Dropout"
  bottom: "fc6"
  top: "fc6"
  dropout_param {
    dropout_ratio: 0.5
  }
}
layer {
  name: "fc7"
  type: "InnerProduct"
  bottom: "fc6"
  top: "fc7"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  inner_product_param {
    num_output: 4096
    weight_filler {
      type: "gaussian"
      std: 0.005
    }
    bias_filler {
      type: "constant"
      value: 0.1
    }
  }
}
layer {
  name: "relu7"
  type: "ReLU"
  bottom: "fc7"
  top: "fc7"
}
layer {
  name: "drop7"
  type: "Dropout"
  bottom: "fc7"
  top: "fc7"
  dropout_param {
    dropout_ratio: 0.5
  }
}
layer {
  name: "fc8"
  type: "InnerProduct"
  bottom: "fc7"
  top: "fc8"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  inner_product_param {
    num_output: 2       #注意：這裡需要改成你要分成的類的個數
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0
    }
  }
}
layer {
  name: "accuracy"
  type: "Accuracy"
  bottom: "fc8"
  bottom: "label"
  top: "accuracy"
  include {
    phase: TEST
  }
}
layer {
  name: "loss"
  type: "SoftmaxWithLoss"
  bottom: "fc8"
  bottom: "label"
  top: "loss"
}

網路協議中需要注意的地方我都做了說明，其它的一些內容就不多做介紹了，想要了解的學友自行查資料吧。。。。
雙擊train.bat開始訓練吧。
這裡寫圖片描述

5、測試
同樣，測試需要一個類別標籤檔案，category.txt，
這裡寫圖片描述
在這裡介紹一下類別檔案的作用，第一個數就是訓練時打的標籤，後面那個是標籤對應的類別名稱，我這裡為了方便寫了0和1，其實應該把後面的0和1改成long sleve和short sleve，以便於測試的時候能直接顯示出是長袖或者短袖。
寫一個test指令碼，
這裡寫圖片描述
classification是bin目錄下一個可執行檔案，用於caffe中的分類。第二個是測試需要的協議，第三個我們訓練好的模型，第四個是均值檔案，第五個是類別標籤，第六個是需要測試的影象。開啟測試需要的deploy.prororxt，內容如下：

name: "AlexNet"
input: "data"
input_dim: 10          
input_dim: 3         #資料
input_dim: 227
input_dim: 227
layer {
  name: "conv1"
  type: "Convolution"
  bottom: "data"
  top: "conv1"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  convolution_param {
    num_output: 96
    kernel_size: 11
    stride: 4
  }
}
layer {
  name: "relu1"
  type: "ReLU"
  bottom: "conv1"
  top: "conv1"
}
layer {
  name: "norm1"
  type: "LRN"
  bottom: "conv1"
  top: "norm1"
  lrn_param {
    local_size: 5
    alpha: 0.0001
    beta: 0.75
  }
}
layer {
  name: "pool1"
  type: "Pooling"
  bottom: "norm1"
  top: "pool1"
  pooling_param {
    pool: MAX
    kernel_size: 3
    stride: 2
  }
}
layer {
  name: "conv2"
  type: "Convolution"
  bottom: "pool1"
  top: "conv2"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  convolution_param {
    num_output: 256
    pad: 2
    kernel_size: 5
    group: 2
  }
}
layer {
  name: "relu2"
  type: "ReLU"
  bottom: "conv2"
  top: "conv2"
}
layer {
  name: "norm2"
  type: "LRN"
  bottom: "conv2"
  top: "norm2"
  lrn_param {
    local_size: 5
    alpha: 0.0001
    beta: 0.75
  }
}
layer {
  name: "pool2"
  type: "Pooling"
  bottom: "norm2"
  top: "pool2"
  pooling_param {
    pool: MAX
    kernel_size: 3
    stride: 2
  }
}
layer {
  name: "conv3"
  type: "Convolution"
  bottom: "pool2"
  top: "conv3"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  convolution_param {
    num_output: 384
    pad: 1
    kernel_size: 3
  }
}
layer {
  name: "relu3"
  type: "ReLU"
  bottom: "conv3"
  top: "conv3"
}
layer {
  name: "conv4"
  type: "Convolution"
  bottom: "conv3"
  top: "conv4"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  convolution_param {
    num_output: 384
    pad: 1
    kernel_size: 3
    group: 2
  }
}
layer {
  name: "relu4"
  type: "ReLU"
  bottom: "conv4"
  top: "conv4"
}
layer {
  name: "conv5"
  type: "Convolution"
  bottom: "conv4"
  top: "conv5"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  convolution_param {
    num_output: 256
    pad: 1
    kernel_size: 3
    group: 2
  }
}
layer {
  name: "relu5"
  type: "ReLU"
  bottom: "conv5"
  top: "conv5"
}
layer {
  name: "pool5"
  type: "Pooling"
  bottom: "conv5"
  top: "pool5"
  pooling_param {
    pool: MAX
    kernel_size: 3
    stride: 2
  }
}
layer {
  name: "fc6"
  type: "InnerProduct"
  bottom: "pool5"
  top: "fc6"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  inner_product_param {
    num_output: 4096
  }
}
layer {
  name: "relu6"
  type: "ReLU"
  bottom: "fc6"
  top: "fc6"
}
layer {
  name: "drop6"
  type: "Dropout"
  bottom: "fc6"
  top: "fc6"
  dropout_param {
    dropout_ratio: 0.5
  }
}
layer {
  name: "fc7"
  type: "InnerProduct"
  bottom: "fc6"
  top: "fc7"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  inner_product_param {
    num_output: 4096    #改成和訓練網路一致
  }
}
layer {
  name: "relu7"
  type: "ReLU"
  bottom: "fc7"
  top: "fc7"
}
layer {
  name: "drop7"
  type: "Dropout"
  bottom: "fc7"
  top: "fc7"
  dropout_param {
    dropout_ratio: 0.5
  }
}
layer {
  name: "fc8"
  type: "InnerProduct"
  bottom: "fc7"
  top: "fc8"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  inner_product_param {
    num_output: 2       #輸出的類別個數
  }
}
layer {
  name: "prob"
  type: "Softmax"
  bottom: "fc8"
  top: "prob"
}

雙擊執行，測試結果：
這裡寫圖片描述
表示為短袖的概率為100%.測試圖片為：

這兩個部落格一個是完成lenet網路訓練，一個是完成Alexnet網路訓練，那麼舉二反無窮，不管用什麼網路，用自己的資料來跑基本上就是這個步驟了對不對，即使有不同，也是大同小異，只需要做小量修改即可，到時候需要具體情況具體分析。。。。。。。

寫在後面的話：
話說寫部落格也不是一件輕鬆的事情啊，好累，好費時間，在我學習的過程中有好多都是從別的同學的部落格裡學習到的，所以我很願意把我學習到的東西分享給別人，因為我也是從一頭霧水，問誰誰不理的時候一步步走來的。說到分享，昨天看了一個上本科時學院的書記轉發的一個朋友圈文章，叫“為什麼層次越高的人，計較的越少”，就談到了願不願分享的問題，有的人覺得自己辛苦得來的東西，害怕別人知道了，不願意告訴別人怎麼做的，其實這樣的人格局實在太小，殊不知你告訴別人的只不過是一個結果，在你探索過程中得到的分析能力，思維方式，你的知識結構別人是拿不去的！你在乎的，往往能反映出你的水準，為什麼層次越高的人，反而計較的越少呢？不是說他有多麼寬容，而是有些事根本入不了他的眼。願不願意分享，就能看出一個人的眼界和格局，而眼界和格局往往才是決定一個人盛衰成敗的關鍵！共勉。。。。。。

深度學習Caffe實戰筆記（3）用AlexNet跑自己的資料

深度學習Caffe實戰筆記（3）用AlexNet跑自己的資料

深度學習Caffe實戰筆記（2）用LeNet跑車牌識別資料

深度學習tensorflow實戰筆記（3）VGG-16訓練自己的資料並測試和儲存模型

深度學習Caffe實戰筆記（10）Windows Caffe使用MATLAB介面提取和視覺化特徵

深度學習Caffe實戰筆記（21）Windows平臺 Faster-RCNN 訓練好的模型測試資料

深度學習caffe實戰筆記（4）Windows caffe平臺下跑cifar10

深度學習Caffe實戰筆記（20）Windows平臺 Faster-RCNN 訓練自己的資料集

深度學習caffe實戰筆記（13）利用MATLAB視覺化mnist資料集

深度學習Caffe實戰筆記（5）Windows caffe平臺跑Siamese mnist資料集

深度學習Caffe實戰筆記（18）Windows平臺 Faster-RCNN 環境配置

深度學習Caffe實戰筆記（7）Caffe平臺下，如何調整卷積神經網路結構

深度學習Caffe實戰筆記（15）CNN網路結構視覺化

深度學習tensorflow實戰筆記（5）用預訓練好的VGG-16模型提取影象特徵

深度學習tensorflow實戰筆記（1）全連線神經網路（FCN）訓練自己的資料（從txt檔案中讀取）

深度學習tensorflow實戰筆記（4）利用儲存的VGG-16CNN網路模型提取特徵

20180813視頻筆記深度學習基礎上篇（1）之必備基礎知識點深度學習基礎上篇（2）神經網絡模型視頻筆記：深度學習基礎上篇（3）神經網絡案例實戰和深度學習基礎下篇

redis實戰筆記（3）-第3章 Redis命令

斯坦福CS224N_自然語言處理NLP深度學習DL課程筆記（一）

java面向物件學習相關簡要筆記（3）

Coursera吳恩達《神經網路與深度學習》課程筆記（2）-- 神經網路基礎之邏輯迴歸

深度學習Caffe實戰筆記（3）用AlexNet跑自己的資料

相關推薦