caffe使用自己的數據做分類

阿新 • • 發佈：2017-09-08

總數 direct 圖片 play local foo director creating 通過

這裏只舉一個例子: Alexnet網絡訓練自己數據的過程

用AlexNet跑自己的數據
參考1：http://blog.csdn.net/gybheroin/article/details/54095399
參考2：http://www.cnblogs.com/alexcai/p/5469436.html
1,準備數據；
在caffe根目錄下data文件夾新建一個文件夾，名字自己起一個就行了，我起的名字是food，在food文件夾下新建兩個文件夾，分別存放train和val數據，
在train文件夾下存放要分類的數據toast, pizza等，要分幾類就建立幾個文件夾，分別把對應的圖像放進去。（當然，也可以把所有的圖像都放在一個文件夾下，只是在標簽文件中標明就行）。
. 
/data (food) -> ./data/food (train val) -> ./data/food/train (pizza sandwich 等等) ./data/food/val (pizza sandwich 等等)
然後在food目錄下生成建立train.txt和val.txt category.txt
--- train.txt 和val.txt 內容類似為:
toast/62.jpg 0
toast/107.jpg 0
toast/172.jpg 0
pizza/62.jpg 1
pizza/107.jpg 1
pizza/172.jpg 1
--- category.txt內容類似為：
0 toast
 
1 pizza


註：圖片需要分兩批：訓練集（train）、測試集（test），一般訓練集與測試集的比例大概是5:1以上，此外每個分類的圖片也不能太少，我這裏每個分類大概選了5000張訓練圖+1000張測試圖。

2,lmdb制作（也可以不制作lmdb數據類型，需要在train的配置文件中data layer 的type改為：type: "ImageData" ###可以直接使用圖像訓練）
編譯成功的caffe根目錄下bin文件夾下有一個convert_imageset.exe文件，用來轉換數據，在food文件夾下新建一個腳本文件create_foodnet.sh，內容參考example/imagenet/create_imagenet.sh

# 
!/usr/bin/env sh
# Create the imagenet lmdb inputs
# N.B. set the path to the imagenet train + val data dirs
set -e

EXAMPLE=data/food  # the path of generated lmdb data
DATA=data/food  # the txt path of train and test data
TOOLS=build/tools

TRAIN_DATA_ROOT=/path/to/imagenet/train/    # /path/to/imagenet/train/
VAL_DATA_ROOT=/path/to/imagenet/val/

# Set RESIZE=true to resize the images to 256x256. Leave as false if images have
# already been resized using another tool.
RESIZE=false
if $RESIZE; then
  RESIZE_HEIGHT=256
  RESIZE_WIDTH=256
else
  RESIZE_HEIGHT=0
  RESIZE_WIDTH=0
fi

if [ ! -d "$TRAIN_DATA_ROOT" ]; then
  echo "Error: TRAIN_DATA_ROOT is not a path to a directory: $TRAIN_DATA_ROOT"
  echo "Set the TRAIN_DATA_ROOT variable in create_imagenet.sh to the path"        "where the ImageNet training data is stored."
  exit 1
fi

if [ ! -d "$VAL_DATA_ROOT" ]; then
  echo "Error: VAL_DATA_ROOT is not a path to a directory: $VAL_DATA_ROOT"
  echo "Set the VAL_DATA_ROOT variable in create_imagenet.sh to the path"        "where the ImageNet validation data is stored."
  exit 1
fi

echo "Creating train lmdb..."

GLOG_logtostderr=1 $TOOLS/convert_imageset     --resize_height=$RESIZE_HEIGHT     --resize_width=$RESIZE_WIDTH     --shuffle     $TRAIN_DATA_ROOT     $DATA/train.txt     $EXAMPLE/food_train_lmdb   #生成的lmdb路徑

echo "Creating val lmdb..."

GLOG_logtostderr=1 $TOOLS/convert_imageset     --resize_height=$RESIZE_HEIGHT     --resize_width=$RESIZE_WIDTH     --shuffle     $VAL_DATA_ROOT     $DATA/val.txt     $EXAMPLE/food_val_lmdb     #生成的lmdb路徑

echo "Done."


3，mean_binary生成

下面我們用lmdb生成mean_file，用於訓練
EXAMPLE=data/food
DATA=data/food
TOOLS=build/tools
$TOOLS/compute_image_mean $EXAMPLE/food_train_lmdb $DATA/foodnet_mean.binaryproto

4，solver 和train網絡修改

------ Solver.prototxt詳解：
# 表示網絡的測試叠代次數。網絡一次叠代將一個batchSize的圖片進行測試，
# 所以為了能將validation集中所有圖片都測試一次，這個參數乘以TEST的batchSize
# 應該等於validation集中圖片總數量。即test_iter*batchSize=val_num。
test_iter: 299  

# 表示網絡叠代多少次進行一次測試。一次叠代即一個batchSize的圖片通過網絡
# 正向傳播和反向傳播的整個過程。比如這裏設置的是224，即網絡每叠代224次即
# 對網絡的準確率進行一次驗證。一般來說，我們需要將訓練集中所有圖片都跑一
# 編，再對網絡的準確率進行測試，整個參數乘以網絡data層（TRAIN）中batchSize
# 參數應該等於訓練集中圖片總數量。即test_interval*batchSize=train_num
test_interval: 224

# 表示網絡的基礎學習率。學習率過高可能導致loss持續86.33333，也可能導致
# loss無法收斂等等問題。過低的學習率會使網絡收斂慢，也有可能導致梯度損失。
# 一般我們設置為0.01  
base_lr: 0.01  
display: 20  
max_iter: 6720  
lr_policy: "step"  
gamma: 0.1  
momentum: 0.9   #動量，上次參數更新的權重
weight_decay: 0.0001  
stepsize: 2218  #每stpesize之後降低學習率
snapshot: 224   # 每多少次保存一次學習的結果。即caffemodel
snapshot_prefix: "food/food_net/food_alex_snapshot"     #快照路徑和前綴
solver_mode: GPU  
net: "train_val.prototxt"  # 網絡結構的文件路徑。
solver_type: SGD  

----- train_val.prototxt 修改
###### Data層為原圖像格式。設置主要是data層不同（原圖像作為輸入）
layer {
  name: "data"
  type: "ImageData" ###註意是ImageData，可以直接使用圖像訓練
  top: "data"
  top: "label"
  include {
    phase: TRAIN
  }

image_data_param { ###
    source: "examples/finetune_myself/train.txt"  ###
    batch_size: 50
    new_height: 256 ###
    new_width: 256 ###
  }
  
##### data層為lmdb格式.（制作的lmdb格式作為輸入）
layer {
  name: "data"
  type: "Data" ###這裏是data，使用轉換為lmdb的圖像之後訓練
  top: "data"
  top: "label"
  include {
    phase: TRAIN
  }

  data_param {  ###
    source: "examples/imagenet/car_train_lmdb"###
    batch_size: 256 
    backend: LMDB ###
  }
  
整個網絡結構為：
name: "AlexNet"
layer {
  name: "data"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TRAIN
  }
  transform_param {
    mirror: true
    crop_size: 227
    mean_file: "mimg_mean.binaryproto" #均值文件
  }
  data_param {
    source: "mtrainldb"  #訓練數據
    batch_size: 256
    backend: LMDB
  }
}
layer {
  name: "data"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TEST
  }
  transform_param {
    mirror: false
    crop_size: 227
    mean_file: "mimg_mean.binaryproto"  #均值文件
  }
  data_param {
    source: "mvaldb"   #驗證數據
    batch_size: 50
    backend: LMDB
  }
}
layer {
  name: "conv1"
  type: "Convolution"
  bottom: "data"
  top: "conv1"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  convolution_param {
    num_output: 96
    kernel_size: 11
    stride: 4
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0
    }
  }
}
layer {
  name: "relu1"
  type: "ReLU"
  bottom: "conv1"
  top: "conv1"
}
layer {
  name: "norm1"
  type: "LRN"
  bottom: "conv1"
  top: "norm1"
  lrn_param {
    local_size: 5
    alpha: 0.0001
    beta: 0.75
  }
}
layer {
  name: "pool1"
  type: "Pooling"
  bottom: "norm1"
  top: "pool1"
  pooling_param {
    pool: MAX
    kernel_size: 3
    stride: 2
  }
}
layer {
  name: "conv2"
  type: "Convolution"
  bottom: "pool1"
  top: "conv2"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  convolution_param {
    num_output: 256
    pad: 2
    kernel_size: 5
    group: 2
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0.1
    }
  }
}
layer {
  name: "relu2"
  type: "ReLU"
  bottom: "conv2"
  top: "conv2"
}
layer {
  name: "norm2"
  type: "LRN"
  bottom: "conv2"
  top: "norm2"
  lrn_param {
    local_size: 5
    alpha: 0.0001
    beta: 0.75
  }
}
layer {
  name: "pool2"
  type: "Pooling"
  bottom: "norm2"
  top: "pool2"
  pooling_param {
    pool: MAX
    kernel_size: 3
    stride: 2
  }
}
layer {
  name: "conv3"
  type: "Convolution"
  bottom: "pool2"
  top: "conv3"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  convolution_param {
    num_output: 384
    pad: 1
    kernel_size: 3
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0
    }
  }
}
layer {
  name: "relu3"
  type: "ReLU"
  bottom: "conv3"
  top: "conv3"
}
layer {
  name: "conv4"
  type: "Convolution"
  bottom: "conv3"
  top: "conv4"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  convolution_param {
    num_output: 384
    pad: 1
    kernel_size: 3
    group: 2
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0.1
    }
  }
}
layer {
  name: "relu4"
  type: "ReLU"
  bottom: "conv4"
  top: "conv4"
}
layer {
  name: "conv5"
  type: "Convolution"
  bottom: "conv4"
  top: "conv5"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  convolution_param {
    num_output: 256
    pad: 1
    kernel_size: 3
    group: 2
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0.1
    }
  }
}
layer {
  name: "relu5"
  type: "ReLU"
  bottom: "conv5"
  top: "conv5"
}
layer {
  name: "pool5"
  type: "Pooling"
  bottom: "conv5"
  top: "pool5"
  pooling_param {
    pool: MAX
    kernel_size: 3
    stride: 2
  }
}
layer {
  name: "fc6"
  type: "InnerProduct"
  bottom: "pool5"
  top: "fc6"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  inner_product_param {
    num_output: 4096
    weight_filler {
      type: "gaussian"
      std: 0.005
    }
    bias_filler {
      type: "constant"
      value: 0.1
    }
  }
}
layer {
  name: "relu6"
  type: "ReLU"
  bottom: "fc6"
  top: "fc6"
}
layer {
  name: "drop6"
  type: "Dropout"
  bottom: "fc6"
  top: "fc6"
  dropout_param {
    dropout_ratio: 0.5
  }
}
layer {
  name: "fc7"
  type: "InnerProduct"
  bottom: "fc6"
  top: "fc7"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  inner_product_param {
    num_output: 4096
    weight_filler {
      type: "gaussian"
      std: 0.005
    }
    bias_filler {
      type: "constant"
      value: 0.1
    }
  }
}
layer {
  name: "relu7"
  type: "ReLU"
  bottom: "fc7"
  top: "fc7"
}
layer {
  name: "drop7"
  type: "Dropout"
  bottom: "fc7"
  top: "fc7"
  dropout_param {
    dropout_ratio: 0.5
  }
}
layer {
  name: "fc8"
  type: "InnerProduct"
  bottom: "fc7"
  top: "fc8"
  param {
    lr_mult: 1
    decay_mult: 1
  }
  param {
    lr_mult: 2
    decay_mult: 0
  }
  inner_product_param {
    num_output: 2       #註意：這裏需要改成你要分成的類的個數
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0
    }
  }
}
layer {
  name: "accuracy"
  type: "Accuracy"
  bottom: "fc8"
  bottom: "label"
  top: "accuracy"
  include {
    phase: TEST
  }
}
layer {
  name: "loss"
  type: "SoftmaxWithLoss"
  bottom: "fc8"
  bottom: "label"
  top: "loss"
}

運行以下腳本進行train
#!/usr/bin/env sh
set -e

./build/tools/caffe train     --solver=food/food_alexnet/solver.prototxt
    
5、測試 
同樣，測試需要一個類別標簽文件，category.txt，文件內容同上，修改deploy.prototxt 開始測試：
./bin/classification "food/foodnet/deploy.prototxt" "food/foodnet/food_iter_100000.caffemodel" "ming_mean.binaryproto" "test001.jpg"

------------------------------------    
---------------- FineTune：
http://www.cnblogs.com/denny402/p/5074212.html
http://www.cnblogs.com/alexcai/p/5469478.html
1，註意finetune的時候，最後一層的連接層的名字需要做修改，類別數需要修改，並且學習率應該比較大，因為只有這層的權值是重新訓練的，而其他的都是已經訓練好了的
2、開始訓練的時候，最後制定的模型為將要finetune的模型
./build/tools/caffe train -solver examples/money_test/fine_tune/solver.prototxt -weights models/bvlc_reference_caffenet/bvlc_reference_caffenet.caffemodel
其中model指定的是caffenet訓練好的model。

caffe使用自己的數據做分類

總數 direct 圖片 play local foo director creating 通過這裏只舉一個例子: Alexnet網絡訓練自己數據的過程用AlexNet跑自己的數據參考1：http://blog.csdn.net/gybheroin/article/

caffe_ssd學習-用自己的數據做訓練

一個自己 shel 錯誤做的 agen 所有測試答案幾乎沒用過linux操作系統，不懂shell編程，linux下shell+windows下UltraEdit勉勉強強生成了train.txt和val.txt期間各種錯誤辛酸不表，照著examples/imagen

FastRCNN 訓練自己數據集 (1編譯配置)

backend key article tail back art model plot osc http://www.cnblogs.com/louyihang-loves-baiyan/p/4885659.html 按照博客的教程配置，但自己在服務器上配置時，USE_C

將自己數據轉化為cifar10支持的lmdb

顯示 c++ track div ifs cat 自己的 align blog 大家都知道，在caffe裏面，要運行cifar10的例子就得先由cifar10的數據庫。由於caffe為了提高運行效率，減少磁盤尋道時間等，統一了數據接口（lmdb，leveldb）。首先，

hbase中double類型數據做累加

string val [] connect lena 進行 return 數據 row public static Result incr(String tableFullName, String rowKey, String family, String qualif

為什麽DT時代，企業需要利用數據做精細化運營

數據營銷企業信息化信息化數據運營袁帥隨著互聯網的飛速發展，信息的傳輸日益方便快捷，需求也日益突出，縱觀整個互聯網領域，大數據已被認為是繼雲計算、物聯網之後的又一大顛覆性的技術性革命，大數據市場是待挖掘的金礦，其價值不言而喻。企業運營對於企業來說是非常重要的，因為良好的運營體系會讓

對後端接口數據做容錯性測試

部分代理服 charles char 生產環境 div 如果沒有傳輸過程在工作過程中，生產app產品偶爾會發生一些在測試過程中無法發現的bug，很多時候是因為在生產環境中服務更不穩定，後端接口會有很多異常返回，例如最常見的result code為非200，或者在數據

mysql 查詢之數據語句分類

ins lec 數據操作獨立 insert del 用戶操作外鍵 unique sql 語句的分類 DDL數據定義語言 create/drop/alter DML數據操作語言 insert/detele/update/trunce DQL數據查詢語言； select/s

深度學習（tensorflow） —— 自己數據集讀取opencv

spa 屬於有效測試大小打開文件需要深度學習 ray 先來看一下我們的目錄： dataset1 和creat_dataset.py 屬於同一目錄 mergeImg1 和mergeImg2 為Dataset1的兩子目錄（兩類為例子）目錄中存儲圖像等

Oracle 的 oracle 數據庫分類

相對大型使用安全性 access 數據庫 mysq 商務信息管理系統一、數據庫分類 1、小型數據庫：access、foxbase 2、中型數據庫：informix、sql server、mysql 3、大型數據庫：sybase、db2、oracle

用自己訓練好的caffemodel來對自己的圖片做分類

首先是deploy.prototxt檔案的生成，deploy.prototxt和train_val.prototxt檔案類似，只是頭尾有些區別而已。沒有了第一層的資料層，也沒有最後的accuracy層（用於反向傳播），但最後多了一個Softmax概率層（Softmax直接

使用pandas、sklearn等外部庫進行iris數據的分類和繪圖，並計算正確率

tin closed mode frame 內容 plt -a predict none from sklearn.model_selection import train_test_split from sklearn.datasets import load_

貝葉斯決策分類器 MNIST手寫數據集分類 python實現

row 出了 net 訓練集貝葉斯公式影響集中 oat blog 轉載： (1) https://zhuanlan.zhihu.com/p/51200626 　　　（2）菊安醬的機器學習第三期　　　（3）代碼來自：https://github.com

Caffe上用SSD訓練和測試自己的數據

輸出 makefile b數 text play cal 上下 lba san 學習caffe第一天，用SSD上上手。我的根目錄$caffe_root為/home/gpu/ljy/caffe 一、運行SSD示例代碼 1.到https://github.com

windows10 conda2 使用caffe訓練訓練自己的數據

caffe lex cond www mom nal shuff sna nor 首先得到了https://blog.csdn.net/gybheroin/article/details/72581318系列博客的幫助。表示感激。關於安裝caffe已在之前的博客介紹，自用

從數據庫、代碼和服務器對PHP網站Mysql做性能優化

now() image 最好提高 mysql 避免允許大數 rdate 數據庫優化是PHP面試幾乎都會被問到的事情，也是我們工作中應該註意的事情，當然，如果是小網站無所謂優化不優化，網站訪問量大了自然會暴漏數據庫的瓶頸，這個瓶頸是各方面問題綜合導致的，下面我們來做下數

Python基本數據分類方式

python 數據類型一、內存模型依據變量在內存中的組織分類 Python的類型，就象絕大多數其它語言一樣，能容納一個或多個值。一個能保存單個字面對象的類型我們稱它為原子或標量存儲，那些可容納多個對象的類型，我們稱之為容器存儲。（容器對象有時會在文檔中被稱為復合

關於客戶端設計之數據分類和存儲的思考

service his defaults def sqli href 思想 number fault 一、關於數據的分類在Android 客戶端設計過程中，我將數據分為未知，已知（本地），臨時，三者之間根據需求相互轉化。未知主要來自用戶輸入和服務端輸入。已知主

JS 循環遍歷JSON數據分類： JS技術 JS JQuery 2010-12-01 13:56 43646人閱讀評論(5) 收藏舉報 jsonc JSON數據如：{"options":"[{

ros json 12px details style position none -i ide JS 循環遍歷JSON數據分類： JS技術 JS JQuery2010-12-01 13:56 43646人閱讀評論(5) 收藏舉報 jsonc

項目優化經驗分享（一）數據自己主動匹配

als 主動 options option reg shee total tomat 功能從今天開始。我將和大家分享一下近期經手項目的優化經驗。今天我們分享的內容是：自己主動匹配！引言：輸入框數據自己主動匹配大家應該非常熟悉，當我們在使用百度或go

caffe使用自己的數據做分類

相關推薦