caffe 在已有模型上繼續訓練

阿新 • • 發佈：2019-01-07

一、

caffe 支援在別人的模型上繼續訓練。下面是給的例子

caffe-master0818\examples\imagenet\resume_training.sh

#!/usr/bin/env sh

./build/tools/caffe train \
    --solver=models/bvlc_reference_caffenet/solver.prototxt \
    --snapshot=models/bvlc_reference_caffenet/caffenet_train_10000.solverstate.h5

二，caffe同樣支援多次降學習率訓練

比如 caffe-master0818\examples\cifar10\train_full.sh

#!/usr/bin/env sh

TOOLS=./build/tools

$TOOLS/caffe train \
    --solver=examples/cifar10/cifar10_full_solver.prototxt

# reduce learning rate by factor of 10
$TOOLS/caffe train \
    --solver=examples/cifar10/cifar10_full_solver_lr1.prototxt \                           // 這裡學習率了lr1配置檔案 
    --snapshot=examples/cifar10/cifar10_full_iter_60000.solverstate.h5

# reduce learning rate by factor of 10
$TOOLS/caffe train \
    --solver=examples/cifar10/cifar10_full_solver_lr2.prototxt \                           // <span style="font-family: Arial, Helvetica, sans-serif;"> 這裡學習率了lr1配置檔案</span>
    --snapshot=examples/cifar10/cifar10_full_iter_65000.solverstate.h5

另一個模型訓練

#!/usr/bin/env sh

TOOLS=./build/tools

$TOOLS/caffe train \
  --solver=examples/cifar10/cifar10_quick_solver.prototxt

# reduce learning rate by factor of 10 after 8 epochs
$TOOLS/caffe train \
  --solver=examples/cifar10/cifar10_quick_solver_lr1.prototxt \
  --snapshot=examples/cifar10/cifar10_quick_iter_4000.solverstate.h5

這樣就不用每次都手動降學習率了。

對於大的模型來說，多次降學習率還是很重要的，實驗結果表明，當第一次學習率不再下降的時候，再次降學習率。能夠進一步降低損失函式

學習率太跳不到最低點，太小跳不出區域性最優點，所以剛開始學習率要大一些，防止進入區域性最優點。

-------------------------------------------------------------------------------------------------------------------------------------------

三、利用配置檔案，配置

當然降學習率也可以通過配置檔案配置。

http://caffe.berkeleyvision.org/tutorial/solver.html caffe 官方例子

To use a learning rate policy like this, you can put the following lines somewhere in your solver prototxt file:

base_lr: 0.01     # begin training at a learning rate of 0.01 = 1e-2

lr_policy: "step" # learning rate policy: drop the learning rate in "steps"
                  # by a factor of gamma every stepsize iterations

gamma: 0.1        # drop the learning rate by a factor of 10
                  # (i.e., multiply it by a factor of gamma = 0.1)

stepsize: 100000  # drop the learning rate every 100K iterations

max_iter: 350000  # train for 350K iterations total

momentum: 0.9

Under the above settings, we’ll always use momentum μ=0.9. We’ll begin training at a base_lr of α=0.01=10−2 for the first 100,000 iterations, then multiply the learning rate by gamma (γ) and train at α′=αγ=(0.01)(0.1)=0.001=10−3 for iterations 100K-200K, then at α′′=10−4 for iterations 200K-300K, and finally train until iteration 350K (since we havemax_iter: 350000) at α′′′=10−5.

Note that the momentum setting μ effectively multiplies the size of your updates by a factor of 11−μ after many iterations of training, so if you increase μ, it may be a good idea to decrease αaccordingly (and vice versa).

For example, with μ=0.9, we have an effective update size multiplier of 11−0.9=10. If we increased the momentum to μ=0.99, we’ve increased our update size multiplier to 100, so we should drop α (base_lr) by a factor of 10.

Note also that the above settings are merely guidelines, and they’re definitely not guaranteed to be optimal (or even work at all!) in every situation. If learning diverges (e.g., you start to see very large or NaN or inf loss values or outputs), try dropping the base_lr (e.g., base_lr: 0.001) and re-training, repeating this until you find a base_lr value that works.

[1] A. Krizhevsky, I. Sutskever, and G. Hinton. ImageNet Classification with Deep Convolutional Neural Networks. Advances in Neural Information Processing Systems, 2012.

caffe 在已有模型上繼續訓練

caffe 在已有模型上繼續訓練

基於caffe在已有模型上進行微調finetune

TensorFlow 模型儲存與恢復總結（微調、微改已有模型）

用已有模型進行微調 finetune

在已有nginx上新增模組以及在已有安裝包上編譯nginx

tf實現在上次訓練結果上繼續訓練

Caffe學習筆記1：linux下建立自己的資料庫訓練和測試caffe中已有網路

雲計算之路-阿裏雲上：彈性伸縮無服務器可彈，已有服務器無兵可援

將已有項目推送到github上

【5】caffe的python介面學習：訓練模型（training)

caffe之利用mnist資料集訓練好的lenet_iter_10000.caffemodel模型測試一張自己的手寫體數字

caffe目標檢測模型訓練全過程（三）目標檢測第一步

Windows+VS2013 caffe使用LeNet模型訓練過程

20 萬、50 萬、100 萬年薪的演算法工程師在能力素質模型上有哪些差距？

python實現對caffe的訓練，初始權重訓練和繼續訓練

caffe呼叫之前的權重和接著斷點繼續訓練

【MYSQL筆記2】複製表，在已有表的基礎上設定主鍵，insert和replace

【MYSQL筆記2】復制表，在已有表的基礎上設置主鍵，insert和replace

將已有專案繫結到git上

Github使用之將已有專案提交到Github/從Github上pull到本地

caffe 在已有模型上繼續訓練

相關推薦