用pycaffe訓練影象

阿新 • • 發佈：2018-11-27

廢話不多說，本文在python下呼叫caffe來訓練，由於python下圖片轉lmdb比較複雜，所以就直接使用了windows下的介面。如果不會搭建caffe包的，移步這https://blog.csdn.net/zb1165048017/article/details/52980102

資料集是一個二分類的資料集，主要是人臉和非人臉，連結：https://pan.baidu.com/s/1WCErudFafJjP2V1edpV5_g 密碼：q85k

要跑網路，我們要先構建自己的網路，由於資料集圖片是60*60，所以我們沒必要跑太複雜的網路，所以我寫了個比較簡單的網路，最後準確率能達到98%

import caffe
from caffe import layers as L,params as P
#定義你的網路層
def myLayer(lmdb,batch_size,is_deploy):
    n=caffe.NetSpec()
#這裡的source填入你的資料來源lmdb
    n.data,n.label=L.Data(batch_size=batch_size,backend=P.Data.LMDB,source=lmdb,
                          transform_param=dict(scale=1./255),ntop=2)
    n.conv1 = L.Convolution(n.data, kernel_size=3,stride=2, num_output=20, 
                            weight_filler=dict(type='xavier'))
    #特徵圖變為 30*30 
    n.relu1=L.ReLU(n.conv1,in_place=True)
    n.conv2 = L.Convolution(n.relu1, kernel_size=3,stride=2, num_output=60, 
                            weight_filler=dict(type='xavier'))
    #15*15
    n.relu2=L.ReLU(n.conv2,in_place=True)
    n.conv3 = L.Convolution(n.relu1, kernel_size=3,stride=2, num_output=90, 
                            weight_filler=dict(type='xavier'))
    n.relu3=L.ReLU(n.conv3,in_place=True)
    n.score=L.InnerProduct(n.relu3,num_output=2,weight_filler=dict(type='xavier'))
    n.loss=L.SoftmaxWithLoss(n.score,n.label)
    return n.to_proto()
#上面的僅僅是定義，下面的是把網路寫入本地的prototxt檔案中
def writeLayer():
    with open('train.prototxt', 'w') as f:
        f.write(str(myLayer('train_lmdb', 100,0)))
    with open('test.prototxt', 'w') as f:
        f.write(str(myLayer('test_lmdb', 10,0)))

寫完網路層，就是寫solver，solver有兩種方法來定義，我一開始用的第一種writeSolver(),可老是報錯，報錯的原因貌似是pycaffe識別不了“snapshot_prefix”這個屬性，於是我後來採用了第二張寫法writeSolver_2()

def writeSolver():
    solverprototxt=tools.CaffeSolver(trainnet_prototxt_path='train.prototxt',
                                    testnet_prototxt_path='test.prototxt')
#在python下，你可以沒必要每個都去定義，它會自己初始化
    solverprototxt.sp['base_lr']='0.001'
    solverprototxt.sp['weight_decay']='0.001'
    solverprototxt.sp['gamma']='0.0001'
    solverprototxt.sp['power']='0.001'
    solverprototxt.sp['display']='1000'
    solverprototxt.sp['test_iter']='100'
    solverprototxt.sp['max_iter']='5000'
    solverprototxt.sp['lr_policy']="step"
    solverprototxt.sp['snapshot']="1000"
#下面這個snapshot_prefix我把他去掉就能正常跑，加上就一直報錯
#    solverprototxt.sp['snapshot_prefix']="rr"
    solverprototxt.sp['display'] = "1"
    solverprototxt.sp['max_iter'] = "1000"
    solverprototxt.write('solver.prototxt')

from caffe.proto import caffe_pb2
def writeSolver_2():
    s=caffe_pb2.SolverParameter()
    s.train_net = 'train.prototxt'     # 訓練配置檔案
    s.test_net.append('test.prototxt')  # 測試配置檔案
    s.test_interval = 200                   # 測試間隔
    s.test_iter.append(1)                 # 測試迭代次數
    s.max_iter = 78200                      # 最大迭代次數
    s.base_lr = 0.001                       # 基礎學習率
    s.momentum = 0.9                        # momentum係數
    s.weight_decay = 5e-4                   # 權值衰減係數
    s.lr_policy = 'step'                    # 學習率衰減方法
    s.stepsize=26067                        # 此值僅對step方法有效
    s.gamma = 0.1                           # 學習率衰減指數
    s.display = 782                         # 螢幕日誌顯示間隔
    s.snapshot = 2000
    s.snapshot_prefix = 'shapshot'
    s.type = "SGD"                         # 優化演算法
    s.solver_mode = caffe_pb2.SolverParameter.GPU
    with open("solver.prototxt","w") as f:
        f.write(str(s))

現在如果分別呼叫上面兩個函式，就可以在本地生成三個檔案（1個solver，另外兩個分別為訓練階段和測試階段的），如圖

好，一切就緒，下面就是跑網路了，如果單純地想讓網路跑起來的話，下面兩句就行了，step就是讓caffe跑的步數

solver=caffe.get_solver('solver.prototxt')
solver.step(200)

下面我給的訓練包括輸出準確率以及視覺化

def train_layer():
    caffe.set_device(0)
    caffe.set_mode_gpu()
    solver=caffe.get_solver('solver.prototxt')
    niter = 3000
    x_label=[0]
    y_acc=[0]
    acc=0
    for it in range(niter):
        solver.step(1)  # SGD by Caffe   
        solver.test_nets[0].forward()    
#a為最後的score，由於測試批次是10，所以這個a是個1*10的陣列，對應的b也是一個1*10的陣列
        a=solver.test_nets[0].blobs['score'].data.argmax(1)
        b=solver.test_nets[0].blobs['label'].data   
        for j in range(10):
            if(a[j]==b[j]):
                acc+=1                 
        if(it%40==0):
            print '第',it,'次迭代，準確率為：',Decimal(float(acc)/float(10*(it+1))).quantize(Decimal('0.000'))
            x_label.append(it)
            y_acc.append(Decimal(float(acc)/float(10*(it+1))).quantize(Decimal('0.000')))
    plt.plot(x_label, y_acc)
    plt.show()

ok,下面上一段完整的程式碼

import caffe
from caffe import layers as L,params as P
import sys
sys.path.append('F:/caffe/caffe-master/examples/pycaffe')
import tools
def writeSolver():
    solverprototxt=tools.CaffeSolver(trainnet_prototxt_path='train.prototxt',
                                    testnet_prototxt_path='test.prototxt')
#    solverprototxt.sp['base_lr']='0.001'
#    solverprototxt.sp['weight_decay']='0.001'
#    solverprototxt.sp['gamma']='0.0001'
#    solverprototxt.sp['power']='0.001'
#    solverprototxt.sp['display']='1000'
#    solverprototxt.sp['test_iter']='100'
#    solverprototxt.sp['max_iter']='5000'
#    solverprototxt.sp['lr_policy']="step"
#    solverprototxt.sp['snapshot']="1000"
##    solverprototxt.sp['snapshot_prefix']="rr"
#    solverprototxt.sp['display'] = "1"
#    solverprototxt.sp['max_iter'] = "1000"
    solverprototxt.write('solver.prototxt')

from caffe.proto import caffe_pb2
def writeSolver_2():
    s=caffe_pb2.SolverParameter()
    s.train_net = 'train.prototxt'     # 訓練配置檔案
    s.test_net.append('test.prototxt')  # 測試配置檔案
    s.test_interval = 200                   # 測試間隔
    s.test_iter.append(1)                 # 測試迭代次數
    s.max_iter = 78200                      # 最大迭代次數
    s.base_lr = 0.001                       # 基礎學習率
    s.momentum = 0.9                        # momentum係數
    s.weight_decay = 5e-4                   # 權值衰減係數
    s.lr_policy = 'step'                    # 學習率衰減方法
    s.stepsize=26067                        # 此值僅對step方法有效
    s.gamma = 0.1                           # 學習率衰減指數
    s.display = 782                         # 螢幕日誌顯示間隔
    s.snapshot = 2000
    s.snapshot_prefix = 'shapshot'
    s.type = "SGD"                         # 優化演算法
    s.solver_mode = caffe_pb2.SolverParameter.GPU
    with open("solver.prototxt","w") as f:
        f.write(str(s))
    




#original img is 60*60
def myLayer(lmdb,batch_size,is_deploy):
    n=caffe.NetSpec()
    n.data,n.label=L.Data(batch_size=batch_size,backend=P.Data.LMDB,source=lmdb,
                          transform_param=dict(scale=1./255),ntop=2)
    n.conv1 = L.Convolution(n.data, kernel_size=3,stride=2, num_output=20, 
                            weight_filler=dict(type='xavier'))
    #特徵圖變為 30*30 
    n.relu1=L.ReLU(n.conv1,in_place=True)
    n.conv2 = L.Convolution(n.relu1, kernel_size=3,stride=2, num_output=60, 
                            weight_filler=dict(type='xavier'))
    #15*15
    n.relu2=L.ReLU(n.conv2,in_place=True)
    n.conv3 = L.Convolution(n.relu1, kernel_size=3,stride=2, num_output=90, 
                            weight_filler=dict(type='xavier'))
    n.relu3=L.ReLU(n.conv3,in_place=True)
    n.score=L.InnerProduct(n.relu3,num_output=2,weight_filler=dict(type='xavier'))
    n.loss=L.SoftmaxWithLoss(n.score,n.label)
    return n.to_proto()

def writeLayer():
    with open('train.prototxt', 'w') as f:
        f.write(str(myLayer('train_lmdb', 100,0)))
    with open('test.prototxt', 'w') as f:
        f.write(str(myLayer('test_lmdb', 10,0)))
import matplotlib.pyplot as plt
import numpy as np
from decimal import Decimal
def train_layer():
    caffe.set_device(0)
    caffe.set_mode_gpu()
    solver=caffe.get_solver('F:/python_project/8_2/solver.prototxt')
    niter = 3000
    x_label=[0]
    y_acc=[0]
    acc=0
    for it in range(niter):
        solver.step(1)  # SGD by Caffe   
        solver.test_nets[0].forward()    
        a=solver.test_nets[0].blobs['score'].data.argmax(1)
        b=solver.test_nets[0].blobs['label'].data   
        for j in range(10):
            if(a[j]==b[j]):
                acc+=1                 
        if(it%40==0):
            print '第',it,'次迭代，準確率為：',Decimal(float(acc)/float(10*(it+1))).quantize(Decimal('0.000'))
            x_label.append(it)
            y_acc.append(Decimal(float(acc)/float(10*(it+1))).quantize(Decimal('0.000')))
    plt.plot(x_label, y_acc)
    plt.show()
     

writeSolver()
writeLayer()
#solver=caffe.get_solver('F:/python_project/8_2/solver.prototxt')
#solver.step(200) 
train_layer()

下面是我的結果

用pycaffe訓練影象

廢話不多說，本文在python下呼叫caffe來訓練，由於python下圖片轉lmdb比較複雜，所以就直接使用了windows下的介面。如果不會搭建caffe包的，移步這https://blog.csdn.net/zb1165048017/article/details/52980102 資料集

Tensorflow 的安裝和用InceptionV3訓練新的影象分類模型

Tensorflow的安裝 1.Tensorflow簡介 Tensorflow是一個谷歌釋出的人工智慧開發工具，於2015年年底開源。在開源之前一直是在谷歌內部使用，維護性比較好，裡面的很多工具也比較新。Tensorflow是採用C++和python寫成的，給的介面也是C+

tensorflow 1.0 學習：用Google訓練好的模型來進行影象分類

谷歌在大型影象資料庫ImageNet上訓練好了一個Inception-v3模型，這個模型我們可以直接用來進來影象分類。下載地址：github：https://github.com/taey16/tf/tree/master/imagenet下載完解壓後，得到幾個檔案：其中的c

tensorflow 1.0 學習：用別人訓練好的模型來進行影象分類

谷歌在大型影象資料庫ImageNet上訓練好了一個Inception-v3模型，這個模型我們可以直接用來進來影象分類。下載完解壓後，得到幾個檔案：其中的classify_image_graph_def.pb 檔案就是訓練好的Inception-v3模型。 imagenet_synset_to_h

Tensorflow用別人訓練好的模型進行影象分類（可執行）

【先說一下自己想說的】：昨晚上找了很久才搞定，程式碼和給的檔案根本不匹配，轉載也不驗證一下就轉。弄得我花了一整天！（我就為了加個單擊圖片顯示可能的標籤這麼個功能我……我容易嗎……555）原帖：http://www.cnblogs.com/denny402/p/694258

Tensorflow學習（7）用別人訓練好的模型進行影象分類

其中的classify_image_graph_def.pb 檔案就是訓練好的Inception-v3模型。 imagenet_synset_to_human_label_map.txt是類別檔案。隨機找一張圖片：如對這張圖片進行識別，看它屬於

tensorflow學習筆記十一：用別人訓練好的模型來進行影象分類

谷歌在大型影象資料庫ImageNet上訓練好了一個Inception-v3模型，這個模型我們可以直接用來進來影象分類。下載完解壓後，得到幾個檔案：其中的classify_image_graph_def.pb 檔案就是訓練好的Inception-v3模型。imagenet_sy

深度學習tensorflow實戰筆記（5）用預訓練好的VGG-16模型提取影象特徵

上一篇部落格介紹瞭如果使用自己訓練好的模型用於影象分類和特徵提取，但是有時候自己的資料集大小有限，所以更多的時候我們需要用VGG-16預訓練好的模型提取特徵，相關學者預訓練好的模型使用的都是公開的標準資料集，所以我們直接用預訓練的模型提取我們自己影象的特徵，可以用於

Caffe上用SSD訓練和測試自己的數據

輸出 makefile b數 text play cal 上下 lba san 學習caffe第一天，用SSD上上手。我的根目錄$caffe_root為/home/gpu/ljy/caffe 一、運行SSD示例代碼 1.到https://github.com

tensorflow 1.0 學習：用別人訓練好的模型來進行圖像分類

ima ppi gin 什麽 dir targe spl flow blog 谷歌在大型圖像數據庫ImageNet上訓練好了一個Inception-v3模型，這個模型我們可以直接用來進來圖像分類。下載地址：https://storage.googleapis.com/d

keras調用預訓練模型分類

dict 拓展 span 類別就是 num pan 維度上下在網上看到一篇博客，地址https://www.pyimagesearch.com/2017/03/20/imagenet-vggnet-resnet-inception-xception-keras/，是關

用TensorFlow訓練卷積神經網路——識別驗證碼

需要用到的包：numpy、tensorflow、captcha、matplotlib、PIL、random import numpy as np import tensorflow as tf # 深度學習庫 from captcha.image import ImageCaptcha

用Python做影象處理

用Python做影象處理用Python做影象處理最近在做一件比較 evil 的事情——驗證碼識別，以此來學習一些新的技能。因為我是初學，對影象處理方面就不太瞭解了，欲要利吾事，必先利吾器

用caffe訓練自己的資料集(三)

本文主要參考了：https://blog.csdn.net/heimu24/article/details/53581362 https://blog.csd

用caffe訓練自己的資料集(二)

本文主要參考了：https://blog.csdn.net/heimu24/article/details/53581362 https://blog.c

用caffe訓練自己的資料集(一)

本文主要參考了：https://blog.csdn.net/heimu24/article/details/53581362 https://blog.csd

Spark Mlib(三)用spark訓練詞向量

自然語言處理中，在詞的表示上，向量的方式無疑是最流行的一種。它可以作為神經網路的輸入，也可直接用來計算。比如計算兩個詞的相似度時，就可以用這兩個詞向量的距離來衡量。詞向量的訓練需要大規模的語料，從而帶來的是比較長的訓練時間。spark框架基於記憶體計算，有忘加快詞向量的訓練速度。以下是sp

0004-用OpenCV實現影象平移的程式碼(分影象尺寸不變和變兩種情況)

影象平移是啥東西就不用講了吧!需要注意的是影象平移有兩種，第一種是平移後圖像大小不變，這樣會損失影象的部分；第二種是平移後圖像大小變化，這樣原影象不會有損失。直接上程式碼，大家看效果吧！程式碼流程如下：讀取影象→顯示原影象→呼叫自定義的函式translateTransform，作平移後

Win10用yolov3訓練自己的資料

哈哈，我們的效率還是很棒的，先自誇一下~廢話不多說，下面就是正宮娘娘：接上次的部落格（yolo環境配好以後）製作自己的資料集首先就是製作資料集啦，我們是自己在校園裡面拍的共享單車，訓練集大概有兩三百張的樣子，還留了一小部分估計也有一百張的樣子做測試集。當然也有SAMA的部落格直

用Python將影象裁剪

用Python將影象裁剪 # -*- coding: utf-8 -*- """ Created on Tue May 15 19:08:03 2018 @author: win7 """ import matplotlib.pyplot as plt from PIL import

用pycaffe訓練影象

相關推薦