使用訓練好的caffe模型分類圖片(python版)

阿新 • • 發佈：2018-11-10

英文官方文件：http://nbviewer.jupyter.org/github/BVLC/caffe/blob/master/examples/00-classification.ipynb

匯入python caffe包

import numpy as np
import matplotlib.pyplot as plt
# display plots in this notebook
%matplotlib inline

# set display defaults
plt.rcParams['figure.figsize'] = (10, 10)        # large images
plt.rcParams['image.interpolation'] = 'nearest'  # don't interpolate: show square pixels
plt.rcParams['image.cmap'] = 'gray'  # use grayscale output rather than a (potentially misleading) color heatmap


import sys
import os
caffe_root = './'  #指定caffe的根目錄 
sys.path.insert(0, caffe_root + 'python')    #將caffe python介面檔案路徑新增到python path中
import caffe

# 判斷model檔案是否存在
if os.path.isfile(caffe_root + 'models/bvlc_reference_caffenet/bvlc_reference_caffenet.caffemodel'):
    print 'CaffeNet found.'
else:
    print 'Downloading pre-trained CaffeNet model...'

載入網路，建立輸入處理

使用python caffe.io.loadImage介面讀取圖片，返回的是[0-1]返回的np.float32陣列

def load_image(filename, color=True):
    """
    Load an image converting from grayscale or alpha as needed.

    Parameters
    ----------
    filename : string
    color : boolean
        flag for color format. True (default) loads as RGB while False
        loads as intensity (if image is already grayscale).

    Returns
    -------
    image : an image with type np.float32 in range [0, 1]
        of size (H x W x 3) in RGB or
        of size (H x W x 1) in grayscale.
    """
    img = skimage.img_as_float(skimage.io.imread(filename, as_grey=not color)).astype(np.float32)
    if img.ndim == 2:
        img = img[:, :, np.newaxis]
        if color:
            img = np.tile(img, (1, 1, 3))
    elif img.shape[2] == 4:
        img = img[:, :, :3]
    return img

python Transformer介面會對load_image讀取的圖片做處理，注意raw_scale實在減去均值和其他處理之前，而input_scale實在這些操作之後

    def preprocess(self, in_, data):
        """
        Format input for Caffe:
        - convert to single
        - resize to input dimensions (preserving number of channels)
        - transpose dimensions to K x H x W
        - reorder channels (for instance color to BGR)
        - scale raw input (e.g. from [0, 1] to [0, 255] for ImageNet models)
        - subtract mean
        - scale feature

        Parameters
        ----------
        in_ : name of input blob to preprocess for
        data : (H' x W' x K) ndarray

        Returns
        -------
        caffe_in : (K x H x W) ndarray for input to a Net
        """
        self.__check_input(in_)
        caffe_in = data.astype(np.float32, copy=False)
        transpose = self.transpose.get(in_)
        channel_swap = self.channel_swap.get(in_)
        raw_scale = self.raw_scale.get(in_)
        mean = self.mean.get(in_)
        input_scale = self.input_scale.get(in_)
        in_dims = self.inputs[in_][2:]

        #1 resize大小
        if caffe_in.shape[:2] != in_dims:   
            caffe_in = resize_image(caffe_in, in_dims)
        
        #2 維度變換，H*W*C轉換成  C*H*W
        if transpose is not None:     
            caffe_in = caffe_in.transpose(transpose)
        
        #3 通道變換
        if channel_swap is not None: #RGB 
            caffe_in = caffe_in[channel_swap, :, :]
       
        #4 raw_scale 讀取的圖片數值範圍在[0,1]時，raw_scale = 255,轉換成[0,255]
        if raw_scale is not None:
            caffe_in *= raw_scale
        
        #5 減去均值
        if mean is not None:   
            caffe_in -= mean

        # input_scale = 0.00390625時， 圖片資料轉換成[0,1] 
        if input_scale is not None:
            caffe_in *= input_scale
        return caffe_in

# 使用cpu計算
caffe.set_mode_cpu()

model_def = caffe_root + 'models/bvlc_reference_caffenet/deploy.prototxt'
model_weights = caffe_root + 'models/bvlc_reference_caffenet/bvlc_reference_caffenet.caffemodel'

# 載入網路
net = caffe.Net(model_def,      # 模型定義檔案
                model_weights,  # 模型引數檔案
                caffe.TEST)     # 啟用測試模式 (e.g., don't perform dropout)

# 載入均值檔案,mu的shape是(3,256,256), mean(1)實在第一個維度上做均值，返回shape為(3,256)
# 再mean(1)後，返回形狀是(3),分別是rgb三個通道上均值
mu = np.load(caffe_root + 'python/caffe/imagenet/ilsvrc_2012_mean.npy')
mu = mu.mean(1).mean(1) 
print 'mean-subtracted values:', zip('BGR', mu)
#mean-subtracted values: [('B', 104.0069879317889), ('G', 116.66876761696767), ('R', 122.6789143406786)]

# create transformer for the input called 'data'
# 建立一個轉換器，名字叫‘data’
transformer = caffe.io.Transformer({'data': net.blobs['data'].data.shape})

# transformer會將channels變成最外面的維度， 即 (H，W，C) 變成(C, W, C)
transformer.set_transpose('data', (2,0,1))  
transformer.set_mean('data', mu)            # 每個通道上減去均值
transformer.set_raw_scale('data', 255)      # 從[0, 1]的範圍放大到[0, 255]
transformer.set_channel_swap('data', (2,1,0))  #修改通道順序，從RGB變成BGR

使用CPU分類

# 為了演示批處理，將輸入的batch size修改成50
net.blobs['data'].reshape(50,        # batch size
                          3,         # 3通道
                          227, 227)  # 圖片大小為 227x227

# caffe.io.load_image讀取圖片值的範圍是0-1，cv2.imread讀取圖片值的範圍是0-255
image = caffe.io.load_image(caffe_root + 'examples/images/cat.jpg')
# transformer進行圖片預處理，包括圖片值轉換到0-255
transformed_image = transformer.preprocess('data', image)
plt.imshow(image)



# 圖片資料拷貝到net申請記憶體中
net.blobs['data'].data[...] = transformed_image

### 前向傳播，執行圖片分類。
output = net.forward()
# top blob可能有多個，使用'prob'索引，後面的0表示第一張圖片的輸出
output_prob = output['prob'][0]  
# 獲取分類編號
print 'predicted class is:', output_prob.argmax()
# 輸出predicted class is: 281

驗證分裂是否正確是否正確

# 載入imageNet的label檔案
labels_file = caffe_root + 'data/ilsvrc12/synset_words.txt'
if not os.path.exists(labels_file):
    !../data/ilsvrc12/get_ilsvrc_aux.sh
    
labels = np.loadtxt(labels_file, str, delimiter='\t')

print 'output label:', labels[output_prob.argmax()]
# 輸出內容   output label: n02123045 tabby, tabby cat


# sort預設升序排列，反轉後全最大前五個
top_inds = output_prob.argsort()[::-1][:5]  # reverse sort and take five largest items

print 'probabilities and labels:'
zip(output_prob[top_inds], labels[top_inds])

'''[(0.31243637, 'n02123045 tabby, tabby cat'),
 (0.2379719, 'n02123159 tiger cat'),
 (0.12387239, 'n02124075 Egyptian cat'),
 (0.10075711, 'n02119022 red fox, Vulpes vulpes'),
 (0.070957087, 'n02127052 lynx, catamount')]  
'''

使用GPU模式

# CPU計算耗時
%timeit net.forward()
# 1 loop, best of 3: 1.42 s per loop


# 設定使用gpu，有多個gpu時使用編號的gpu
caffe.set_device(0)  # if we have multiple GPUs, pick the first one
caffe.set_mode_gpu()
net.forward()  # run once before timing to set up memory
%timeit net.forward()
# 10 loops, best of 3: 70.2 ms per loop

使用訓練好的caffe模型分類圖片(python版)

英文官方文件：http://nbviewer.jupyter.org/github/BVLC/caffe/blob/master/examples/00-classification.ipynb 匯入python caffe包 import numpy as np im

用caffe自帶的訓練好的模型測試圖片的分類結果，實現啦啦啦

1、caffemodel檔案下載可以直接在瀏覽器裡輸入地址下載，也可以執行指令碼檔案下載。下載地址為：http://dl.caffe.berkeleyvision.org/bvlc_reference_caffenet.caffemodel 檔名稱為：b

Caffe用訓練好的模型測試圖片

這是一個python指令碼，用訓練好的caffemodel來測試圖片，接下來直接上程式碼，裡面有詳細解釋，大部分你要修改的只是路徑，另外在這個指令碼的基礎上你可以根據自己的需要進行改動。需要的東西：訓練好的caffemodel，deploy.prototxt

caffe 用訓練好的模型提取圖片特徵（使用自帶classify.py和classifier.py）

原材料： 1）訓練好的caffemodel 2) 定義網路結構的deploy.prototxt配置檔案 3）訓練時使用的mean檔案，在/cafferoot/python/classify.py的demo中，要求使用的是.npy格式的meanfile，如果我們手上有的是

caffe的python介面學習（6）：用訓練好的模型（caffemodel）來分類新的圖片

#coding=utf-8import caffeimport numpy as nproot='/home/xxx/' #根目錄deploy=root + 'mnist/deploy.prototxt' #deploy檔案caffe_model=root + 'mnist/lenet_iter

有關Caffe訓練好的模型在Python介面下使用分類不準確的問題解決

之前使用caffe訓練了1k個自己的資料,有3個分類,在consol下面訓練加驗證的結果是85%左右的準確率,還是可以的. 但是問題是,當使用了Python介面,匯入caffemodel檔案和npy均值檔案後,分類結果完全慘不忍睹,全部都偏向第一分類. 經過不懈的googl

使用訓練好的caffe模型識別圖片

這裡記錄如何用訓練好的caffe模型來對測試圖片進行識別。下載訓練好的caffemodel 首先需要一個訓練好的caffemodel，這裡我選用的是caffe官方提供的caffemodel，該模型擁有較多標籤，經過大量的資料訓練得到的。下載地址：http:/

利用caffe訓練好的模型測試自己的手寫字型圖片

轉載地址： http://blog.csdn.net/xunan003/article/details/73126425 一、前沿寫這篇博文，是因為一開始在做《21天學習caffe》第6天6.4練習題1的時候看著自己搜尋的博文，在不理解其根本的情況下做的

Caffe：利用訓練好的模型進行分類

以大神訓練好的模型為基礎，利用自己的資料進行了finetune之後，下一步就可以真正使用模型來進行分類操作了。具體步驟如下： 1. 編輯分類網路的配置檔案deploy.prototxt deploy檔案是真正使用模型時候用的，其結構與train_v

tensorflow 1.0 學習：用別人訓練好的模型來進行圖像分類

ima ppi gin 什麽 dir targe spl flow blog 谷歌在大型圖像數據庫ImageNet上訓練好了一個Inception-v3模型，這個模型我們可以直接用來進來圖像分類。下載地址：https://storage.googleapis.com/d

Python Word2Vec使用訓練好的模型生成詞向量

https 一起失效 com mode 密碼 pytho ID list # 文本文件必須是utf-8無bom格式 from gensim.models.deprecated.word2vec import Word2Vec model = Word2Vec.lo

Tensorflow學習教程------利用卷積神經網路對mnist資料集進行分類_利用訓練好的模型進行分類

#coding:utf-8 import tensorflow as tf from PIL import Image,ImageFilter from tensorflow.examples.tutorials.mnist import input_data def imageprepare(ar

TensorFlow 呼叫預訓練好的模型—— Python 實現

1. 準備預訓練好的模型 TensorFlow 預訓練好的模型被儲存為以下四個檔案 data 檔案是訓練好的引數值，meta 檔案是定義的神經網路圖，checkpoint 檔案是所有模型的儲存路

將python訓練好的模型儲存為pmml檔案供java呼叫

1、PMLL概述用python訓練好的機器學習模型如果上線部署，被java呼叫，可以將模型儲存為pmml檔案，那麼什麼是pmml呢？PMML是資料探勘的一種通用的規範，它用統一的XML格式來描述我們生成的機器學習模型。這樣無論你的模型是sklearn,R還是Sp

PyTorch(三)——使用訓練好的模型測試自己圖片

PyTorch的學習和使用（三）在上一篇文章中實現瞭如何增加一個自定義的Loss，以Siamese network為例。現在實現使用訓練好的該網路對自己手寫的數字圖片進行測試。首先需要對訓練時的權

深度學習Caffe實戰筆記（21）Windows平臺 Faster-RCNN 訓練好的模型測試資料

前一篇部落格介紹瞭如何利用Faster-RCNN訓練自己的資料集，訓練好會得到一個模型，這篇部落格介紹如何利用訓練好的模型進行測試資料。 1、訓練好的模型存放位置訓練好的模型存放在faster_rcnn-master\output\faster_rcnn_

DL開源框架Caffe | 用訓練好的模型對資料進行預測

一句話理解Caffe：　　Caffe的萬丈高樓（Net）是按照我們設計的圖紙（prototxt），用很多磚塊（Blob）築成一層層（Layer）樓房，最後通過某些手段（Solver）進行簡裝修（Train）/精裝修（Finetune）實現的，另外每個樓層都可

tensorflow 1.0 學習：用Google訓練好的模型來進行影象分類

谷歌在大型影象資料庫ImageNet上訓練好了一個Inception-v3模型，這個模型我們可以直接用來進來影象分類。下載地址：github：https://github.com/taey16/tf/tree/master/imagenet下載完解壓後，得到幾個檔案：其中的c

caffe練習例項（3）——使用訓練好的模型

本例項是使用opencv編寫程式碼，使用修改後的mnist的deploy檔案並且呼叫訓練好的模型，輸入一張圖片，輸出分類結果。本工程的所有檔案我都上傳到了github上面，需要的可以下載。具體步驟如下：改寫deploy檔案：把資料層和（Data

tensorflow 1.0 學習：用別人訓練好的模型來進行影象分類

谷歌在大型影象資料庫ImageNet上訓練好了一個Inception-v3模型，這個模型我們可以直接用來進來影象分類。下載完解壓後，得到幾個檔案：其中的classify_image_graph_def.pb 檔案就是訓練好的Inception-v3模型。 imagenet_synset_to_h

使用訓練好的caffe模型分類圖片(python版)

相關推薦