tensorflow 生成.pb檔案，載入.pb檔案---遷移學習

阿新 • • 發佈：2019-01-09

這篇薄荷主要是講了如何用tensorflow去訓練好一個模型，然後生成相應的pb檔案。最後會將如何重新載入這個pb檔案。

train
首先說一下train。一開始當然是讀圖片啦。

用io.imread來讀取每一張圖片，然後resize成vgg的輸入的大小（224，224，3），最後分別放入了data和label中。

defread_img(path):
    cate   = [path + x for x in os.listdir(path) if os.path.isdir(path + x)]
    imgs   = []
    labels = []
    for idx, folder in 
 enumerate(cate):
        for im in glob.glob(folder + '/*.jpg'):
            print('reading the image: %s' % (im))
            img = io.imread(im)
            img = transform.resize(img, (w, h, c))
            imgs.append(img)
            labels.append(idx)
    return np.asarray(imgs, np.float32), np.asarray(labels, np.int32)
data, label = read_img(path)

這裡是把圖片的順序打亂，先生成一個等差數列，然後打亂，最後賦值回原來的data和label

num_example = data.shape[0]
arr = np.arange(num_example)
np.random.shuffle(arr)
data = data[arr]
label = label[arr]

全部的資料中百分之80的用來train，剩下20的用來test（雖然一共才30張圖片。。。。。）

ratio = 0.8
s = np.int(num_example * ratio)
x_train = data[:s]
y_train = label[:s]
x_val   =  
data[s:]
y_val = label[s:]

開始build相應的vgg model，這一步不難，但是每一層最好都給上相應的name。上面的x和y是相應的輸入和相應的標籤。

defbuild_network(height, width, channel):
    x = tf.placeholder(tf.float32, shape=[None, height, width, channel], name='input')
    y = tf.placeholder(tf.int64, shape=[None, 2], name='labels_placeholder')

在build的最後，是需要進行誤差計算。finaloutput是最後的輸出，cost是計算誤差，optimize是定義訓練時候安什麼方式，也注意一下最後的return。

    finaloutput = tf.nn.softmax(output_fc8, name="softmax")

    cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits=finaloutput, labels=y))
    optimize = tf.train.AdamOptimizer(learning_rate=1e-4).minimize(cost)

    prediction_labels = tf.argmax(finaloutput, axis=1, name="output")
    read_labels = y

    correct_prediction = tf.equal(prediction_labels, read_labels)
    accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))

    correct_times_in_batch = tf.reduce_sum(tf.cast(correct_prediction, tf.int32))

    return dict(
        x=x,
        y=y,
        optimize=optimize,
        correct_prediction=correct_prediction,
        correct_times_in_batch=correct_times_in_batch,
        cost=cost,
)

接著是訓練過程。

def train_network(graph, batch_size, num_epochs, pb_file_path):
    init = tf.global_variables_initializer()
    with tf.Session() as sess:
        sess.run(init)
        epoch_delta = 2
        for epoch_index in range(num_epochs):
            for i in range(12):
                sess.run([graph['optimize']], feed_dict={
                    graph['x']: np.reshape(x_train[i], (1, 224, 224, 3)),
                    graph['y']: ([[1, 0]] if y_train[i] == 0 else [[0, 1]])
})

其實訓練的程式碼就這些，定好了batchsize和numepoch進行訓練。下面的程式碼主要是為了看每幾次相應的正確率。

將訓練好的模型儲存為pb檔案。執行完之後就會發現應該的資料夾多出了一個pb檔案。

constant_graph = graph_util.convert_variables_to_constants(sess,sess.graph_def, ["output"])
with tf.gfile.FastGFile(pb_file_path, mode='wb') as f:
    f.write(constant_graph.SerializeToString())

test

開啟相應的pb檔案。

with tf.Graph().as_default():
   output_graph_def = tf.GraphDef()

   with open(pb_file_path, "rb") as f:
       output_graph_def.ParseFromString(f.read())
       _ = tf.import_graph_def(output_graph_def, name="")

讀取圖片檔案，resize之後放入模型的輸入位置，之後img_out_softmax就是相應輸出的結果。

img = io.imread(jpg_path)
img = transform.resize(img, (224, 224, 3))
img_out_softmax = sess.run(out_softmax, feed_dict={input_x:np.reshape(img, [-1, 224, 224, 3])})

最後放出整個的train和test的程式碼：
train

from PIL import Image
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import tensorflow as tf
import os
import glob
from skimage import io, transform
from tensorflow.python.framework import graph_util
import collections

path = '/home/zhoupeilin/vgg16/picture/'
w = 224
h = 224
c = 3

defread_img(path):
    cate   = [path + x for x in os.listdir(path) if os.path.isdir(path + x)]
    imgs   = []
    labels = []
    for idx, folder in enumerate(cate):
        for im in glob.glob(folder + '/*.jpg'):
            print('reading the image: %s' % (im))
            img = io.imread(im)
            img = transform.resize(img, (w, h, c))
            imgs.append(img)
            labels.append(idx)
    return np.asarray(imgs, np.float32), np.asarray(labels, np.int32)
data, label = read_img(path)

num_example = data.shape[0]
arr = np.arange(num_example)
np.random.shuffle(arr)
data = data[arr]
label = label[arr]

ratio = 0.8
s = np.int(num_example * ratio)
x_train = data[:s]
y_train = label[:s]
x_val   = data[s:]
y_val   = label[s:]

defbuild_network(height, width, channel):
    x = tf.placeholder(tf.float32, shape=[None, height, width, channel], name='input')
    y = tf.placeholder(tf.int64, shape=[None, 2], name='labels_placeholder')

    defweight_variable(shape, name="weights"):
        initial = tf.truncated_normal(shape, dtype=tf.float32, stddev=0.1)
        return tf.Variable(initial, name=name)

    defbias_variable(shape, name="biases"):
        initial = tf.constant(0.1, dtype=tf.float32, shape=shape)
        return tf.Variable(initial, name=name)

    defconv2d(input, w):
        return tf.nn.conv2d(input, w, [1, 1, 1, 1], padding='SAME')

    defpool_max(input):
        return tf.nn.max_pool(input,
                               ksize=[1, 2, 2, 1],
                               strides=[1, 2, 2, 1],
                               padding='SAME',
                               name='pool1')

    deffc(input, w, b):
        return tf.matmul(input, w) + b

    # conv1
    with tf.name_scope('conv1_1') as scope:
        kernel = weight_variable([3, 3, 3, 64])
        biases = bias_variable([64])
        output_conv1_1 = tf.nn.relu(conv2d(x, kernel) + biases, name=scope)

    with tf.name_scope('conv1_2') as scope:
        kernel = weight_variable([3, 3, 64, 64])
        biases = bias_variable([64])
        output_conv1_2 = tf.nn.relu(conv2d(output_conv1_1, kernel) + biases, name=scope)

    pool1 = pool_max(output_conv1_2)

    # conv2
    with tf.name_scope('conv2_1') as scope:
        kernel = weight_variable([3, 3, 64, 128])
        biases = bias_variable([128])
        output_conv2_1 = tf.nn.relu(conv2d(pool1, kernel) + biases, name=scope)

    with tf.name_scope('conv2_2') as scope:
        kernel = weight_variable([3, 3, 128, 128])
        biases = bias_variable([128])
        output_conv2_2 = tf.nn.relu(conv2d(output_conv2_1, kernel) + biases, name=scope)

    pool2 = pool_max(output_conv2_2)

    # conv3
    with tf.name_scope('conv3_1') as scope:
        kernel = weight_variable([3, 3, 128, 256])
        biases = bias_variable([256])
        output_conv3_1 = tf.nn.relu(conv2d(pool2, kernel) + biases, name=scope)

    with tf.name_scope('conv3_2') as scope:
        kernel = weight_variable([3, 3, 256, 256])
        biases = bias_variable([256])
        output_conv3_2 = tf.nn.relu(conv2d(output_conv3_1, kernel) + biases, name=scope)

    with tf.name_scope('conv3_3') as scope:
        kernel = weight_variable([3, 3, 256, 256])
        biases = bias_variable([256])
        output_conv3_3 = tf.nn.relu(conv2d(output_conv3_2, kernel) + biases, name=scope)

    pool3 = pool_max(output_conv3_3)

    # conv4
    with tf.name_scope('conv4_1') as scope:
        kernel = weight_variable([3, 3, 256, 512])
        biases = bias_variable([512])
        output_conv4_1 = tf.nn.relu(conv2d(pool3, kernel) + biases, name=scope)

    with tf.name_scope('conv4_2') as scope:
        kernel = weight_variable([3, 3, 512, 512])
        biases = bias_variable([512])
        output_conv4_2 = tf.nn.relu(conv2d(output_conv4_1, kernel) + biases, name=scope)

    with tf.name_scope('conv4_3') as scope:
        kernel = weight_variable([3, 3, 512, 512])
        biases = bias_variable([512])
        output_conv4_3 = tf.nn.relu(conv2d(output_conv4_2, kernel) + biases, name=scope)

    pool4 = pool_max(output_conv4_3)

    # conv5
    with tf.name_scope('conv5_1') as scope:
        kernel = weight_variable([3, 3, 512, 512])
        biases = bias_variable([512])
        output_conv5_1 = tf.nn.relu(conv2d(pool4, kernel) + biases, name=scope)

    with tf.name_scope('conv5_2') as scope:
        kernel = weight_variable([3, 3, 512, 512])
        biases = bias_variable([512])
        output_conv5_2 = tf.nn.relu(conv2d(output_conv5_1, kernel) + biases, name=scope)

    with tf.name_scope('conv5_3') as scope:
        kernel = weight_variable([3, 3, 512, 512])
        biases = bias_variable([512])
        output_conv5_3 = tf.nn.relu(conv2d(output_conv5_2, kernel) + biases, name=scope)

    pool5 = pool_max(output_conv5_3)

    #fc6
    with tf.name_scope('fc6') as scope:
        shape = int(np.prod(pool5.get_shape()[1:]))
        kernel = weight_variable([shape, 4096])
        biases = bias_variable([4096])
        pool5_flat = tf.reshape(pool5, [-1, shape])
        output_fc6 = tf.nn.relu(fc(pool5_flat, kernel, biases), name=scope)

    #fc7
    with tf.name_scope('fc7') as scope:
        kernel = weight_variable([4096, 4096])
        biases = bias_variable([4096])
        output_fc7 = tf.nn.relu(fc(output_fc6, kernel, biases), name=scope)

    #fc8
    with tf.name_scope('fc8') as scope:
        kernel = weight_variable([4096, 2])
        biases = bias_variable([2])
        output_fc8 = tf.nn.relu(fc(output_fc7, kernel, biases), name=scope)

    finaloutput = tf.nn.softmax(output_fc8, name="softmax")

    cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits=finaloutput, labels=y))
    optimize = tf.train.AdamOptimizer(learning_rate=1e-4).minimize(cost)

    prediction_labels = tf.argmax(finaloutput, axis=1, name="output")
    read_labels = y

    correct_prediction = tf.equal(prediction_labels, read_labels)
    accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))

    correct_times_in_batch = tf.reduce_sum(tf.cast(correct_prediction, tf.int32))

    return dict(
        x=x,
        y=y,
        optimize=optimize,
        correct_prediction=correct_prediction,
        correct_times_in_batch=correct_times_in_batch,
        cost=cost,
    )


deftrain_network(graph, batch_size, num_epochs, pb_file_path):
    init = tf.global_variables_initializer()
    with tf.Session() as sess:
        sess.run(init)
        epoch_delta = 2
        for epoch_index in range(num_epochs):
            for i in range(12):
                sess.run([graph['optimize']], feed_dict={
                    graph['x']: np.reshape(x_train[i], (1, 224, 224, 3)),
                    graph['y']: ([[1, 0]] if y_train[i] == 0 else [[0, 1]])
                })
            if epoch_index % epoch_delta == 0:
                total_batches_in_train_set = 0
                total_correct_times_in_train_set = 0
                total_cost_in_train_set = 0.
                for i in range(12):
                    return_correct_times_in_batch = sess.run(graph['correct_times_in_batch'], feed_dict={
                        graph['x']: np.reshape(x_train[i], (1, 224, 224, 3)),
                        graph['y']: ([[1, 0]] if y_train[i] == 0 else [[0, 1]])
                    })
                    mean_cost_in_batch = sess.run(graph['cost'], feed_dict={
                        graph['x']: np.reshape(x_train[i], (1, 224, 224, 3)),
                        graph['y']: ([[1, 0]] if y_train[i] == 0 else [[0, 1]])
                    })
                    total_batches_in_train_set += 1
                    total_correct_times_in_train_set += return_correct_times_in_batch
                    total_cost_in_train_set += (mean_cost_in_batch * batch_size)


                total_batches_in_test_set = 0
                total_correct_times_in_test_set = 0
                total_cost_in_test_set = 0.
                for i in range(3):
                    return_correct_times_in_batch = sess.run(graph['correct_times_in_batch'], feed_dict={
                        graph['x']: np.reshape(x_val[i], (1, 224, 224, 3)),
                        graph['y']: ([[1, 0]] if y_val[i] == 0 else [[0, 1]])
                    })
                    mean_cost_in_batch = sess.run(graph['cost'], feed_dict={
                        graph['x']: np.reshape(x_val[i], (1, 224, 224, 3)),
                        graph['y']: ([[1, 0]] if y_val[i] == 0 else [[0, 1]])
                    })
                    total_batches_in_test_set += 1
                    total_correct_times_in_test_set += return_correct_times_in_batch
                    total_cost_in_test_set += (mean_cost_in_batch * batch_size)

                acy_on_test  = total_correct_times_in_test_set / float(total_batches_in_test_set * batch_size)
                acy_on_train = total_correct_times_in_train_set / float(total_batches_in_train_set * batch_size)
                print('Epoch - {:2d}, acy_on_test:{:6.2f}%({}/{}),loss_on_test:{:6.2f}, acy_on_train:{:6.2f}%({}/{}),loss_on_train:{:6.2f}'.format(epoch_index, acy_on_test*100.0,total_correct_times_in_test_set,
                                                                                                                                                   total_batches_in_test_set * batch_size,
                                                                                                                                                   total_cost_in_test_set,
                                                                                                                                                   acy_on_train * 100.0,
                                                                                                                                                   total_correct_times_in_train_set,
                                                                                                                                                   total_batches_in_train_set * batch_size,
                                                                                                                                                   total_cost_in_train_set))
            constant_graph = graph_util.convert_variables_to_constants(sess, sess.graph_def, ["output"])
            with tf.gfile.FastGFile(pb_file_path, mode='wb') as f:
                f.write(constant_graph.SerializeToString())


defmain():
    batch_size = 12
    num_epochs = 50

    pb_file_path = "vggs.pb"

    g = build_network(height=224, width=224, channel=3)
    train_network(g, batch_size, num_epochs, pb_file_path)

main()

test

import tensorflow as tf
import  numpy as np
import PIL.Image as Image
from skimage import io, transform

def recognize(jpg_path, pb_file_path):
    with tf.Graph().as_default():
        output_graph_def = tf.GraphDef()

        with open(pb_file_path, "rb") as f:
            output_graph_def.ParseFromString(f.read())
            _ = tf.import_graph_def(output_graph_def, name="")

        with tf.Session() as sess:
            init = tf.global_variables_initializer()
            sess.run(init)

            input_x = sess.graph.get_tensor_by_name("input:0")
            print input_x
            out_softmax = sess.graph.get_tensor_by_name("softmax:0")
            print out_softmax
            out_label = sess.graph.get_tensor_by_name("output:0")
            print out_label

            img = io.imread(jpg_path)
            img = transform.resize(img, (224, 224, 3))
            img_out_softmax = sess.run(out_softmax, feed_dict={input_x:np.reshape(img, [-1, 224, 224, 3])})

            print "img_out_softmax:",img_out_softmax
            prediction_labels = np.argmax(img_out_softmax, axis=1)
            print "label:",prediction_labels

recognize("vgg16/picture/dog/dog3.jpg", "vgg16/vggs.pb")

tensorflow 生成.pb檔案，載入.pb檔案---遷移學習

這篇薄荷主要是講了如何用tensorflow去訓練好一個模型，然後生成相應的pb檔案。最後會將如何重新載入這個pb檔案。 train 首先說一下train。一開始當然是讀圖片啦。用io.imread來讀取每一張圖片，然後resize成vgg的輸入的大小

tensorflow學習筆記——模型持久化的原理，將CKPT轉為pb檔案，使用pb模型預測

　　由題目就可以看出，本節內容分為三部分，第一部分就是如何將訓練好的模型持久化，並學習模型持久化的原理，第二部分就是如何將CKPT轉化為pb檔案，第三部分就是如何使用pb模型進行預測。一，模型持久化　　為了讓訓練得到的模型儲存下來方便下次直接呼叫，我們需要將訓練得到的神經網路模型持久化。下面學習通過Ten

C# AE開發，載入sxd檔案顯示不了

問題：載入sxd時，執行結果不顯示內容，空白。解決方法：百度之後:蘇佔東001 2016-01-12 13:51 在你的SceneControl介面中拖入控制元件License Control試試於是查詢如何操作，我使用的是VS2017+AE10.1版本，工具

使用dd命令在Linux下建立大檔案，批量大小檔案生成方法

前沿：最近在開發自動從U盤拷貝大批量檔案到linux系統的的功能。由於需要幾十個G的大檔案來做測試，如果自己去找這麼多資源，然後再拷貝，非常麻煩。所以學了下dd命令，現在總結一下：一、引數介紹 if=FILE 從FILE中讀取資料，而不是預設的標準輸入。

Unity3D-動態讀取配置檔案，載入遊戲物件

private Dictionary<int,T> LoadConfig<T>(string fileName) where T : class,new() {

python讀取excel中表結構生成sql語句，存入txt檔案

python-excel-sql-txt#coding=utf-8 from openpyxl import load_workbook #讀取excel的資料 def read_excel(): #開啟一個workbook wb = load_workboo

HttpURLConnection下載網路檔案，載入網路圖片

說明：做sdk開發的時候（sdk不採取任何第三方框架），涉及到下載網路檔案，和載入網路圖片的功能，由於不能用第三方jar包進行，所以只能用基本的HttpURLConnection把檔案作為流來處理，進行下載和載入。 1、HttpURLConnection載入圖片程

Nginx 作為代理伺服器，載入JS檔案報錯，net::ERR_CONTENT_LENGTH_MISMATCH

檢視nginx 日誌發現報錯 [[email protected] logs]# tail error.log 2016/11/11 15:04:20 [crit] 8655#0: *21 open() "/usr/local/nginx/

java.io.File.deleteOnExit()-生成臨時檔案，刪除臨時檔案

Description The java.io.File.deleteOnExit() method deletes the file or directory defined by the abstract path name when the virt

c#兩種方式呼叫google地球,呼叫COM API以及呼叫GEPLUGIN 與js互動，載入kml檔案，dae檔案。將二維高德地圖覆蓋到到三維谷歌地球表面。

網路上資源很多不全面，自己在開發的時候走了不少彎路，在這裡整理了最全面的google全套開發，COM互動，web端互動。封裝好了各種模組功能。直接就可以呼叫。第一種方式：呼叫COMAPI實現呼叫google地球 1、安裝googleearth客戶端。傳送門：https://pan.baidu.com/

icf檔案，連結配置檔案

icf是連結配置檔案（Linker configuration file）字尾名。 stm32韌體庫中存放路徑為： STM32F0xx_StdPeriph_Lib_V1.5.0\Projects\STM32F0xx_StdPeriph_Templates\EWARM\

openoffice轉excel為pdf檔案，根據excel檔案大小設定pdf頁面大小，只適用一個sheet的情況

1、maven注入連線openoffice的Jar和poi <dependency> <

【筆記】window下使用c++遍歷資料夾及其子資料夾和檔案，並列印檔案路徑及各檔案內容

這兩天一直在學習如何使用c++遍歷資料夾、讀取檔案內容和寫入檔案。話不多說，直接上程式碼 /** 檔案功能：遞迴遍歷資料夾,遍歷資料夾及其子資料夾和檔案.列印資料夾名稱、檔名稱和檔案數目*** 參考：https://www.cnblogs.com/collectionne/p/679230

將100道計算題輸出至txt檔案，再讀取檔案至控制檯,在控制檯中輸入答案並評判對錯

我在課堂上基本完成了輸出100道題和建立文件，但是因為對輸入輸出流不熟悉，所以並沒有實現將輸出的計算題匯出到文件裡，在課下我又請教了宿舍的大佬，基本完成如下：原始碼： import java.io.File; import java.io.FileInputStream; import java.

Python解密網易雲音樂.ncm檔案，將.ncm檔案轉換為.mp3檔案，實現隨處播放（另附C++已編譯轉換器）

網易雲音樂把.mp3音樂檔案加密為.ncm檔案，導致不能將下載好的音樂複製到其它裝置或使用非網易雲音樂播放器播放，該程式可將.ncm檔案逆向解密為.mp3檔案並保留最高音質。另有C++已編譯.exe轉換器，將.ncm檔案拖到.exe上直接執行轉換，生成.mp3檔案在.ncm檔案相同路徑。點選下

Java如何用WriteUTF寫檔案，ReadUTF讀檔案

直接上樣例參考（附有部分說明）： File fileName = new File(Environment.getExternalStorageDirectory().getAbsolutePath() + “/test/test.levp”); FileOut

Skyline 伺服器新增.3DML檔案，通過FLY檔案接入使用

最近接收到一批.3DML三維模型，應用於新的需求。原有的系統架構是：三維模型和shapefile檔案都發佈於Skyline伺服器，客戶端fly檔案通過使用者名稱，按照所屬許可權訪問這些模型資料。客戶端具有IE瀏覽器外掛，並開發了相關HTML

利用boost遍歷路徑下所有檔案，並判斷檔案是否是資料夾

#include<boost/filesystem.hpp> void GetFileNameFromDir(const char* rootPath) { boost::filesystem::path dir(rootPath); if (b

flume實現監控檔案，並將檔案內容傳入kafka的，kafka在控制檯實現消費

在flume的配置裡建一個檔案flume-kafka.conf 生產者產生的資料放在/home/hadoop/c.txt中 topic消費c.txt中的檔案 a1.sources = s1

java中pdf檔案的管理（pdf檔案轉png檔案，base64傳輸檔案以及刪除）

這幾天根據需求做了一個小demo，從中學習了一些java中pdf檔案的管理和檔案轉base64，主要包括以下幾個方面： 1.前端上傳影像檔案，把影像檔案儲存到指定的路徑下，然後如果是pdf檔案，把pdf檔案轉換為對應的png檔案儲存到pdf檔案儲存地址同級的指定資料夾中，同時保留原pdf檔案，如下圖： pd

tensorflow 生成.pb檔案，載入.pb檔案---遷移學習

相關推薦