深度學習之格式轉換筆記(二)：CKPT 轉換成 PB格式檔案

阿新 • • 發佈：2021-02-01

我們使用tf.train.saver()儲存模型時會產生多個檔案，也就是說把計算圖的結構和圖上引數取值分成了不同的檔案儲存。這也是在tensorflow中常用的儲存方式。

儲存檔案的程式碼:

import tensorflow as tf
# 宣告兩個變數
v1 = tf.Variable(tf.random_normal([1, 2]), name="v1")
v2 = tf.Variable(tf.random_normal([2, 3]), name="v2")
init_op = 
 tf.global_variables_initializer() # 初始化全部變數
saver = tf.train.Saver() # 宣告tf.train.Saver類用於儲存模型
with tf.Session() as sess:
    sess.run(init_op)
    print("v1:", sess.run(v1)) # 列印v1、v2的值一會讀取之後對比
    print("v2:", sess.run(v2))
    saver_path = saver.save(sess, "save/model.ckpt-510" 
)  # 將模型儲存到save/model.ckpt-510檔案
    print("Model saved in file:", saver_path)

這時候我們就可以看到結果
在這裡插入圖片描述
其中：

checkpoint:檢查點檔案，檔案儲存了一個目錄下所有的模型檔案列表；
model.ckpt-510.meta:儲存了TensorFlow計算圖的結構，可以理解為神經網路的網路結構，該檔案可以被
tf.train.import_meta_graph 載入到當前預設的圖來使用。
ckpt-510.data ：儲存模型中每個變數的取值
ckpt-510.index：可能是內部需要的某種索引來正確對映前兩個檔案，它通常不是必需的

真正部署的時候，一般人家不會給你ckpt模型的，而是固化成pb模型以後再給你用，現在我們就來看看怎麼將ckpt固化成pb模型。

實際完整程式碼:

# -*-coding: utf-8 -*-
import os
import tensorflow as tf
from create_tf_record import *
from tensorflow.python.framework import graph_util

resize_height = 299  # 指定圖片高度
resize_width = 299  # 指定圖片寬度
depths = 3


def freeze_graph_test(pb_path, image_path):
    '''
    :param pb_path:pb檔案的路徑
    :param image_path:測試圖片的路徑
    :return:
    '''
    with tf.Graph().as_default():
        output_graph_def = tf.GraphDef()
        with open(pb_path, "rb") as f:
            output_graph_def.ParseFromString(f.read())
            tf.import_graph_def(output_graph_def, name="")
        with tf.Session() as sess:
            sess.run(tf.global_variables_initializer())

            # 定義輸入的張量名稱,對應網路結構的輸入張量，往往是通過tf.placeholder呼叫的。
            # input:0作為輸入影象,keep_prob:0作為dropout的引數,測試時值為1,is_training:0訓練引數
            input_image_tensor = sess.graph.get_tensor_by_name("input:0")
            input_keep_prob_tensor = sess.graph.get_tensor_by_name("keep_prob:0")
            input_is_training_tensor = sess.graph.get_tensor_by_name("is_training:0")

            # 定義輸出的張量名稱
            output_tensor_name = sess.graph.get_tensor_by_name("InceptionV3/Logits/SpatialSqueeze:0")

            # 讀取測試圖片
            im = read_image(image_path, resize_height, resize_width, normalization=True)
            im = im[np.newaxis, :]
            # 測試讀出來的模型是否正確，注意這裡傳入的是輸出和輸入節點的tensor的名字，不是操作節點的名字
            # out=sess.run("InceptionV3/Logits/SpatialSqueeze:0", feed_dict={'input:0': im,'keep_prob:0':1.0,'is_training:0':False})
            out = sess.run(output_tensor_name, feed_dict={input_image_tensor: im,
                                                          input_keep_prob_tensor: 1.0,
                                                          input_is_training_tensor: False})
            print("out:{}".format(out))
            score = tf.nn.softmax(out, name='pre')
            class_id = tf.argmax(score, 1)
            print(
            "pre class_id:{}".format(sess.run(class_id)))


def freeze_graph(input_checkpoint, output_graph):
    '''
    :param input_checkpoint:
    :param output_graph: PB模型儲存路徑
    :return:
    '''
    # checkpoint = tf.train.get_checkpoint_state(model_folder) #檢查目錄下ckpt檔案狀態是否可用
    # input_checkpoint = checkpoint.model_checkpoint_path #得ckpt檔案路徑

    # 指定輸出的節點名稱,該節點名稱必須是原模型中存在的節點
    output_node_names = "InceptionV3/Logits/SpatialSqueeze"
    saver = tf.train.import_meta_graph(input_checkpoint + '.meta', clear_devices=True)

    with tf.Session() as sess:
        saver.restore(sess, input_checkpoint)  # 恢復圖並得到資料
        output_graph_def = graph_util.convert_variables_to_constants(  # 模型持久化，將變數值固定
            sess=sess,
            input_graph_def=sess.graph_def,  # 等於:sess.graph_def
            output_node_names=output_node_names.split(","))  # 如果有多個輸出節點，以逗號隔開

        with tf.gfile.GFile(output_graph, "wb") as f:  # 儲存模型
            f.write(output_graph_def.SerializeToString())  # 序列化輸出
        print("%d ops in the final graph." % len(output_graph_def.node))  # 得到當前圖有幾個操作節點

        # for op in sess.graph.get_operations():
        #     print(op.name, op.values())


def freeze_graph2(input_checkpoint, output_graph):
    '''
    :param input_checkpoint:
    :param output_graph: PB模型儲存路徑
    :return:
    '''
    # checkpoint = tf.train.get_checkpoint_state(model_folder) #檢查目錄下ckpt檔案狀態是否可用
    # input_checkpoint = checkpoint.model_checkpoint_path #得ckpt檔案路徑

    # 指定輸出的節點名稱,該節點名稱必須是原模型中存在的節點
    output_node_names = "InceptionV3/Logits/SpatialSqueeze"
    saver = tf.train.import_meta_graph(input_checkpoint + '.meta', clear_devices=True)
    graph = tf.get_default_graph()  # 獲得預設的圖
    input_graph_def = graph.as_graph_def()  # 返回一個序列化的圖代表當前的圖

    with tf.Session() as sess:
        saver.restore(sess, input_checkpoint)  # 恢復圖並得到資料
        output_graph_def = graph_util.convert_variables_to_constants(  # 模型持久化，將變數值固定
            sess=sess,
            input_graph_def=input_graph_def,  # 等於:sess.graph_def
            output_node_names=output_node_names.split(","))  # 如果有多個輸出節點，以逗號隔開

        with tf.gfile.GFile(output_graph, "wb") as f:  # 儲存模型
            f.write(output_graph_def.SerializeToString())  # 序列化輸出
        print("%d ops in the final graph." % len(output_graph_def.node))  # 得到當前圖有幾個操作節點

        # for op in graph.get_operations():
        #     print(op.name, op.values())


if __name__ == '__main__':
    # 輸入ckpt模型路徑
    input_checkpoint = 'D:/pycharm/CarPlateIdentity-master/carIdentityData/model1/char_recongnize/model.ckpt-510'
    # 輸出pb模型的路徑
    out_dirpath = 'D:/pycharm/CarPlateIdentity-master/carIdentityData/model1/char_recongnize/pb/'
    os.makedirs(os.path.dirname(out_dirpath),exist_ok=True)
    out_pb_path = out_dirpath+"frozen_model.pb"
    # 呼叫freeze_graph將ckpt轉為pb
    freeze_graph(input_checkpoint, out_pb_path)
    print("the success cover")
    # 測試pb模型
    # image_path = 'test_image/animal.jpg'
    # freeze_graph_test(pb_path=out_pb_path, image_path=image_path)

在將ckpt轉換為pd過程中，會依據輸出節點來丟棄那些與輸出節點無關的引數，只保留與輸出節點存在上下文關係的引數，這也就是生成pd檔案的意義所在，即通過減少引數量降低模型的大小，所以在生成pd的過程中需要明確指定輸出節點是誰，這樣才能確定其依賴的需要固化的上下文引數。

深度學習之格式轉換筆記(二)：CKPT 轉換成 PB格式檔案

技術標籤：深度學習pythontensorflow深度學習我們使用tf.train.saver()儲存模型時會產生多個檔案，也就是說把計算圖的結構和圖上引數取值分成了不同的檔案儲存。這也是在tensorflow中常用的儲存方式。

深度學習筆記二：卷積神經網路（CNN）

卷積神經網路CNN 1. 緒論 1. 卷積神經網路的應用基本應用：分類、檢索、檢測、分割

Redis學習筆記(二)：Redis常用資料型別之set(集合)、zset(有序集合)的命令以及全域性命令詳解

技術標籤：Redisredis 上一篇部落格說了str、hash和list三種資料型別，這篇部落格將會介紹五種資料型別的後兩種set和zset，分別是集合和有序集合。這兩個資料型別就我個人的開發經驗來說沒用過，一方面是不熟悉，

《javascript設計模式》學習筆記二：Javascript面向物件程式設計繼承用法分析

本文例項講述了Javascript面向物件程式設計繼承用法。分享給大家供大家參考，具體如下：

SQL筆記二：個性化查詢之模糊查詢、分組、排序、限制等

1.掌握in的用法使用場景：做條件查詢的時候，條件欄位的取值有多個情況，in(範圍)，not in(範圍)

學習筆記二：IP相關知識

IP地址 IP地址（Internet Protocol Address）是指網際網路協議地址，又譯為網際協議地址。IP地址是IP協議提供的一種統一的地址格式，它為網際網路上的每一個網路和每一臺主機分配一個邏輯地址，以此來遮蔽實體地址的

CAS學習筆記二：CAS單點登入流程

背景由於公司專案甲方眾多，各甲方為了統一登入使用者體系實現單點登入（SSO）開始要求各乙方專案對接其搭建的CAS單點登入服務，有段時間對CAS的流程很迷，各廠商還有基於CAS進行二次開發的情況，所以對它的官方文

FCC JavaScript 題二：羅馬數字轉換器

將給定數字轉換為羅馬數字。所有羅馬數字答案均應以大寫形式提供。關於羅馬數字的解釋詳情請看連結。

SDL開發筆記(二)：音訊基礎介紹、使用SDL播放音訊

若該文為原創文章，未經允許不得轉載原博主部落格地址：https://blog.csdn.net/qq21497936原博主部落格導航：https://blog.csdn.net/qq21497936/article/details/102478062本文章部落格地址：https://blog.csdn.net

pytorch深度學習之音訊librosa庫與torchaudio庫的安裝與使用

pytorch深度學習之音訊librosa庫與torchaudio庫的安裝與使用搭建pytorch 基本框架與 anaconda pytorch虛擬環境建立，去看這裡

深度學習之Pytorch（一）神經網路基礎及程式碼實現

1.1 Tensor (張量) Tensor 可以和 numpy 的 ndarray相互轉換Tensor有不同資料型別，有32位浮點型torch.FloatTensor、64位浮點型 torch.DoubleTensor等

[DesignPattern] 設計之禪讀書筆記(二) 工廠模式

工廠模式 Define an interface for creating an object, but let subclasses decide which class to instantiate. Factory Method lets a class defer instantiation to subclass. (定義一個用於建立物件的介面，

《計算機是怎麼跑起來的》讀書筆記二：一臺簡易的微型計算機和機器語言

文章目錄前言一、簡易的微型計算機二、機器語言1.CPU內部結構2.機器語言2.組合語言

目標檢測論文筆記二：CenterNet《Objects as Points》

論文通過將物體建模成一個物體中心點，使用關鍵點估計網路來預測物體中心並回歸一系列物體屬性（長寬高等等）。並且相比於基於anchor的物體檢測器，CenterNet 更簡單、更快、更準確。網路的整個執行流程為

架構筆記二：高效能架構模式

高效能架構模式一.高效能資料庫叢集 1.1讀寫分離讀寫分離的基本原理是將資料庫讀寫操作分散到不同的節點上，下面是其基本架構圖。

javaweb基礎之mysql回溯筆記(二)

文章目錄 mysql的事務定義指對資料庫的一組操作，要麼都執行要麼都不執行；老實說，對事務的定義，其實還是很模糊的概念，這裡留下個坑；等我以後真正見識過事務，再來定義什麼是事務！四大特性

鴻蒙(HarmonyOS)開發筆記二：使用DevEco Studio建立一個專案

，在對harmonyOS有了一個初步認知之後，我們使用DevEcoStudio來建立一個專案，把專案執行起來，先從整體上來了解一下harmonyOS專案的整體結構以及開發工具的基本使用。

MFC筆記二：主視窗開啟其他視窗

技術標籤：VS2013程式設計visual studio codewindowsmfc 主視窗點選按鈕呼叫其他視窗 1）插入新的視窗後雙擊建立好新視窗對應的類

深度學習之資料劃分

技術標籤：Python 使用步驟 1.匯入相關包： from sklearn.datasets import load_iris from sklearn.datasets import fetch_20newsgroups from sklearn.model_selection import train_test_split 2.例項化物件:li

深度學習之文字特徵值抽取

技術標籤：Python 首先構建三個字串： str1="疫情之下，全球化的道路將得到更多支援票還是反對票？人類社將更渴求一個相容幷包、相互支撐、分工合作的共生體，還是各自封閉，在保護主義和單邊主義的矯飾中飲

深度學習之格式轉換筆記(二)：CKPT 轉換成 PB格式檔案

相關推薦