機器學習小試（9）使用TensorFlow跑通一個通用增量學習流程-測試與應用

阿新 • • 發佈：2019-01-23

（接上文）
為了對神經網路的分類（擬合）效果進行測試，我們可以使用另一組訓練樣本，進行試分類，評價其代價函式的收斂程度。

1. 模型測試

該測試程式讀取測試資料，並應用當前訓練好的模型，進行分類，計算代價函式。如果模型奇異，則代價函式相較訓練集會較高，反之，較低（一致）：
執行結果：

Testing...
1024 0.0035852
2048 0.00231017
3072 0.00157589
4096 0.00172059
5120 0.00321012
6144 0.00346273
7168 0.00267906
8192 0.00247223
9216 0.00233935
10240 0.00288214
11264 0.002231
12288 0.00120241

test
測試程式的完整程式碼：

# -*- coding: utf-8 -*-
"""
Created on Sun Nov 26 15:24:50 2017
gn_test_model.py
@author: goldenhawking
"""
from __future__ import print_function
import tensorflow as tf
import numpy as np
import configparser
import re
import matplotlib.pyplot as mpl
trainning_task_file         = 'train_task.cfg' 

testing_file                = 'test_set.txt'
model_path                  = './saved_model/'
#讀取配置
config = configparser.ConfigParser()
config.read(trainning_task_file)
n               = int(config['network']['input_nodes'])     # input vector size
K               = int(config['network']['output_nodes' 
])     # output vector size
lam             = float(config['network']['lambda'])
#隱層規模 用逗號分開,類似 ”16,16,13“ 
hidden_layer_size = config['network']['hidden_layer_size'] 
#分離字元
reobj = re.compile('[\s,\"]')
ls_array        = reobj.split(hidden_layer_size);
ls_array        = [item for item in filter(lambda x:x != '', ls_array)] #刪空白
#隱層個數
hidden_layer_elems =  len(ls_array);

#轉為整形，並計入輸出層 
ns_array = []
for idx in range(0,hidden_layer_elems)    :
    ns_array.append(int(ls_array[idx]))
#Output is the last layer, append to last
ns_array.append(K)
#總層數（含有輸出層）
total_layer_size = len(ns_array)
#--------------------------------------------------------------
#create graph
graph = tf.Graph()
with graph.as_default():
    with tf.name_scope('network'):
        with tf.name_scope('input'):
            s = [n]
            a = [tf.placeholder(tf.float32,[None,s[0]],name="in")]
            W = []
            b = []
            z = []
            punish = tf.constant(0.0)
            for idx in range(0,total_layer_size)    :
                with tf.name_scope('layer'+str(idx+1)):
                    s.append(int(ns_array[idx]))
                    W.append(tf.Variable(tf.random_uniform([s[idx],s[idx+1]],0,1),name='W'+str(idx+1)))
                    b.append(tf.Variable(tf.random_uniform([1],0,1),name='b'+str(idx+1)))
                    z.append(tf.matmul(a[idx],W[idx]) + b[idx]*tf.ones([1,s[idx+1]],name='z'+str(idx+1)))
                    a.append(tf.nn.tanh(z[idx],name='a'+str(idx+1)))
                with tf.name_scope('regular'):
                    punish = punish + tf.reduce_sum(W[idx]**2) * lam

    #--------------------------------------------------------------
    with tf.name_scope('loss'):
        y_ = tf.placeholder(tf.float32,[None,K],name="tr_out")
        loss = tf.reduce_mean(tf.square(a[total_layer_size]-y_),name="loss") + punish
    with tf.name_scope('trainning'):
        optimizer = tf.train.AdamOptimizer(name="opt")
        train = optimizer.minimize(loss,name="train")

    init = tf.global_variables_initializer()
    #save graph to Disk
    saver = tf.train.Saver()
#--------------------------------------------------------------
### create tensorflow structure end ###
sess = tf.Session(graph=graph)
check_point_path = model_path # 儲存好模型的檔案路徑
ckpt = tf.train.get_checkpoint_state(checkpoint_dir=check_point_path)
saver.restore(sess,ckpt.model_checkpoint_path)

#--------------------------------------------------------------
file_deal_times = int(config['performance']['file_deal_times'])
trunk           = int(config['performance']['trunk'])
train_step      = int(config['performance']['train_step'])
iterate_times   = int(config['performance']['iterate_times'])
print ("Testing...")
#testing
x_test = np.zeros([trunk,n]).astype(np.float32)
#read n features and K outputs
y_test = np.zeros([trunk,K]).astype(np.float32)
total_red = 0

plot_x = []
plot_y = []

with open(testing_file, 'rt') as testfile:
    while 1:
        lines = testfile.readlines()
        if not lines:
            break
        line_count = len(lines)
        for lct in range(line_count):
            x_arr = reobj.split(lines[lct]);
            x_arr = [item for item in filter(lambda x:x != '', x_arr)] #remove null strings
            for idx in range(n)    :
                x_test[total_red % trunk,idx] = float(x_arr[idx])
            for idx in range(K)    :    
                y_test[total_red % trunk,idx] = float(x_arr[idx+n])           
            total_red = total_red + 1
            #the trainning set run trainning
            if (total_red % train_step == 0):
                #print loss
                lss = sess.run(loss,feed_dict={a[0]:x_test[0:min(total_red,trunk)+1],y_:y_test[0:min(total_red,trunk)+1]})
                print(total_red,lss)
                plot_x.append(total_red)
                plot_y.append(lss)

mpl.plot(plot_x,plot_y)

2. 模型應用

下面這個程式，讀取給定的特徵，產生分類結果。我們把分類器的輸出，存為一個文字檔案。
這個文字檔案每一行為一個結果，由兩部分組成，特徵、分類（或者擬合）結果。

[-0.24751600623130798, -0.9268109798431396] [0.9986907243728638, -0.000654876115731895, -0.00044381615589372814]
[0.045763999223709106, 0.5164780020713806] [0.9986994862556458, -0.0026147901080548763, -0.001965639414265752]
[-0.6250460147857666, -0.8338379859924316] [-0.00046735999058000743, -0.0015115130227059126, 0.9921404719352722]
[0.6993309855461121, -0.042775001376867294] [0.9986986517906189, -0.0005539059056900442, -0.00046229359577409923]
[0.9839800000190735, 0.19465599954128265] [0.9986998438835144, -0.0009445545147173107, -0.0008026955765672028]
[-0.12072400003671646, 0.5291630029678345] [0.9986990690231323, 6.365776062011719e-05, -4.45246696472168e-05]
[0.11185800284147263, 0.20474199950695038] [0.9986990690231323, -0.00044244524906389415, -0.0004038810438942164]

可以使用最大值判決，來對輸出的浮點型判決結果進行分類。同時，通過比值，可以看出分類的區分度。
result

附帶原始碼：

# -*- coding: utf-8 -*-
"""
Created on Sun Nov 26 15:24:50 2017
gn_run_model.py
@author: goldenhawking
"""
from __future__ import print_function
import tensorflow as tf
import numpy as np
import configparser
import re
import matplotlib.pyplot as mpl
trainning_task_file         = 'train_task.cfg'
input_file                  = 'test_set.txt'
output_file                 = 'result.txt'
model_path                  = './saved_model/'
#讀取配置
config = configparser.ConfigParser()
config.read(trainning_task_file)
n               = int(config['network']['input_nodes'])     # input vector size
K               = int(config['network']['output_nodes'])     # output vector size
lam             = float(config['network']['lambda'])
#隱層規模 用逗號分開,類似 ”16,16,13“ 
hidden_layer_size = config['network']['hidden_layer_size'] 
#分離字元
reobj = re.compile('[\s,\"]')
ls_array        = reobj.split(hidden_layer_size);
ls_array        = [item for item in filter(lambda x:x != '', ls_array)] #刪空白
#隱層個數
hidden_layer_elems =  len(ls_array);

#轉為整形，並計入輸出層 
ns_array = []
for idx in range(0,hidden_layer_elems)    :
    ns_array.append(int(ls_array[idx]))
#Output is the last layer, append to last
ns_array.append(K)
#總層數（含有輸出層）
total_layer_size = len(ns_array)
#--------------------------------------------------------------
#create graph
graph = tf.Graph()
with graph.as_default():
    with tf.name_scope('network'):
        with tf.name_scope('input'):
            s = [n]
            a = [tf.placeholder(tf.float32,[None,s[0]],name="in")]
            W = []
            b = []
            z = []
            punish = tf.constant(0.0)
            for idx in range(0,total_layer_size)    :
                with tf.name_scope('layer'+str(idx+1)):
                    s.append(int(ns_array[idx]))
                    W.append(tf.Variable(tf.random_uniform([s[idx],s[idx+1]],0,1),name='W'+str(idx+1)))
                    b.append(tf.Variable(tf.random_uniform([1],0,1),name='b'+str(idx+1)))
                    z.append(tf.matmul(a[idx],W[idx]) + b[idx]*tf.ones([1,s[idx+1]],name='z'+str(idx+1)))
                    a.append(tf.nn.tanh(z[idx],name='a'+str(idx+1)))
                with tf.name_scope('regular'):
                    punish = punish + tf.reduce_sum(W[idx]**2) * lam

    #--------------------------------------------------------------
    with tf.name_scope('loss'):
        y_ = tf.placeholder(tf.float32,[None,K],name="tr_out")
        loss = tf.reduce_mean(tf.square(a[total_layer_size]-y_),name="loss") + punish
    with tf.name_scope('trainning'):
        optimizer = tf.train.AdamOptimizer(name="opt")
        train = optimizer.minimize(loss,name="train")

    init = tf.global_variables_initializer()
    #save graph to Disk
    saver = tf.train.Saver()
#--------------------------------------------------------------
### create tensorflow structure end ###
sess = tf.Session(graph=graph)
check_point_path = model_path # 儲存好模型的檔案路徑
ckpt = tf.train.get_checkpoint_state(checkpoint_dir=check_point_path)
saver.restore(sess,ckpt.model_checkpoint_path)

#--------------------------------------------------------------
print ("Running...")
with open(input_file, 'rt') as testfile:
    with open(output_file, 'wt') as resultfile:    
        while 1:
            lines = testfile.readlines()
            if not lines:
                break
            line_count = len(lines)
            x_test = np.zeros([line_count,n]).astype(np.float32)
            for lct in range(line_count):
                x_arr = reobj.split(lines[lct]);
                x_arr = [item for item in filter(lambda x:x != '', x_arr)] #remove null strings
                for idx in range(n)    :
                    x_test[lct,idx] = float(x_arr[idx])
            #the trainning set run trainning
            result = sess.run(a[total_layer_size],feed_dict={a[0]:x_test})
            for idx in range(line_count):
                print(x_test[idx].tolist(),result[idx].tolist(),file = resultfile)

mpl.plot(x_test[result[:,1]>=0.9,0],x_test[result[:,1]>=0.9,1],'b.');
mpl.plot(x_test[result[:,2]>=0.9,0],x_test[result[:,2]>=0.9,1],'r.');
mpl.plot(x_test[result[:,0]>=0.9,0],x_test[result[:,0]>=0.9,1],'g.');

機器學習小試（9）使用TensorFlow跑通一個通用增量學習流程-測試與應用

（接上文）為了對神經網路的分類（擬合）效果進行測試，我們可以使用另一組訓練樣本，進行試分類，評價其代價函式的收斂程度。 1. 模型測試該測試程式讀取測試資料，並應用當前訓練好的模型，進行分類，計算代價函式。如果模型奇異，則代價函式相較訓練集會較高，反之

TensorFlow 深度學習框架（9）-- 經典卷積網路模型 : LeNet-5 模型 & Inception-v3 模型

LeNet -5 模型LeNet-5 模型總共有 7 層，以數字識別為例，圖展示了 LeNet-5 模型的架構第一層，卷積層這一層的輸入就是原始的影象畫素，LeNet-5 模型接受的輸入層大小為 32*32*1 。第一個卷積層過濾器的尺寸為 5 * 5，深度為 6，步長為 1

Linux學習筆記（9）

9一、特殊權限set_uidset_uid 可以臨時賦予其他用戶命令所有者的身份例如passwd 權限 resr-xr-x。給一個文件設置set_uid前提是文件是二進制的可執行的文件例如ls，cat 。給一個文本文件或者目錄設置是沒有意義的普通用戶ls不了/root/目錄chmod u+s +命令

Linux第二周學習筆記（9）

使用 red tmp 可用命令文件的二周 nac style Linux第二周學習筆記（9）2.15 更改所有者和所屬組chownchown（change owner）命令：更改所有者，也可更改所屬組chown -R命令: chown命令只是對文件或者目錄生效的僅僅只

cesium 學習筆記（9）2018.11.09

實體的描述資訊 2種方法一種建立的時候加一種後來加 var viewer = new Cesium.Viewer('cesiumContainer'); var wyoming = viewer.entities.add({ name : 'Wyoming',

MongoDB 學習筆記（9）--- Limit與Skip方法

MongoDB Limit() 方法如果你需要在MongoDB中讀取指定數量的資料記錄，可以使用MongoDB的Limit方法，limit()方法接受一個數字引數，該引數指定從MongoDB中讀取的記錄條數。語法 limit()方法基本語法如下所示： >db.C

Kotlin學習筆記（9）- 資料類

系列文章全部為本人的學習筆記，若有任何不妥之處，隨時歡迎拍磚指正。如果你覺得我的文章對你有用，歡迎關注我，我們一起學習進步！ Kotlin學習筆記（1）- 環境配置 Kotlin學習筆記（2）- 空安全 Kotlin學習筆記（3）- 語法 Ko

吳恩達深度學習筆記（9）-導數的簡單推導介紹

導數（Derivatives）這個筆記我主要是想幫你獲得對微積分和導數直觀的理解。或許你認為自從大學畢以後你再也沒有接觸微積分。為了高效應用神經網路和深度學習，你並不需要非常深入理解微積分（這個哦，並不需要深入瞭解）。因此如果你觀看這個視訊或者以後的視訊時心想：“哇哦，這些知

深度學習實踐（一）—tensorflow之概述

內容預覽 1.1 深度學習與機器學習的區別 1.1.1 特徵提取方面 1.1.2 資料量和計算效能要求 1.1.3 演算法代表 1.2 深度學習的應用場景 1.2.1 影象識別 1.2

Java核心技術卷I 基礎知識學習筆記（9）

參考：Java核心技術卷I 基礎知識第十四章多程序與多執行緒有哪些區別呢？本質的區別在於每個程序擁有自己的一整套變數，而執行緒則共享資料。似乎有些風險，但是共享變數使執行緒之間的通訊比程序之間的通訊更有效、更容易。在有些作業系統中，與程序相比，執行緒更輕量級，建立、撤銷一個執

Python時間序列LSTM預測系列學習筆記（9）-多變數

本文是對： https://machinelearningmastery.com/multivariate-time-series-forecasting-lstms-keras/ https://blog.csdn.net/iyangdi/article/details/77881755

Tensorflow學習筆記（一）Tensorflow入門

Tensorflow入門前言：本文是閱讀《TensorFlow：實戰Google深度學習框架》第三章提煉出來的筆記，非本人原創。這一章主要介紹： TensorFlow 名字說明最重要兩個概念：Tensor(張量)，Flow(流)。 tensor張量可以理解

TensorFlow學習筆記（九）—— Tensorflow模型的儲存與恢復載入

近期做了一些反垃圾的工作，除了使用常用的規則匹配過濾等手段，也採用了一些機器學習方法進行分類預測。我們使用TensorFlow進行模型的訓練，訓練好的模型需要儲存，預測階段我們需要將模型進行載入還原使用，這就涉及TensorFlow模型的儲存與恢復載入。總結一下Tenso

深度學習實戰（1）--手機跑目標檢測模型（YOLO，從DarkNet到Caffe再到NCNN完整打通）

這篇算是關鍵技術貼，YOLO是什麼、DarkNet是什麼、Caffe是什麼、NCNN又是什麼…等等這一系列科普這裡就完全不說了，牽扯實在太多，通過其他帖子有一定的積累後，看這篇就相對容易了。本文核心：把一個目標檢測模型跑到手機上整個工作分以下幾個階段： 1

solidity學習筆記（9）—— 介面和抽象合約

一個合約如何讀取其他合約的資料或呼叫其他合約的方法？介面的存在就是為了合約之間的通訊。有兩種實現方式：抽象合約和介面一、抽象合約抽象函式是沒有函式體的的函式。如下： pragma solidity ^0.4.0; contract Feline {

python快速學習系列（9）：上下文管理器

上下文管理器context manager -為什麼要學context manager？ ·類似於decorator，TensorFlow裡面出現了不少context manager ·Pythonic的程式碼複用工具，適用於所有有始必有終模式的程式碼複用 ·減少錯誤，降低編寫程式碼的認知資

Spark學習筆記（9）—— Spark IP位置查詢

1 資料來源 ip.txt 1.0.1.0|1.0.3.255|16777472|16778239|亞洲|中國|福建|福州||電信|350100|China|CN|119.306239|26.07530

TCP/IP學習筆記（9）-DNS域名系統

前面已經提到了訪問一臺機器要靠IP地址和MAC地址，其中，MAC地址可以通過ARP協議得到，所以這對使用者是透明的，但是IP地址就不行，無論如何使用者都需要用一個指定的IP來訪問一臺計算機，而IP地址又非常不好記，於是就出現了DNS系統。 DNS系統介紹 DN

深度學習入門（二）——TensorFlow介紹

TensorFlow 1.使用圖 (graph) 來表示計算任務. 2.在被稱之為會話 (Session) 的上下文 (context) 中執行圖. 3.使用 tensor 表示資料. 4通過變數 (Variable) 維護狀態.

從零開始一起學習SLAM（9）不推公式，如何真正理解對極約束?

自從小白向師兄學習了李群李代數和相機成像模型的基本原理後，感覺書上的內容沒那麼難了，公式推導也能推得動了，感覺進步神速，不過最近小白在學習對極幾何，貌似又遇到了麻煩。。。小白：師兄，對極幾何這塊你覺得重要嗎？師兄：當然重要啦，這個是多視角立體

機器學習小試（9）使用TensorFlow跑通一個通用增量學習流程-測試與應用

1. 模型測試

2. 模型應用

相關推薦