Python使用tensorflow實現影象識別（貓狗大戰）-02

阿新 • • 發佈：2018-12-19

import tensorflow as tf

def inference(images, batch_size, n_classes):
    # cov1, shape = [kernel size, kernel size, channels, kernel numbers]
    with tf.variable_scope('conv1') as scope:
    #使用 tf.variable_scope() 讓定義的變數可以有相同的名字
        weights = tf.get_variable('weights', 
                                  shape = 
 [3, 3, 3, 16],   # 16[kernel numbers]：跟計算的精度有關
                                  dtype = tf.float32,
                                  initializer = tf.truncated_normal_initializer(stddev = 0.1,
                                                                                dtype = tf.float32)) 
                        #擷取的正態分佈  tf.truncated_normal_initializer() 

                 '''
                mean：一個python標量或一個標量張量。要生成的隨機值的均值。
                stddev：一個python標量或一個標量張量。要生成的隨機值的標準偏差。
				seed：一個Python整數。用於建立隨機種子。檢視 tf.set_random_seed 行為。
				dtype：資料型別。只支援浮點型別。
                '''       
        biases = tf.get_variable('biases', 
                                 shape = 
 [16], 
                                 dtype = tf.float32,
                                 initializer = tf.constant_initializer(0.1)) # 初始化0.1
        # tf.get_variable()函式的用法----------------------------------------------------------------(1)
        
        conv = tf.nn.conv2d(images, weights, strides = [1, 1, 1, 1], padding = 'SAME')
        #--------------------------------------------------------------------------------------------(2)
        pre_activation = tf.nn.bias_add(conv, biases)
        conv1 = tf.nn.relu(pre_activation, name = scope.name)
        
    #pool1 and norm1
    with tf.variable_scope('poling_lrn') as scope:
        pool1 = tf.nn.max_pool(conv1, ksize = [1, 3, 3, 1], strides = [1, 2, 2, 1], # 標準數值
                               padding = 'SAME', name = 'pooling1')
        norm1 = tf.nn.lrn(pool1, depth_radius = 4, bias = 1.0, alpha = 0.001/9.0,
                          beta = 0.75, name = 'norm1')
 
    # conv2 
    with tf.variable_scope('conv2') as scope:
        weights = tf.get_variable('weights', shape = [3, 3, 16, 16],
                                  dtype = tf.float32,
                                  initializer = tf.truncated_normal_initializer(stddev = 0.1,
                                                                                dtype = tf.float32))
        biases = tf.get_variable('biases', 
                                 shape = [16], 
                                 dtype = tf.float32,
                                 initializer = tf.constant_initializer(0.1))
        conv = tf.nn.conv2d(norm1,weights, strides = [1, 1, 1, 1], padding = 'SAME')
        pre_activation = tf.nn.bias_add(conv, biases)
        conv2 = tf.nn.relu(pre_activation, name = 'conv2')
        
    # pool2 and norm2
    with tf.variable_scope('pooling2_lrn') as scope:
        norm2 = tf.nn.lrn(conv2, depth_radius = 4, bias = 1.0, alpha = 0.001/9.0,
                          beta = 0.75, name = 'norm2')
        pool2 = tf.nn.max_pool(norm2, ksize = [1, 3, 3, 1], strides = [1, 1, 1, 1],
                               padding = 'SAME', name = 'pooling2')
        
    # local3
    with tf.variable_scope('local3') as scope:
        reshape = tf.reshape(pool2, shape = [batch_size, -1]) # 變換成向量
        dim = reshape.get_shape()[1].value
        weights = tf.get_variable('weights',
                                  shape = [dim, 128],  # 全連線個數，128
                                  dtype = tf.float32,
                                  initializer = tf.truncated_normal_initializer(stddev = 0.005,
                                                                                dtype = tf.float32))

        biases = tf.get_variable('biases',
                                shape = [128],
                                dtype = tf.float32,
                                initializer = tf.constant_initializer(0.1))  
        local3 = tf.nn.relu(tf.matmul(reshape, weights) + biases, name = scope.name)
        
    # local4
    with tf.variable_scope('local4') as scope:
        weights = tf.get_variable('weights',
                                  shape = [128, 128], 
                                  dtype = tf.float32,
                                  initializer = tf.truncated_normal_initializer(stddev = 0.005,
                                                                                dtype = tf.float32))
        biases = tf.get_variable('biases',
                                 shape = [128], 
                                 dtype = tf.float32,
                                 initializer = tf.constant_initializer(0.1))
        local4 = tf.nn.relu(tf.matmul(local3, weights) + biases, name = 'local4')
        
    # softmax
    with tf.variable_scope('softmax_linear') as scope:
        weights = tf.get_variable('softmax_linear',
                                  shape = [128, n_classes],  # n_classes:2，表示二分類
                                  dtype = tf.float32,
                                  initializer = tf.truncated_normal_initializer(stddev = 0.005,
                                                                                dtype = tf.float32))
        biases = tf.get_variable('biases',
                                 shape = [n_classes], 
                                 dtype = tf.float32,
                                 initializer = tf.constant_initializer(0.1))
        softmax_linear = tf.add(tf.matmul(local4, weights), biases, name = 'softmax_linear')
        
    return softmax_linear

def losses(logits, labels):
    with tf.variable_scope('lose') as scope:
        cross_entropy = tf.nn.sparse_softmax_cross_entropy_with_logits\
                (logits = logits, labels = labels, name = 'xentropy_per_example')
        loss = tf.reduce_mean(cross_entropy, name = 'loss')
        tf.summary.scalar(scope.name+'/loss', loss)
    
    return loss

#訓練優化
def training(loss, learning_rate):
    with tf.name_scope('optimizer'):
        optimizer = tf.train.AdamOptimizer(learning_rate = learning_rate)
        global_step = tf.Variable(0, name = 'global_step', trainable = False)
        train_op = optimizer.minimize(loss, global_step = global_step)
        
    return train_op

def evalution(logits, labels):
    with tf.variable_scope('accuracy') as scope:
        correct = tf.nn.in_top_k(logits, labels, 1) # 取最大值
        correct = tf.cast(correct, tf.float16)
        accuracy = tf.reduce_mean(correct)
        tf.summary.scalar(scope.name+'/accuracy', accuracy)
    return accuracy

tf.get_variable(name,  shape, initializer): 
#name就是變數的名稱，shape是變數的維度，initializer是變數初始化的方式

import tnsorflow as tf
tf.nn.conv2d(input,filter,strides=[1,1,1,1],padding='SAME')
#input是輸入做卷積的圖片，是一個張量；filter卷積核；strides步長；padding是否考慮邊界，SAME用0填充，VALID不考慮

Python使用tensorflow實現影象識別（貓狗大戰）-02

import tensorflow as tf def inference(images, batch_size, n_classes): # cov1, shape = [kernel size, kernel size, channels, ke

Python使用tensorflow實現影象識別（貓狗大戰）-01

Python使用tensorflow實現影象識別（貓狗大戰）-01 import_data.py import tensorflow as tf import numpy as np import os #引入tensorflow、numpy、os 三個第三方模組 img_widt

基於TensorFlow的Cats vs. Dogs（貓狗大戰）實現和詳解（2）

2. 卷積神經網路模型的構造——model.py 　　關於神經網路模型不想說太多，視訊中使用的模型是仿照TensorFlow的官方例程cifar-10的網路結構來寫的。就是兩個卷積層（每個卷積層後加一個池化層），兩個全連線層，最後一個softmax

基於TensorFlow的Cats vs. Dogs（貓狗大戰）實現和詳解（1）

2017.5.29 　　官方的MNIST例子裡面訓練資料的下載和匯入都是用已經寫好的指令碼完成的，至於裡面實現細節也沒高興去看原始碼，感覺寫得太正式，我這個初學者不好理解。於是在優酷上找到了KevinRush這麼一個播主，裡面的視訊教程講得挺清晰的，於是跟著視

從0到1：神經網路實現影象識別（中）

”. . . we may have knowledge of the past and cannot control it; we may control the future but have no knowledge of it.” — Claude Shannon 1959

從0到1：神經網路實現影象識別（上）

紙上得來終覺淺，絕知此事要躬行。 “神經網路”是“機器學習”的利器之一，常用演算法在TensorFlow、MXNet計算框架上，有很好的支援。為了更好的理解與使用這件利器，我們可以不借助計算框架，從零開始，一步步構建模型，實現學習演算法，並在一個影象識別資料集上，訓練這個模型，再驗證模型預

用101000張食物圖片實現影象識別（資料的獲取與處理）-python-tensorflow框架

　　　　前段時間，日劇《輪到你了》大火，作為程式設計師的我，看到了另外一個程式設計師—二階堂，他的生活作息，以及飲食規律，讓我感同身受，最讓我感觸的是他做的AI聊天機器人，AI菜品分析機器人，AI罪犯分析。　　　　　這讓作為程式設計師的我突然萌生了一股攀比和一種激情，我也得做一個出來（小聲bb，都得嘗試下）

深度學習（四）Keras利用CNN實現圖片識別（Mnist、Cifar10）

視覺集視覺資料庫是用來提供給圖片識別領域用素材，目前各個教材常用的主要有手寫數字識別庫、10中小圖片分類庫，詳細介紹如下： Mnist &

tensorflow實現貓狗大戰（分類算法）

sse sin output 行操作 ogr cast bytes 序列 raw 本次使用了tensorflow高級API在規範化網絡編程做出了嘗試。第一步：準備好需要的庫 tensorflow-gpu 1.8.0 opencv-python 3.3.1 nu

利用resnet 做kaggle貓狗大戰影象識別，秒上98準確率

1、資料介紹這份資料集來源於Kaggle，資料集有12500只貓和12500只狗。在這裡簡單介紹下整體思路 1、1從圖片中直接訓練一個小網路（作為基準方法），也就是普通的cnn方法 2、2後面我會用到最新的預訓練好的resnet等方法進行訓練 2

opencv3實現簡單的數字影象識別（KNN）

正在用opencv3做一個數字影象識別的小專案，要用到KNN，但是不熟悉它的介面，因此，借鑑了大佬的部落格，基本照搬了程式碼，程式碼如下：大佬的連結如下：http://www.cnblogs.com/denny402/p/5033898.html // knnrecognizenum

從零開始學caffe（七）：利用GoogleNet實現影象識別

一、準備模型在這裡，我們利用已經訓練好的Googlenet進行物體影象的識別，進入Googlenet的GitHub地址，進入models資料夾，選擇Googlenet 點選Googlenet的模型下載地址下載該模型到電腦中。模型結構在這裡，我們利用之前講

從零開始學caffe（九）：在Windows下實現影象識別

本系列文章主要介紹了在win10系統下caffe的安裝編譯，運用CPU和GPU完成簡單的小專案，文章之間具有一定延續性。 step1:準備資料集資料集是進行深度學習的第一步，在這裡我們從以下五個連結中下載所需要的資料集： animal flower plane hou

貓狗大戰-caffe模型訓練例項&NSDK識別執行

首先【caffe-Windows】安裝，請參考https://blog.csdn.net/zb1165048017/article/details/51355143（博主寫的非常詳細，避免了很多坑）注：如果出現libcaffe.lib無法開啟的錯誤且按照博主寫的策略無法解決，

如何用keras cnn 做kaggle貓狗大戰圖片識別

1、資料簡介這份資料集來源於Kaggle，原資料集有12500只貓和12500只狗，因單機cpu跑，這裡我只選取了2000張圖片，工具用的是基於tensorflow的keras。資料如下所示： 2、資料預處理這部分程式碼如下： impor

基於 TensorFlow 的影象識別（R實現）

提到機器學習，深度學習這些，大家都會立馬想起Python。但R的實力也不容小覷。今天就用R來演示一個基於TensorFlow的影象識別的例子。如果你想執行這些程式碼，就必須先安裝配置好TensorFlow，我是在Linux系統上面執行的。如何配置TensorFlow儘量看看官

Python實現對12500張貓狗影象的精準分類

微信公眾號關鍵字全網搜尋最新排名【機器學習演算法】：排名第一【機器學習】：排名第一【Python

luogu P1489 貓狗大戰

經典 main while 輸出格式 badge 輸入格式 pan for getch 題目描述新一年度的貓狗大戰通過SC(星際爭霸)這款經典的遊戲來較量，野貓和飛狗這對冤家為此已經準備好久了，為了使戰爭更有難度和戲劇性，雙方約定只能選擇Terran(人族)並且只能造機

貓狗大戰的TFrecord數據集制作

AD load example std contest from string listdir label import tensorflow as tfimport numpy as npimport osfrom PIL import Image#沒有下面兩句德華會出現

貓狗大戰

IT img 以及給定 || 足夠 span 星際技術分享新一年度的貓狗大戰通過SC(星際爭霸)這款經典的遊戲來較量，野貓和飛狗這對冤家為此已經準備好久了，為了使戰爭更有難度和戲劇性，雙方約定只能選擇Terran(人族)並且只能造機槍兵。比賽開始了，很快，野貓已

Python使用tensorflow實現影象識別（貓狗大戰）-02

相關推薦