【TensorFlow】貓狗大戰——二分類

阿新 • • 發佈：2019-01-06

https://blog.csdn.net/caicai2526/article/details/75329812

https://blog.csdn.net/caicai2526/article/details/75330192

https://blog.csdn.net/wsLJQian/article/details/78091425

實現貓狗的二分類：

input_data.py

# coding=utf-8

#%%

import tensorflow as tf
import numpy as np
import os

#%%

# you need to change this to your data directory
#train_dir = '/home/kevin/tensorflow/cats_vs_dogs/data/train/'
#train_dir = '/home/twinkle/PycharmProjects/AlexNet_CatVSDog/01 cats vs dogs/data/train/'

def get_files(file_dir):
    '''
    Args:
        file_dir: file directory
    Returns:
        list of images and labels
    '''
    cats = []
    label_cats = []
    dogs = []
    label_dogs = []
    for file in os.listdir(file_dir):
        #name = file.split(sep='.')
        name = file.split('.')
        if name[0]=='cat':
            cats.append(file_dir + file)
            label_cats.append(0)
        else:
            dogs.append(file_dir + file)
            label_dogs.append(1)
    print('There are %d cats\nThere are %d dogs' %(len(cats), len(dogs)))
    
    image_list = np.hstack((cats, dogs))
    label_list = np.hstack((label_cats, label_dogs))
    
    temp = np.array([image_list, label_list])
    temp = temp.transpose()
    np.random.shuffle(temp)
    
    image_list = list(temp[:, 0])
    label_list = list(temp[:, 1])
    label_list = [int(i) for i in label_list]
    
    
    return image_list, label_list


#%%

def get_batch(image, label, image_W, image_H, batch_size, capacity):
    '''
    Args:
        image: list type
        label: list type
        image_W: image width
        image_H: image height
        batch_size: batch size
        capacity: the maximum elements in queue
    Returns:
        image_batch: 4D tensor [batch_size, width, height, 3], dtype=tf.float32
        label_batch: 1D tensor [batch_size], dtype=tf.int32
    '''
    
    image = tf.cast(image, tf.string)
    label = tf.cast(label, tf.int32)

    # make an input queue
    input_queue = tf.train.slice_input_producer([image, label])
    
    label = input_queue[1]
    image_contents = tf.read_file(input_queue[0])
    image = tf.image.decode_jpeg(image_contents, channels=3)
    
    ######################################
    # data argumentation should go to here
    ######################################
    
    image = tf.image.resize_image_with_crop_or_pad(image, image_W, image_H)
    
    # if you want to test the generated batches of images, you might want to comment the following line.
    # 如果想看到正常的圖片，請註釋掉111行（標準化）和 126行（image_batch = tf.cast(image_batch, tf.float32)）
    # 訓練時不要註釋掉！
    image = tf.image.per_image_standardization(image)
    
    image_batch, label_batch = tf.train.batch([image, label],
                                                batch_size= batch_size,
                                                num_threads= 64, 
                                                capacity = capacity)
    
    #you can also use shuffle_batch 
#    image_batch, label_batch = tf.train.shuffle_batch([image,label],
#                                                      batch_size=BATCH_SIZE,
#                                                      num_threads=64,
#                                                      capacity=CAPACITY,
#                                                      min_after_dequeue=CAPACITY-1)
    
    label_batch = tf.reshape(label_batch, [batch_size])
    image_batch = tf.cast(image_batch, tf.float32)
    
    return image_batch, label_batch


 
#%% TEST
# To test the generated batches of images
# When training the model, DO comment the following codes




#import matplotlib.pyplot as plt
#
#BATCH_SIZE = 2
#CAPACITY = 256
#IMG_W = 208
#IMG_H = 208
#
#train_dir = '/home/kevin/tensorflow/cats_vs_dogs/data/train/'
#
#image_list, label_list = get_files(train_dir)
#image_batch, label_batch = get_batch(image_list, label_list, IMG_W, IMG_H, BATCH_SIZE, CAPACITY)
#
#with tf.Session() as sess:
#    i = 0
#    coord = tf.train.Coordinator()
#    threads = tf.train.start_queue_runners(coord=coord)
#    
#    try:
#        while not coord.should_stop() and i<1:
#            
#            img, label = sess.run([image_batch, label_batch])
#            
#            # just test one batch
#            for j in np.arange(BATCH_SIZE):
#                print('label: %d' %label[j])
#                plt.imshow(img[j,:,:,:])
#                plt.show()
#            i+=1
#            
#    except tf.errors.OutOfRangeError:
#        print('done!')
#    finally:
#        coord.request_stop()
#    coord.join(threads)


#%%

#############################################################

model.py

# coding=utf-8


#%%

import tensorflow as tf

#%%
def inference(images, batch_size, n_classes):
    '''Build the model
    Args:
        images: image batch, 4D tensor, tf.float32, [batch_size, width, height, channels]
    Returns:
        output tensor with the computed logits, float, [batch_size, n_classes]
    '''
    #conv1, shape = [kernel size, kernel size, channels, kernel numbers]
    
    with tf.variable_scope('conv1') as scope:
        weights = tf.get_variable('weights', 
                                  shape = [3,3,3, 16],
                                  dtype = tf.float32, 
                                  initializer=tf.truncated_normal_initializer(stddev=0.1,dtype=tf.float32))
        biases = tf.get_variable('biases', 
                                 shape=[16],
                                 dtype=tf.float32,
                                 initializer=tf.constant_initializer(0.1))
        conv = tf.nn.conv2d(images, weights, strides=[1,1,1,1], padding='SAME')
        pre_activation = tf.nn.bias_add(conv, biases)
        conv1 = tf.nn.relu(pre_activation, name= scope.name)
    
    #pool1 and norm1   
    with tf.variable_scope('pooling1_lrn') as scope:
        pool1 = tf.nn.max_pool(conv1, ksize=[1,3,3,1],strides=[1,2,2,1],
                               padding='SAME', name='pooling1')
        norm1 = tf.nn.lrn(pool1, depth_radius=4, bias=1.0, alpha=0.001/9.0,
                          beta=0.75,name='norm1')
    
    #conv2
    with tf.variable_scope('conv2') as scope:
        weights = tf.get_variable('weights',
                                  shape=[3,3,16,16],
                                  dtype=tf.float32,
                                  initializer=tf.truncated_normal_initializer(stddev=0.1,dtype=tf.float32))
        biases = tf.get_variable('biases',
                                 shape=[16], 
                                 dtype=tf.float32,
                                 initializer=tf.constant_initializer(0.1))
        conv = tf.nn.conv2d(norm1, weights, strides=[1,1,1,1],padding='SAME')
        pre_activation = tf.nn.bias_add(conv, biases)
        conv2 = tf.nn.relu(pre_activation, name='conv2')
    
    
    #pool2 and norm2
    with tf.variable_scope('pooling2_lrn') as scope:
        norm2 = tf.nn.lrn(conv2, depth_radius=4, bias=1.0, alpha=0.001/9.0,
                          beta=0.75,name='norm2')
        pool2 = tf.nn.max_pool(norm2, ksize=[1,3,3,1], strides=[1,1,1,1],
                               padding='SAME',name='pooling2')
    
    
    #local3
    with tf.variable_scope('local3') as scope:
        reshape = tf.reshape(pool2, shape=[batch_size, -1])
        dim = reshape.get_shape()[1].value
        weights = tf.get_variable('weights',
                                  shape=[dim,128],
                                  dtype=tf.float32,
                                  initializer=tf.truncated_normal_initializer(stddev=0.005,dtype=tf.float32))
        biases = tf.get_variable('biases',
                                 shape=[128],
                                 dtype=tf.float32, 
                                 initializer=tf.constant_initializer(0.1))
        local3 = tf.nn.relu(tf.matmul(reshape, weights) + biases, name=scope.name)    
    
    #local4
    with tf.variable_scope('local4') as scope:
        weights = tf.get_variable('weights',
                                  shape=[128,128],
                                  dtype=tf.float32, 
                                  initializer=tf.truncated_normal_initializer(stddev=0.005,dtype=tf.float32))
        biases = tf.get_variable('biases',
                                 shape=[128],
                                 dtype=tf.float32,
                                 initializer=tf.constant_initializer(0.1))
        local4 = tf.nn.relu(tf.matmul(local3, weights) + biases, name='local4')
     
        
    # softmax
    with tf.variable_scope('softmax_linear') as scope:
        weights = tf.get_variable('softmax_linear',
                                  shape=[128, n_classes],
                                  dtype=tf.float32,
                                  initializer=tf.truncated_normal_initializer(stddev=0.005,dtype=tf.float32))
        biases = tf.get_variable('biases', 
                                 shape=[n_classes],
                                 dtype=tf.float32, 
                                 initializer=tf.constant_initializer(0.1))
        softmax_linear = tf.add(tf.matmul(local4, weights), biases, name='softmax_linear')
    
    return softmax_linear

#%%
def losses(logits, labels):
    '''Compute loss from logits and labels
    Args:
        logits: logits tensor, float, [batch_size, n_classes]
        labels: label tensor, tf.int32, [batch_size]
        
    Returns:
        loss tensor of float type
    '''
    with tf.variable_scope('loss') as scope:
        cross_entropy = tf.nn.sparse_softmax_cross_entropy_with_logits\
                        (logits=logits, labels=labels, name='xentropy_per_example')
        loss = tf.reduce_mean(cross_entropy, name='loss')
        tf.summary.scalar(scope.name+'/loss', loss)
    return loss

#%%
def trainning(loss, learning_rate):
    '''Training ops, the Op returned by this function is what must be passed to 
        'sess.run()' call to cause the model to train.
        
    Args:
        loss: loss tensor, from losses()
        
    Returns:
        train_op: The op for trainning
    '''
    with tf.name_scope('optimizer'):
        optimizer = tf.train.AdamOptimizer(learning_rate= learning_rate)
        global_step = tf.Variable(0, name='global_step', trainable=False)
        train_op = optimizer.minimize(loss, global_step= global_step)
    return train_op

#%%
def evaluation(logits, labels):
  """Evaluate the quality of the logits at predicting the label.
  Args:
    logits: Logits tensor, float - [batch_size, NUM_CLASSES].
    labels: Labels tensor, int32 - [batch_size], with values in the
      range [0, NUM_CLASSES).
  Returns:
    A scalar int32 tensor with the number of examples (out of batch_size)
    that were predicted correctly.
  """
  with tf.variable_scope('accuracy') as scope:
      correct = tf.nn.in_top_k(logits, labels, 1)
      correct = tf.cast(correct, tf.float16)
      accuracy = tf.reduce_mean(correct)
      tf.summary.scalar(scope.name+'/accuracy', accuracy)
  return accuracy

#%%

##############################################

training.py

# coding=utf-8
#%%

import os
import numpy as np
import tensorflow as tf
import input_data
import model

#%%

N_CLASSES = 2
IMG_W = 208  # resize the image, if the input image is too large, training will be very slow.
IMG_H = 208
#BATCH_SIZE = 16
BATCH_SIZE = 16
CAPACITY = 2000
#MAX_STEP = 10000 # with current parameters, it is suggested to use MAX_STEP>10k
MAX_STEP = 1000000 # with current parameters, it is suggested to use MAX_STEP>10k
learning_rate = 0.0001 # with current parameters, it is suggested to use learning rate<0.0001


#%%
def run_training():

    # you need to change the directories to yours.
    train_dir = '/home/twinkle/PycharmProjects/AlexNet_CatVSDog/01 cats vs dogs/data/train/'
    logs_train_dir = '/home/twinkle/PycharmProjects/AlexNet_CatVSDog/01 cats vs dogs/logs/train/'

    train, train_label = input_data.get_files(train_dir)

    train_batch, train_label_batch = input_data.get_batch(train,
                                                          train_label,
                                                          IMG_W,
                                                          IMG_H,
                                                          BATCH_SIZE,
                                                          CAPACITY)
    train_logits = model.inference(train_batch, BATCH_SIZE, N_CLASSES)
    train_loss = model.losses(train_logits, train_label_batch)
    train_op = model.trainning(train_loss, learning_rate)
    train__acc = model.evaluation(train_logits, train_label_batch)

    summary_op = tf.summary.merge_all()
    sess = tf.Session()
    train_writer = tf.summary.FileWriter(logs_train_dir, sess.graph)
    saver = tf.train.Saver()

    sess.run(tf.global_variables_initializer())
    coord = tf.train.Coordinator()
    threads = tf.train.start_queue_runners(sess=sess, coord=coord)

    try:
        for step in np.arange(MAX_STEP):
            if coord.should_stop():
                    break
            _, tra_loss, tra_acc = sess.run([train_op, train_loss, train__acc])

            if step % 50 == 0:
                print('Step %d, train loss = %.2f, train accuracy = %.2f%%' %(step, tra_loss, tra_acc*100.0))
                summary_str = sess.run(summary_op)
                train_writer.add_summary(summary_str, step)

            if step % 2000 == 0 or (step + 1) == MAX_STEP:
                checkpoint_path = os.path.join(logs_train_dir, 'model.ckpt')
                saver.save(sess, checkpoint_path, global_step=step)

    except tf.errors.OutOfRangeError:
        print('Done training -- epoch limit reached')
    finally:
        coord.request_stop()

    coord.join(threads)
    sess.close()


#%% Evaluate one image
# when training, comment the following codes.


from PIL import Image
import matplotlib.pyplot as plt

def get_one_image(train):
   '''Randomly pick one image from training data
   Return: ndarray
   '''
   n = len(train)
   ind = np.random.randint(0, n)
   img_dir = train[ind]

   image = Image.open(img_dir)
   #plt.imshow(image)
   image.show(image)
   print('show %d picture' %(ind))

   image = image.resize([208, 208])
   image = np.array(image)
   return image

def evaluate_one_image():
   '''Test one image against the saved models and parameters
   '''

   # you need to change the directories to yours.
   train_dir = '/home/twinkle/PycharmProjects/AlexNet_CatVSDog/01 cats vs dogs/data/train/'
   train, train_label = input_data.get_files(train_dir)
   image_array = get_one_image(train)

   with tf.Graph().as_default():
       BATCH_SIZE = 1
       N_CLASSES = 2

       image = tf.cast(image_array, tf.float32)
       image = tf.image.per_image_standardization(image)
       image = tf.reshape(image, [1, 208, 208, 3])
       logit = model.inference(image, BATCH_SIZE, N_CLASSES)

       logit = tf.nn.softmax(logit)

       x = tf.placeholder(tf.float32, shape=[208, 208, 3])

       # you need to change the directories to yours.
       logs_train_dir = '/home/twinkle/PycharmProjects/AlexNet_CatVSDog/01 cats vs dogs/logs/train/'

       saver = tf.train.Saver()

       with tf.Session() as sess:

           print("Reading checkpoints...")
           ckpt = tf.train.get_checkpoint_state(logs_train_dir)
           if ckpt and ckpt.model_checkpoint_path:
               global_step = ckpt.model_checkpoint_path.split('/')[-1].split('-')[-1]
               saver.restore(sess, ckpt.model_checkpoint_path)
               print('Loading success, global_step is %s' % global_step)
           else:
               print('No checkpoint file found')

           prediction = sess.run(logit, feed_dict={x: image_array})
           max_index = np.argmax(prediction)
           if max_index==0:
               print('This is a cat with possibility %.6f' %prediction[:, 0])
           else:
               print('This is a dog with possibility %.6f' %prediction[:, 1])


#%%

evaluate_one_image()
#run_training()

檔案樹：訓練檔案需要另外下載。

在training.py資料夾所在目錄開啟終端執行training.py

檔案內最後兩句

#evaluate_one_image()
#run_training()

取消註釋可以執行訓練和評價，模型存放在logs中。

【TensorFlow】貓狗大戰——二分類

https://blog.csdn.net/caicai2526/article/details/75329812https://blog.csdn.net/caicai2526/article/details/75330192https://blog.csdn.net/ws

tensorflow實現貓狗大戰（分類算法）

sse sin output 行操作 ogr cast bytes 序列 raw 本次使用了tensorflow高級API在規範化網絡編程做出了嘗試。第一步：準備好需要的庫 tensorflow-gpu 1.8.0 opencv-python 3.3.1 nu

Python使用tensorflow實現影象識別（貓狗大戰）-01

Python使用tensorflow實現影象識別（貓狗大戰）-01 import_data.py import tensorflow as tf import numpy as np import os #引入tensorflow、numpy、os 三個第三方模組 img_widt

Tensorflow學習筆記：資料集加工和轉化為TensorFlow專用格式——Finetuning，貓狗大戰，VGGNet的重新針對訓練

Kaggle 貓狗大戰貓狗大戰的資料集來源於Kaggle上的一個競賽：Dogs vs. Cats 貓狗大戰的資料集下載地址http://www.kaggle.com/c/dogs-vs-cats，其中資料集有12500只貓和12500只狗 ,官方資料集下載需要帳號，大

Tensorflow學習筆記：VGG16模型——Finetuning，貓狗大戰，VGGNet的重新針對訓練

這一篇介紹一下VGG16模型的修改 Step 1: 對模型的修改首先是對模型的修改（VGG16_model.py檔案），在這裡原先的輸出結果是對1000個不同的類別進行判定，而在此是對2個影象，也就是貓和狗的判斷，因此首先第一步就是修改輸出層的全連線資料。

Tensorflow學習筆記：VGG16訓練——Finetuning，貓狗大戰，VGGNet的重新針對訓練

這篇介紹如何用資料對vgg16進行訓練 Finetuning最重要的一個步驟就是模型的重新訓練與儲存。首先對於模型的值的輸出，在類中已經做了定義，因此只需要將定義的模型類初始化後輸出賦予一個特定的變數即可。 vgg = model.vgg16(x_imgs)

Python使用tensorflow實現影象識別（貓狗大戰）-02

import tensorflow as tf def inference(images, batch_size, n_classes): # cov1, shape = [kernel size, kernel size, channels, ke

貓狗大戰2.0 使用tensorflow和tfrecord

距離上次的部落格已經過去了半個月兩週左右的時間，自己在b站和部落格上學習了很多相關的知識，自我感覺自己的tensorflow的水平已經算是到了入門的水平，在部落格上有相關的非tensorboard匯入資料，實測有效（傳送門由於時間久了，暫時找不到了，自己找一下吧）

基於TensorFlow的Cats vs. Dogs（貓狗大戰）實現和詳解（2）

2. 卷積神經網路模型的構造——model.py 　　關於神經網路模型不想說太多，視訊中使用的模型是仿照TensorFlow的官方例程cifar-10的網路結構來寫的。就是兩個卷積層（每個卷積層後加一個池化層），兩個全連線層，最後一個softmax

基於TensorFlow的Cats vs. Dogs（貓狗大戰）實現和詳解（1）

2017.5.29 　　官方的MNIST例子裡面訓練資料的下載和匯入都是用已經寫好的指令碼完成的，至於裡面實現細節也沒高興去看原始碼，感覺寫得太正式，我這個初學者不好理解。於是在優酷上找到了KevinRush這麼一個播主，裡面的視訊教程講得挺清晰的，於是跟著視

【Tensorflow】怎樣為你的網路預加工和打包訓練資料？（二）：小資料集的處理方案

實驗環境：python2.7 第二篇我們來講一講小資料集的處理方法，小資料集一般多以文字儲存為主，csv是一種流行的資料格式，另外也有txt等。當然也會有.mat或者.npy這種經過處理的格式。一.處理csv格式資料集實驗資料集是鳶尾花卉資料集iris，格式是.csv

【轉】java提高篇(二)-----理解java的三大特性之繼承

logs 了解向上轉型 one 調用 adding nbsp eight 基礎【轉】java提高篇(二)-----理解java的三大特性之繼承原文地址：http://www.cnblogs.com/chenssy/p/3354884.html 在《Thi

【轉】JMeter學習（二）錄制腳本

使用 get 運行喜歡錄制完成帶來免費 sdn title ---------------------------------------------------------------------------------------------------- 環境

【轉】JMeter學習（二十九）使用Jmeter創建ActiveMQ JMS POINT TO POINT請求，環境搭建、請求創建、插件安裝、監聽服務器資源等

分布式 jndi 根目錄 point 啟動 lib .cn 轉載 p2p 最近要做公司消息中間件的性能測試，第一個想到的工具就是Jmeter了，網上簡單搜了一下，基本上都是WEB測試的居多，只好自己研究官方文檔了。其中涉及Jmeter基本的術語或者概念，請自行參考官方文檔

【轉】JMeter學習（二十七）Jmeter常見問題

pre 麻煩 continue 而不是行為 let 方式 prop 右上角收集工作中JMeter遇到的各種問題 1. JMeter的工作原理是什麽？　　向服務器提交請求；從服務器取回請求返回的結果。 2. JMeter的作用？　　JMeter可以用於測試

【轉】JMeter學習（二十八）內存溢出解決方法

不能 -xms 百度解決 code apache 超過軟件測試內存使用jmeter進行壓力測試時遇到一段時間後報內存溢出outfmenmory錯誤，導致jmeter卡死了，先嘗試在jmeter.bat中增加了JVM_ARGS="-Xmx2048m -Xms2048m

【轉】JMeter學習（二十五）HTTP屬性管理器HTTP Cookie Manager、HTTP Request Defaults

agen 讀取 expired fault 範圍運行時 ear 定制只有一個 Test Plan的配置元件中有一些和HTTP屬性相關的元件：HTTP Cache Manager、HTTP Authorization Manager、HTTP Cookie Manager

【4】簡單繪圖（二）

dispose alt draw bsp rom 形狀 .html yellow tex 在上一篇裏已經向大家介紹了如何使用GDI+繪制簡單的圖像,這一篇繼續向大家介紹其它一些繪圖知識. 1.首先我們來看下上一篇中我們使用過的Pen. Pen的屬性主要有: Color(顏色

MT【61】含參數二次函數最大最小值

tco pla 最大 back inline 我們最小但是 alt 評：此類題目在高考中作為壓軸題也曾考過，一般通性通法都如上面的做法，但是我們如果可以站在包絡的角度，很多問題將變得很清晰：MT【61】含參數二次函數最大最小值

luogu P1489 貓狗大戰

經典 main while 輸出格式 badge 輸入格式 pan for getch 題目描述新一年度的貓狗大戰通過SC(星際爭霸)這款經典的遊戲來較量，野貓和飛狗這對冤家為此已經準備好久了，為了使戰爭更有難度和戲劇性，雙方約定只能選擇Terran(人族)並且只能造機

【TensorFlow】貓狗大戰——二分類

相關推薦