貓狗大戰2.0 使用tensorflow和tfrecord

阿新 • • 發佈：2018-12-19

距離上次的部落格已經過去了半個月兩週左右的時間，自己在b站和部落格上學習了很多相關的知識，自我感覺自己的tensorflow的水平已經算是到了入門的水平，在部落格上有相關的非tensorboard匯入資料，實測有效（傳送門由於時間久了，暫時找不到了，自己找一下吧）。不過通過tensorboard的可執行資料卻十分少，所以自己在此記錄一下自己的程式，也算是自己整理一下。

如果感覺有用，可以點一下關注喲！我會不定期更新一些自己學習的東西。

使用的IDE是vscode（python3.5）,資料下載可以在其他部落格中找一下。

首先工程圖如下：

Data資料夾下有train和test兩個資料夾，就是下載的資料集。Logs存放咱們所有的程式。

首先我們要建立建立tfrecord的檔案：

首先得到資料夾train下的所有檔名稱，貼出程式碼如下：

import tensorflow as tf
import numpy as np
import os

NumClass = 2
ImageWidth = 208
ImageHeight = 208
ImageChannel = 3

def get_files(file_dic):

    cats = []
    dogs = []
    cats_labels = []
    dogs_labels = []

    for file in os.listdir(file_dic):
        #得到檔案下所有的檔名稱，也就是cat.0jpg....

        name = file.split(sep='.')
        if name[0] == 'cat':
            cats.append(file_dic+'/'+file)
            cats_labels.append(0)
        if name[0] == 'dog':
            dogs.append(file_dic+'/'+file)
            dogs_labels.append(1)

    return cats, cats_labels, dogs, dogs_labels

cats, cats_labels, dogs, dogs_labels = get_files('G:/CatsAndDogs/data/train')
print(cats)

得到的輸出為：

['G:/CatsAndDogs/data/train/cat.0.jpg', 'G:/CatsAndDogs/data/train/cat.1.jpg'...

可以看到檔名稱都已匯入cats等列表當中，那麼接下來的操作就是把image和label的列表連線起來並打亂資料。程式碼貼出如下。

    #將檔案打亂順序
    image_list = np.hstack((cats, dogs)) #將兩個列表連線起來
    labels_list = np.hstack((cats_labels, dogs_labels))
    temp = np.array([image_list, labels_list])
    temp = temp.transpose() #轉置矩陣
    np.random.shuffle(temp) #打亂資料，下面的圖片是temp現在的資料


    ##從打亂的temp中再取出list，相當於洗牌之後的重新摸牌
    image_list = list(temp[:, 0])
    label_list = list(temp[:, 1])
    label_list = [int(i) for i in label_list]  # 字串型別轉換為int型別

現在temp程式跑到這的資料結果如下圖所示：

嗯，到現在為止也很成功，那我們接下來繼續操作，這個時候又變成了需要對整個列表的操作，所以我們需要整個列表的長度，再把相片的解碼出來。

from scipy.misc import imread,imresize #注意是得重新載入一個model

    #首先要確定整個列表的長度並且初始化相片的格式
    num_file = len(cats_labels) + len(dogs_labels)
    images = np.zeros((num_file, ImageHeight, ImageWidth, ImageChannel), dtype = np.uint8)

    for index in range(num_file):
        img = imread(image_list[index])
        img = imresize(img, (ImageWidth, ImageHeight))
        images[index] = img

接下來我們可以把這些相片，labels等資訊傳入一個類中，用來建立接下來的tdrecord檔案。

    class ImgData(object):
        pass
    
    result = ImgData()
    result.images = images
    result.labels = label_list
    result.num = num_file

到此我們就完成了第一個函式。此時得到了資料夾中亂序的所有照片及相應的標籤，暫時完成了第一步的工作。接下來我們就開始第二部的工作，完成tfrecord檔案的完成。

def convert(data, destination, destination1):
    """將圖片儲存為.tfrecords檔案
    引數:
        data: 上述函式返回的ImageData物件
        destination: 目標檔名
    """
 
    images = data.images
    labels = data.labels
    num_examples = data.num - 3000
                #使用上面使用的類進行下面的tfreord建立,照片標籤以及資料
 
    # 儲存的檔名
    filename = destination
                #儲存時的檔名（帶路徑的）
    
    # 使用TFRecordWriter來寫入資料
    writer = tf.python_io.TFRecordWriter(filename)
                #使用tfrecord進行寫入
    # 遍歷圖片
    for index in range(num_examples):
        # 轉為二進位制
        image = images[index].tostring()
        label = labels[index]
                #直接使用上面所建立的類進行輸入
                
        # tf.train下有Feature和Features，需要注意其區別
        # 層級關係為Example->Features->Feature（很重要）
        #注意圖片一般的型別為ByteList，其他的有Int64List和FloatList型
        example = tf.train.Example(features=tf.train.Features(feature={
            'image': tf.train.Feature(bytes_list=tf.train.BytesList(value=[image])),
            'label': tf.train.Feature(int64_list=tf.train.Int64List(value=[label]))
        }))
        # 寫入建立的example
        writer.write(example.SerializeToString())
    writer.close()
            
    filename1 = destination1
    writer = tf.python_io.TFRecordWriter(filename1)
    for index in range(3000):
        # 轉為二進位制
        image = images[index+num_examples].tostring()
        label = labels[index+num_examples]
        example = tf.train.Example(features=tf.train.Features(feature={
            'image': tf.train.Feature(bytes_list=tf.train.BytesList(value=[image])),
            'label': tf.train.Feature(int64_list=tf.train.Int64List(value=[label]))
        }))
        writer.write(example.SerializeToString())
    writer.close()    

convert(result, 'G:/CatsAndDogs/logs/traintfrecord', 'G:/CatsAndDogs/logs/testtfrecord')

上面的註釋很清楚，應該可以看懂，主要是有train和test兩個部分，不太懂的也可以百度查一下，應該可以找到相關的資料。接下來的就是解碼一下資料。

def read_and_decode(filename_queue, batch_size, capacity):
    """讀取.tfrecords檔案
    引數:
        filename_queue: 檔名, 一個列表
    返回:
        img, label: **單張圖片和對應標籤**
    """
    # 建立一個圖節點，該節點負責資料輸入
    filename_queue = tf.train.string_input_producer([filename_queue])
                # tf.train.string_input_producer函式把我們需要的全部檔案打包為一個tf內部的queue型別
                # 之後tf開檔案就從這個queue中取目錄了,要注意一點的是這個函式的shuffle引數預設是True
                # 所以讀取的順序可能不一樣
    reader = tf.TFRecordReader()
    _, serialized_example = reader.read(filename_queue)
                # 讀取前面所建立的資料目錄
    
    # 解析單個example
            # 暫時不大清楚下面的操作，不過感覺應該是得到各個features
    features = tf.parse_single_example(serialized_example, features={
        'image': tf.FixedLenFeature([], tf.string),
        'label': tf.FixedLenFeature([], tf.int64)
    })
            # tf.decode_raw函式的意思是將原來編碼為字串型別的變數重新變回來
            # 這個方法在資料集dataset中很常用，因為製作圖片源資料一般寫進tfrecord裡用to_bytes的形式，也就是字串
            # 這裡將原始資料取出來，必須制定原始資料的格式，原始資料是什麼格式這裡解析必須是什麼格式！！
            # tf.cast這個函式主要用於資料型別的轉變，不會改變原始資料的值還有形狀的
    image = tf.decode_raw(features['image'], tf.uint8)
    image = tf.reshape(image, [ImageHeight, ImageWidth, ImageChannel])
    image = tf.cast(image, tf.float32)
    label = tf.cast(features['label'], tf.int32)
    
    image_batch, label_batch = tf.train.batch([image, label],
                                              batch_size=batch_size,
                                              num_threads=64,  # 執行緒
                                              capacity=capacity)
    
    return image, label

上面是解析tfrecord的函式，那麼可以利用下面的程式進行train和test資料的獲得。

image, label = read_and_decode('G:/CatsAndDogs/logs/traintfrecord', 16, 2000) 
image1, label1 = read_and_decode('G:/CatsAndDogs/logs/testtfrecord', 16, 2000)

整體的函式如下：

import tensorflow as tf
import numpy as np
import os
from scipy.misc import imread,imresize

NumClass = 2
ImageWidth = 208
ImageHeight = 208
ImageChannel = 3

def get_files(file_dic):

    cats = []
    dogs = []
    cats_labels = []
    dogs_labels = []

    #得到檔案下所有的檔名稱，也就是cat.0jpg....
    for file in os.listdir(file_dic):   

        name = file.split(sep='.')
        if name[0] == 'cat':
            cats.append(file_dic+'/'+file)
            cats_labels.append(0)
        if name[0] == 'dog':
            dogs.append(file_dic+'/'+file)
            dogs_labels.append(1)
            
    num_file = len(cats_labels) + len(dogs_labels)
    images = np.zeros((num_file, ImageHeight, ImageWidth, ImageChannel), dtype = np.uint8)
    print("There are %d cats\nThere are %d dogs" % (len(cats), len(dogs)))

    #將檔案打亂順序
    image_list = np.hstack((cats, dogs))
    labels_list = np.hstack((cats_labels, dogs_labels))
    temp = np.array([image_list, labels_list])
    temp = temp.transpose()
    np.random.shuffle(temp)

    ##從打亂的temp中再取出list（img和lab）
    image_list = list(temp[:, 0])
    label_list = list(temp[:, 1])
    label_list = [int(i) for i in label_list]  # 字串型別轉換為int型別
    
    for index in range(num_file):
        img = imread(image_list[index])
        img = imresize(img, (ImageWidth, ImageHeight))
        images[index] = img
        
    class ImgData(object):
        pass
    
    result = ImgData()
    result.images = images
    result.labels = label_list
    result.num = num_file
    
    return result

def convert(data, destination, destination1):
    """將圖片儲存為.tfrecords檔案
    引數:
        data: 上述函式返回的ImageData物件
        destination: 目標檔名
    """
 
    images = data.images
    labels = data.labels
    num_examples = data.num - 3000
                #使用上面使用的類進行下面的tfreord建立,照片標籤以及資料
 
    # 儲存的檔名
    filename = destination
                #儲存時的檔名（帶路徑的）
    
    # 使用TFRecordWriter來寫入資料
    writer = tf.python_io.TFRecordWriter(filename)
                #使用tfrecord進行寫入
    # 遍歷圖片
    for index in range(num_examples):
        # 轉為二進位制
        image = images[index].tostring()
        label = labels[index]
                #直接使用上面所建立的類進行輸入
                
        # tf.train下有Feature和Features，需要注意其區別
        # 層級關係為Example->Features->Feature（很重要）
        #注意圖片一般的型別為ByteList，其他的有Int64List和FloatList型
        example = tf.train.Example(features=tf.train.Features(feature={
            'image': tf.train.Feature(bytes_list=tf.train.BytesList(value=[image])),
            'label': tf.train.Feature(int64_list=tf.train.Int64List(value=[label]))
        }))
        # 寫入建立的example
        writer.write(example.SerializeToString())
    writer.close()
            
    filename1 = destination1
    writer = tf.python_io.TFRecordWriter(filename1)
    for index in range(3000):
        # 轉為二進位制
        image = images[index+num_examples].tostring()
        label = labels[index+num_examples]
        example = tf.train.Example(features=tf.train.Features(feature={
            'image': tf.train.Feature(bytes_list=tf.train.BytesList(value=[image])),
            'label': tf.train.Feature(int64_list=tf.train.Int64List(value=[label]))
        }))
        writer.write(example.SerializeToString())
    writer.close()    

convert(result, 'G:/CatsAndDogs/logs/traintfrecord', 'G:/CatsAndDogs/logs/testtfrecord')


def read_and_decode(filename_queue, batch_size, capacity):
    """讀取.tfrecords檔案
    引數:
        filename_queue: 檔名, 一個列表
    返回:
        img, label: **單張圖片和對應標籤**
    """
    # 建立一個圖節點，該節點負責資料輸入
    filename_queue = tf.train.string_input_producer([filename_queue])
                # tf.train.string_input_producer函式把我們需要的全部檔案打包為一個tf內部的queue型別
                # 之後tf開檔案就從這個queue中取目錄了,要注意一點的是這個函式的shuffle引數預設是True
                # 所以讀取的順序可能不一樣
    reader = tf.TFRecordReader()
    _, serialized_example = reader.read(filename_queue)
                # 讀取前面所建立的資料目錄
    
    # 解析單個example
            # 暫時不大清楚下面的操作，不過感覺應該是得到各個features
    features = tf.parse_single_example(serialized_example, features={
        'image': tf.FixedLenFeature([], tf.string),
        'label': tf.FixedLenFeature([], tf.int64)
    })
            # tf.decode_raw函式的意思是將原來編碼為字串型別的變數重新變回來
            # 這個方法在資料集dataset中很常用，因為製作圖片源資料一般寫進tfrecord裡用to_bytes的形式，也就是字串
            # 這裡將原始資料取出來，必須制定原始資料的格式，原始資料是什麼格式這裡解析必須是什麼格式！！
            # tf.cast這個函式主要用於資料型別的轉變，不會改變原始資料的值還有形狀的
    image = tf.decode_raw(features['image'], tf.uint8)
    image = tf.reshape(image, [ImageHeight, ImageWidth, ImageChannel])
    image = tf.cast(image, tf.float32)
    label = tf.cast(features['label'], tf.int32)
    
    image_batch, label_batch = tf.train.batch([image, label],
                                              batch_size=batch_size,
                                              num_threads=64,  # 執行緒
                                              capacity=capacity)
    
    return image, label

image, label = read_and_decode('G:/CatsAndDogs/logs/traintfrecord', 16, 2000) 
image1, label1 = read_and_decode('G:/CatsAndDogs/logs/testtfrecord', 16, 2000)

不過要注意先用前兩個函式生成所需的tfrecord檔案才能跑第三個函式。

明天繼續剩下的部分。

貓狗大戰2.0 使用tensorflow和tfrecord

距離上次的部落格已經過去了半個月兩週左右的時間，自己在b站和部落格上學習了很多相關的知識，自我感覺自己的tensorflow的水平已經算是到了入門的水平，在部落格上有相關的非tensorboard匯入資料，實測有效（傳送門由於時間久了，暫時找不到了，自己找一下吧）

基於TensorFlow的Cats vs. Dogs（貓狗大戰）實現和詳解（2）

2. 卷積神經網路模型的構造——model.py 　　關於神經網路模型不想說太多，視訊中使用的模型是仿照TensorFlow的官方例程cifar-10的網路結構來寫的。就是兩個卷積層（每個卷積層後加一個池化層），兩個全連線層，最後一個softmax

Tensorflow學習筆記：資料集加工和轉化為TensorFlow專用格式——Finetuning，貓狗大戰，VGGNet的重新針對訓練

Kaggle 貓狗大戰貓狗大戰的資料集來源於Kaggle上的一個競賽：Dogs vs. Cats 貓狗大戰的資料集下載地址http://www.kaggle.com/c/dogs-vs-cats，其中資料集有12500只貓和12500只狗 ,官方資料集下載需要帳號，大

基於TensorFlow的Cats vs. Dogs（貓狗大戰）實現和詳解（1）

2017.5.29 　　官方的MNIST例子裡面訓練資料的下載和匯入都是用已經寫好的指令碼完成的，至於裡面實現細節也沒高興去看原始碼，感覺寫得太正式，我這個初學者不好理解。於是在優酷上找到了KevinRush這麼一個播主，裡面的視訊教程講得挺清晰的，於是跟著視

貓狗大戰的TFrecord數據集制作

AD load example std contest from string listdir label import tensorflow as tfimport numpy as npimport osfrom PIL import Image#沒有下面兩句德華會出現

tensorflow實現貓狗大戰（分類算法）

sse sin output 行操作 ogr cast bytes 序列 raw 本次使用了tensorflow高級API在規範化網絡編程做出了嘗試。第一步：準備好需要的庫 tensorflow-gpu 1.8.0 opencv-python 3.3.1 nu

tfrecord數據集訓練驗證-貓狗大戰

圖片大小 cat rac exc 兩個 bin span loss error: #!/usr/bin/env python # -*- coding:utf-8 -*- from mk_tfrecord import * #from model import * fr

Python使用tensorflow實現影象識別（貓狗大戰）-01

Python使用tensorflow實現影象識別（貓狗大戰）-01 import_data.py import tensorflow as tf import numpy as np import os #引入tensorflow、numpy、os 三個第三方模組 img_widt

Tensorflow學習筆記：VGG16模型——Finetuning，貓狗大戰，VGGNet的重新針對訓練

這一篇介紹一下VGG16模型的修改 Step 1: 對模型的修改首先是對模型的修改（VGG16_model.py檔案），在這裡原先的輸出結果是對1000個不同的類別進行判定，而在此是對2個影象，也就是貓和狗的判斷，因此首先第一步就是修改輸出層的全連線資料。

Tensorflow學習筆記：VGG16訓練——Finetuning，貓狗大戰，VGGNet的重新針對訓練

這篇介紹如何用資料對vgg16進行訓練 Finetuning最重要的一個步驟就是模型的重新訓練與儲存。首先對於模型的值的輸出，在類中已經做了定義，因此只需要將定義的模型類初始化後輸出賦予一個特定的變數即可。 vgg = model.vgg16(x_imgs)

Python使用tensorflow實現影象識別（貓狗大戰）-02

import tensorflow as tf def inference(images, batch_size, n_classes): # cov1, shape = [kernel size, kernel size, channels, ke

【TensorFlow】貓狗大戰——二分類

https://blog.csdn.net/caicai2526/article/details/75329812https://blog.csdn.net/caicai2526/article/details/75330192https://blog.csdn.net/ws

luogu P1489 貓狗大戰

經典 main while 輸出格式 badge 輸入格式 pan for getch 題目描述新一年度的貓狗大戰通過SC(星際爭霸)這款經典的遊戲來較量，野貓和飛狗這對冤家為此已經準備好久了，為了使戰爭更有難度和戲劇性，雙方約定只能選擇Terran(人族)並且只能造機

貓狗大戰

IT img 以及給定 || 足夠 span 星際技術分享新一年度的貓狗大戰通過SC(星際爭霸)這款經典的遊戲來較量，野貓和飛狗這對冤家為此已經準備好久了，為了使戰爭更有難度和戲劇性，雙方約定只能選擇Terran(人族)並且只能造機槍兵。比賽開始了，很快，野貓已

ASP.NET Core 2.0 IHostEnvironment和IApplicationLifetime介紹

pat onstop cat clas alt 監控 gis 開發 class IHostEnvironment獲取程序信息 public void Configure(IApplicationBuilder app, IHostingEnvironment env)

kaggle貓狗大戰之AlexNet(一)

這篇文章主要介紹如何利用AlexNet預訓練模型來訓練一個貓狗分類器，主要內容包括：專案結構介紹資料探索資料的準備 AlexNet模型的構建模型的訓練和效能評估結果的提交一、專案結構介紹 1、相關資料下載地址專案地址:http

我的貓狗大戰資料集圖片缺失處理

前面找了一份540M的貓狗大戰的資料集，想使用這個資料集在小型資料集上從頭開始訓練一個卷積神經網路，使用了其中的2500個樣本，這個貓狗大戰的資料集總的是25000張圖片，所以在前面2500張圖片缺失的時候我就自己從後面的資料集中拷貝圖片補齊前面的，但是發現缺失圖片比較多，手動去查詢太麻煩，所以乾

CentOS6.8下Nagios-4.2.0安裝和配置

因此 figure 問題 usermod linux文件 httpd的配置 pen kconfig etc 1實驗目標掌握Nagios的安裝 2實驗環境主機名：Nagios-Server 操作系統：CentOS release 6.8 (Final) IP地址：19

ASP.NET Core 2.0身份和角色管理入門

目錄介紹身份驗證和授權身份驗證授權背景先決條件使用程式碼第1步：建立資料庫第2步：建立ASP.NET Core 更新appsettings.json 步驟3：在Startup.cs檔案中新增Identity Service

貓狗大戰：融合了三種模型的Keras程式碼，準確率直升到99%

使用keras的resnet，inceptionV3，xception模型，首先載入預訓練模型的權重，通過預訓練權重生成對貓狗的訓練值和測試值的特徵向量預訓練模型下載地址：http://pan.baidu.com/s/1geHmOpH from ker

貓狗大戰2.0 使用tensorflow和tfrecord

相關推薦