TensorFlow 2.0 - TFRecord儲存資料集、@tf.function圖執行模式、tf.TensorArray、tf.config分配GPU

阿新 • • 發佈：2021-02-02

技術標籤：TensorFlow

文章目錄

1. TFRecord 格式儲存

使用該種格式，更高效地進行大規模的模型訓練

import random
import os
import tensorflow as tf

# 使用前一節 kaggle 上的 貓狗資料集
train_data_dir = "./dogs-vs-cats/train/"
test_data_dir = 
 "./dogs-vs-cats/test/"

# 訓練檔案路徑
file_dir = [train_data_dir + filename for filename in os.listdir(train_data_dir)]
labels = [0 if filename[0] == 'c' else 1
          for filename in os.listdir(train_data_dir)]

# 打包並打亂
f_l = list(zip(file_dir, labels))
random.shuffle(f_l)
file_dir, labels = zip 
(*f_l)

# 切分訓練集，驗證集
valid_ratio = 0.1
idx = int((1 - valid_ratio) * len(file_dir))
train_files, valid_files = file_dir[:idx], file_dir[idx:]
train_labels, valid_labels = labels[:idx], labels[idx:]

# tfrecord 格式資料儲存路徑
train_tfrecord_file = "./dogs-vs-cats/train.tfrecords"
valid_tfrecord_file = 
 "./dogs-vs-cats/valid.tfrecords"

# -------------------看下面程式碼-----------------------------
# 儲存過程
# 預先定義一個寫入器
with tf.io.TFRecordWriter(path=train_tfrecord_file) as writer:
    # 遍歷原始資料
    for filename, label in zip(train_files, train_labels):
        img = open(filename, 'rb').read()  # 讀取圖片，img 是 Byte 型別的字串
        # 建立 feature 的 字典 k : v
        feature = {
            'image': tf.train.Feature(bytes_list=tf.train.BytesList(value=[img])),
            'label': tf.train.Feature(int64_list=tf.train.Int64List(value=[label]))
        }
        # feature 包裹成 example
        example = tf.train.Example(features=tf.train.Features(feature=feature))
        # example 序列化為字串，寫入
        writer.write(example.SerializeToString())

# -------------------看下面程式碼-----------------------------
# 讀取過程
# 讀取 tfrecord 資料，得到 tf.data.Dataset 物件
raw_train_dataset = tf.data.TFRecordDataset(train_tfrecord_file)
# 特徵的格式、資料型別
feature_description = {
    'image': tf.io.FixedLenFeature(shape=[], dtype=tf.string),
    'label': tf.io.FixedLenFeature([], tf.int64),
}


def _parse_example(example_string): # 解碼每個example
    # tf.io.parse_single_example 反序列化
    feature_dict = tf.io.parse_single_example(example_string, feature_description)
    # 影象解碼
    feature_dict['image'] = tf.io.decode_jpeg(feature_dict['image'])
    # 返回資料 X, y
    return feature_dict['image'], feature_dict['label']

# 處理資料集
train_dataset = raw_train_dataset.map(_parse_example)

import matplotlib.pyplot as plt
for img, label in train_dataset:
    plt.title('cat' if label==0 else 'dog')
    plt.imshow(img.numpy())
    plt.show()

2. tf.function 高效能

TF 2.0 預設 即時執行模式（Eager Execution），靈活、易除錯
追求高效能、部署模型時，使用圖執行模式（Graph Execution）
TF 2.0 的 tf.function 模組 + AutoGraph 機制，使用 @tf.function 修飾符，就可以將模型以圖執行模式執行

注意：@tf.function修飾的函式內，儘量只用 tf 的內建函式，變數只用 tensor、numpy 陣列

被修飾的函式 F(X, y) 可以呼叫get_concrete_function 方法，獲得計算圖

graph = F.get_concrete_function(X, y)

3. tf.TensorArray 支援計算圖特性

tf.TensorArray 支援計算圖模式的動態陣列

arr = tf.TensorArray(dtype=tf.int64, size=1, dynamic_size=True)
arr = arr.write(index=1, value=512)
# arr.write(index=0, value=512) # 沒有左值接受，會丟失
for i in range(arr.size()):
    print(arr.read(i))

4. tf.config 分配GPU

列出裝置 list_physical_devices

print('---device----')
gpus = tf.config.list_physical_devices(device_type='GPU')
cpus = tf.config.list_physical_devices(device_type='CPU')
print(gpus, "\n", cpus)

# 單個的 GPU, CPU
[PhysicalDevice(name='/physical_device:GPU:0', device_type='GPU')] 
 [PhysicalDevice(name='/physical_device:CPU:0', device_type='CPU')]

設定哪些可見 set_visible_devices

tf.config.set_visible_devices(devices=gpus[0:2], device_type='GPU')

或者

終端輸入 export CUDA_VISIBLE_DEVICES=2,3
or 程式碼中加入

import os
os.environ['CUDA_VISIBLE_DEVICES'] = "2,3"

指定程式只在顯示卡 2, 3 上執行

視訊記憶體使用策略：

gpus = tf.config.list_physical_devices(device_type='GPU')
for gpu in gpus:
	# 僅在需要時申請視訊記憶體
    tf.config.experimental.set_memory_growth(device=gpu, enable=True)

gpus = tf.config.list_physical_devices(device_type='GPU')
# 固定視訊記憶體使用上限，超出報錯
tf.config.set_logical_device_configuration(
    gpus[0],
    [tf.config.LogicalDeviceConfiguration(memory_limit=1024)])

單 GPU 模擬多 GPU 環境

在單GPU電腦上，寫多GPU 程式碼，可以模擬實現

gpus = tf.config.list_physical_devices('GPU')
tf.config.set_logical_device_configuration(
    gpus[0],
    [tf.config.LogicalDeviceConfiguration(memory_limit=2048),
     tf.config.LogicalDeviceConfiguration(memory_limit=2048)])
gpus = tf.config.list_logical_devices(device_type='GPU')
print(gpus)

輸出：2個虛擬的GPU

[LogicalDevice(name='/device:GPU:0', device_type='GPU'), 
 LogicalDevice(name='/device:GPU:1', device_type='GPU')]

TensorFlow 2.0 - TFRecord儲存資料集、@tf.function圖執行模式、tf.TensorArray、tf.config分配GPU

技術標籤：TensorFlow 文章目錄 1. TFRecord 格式儲存2. tf.function 高效能3. tf.TensorArray 支援計算圖特性4. tf.config 分配GPU

C#使用TensorFlow.NET訓練自己的資料集的方法

今天，我結合程式碼來詳細介紹如何使用 SciSharp STACK 的 TensorFlow.NET 來訓練CNN模型，該模型主要實現影象的分類，可以直接移植該程式碼在 CPU 或 GPU 下使用，並針對你們自己本地的影象資料集進行訓練和推理。

tensorflow 2.0模式下訓練的模型轉成 tf1.x 版本的pb模型例項

升級到tf 2.0後,訓練的模型想轉成1.x版本的.pb模型,但之前提供的通過ckpt轉pb模型的方法都不可用(因為儲存的ckpt不再有.meta)檔案,嘗試了好久,終於找到了一個方法可以迂迴轉到1.x版本的pb模型.

TensorFlow 2.0 快速搭建神經網路

tf.keras 是 TensorFlow2 引入的高度封裝框架，可以快速搭建神經網路模型。下面介紹一些常用API，更多內容可以參考官方文件：tensorflow

tensorflow2.0——手寫資料集預測

import tensorflow as tf import numpy as np import matplotlib.pylab as plt plt.rcParams[\"font.family\"] = \'SimHei\'# 將字型改為中文

tensorflow2.0——手寫資料集預測（全連線神經3層網路）

import tensorflow as tf import numpy as np from tensorflow.keras import datasets, layers, optimizers # 載入手寫數字資料

tensorflow2.0——手寫資料集預測完整版

import tensorflow as tf def preporocess(x,y): x = tf.cast(x,dtype=tf.float32) / 255 x = tf.reshape(x,(-1,28 *28))#鋪平

TensorFlow 2.0 快速入門指南 | iBooker·ApacheCN

原文：TensorFlow 2.0 Quick Start Guide 協議：CC BY-NC-SA 4.0 自豪地採用谷歌翻譯不要擔心自己的形象，只關心如何實現目標。——《原則》，生活原則 2.3.c

tensorflow yolov3訓練自己的資料集，詳細教程

這個教程是我在自己學習的過程中寫的，當作一個筆記，寫的比較詳細在github上下載yolov3的tensorflow1.0版本：https://github.com/YunYang1994/tensorflow-yolov3在19年12月，發現網上訓練的教程大部分似乎已經過時了

Tensorflow 2.0 mnist

# -- coding: utf-8 -- from __future__ import absolute_import from __future__ import division from __future__ import print_function

【吳恩達Tensorflow 2.0實踐課】2.2 Transfer learning

技術標籤：TensorFlow卷積深度學習tensorflow 2.2.1 Transfer learning - the concepts & coding

tf計算矩陣維度_【tf.matmul 致命錯誤】請謹慎使用tensorflow 2.0

技術標籤：tf計算矩陣維度 2020/1/11更新：在 tensorflow 2.1 及以上版本中，該bug已解決。

《利用Python進行資料分析》筆記---第2章--MovieLens 1M資料集

寫在前面的話：例項中的所有資料都是在GitHub上下載的，打包下載即可。地址是： [ http://github.com/pydata/pydata-book ](http://github.com/pydata/pydata-

《原神攻略》2.0版祕寶迷蹤活動詳情祕寶迷蹤活動時間、獎勵說明

《原神》的祕寶迷蹤活動即將開始，玩家可以通過參與活動兌換各種獎勵。下面請看由“西風快報員”帶來的《原神》2.0版祕寶迷蹤活動詳情，一起來看看吧。

【教程】使用TensorFlow物件檢測介面標註資料集

當為機器學習物件檢測和識別模型構建資料集時，為資料集中的所有影象生成標註非常耗時。而這些標註是訓練和測試模型所必需的，並且標註必須是準確的。因此，資料集中的所有影象都需要人為監督。不過，這並不意味著機

《聖歌》2.0 宣佈終止開發，遊戲伺服器將正常執行

2月25日訊息遊戲公司 BioWare 昨日在官網發文宣佈，將永久停止《聖歌》2.0 的開發，但《聖歌》的遊戲伺服器將繼續正常執行。

微信紅包封面元件 2.0 內測上線：首次支援背景圖自定義，5 分鐘倒計時

12 月 24 日訊息，據微信紅包封面釋出，在 2022 年來臨之際，品牌官方區正式推出紅包封面元件 2.0，圍繞節日氛圍，社交傳播和導流沉澱三大方面進行全面升級。跨年期間，搜一搜紅包封面 2.0 有 17 家品牌參與首發內測

華為 P50E 手機推送鴻蒙 HarmonyOS 2.0.1.130 更新：相機新增流光快門模式

感謝網友肖戰割割的線索投遞！

【2】TensorFlow光速入門-資料預處理（得到資料集）

本文地址：https://www.cnblogs.com/tujia/p/13862351.html 系列文章：【0】TensorFlow光速入門-序

使用TensorFlow Object Detection Api 進行環境搭建、訓練自定義的資料集、輸出模型、Android端使用模型目標檢測

技術標籤：機器學習計算機視覺移動端tensorflow神經網路機器學習深度學習一、環境搭建

TensorFlow 2.0 - TFRecord儲存資料集、@tf.function圖執行模式、tf.TensorArray、tf.config分配GPU

文章目錄

1. TFRecord 格式儲存

2. tf.function 高效能

3. tf.TensorArray 支援計算圖特性

4. tf.config 分配GPU

相關推薦