解決Keras TensorFlow 混編中 trainable=False設定無效問題

阿新 • • 發佈：2020-06-29

這是最近碰到一個問題，先描述下問題：

首先我有一個訓練好的模型(例如vgg16)，我要對這個模型進行一些改變，例如新增一層全連線層，用於種種原因，我只能用TensorFlow來進行模型優化,tf的優化器，預設情況下對所有tf.trainable_variables()進行權值更新，問題就出在這，明明將vgg16的模型設定為trainable=False，但是tf的優化器仍然對vgg16做權值更新

以上就是問題描述，經過谷歌百度等等，終於找到了解決辦法，下面我們一點一點的來複原整個問題。

trainable=False 無效

首先，我們匯入訓練好的模型vgg16，對其設定成trainable=False

from keras.applications import VGG16
import tensorflow as tf
from keras import layers

# 匯入模型
base_mode = VGG16(include_top=False)
# 檢視可訓練的變數
tf.trainable_variables()

[<tf.Variable 'block1_conv1/kernel:0' shape=(3,3,64) dtype=float32_ref>,<tf.Variable 'block1_conv1/bias:0' shape=(64,) dtype=float32_ref>,<tf.Variable 'block1_conv2/kernel:0' shape=(3,64,<tf.Variable 'block1_conv2/bias:0' shape=(64,<tf.Variable 'block2_conv1/kernel:0' shape=(3,128) dtype=float32_ref>,<tf.Variable 'block2_conv1/bias:0' shape=(128,<tf.Variable 'block2_conv2/kernel:0' shape=(3,128,<tf.Variable 'block2_conv2/bias:0' shape=(128,<tf.Variable 'block3_conv1/kernel:0' shape=(3,256) dtype=float32_ref>,<tf.Variable 'block3_conv1/bias:0' shape=(256,<tf.Variable 'block3_conv2/kernel:0' shape=(3,256,<tf.Variable 'block3_conv2/bias:0' shape=(256,<tf.Variable 'block3_conv3/kernel:0' shape=(3,<tf.Variable 'block3_conv3/bias:0' shape=(256,<tf.Variable 'block4_conv1/kernel:0' shape=(3,512) dtype=float32_ref>,<tf.Variable 'block4_conv1/bias:0' shape=(512,<tf.Variable 'block4_conv2/kernel:0' shape=(3,512,<tf.Variable 'block4_conv2/bias:0' shape=(512,<tf.Variable 'block4_conv3/kernel:0' shape=(3,<tf.Variable 'block4_conv3/bias:0' shape=(512,<tf.Variable 'block5_conv1/kernel:0' shape=(3,<tf.Variable 'block5_conv1/bias:0' shape=(512,<tf.Variable 'block5_conv2/kernel:0' shape=(3,<tf.Variable 'block5_conv2/bias:0' shape=(512,<tf.Variable 'block5_conv3/kernel:0' shape=(3,<tf.Variable 'block5_conv3/bias:0' shape=(512,<tf.Variable 'block1_conv1_1/kernel:0' shape=(3,<tf.Variable 'block1_conv1_1/bias:0' shape=(64,<tf.Variable 'block1_conv2_1/kernel:0' shape=(3,<tf.Variable 'block1_conv2_1/bias:0' shape=(64,<tf.Variable 'block2_conv1_1/kernel:0' shape=(3,<tf.Variable 'block2_conv1_1/bias:0' shape=(128,<tf.Variable 'block2_conv2_1/kernel:0' shape=(3,<tf.Variable 'block2_conv2_1/bias:0' shape=(128,<tf.Variable 'block3_conv1_1/kernel:0' shape=(3,<tf.Variable 'block3_conv1_1/bias:0' shape=(256,<tf.Variable 'block3_conv2_1/kernel:0' shape=(3,<tf.Variable 'block3_conv2_1/bias:0' shape=(256,<tf.Variable 'block3_conv3_1/kernel:0' shape=(3,<tf.Variable 'block3_conv3_1/bias:0' shape=(256,<tf.Variable 'block4_conv1_1/kernel:0' shape=(3,<tf.Variable 'block4_conv1_1/bias:0' shape=(512,<tf.Variable 'block4_conv2_1/kernel:0' shape=(3,<tf.Variable 'block4_conv2_1/bias:0' shape=(512,<tf.Variable 'block4_conv3_1/kernel:0' shape=(3,<tf.Variable 'block4_conv3_1/bias:0' shape=(512,<tf.Variable 'block5_conv1_1/kernel:0' shape=(3,<tf.Variable 'block5_conv1_1/bias:0' shape=(512,<tf.Variable 'block5_conv2_1/kernel:0' shape=(3,<tf.Variable 'block5_conv2_1/bias:0' shape=(512,<tf.Variable 'block5_conv3_1/kernel:0' shape=(3,<tf.Variable 'block5_conv3_1/bias:0' shape=(512,) dtype=float32_ref>]

# 設定 trainable=False
# base_mode.trainable = False似乎也是可以的
for layer in base_mode.layers:
  layer.trainable = False

設定好trainable=False後，再次檢視可訓練的變數，發現並沒有變化，也就是說設定無效

# 再次檢視可訓練的變數
tf.trainable_variables()

[<tf.Variable 'block1_conv1/kernel:0' shape=(3,) dtype=float32_ref>]

解決的辦法

解決的辦法就是在匯入模型的時候建立一個variable_scope，將需要訓練的變數放在另一個variable_scope,然後通過tf.get_collection獲取需要訓練的變數，最後通過tf的優化器中var_list指定需要訓練的變數

from keras import models
with tf.variable_scope('base_model'):
  base_model = VGG16(include_top=False,input_shape=(224,224,3))
with tf.variable_scope('xxx'):
  model = models.Sequential()
  model.add(base_model)
  model.add(layers.Flatten())
  model.add(layers.Dense(10))

# 獲取需要訓練的變數
trainable_var = tf.get_collection(tf.GraphKeys.TRAINABLE_VARIABLES,'xxx')
trainable_var

[<tf.Variable 'xxx_2/dense_1/kernel:0' shape=(25088,10) dtype=float32_ref>,
<tf.Variable 'xxx_2/dense_1/bias:0' shape=(10,) dtype=float32_ref>]

# 定義tf優化器進行訓練，這裡假設有一個loss
loss = model.output / 2; # 隨便定義的，方便演示
train_step = tf.train.AdamOptimizer().minimize(loss,var_list=trainable_var)

總結

在keras與TensorFlow混編中，keras中設定trainable=False對於TensorFlow而言並不起作用

解決的辦法就是通過variable_scope對變數進行區分，在通過tf.get_collection來獲取需要訓練的變數，最後通過tf優化器中var_list指定訓練

以上這篇解決Keras TensorFlow 混編中 trainable=False設定無效問題就是小編分享給大家的全部內容了，希望能給大家一個參考，也希望大家多多支援我們。

解決Keras TensorFlow 混編中 trainable=False設定無效問題

這是最近碰到一個問題，先描述下問題：首先我有一個訓練好的模型(例如vgg16)，我要對這個模型進行一些改變，例如新增一層全連線層，用於種種原因，我只能用TensorFlow來進行模型優化,tf的優化器，預設情況下對所有t

勉強解決arcGIS API for JS 中，地圖設定scale屬性後，縮放地圖時地圖自動偏移的問題

場景描述：專題圖使用scale或者setScale設定了一個比例尺,滑鼠滾輪下滑，比例尺縮小，地圖縮小。

如何在模組化/元件化專案中實現 ObjC-Swift 混編？

本文首發於： ShannonChenCHN/ASwiftTour 原始碼地址：ShannonChenCHN/ASwiftTour 關鍵詞：模組化/元件化、ObjC-Swift 混編、Swift 靜態庫、ABI Stability、Module Stability、LLVM Module、Umbrella Header

解決Keras 與 Tensorflow 版本之間的相容性問題

在利用Keras進行實驗的時候，後端為Tensorflow，出現了以下問題： 1. 伺服器端啟用Anaconda環境跑程式時，實驗結果很差。

解決json中ensure_ascii=False的問題

在使用json.dumps時要注意一個問題 >>> import json >>> print json.dumps(\'中國\')

解決Keras 中加入lambda層無法正常載入模型問題

剛剛解決了這個問題，現在記錄下來問題描述當使用lambda層加入自定義的函式後，訓練沒有bug，載入儲存模型則顯示Nonetype has no attribute \'get\'

解決Keras中Embedding層masking與Concatenate層不可調和的問題

問題描述我在用Keras的Embedding層做nlp相關的實現時，發現了一個神奇的問題，先上程式碼：

解決Keras中迴圈使用K.ctc_decode記憶體不釋放的問題

如下一段程式碼，在多次呼叫了K.ctc_decode時，會發現程式佔用的記憶體會越來越高，執行速度越來越慢。

解決Keras中CNN輸入維度報錯問題

想要寫分類器對圖片進行分類，用到了CNN。然而，在執行程式時，一直報錯：

解決keras.backend.reshape中的錯誤ValueError: Tried to convert 'shape' to a tensor and failed. Error: Cannot convert a partially known TensorShape to a Tensor

許多CNN網路都有Fusion layer作為融合層，比如：參考：https://arxiv.org/pdf/1712.03400.pdf

解決Keras TensorFlow 混編中 trainable=False設定無效問題

解決Keras TensorFlow 混編中 trainable=False設定無效問題

勉強解決arcGIS API for JS 中，地圖設定scale屬性後，縮放地圖時地圖自動偏移的問題

如何在模組化/元件化專案中實現 ObjC-Swift 混編？

解決Keras 與 Tensorflow 版本之間的相容性問題

解決json中ensure_ascii=False的問題

解決Keras 中加入lambda層無法正常載入模型問題

解決Keras中Embedding層masking與Concatenate層不可調和的問題

解決Keras中迴圈使用K.ctc_decode記憶體不釋放的問題

解決Keras中CNN輸入維度報錯問題

解決keras.backend.reshape中的錯誤ValueError: Tried to convert 'shape' to a tensor and failed. Error: Cannot convert a partially known TensorShape to a Tensor

關於vscode編譯c/c++ 使用c++11的一個解決辦法,針對c和c++混編出現的問題的解決方法 [-fpermissive]出現後如何解決

iOS專案中混編flutter和打包釋出

【已解決】aconda3 建立和切換jupyter Kernel（安裝好了tensorflow在jupyter中無法使用）

用swift開發framework時採用OC混編的解決方案

OC和Swift混編（一）——OC與Swift相互呼叫

Idea中Springboot熱部署無效問題解決

tensorflow實現tensor中滿足某一條件的數值取出組成新的tensor

keras tensorflow 實現在python下多程序執行

TensorFlow 輸出checkpoint 中的變數名與變數值方式

tensorflow 獲取checkpoint中的變數列表例項

解決Keras TensorFlow 混編中 trainable=False設定無效問題

相關推薦