TensorFlow實現多層LSTM識別MNIST手寫字，多層LSTM下state和output的關係

阿新 • • 發佈：2019-01-12

其他內容

輸入格式：batch_size*784改成batch_size*28*28,28個序列，內容是一行的28個灰度數值。

讓神經網路逐行掃描一個手寫字型圖案，總結各行特徵，通過時間序列串聯起來，最終得出結論。

網路定義：單獨定義一個獲取單元的函式，便於在MultiRNNCell中呼叫，建立多層LSTM網路

def get_a_cell(i):
    lstm_cell =rnn.BasicLSTMCell(num_units=HIDDEN_CELL, forget_bias = 1.0, state_is_tuple = True, name = 'layer_%s'%i)
    print(type(lstm_cell))
    dropout_wrapped = rnn.DropoutWrapper(cell = lstm_cell, input_keep_prob = 1.0, output_keep_prob = keep_prob)
    return dropout_wrapped

multi_lstm = rnn.MultiRNNCell(cells = [get_a_cell(i) for i in range(LSTM_LAYER)],
                              state_is_tuple=True)#tf.nn.rnn_cell.MultiRNNCell

多層RNN下state和單層RNN有所不同，多了些細節，每一層都是一個cell，每一個cell都有自己的state，每一層都對應一個LSTMStateTuple（本例是分類預測，所以只用到最後一層的輸出，但是不代表其他情況不需要使用中間層的狀態）。

cell之間是串聯的，-1是最後一層的state，等價於單層下的output，我這裡建了三層，所以-1和2相等：


outputs, state = tf.nn.dynamic_rnn(multi_lstm, inputs = tf_x_reshaped, initial_state = init_state, time_major = False)
print('state:',state)
print('state[0]:',state[0])#layer 0's LSTMStateTuple
print('state[1]:',state[1])#layer 1's LSTMStateTuple
print('state[2]:',state[2])#layer 2's LSTMStateTuple
print('state[-1]:',state[-1])#layer 2's LSTMStateTuple

state: (LSTMStateTuple(c=<tf.Tensor 'rnn/while/Exit_3:0' shape=(32, 256) dtype=float32>, h=<tf.Tensor 'rnn/while/Exit_4:0' shape=(32, 256) dtype=float32>), LSTMStateTuple(c=<tf.Tensor 'rnn/while/Exit_5:0' shape=(32, 256) dtype=float32>, h=<tf.Tensor 'rnn/while/Exit_6:0' shape=(32, 256) dtype=float32>), LSTMStateTuple(c=<tf.Tensor 'rnn/while/Exit_7:0' shape=(32, 256) dtype=float32>, h=<tf.Tensor 'rnn/while/Exit_8:0' shape=(32, 256) dtype=float32>))
state[0]: LSTMStateTuple(c=<tf.Tensor 'rnn/while/Exit_3:0' shape=(32, 256) dtype=float32>, h=<tf.Tensor 'rnn/while/Exit_4:0' shape=(32, 256) dtype=float32>)
state[1]: LSTMStateTuple(c=<tf.Tensor 'rnn/while/Exit_5:0' shape=(32, 256) dtype=float32>, h=<tf.Tensor 'rnn/while/Exit_6:0' shape=(32, 256) dtype=float32>)
state[2]: LSTMStateTuple(c=<tf.Tensor 'rnn/while/Exit_7:0' shape=(32, 256) dtype=float32>, h=<tf.Tensor 'rnn/while/Exit_8:0' shape=(32, 256) dtype=float32>)
state[-1]: LSTMStateTuple(c=<tf.Tensor 'rnn/while/Exit_7:0' shape=(32, 256) dtype=float32>, h=<tf.Tensor 'rnn/while/Exit_8:0' shape=(32, 256) dtype=float32>)

下邊是outputs和states的對比：outputs對應state_2，又因為這裡做的是型別預測，是Nvs1模型，且time_major是False，第0維是batch，要取時間序列的最後一個輸出，用[:,-1,:]，可以看到，是全相等的。


outputs, state = tf.nn.dynamic_rnn(multi_lstm, inputs = tf_x_reshaped, initial_state = init_state, time_major = False)
h_state_0 = state[0][1]
h_state_1 = state[1][1]
h_state = state[-1][1]
h_state_2 = h_state



        _, loss_,outputs_, state_, h_state_0_, h_state_1_, h_state_2_ = \
            sess.run([train_op, cross_entropy,outputs, state, h_state_0, h_state_1, h_state_2], {tf_x:x, tf_y:y, keep_prob:1.0})


        print('h_state_2_ == outputs_[:,-1,:]:', h_state_2_ == outputs_[:,-1,:])


h_state_2_ == outputs_[:,-1,:]: [[ True  True  True ...  True  True  True]
 [ True  True  True ...  True  True  True]
 [ True  True  True ...  True  True  True]
 ...
 [ True  True  True ...  True  True  True]
 [ True  True  True ...  True  True  True]
 [ True  True  True ...  True  True  True]]

最後處理一下輸出：LSTM的介面為了使用方便，輸入輸出是等維度的，不可設定，隱藏單元這裡設定的256，需要做一個轉換，轉換為10維輸出，最終對手寫數字進行分類預測。

#prediction and loss
W = tf.Variable(initial_value = tf.truncated_normal([HIDDEN_CELL, CLASS_NUM], stddev = 0.1 ), dtype = tf.float32)
print(W)
b = tf.Variable(initial_value = tf.constant(0.1, shape = [CLASS_NUM]), dtype = tf.float32)
predictions = tf.nn.softmax(tf.matmul(h_state, W) + b)
#sum   -ylogy^
cross_entropy = -tf.reduce_sum(tf_y * tf.log(predictions))

完整程式碼：

TensorFlow實現多層LSTM識別MNIST手寫字，多層LSTM下state和output的關係

TensorFlow實現多層LSTM識別MNIST手寫字，多層LSTM下state和output的關係

tensorflow實現多層感知機進行手寫字識別

Tensorflow #2 深度學習-RNN LSTM版 MNIST手寫識別Demo

深度學習-tensorflow學習筆記(2)-MNIST手寫字體識別

TensorFlow學習筆記（1）—— MNIST手寫識別

mnist手寫字識別及tensorflow與tflearn對比

Tensorflow案例5：CNN演算法-Mnist手寫數字識別

Tensorflow | MNIST手寫字識別

深度學習筆記——TensorFlow學習筆記（三）使用TensorFlow實現的神經網路進行MNIST手寫體數字識別

pytorch 利用lstm做mnist手寫數字識別分類

Keras中將LSTM用於mnist手寫數字識別

運用tensorflow全連線神經網路進行MNIST手寫數字影象識別

人工智能 tensorflow框架-->MNIST手寫字符數據集 06

matlab練習程序（神經網絡識別mnist手寫數據集）

matlab練習程式（神經網路識別mnist手寫資料集）

LSTM在MNIST手寫資料集上做分類（程式碼中尺寸變換細節）

機器學習筆記（十二）：TensorFlow實現四（影象識別與卷積神經網路）

Keras 入門課1 -- 用MLP識別mnist手寫字元

tensorflow實現人臉檢測及識別(簡單版)

MATLAB自動識別MNIST手寫數字資料庫

TensorFlow實現多層LSTM識別MNIST手寫字，多層LSTM下state和output的關係

相關推薦