LSTM 神經網路輸入輸出層

阿新 • • 發佈：2018-11-16

今天終於弄明白，TensorFlow和Keras中LSTM神經網路的輸入輸出層到底應該怎麼設定和連線了。寫個備忘。

https://machinelearningmastery.com/how-to-develop-lstm-models-for-time-series-forecasting/

Stacked LSTM

Multiple hidden LSTM layers can be stacked one on top of another in what is referred to as a Stacked LSTM model.
An LSTM layer requires a three-dimensional input and LSTMs by default will produce a two-dimensional output as an interpretation from the end of the sequence.
We can address this by having the LSTM output a value for each time step in the input data by setting the return_sequences=True argument on the layer. This allows us to have 3D output from hidden LSTM layer as input to the next.
We can, therefore, define a Stacked LSTM as follows.

# define model
model = Sequential()
model.add(LSTM(50, activation='relu', return_sequences=True, input_shape=(n_steps, n_features)))
model.add(LSTM(50, activation='relu'))
model.add(Dense(1))
model.compile(optimizer='adam', loss='mse')

X_train.shape
(500, 40, 1)
y_train.shape
(500, 40, 1)

from keras.models import Sequential
from keras import layers
from keras.optimizers import RMSprop

model = Sequential()
model.add(layers.GRU(100, input_shape=(None, X_train.shape[-1]), return_sequences=True))
model.add(layers.Dense(1))
model.compile(optimizer=RMSprop(), loss='mae')
history = model.fit(X_train, y_train,steps_per_epoch=25,epochs=20)

reset_graph()

n_steps = 40
n_inputs = 1
n_neurons = 100

X = tf.placeholder(tf.float32, [None, n_steps, n_inputs])
y = tf.placeholder(tf.float32, [None, n_steps, n_outputs])

num_units = [500, 200, 100]
cells = [tf.nn.rnn_cell.GRUCell(num_units=n) for n in num_units]
stacked_rnn_cell = tf.nn.rnn_cell.MultiRNNCell(cells)
rnn_outputs, states = tf.nn.dynamic_rnn(stacked_rnn_cell, X, dtype=tf.float32)

# 先去掉一個維度，用一個Dense層連上，再把n_steps這個維度加回去
# [batch_size, n_steps, n_neurons]
# [batch_size * n_steps, n_neurons]
# [batch_size, n_steps, n_neurons]

stacked_rnn_outputs = tf.reshape(rnn_outputs, [-1, n_neurons])
stacked_outputs = tf.layers.dense(stacked_rnn_outputs, n_outputs)
outputs = tf.reshape(stacked_outputs, [-1, n_steps, n_outputs])

loss = tf.reduce_mean(tf.square(outputs - y))
optimizer = tf.train.AdamOptimizer(learning_rate=learning_rate)
training_op = optimizer.minimize(loss)

init = tf.global_variables_initializer()
saver = tf.train.Saver()

n_iterations = 5000
batch_size = 100

with tf.Session() as sess:
    init.run()
    for iteration in range(n_iterations):
        X_batch, y_batch = next_batch(batch_size, n_steps)
        sess.run(training_op, feed_dict={X: X_batch, y: y_batch})
        if iteration % 100 == 0:
            mse = loss.eval(feed_dict={X: X_batch, y: y_batch})
            print(iteration, "\tMSE:", mse)
    
    X_new = time_series(np.array(t_instance[:-1].reshape(-1, n_steps, n_inputs)))
    y_pred = sess.run(outputs, feed_dict={X: X_new})
    
    saver.save(sess, "./my_time_series_model")

與TensorFlow不同， Keras 中 LSTM 層預設只輸出最後一個時間步

LSTM 神經網路輸入輸出層

Stacked LSTM

LSTM 神經網路輸入輸出層

LSTM神經網路輸入輸出究竟是怎樣的？

卷積神經網路輸入輸出之間維數變換關係

BP神經網路的隱含層，輸入層，輸出層的節點數確定

卷積神經網路——輸入層、卷積層、啟用函式、池化層、全連線層

Make Your Own Neural Network（八）-----利用矩陣計算三層神經網路的輸出結果

神經網路中隱層數和隱層節點數問題的討論

Tensorflow: MNIST資料集實現DNN、CNN、LSTM神經網路

word2vec的詞向量&&神經網路的embedding層的關係

直觀理解神經網路最後一層全連線+Softmax

BP單隱藏層神經網路中隱藏層節點個數的取值

【機器學習筆記21】神經網路（多層感知機)

基於神經網路（多層感知機）識別手寫數字

LSTM神經網路之前向反向傳播演算法

神經網路中embedding層作用——本質就是word2vec，資料降維，同時可以很方便計算同義詞（各個word之間的距離），底層實現是2-gram（詞頻）+神經網路

熵與神經網路的輸出值

使用tensorflow：LSTM神經網路預測股票（三）

使用tensorflow：LSTM神經網路預測股票（一）

關於神經網路中隱藏層和神經元的深入理解

LSTM神經網路的詳細推導及C++實現

LSTM 神經網路輸入輸出層

Stacked LSTM

相關推薦