TensorFlow實現LSTM（迴歸）

阿新 • • 發佈：2019-02-16

最近在學習TensorFlow，並學習了在TensorFlow中實現LSTM的迴歸應用。

下面是示例程式碼：

import tensorflow as tf
import numpy as np
import matplotlib.pyplot as plt


BATCH_START = 0
TIME_STEPS = 20
BATCH_SIZE = 50
INPUT_SIZE = 1
OUTPUT_SIZE = 1
CELL_SIZE = 10
LR = 0.006


def get_batch():
    global BATCH_START, TIME_STEPS
    # xs shape (50batch, 20steps)
    xs = np.arange(BATCH_START, BATCH_START+TIME_STEPS*BATCH_SIZE).reshape((BATCH_SIZE, TIME_STEPS)) / (10*np.pi)
    seq = np.sin(xs)
    res = np.cos(xs)
    BATCH_START += TIME_STEPS
    # plt.plot(xs[0, :], res[0, :], 'r', xs[0, :], seq[0, :], 'b--')
    # plt.show()
    # returned seq, res and xs: shape (batch, step, input)
    return [seq[:, :, np.newaxis], res[:, :, np.newaxis], xs]


class LSTMRNN(object):
    def __init__(self, n_steps, input_size, output_size, cell_size, batch_size):
        self.n_steps = n_steps
        self.input_size = input_size
        self.output_size = output_size
        self.cell_size = cell_size
        self.batch_size = batch_size
        with tf.name_scope('inputs'):
            self.xs = tf.placeholder(tf.float32, [None, n_steps, input_size], name='xs')
            self.ys = tf.placeholder(tf.float32, [None, n_steps, output_size], name='ys')
        with tf.variable_scope('in_hidden'):
            self.add_input_layer()
        with tf.variable_scope('LSTM_cell'):
            self.add_cell()
        with tf.variable_scope('out_hidden'):
            self.add_output_layer()
        with tf.name_scope('cost'):
            self.compute_cost()
        with tf.name_scope('train'):
            self.train_op = tf.train.AdamOptimizer(LR).minimize(self.cost)

    def add_input_layer(self,):
        l_in_x = tf.reshape(self.xs, [-1, self.input_size], name='2_2D')  # (batch*n_step, in_size)
        # Ws (in_size, cell_size)
        Ws_in = self._weight_variable([self.input_size, self.cell_size])
        # bs (cell_size, )
        bs_in = self._bias_variable([self.cell_size,])
        # l_in_y = (batch * n_steps, cell_size)
        with tf.name_scope('Wx_plus_b'):
            l_in_y = tf.matmul(l_in_x, Ws_in) + bs_in
        # reshape l_in_y ==> (batch, n_steps, cell_size)
        self.l_in_y = tf.reshape(l_in_y, [-1, self.n_steps, self.cell_size], name='2_3D')

    def add_cell(self):
        lstm_cell = tf.contrib.rnn.BasicLSTMCell(self.cell_size, forget_bias=1.0, state_is_tuple=True)
        with tf.name_scope('initial_state'):
            self.cell_init_state = lstm_cell.zero_state(self.batch_size, dtype=tf.float32)
        self.cell_outputs, self.cell_final_state = tf.nn.dynamic_rnn(
            lstm_cell, self.l_in_y, initial_state=self.cell_init_state, time_major=False)

    def add_output_layer(self):
        # shape = (batch * steps, cell_size)
        l_out_x = tf.reshape(self.cell_outputs, [-1, self.cell_size], name='2_2D')
        Ws_out = self._weight_variable([self.cell_size, self.output_size])
        bs_out = self._bias_variable([self.output_size, ])
        # shape = (batch * steps, output_size)
        with tf.name_scope('Wx_plus_b'):
            self.pred = tf.matmul(l_out_x, Ws_out) + bs_out

    def compute_cost(self):
        losses = tf.contrib.legacy_seq2seq.sequence_loss_by_example(
            [tf.reshape(self.pred, [-1], name='reshape_pred')],
            [tf.reshape(self.ys, [-1], name='reshape_target')],
            [tf.ones([self.batch_size * self.n_steps], dtype=tf.float32)],
            average_across_timesteps=True,
            softmax_loss_function=self.ms_error,
            name='losses'
        )
        with tf.name_scope('average_cost'):
            self.cost = tf.div(
                tf.reduce_sum(losses, name='losses_sum'),
                self.batch_size,
                name='average_cost')
            tf.summary.scalar('cost', self.cost)

    @staticmethod
    def ms_error(labels, logits):
        return tf.square(tf.subtract(labels, logits))

    def _weight_variable(self, shape, name='weights'):
        initializer = tf.random_normal_initializer(mean=0., stddev=1.,)
        return tf.get_variable(shape=shape, initializer=initializer, name=name)

    def _bias_variable(self, shape, name='biases'):
        initializer = tf.constant_initializer(0.1)
        return tf.get_variable(name=name, shape=shape, initializer=initializer)


if __name__ == '__main__':
    model = LSTMRNN(TIME_STEPS, INPUT_SIZE, OUTPUT_SIZE, CELL_SIZE, BATCH_SIZE)
    sess = tf.Session()
    merged = tf.summary.merge_all()
    writer = tf.summary.FileWriter("logs", sess.graph)
    # tf.initialize_all_variables() no long valid from
    # 2017-03-02 if using tensorflow >= 0.12
    if int((tf.__version__).split('.')[1]) < 12 and int((tf.__version__).split('.')[0]) < 1:
        init = tf.initialize_all_variables()
    else:
        init = tf.global_variables_initializer()
    sess.run(init)
    # relocate to the local dir and run this line to view it on Chrome (http://0.0.0.0:6006/):
    # $ tensorboard --logdir='logs'

    plt.ion()
    plt.show()
    for i in range(200):
        seq, res, xs = get_batch()
        if i == 0:
            feed_dict = {
                    model.xs: seq,
                    model.ys: res,
                    # create initial state
            }
        else:
            feed_dict = {
                model.xs: seq,
                model.ys: res,
                model.cell_init_state: state    # use last state as the initial state for this run
            }

        _, cost, state, pred = sess.run(
            [model.train_op, model.cost, model.cell_final_state, model.pred],
            feed_dict=feed_dict)

        # plotting
        plt.plot(xs[0, :], res[0].flatten(), 'r', xs[0, :], pred.flatten()[:TIME_STEPS], 'b--')
        plt.ylim((-1.2, 1.2))
        plt.draw()
        plt.pause(0.3)

        if i % 20 == 0:
            print('cost: ', round(cost, 4))
            result = sess.run(merged, feed_dict)
            writer.add_summary(result, i)

執行結果：

可以看到綠色虛線不斷地去擬合紅色的實線。

TensorFlow實現LSTM（迴歸）

最近在學習TensorFlow，並學習了在TensorFlow中實現LSTM的迴歸應用。下面是示例程式碼：import tensorflow as tf import numpy as np import matplotlib.pyplot as plt BATCH_ST

機器學習與Tensorflow（1）——機器學習基本概念、tensorflow實現簡單線性迴歸

一、機器學習基本概念 1.訓練集和測試集訓練集(training set/data)/訓練樣例（training examples): 用來進行訓練，也就是產生模型或者演算法的資料集測試集(testing set/data)/測試樣例 (testing examples)：用來專門進行測試已經學習好

TensorFlow學習筆記（一）-- Softmax迴歸模型識別MNIST

最近學習Tensorflow，特此筆記，學習資料為21個專案玩轉深度學習基於TensorFlow的實踐詳解 Softmax迴歸是一個線性的多分類模型，它是從Logistic迴歸模型轉化而來的，不同的是Logistic迴歸模型是一個二分類模型，而Softmax迴歸模型是一個多分類模型

TensorFlow學習筆記（3）——CNN在CIFAR10上的實現

CIFAR10是一個對圖片進行10種分類的專案官網提供了資料集的下載，此外官網還有對於資料集的介紹。資料集中資料被分為了兩部分。第一部分是特徵部分，使用一個[10000,3072]的uint8的矩陣進行儲存，每一行向量都是32*32大小的3通道圖片，構成的格式類似於[32,32,3]

Tensorflow機器學習（三）程式碼實現反捲積過程（de-convolution/convolution transpose）

卷積神經網路是深度學習中一個很流行的網路模型，它的原理和過程我就不在此介紹了，感興趣的可以去看一下https://blog.csdn.net/kane7csdn/article/details/83617086。在這裡，介紹一下反捲積過程（可以叫做deconvolution，或者也可

1.CNN圖片單標籤分類（基於TensorFlow實現基礎VGG16網路）

本文所使用的開源資料集（kaggle貓狗大戰）： www.kaggle.com/c/dogs-vs-c… 國內百度網盤下載地址： pan.baidu.com/s/12ab32UNY… 利用本文程式碼訓練並生成的模型（對應專案中的model資料夾）： pan.baidu.com/s/1tBkVQKoH

tensorflow學習筆記（二）實現MNIST

import tensorflow as tf from tensorflow.contrib import rnn import numpy as np import input_data input_vec_size = lstm_size = 28 time_st

機器學習筆記（十二）：TensorFlow實現四（影象識別與卷積神經網路）

1 - 卷積神經網路常用結構 1.1 - 卷積層我們先來介紹卷積層的結構以及其前向傳播的演算法。一個卷積層模組，包含以下幾個子模組：使用0擴充邊界(padding) 卷積視窗過濾器（filter）前向卷積反向卷積（可選） 1.1.2 - 邊界填充

TensorFlow學習筆記（5）--實現卷積神經網路（MNIST資料集）

這裡使用TensorFlow實現一個簡單的卷積神經網路，使用的是MNIST資料集。網路結構為：資料輸入層–卷積層1–池化層1–卷積層2–池化層2–全連線層1–全連線層2（輸出層），這是一個簡單但非常有代表性的卷積神經網路。 import tensorflow

tensorflow 學習專欄（六）：使用卷積神經網路（CNN）在mnist資料集上實現分類

卷積神經網路（Convolutional Neural Network, CNN）是一種前饋神經網路，它的人工神經元可以響應一部分覆蓋範圍內的周圍單元，對於大型影象處理有出色表現。卷積神經網路CNN的結構一般包含這幾個層：輸入層：用於資料的輸入卷積層：使用卷積核進行特徵提取和

Python實現機器學習二（實現多元線性迴歸）

接著上一次的一元線性迴歸http://blog.csdn.net/lulei1217/article/details/49385531往下講，這篇文章要講解的多元線性迴歸。 1、什麼是多元線性迴歸模型？當y值的影響因素不唯一時,採用多元線性迴歸模型。

TensorFlow實現ResNet（ResNet 152網路結構的forward耗時檢測）（轉）

結構有ResNet 50、ResNet 152、ResNet 200，考慮耗時原因只跑了ResNet 152網路結構的forward。 # coding:UTF-8 """ Typical use: from tensorflow.contrib.slim.n

TensorFlow學習筆記（4）--實現多層感知機（MNIST資料集）

前面使用TensorFlow實現一個完整的Softmax Regression，並在MNIST資料及上取得了約92%的正確率。現在建含一個隱層的神經網路模型（多層感知機）。 import tensorflow as tf import numpy as np

Tensorflow學習筆記（五）——結構化模型及Skip-gram模型的實現

一、結構化模型結構化我們的模型，可以方便我們Debug和良好的視覺化。一般我們的模型都是由以下兩步構成，第一步是構建計算圖，第二步是執行計算圖。 Assemble Graph Define placeholders for Inp

【TensorFlow】LSTM（使用TFLearn預測正弦sin函式）

專案已上傳至 GitHub —— sin_pre 資料生成因為標準的迴圈神經網路模型預測的是離散的數值，所以需要將連續的 sin 函式曲線離散化所謂離散化就是在一個給定的區間 [0,MAX] 內，通過有限個取樣點模擬一個連續的曲線，即間

TensorFlow學習筆記（1）：LSTM相關程式碼

LSTM是seq2seq模型中經典的子結構，TensorFlow中提供了相應的結構，供我們使用： tensorflow提供了LSTM實現的一個basic版本，不包含lstm的一些高階擴充套件，同時也提供了一個標準介面，其中包含了lstm的擴充套件。分別為：tf.nn.rnn

深度學習筆記——TensorFlow學習筆記（三）使用TensorFlow實現的神經網路進行MNIST手寫體數字識別

本文是TensorFlow學習的第三部分，參考的是《TensorFlow實戰Google深度學習框架》一書，這部分講述的是使用TensorFlow實現的神經網路進行MNIST手寫體數字識別一個例項。這個例項將第二部分講述的啟用函式、損失函式、優化演算法、正則化等都運用上了

tensorFlow入門實踐（三）初識AlexNet實現結構

參數 variable alexnet with col 展望 port kernel 兩個參考黃文堅《TensorFlow實戰》一書，完成AlexNet的整體實現並展望其訓練和預測過程。 import tensorflow as tf batch_size = 32

Tensorflow學習筆記（7）——CNN識別mnist程式設計實現

1.卷積神經網路構成（CNN）卷積神經網路主要由卷積層和pooling層組成。 (1)卷積層在CNN中的卷積層和普通神經網路的區別：根據生物學上動物視覺上識別事物是通過區域性感知野的啟發，普通神經網路是下一層的神經元與本層神經元之間是全連結的，

TensorFlow學習筆記（7）--實現卷積神經網路（同(5),不同的程式風格）

import tensorflow as tf import numpy as np import input_data mnist = input_data.read_data_sets('data/', one_hot=True) print("MNIST

TensorFlow實現LSTM（迴歸）

相關推薦