Tensorflow中梯度下降法更新引數值

阿新 • • 發佈：2019-01-10

tf.train.GradientDescentOptimizer(0.01).minimize(cross_entropy)

TensorFlow經過使用梯度下降法對損失函式中的變數進行修改值，預設修改tf.Variable(tf.zeros([784,10]))

為Variable的引數。

train_step = tf.train.GradientDescentOptimizer(0.01).minimize(cross_entropy,var_list=[w,b])

也可以使用var_list引數來定義更新那些引數的值

#匯入Minst資料集
import input_data
mnist = input_data.read_data_sets("data",one_hot=True)

#匯入tensorflow庫
import tensorflow as tf

#輸入變數，把28*28的圖片變成一維陣列（丟失結構資訊）
x = tf.placeholder("float",[None,784])

#權重矩陣，把28*28=784的一維輸入，變成0-9這10個數字的輸出
w = tf.Variable(tf.zeros([784,10]))
#偏置
b = tf.Variable(tf.zeros([10]))

#核心運算，其實就是softmax（x*w+b）
y = tf.nn.softmax(tf.matmul(x,w) + b)

#這個是訓練集的正確結果
y_ = tf.placeholder("float",[None,10])

#交叉熵，作為損失函式
cross_entropy = -tf.reduce_sum(y_ * tf.log(y))

#梯度下降演算法，最小化交叉熵
train_step = tf.train.GradientDescentOptimizer(0.01).minimize(cross_entropy)

#初始化，在run之前必須進行的
init = tf.initialize_all_variables()
#建立session以便運算
sess = tf.Session()
sess.run(init)

#迭代1000次
for i in range(1000):
  #獲取訓練資料集的圖片輸入和正確表示數字
  batch_xs, batch_ys = mnist.train.next_batch(100)
  #執行剛才建立的梯度下降演算法，x賦值為圖片輸入，y_賦值為正確的表示數字
  sess.run(train_step,feed_dict = {x:batch_xs, y_: batch_ys})

#tf.argmax獲取最大值的索引。比較運算後的結果和本身結果是否相同。
#這步的結果應該是[1,1,1,1,1,1,1,1,0,1...........1,1,0,1]這種形式。
#1代表正確，0代表錯誤
correct_prediction = tf.equal(tf.argmax(y,1), tf.argmax(y_,1))

#tf.cast先將資料轉換成float，防止求平均不準確。
#tf.reduce_mean由於只有一個引數，就是上面那個陣列的平均值。
accuracy = tf.reduce_mean(tf.cast(correct_prediction,"float"))
#輸出
print(sess.run(accuracy,feed_dict={x:mnist.test.images,y_: mnist.test.labels}))

計算結果如下

"C:\Program Files\Anaconda3\python.exe" D:/pycharmprogram/tensorflow_learn/softmax_learn/softmax_learn.py
Extracting data\train-images-idx3-ubyte.gz
Extracting data\train-labels-idx1-ubyte.gz
Extracting data\t10k-images-idx3-ubyte.gz
Extracting data\t10k-labels-idx1-ubyte.gz
WARNING:tensorflow:From C:\Program Files\Anaconda3\lib\site-packages\tensorflow\python\util\tf_should_use.py:175: initialize_all_variables (from tensorflow.python.ops.variables) is deprecated and will be removed after 2017-03-02.
Instructions for updating:
Use `tf.global_variables_initializer` instead.
2018-05-14 15:49:45.866600: W C:\tf_jenkins\home\workspace\rel-win\M\windows\PY\35\tensorflow\core\platform\cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX instructions, but these are available on your machine and could speed up CPU computations.
2018-05-14 15:49:45.866600: W C:\tf_jenkins\home\workspace\rel-win\M\windows\PY\35\tensorflow\core\platform\cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX2 instructions, but these are available on your machine and could speed up CPU computations.
0.9163

Process finished with exit code 0

如果限制，只更新引數W檢視效果

"C:\Program Files\Anaconda3\python.exe" D:/pycharmprogram/tensorflow_learn/softmax_learn/softmax_learn.py
Extracting data\train-images-idx3-ubyte.gz
Extracting data\train-labels-idx1-ubyte.gz
Extracting data\t10k-images-idx3-ubyte.gz
Extracting data\t10k-labels-idx1-ubyte.gz
WARNING:tensorflow:From C:\Program Files\Anaconda3\lib\site-packages\tensorflow\python\util\tf_should_use.py:175: initialize_all_variables (from tensorflow.python.ops.variables) is deprecated and will be removed after 2017-03-02.
Instructions for updating:
Use `tf.global_variables_initializer` instead.
2018-05-14 15:51:08.543600: W C:\tf_jenkins\home\workspace\rel-win\M\windows\PY\35\tensorflow\core\platform\cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX instructions, but these are available on your machine and could speed up CPU computations.
2018-05-14 15:51:08.544600: W C:\tf_jenkins\home\workspace\rel-win\M\windows\PY\35\tensorflow\core\platform\cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX2 instructions, but these are available on your machine and could speed up CPU computations.
0.9187

Process finished with exit code 0

可以看出只修改W對結果影響不大，如果設定只修改b

#匯入Minst資料集
import input_data
mnist = input_data.read_data_sets("data",one_hot=True)

#匯入tensorflow庫
import tensorflow as tf

#輸入變數，把28*28的圖片變成一維陣列（丟失結構資訊）
x = tf.placeholder("float",[None,784])

#權重矩陣，把28*28=784的一維輸入，變成0-9這10個數字的輸出
w = tf.Variable(tf.zeros([784,10]))
#偏置
b = tf.Variable(tf.zeros([10]))

#核心運算，其實就是softmax（x*w+b）
y = tf.nn.softmax(tf.matmul(x,w) + b)

#這個是訓練集的正確結果
y_ = tf.placeholder("float",[None,10])

#交叉熵，作為損失函式
cross_entropy = -tf.reduce_sum(y_ * tf.log(y))

#梯度下降演算法，最小化交叉熵
train_step = tf.train.GradientDescentOptimizer(0.01).minimize(cross_entropy,var_list=[b])

#初始化，在run之前必須進行的
init = tf.initialize_all_variables()
#建立session以便運算
sess = tf.Session()
sess.run(init)

#迭代1000次
for i in range(1000):
  #獲取訓練資料集的圖片輸入和正確表示數字
  batch_xs, batch_ys = mnist.train.next_batch(100)
  #執行剛才建立的梯度下降演算法，x賦值為圖片輸入，y_賦值為正確的表示數字
  sess.run(train_step,feed_dict = {x:batch_xs, y_: batch_ys})

#tf.argmax獲取最大值的索引。比較運算後的結果和本身結果是否相同。
#這步的結果應該是[1,1,1,1,1,1,1,1,0,1...........1,1,0,1]這種形式。
#1代表正確，0代表錯誤
correct_prediction = tf.equal(tf.argmax(y,1), tf.argmax(y_,1))

#tf.cast先將資料轉換成float，防止求平均不準確。
#tf.reduce_mean由於只有一個引數，就是上面那個陣列的平均值。
accuracy = tf.reduce_mean(tf.cast(correct_prediction,"float"))
#輸出
print(sess.run(accuracy,feed_dict={x:mnist.test.images,y_: mnist.test.labels}))

計算結果：

"C:\Program Files\Anaconda3\python.exe" D:/pycharmprogram/tensorflow_learn/softmax_learn/softmax_learn.py
Extracting data\train-images-idx3-ubyte.gz
Extracting data\train-labels-idx1-ubyte.gz
Extracting data\t10k-images-idx3-ubyte.gz
Extracting data\t10k-labels-idx1-ubyte.gz
WARNING:tensorflow:From C:\Program Files\Anaconda3\lib\site-packages\tensorflow\python\util\tf_should_use.py:175: initialize_all_variables (from tensorflow.python.ops.variables) is deprecated and will be removed after 2017-03-02.
Instructions for updating:
Use `tf.global_variables_initializer` instead.
2018-05-14 15:52:04.483600: W C:\tf_jenkins\home\workspace\rel-win\M\windows\PY\35\tensorflow\core\platform\cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX instructions, but these are available on your machine and could speed up CPU computations.
2018-05-14 15:52:04.483600: W C:\tf_jenkins\home\workspace\rel-win\M\windows\PY\35\tensorflow\core\platform\cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX2 instructions, but these are available on your machine and could speed up CPU computations.
0.1135

Process finished with exit code 0

如果只更新b那麼對效果影響很大。

Tensorflow中梯度下降法更新引數值

tf.train.GradientDescentOptimizer(0.01).minimize(cross_entropy)TensorFlow經過使用梯度下降法對損失函式中的變數進行修改值，預設修改tf.Variable(tf.zeros([784,10]))為Varia

TensorFlow中梯度下降函式

一介紹下面介紹在TensorFlow中進行隨機梯度下降優化的函式。在TensorFlow中通過一個叫Optimizer的優化器類進行訓練優化。二梯度下降優化器三說明在訓練過程中先例項化一個優化函式如tf.train.GradientDescentOptimizer，並基

神經網路例程-梯度下降法更新權值

以下程式碼來自Deep Learning for Computer Vision with Python第九章。一、梯度下降法（Gradient Decent） # import the necessary packages from sklearn.model_s

機器學習中梯度下降法和牛頓法的比較

在機器學習的優化問題中，梯度下降法和牛頓法是常用的兩種凸函式求極值的方法，他們都是為了求得目標函式的近似解。在邏輯斯蒂迴歸模型的引數求解中，一般用改良的梯度下降法，也可以用牛頓法。由於兩種方法有些相似，我特地拿來簡單地對比一下。下面的內容需要讀者之前熟悉兩種演算

梯度下降法更新權值理論

使用3層的神經網路(包含輸入層和輸出層)來演示是如何工作的。網路：input 3個節點；hidden 3個節點；output 3個節點引數：input：矩陣是3*1的矩陣；為3*3的矩陣；為3*3的矩陣；output：矩陣為3*1的矩陣計算hidden層節點的矩陣

機器學習中梯度下降法原理及用其解決線性迴歸問題的C語言實現

本文講梯度下降（Gradient Descent）前先看看利用梯度下降法進行監督學習（例如分類、迴歸等）的一般步驟： 1，定義損失函式（Loss Function） 2，資訊流forward propagation，直到輸出端 3，誤差訊號back propagation。採用“鏈式法則”，求損失函式關

tensorflow實現svm多分類 iris 3分類——本質上在使用梯度下降法求解線性回歸（loss是定制的而已）

points near plot asi atm lob put matplot ive # Multi-class (Nonlinear) SVM Example # # This function wll illustrate how to # implement

梯度下降法中，為什麼在負梯度方向函式值下降最快

以下內容整理於高數課本以及李巨集毅老師的視訊：我們想要利用梯度下降來求得損失函式的最小值。也就是每次我們更新引數，當前的損失函式總比上一次要小。假設只有兩個引數θ1和θ2，上圖是損失函式的等值線，紅色點是初始值當前的狀態。以紅色點為圓心畫圓，在這個圓的範圍內，我們想要找到

Stanford機器學習課程(Andrew Ng) Week 1 Parameter Learning --- 線性迴歸中的梯度下降法

本節將梯度下降與代價函式結合，並擬合到線性迴歸的函式中這是我們上兩節課得到的函式，包括：梯度下降的公式用於擬合的線性假設和h(x) 平方誤差代價函式 J

ML學習筆記 3 梯度下降法及其線上性迴歸中的應用

背景上一篇文章用最小二乘法（即公式法）求出了線性迴歸的引數 theta ；本篇程式碼介紹用梯度下降法求極小值。原理實在不知道怎麼描述啊，okay，從山頂走向山腳有 n 條路，問題來了：捷徑？最快的那條路徑。以多大的步伐走比較合適？比較走的太快，容易

高斯混合模型（GMM model）以及梯度下降法（gradient descent）更新引數

關於GMM模型的資料和 EM 引數估算的資料，網上已經有很多了，今天想談的是GMM的協方差矩陣的分析、GMM的引數更新方法 1、GMM協方差矩陣的物理含義涉及到每個元素，是這樣求算：用中文來描述就是：注意後面的那個除以（樣本數-1），就是大括號外面的E求期望　（這叫

機器學習中常見的優化方法：梯度下降法、牛頓法擬牛頓法、共軛梯度法、拉格朗日乘數法

機器學習中常見的優化方法：梯度下降法、牛頓法擬牛頓法、共軛梯度法、拉格朗日乘數法主要內容梯度下降法牛頓法擬牛頓法共軛梯度法拉格朗日乘數法許多機器學習演算法，往往建立目標函式（損失函式+正則項），通過優化方法進行優化，根據訓練

機器學習中最小二乘和梯度下降法的個人理解

提前說明一下，這裡不涉及數學公式的推到，只是根據自己的理解來概括一下，有不準確的地方，歡迎指出。最小二乘：我們通常是根據一些離散的點來擬合出一天直線，這條直線也就是我們所說的模型，最小二乘也就是評價損失函式（loss）的一個指標。最小二乘就是那些離散的點與模型上擬合出的點做一

梯度下降法（GD,SGD,Mini-Batch GD）線上性迴歸中的使用

https://github.com/crystal30/SGDLinrearRegression一. 梯度下降法(Batch Gradient Descent)1.梯度下降法的原理(1) 梯度下降法是一種基於搜尋的最優化方法，不是一個機器學習演算法。(2) 作用：

對數幾率回歸法（梯度下降法，隨機梯度下降與牛頓法）與線性判別法(LDA)

3.1 初始屬性 author alt closed sta lose cnblogs 　　本文主要使用了對數幾率回歸法與線性判別法（ＬＤＡ）對數據集（西瓜３.０）進行分類。其中在對數幾率回歸法中，求解最優權重Ｗ時，分別使用梯度下降法，隨機梯度下降與牛頓法。代碼如下：

批量梯度下降法（Batch Gradient Descent）

所有 margin 初始 ont 模型 log eight 梯度下降 img 批量梯度下降：在梯度下降的每一步中都用到了所有的訓練樣本。思想：找能使代價函數減小最大的下降方向（梯度方向）。　　　　ΔΘ = - α▽J α：學習速率梯度下降的線性回歸　　

機器學習之梯度下降法

梯度學習模型最快參數 nbsp 函數 bsp 每一個在吳恩達的機器學習課程中，講了一個模型，如何求得一個參數令錯誤函數值的最小，這裏運用梯度下降法來求得參數。首先任意選取一個θ 令這個θ變化，怎麽變化呢，怎麽讓函數值變化的快，變化的小怎麽變化，那麽函數值怎麽才能

常見的幾種最優化方法（梯度下降法、牛頓法、擬牛頓法、共軛梯度法等）

linear 樣本計算每次理學系統是否底部有效我們每個人都會在我們的生活或者工作中遇到各種各樣的最優化問題，比如每個企業和個人都要考慮的一個問題“在一定成本下，如何使利潤最大化”等。最優化方法是一種數學方法，它是研究在給定約束之下如何尋求某些因素(的量)，以

解梯度下降法的三種形式BGD、SGD以及MBGD

有一個 lis 一行 pri mbg 網絡 () 次數 pen 原帖地址：https://zhuanlan.zhihu.com/p/25765735 在應用機器學習算法時

（轉）梯度下降法及其Python實現

radi 減少 fill 叠代 bbs 方法風險 ews 展示梯度下降法（gradient descent），又名最速下降法（steepest descent）是求解無約束最優化問題最常用的方法，它是一種叠代方法，每一步主要的操作是求解目標函數的梯度向量，將當前位置的負

Tensorflow中梯度下降法更新引數值

相關推薦