Coursera 吳恩達 Deep Learning 第2課 Improving Deep Neural Networks 第一週程式設計作業程式碼 Initialization

阿新 • • 發佈：2019-01-31

2 - Zero initialization

# GRADED FUNCTION: initialize_parameters_zeros def initialize_parameters_zeros(layers_dims): """ Arguments: layer_dims -- python array (list) containing the size of each layer. Returns: parameters -- python dictionary containing your parameters "W1", "b1", ..., "WL", "bL":

W1 -- weight matrix of shape (layers_dims[1], layers_dims[0]) b1 -- bias vector of shape (layers_dims[1], 1) ... WL -- weight matrix of shape (layers_dims[L], layers_dims[L-1]) bL -- bias vector of shape (layers_dims[L], 1)

""" parameters = {} L = len(layers_dims) # number of layers in the network for l in range(1, L): ### START CODE HERE ### (≈ 2 lines of code) parameters['W' + str(l)] = np.zeros((layers_dims[l], layers_dims[l-1])) parameters['b' + str(l)] = np.zeros((layers_dims[l], 1))

### END CODE HERE ### return parameters

3 - Random initialization

# GRADED FUNCTION: initialize_parameters_random def initialize_parameters_random(layers_dims): """ Arguments: layer_dims -- python array (list) containing the size of each layer. Returns: parameters -- python dictionary containing your parameters "W1", "b1", ..., "WL", "bL": W1 -- weight matrix of shape (layers_dims[1], layers_dims[0]) b1 -- bias vector of shape (layers_dims[1], 1) ... WL -- weight matrix of shape (layers_dims[L], layers_dims[L-1]) bL -- bias vector of shape (layers_dims[L], 1) """ np.random.seed(3) # This seed makes sure your "random" numbers will be the as ours parameters = {} L = len(layers_dims) # integer representing the number of layers for l in range(1, L): ### START CODE HERE ### (≈ 2 lines of code) parameters['W' + str(l)] = np.random.randn(layers_dims[l], layers_dims[l-1]) * 10 #注意括號的數目 parameters['b' + str(l)] = np.zeros((layers_dims[l], 1)) ### END CODE HERE ### return parameters

4 - He initialization

Xavier初始化的基本思想是保持輸入和輸出的方差一致，這樣就避免了所有輸出值都趨向於0 He initialization的思想是：在ReLU網路中，假定每一層有一半的神經元被啟用，另一半為0，所以，要保持variance不變，只需要在Xavier的基礎上再除以2 # GRADED FUNCTION: initialize_parameters_he def initialize_parameters_he(layers_dims): """ Arguments: layer_dims -- python array (list) containing the size of each layer. Returns: parameters -- python dictionary containing your parameters "W1", "b1", ..., "WL", "bL": W1 -- weight matrix of shape (layers_dims[1], layers_dims[0]) b1 -- bias vector of shape (layers_dims[1], 1) ... WL -- weight matrix of shape (layers_dims[L], layers_dims[L-1]) bL -- bias vector of shape (layers_dims[L], 1) """ np.random.seed(3) parameters = {} L = len(layers_dims) - 1 # integer representing the number of layers for l in range(1, L + 1): ### START CODE HERE ### (≈ 2 lines of code) parameters['W' + str(l)] = np.random.randn(layers_dims[l], layers_dims[l-1]) * np.sqrt(2./layers_dims[l-1]) parameters['b' + str(l)] = np.zeros((layers_dims[l], 1)) ### END CODE HERE ### return parameters

疑問：

If you have heard of "Xavier initialization", this is similar except Xavier initialization uses a scaling factor for the weightsW[l]W[l]of sqrt(1./layers_dims[l-1]) 實驗中提到的 Xarier 初始化分佈為正態分佈隨機化後除以 “sqrt（上一層結點數目）” 然而在論文中，Xavier 初始化的分佈為均勻分佈：

in TensorFlow defxavier_init(fan_in,fan_out,constant = 1): low = -constant * np.sqrt(6.0/ (fan_in + fan_out)) high = constant * np.sqrt(6.0/ (fan_in + fan_out)) return tf.random_uniform((fan_in,fan_out), minval=low,maxval=high,dtype=tf.float32) [1] Xavier Glorot et al., Understanding the Difficult of Training Deep Feedforward Neural Networks.

Coursera 吳恩達 Deep Learning 第2課 Improving Deep Neural Networks 第一週程式設計作業程式碼 Initialization

2 - Zero initialization

3 - Random initialization

4 - He initialization

疑問：

Coursera 吳恩達 Deep Learning 第2課 Improving Deep Neural Networks 第一週程式設計作業程式碼 Initialization

Coursera 吳恩達DeepLearning.AI 第五課 sequence model 序列模型第一週 Improvise a Jazz Solo with an LSTM Network

Coursera 吳恩達 Deep Learning 第2課 Improving Deep Neural Networks 第一週程式設計作業程式碼 Regularization

Coursera 吳恩達DeepLearning.AI 第五課 sequence model 序列模型第二週 Emofify

Coursera Deep Learning 第四課卷積神經網路第二週程式設計作業殘差神經網路 Residual Networks

Coursera 吳恩達 Deep Learning 第二課改善神經網路 Improving Deep Neural Networks 第二週程式設計作業程式碼Optimization methods

吳恩達DeepLearning.ai（神經網路和深度學習）第二週程式設計作業

Coursera-吳恩達-機器學習-第十週-測驗-Large Scale Machine Learning

Coursera-吳恩達-機器學習-第六週-測驗-Machine Learning System Design

Coursera-吳恩達-機器學習-第五週-程式設計作業: Neural Networks Learning

Coursera-吳恩達-機器學習-第七週-測驗-Support Vector Machines

Coursera-吳恩達-機器學習-第七週-程式設計作業: Support Vector Machines

Coursera-吳恩達-機器學習-第十一週-測驗-Application: Photo OCR

Coursera-吳恩達-機器學習-第九周-程式設計作業-Anomaly Detection and Recommender Systems

Coursera-吳恩達-機器學習-第九周-測驗-Recommender Systems

Coursera-吳恩達-機器學習-第八週-程式設計作業: K-Means Clustering and PCA

Coursera-吳恩達-機器學習-第八週-測驗-Principal Component Analysis

Coursera-吳恩達-機器學習-第六週-程式設計作業: Regularized Linear Regression and Bias/Variance

吳恩達Deeplearning.ai 第五課 Sequence Model 第一週------Deep RNNs

吳恩達深度學習第四課：卷積神經網路（學習筆記2）

Coursera 吳恩達 Deep Learning 第2課 Improving Deep Neural Networks 第一週 程式設計作業程式碼 Initialization

2 - Zero initialization

3 - Random initialization

4 - He initialization

疑問：

相關推薦

Coursera 吳恩達 Deep Learning 第2課 Improving Deep Neural Networks 第一週程式設計作業程式碼 Initialization