吳恩達deeplearning作業-Planar data classification with one hidden layer

阿新 • • 發佈：2019-01-31

Planar data classification with one hidden layer

Welcome to your week 3 programming assignment. It’s time to build your first neural network, which will have a hidden layer. You will see a big difference between this model and the one you implemented using logistic regression.

You will learn how to:

- Implement a 2-class classification neural network with a single hidden layer
- Use units with a non-linear activation function, such as tanh
- Compute the cross entropy loss
- Implement forward and backward propagation

1 - Packages

Let’s first import all the packages that you will need during this assignment.
-

numpy is the fundamental package for scientific computing with Python.
- sklearn provides simple and efficient tools for data mining and data analysis.
- matplotlib is a library for plotting graphs in Python.
- testCases provides some test examples to assess the correctness of your functions
- planar_utils provide various useful functions used in this assignment

# Package imports
import numpy as np
import matplotlib.pyplot as plt
from testCases import *
import sklearn
import sklearn.datasets
import sklearn.linear_model
from planar_utils import plot_decision_boundary, sigmoid, load_planar_dataset, load_extra_datasets

%matplotlib inline

np.random.seed(1) # set a seed so that the results are consistent

2 - Dataset

First, let’s get the dataset you will work on. The following code will load a “flower” 2-class dataset into variables X and Y.

X, Y = load_planar_dataset()

Visualize the dataset using matplotlib. The data looks like a “flower” with some red (label y=0) and some blue (y=1) points. Your goal is to build a model to fit this data.

# Visualize the data:
plt.scatter(X[0, :], X[1, :], c=Y.reshape(X[0,:].shape), s=40, cmap=plt.cm.Spectral);

這裡寫圖片描述

You have:
- a numpy-array (matrix) X that contains your features (x1, x2)
- a numpy-array (vector) Y that contains your labels (red:0, blue:1).

Lets first get a better sense of what our data is like.

Exercise: How many training examples do you have? In addition, what is the shape of the variables X and Y?

Hint: How do you get the shape of a numpy array? (help)

### START CODE HERE ### (≈ 3 lines of code)
shape_X = X.shape
shape_Y = Y.shape
m = X.shape[1]  # training set size
### END CODE HERE ###

print ('The shape of X is: ' + str(shape_X))
print ('The shape of Y is: ' + str(shape_Y))
print ('I have m = %d training examples!' % (m))

The shape of X is: (2, 400)
The shape of Y is: (1, 400)
I have m = 400 training examples!

Expected Output:

shape of X	(2, 400)
shape of Y	(1, 400)
m	400

3 - Simple Logistic Regression

Before building a full neural network, lets first see how logistic regression performs on this problem. You can use sklearn’s built-in functions to do that. Run the code below to train a logistic regression classifier on the dataset.

# Train the logistic regression classifier
clf = sklearn.linear_model.LogisticRegressionCV();
clf.fit(X.T, Y.T.ravel());

You can now plot the decision boundary of these models. Run the code below.

# Plot the decision boundary for logistic regression
plot_decision_boundary(lambda x: clf.predict(x), X, Y)
plt.title("Logistic Regression")

# Print accuracy
LR_predictions = clf.predict(X.T)
print ('Accuracy of logistic regression: %d ' % float((np.dot(Y,LR_predictions) + np.dot(1-Y,1-LR_predictions))/float(Y.size)*100) +
       '% ' + "(percentage of correctly labelled datapoints)")

Accuracy of logistic regression: 47 % (percentage of correctly labelled datapoints)

這裡寫圖片描述

Expected Output:

**Accuracy**

47%

Interpretation: The dataset is not linearly separable, so logistic regression doesn’t perform well. Hopefully a neural network will do better. Let’s try this now!

4 - Neural Network model

Logistic regression did not work well on the “flower dataset”. You are going to train a Neural Network with a single hidden layer.

Here is our model:
這裡寫圖片描述

Mathematically:

For one example x(i):

z[1](i)=W[1]x(i)+b[1](i)(1)
a[1](i)=tanh(z[1](i))(2)
z[2](i)=W[2]a[1](i)+b[2](i)(3)
ŷ (i)=a[2](i)=σ(z[2](i))(4)
y(i)prediction={10if a[2](i)>0.5otherwise (5)

Given the predictions on all the examples, you can also compute the cost J as follows:

J=−1m∑i=0m(y(i)log(a[2](i))+(1−y(i))log(1−a[2](i)))(6)

Reminder: The general methodology to build a Neural Network is to:
1. Define the neural network structure ( # of input units, # of hidden units, etc).
2. Initialize the model’s parameters
3. Loop:
- Implement forward propagation
- Compute loss
- Implement backward propagation to get the gradients
- Update parameters (gradient descent)

You often build helper functions to compute steps 1-3 and then merge them into one function we call nn_model(). Once you’ve built nn_model() and learnt the right parameters, you can make predictions on new data.

4.1 - Defining the neural network structure

Exercise: Define three variables:
- n_x: the size of the input layer
- n_h: the size of the hidden layer (set this to 4)
- n_y: the size of the output layer

Hint: Use shapes of X and Y to find n_x and n_y. Also, hard code the hidden layer size to be 4.

# GRADED FUNCTION: layer_sizes

def layer_sizes(X, Y):
    """
    Arguments:
    X -- input dataset of shape (input size, number of examples)
    Y -- labels of shape (output size, number of examples)

    Returns:
    n_x -- the size of the input layer
    n_h -- the size of the hidden layer
    n_y -- the size of the output layer
    """
    ### START CODE HERE ### (≈ 3 lines of code)
    n_x = X.shape[0] # size of input layer
    n_h = 4
    n_y =Y.shape[0]# size of output layer
    ### END CODE HERE ###
    return (n_x, n_h, n_y)

X_assess, Y_assess = layer_sizes_test_case()
(n_x, n_h, n_y) = layer_sizes(X_assess, Y_assess)
print("The size of the input layer is: n_x = " + str(n_x))
print("The size of the hidden layer is: n_h = " + str(n_h))
print("The size of the output layer is: n_y = " + str(n_y))

The size of the input layer is: n_x = 5
The size of the hidden layer is: n_h = 4
The size of the output layer is: n_y = 2

Expected Output (these are not the sizes you will use for your network, they are just used to assess the function you’ve just coded).

n_x	5
n_h	4
n_y	2

4.2 - Initialize the model’s parameters

Exercise: Implement the function initialize_parameters().

Instructions:
- Make sure your parameters’ sizes are right. Refer to the neural network figure above if needed.
- You will initialize the weights matrices with random values.
- Use: np.random.randn(a,b) * 0.01 to randomly initialize a matrix of shape (a,b).
- You will initialize the bias vectors as zeros.
- Use: np.zeros((a,b)) to initialize a matrix of shape (a,b) with zeros.

# GRADED FUNCTION: initialize_parameters

def initialize_parameters(n_x, n_h, n_y):
    """
    Argument:
    n_x -- size of the input layer
    n_h -- size of the hidden layer
    n_y -- size of the output layer

    Returns:
    params -- python dictionary containing your parameters:
                    W1 -- weight matrix of shape (n_h, n_x)
                    b1 -- bias vector of shape (n_h, 1)
                    W2 -- weight matrix of shape (n_y, n_h)
                    b2 -- bias vector of shape (n_y, 1)
    """

    np.random.seed(2) # we set up a seed so that your output matches ours although the initialization is random.

    ### START CODE HERE ### (≈ 4 lines of code)
    W1 = np.random.randn(n_h,n_x)*0.01
    b1 = np.zeros((n_h,1))
    W2 = np.random.randn(n_y,n_h)*0.01
    b2 = np.zeros((n_y,1))
    ### END CODE HERE ###

    assert (W1.shape == (n_h, n_x))
    assert (b1.shape == (n_h, 1))
    assert (W2.shape == (n_y, n_h))
    assert (b2.shape == (n_y, 1))

    parameters = {"W1": W1,
                  "b1": b1,
                  "W2": W2,
                  "b2": b2}

    return parameters

n_x, n_h, n_y = initialize_parameters_test_case()

parameters = initialize_parameters(n_x, n_h, n_y)
print("W1 = " + str(parameters["W1"]))
print("b1 = " + str(parameters["b1"]))
print("W2 = " + str(parameters["W2"]))
print("b2 = " + str(parameters["b2"]))

W1 = [[-0.00416758 -0.00056267]
 [-0.02136196  0.01640271]
 [-0.01793436 -0.00841747]
 [ 0.00502881 -0.01245288]]
b1 = [[ 0.]
 [ 0.]
 [ 0.]
 [ 0.]]
W2 = [[-0.01057952 -0.00909008  0.00551454  0.02292208]]
b2 = [[ 0.]]

Expected Output:

W1	[[-0.00416758 -0.00056267] [-0.02136196 0.01640271] [-0.01793436 -0.00841747] [ 0.00502881 -0.01245288]]
b1	[[ 0.] [ 0.] [ 0.] [ 0.]]
W2	[[-0.01057952 -0.00909008 0.00551454 0.02292208]]
b2	[[ 0.]]

4.3 - The Loop

Question: Implement forward_propagation().

Instructions:
- Look above at the mathematical representation of your classifier.
- You can use the function sigmoid(). It is built-in (imported) in the notebook.
- You can use the function np.tanh(). It is part of the numpy library.
- The steps you have to implement are:
1. Retrieve each parameter from the dictionary “parameters” (which is the output of initialize_parameters()) by using parameters[".."].
2. Implement Forward Propagation. Compute Z[1],A[1],Z[2] and A[2] (the vector of all your predictions on all the examples in the training set).
- Values needed in the backpropagation are stored in “cache“. The cache will be given as an input to the backpropagation function.

# GRADED FUNCTION: forward_propagation

def forward_propagation(X, parameters):
    """
    Argument:
    X -- input data of size (n_x, m)
    parameters -- python dictionary containing your parameters (output of initialization function)

    Returns:
    A2 -- The sigmoid output of the second activation
    cache -- a dictionary containing "Z1", "A1", "Z2" and "A2"
    """
    # Retrieve each parameter from the dictionary "parameters"
    ### START CODE HERE ### (≈ 4 lines of code)
    W1 = parameters["W1"]
    b1 = parameters["b1"]
    W2 = parameters["W2"]
    b2 = parameters["b2"]
    ### END CODE HERE ###

    # Implement F

 
 
              
           
              
              
            
            相關推薦
			   
            
            
            
 

    

    
    吳恩達deeplearning作業-Planar data classification with one hidden layer
      
							
							
							

Planar data classification with one hidden layer

Welcome to your week 3 programming assignment. It’s time to build your first ne 

  
 

    

    
    Planar data classification with one hidden layer
       
 
  
  
 From Logistic Regression with a Neural Network mindset, we achieved the Neural Network which use Logistic Regression to resolve the linear class 

  
 

    

    
    Neural Networks and Deep Learning（week3）Planar data classification with one hidden layer(基於單隱層的平面數據分類)
      one   hid   線性   deep   with   ica   ural   神經網絡   二分   Planar data classification with one hidden layer
你會學習到如何：


用單隱層實現一個二分類神經網絡


使用一個非線性激勵函數，如  

  
 

    

    
    吳恩達deeplearning作業-Logistic Regression with a Neural Network
      
							
							
							Logistic Regression with a Neural Network作業需要用到的資料資料



Logistic Regression with a Neural Network mindset

Welcome to your first (r 

  
 

    

    
    吳恩達 DeepLearning  神經網路基礎 第一課第三週程式設計題目及作業
      
								
								            
							
							
							宣告

此周的作業放在下面的連結那裡，裡面包含了所有資料。

作業連結

在一些程式碼上我寫上了註釋，再次明確了建立一個模型的步驟，此作業由於並不是很複雜，所以一些步驟合併了，但是總體的建模步驟還是跟部 

  
 

    

    
    吳恩達DeepLearning.ai（神經網路和深度學習） 第二週程式設計作業
      轉載於：http://blog.csdn.net/Koala_Tree/article/details/78057033吳恩達Coursera課程 DeepLearning.ai 程式設計作業系列，本文為《神經網路與深度學習》部分的第二週“神經網路基礎”的課程作業（做了無用部分的刪減）。Part 1：Pyth 

  
 

    

    
    吳恩達 DeepLearning  第一課第二週程式設計題目及作業（可免費下載資源）
      
							
							
							提示

作業裡面會有需要用到的 Python 模組以及資料集。所以我下面再附上所需要的檔案下載連結，不把所有檔案連同作業放一起打包好的目的是讓第一次接觸 Python 的人更多的瞭解 Python ， 萬事開頭難，希望大傢伙明白。

檔案連結



宣告

這一 

  
 

    

    
    吳恩達deeplearning課程作業環境
      
							
							
							Deep learning課程安裝

Course 4 prerequisites: 
- 2017/2/14 keras /pydot / graphviz 
- 2017/2/20 pandas for week 3 
- 2017/2/23 opencv  

  
 

    

    
    用純Python實現迴圈神經網路RNN向前傳播過程(吳恩達DeepLearning.ai作業)
      Google TensorFlow程式設計師點讚的文章！
  
前言
目錄:
    - 向量表示以及它的維度
    - rnn cell
    - rnn 向前傳播
&nbs 

  
 

    

    
    吳恩達DeepLearning.ai的Sequence model作業Dinosaurus Island
       
目錄
1 問題設定
 


1.1 資料集和預處理


1.2 概覽整個模型


 
2. 建立模型模組
 


2.1 在優化迴圈中梯度裁剪


2.2 取樣


 
3. 構建語言模型
 


3.1 梯度下降


3.2 訓練模型


&n 

  
 

    

    
    吳恩達 DeepLearning.ai 課程提煉筆記（4-2）卷積神經網絡 --- 深度卷積模型
      mes   and   shift   abd   actual   應用   比賽   特征   通道   1. 經典的卷積網絡
介紹幾種經典的卷積神經網絡結構，分別是LeNet、AlexNet、VGGNet。
LeNet-5：
LeNet-5主要是針對灰度設計的，所以其輸入較小，為  ，其結構如下： 

  
 

    

    
    吳恩達 DeepLearning 第二課第三週程式設計 tensorflow
       
 
 TensorFlow Tutorial 
 Welcome to this week's programming assignment. Until now, you've always used numpy to build neural networks. Now we will step yo 

  
 

    

    
    吳恩達 DeepLearning 第一課第二週 測驗 · Neural Network Basics
      





 


 

 



————————————————–中文翻譯—————————————————————————————–
1、神經元的計算是什麼？(B)
A. 在將輸出應用到啟用函式之前, 神經元計算所有特徵的平均值
B. 神經元計算一個線性函式 (z = Wx + b), 然後是一個啟用函 

  
 

    

    
    吳恩達deeplearning之CNN—卷積神經網路入門
      
1.邊界檢測示例 
假如你有一張如下的影象，你想讓計算機搞清楚影象上有什麼物體，你可以做的事情是檢測影象的垂直邊緣和水平邊緣。  
如下是一個6*6的灰度影象，構造一個3*3的矩陣，在卷積神經網路中通常稱之為filter，對這個6*6的影象進行卷積運算，以左上角的-5計算為例  3*1+ 

  
 

    

    
    吳恩達Deeplearning.ai 知識點梳理（course 5，week 3）
      
							
							
							  本週主要講了多種sequence to sequence結構，包括conditional language model，beam searching，bleu，attention，speech recognition，以及trigger word detec 

  
 

    

    
    吳恩達DeepLearning.ai《深度學習》課程筆記目錄總集
      
							
							
							

本文釋出在知乎的專欄中，為了方便習慣使用CSDN的使用者，更改了下面文章的直鏈到CSDN中的筆記。 
同時，也歡迎大家關注我的知乎：大樹先生，會不定期有新的乾貨更新。一起學習一起進步呀！^_^

DeepLearning.ai簡介
deepLearning. 

  
 

    

    
    吳恩達DeepLearning.ai筆記（5-1）-- 迴圈序列模型
      
							
							
							吳恩達DeepLearning.ai筆記（5-1）– 迴圈序列模型



1.一些序列資料例子





2.數學符號

 
 x<1>x<1>輸入序列X第一個單詞，TixTxi輸入序列X的單詞個數，X（i）<t>X（i）&l 

  
 

    

    
    【Coursera】吳恩達 deeplearning.ai 05.序列模型 第一週 迴圈序列模型 課程筆記
       
 
  
  
 迴圈序列模型 
 為什麼選擇序列模型 
 在語音識別、音樂生成、情感分類、DNA序列分析、機器翻譯、視訊識別、命名實體識別等任務中，共同特點是輸入X和/或輸出Y都是序列。   
 數學符號 
 例如，對於命名實體識別的問題： 
  
  
  對於輸入序列，用
    
     x&l 

  
 

    

    
    【Coursera】吳恩達 deeplearning.ai 04.卷積神經網路 第二週 深度卷積神經網路 課程筆記
       
 
  
  
 深度卷積神經網路 
 2.1 為什麼要進行例項化 
 實際上，在計算機視覺任務中表現良好的神經網路框架，往往也適用於其他任務。 
 2.2 經典網路 
  
  LeNet-5 
  AlexNet 
  VGG 
  
 LeNet-5 
 主要針對灰度影象   
 隨著神經網路的加深 

  
 

    

    
    吳恩達deeplearning.ai第四課學習心得：卷積神經網路與計算機視覺
      
不久前，Coursera 上放出了吳恩達 deeplearning.ai 的第四門課程《卷積神經網路》。本文是加拿大國家銀行首席分析師 Ryan Shrott 在完成該課程後所寫的學習心得，有助於大家直觀地瞭解、學習計算機視覺。




我最近在 Coursera 上完成了吳恩達教授的計算機視覺課程。吳恩達

吳恩達deeplearning作業-Planar data classification with one hidden layer

Planar data classification with one hidden layer

1 - Packages

2 - Dataset

3 - Simple Logistic Regression

4 - Neural Network model

4.1 - Defining the neural network structure

4.2 - Initialize the model’s parameters

4.3 - The Loop

吳恩達deeplearning作業-Planar data classification with one hidden layer

Planar data classification with one hidden layer

Neural Networks and Deep Learning（week3）Planar data classification with one hidden layer(基於單隱層的平面數據分類)

吳恩達deeplearning作業-Logistic Regression with a Neural Network

吳恩達 DeepLearning 神經網路基礎第一課第三週程式設計題目及作業

吳恩達DeepLearning.ai（神經網路和深度學習）第二週程式設計作業

吳恩達 DeepLearning 第一課第二週程式設計題目及作業（可免費下載資源）

吳恩達deeplearning課程作業環境

用純Python實現迴圈神經網路RNN向前傳播過程(吳恩達DeepLearning.ai作業)

吳恩達DeepLearning.ai的Sequence model作業Dinosaurus Island

吳恩達 DeepLearning.ai 課程提煉筆記（4-2）卷積神經網絡 --- 深度卷積模型

吳恩達 DeepLearning 第二課第三週程式設計 tensorflow

吳恩達 DeepLearning 第一課第二週測驗 · Neural Network Basics

吳恩達deeplearning之CNN—卷積神經網路入門

吳恩達Deeplearning.ai 知識點梳理（course 5，week 3）

吳恩達DeepLearning.ai《深度學習》課程筆記目錄總集

吳恩達DeepLearning.ai筆記（5-1）-- 迴圈序列模型

【Coursera】吳恩達 deeplearning.ai 05.序列模型第一週迴圈序列模型課程筆記

【Coursera】吳恩達 deeplearning.ai 04.卷積神經網路第二週深度卷積神經網路課程筆記

吳恩達deeplearning.ai第四課學習心得：卷積神經網路與計算機視覺