《深度學習——Andrew Ng》第四課第二週程式設計作業

阿新 • • 發佈：2019-01-04

深度學習第四課是 卷積神經網路 ，共四周內容：

第一週卷積神經網路（卷積的含義，各個層的功能，如何計算資料在不同層的大小（shape））
第二週深度卷積網路：例項探究（LeNet5、ResNet50等經典神經網路，遷移學習，資料擴充）
第三週目標檢測（目標檢測，衡量指標，YOLO演算法）
第四周特殊應用：人臉識別和神經風格轉換（。。。還沒看）

上面連結有作業及答案，這裡主要總結一下第二週的作業內容，把學到的知識做個筆記。

keras框架

這裡先送上keras框架的中文文件，Keras中文文件，有不會的內容查官方文件還是最有效的。

keras簡介

之前有tensorflow、caffe、theano等諸多深度學習框架，這裡keras的出現是為了進一步簡化初學者構建神經網路的步驟。如果是隻對深度學習有概念上的認識，而不清楚tensorflow裡面的tensor（張量）等概念的同學，可以使用keras快速搭建自己的網路，這也符合NG快速搭建自己網路的理念。

網路層的使用

keras裡面的網路層的使用都很簡單，將keras.layers匯入即可，這裡以2D卷積層為例：

from keras import layers
from keras.layers import Conv2D

# 這裡最後輸出是X，輸入是最後括號裡面的X。8個3*3的2D卷積層 

X = Conv2D(8,kernel_size=(3,3),strides=(1,1))(X)

keras.layers.convolutional.Conv2D(filters, kernel_size, strides=(1, 1), padding=’valid’, data_format=None, dilation_rate=(1, 1), activation=None, use_bias=True, kernel_initializer=’glorot_uniform’, bias_initializer=’zeros’, kernel_regularizer=None, bias_regularizer=None, activity_regularizer=None, kernel_constraint=None, bias_constraint=None)

引數

filters：卷積核的數目（即輸出的維度）
kernel_size：單個整數或由兩個整數構成的list/tuple，卷積核的寬度和長度。如為單個整數，則表示在各個空間維度的相同長度。
strides：單個整數或由兩個整數構成的list/tuple，為卷積的步長。如為單個整數，則表示在各個空間維度的相同步長。任何不為1的strides均與任何不為1的dilation_rate均不相容
padding：補0策略，為“valid”, “same” 。“valid”代表只進行有效的卷積，即對邊界資料不處理。“same”代表保留邊界處的卷積結果，通常會導致輸出shape與輸入shape相同。
activation：啟用函式，為預定義的啟用函式名（參考啟用函式），或逐元素（element-wise）的Theano函式。如果不指定該引數，將不會使用任何啟用函式（即使用線性啟用函式：a(x)=x）
dilation_rate：單個整數或由兩個個整數構成的list/tuple，指定dilated convolution中的膨脹比例。任何不為1的dilation_rate均與任何不為1的strides均不相容。
data_format：字串，“channels_first”或“channels_last”之一，代表影象的通道維的位置。該引數是Keras 1.x中的image_dim_ordering，“channels_last”對應原本的“tf”，“channels_first”對應原本的“th”。以128x128的RGB影象為例，“channels_first”應將資料組織為（3,128,128），而“channels_last”應將資料組織為（128,128,3）。該引數的預設值是~/.keras/keras.json中設定的值，若從未設定過，則為“channels_last”。
use_bias:布林值，是否使用偏置項
kernel_initializer：權值初始化方法，為預定義初始化方法名的字串，或用於初始化權重的初始化器。參考initializers
bias_initializer：權值初始化方法，為預定義初始化方法名的字串，或用於初始化權重的初始化器。參考initializers
kernel_regularizer：施加在權重上的正則項，為Regularizer物件
bias_regularizer：施加在偏置向量上的正則項，為Regularizer物件
activity_regularizer：施加在輸出上的正則項，為Regularizer物件
kernel_constraints：施加在權重上的約束項，為Constraints物件
bias_constraints：施加在偏置上的約束項，為Constraints物件

視覺化

keras.utils.vis_utils模組提供了畫出Keras模型的函式（利用graphviz）

這裡首先要安裝兩個依賴包（按照順序安裝）：

graphviz
sudo apt-get install graphviz
如果提示有依賴無法安裝，則先安裝依賴，然後再裝graphviz：

sudo apt-get -f install

sudo apt-get install graphviz
pydot-ng
sudo pip3 install pydot-ng

安裝好之後可以在程式中畫出自己網路模型圖，這裡以第一個作業的happyModel為例：

### keras visualization
from keras.utils import plot_model

plot_model(happyModel, to_file='happymodel.png', show_shapes = True)

這裡寫圖片描述

ResNet50

殘差網路的原理網上有很多解釋，這裡在直覺上，我感覺是將網路前面層的資料輸送給後面，從而將中間層做個“假遮蔽”，這樣就可以將深度網路做一個簡化。（個人愚見，有待日後再學習理解）

殘差網路結構圖：
這裡寫圖片描述

Keras實現ResNet50

這裡貼出幾個主要函式，都是使用Keras，將需要的網路層寫出了，連線實現。

block identity

################# block identity #################
def identity_block(X, f, filters, stage, block):
    """
    Implementation of the identity block as defined in Figure 4

    Arguments:
    X -- input tensor of shape (m, n_H_prev, n_W_prev, n_C_prev)
    f -- integer, specifying the shape of the middle CONV's window for the main path
    filters -- python list of integers, defining the number of filters in the CONV layers of the main path
    stage -- integer, used to name the layers, depending on their position in the network
    block -- string/character, used to name the layers, depending on their position in the network

    Returns:
    X -- output of the identity block, tensor of shape (n_H, n_W, n_C)
    """

    # defining name basis
    conv_name_base = 'res' + str(stage) + block + '_branch'
    bn_name_base = 'bn' + str(stage) + block + '_branch'

    # retrieve filters
    F1, F2, F3 = filters

    # Save the input value. You'll need this later to add back to the main path.
    X_shortcut = X

    # First component of main path
    X = Conv2D(filters = F1, kernel_size=(1,1), strides=(1,1), padding='valid',name=conv_name_base + '2a',
               kernel_initializer=glorot_uniform(seed=0))(X)
    X = BatchNormalization(axis=3, name=bn_name_base + '2a')(X)
    X = Activation('relu')(X)

    # Second component of main path
    X = Conv2D(filters= F2, kernel_size=(f,f), strides=(1,1), padding=('same'), name=conv_name_base + '2b',
               kernel_initializer=glorot_uniform(seed=0))(X)
    X = BatchNormalization(axis=3,name=bn_name_base + '2b')(X)
    X = Activation('relu')(X)

    # Third component of main path
    X = Conv2D(filters=F3, kernel_size=(1, 1), strides=(1, 1), padding=('valid'), name=conv_name_base + '2c',
               kernel_initializer=glorot_uniform(seed=0))(X)
    X = BatchNormalization(axis=3, name=bn_name_base + '2c')(X)

    # Final step: Add shortcut value to main path, and pass it through a RELU activation
    X = layers.add([X, X_shortcut])
    X = Activation('relu')(X)

    return  X

block convolutional

################# block convolutional #################
# GRADED FUNCTION: convolutional_block

def convolutional_block(X, f, filters, stage, block, s=2):
    """
    Implementation of the convolutional block as defined in Figure 4

    Arguments:
    X -- input tensor of shape (m, n_H_prev, n_W_prev, n_C_prev)
    f -- integer, specifying the shape of the middle CONV's window for the main path
    filters -- python list of integers, defining the number of filters in the CONV layers of the main path
    stage -- integer, used to name the layers, depending on their position in the network
    block -- string/character, used to name the layers, depending on their position in the network
    s -- Integer, specifying the stride to be used

    Returns:
    X -- output of the convolutional block, tensor of shape (n_H, n_W, n_C)
    """

    # defining name basis
    conv_name_base = 'res' + str(stage) + block + '_branch'
    bn_name_base = 'bn' + str(stage) + block + '_branch'

    # Retrieve Filters
    F1, F2, F3 = filters

    # Save the input value
    X_shortcut = X

    ##### MAIN PATH #####
    # First component of main path
    X = Conv2D(F1, (1, 1), strides=(s, s), name=conv_name_base + '2a', padding='valid',
               kernel_initializer=glorot_uniform(seed=0))(X)
    X = BatchNormalization(axis=3, name=bn_name_base + '2a')(X)
    X = Activation('relu')(X)

    ### START CODE HERE ###

    # Second component of main path (≈3 lines)
    X = Conv2D(F2, (f, f), strides=(1, 1), name=conv_name_base + '2b', padding='same',
               kernel_initializer=glorot_uniform(seed=0))(X)
    X = BatchNormalization(axis=3, name=bn_name_base + '2b')(X)
    X = Activation('relu')(X)

    # Third component of main path (≈2 lines)
    X = Conv2D(F3, (1, 1), strides=(1, 1), name=conv_name_base + '2c', padding='valid',
               kernel_initializer=glorot_uniform(seed=0))(X)
    X = BatchNormalization(axis=3, name=bn_name_base + '2c')(X)

    ##### SHORTCUT PATH #### (≈2 lines)
    X_shortcut = Conv2D(F3, (1, 1), strides=(s, s), name=conv_name_base + '1', padding='valid',
                        kernel_initializer=glorot_uniform(seed=0))(X_shortcut)
    X_shortcut = BatchNormalization(axis=3, name=bn_name_base + '1')(X_shortcut)

    # Final step: Add shortcut value to main path, and pass it through a RELU activation (≈2 lines)
    X = layers.add([X, X_shortcut])
    X = Activation('relu')(X)

    ### END CODE HERE ###

    return X

由identity block和convolutional block組成ResNet50

################# ResNet50 #################
# ResNet50
def ResNet50(input_shape=(64,64,3), classes = 6):
    """
     Implementation of the popular ResNet50 the following architecture:
     CONV2D -> BATCHNORM -> RELU -> MAXPOOL -> CONVBLOCK -> IDBLOCK*2 -> CONVBLOCK -> IDBLOCK*3
     -> CONVBLOCK -> IDBLOCK*5 -> CONVBLOCK -> IDBLOCK*2 -> AVGPOOL -> TOPLAYER

     Arguments:
     input_shape -- shape of the images of the dataset
     classes -- integer, number of classes

     Returns:
     model -- a Model() instance in Keras
     """
    # Define the input as a tensor with shape input_shape
    X_input = Input(input_shape)

    # Zero-padding
    X = ZeroPadding2D((3,3))(X_input)

    # Stage 1
    X = Conv2D(64, (7,7), strides=(2,2), name='conv1', kernel_initializer=glorot_uniform(seed=0))(X)
    X = BatchNormalization(axis=3,name='bn_conv1')(X)
    X = Activation('relu')(X)
    X = MaxPooling2D((3,3), strides=(2,2))(X)

    # Stage2
    X = convolutional_block(X, f=3, filters=[64,64,256], stage=2, block='a', s=1)
    X = identity_block(X, 3, [64,64,256], stage=2, block='b')
    X = identity_block(X, 3, [64,64,256], stage=2, block='c')

    # Stage3
    X = convolutional_block(X, f=3, filters=[128,128,512], stage=3, block='a',s=2)
    X = identity_block(X, f=3, filters=[128,128,512], stage=3, block='b')
    X = identity_block(X, f=3, filters=[128,128,512], stage=3, block='c')
    X = identity_block(X, f=3, filters=[128,128,512], stage=3, block='d')

    # Stage4
    X = convolutional_block(X, f=3, filters=[256, 256, 1024], block='a', stage=4, s=2)
    X = identity_block(X, f=3, filters=[256, 256, 1024], block='b', stage=4)
    X = identity_block(X, f=3, filters=[256, 256, 1024], block='c', stage=4)
    X = identity_block(X, f=3, filters=[256, 256, 1024], block='d', stage=4)
    X = identity_block(X, f=3, filters=[256, 256, 1024], block='e', stage=4)
    X = identity_block(X, f=3, filters=[256, 256, 1024], block='f', stage=4)

    # Stage 5 (≈3 lines)
    # The convolutional block uses three set of filters of size [512, 512, 2048], "f" is 3, "s" is 2 and the block is "a".
    # The 2 identity blocks use three set of filters of size [256, 256, 2048], "f" is 3 and the blocks are "b" and "c".
    X = convolutional_block(X, f = 3, filters=[512, 512, 2048], stage=5, block='a', s = 2)
    X = identity_block(X, f = 3, filters=[256, 256, 2048], stage=5, block='b')
    X = identity_block(X, f = 3, filters=[256, 256, 2048], stage=5, block='c')

    # Avgpool
    X = AveragePooling2D(pool_size=(2,2))(X)

    # output layer
    X = Flatten()(X)
    X = Dense(classes,activation='softmax', name='fc'+str(classes), kernel_initializer=glorot_uniform(seed=0))(X)

    # create model
    model = Model(inputs=X_input, outputs=X, name='ResNet50')
    return model

《深度學習——Andrew Ng》第一課第二週程式設計作業

最近在網易雲課堂學習《深度學習》微專業，將課後的程式設計作業記錄下來。 Logistic Regression with a Neural Network mindset Welcome to your first (required) pr

《深度學習——Andrew Ng》第四課第二週程式設計作業

深度學習第四課是卷積神經網路，共四周內容：第一週卷積神經網路（卷積的含義，各個層的功能，如何計算資料在不同層的大小（shape））第二週深度卷積網路：例項探究（LeNet5、ResNet50等經典神經網路，遷移學習，資料擴充）第三週

Operations on word vectors-v2 吳恩達老師深度學習課程第五課第二週程式設計作業1

吳恩達老師深度學習課程第五課（RNN）第二週程式設計作業1，包含答案 Operations on word vectors Welcome to your first assignment of this week! Because word embe

v2 吳恩達老師深度學習第五課第二週程式設計作業2

吳恩達老師深度學習第五課第二週程式設計作業2，包含答案！ Emojify! Welcome to the second assignment of Week 2. You are going to use word vector representation

deep learning 吳恩達第四課第二週程式設計 Keras - Tutorial - Happy House v2

Keras tutorial - the Happy House Welcome to the first assignment of week 2. In this assignment, you will: Learn to use Keras, a high-level neur

深度學習-吳恩達第一課第二週課程作業

這周作業是，給出一張圖片，判斷這張圖是不是貓。這是一個二分類問題，結果是非0即1的，使用邏輯迴歸（Logic Regression），可以說，瞭解這個迴歸方法，有些python基礎，會使用jupyter notebook就可以嘗試著碼一遍程式碼，走完整個學習流程，能進一步

Ng深度學習課程-第四課第二週筆記摘要

深度卷積網路：例項探究為什麼要進行例項探究？經典網路殘差網路(ResNets)(Residual Networks (ResNets))

《深度學習——Andrew Ng》第五課第一週程式設計作業_2_dinosaurus island

第二課的作業是給恐龍起名，訓練集是一系列恐龍的名字，經過訓練後，RNN網路可以生成新的恐龍的名字，隨著訓練次數的迭代，可以發現得到的名字越來越像是正常的恐龍名字。這裡有兩點需要注意一下：使用的模型RNN 圖中的每個cell都把計算流程標清楚了

《深度學習——Andrew Ng》第一課第四周程式設計作業

Building your Deep Neural Network: Step by Step 3.2 - L-layer Neural Network The initialization for a deeper L-layer neural

吳恩達深度學習：基於Matlab完成卷積神經網路第四課第一週程式設計任務

這兩三個月通過吳恩達老師的課程學習了深度學習，從零開始學理論，做程式設計任務。感覺學了很多知識。現在學到卷積神經網路，想把第一週的程式設計任務、其中的要點上傳和編寫，方便自己以後鞏固。（注：吳恩達老師課程的程式設計任務是用Python來完成的，而我是用ma

deep learning 吳恩達第四課第二週 Residual Networks - v2

Residual Networks Welcome to the second assignment of this week! You will learn how to build very deep convolutional networks, using Residual Networ

吳恩達Coursera深度學習課程 DeepLearning第一課第二週程式設計作業

最近在學習吳恩達的Deep Learning 系列課程，首先在此對吳老師表示深深的謝意。第一次接觸深度學習方面的知識，更是第一次用程式碼程式設計實現深度學習的演算法。所以在完成老師的作業過程中，遇到很多問題，最終在度孃的幫助下，花了一天的時間，終於把程式設計實現了邏

deeplearning.ai 第四課第二週，keras導航

1、函式庫匯入：（案例是一個happyhouse案例） import numpy as np from keras import layers from keras.layers import Input, Dense, Activation, ZeroP

deeplearning.ai 第四課第二週 resnet 50層神經網路實現

1、匯入函式庫: import numpy as np from keras import layers from keras.layers import Input, Add, Dense, Activation, ZeroPadding2D, Batc

Coursera 深度學習 deep learning.ai 吳恩達神經網路和深度學習第一課第二週程式設計作業 Python Basics with Numpy

Python Basics with Numpy (optional assignment) Welcome to your first assignment. This exercise gives you a brief introduction to P

吳恩達第四課第一週程式設計 Convolution model - Step by Step - v2

Convolutional Neural Networks: Step by Step Welcome to Course 4's first assignment! In this assignment, you will implement convolutional (C

吳恩達第四課第一週程式設計 Convolution model - Application - v1

Convolutional Neural Networks: Application Welcome to Course 4's second assignment! In this notebook, you will: Implement helper funct

《深度學習——Andrew Ng》第四課第四周程式設計作業_2_神經網路風格遷移

課程筆記演算法將一幅圖片分為內容+風格，有了這兩像，圖片也就確定了，所以”生成圖片主要的思想，通過兩個損失函式（內容損失+風格損失）來進行迭代更新” 遷移學習總體分為三步: 建立內容損失函式 Jcontent(C,G)Jcontent(C,G)

《深度學習——Andrew Ng》第四課第三週程式設計作業

第三週的課程是目標檢測，程式設計作業是以yolo網路為主。程式設計作業的主要部分是對yolo網路輸出進行 anchor boxes過濾、IOU過濾、非極大抑制處理。理論知識交併比（Intersection-over-Union，IoU），目標檢測中使

Ng深度學習課程-第四課第一週筆記摘要

卷積神經網路邊緣檢測 padding 卷積步長三維卷積池化層卷積神經網路

《深度學習——Andrew Ng》第四課第二週程式設計作業

keras框架

keras簡介

網路層的使用

視覺化

ResNet50

Keras實現ResNet50

block identity

block convolutional

由identity block和convolutional block組成ResNet50

相關推薦