Resnet50原始碼-tensorflow+keras詳細解析

阿新 • • 發佈：2018-11-26

Resnet50原始碼-tensorflow解析

參考keras中的原始碼進行解析

先載入一些庫的檔案

from __future__ import print_function

import numpy as np
import warnings

from keras.layers import Input
from keras import layers
from keras.layers import Dense
from keras.layers import Activation
from keras.layers import Flatten
from keras.layers import Conv2D
from keras.layers import MaxPooling2D
from keras.layers import GlobalMaxPooling2D
from keras.layers import ZeroPadding2D
from keras.layers import AveragePooling2D
from keras.layers import GlobalAveragePooling2D
from keras.layers import BatchNormalization
from keras.models import Model
from keras.preprocessing import image
import keras.backend as K
from keras.utils import layer_utils
from keras.utils.data_utils import get_file
from keras.applications.imagenet_utils import decode_predictions
from keras.applications.imagenet_utils import preprocess_input
from keras.applications.imagenet_utils import _obtain_input_shape
from keras.engine.topology import get_source_inputs

然後新增預訓練的權重（採用網上的resnet50進行舉例）

WEIGHTS_PATH = 'https://github.com/fchollet/deep-learning-models/releases/download/v0.2/resnet50_weights_tf_dim_ordering_tf_kernels.h5'
WEIGHTS_PATH_NO_TOP = 'https://github.com/fchollet/deep-learning-models/releases/download/v0.2/resnet50_weights_tf_dim_ordering_tf_kernels_notop.h5'

簡單瞭解下殘差網路的原理圖

然後我們可以開始定義結構的主體，首先定義identity(x)

def identity_block(input_tensor, kernel_size, filters, stage, block):
    """The identity block is the block that has no conv layer at shortcut.
    # Arguments
        input_tensor: input tensor  #輸入變數#
        kernel_size: defualt 3, the kernel size of middle conv layer at main path #卷積核的大小#
        filters: list of integers, the filterss of 3 conv layer at main path  #卷積核的數目#
        stage: integer, current stage label, used for generating layer names #當前階段的標籤#
        block: 'a','b'..., current block label, used for generating layer names #當前塊的標籤#
    # Returns
        Output tensor for the block.  #返回塊的輸出變數#
    """
    filters1, filters2, filters3 = filters  #濾波器的名稱#
    if K.image_data_format() == 'channels_last':  #代表影象通道維的位置#
        bn_axis = 3
    else:
        bn_axis = 1
    conv_name_base = 'res' + str(stage) + block + '_branch'
    bn_name_base = 'bn' + str(stage) + block + '_branch'

    x = Conv2D(filters1, (1, 1), name=conv_name_base + '2a')(input_tensor)
    x = BatchNormalization(axis=bn_axis, name=bn_name_base + '2a')(x)
    x = Activation('relu')(x)   #卷積層，BN層，啟用函式#

    x = Conv2D(filters2, kernel_size,
               padding='same', name=conv_name_base + '2b')(x)
    x = BatchNormalization(axis=bn_axis, name=bn_name_base + '2b')(x)
    x = Activation('relu')(x)

    x = Conv2D(filters3, (1, 1), name=conv_name_base + '2c')(x)
    x = BatchNormalization(axis=bn_axis, name=bn_name_base + '2c')(x)

    x = layers.add([x, input_tensor])
    x = Activation('relu')(x)
    return x

定義卷積層的結構

def conv_block(input_tensor, kernel_size, filters, stage, block, strides=(2, 2)):
    """conv_block is the block that has a conv layer at shortcut
    # Arguments
        input_tensor: input tensor
        kernel_size: defualt 3, the kernel size of middle conv layer at main path
        filters: list of integers, the filterss of 3 conv layer at main path
        stage: integer, current stage label, used for generating layer names
        block: 'a','b'..., current block label, used for generating layer names
    # Returns
        Output tensor for the block.
    Note that from stage 3, the first conv layer at main path is with strides=(2,2)
    And the shortcut should have strides=(2,2) as well
    """
    filters1, filters2, filters3 = filters
    if K.image_data_format() == 'channels_last':
        bn_axis = 3
    else:
        bn_axis = 1
    conv_name_base = 'res' + str(stage) + block + '_branch'
    bn_name_base = 'bn' + str(stage) + block + '_branch'

    x = Conv2D(filters1, (1, 1), strides=strides,
               name=conv_name_base + '2a')(input_tensor)
    x = BatchNormalization(axis=bn_axis, name=bn_name_base + '2a')(x)
    x = Activation('relu')(x)

    x = Conv2D(filters2, kernel_size, padding='same',
               name=conv_name_base + '2b')(x)
    x = BatchNormalization(axis=bn_axis, name=bn_name_base + '2b')(x)
    x = Activation('relu')(x)

    x = Conv2D(filters3, (1, 1), name=conv_name_base + '2c')(x)
    x = BatchNormalization(axis=bn_axis, name=bn_name_base + '2c')(x)

    shortcut = Conv2D(filters3, (1, 1), strides=strides,
                      name=conv_name_base + '1')(input_tensor)
    shortcut = BatchNormalization(axis=bn_axis, name=bn_name_base + '1')(shortcut)

    x = layers.add([x, shortcut])
    x = Activation('relu')(x)
    return x

現在讓我們用定義好的兩種塊去構建resnet50的主體結構，先看一下其原理結構圖：

看了原理圖之後開始構建框架圖：

def ResNet50(include_top=True, weights='imagenet',
             input_tensor=None, input_shape=None,
             pooling=None,
             classes=1000):  #這裡採用的權重是imagenet，可以更改，種類為1000#
if weights not in {'imagenet', None}:
        raise ValueError('The `weights` argument should be either '
                         '`None` (random initialization) or `imagenet` '
                         '(pre-training on ImageNet).')

    if weights == 'imagenet' and include_top and classes != 1000:
        raise ValueError('If using `weights` as imagenet with `include_top`'
                         ' as true, `classes` should be 1000')

    # Determine proper input shape
    input_shape = _obtain_input_shape(input_shape,
                                      default_size=224,
                                      min_size=197,
                                      data_format=K.image_data_format(),
                                      include_top=include_top)

    if input_tensor is None:
        img_input = Input(shape=input_shape)
    else:
        if not K.is_keras_tensor(input_tensor):
            img_input = Input(tensor=input_tensor, shape=input_shape)
        else:
            img_input = input_tensor
    if K.image_data_format() == 'channels_last':
        bn_axis = 3
    else:
        bn_axis = 1

    x = ZeroPadding2D((3, 3))(img_input) #對圖片介面填充0，保證特徵圖的大小#
    x = Conv2D(64, (7, 7), strides=(2, 2), name='conv1')(x) #定義卷積層#
    x = BatchNormalization(axis=bn_axis, name='bn_conv1')(x) #批標準化#
    x = Activation('relu')(x) #啟用函式#
    x = MaxPooling2D((3, 3), strides=(2, 2))(x) #最大池化層#
#stage2#
    x = conv_block(x, 3, [64, 64, 256], stage=2, block='a', strides=(1, 1))
    x = identity_block(x, 3, [64, 64, 256], stage=2, block='b')
    x = identity_block(x, 3, [64, 64, 256], stage=2, block='c')
#stage3#
    x = conv_block(x, 3, [128, 128, 512], stage=3, block='a')
    x = identity_block(x, 3, [128, 128, 512], stage=3, block='b')
    x = identity_block(x, 3, [128, 128, 512], stage=3, block='c')
    x = identity_block(x, 3, [128, 128, 512], stage=3, block='d')
#stage4#
    x = conv_block(x, 3, [256, 256, 1024], stage=4, block='a')
    x = identity_block(x, 3, [256, 256, 1024], stage=4, block='b')
    x = identity_block(x, 3, [256, 256, 1024], stage=4, block='c')
    x = identity_block(x, 3, [256, 256, 1024], stage=4, block='d')
    x = identity_block(x, 3, [256, 256, 1024], stage=4, block='e')
    x = identity_block(x, 3, [256, 256, 1024], stage=4, block='f')
#stage5#
    x = conv_block(x, 3, [512, 512, 2048], stage=5, block='a')
    x = identity_block(x, 3, [512, 512, 2048], stage=5, block='b')
    x = identity_block(x, 3, [512, 512, 2048], stage=5, block='c')

    x = AveragePooling2D((7, 7), name='avg_pool')(x) #平均池化層#

    if include_top:
        x = Flatten()(x)
        x = Dense(classes, activation='softmax', name='fc1000')(x)
    else:
        if pooling == 'avg':
            x = GlobalAveragePooling2D()(x)
        elif pooling == 'max':
            x = GlobalMaxPooling2D()(x)

    # Ensure that the model takes into account
    # any potential predecessors of `input_tensor`.
    if input_tensor is not None:
        inputs = get_source_inputs(input_tensor)
    else:
        inputs = img_input
    # Create model.
    model = Model(inputs, x, name='resnet50')  

    # load weights
    if weights == 'imagenet':
        if include_top:
            weights_path = get_file('resnet50_weights_tf_dim_ordering_tf_kernels.h5',
                                    WEIGHTS_PATH,
                                    cache_subdir='models',
                                    md5_hash='a7b3fe01876f51b976af0dea6bc144eb')
        else:
            weights_path = get_file('resnet50_weights_tf_dim_ordering_tf_kernels_notop.h5',
                                    WEIGHTS_PATH_NO_TOP,
                                    cache_subdir='models',
                                    md5_hash='a268eb855778b3df3c7506639542a6af')
        model.load_weights(weights_path)
        if K.backend() == 'theano':
            layer_utils.convert_all_kernels_in_model(model)

        if K.image_data_format() == 'channels_first':
            if include_top:
                maxpool = model.get_layer(name='avg_pool')
                shape = maxpool.output_shape[1:]
                dense = model.get_layer(name='fc1000')
                layer_utils.convert_dense_weights_data_format(dense, shape, 'channels_first')

            if K.backend() == 'tensorflow':
                warnings.warn('You are using the TensorFlow backend, yet you '
                              'are using the Theano '
                              'image data format convention '
                              '(`image_data_format="channels_first"`). '
                              'For best performance, set '
                              '`image_data_format="channels_last"` in '
                              'your Keras config '
                              'at ~/.keras/keras.json.')
    return model

搭建完主體結構之後，便可以開始進行測試了。

if __name__ == '__main__':
    model = ResNet50(include_top=True, weights='imagenet')

    img_path = 'elephant.jpg'
    img = image.load_img(img_path, target_size=(224, 224))
    x = image.img_to_array(img)
    x = np.expand_dims(x, axis=0)
    x = preprocess_input(x)
    print('Input image shape:', x.shape)

    preds = model.predict(x)
    print('Predicted:', decode_predictions(preds))

作為一名深度學習領域的新手，我的建議是大家可以在掌握原理的前提之後，開始看論文的原始碼之後，在把resnet網路的主體結構自己在重新coding 一下，這樣不僅加深自己的理解，同時也會使自己具備看程式碼的耐心，避免之後看到龐大的程式碼庫便心生怯意，同時大家不懂的地方可以在下方評論。

Resnet50原始碼-tensorflow+keras詳細解析

Resnet50原始碼-tensorflow解析原理解析：何凱明論文PPT-秒懂原理專案地址：Resnet50原始碼參考keras中的原始碼進行解析先載入一些庫的檔案 from __future__ import print_function import numpy as

BiSeNet 語義分割公眾號幸運飛艇原始碼下載網路結構詳細解析

針對 BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation.該論文公眾號幸運飛艇原始碼下載QQ2952777280【話仙原始碼論壇】 hxforum.com 提出的語義分割網路，根據第三方實現

SSD-Tensorflow超詳細解析【一】：載入模型對圖片進行測試

SSD-tensorflow——github下載地址：SSD-Tensorflow目標檢測的塊速實現下載完成之後我們開啟工程，可以看到如下圖所示的檔案佈局：首先我們開啟checkpoints檔案，解壓縮ssd_300_vgg.ckpt.zip檔案到checkpoints目錄下

Vue原始碼詳細解析

Vue原始碼詳細解析教程包含了Vue中從資料observe到模板解析、transclude、compile、link、指令的bind、update、dom批處理更新、陣列diff等等環節，基本涵蓋了Vue整個生命週期過程。訂閱新文章請watch本專案。文章主線劇情

詳細解析Android的View事件分發機制附帶原始碼分析

前言在Android中，事件分發機制是一塊很重要的知識點，掌握這個機制能幫你在平時的開發中解決掉很多的View事件衝突問題，這個問題也是面試中問的比較多的一個問題了，今天就來總結下這個知識點。事件分發機制事件分發原因 Android中頁面上的View是以

Android技能樹 — 網路小結(7)之 Retrofit原始碼詳細解析

前言：哈哈，其實寫的還是很水，各位原諒我O(∩_∩)O。介於自己的網路方面知識爛的一塌糊塗，所以準備寫相關網路的文章，但是考慮全部寫在一篇太長了，所以分開寫，希望大家能仔細看，最好可以指出我的錯誤，讓我也能糾正。 1.講解相關的整個網路體系結構： Android技能樹 — 網路小結(1)之網路體系

《TensorFlow+Keras深度學習人工智慧實踐應用》林大貴版-解析

參考部落格：【Keras-CNN】CIFAR-10 https://blog.csdn.net/bryant_meng/article/details/81077196#1_Data_preprocessing_12 目錄：https://blog.csdn.net/b

Word2Vec原始碼詳細解析（上）

相關連結： 1、Word2Vec原始碼最詳細解析（上） 2、Word2Vec原始碼最詳細解析（下） Word2Vec原始碼最詳細解析（上）在這一部分中，主要介紹的是Word2Vec原始碼中的主要資料結構、各個變數的含義與作用，以及所有演算法之外的輔助函式，包括如何

原始碼詳細解析Activity生命週期onResume中Handler.Post(Runnable)和View.Post(Runnable)的UI效果差異原因

一般需求中會出現在Activity啟動中需要獲取Ui控制元件相關大小或者在介面繪製完成之後重新整理資料，我們都知道在UI繪製完成之後，時機最好，不會阻塞主執行緒導致卡頓或者UI控制元件引數獲取失敗。也許大家使用過或知道Handler（MainLooper）.

Word2Vec原始碼詳細解析（下）

相關連結： 1、Word2Vec原始碼最詳細解析（上） 2、Word2Vec原始碼最詳細解析（下） Word2Vec原始碼最詳細解析（下）在這一部分中，重點分析的是Word2Vec原始碼中演算法部分的實現，需要一定得演算法理論基礎，如果對CBOW和

spring4.0 原始碼分析 bean各種屬性詳細解析(四)

一、軟體版本 spring-framework-4.0.7.RELEASE-dist jdk1.7.0.79 myeclipse9.1 二、bean各種屬性詳細解析在上篇的示例中，給出了原始碼，再看一下： public AbstractBeanDefin

Jdk1.8集合框架之HashMap原始碼解析（詳細解析紅黑樹）

HashMap特點不同步，支援null的鍵和值，put或get操作通常是常數時間。 Map介面的實現。去掉了Hashtable的contains(Object value)方法，保留containsKey和containsValue方法。使用

HashMap 原始碼詳細解析 (JDK1.8)

概要 HashMap 最早出現在 JDK 1.2 中，底層基於雜湊演算法實現。HashMap 允許 null 鍵和 null 值，在計算哈鍵的雜湊值時，null 鍵雜湊值為 0。HashMap 並不保證鍵值對的順序，這意味著在進行某些操作後，鍵值對的順序可能會發生變化。另外，需要注意的是，HashMap 是

redis配置詳細解析

keep turn name sort out 配置文件 trac lte eid # redis 配置文件示例 # 當你需要為某個配置項指定內存大小的時候，必須要帶上單位， # 通常的格式就是 1k 5gb 4m 等： # # 1k => 1000 bytes

CDN原理詳細解析

cdn dns負載均衡文件分發網絡 1.用戶向瀏覽器輸入www.web.com這個域名，瀏覽器第一次發現本地沒有dns緩存，則向網站的DNS服務器請求；2.網站的DNS域名解析器設置了CNAME，指向了www.web.51cdn.com,請求指向了CDN網絡中的智能DNS負載均衡系統；3.智能D

2017年軟考各科最新真題詳細解析資料集錦

軟考真題軟考答案軟考真題答案軟考真題資料軟考真題視頻作為51CTO學院的軟考培訓講師，本著對廣大學員負責的態度，在每年同學們參加完軟考考試，我都會盡早的給大家發布各科的真題詳細解析資料。一方面是為了參加軟考考試的同學對自己考試情況做一個準確評估；另一方面是為未來參加軟考考試的學員

Tensorflow之MNIST解析

浪潮每一個 col dir html 相關操作 ros 復雜老師要說2017年什麽技術最火爆，無疑是google領銜的深度學習開源框架Tensorflow。本文簡述一下深度學習的入門例子MNIST。深度學習簡單介紹首先要簡單區別幾個概念：人工智能，機器學習，深

redis.conf配置詳細解析

tip soft notify cross following 模板 guarantee use fast # redis 配置文件示例 # 當你需要為某個配置項指定內存大小的時候，必須要帶上單位， # 通常的格式就是 1k 5gb 4m 等醬紫： # # 1k =&

前端【響應式】開發詳細解析

響應式設計針對標簽 ipad rem img ons 微信公眾 dev 一、響應式設計需要解決的問題是什麽？針對目前大家常見的固定布局、自適應布局都是一種反應遲鈍的web設計，當Web頁面需要在各種顯示屏顯示時，他們就顯得力不從心了。因此，我們就需要相應設計。優勢：

vue-cli中的build.js配置文件詳細解析

刪除 .json directory 內置 tostring file 環境配置輸出 pin 轉載自：https://www.cnblogs.com/ye-hcj/p/7096341.html這是vue-cli腳手架工具的生產環境配置入口 package.json中的"b

Resnet50原始碼-tensorflow+keras詳細解析

相關推薦