《tensorflow實戰》之實現AlexNet網路（六）

阿新 • • 發佈：2019-01-05

一 AlexNet網路結構及特點

1.AlexNet網路結構

AlexNet有8個需要訓練的層(不包括池化層和LRN層)，前5層為卷積層後3層為全連線層。AlexNet最後一層是有1000類輸出的softmax層用做分類。其中LRN層出現在第1個和第2個卷積層後，最大池化層出現在第1，第2，第5個卷基層後。relu啟用函式則運用在這8層每一層的後面。
這裡寫圖片描述

2.AlexNet網路技術要點

成功使用ReLU作為CNN的啟用函式。第一次驗證其效果在較深的網路中效果超過了sigmoid，較好的解決了sigmoid在較深網路裡梯度彌散的問題。
訓練時使用了Dropout隨機忽略了一部分神經元，以避免模型的過擬合。主要在最後幾個全連線層使用了Dropout。

在CNN中使用重疊的最大池化。避免平均池化模糊化效果，重疊和覆蓋提升了特徵的豐富性。
提出了LRN層，對區域性神經元建立競爭機制，增強了模型的泛化能力。
使用CUDA家屬深度神經網路的訓練。
資料增強。訓練時隨機從256×256的原始圖片中擷取224*224大小的區域，相當於增加了2048倍的資料量，避免了過擬合，提升了泛化能力。預測時，取照片的四個腳和中間五個位置，並且進行旋轉，一共獲得10張照片，並對這十張照片預測後取平均值。對RGB資料的圖片進行PCA處理，加入標準差為0.1的高斯噪聲，增強資料得到豐富性。

二 AlexNet網路Tensorflow實現

1.匯入庫

from 
 datetime import datetime
import math
import time
import tensorflow as tf

batch_size=32#批次大小
num_batches=100#批次個數

2.網路結構

函式print_activations()輸出每一個卷積層和池化層的tensor輸出尺寸。t.op.name是該層的名稱，引數列表t.get_shape().as_list()。
使用Tensorflow的name_scope，通過 with tf.name_scope(‘conv1’) as scope可以將scope內生成的variable自動命名為conv1/xxx。

def inference(images):
    parameters = []
    # conv1
    with tf.name_scope('conv1') as scope:
        kernel = tf.Variable(tf.truncated_normal([11, 11, 3, 64], dtype=tf.float32,
                                                 stddev=1e-1), name='weights')
        conv = tf.nn.conv2d(images, kernel, [1, 4, 4, 1], padding='SAME')
        biases = tf.Variable(tf.constant(0.0, shape=[64], dtype=tf.float32),
                             trainable=True, name='biases')
        bias = tf.nn.bias_add(conv, biases)
        conv1 = tf.nn.relu(bias, name=scope)
        print_activations(conv1)
        parameters += [kernel, biases]


  # pool1
    lrn1 = tf.nn.lrn(conv1, 4, bias=1.0, alpha=0.001 / 9.0, beta=0.75, name='lrn1')
    pool1 = tf.nn.max_pool(lrn1,
                           ksize=[1, 3, 3, 1],
                           strides=[1, 2, 2, 1],
                           padding='VALID',
                           name='pool1')
    print_activations(pool1)

  # conv2
    with tf.name_scope('conv2') as scope:
        kernel = tf.Variable(tf.truncated_normal([5, 5, 64, 192], dtype=tf.float32,
                                                 stddev=1e-1), name='weights')
        conv = tf.nn.conv2d(pool1, kernel, [1, 1, 1, 1], padding='SAME')
        biases = tf.Variable(tf.constant(0.0, shape=[192], dtype=tf.float32),
                             trainable=True, name='biases')
        bias = tf.nn.bias_add(conv, biases)
        conv2 = tf.nn.relu(bias, name=scope)
        parameters += [kernel, biases]
    print_activations(conv2)

  # pool2
    lrn2 = tf.nn.lrn(conv2, 4, bias=1.0, alpha=0.001 / 9.0, beta=0.75, name='lrn2')
    pool2 = tf.nn.max_pool(lrn2,
                           ksize=[1, 3, 3, 1],
                           strides=[1, 2, 2, 1],
                           padding='VALID',
                           name='pool2')
    print_activations(pool2)

  # conv3
    with tf.name_scope('conv3') as scope:
        kernel = tf.Variable(tf.truncated_normal([3, 3, 192, 384],
                                                 dtype=tf.float32,
                                                 stddev=1e-1), name='weights')
        conv = tf.nn.conv2d(pool2, kernel, [1, 1, 1, 1], padding='SAME')
        biases = tf.Variable(tf.constant(0.0, shape=[384], dtype=tf.float32),
                             trainable=True, name='biases')
        bias = tf.nn.bias_add(conv, biases)
        conv3 = tf.nn.relu(bias, name=scope)
        parameters += [kernel, biases]
        print_activations(conv3)

  # conv4
    with tf.name_scope('conv4') as scope:
        kernel = tf.Variable(tf.truncated_normal([3, 3, 384, 256],
                                                 dtype=tf.float32,
                                                 stddev=1e-1), name='weights')
        conv = tf.nn.conv2d(conv3, kernel, [1, 1, 1, 1], padding='SAME')
        biases = tf.Variable(tf.constant(0.0, shape=[256], dtype=tf.float32),
                             trainable=True, name='biases')
        bias = tf.nn.bias_add(conv, biases)
        conv4 = tf.nn.relu(bias, name=scope)
        parameters += [kernel, biases]
        print_activations(conv4)

  # conv5
    with tf.name_scope('conv5') as scope:
        kernel = tf.Variable(tf.truncated_normal([3, 3, 256, 256],
                                                 dtype=tf.float32,
                                                 stddev=1e-1), name='weights')
        conv = tf.nn.conv2d(conv4, kernel, [1, 1, 1, 1], padding='SAME')
        biases = tf.Variable(tf.constant(0.0, shape=[256], dtype=tf.float32),
                             trainable=True, name='biases')
        bias = tf.nn.bias_add(conv, biases)
        conv5 = tf.nn.relu(bias, name=scope)
        parameters += [kernel, biases]
        print_activations(conv5)

  # pool5
    pool5 = tf.nn.max_pool(conv5,
                           ksize=[1, 3, 3, 1],
                           strides=[1, 2, 2, 1],
                           padding='VALID',
                           name='pool5')
    print_activations(pool5)

    return pool5, parameters

3.評估AlexNet網路的計算時間

輸入引數target需要測評的運運算元，info_string測試名稱。
預熱輪數num_steps_burn_in去除前幾層執行時硬體的影響，每10輪輸出總時間，平方和。total_duration是總時間，total_duration_squared平方和。
迴圈結束後，計算平均用時mn和標準差sd。

def time_tensorflow_run(session, target, info_string):
#  """Run the computation to obtain the target tensor and print timing stats.
#
#  Args:
#    session: the TensorFlow session to run the computation under.
#    target: the target Tensor that is passed to the session's run() function.
#    info_string: a string summarizing this run, to be printed with the stats.
#
#  Returns:
#    None
#  """
    num_steps_burn_in = 10
    total_duration = 0.0
    total_duration_squared = 0.0
    for i in range(num_batches + num_steps_burn_in):
        start_time = time.time()
        _ = session.run(target)
        duration = time.time() - start_time
        if i >= num_steps_burn_in:
            if not i % 10:
                print ('%s: step %d, duration = %.3f' %
                       (datetime.now(), i - num_steps_burn_in, duration))
            total_duration += duration
            total_duration_squared += duration * duration
    mn = total_duration / num_batches
    vr = total_duration_squared / num_batches - mn * mn
    sd = math.sqrt(vr)
    print ('%s: %s across %d steps, %.3f +/- %.3f sec / batch' %
           (datetime.now(), info_string, num_batches, mn, sd))

4.主函式

with tf.Graph().as_default():定義預設的graph。
對forward計算測評。直接使用time_tensorflow_run，輸入引數為pool5，及最後一個池化層的輸出。
對backward計算測評。在這之前需要定義一個loss，再相對於loss和所有模型引數求梯度。

def run_benchmark():
#  """Run the benchmark on AlexNet."""
    with tf.Graph().as_default():
    # Generate some dummy images.
        image_size = 224
    # Note that our padding definition is slightly different the cuda-convnet.
    # In order to force the model to start with the same activations sizes,
    # we add 3 to the image_size and employ VALID padding above.
        images = tf.Variable(tf.random_normal([batch_size,
                                           image_size,
                                           image_size, 3],
                                          dtype=tf.float32,
                                          stddev=1e-1))

    # Build a Graph that computes the logits predictions from the
    # inference model.
        pool5, parameters = inference(images)

    # Build an initialization operation.
        init = tf.global_variables_initializer()

    # Start running operations on the Graph.
        config = tf.ConfigProto()
        config.gpu_options.allocator_type = 'BFC'
        sess = tf.Session(config=config)
        sess.run(init)

    # Run the forward benchmark.
        time_tensorflow_run(sess, pool5, "Forward")

    # Add a simple objective so we can calculate the backward pass.
        objective = tf.nn.l2_loss(pool5)
    # Compute the gradient with respect to all the parameters.
        grad = tf.gradients(objective, parameters)
    # Run the backward benchmark.
        time_tensorflow_run(sess, grad, "Forward-backward")


run_benchmark()

《tensorflow實戰》之實現AlexNet網路（六）

一 AlexNet網路結構及特點 1.AlexNet網路結構 AlexNet有8個需要訓練的層(不包括池化層和LRN層)，前5層為卷積層後3層為全連線層。AlexNet最後一層是有1000類輸出的softmax層用做分類。其中LRN層出現在第1個和第2個

TensorFlow實戰之實現AlexNet經典卷積神經網路

本文根據最近學習TensorFlow書籍網路文章的情況,特將一些學習心得做了總結,詳情如下.如有不當之處,請各位大拿多多指點,在此謝過。一、AlexNet模型及其基本原理闡述 1、關於AlexNet 2012年，AlexKrizhevsky提出

TensorFlow實戰之實現AlexNet經典卷積神經網絡

ima 數據集 cross 輸出結果運行 article 像素 ons 做了本文已同步本人另外一個博客（http://blog.csdn.net/qq_37608890/article/details/79371347）本文根據最近學習

TensorFlow 實戰之實現卷積神經網路

本文根據最近學習TensorFlow書籍網路文章的情況,特將一些學習心得做了總結,詳情如下.如有不當之處,請各位大拿多多指點,在此謝過。一、相關性概念 1、卷積神經網路（ConvolutionNeural Network，CNN） 1

一個基於JRTPLIB的輕量級RTSP客戶端(myRTSPClient)——實現篇：（六）RTP音視頻傳輸解析層之音視頻數據傳輸格式

客戶端會有服務 client 基本 cnblogs 存在額外導致一、差異本地音視頻數據格式和用來傳輸的音視頻數據格式存在些許差異，由於音視頻數據流到達客戶端時，需要考慮數據流的數據邊界、分包、組包順序等問題，所以傳輸中的音視頻數據往往會多一些字節。舉個例子

計算機網路之我見-通俗理解計算機網路（六）

本篇講解UDP協議一、UDP協議的組成格式 # UDP協議格式比較簡單，主要由協議頭和協議體構成 # 協議頭由源埠號、目的埠號、校驗和、和包體長度欄位組成 # UDP協議資料包由IP資料包承載，IP資料包頭有兩位元組長度的包體欄位的限制，包體最大長度為65535位元組，所以理論

tensorflow是實現神經網路（lstm）如何使實驗復現或固定權值

1.固定隨機種子 tf.set_random_seed(2) 或者在程式碼的頂端其他之前要有下面四行程式碼 from numpy.random import seed seed(1) from t

不要慫，就是GAN (生成式對抗網路) （六）：Wasserstein GAN（WGAN） TensorFlow 程式碼

先來梳理一下我們之前所寫的程式碼，原始的生成對抗網路，所要優化的目標函式為：此目標函式可以分為兩部分來看： ①固定生成器 G，優化判別器 D，則上式可以寫成如下形式：可以轉化為最小化形式：我們編寫的程式碼中，d_loss_real =

jenkins實戰之jenkins安裝部署（二）

自動化運維上一小節介紹了Jenkins安裝（Linux/uninx平臺）,這節我們講講Jenkins界面操作（包括系統設置，工具安裝，插件管理，系統升級，安全設置等等操作）；登錄jenkins首頁，分別有以下選項欄，從左側看起，點擊Jenkins系統管理我們會看到右側list欄，內

PHP初級練習實戰之公司留言板（原生）

ali 日期元素 align display 初學 locate padding asi PHP初級練習實戰之留言板初學者做的東西，有的地方寫的不好，哈哈哈！一.知識重點1.三目運算 $page= empty($_GET[‘p‘]) ? 1: $_GET[‘p‘];2.數

人臉識別之人臉對齊（六）--ERT演算法

1.概述文章名稱：One Millisecond Face Alignment with an Ensemble of Regression Trees 文章來源：2014CVPR 文章作者：Vahid Kazemi ，Josephine Sullivan 簡要介紹：One Milliseco

linux 核心模組程式設計之LED驅動程式（六）

我使用的是tiny6410的核心板，板子如下，淘寶可以買到為了不與板子上的任何驅動發生IO衝突，我使用CON1那一排沒用到的IO口，引腳如下 LED1 LED2 LED3 LED4

機器學習之迴圈神經網路（十）

摘要：多層反饋RNN（Recurrent neural Network、迴圈神經網路）神經網路是一種節點定向連線成環的人工神經網路。這種網路的內部狀態可以展示動態時序行為。不同於前饋神經網路的是，RNN可以利用它內部的記憶來處理任意時序的輸入序列，這讓

SQL優化案例-分割槽索引之無字首索引（六）

無字首索引：分割槽索引不包含分割槽欄位就叫無字首索引，那麼什麼時候用無字首索引和字首索引呢？ SQL文字如下，跨分割槽查詢，分割槽欄位post_date（為保證客戶隱私，已經將註釋和文字部分去掉）：跨30個分割槽執行了6分鐘。 SELECT /*+index(I IND_DATE_CO

docker容器技術之Dockerfile詳解（六）

上一篇文章的連線：docker容器技術之儲存卷（五）目錄一、前言二、Dockerfile 2.1製作映象有兩種： 2.2 什麼是Dockerfile？ 2.3 Dockerfile的語法格式 dockerfile做映象時的工作邏輯： .dockering

複習之JavaScript基本語法（六）——事件監聽總彙

事件監聽簡單事件監聽 btn.onclick 點選事件 <div class="box" id="box"> <H1> 測試模組</H1> <H1> 測試模組</H1> <

床頭筆記之Android開發學習（六）

初識Acitivity 目錄：認識acitivity 建立一個acitivity專案新增控制元件，實現想要的功能及介面認識acitivity： Activity 是一個應用元件。每個 Activity 都會獲得一個用於繪製其使用者介面的視窗。使用者

圖解HTTP之HTTP首部（六）

HTTP 協議的請求和響應報文中必定包含 HTTP 首部。首部內容為客戶端和伺服器分別處理請求和響應提供所需要的資訊。對於客戶端使用者來說，這些資訊中的大部分內容都無須親自檢視。 HTTP請求報文由方法、URI、HTTP版本、HTTP首部欄位構成。 HTTP響應報文由HTTP版本、狀態碼（數

作業系統之程序—死鎖（六）

1.死鎖產生獨佔性資源,如磁帶機、印表機、繪圖儀等硬體裝置以及程序表、臨界區等軟體資源不能同時供多個程序使用，否則容易導致結果混亂、資料錯誤以及程式崩潰，因此係統一次僅允許一個程序訪問獨佔性資源如果多個程序共享的資源為獨佔性資源，處理不當，就可能發生若無外力，程序永遠

bash學習之vim編輯器（六）

原本不想寫這篇文章的，因為各式各樣的編輯器有很多，用什麼完全根據自己的愛好來決定就行了，而且很容易掌握一些基本的命令之後就可以編輯了。除了vim，還有sed，gawk，等編輯器。但是直到我偶爾看到了這篇神的編輯器和編輯器之神，心中頓時產生一種敬畏之情，編輯器那麼

《tensorflow實戰》之實現AlexNet網路（六）

一 AlexNet網路結構及特點

1.AlexNet網路結構

2.AlexNet網路技術要點

二 AlexNet網路Tensorflow實現

1.匯入庫

2.網路結構

3.評估AlexNet網路的計算時間

4.主函式

相關推薦