AlexNet卷積神經網絡【前向反饋】

阿新 • • 發佈：2019-01-27

div gradient type ros 比賽 star default 梯度 -s

1.代碼實現

  1 # -*- coding: utf-8 -*-
  2 """
  3 Created on Wed Nov 14 17:13:05 2018
  4 
  5 @author: zhen
  6 """
  7 
  8 from datetime import datetime
  9 import math
 10 import time
 11 import tensorflow as tf
 12 
 13 batch_size = 32
 14 num_batchs = 100
 15 
 16 def print_activations(t):
 
 17     print(t.op.name, " ", t.get_shape().as_list())
 18     
 19 def inference(images):
 20     parameters = []
 21     with tf.name_scope(‘conv1‘) as scope:
 22         kernel = tf.Variable(tf.truncated_normal([11,11, 3, 64],dtype=tf.float32, stddev=1e-1), name=‘weights‘)
 23         conv = tf.nn.conv2d(images, kernel, [1, 4, 4, 1], padding=‘ 
SAME‘)
 24         biases = tf.Variable(tf.constant(0.0, shape=[64], dtype=tf.float32), trainable=True, name=‘biases‘)
 25         bias = tf.nn.bias_add(conv, biases)
 26         conv1 = tf.nn.relu(bias, name=scope)
 27         print_activations(conv1)
 28         parameters += [kernel, biases]
 
 29     lrn1 = tf.nn.lrn(conv1, depth_radius=4, bias=1.0, alpha=0.001/9, beta=0.75, name=‘lrn1‘)
 30     pool1 = tf.nn.max_pool(lrn1, ksize=[1, 3, 3, 1], strides=[1, 2, 2, 1], padding=‘VALID‘, name=‘pool1‘)
 31     print_activations(pool1)
 32 
 33     with tf.name_scope(‘conv2‘) as scope:
 34         kernel = tf.Variable(tf.truncated_normal([5, 5, 64, 128], dtype=tf.float32, stddev=1e-1, name=‘weights‘))
 35         conv = tf.nn.conv2d(pool1, kernel, [1, 1, 1, 1], padding=‘SAME‘)
 36         biases = tf.Variable(tf.constant(0.0, shape=[128], dtype=tf.float32), trainable=True, name=‘biases‘)
 37         bias = tf.nn.bias_add(conv, biases)
 38         conv2 = tf.nn.relu(bias, name=scope)
 39         parameters += [kernel, biases]
 40         print_activations(conv2)
 41         
 42     lrn2 = tf.nn.lrn(conv2, 4, bias=1.0, alpha=0.001/9, beta=0.75, name=‘lrn2‘)
 43     pool2 = tf.nn.max_pool(lrn2, ksize=[1, 3, 3, 1], strides=[1, 2, 2, 1], padding=‘VALID‘, name=‘pool2‘)
 44     print_activations(pool2)         
 45         
 46     with tf.name_scope(‘conv3‘) as scope:
 47         kernel = tf.Variable(tf.truncated_normal([3, 3, 128, 256], dtype=tf.float32, stddev=1e-1, name=‘weights‘))
 48         conv = tf.nn.conv2d(pool2, kernel, [1, 1, 1, 1], padding=‘SAME‘)
 49         biases = tf.Variable(tf.constant(0.0, shape=[256], dtype=tf.float32), trainable=True, name=‘biases‘)
 50         bias = tf.nn.bias_add(conv, biases)
 51         conv3 = tf.nn.relu(bias, name=scope)
 52         parameters += [kernel, biases]
 53         print_activations(conv3)
 54         
 55     with tf.name_scope(‘conv4‘) as scope:
 56         kernel = tf.Variable(tf.truncated_normal([3, 3, 256, 128], dtype=tf.float32, stddev=1e-1, name=‘weights‘))
 57         conv = tf.nn.conv2d(conv3, kernel, [1, 1, 1, 1], padding=‘SAME‘)
 58         biases = tf.Variable(tf.constant(0.0, shape=[128], dtype=tf.float32), trainable=True, name=‘biases‘)
 59         bias = tf.nn.bias_add(conv, biases)
 60         conv4 = tf.nn.relu(bias, name=scope)
 61         parameters += [kernel, biases]
 62         print_activations(conv4)
 63         
 64     with tf.name_scope(‘conv5‘) as scope:
 65         kernel = tf.Variable(tf.truncated_normal([3, 3, 128, 128], dtype=tf.float32, stddev=1e-1, name=‘weights‘))
 66         conv = tf.nn.conv2d(conv4, kernel, [1, 1, 1, 1], padding=‘SAME‘)
 67         biases = tf.Variable(tf.constant(0.0, shape=[128], dtype=tf.float32), trainable=True, name=‘biases‘)
 68         bias = tf.nn.bias_add(conv, biases)
 69         conv5 = tf.nn.relu(bias, name=scope)
 70         parameters += [kernel, biases]
 71         print_activations(conv5)
 72         
 73     pool5 = tf.nn.max_pool(conv5, ksize=[1, 3, 3, 1], strides=[1, 2, 2, 1], padding=‘VALID‘, name=‘pool5‘)
 74     print_activations(pool5)
 75     
 76     return pool5, parameters
 77 
 78 # 評估AlexNet每輪計算時間
 79 def fit_date(session, target, info_string):
 80     num_steps_burn_in = 10 # 初始計算輪數
 81     total_duration = 0.0
 82     total_duration_squared = 0.0
 83     
 84     for i in range(num_batchs + num_steps_burn_in):
 85         start_time = time.time()
 86         session.run(target)
 87         duration = time.time() - start_time
 88         if i >= num_steps_burn_in:
 89             if not i % 10:
 90                 print(‘%s:step %d, duration=%.3f‘%(datetime.now(), i - num_steps_burn_in, duration))
 91             total_duration += duration
 92             total_duration_squared += duration * duration
 93     mn = total_duration / num_batchs
 94     vr = total_duration_squared / num_batchs - mn * mn
 95     sd = math.sqrt(vr)                
 96     print(‘%s:%s across %d steps,%.3f +/- %.3f sec / batch‘%(datetime.now(), info_string, num_batchs, mn, sd))
 97     
 98 def fit_benchmark():
 99     with tf.Graph().as_default():
100         image_size = 224
101         images = tf.Variable(tf.random_normal([batch_size, image_size, image_size, 3], dtype=tf.float32, stddev=1e-1))
102         pool5, parameters = inference(images)
103         init = tf.global_variables_initializer()
104         sess = tf.Session()
105         sess.run(init)
106         
107         fit_date(sess, pool5, "Forward")
108         objective = tf.nn.l2_loss(pool5)
109         grad = tf.gradients(objective, parameters)
110         fit_date(sess, grad, "Forward-backward")
111         
112 fit_benchmark()

2.結果

conv1   [32, 56, 56, 64]
pool1   [32, 27, 27, 64]
conv2   [32, 27, 27, 128]
pool2   [32, 13, 13, 128]
conv3   [32, 13, 13, 256]
conv4   [32, 13, 13, 128]
conv5   [32, 13, 13, 128]
pool5   [32, 6, 6, 128]
2019-01-27 10:51:37.551617:step 0, duration=1.625
2019-01-27 10:51:54.082824:step 10, duration=1.766
2019-01-27 10:52:10.582787:step 20, duration=1.641
2019-01-27 10:52:27.051502:step 30, duration=1.672
2019-01-27 10:52:43.507558:step 40, duration=1.625
2019-01-27 10:52:59.913772:step 50, duration=1.625
2019-01-27 10:53:16.245750:step 60, duration=1.672
2019-01-27 10:53:32.511337:step 70, duration=1.625
2019-01-27 10:53:48.901938:step 80, duration=1.609
2019-01-27 10:54:05.183145:step 90, duration=1.625
2019-01-27 10:54:19.917492:Forward across 100 steps,1.640 +/- 0.031 sec / batch
2019-01-27 10:55:47.146016:step 0, duration=7.719
2019-01-27 10:57:04.602639:step 10, duration=7.766
2019-01-27 10:58:26.594245:step 20, duration=9.842
2019-01-27 11:00:01.957195:step 30, duration=8.391
2019-01-27 11:01:35.103007:step 40, duration=10.073
2019-01-27 11:03:07.656318:step 50, duration=8.988
2019-01-27 11:04:31.844207:step 60, duration=8.590
2019-01-27 11:06:01.173490:step 70, duration=9.422
2019-01-27 11:07:28.737373:step 80, duration=10.635
2019-01-27 11:09:03.830375:step 90, duration=8.653
2019-01-27 11:10:19.836018:Forward-backward across 100 steps,8.804 +/- 0.817 sec / batch

3.分析

　　1、AlexNet是比賽分類項目的2012年冠軍，top5錯誤率16.4%，8層神經網絡。

　　2、AlexNet中包含了幾個比較新的技術點，首次在CNN中成功應用了Relu、Dropout、 Lrn等Trick。

　　3、運用Relu，解決Sigmoid在網絡層次較深時的梯度彌散。

　　4、訓練Dropout，隨機忽略一些神經元，避免過擬合。

　　5、使用重疊的最大池化，此前CNN普遍平均池化，最大池化避免平均池化的模糊化效果。

　　6、提出了Lrn層，局部神經元活動創建競爭機制，響應比較大的值變得更大，抑制其他反饋小的神經元，增強泛化能力。
　　7、數據增強，隨機地從256*256的原始圖像中截取224*224大小的區域，以及水平翻轉的鏡像，相當於增加了【（256-224）^2】*2=2048倍的數據量。

　　註意：沒有數據增強，僅靠原始的數據量，參數眾多的CNN會陷入過擬合中。

AlexNet卷積神經網絡【前向反饋】

div gradient type ros 比賽 star default 梯度 -s 1.代碼實現 1 # -*- coding: utf-8 -*- 2 """ 3 Created on Wed Nov 14 17:13:05 2018 4

007-卷積神經網絡-前向傳播-反向傳播

ron inf nbsp bubuko src 深度圖片矩陣 png 前向傳播：前向傳播就是求特征圖的過程通常x和w有四個維度[編號，深度，高度，寬度] 反向傳播：先來復習一下反向傳播的知識：反向傳播回來的是梯度，也就是偏導數反向傳播力有一個鏈式法則：對

TensorFlow實戰之實現AlexNet經典卷積神經網絡

ima 數據集 cross 輸出結果運行 article 像素 ons 做了本文已同步本人另外一個博客（http://blog.csdn.net/qq_37608890/article/details/79371347）本文根據最近學習

經典卷積神經網絡（LeNet、AlexNet、VGG、GoogleNet、ResNet）的實現（MXNet版本）

lns dataset frame outer soft 想法 object googlenet bat 　　卷積神經網絡（Convolutional Neural Network, CNN）是一種前饋神經網絡，它的人工神經元可以響應一部分覆蓋範圍內的周圍單元，對於大型圖像

深度學習——卷積神經網絡的經典網絡（LeNet-5、AlexNet、ZFNet、VGG-16、GoogLeNet、ResNet）

足夠論文 ogl 相關性 spa 原因線性 pad fan 一、CNN卷積神經網絡的經典網絡綜述下面圖片參照博客：http://blog.csdn.net/cyh_24/article/details/51440344 二、LeNet-5網絡

第十五節，卷積神經網絡之AlexNet網絡詳解(五)

主成分分析 ron 內容 too 步長節點隨機梯度 fc7 分辨原文 ImageNet Classification with Deep ConvolutionalNeural Networks 下載地址：http://papers.nips.cc/paper/4

吳恩達【深度學習工程師】 04.卷積神經網絡第三周目標檢測（1）基本的對象檢測算法

元素需要有關卷積訓練特定步長來看選擇該筆記介紹的是《卷積神經網絡》系列第三周：目標檢測（1）基本的對象檢測算法主要內容有： 1.目標定位 2.特征點檢測 3.目標檢測目標定位使用算法判斷圖片中是不是目標物體，如果是還要再圖片中標出其位置並

【TensorFlow實戰】TensorFlow實現經典卷積神經網絡之VGGNet

3*3 一次卷積神經網絡有意研究而不是不同等級帶來這一 VGGNet 　　VGGNet是牛津大學計算機視覺組與Google DeepMind公司的研究員一起研發的深度卷積神經網絡。VGGNet探索了卷積神經網絡的深度與其性能之間的關系，通過反復堆疊3*3的小型

【TensorFlow/簡單網絡】MNIST數據集-softmax、全連接神經網絡，卷積神經網絡模型

idt form data labels pac amp sil ber 內置函數初學tensorflow，參考了以下幾篇博客：soft模型 tensorflow構建全連接神經網絡tensorflow構建卷積神經網絡tensorflow構

【TensorFlow實戰】TensorFlow實現經典卷積神經網絡之ResNet

man bject dep lte 也會 weight params detail 三層 ResNet 　　ResNet(Residual Neural Network)通過使用Residual Unit成功訓練152層深的神經網絡，在ILSVRC 2015比賽中獲得冠軍

【翻譯】TensorFlow卷積神經網絡識別CIFAR 10Convolutional Neural Network (CNN)| CIFAR 10 TensorFlow

man 加載 published class cif alt lis update air 原網址:https://data-flair.training/blogs/cnn-tensorflow-cifar-10/ by DataFlair Team · Publish

TensorFlow+實戰Google深度學習框架學習筆記（13）------Mnist識別和卷積神經網絡AlexNet

net dev adding 筆記 learn 明顯 lex test info 一、AlexNet：共8層：5個卷積層（卷積+池化）、3個全連接層，輸出到softmax層，產生分類。論文中lrn層推薦的參數：depth_radius = 4，bias = 1.0 ,

Network In Network——卷積神經網絡的革新

gin src center log 感知 eat line pro bsp Network In Network 是13年的一篇paper 引用：Lin M, Chen Q, Yan S. Network in network[J]. arXiv preprint ar

[透析] 卷積神經網絡CNN究竟是怎樣一步一步工作的？（轉）

caff 素數 aec near chris line 旋轉均值水平視頻地址：https://www.youtube.com/embed/FmpDIaiMIeA 轉載：http://www.jianshu.com/p/fe428f0b32c1 文檔參閱：pdf

AI相關 TensorFlow -卷積神經網絡踩坑日記之一

一個模糊結果隊列二維圖片路徑降維支持日記上次寫完粗淺的BP算法介紹本來應該繼續把卷積神經網絡算法寫一下的但是最近一直在踩 TensorFlow的坑。所以就先跳過算法介紹直接來應用場景，原諒我吧。 TensorFlow 介紹 TF是google

『cs231n』卷積神經網絡的可視化與進一步理解

都是 lan 精度輸出上采樣一行 ear 模型運算 cs231n的第18課理解起來很吃力，聽後又查了一些資料才算是勉強弄懂，所以這裏貼一篇博文（根據自己理解有所修改）和原論文的翻譯加深加深理解。可視化理解卷積神經網絡原文地址一、相關理論本篇博文主要講解201

C++卷積神經網絡實例：tiny_cnn代碼具體解釋（6）——average_pooling_layer層結構類分析

加權 for com 整數 ret 子類 mismatch normal 信息　　在之前的博文中我們著重分析了convolutional_layer類的代碼結構。在這篇博文中分析相應的下採樣層average_pooling_layer類：　　一、下採樣層的作用　　下採

卷積神經網絡（CNN）

進行參數一個目的下一步方便 logs 很多好的最近可能會用到CNN，今天回顧一下，並找到了一些大神的精華帖，順便做個總結。 CNN是時下非常火的一種深度學習算法，它是一種前饋神經網絡，即神經元只與前後層有聯系，在同一層的神經元無聯系。筆者用下面這張圖用來說明卷

TensorFlow框架(4)之CNN卷積神經網絡詳解

this map ets 多層神經網絡本地 height its 網絡操作 1. 卷積神經網絡 1.1 多層前饋神經網絡　　多層前饋神經網絡是指在多層的神經網絡中，每層神經元與下一層神經元完全互連，神經元之間不存在同層連接，也不存在跨層連接的情況，如圖 11所示。

Tensorflow框架初嘗試————搭建卷積神經網絡做MNIST問題

過擬合 dict cast 官方文檔 float hot blog next 神經網絡 Tensorflow是一個非常好用的deep learning框架學完了cs231n，大概就可以寫一個CNN做一下MNIST了 tensorflow具體原理可以參見它的官方文檔然後C

AlexNet卷積神經網絡【前向反饋】

1.代碼實現

2.結果

3.分析

相關推薦