AlexNet學習彙總

阿新 • • 發佈：2018-11-16

最近開始接觸了AlexNet，收穫也很多，在這裡做一個小總結吧。

首先，基礎的部分就不再贅述了，在我的另一篇部落格中有一些很詳細的說明了。

這裡只做實踐的部分；

程式碼全部源自於https://kratzert.github.io/2017/02/24/finetuning-alexnet-with-tensorflow.html

有興趣的朋友可以直接看這個，要比我寫的好的多的多；

首先是第一部分：

這裡定義了一個 __init__ 函式，觀察一下程式碼便可知道他是一個用來解析輸入引數的函式

並且呼叫了一個名為creat()的函式。這個函式的具體實現程式碼沒有寫在這裡，但是會出現在後面，要留意一下

load_initial_weights(self)這個函式的作用是為我們建立的變數來分配pretrained weights，具體實現的程式碼也在後面（出於簡潔的考慮，並沒有在一段描述中寫入過多的程式碼）

class AlexNet(object):

  def __init__(self, x, keep_prob, num_classes, skip_layer,
               weights_path = 'DEFAULT'):
    """
    Inputs:
    - x: tf.placeholder, for the input images
    - keep_prob: tf.placeholder, for the dropout rate
    - num_classes: int, number of classes of the new dataset
    - skip_layer: list of strings, names of the layers you want to reinitialize
    - weights_path: path string, path to the pretrained weights,
                    (if bvlc_alexnet.npy is not in the same folder)
    """
    # Parse input arguments
    self.X = x
    self.NUM_CLASSES = num_classes
    self.KEEP_PROB = keep_prob
    self.SKIP_LAYER = skip_layer
    self.IS_TRAINING = is_training

    if weights_path == 'DEFAULT':
      self.WEIGHTS_PATH = 'bvlc_alexnet.npy'
    else:
      self.WEIGHTS_PATH = weights_path

    # Call the create function to build the computational graph of AlexNet
    self.create()

  def create(self):

    pass

  def load_initial_weights(self):

    pass

這樣一來我們就有了一個基本的類結構了，

下面我們會在定義一些函式來幫助我們建立各種layer

def conv(x, filter_height, filter_width, num_filters, stride_y, stride_x, name,
         padding='SAME', groups=1):

  # Get number of input channels
  input_channels = int(x.get_shape()[-1])

  # Create lambda function for the convolution
  convolve = lambda i, k: tf.nn.conv2d(i, k,
                                       strides = [1, stride_y, stride_x, 1],
                                       padding = padding)

  with tf.variable_scope(name) as scope:
    # Create tf variables for the weights and biases of the conv layer
    weights = tf.get_variable('weights',
                              shape = [filter_height, filter_width,
                              input_channels/groups, num_filters])
    biases = tf.get_variable('biases', shape = [num_filters])


    if groups == 1:
      conv = convolve(x, weights)

    # In the cases of multiple groups, split inputs & weights and
    else:
      # Split input and weights and convolve them separately
      input_groups = tf.split(axis = 3, num_or_size_splits=groups, value=x)
      weight_groups = tf.split(axis = 3, num_or_size_splits=groups, value=weights)
      output_groups = [convolve(i, k) for i,k in zip(input_groups, weight_groups)]

      # Concat the convolved output together again
      conv = tf.concat(axis = 3, values = output_groups)

    # Add biases
    bias = tf.reshape(tf.nn.bias_add(conv, biases), conv.get_shape().as_list())

    # Apply relu function
    relu = tf.nn.relu(bias, name = scope.name)

    return relu

######注意一下這裡的lambda函式的用法很巧妙，

下面是全連線層的定義，這看起來比卷積層的定義要容易得多

def fc(x, num_in, num_out, name, relu = True):
  with tf.variable_scope(name) as scope:

    # Create tf variables for the weights and biases
    weights = tf.get_variable('weights', shape=[num_in, num_out], trainable=True)
    biases = tf.get_variable('biases', [num_out], trainable=True)

    # Matrix multiply weights and inputs and add bias
    act = tf.nn.xw_plus_b(x, weights, biases, name=scope.name)

    if relu == True:
      # Apply ReLu non linearity
      relu = tf.nn.relu(act)
      return relu
    else:
      return act

The rest are Max-Pooling, Local-Response-Normalization and Dropout and should be self-explaining.

def max_pool(x, filter_height, filter_width, stride_y, stride_x,
             name, padding='SAME'):
  return tf.nn.max_pool(x, ksize=[1, filter_height, filter_width, 1],
                        strides = [1, stride_y, stride_x, 1],
                        padding = padding, name = name)

def lrn(x, radius, alpha, beta, name, bias=1.0):
  return tf.nn.local_response_normalization(x, depth_radius = radius,
                                            alpha = alpha, beta = beta,
                                            bias = bias, name = name)

def dropout(x, keep_prob):
  return tf.nn.dropout(x, keep_prob)

下面就是對creat()函式和 load_initial_weights 的實現

def create(self):

  # 1st Layer: Conv (w ReLu) -> Lrn -> Pool
  conv1 = conv(self.X, 11, 11, 96, 4, 4, padding = 'VALID', name = 'conv1')
  norm1 = lrn(conv1, 2, 1e-05, 0.75, name = 'norm1')
  pool1 = max_pool(norm1, 3, 3, 2, 2, padding = 'VALID', name = 'pool1')

  # 2nd Layer: Conv (w ReLu) -> Lrn -> Poolwith 2 groups
  conv2 = conv(pool1, 5, 5, 256, 1, 1, groups = 2, name = 'conv2')
  norm2 = lrn(conv2, 2, 1e-05, 0.75, name = 'norm2')
  pool2 = max_pool(norm2, 3, 3, 2, 2, padding = 'VALID', name ='pool2')

  # 3rd Layer: Conv (w ReLu)
  conv3 = conv(pool2, 3, 3, 384, 1, 1, name = 'conv3')

  # 4th Layer: Conv (w ReLu) splitted into two groups
  conv4 = conv(conv3, 3, 3, 384, 1, 1, groups = 2, name = 'conv4')

  # 5th Layer: Conv (w ReLu) -> Pool splitted into two groups
  conv5 = conv(conv4, 3, 3, 256, 1, 1, groups = 2, name = 'conv5')
  pool5 = max_pool(conv5, 3, 3, 2, 2, padding = 'VALID', name = 'pool5')

  # 6th Layer: Flatten -> FC (w ReLu) -> Dropout
  flattened = tf.reshape(pool5, [-1, 6*6*256])
  fc6 = fc(flattened, 6*6*256, 4096, name='fc6')
  dropout6 = dropout(fc6, self.KEEP_PROB)

  # 7th Layer: FC (w ReLu) -> Dropout
  fc7 = fc(dropout6, 4096, 4096, name = 'fc7')
  dropout7 = dropout(fc7, self.KEEP_PROB)

  # 8th Layer: FC and return unscaled activations
  # (for tf.nn.softmax_cross_entropy_with_logits)
  self.fc8 = fc(dropout7, 4096, self.NUM_CLASSES, relu = False, name='fc8')

def load_initial_weights(self, session):

  # Load the weights into memory
  weights_dict = np.load(self.WEIGHTS_PATH, encoding = 'bytes').item()

  # Loop over all layer names stored in the weights dict
  for op_name in weights_dict:

    # Check if the layer is one of the layers that should be reinitialized
    if op_name not in self.SKIP_LAYER:

      with tf.variable_scope(op_name, reuse = True):

        # Loop over list of weights/biases and assign them to their corresponding tf variable
        for data in weights_dict[op_name]:

          # Biases
          if len(data.shape) == 1:

            var = tf.get_variable('biases', trainable = False)
            session.run(var.assign(data))

          # Weights
          else:

            var = tf.get_variable('weights', trainable = False)
            session.run(var.assign(data))

有了這兩個函式的實現，我們的AlexNet 就大功告成啦。

這裡有實踐的步驟和簡單的資料集，有需要的可以1C幣支援一下，啊哈哈

https://download.csdn.net/download/pierce_kk/10755256

AlexNet學習彙總

最近開始接觸了AlexNet，收穫也很多，在這裡做一個小總結吧。首先，基礎的部分就不再贅述了，在我的另一篇部落格中有一些很詳細的說明了。這裡只做實踐的部分；程式碼全部源自於https://kratzert.github.io/2017/02/24/finetuni

10月26日，AlexNet學習彙總

AlexNet網路的實現關於AlexNet的一些介紹，https://www.cnblogs.com/gongxijun/p/6027747.html 裡面有一些關於維度的計算，我還是沒弄太清楚這裡還有一些更加詳細了，可以深入的理解看一下。 https://www.cnblogs

Selenium 學習彙總

Commands (命令) Action 對當前狀態進行操作失敗時，停止測試 Assertion 校驗是否有產生正確的值 Element Locators 指定HTML中的某元素 Patterns 用於模式匹配 id=id id locator 指定HTML中的唯一i

VSFTPD+nginx學習彙總

VSFTPD是基於FTP協議的,客戶端瀏覽器是需要通過http協議訪問圖片問題：getsebool: SELinux is disabled setenforce: SELinux is disabled 那麼說明selinux已經被徹底的關閉了

JAVA集合類框架學習彙總

學習連結: 集合類框架教程：http://www.runoob.com/java/java-collections.html 集合類框架面試題：https://www.jianshu.com/p/8b0a09f70b9c 集合類框架的優點：通過使用集合框架的核心類可以減少

SHELL 提示符學習彙總_2

title: SHELL 提示符學習彙總_2 data: 2018-9-7 tags: [shell , 提示符 , 學習] categories: [SHELL,學習,命令列] grammar_cjkRuby: true copyright: true 上一篇更新到重定向命令，後

SHELL提示符學習彙總_１

title: SHELL 提示符學習彙總 data: 2018-8-20 tags: [shell , 提示符 , 學習] categories: [SHELL,學習,命令列] grammar_cjkRuby: true copyright: true 學習Linux，怎麼能不會命

AHB匯流排學習彙總

部落格不是寫書，基本的背景也不做什麼介紹了，瞭解的人是不會介意這些東西的。一、AHB的基本介紹 AHB是ARM退出的AMBA匯流排系列中的其中一種，它是一種高效能的pipe系統匯流排。 1. AHB匯流排有一下特性： n Burst

UVM_USERS_GUIDE學習彙總（持續更新中）

1.overview 本章節通過典型的testbench架構和引入相關術語提供一個uvm的基本概述。 1.1 the typical uvm testbench architecture 1.1.1 uvm testbench UVM Testbench 例化了Des

Linux_Vim學習彙總

本文轉自：https://www.cnblogs.com/tzhangofseu/archive/2011/12/17/2290955.html vim的配置檔案　　~/.vimrc 　　使用者的預設配置檔案　　~/.vim/p

Linux快捷鍵學習彙總

Linux shell 方向＜－前後 -＞刪除 ctrl + d 刪除游標所在位置上的字元相當於VIM裡x或者dl ctrl + h 刪除游標所在位置前的字元相當於VIM裡hx或者dh ctrl + k 刪除游

APB匯流排學習彙總

APB簡介 APB(Advanced Peripheral Bus)，外圍匯流排。APB屬於AMBA 3 協議系列，它提供了一個低功耗的介面，並降低了介面的複雜性。 APB介面用在低頻寬和不需要高效能匯流排的外圍裝置上。 APB是非流水線結構，所有的訊號僅與時鐘上升沿相關，這樣就可以簡化APB

AXI匯流排學習彙總

本文轉自：http://www.cnblogs.com/lkiller/p/4773235.html 0.緒論 AXI是高階擴充套件介面，在AMBA3.0中提出，AMBA4.0將其修改升級為AXI4.0。AMBA4.0 包括AXI4.0、AXI4.0-lite、ACE4.0、AXI4.0-s

python 學習彙總27：itertools函式詳解（ tcy）

itertools函式 2018/11/14 2.1.建立新iter： count(start=0, step=1)#無限迴圈數;按Ctrl + C退出 # 返回均勻間隔值無限流；通常用作map()生成連續資料點的引數。此外，用於zip()新增序列號 g = itertools.count

python 學習彙總28：itertools-tool簡單實用（ tcy）

Itertools-擴充套件工具 2018/11/14 說明： 1.用途：用現有itertools構建塊建立擴充套件工具集的配方。2.優點：擴充套件工具提供了與底層工具集相同的高效能。 &nb

python 學習彙總24：迭代解包Iterable Unpacking（ tcy）

迭代解包Iterable Unpacking =================================================================== # 1.例項1 *a, = range(5); print(a)

python 學習彙總29：各種推導式（ tcy）

python的各種推導式(輕量級迴圈) 2018 / 6 / 16 1.推導式用途：推導式是從一個數據序列構建一個新的資料序列的結構體，類似於for 迴圈列表/字典/集合推導式優於 map/filter 2.基本格式

python 學習彙總23：裝飾器 decorator（ tcy）

裝飾器 decorator 2018 / 8 / 11 ================================================================== 1.1.定義： # 在程式碼執行期間動態增加功能的方式，稱之為裝飾器Decorator

python 學習彙總25：迭代器iter（ tcy）

迭代器 2018/6/12 目錄： iter 1.iter 2.iter-型別判斷 3.iter-解包 itertools工具 1.itertools函式簡表見本人相關博文 2.itertools函式詳細說明見本人相關博文 3

python 學習彙總26：itertools函式彙總簡表（ tcy）

itertools 2018 / 9 / 13 說明用途：操作迭代物件;為高效迴圈建立迭代器的函式模組標準化一套核心快速高效記憶體工具，一起構成一個“迭代器代數” 很好處理operator模組中高速功能。 # 將乘法運算子對映到2向量形成高效點

AlexNet學習彙總

下面就是對creat()函式 和 load_initial_weights 的實現

相關推薦

下面就是對creat()函式和 load_initial_weights 的實現