Caffe 程式碼解讀之全連線層concat layer

阿新 • • 發佈：2019-02-08

今天，我們看一下caffe的拼接層，即將兩個或多個layer進行拼接。
首先，看一下caffe官方文件。
concat

同其他layer一樣，分為setup、reshape、Forward_cpu、Backward_cpu。

//concat_layer 用來實現兩個或者多個blob的連結，即多輸入一輸出
//支援在num 維度上的連結（concat_dim = 0 : (n1+n2+...+nk)∗c∗h∗w ）
//和channel維度上的連結（concat_dim = 1 : n∗(c1+c2+...+ck)∗h∗w）。

//axis ，dim ：0 為 num 維度連結，1 為 channel 維度連結 

//這裡需要給出axis或concat_dim
template <typename Dtype>
void ConcatLayer<Dtype>::LayerSetUp(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top) {
  const ConcatParameter& concat_param = this->layer_param_.concat_param();
  CHECK(!(concat_param.has_axis() && concat_param.has_concat_dim()))
      << "Either axis or concat_dim should be specified; not both." 
;
}

template <typename Dtype>
void ConcatLayer<Dtype>::Reshape(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top) {
  //獲取axis，確定拼接哪一維度
  const int num_axes = bottom[0]->num_axes();
  const ConcatParameter& concat_param = this->layer_param_.concat_param();
  //以下都在獲取、判斷axis的維度 

  if (concat_param.has_concat_dim()) {
    concat_axis_ = static_cast<int>(concat_param.concat_dim());
    // Don't allow negative indexing for concat_dim, a uint32 -- almost
    // certainly unintended.
    CHECK_GE(concat_axis_, 0) << "casting concat_dim from uint32 to int32 "
        << "produced negative result; concat_dim must satisfy "
        << "0 <= concat_dim < " << kMaxBlobAxes;
    CHECK_LT(concat_axis_, num_axes) << "concat_dim out of range.";
  } else {
    concat_axis_ = bottom[0]->CanonicalAxisIndex(concat_param.axis());
  }
  // Initialize with the first blob.
  //這裡有一點需要解釋，可以看到，bottom型別為 vector<Blob<Dtype>*>，這裡只需要使用bottom[0]
  //給shape賦值就好，其實botom本身就是一個Blob的vector
  //比如：我要將兩個layer拼接，那麼久有bottom[0]以及bottom[1]
  vector<int> top_shape = bottom[0]->shape();
  //concat_axis_ = 0 : num_concats_=num;concat_axis_ = 1 : num_concats_=num x channel;
  num_concats_ = bottom[0]->count(0, concat_axis_);
  //concat_axis_ = 0 : concat_input_size_=channel x height x width;
  //concat_axis_ = 1 : concat_input_size_=height x width;
  concat_input_size_ = bottom[0]->count(concat_axis_ + 1);

  int bottom_count_sum = bottom[0]->count();
  //檢測num_axes拼接的層是否相同，num_axes為維度資訊
  for (int i = 1; i < bottom.size(); ++i) {
    CHECK_EQ(num_axes, bottom[i]->num_axes())
        << "All inputs must have the same #axes.";
    for (int j = 0; j < num_axes; ++j) {
      if (j == concat_axis_) { continue; }
      CHECK_EQ(top_shape[j], bottom[i]->shape(j))
          << "All inputs must have the same shape, except at concat_axis.";
    }
    bottom_count_sum += bottom[i]->count();
    top_shape[concat_axis_] += bottom[i]->shape(concat_axis_);
  }
  top[0]->Reshape(top_shape);
  CHECK_EQ(bottom_count_sum, top[0]->count());
}

1、這裡有一點需要解釋，可以看到，bottom型別為 vector blob，這裡只需要使用bottom[0]給shape賦值就好，其實bottom本身就是一個Blob的vector。
2、CHECK_**，這裡給小白們解釋一下，就是判斷是否相等、小於、大於
這裡寫圖片描述
3、 count，這看到有好多的count函式，這些函式在blob層實現，解釋如下：

inline int count() const { return count_; }

  /**
   * @brief Compute the volume of a slice; i.e., the product of dimensions
   *        among a range of axes.
   *
   * @param start_axis The first axis to include in the slice.
   *
   * @param end_axis The first axis to exclude from the slice.
   */
  inline int count(int start_axis, int end_axis) const {
    CHECK_LE(start_axis, end_axis);
    CHECK_GE(start_axis, 0);
    CHECK_GE(end_axis, 0);
    CHECK_LE(start_axis, num_axes());
    CHECK_LE(end_axis, num_axes());
    int count = 1;
    for (int i = start_axis; i < end_axis; ++i) {
      count *= shape(i);
    }
    return count;
  }
  /**
   * @brief Compute the volume of a slice spanning from a particular first
   *        axis to the final axis.
   *
   * @param start_axis The first axis to include in the slice.
   */
  inline int count(int start_axis) const {
    return count(start_axis, num_axes());
  }

前向傳播就是layer的拼接

template <typename Dtype>
void ConcatLayer<Dtype>::Forward_cpu(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top) {
  Dtype* top_data = top[0]->mutable_cpu_data();
  int offset_concat_axis = 0;
  const int top_concat_axis = top[0]->shape(concat_axis_);
  //遍歷所有輸入bottom
  for (int i = 0; i < bottom.size(); ++i) {
    const Dtype* bottom_data = bottom[i]->cpu_data();
    const int bottom_concat_axis = bottom[i]->shape(concat_axis_);
    //把 各個bottom data 拷貝到輸出 top data 的對應位置
    for (int n = 0; n < num_concats_; ++n) {
      //case 0：num x channel x h x w;case 1: channel x h x w
      //case 0：bottom_data + n x num x channel x h x w ;
      //case 1：bottom_data + n x channel x h x w ;
      caffe_copy(bottom_concat_axis * concat_input_size_,
          bottom_data + n * bottom_concat_axis * concat_input_size_,
          top_data + (n * top_concat_axis + offset_concat_axis)
              * concat_input_size_);
    }
    offset_concat_axis += bottom_concat_axis;
  }
}

反向傳播，就是layer層之間diff和data的傳播

//反向傳播就是對每一個bottom的 diff 做和 data 相同的連結
template <typename Dtype>
void ConcatLayer<Dtype>::Backward_cpu(const vector<Blob<Dtype>*>& top,
      const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom) {
  const Dtype* top_diff = top[0]->cpu_diff();
  int offset_concat_axis = 0;
  const int top_concat_axis = top[0]->shape(concat_axis_);
  for (int i = 0; i < bottom.size(); ++i) {
    if (!propagate_down[i]) { continue; }
    Dtype* bottom_diff = bottom[i]->mutable_cpu_diff();
    const int bottom_concat_axis = bottom[i]->shape(concat_axis_);
    for (int n = 0; n < num_concats_; ++n) {
      caffe_copy(bottom_concat_axis * concat_input_size_, top_diff +
          (n * top_concat_axis + offset_concat_axis) * concat_input_size_,
          bottom_diff + n * bottom_concat_axis * concat_input_size_);
    }
    offset_concat_axis += bottom_concat_axis;
  }
}

Caffe 程式碼解讀之全連線層concat layer

今天，我們看一下caffe的拼接層，即將兩個或多個layer進行拼接。首先，看一下caffe官方文件。同其他layer一樣，分為setup、reshape、Forward_cpu、Backward_cpu。 //concat_layer 用

caffe詳解之全連線層

全連線層引數說明全連線層，輸出的是一個一維向量,引數跟卷積層一樣。一般將全連線置於卷積神經網路的後幾層。權重值的初始化採用xavier,偏置初始化為0.layer { name: "ip1" type: "InnerProduct" #全連線層 bottom: "poo

卷積神經網路(CNN)中全連線層(FC layer)的作用

前言一般來說，卷積神經網路會有三種類型的隱藏層——卷積層、池化層、全連線層。卷積層和池化層比較好理解，主要很多教程也會解釋。卷積層(Convolutional layer)主要是用一個取樣器從輸入資料中採集關鍵資料內容；池化層(Pooling lay

caffe之(四)全連線層

在caffe中，網路的結構由prototxt檔案中給出，由一些列的Layer（層）組成，常用的層如：資料載入層、卷積操作層、pooling層、非線性變換層、內積運算層、歸一化層、損失計算層等；本篇主要介紹全連線層該層是對元素進行wise to wise的運算 1. 全連線層

圖文+程式碼分析：caffe中全連線層、Pooling層、Relu層的反向傳播原理和實現

1.全連線層反向傳播設CC為loss 全連線層輸入：(bottom_data) aa 全連線層輸出：(top_data) zz 假設 aa維度K_， zz維度N_，則權值矩陣維度為N_行*K_列，batchsize=M_ 全連線層每個輸出zi=b+∑

Caffe 全連線層

深度學習筆記（6）全連線層的實現：全連線層的每一個結點都與上一層的所有結點相連，用來把前邊提取到的特徵綜合起來。由於其全相連的特性，一般全連線層的引數也是最多的。全連線層的前向計算下圖中連線最密集的2個地方就是全連線層，這很明顯的可以看出全連線層的引數的確很多。在前向計算過程，也就是一個

caffe學習筆記31-理解全連線層

理解全連線層：連線層實際就是卷積核大小為上層特徵大小的卷積運算，卷積後的結果為一個節點，就對應全連線層的一個點。（理解）假設最後一個卷積層的輸出為7×7×512，連線此卷積層的全連線層為1×1×4096。如果將這個全連線層轉化為卷積層：1.共有4096組濾波器2.每組濾

ROIPooling的意義？全連線層輸入需要固定尺度？全連線層的實現？為什麼需要兩個全連線層？

ROIPooling的作用，就是resize到統一尺寸，這樣才能利用預訓練的全連線層引數，大多是7*7大小，這是因為全連結層需要固定的輸入尺寸.那麼為什麼需要固定尺寸呢？全連線層的計算其實相當於輸入的特徵圖資料矩陣和全連線層權值矩陣進行內積以vgg16,512*7*7

caffe Python API 之卷積層（Convolution）

pen project tsp otto weight value stride new constant 1 import sys 2 sys.path.append(‘/projects/caffe-ssd/python‘) 3 import caffe 4

為什麼目標檢測中要將全連線層轉化為卷積層？

參考文章： VGG網路中測試時為什麼全連結層改成卷積層為什麼使用卷積層替代CNN末尾的全連線層首先看一下卷積層的特點：區域性連線：提取資料區域性特徵，比如卷積核的感受野權值共享：一個卷積核只需提取一個特徵，降低了網路訓練的難度究竟使用卷積層代替全連線層會帶來什麼好處呢？

Global Average Pooling 對全連線層的可替代性

reference：https://blog.csdn.net/williamyi96/article/details/77530995 Golbal Average Pooling 第一次出現在論文Network in Network中，後來又很多工作延續使用了GAP

CNN卷積層到全連線層的輸入格式變換錯誤 tf.reshape()和slim.flatten()

TypeError: Failed to convert object of type < type ‘list’>to Tensor. Contents: [None, 9216]. Consider casting elements to a supported type.

為什麼要將全連線層轉化為卷積層

轉自：https://www.cnblogs.com/liuzhan709/p/9356960.html 理解為什麼要將全連線層轉化為卷積層 1.全連線層可以視作一種特殊的卷積考慮下面兩種情況：特徵圖和全連線層相連，AlexNet經過五次池化後得到7*7*512的特徵圖，下

[Object Detection]關於“在預訓練網路中增加捲積和全連線層可以改善效能”

Yolo論文裡提到"Ren et al. show that adding both convolutional and connected layers to pretrained networks can improve performance [28]." [28] S. Ren, K. He, R.

對CNN網路全連線層的一些理解

CNN網路的全連線層一般包含兩個部分：線性運算部分：完成線性變換的工作，將輸入經過線性變換轉換成輸出。非線性運算部分（以下簡稱非線性部分）：緊接著線性部分，完成非線性變換。線性運算部分的作用：線性部分從運算過程上看就是線性變換，對於一個輸入向量，線性部分的輸出向量是，線

【深度學習筆記】關於卷積層、池化層、全連線層簡單的比較

卷積層池化層全連線層功能提取特徵壓縮特徵圖，提取主要特徵將學到的“分散式特徵表示”對映到樣本標記空間操作可看這個的動態圖，可惜是二維的。對於三維資料比如RGB影象（3通道），卷積核的深度必須

keras呼叫自己訓練的模型，並去掉全連線層

其實很簡單 from keras.models import load_model base_model = load_model('model_resenet.h5')#載入指定的模型 print(base_model.summary())#輸出網路的結構圖

Keras —— 基於Vgg16模型（含全連線層）的圖片識別

一、載入並顯示圖片 img_path = 'elephant.jpg' img = image.load_img(img_path, target_size=(224, 224)) plt.ims

tensorflow 新增一個全連線層

對於一個全連線層，tensorflow都為我們封裝好了。使用：tf.layers.dense() 1 tf.layers.dense( 2 inputs, 3 units, 4 activation=None, 5 use_bias=True, 6

CNN中全連線層是什麼樣的？

名稱：全連線。意思就是輸出層的神經元和輸入層的每個神經元都連線。例子： AlexNet 網路中第一個全連線層是這樣的： layer { name: "fc6" type: "InnerProduct" bottom: "pool5" top:"fc6"

Caffe 程式碼解讀之全連線層concat layer

相關推薦