caffe原始碼之 Relu層

阿新 • • 發佈：2019-01-11

本文主要實現caffe框架中/src/caffe/layers/Relu_layer.cpp檔案，該檔案實現的是啟用函式Relu。

ReLU是近些年非常流行的啟用函式。相比於sigmoid與Tanh，它具有一定的優越性，這三者對比可見https://zhuanlan.zhihu.com/p/21462488?refer=intelligentunit，它的函式公式是f(x)=max(0,x)。換句話說，這個啟用函式就是一個關於0的閾值。如下圖：：：
這裡寫圖片描述

下面記錄我在看relu層時的程式碼註釋：：：

Relu_layer.hpp：：：

#ifndef CAFFE_RELU_LAYER_HPP_ 

#define CAFFE_RELU_LAYER_HPP_

#include <vector>

#include "caffe/blob.hpp"
#include "caffe/layer.hpp"
#include "caffe/proto/caffe.pb.h"

#include "caffe/layers/neuron_layer.hpp"

namespace caffe {

/**
 * @brief Rectified Linear Unit non-linearity @f$ y = \max(0, x) @f$.
 *        The simple max is fast to compute, and the function does not saturate.
 */ 

 /*Relu層類，派生於NeuronLayer類*/
template <typename Dtype>
class ReLULayer : public NeuronLayer<Dtype> {
 public:
  /**
   * @param param provides ReLUParameter relu_param,
   *     with ReLULayer options:
   *   - negative_slope (\b optional, default 0).
   *     the value @f$ \nu @f$ by which negative values are multiplied.
   */ 

   /*建構函式，NeuronLayer層的引數顯式傳遞給ReluLayer,這些引數就是protobuf檔案中儲存的引數*/
  explicit ReLULayer(const LayerParameter& param)
      : NeuronLayer<Dtype>(param) {}

  /*行內函數，將當前層型別返回*/
  virtual inline const char* type() const { return "ReLU"; }

 protected:
  /**
   * @param bottom input Blob vector (length 1)
   *   -# @f$ (N \times C \times H \times W) @f$
   *      the inputs @f$ x @f$
   * @param top output Blob vector (length 1)
   *   -# @f$ (N \times C \times H \times W) @f$
   *      the computed outputs @f$
   *        y = \max(0, x)
   *      @f$ by default.  If a non-zero negative_slope @f$ \nu @f$ is provided,
   *      the computed outputs are @f$ y = \max(0, x) + \nu \min(0, x) @f$.
   */
   //前向傳播cpu實現
  virtual void Forward_cpu(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top);
  //前向傳播gpu實現
  virtual void Forward_gpu(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top);

  /*注意：前向傳播函式以bottom為輸入，top為輸出*/

  /**
   * @brief Computes the error gradient w.r.t. the ReLU inputs.
   *
   * @param top output Blob vector (length 1), providing the error gradient with
   *      respect to the outputs
   *   -# @f$ (N \times C \times H \times W) @f$
   *      containing error gradients @f$ \frac{\partial E}{\partial y} @f$
   *      with respect to computed outputs @f$ y @f$
   * @param propagate_down see Layer::Backward.
   * @param bottom input Blob vector (length 1)
   *   -# @f$ (N \times C \times H \times W) @f$
   *      the inputs @f$ x @f$; Backward fills their diff with
   *      gradients @f$
   *        \frac{\partial E}{\partial x} = \left\{
   *        \begin{array}{lr}
   *            0 & \mathrm{if} \; x \le 0 \\
   *            \frac{\partial E}{\partial y} & \mathrm{if} \; x > 0
   *        \end{array} \right.
   *      @f$ if propagate_down[0], by default.
   *      If a non-zero negative_slope @f$ \nu @f$ is provided,
   *      the computed gradients are @f$
   *        \frac{\partial E}{\partial x} = \left\{
   *        \begin{array}{lr}
   *            \nu \frac{\partial E}{\partial y} & \mathrm{if} \; x \le 0 \\
   *            \frac{\partial E}{\partial y} & \mathrm{if} \; x > 0
   *        \end{array} \right.
   *      @f$.
   */
   //返向傳播cpu實現
  virtual void Backward_cpu(const vector<Blob<Dtype>*>& top,
      const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom);
  //返向傳播gpu實現
  virtual void Backward_gpu(const vector<Blob<Dtype>*>& top,
      const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom);
  /*注意：返向傳播中bottom為輸出，top為輸入，其中propagate_down為bottom是否返向傳播梯度的bool值的向量，個數與bottom資料個數相同*/
};

}  // namespace caffe

#endif  // CAFFE_RELU_LAYER_HPP_

Relu_layer.cpp：：：

#include <algorithm>
#include <vector>

#include "caffe/layers/relu_layer.hpp"

namespace caffe {

/*Relu層的前向傳播函式*/
template <typename Dtype>
void ReLULayer<Dtype>::Forward_cpu(const vector<Blob<Dtype>*>& bottom,
    const vector<Blob<Dtype>*>& top) {
  const Dtype* bottom_data = bottom[0]->cpu_data(); //獲得輸入資料記憶體地址指標
  Dtype* top_data = top[0]->mutable_cpu_data();     //獲得輸出資料記憶體地址指標
  //輸入的blob的個數
  const int count = bottom[0]->count();
  //negative_slope是Leak Relu的引數，預設為0，就是普通的Relu函式。
  //Leaky ReLU是為解決“ReLU死亡”問題的嘗試。
  //一般的ReLU中當x<0時，函式值為0。禠eaky ReLU則是給出一個很小的負數梯度值，比如0.01。
  //Leaky Relu公式如下 f(x) = max(x, 0) + alpha*min(x, 0) 其中alpha就是下面的程式碼中引數negative_slope
  Dtype negative_slope = this->layer_param_.relu_param().negative_slope();
  for (int i = 0; i < count; ++i) {
    top_data[i] = std::max(bottom_data[i], Dtype(0))
        + negative_slope * std::min(bottom_data[i], Dtype(0));
  }
}

/*Relu層的返向傳播函式*/
template <typename Dtype>
void ReLULayer<Dtype>::Backward_cpu(const vector<Blob<Dtype>*>& top,
    const vector<bool>& propagate_down,
    const vector<Blob<Dtype>*>& bottom) {
    //propagate_down與計算bottom的梯度有關，在caffe的BP實現中非常重要
  if (propagate_down[0]) {
    //獲得前一層的前向傳播的資料記憶體地址
    const Dtype* bottom_data = bottom[0]->cpu_data();
    //獲得後一層的後向傳播的導數的記憶體地址（對於本層來說是輸入資料）
    const Dtype* top_diff = top[0]->cpu_diff();
    //獲得前一層的後向傳播的導數的記憶體地址（對於本層來說是輸出資料）
    Dtype* bottom_diff = bottom[0]->mutable_cpu_diff();
    //參與計算的blob個數
    const int count = bottom[0]->count();
    //見上面的Forward_cpu函式中關於這個引數的解釋 
    Dtype negative_slope = this->layer_param_.relu_param().negative_slope();
    //這裡(bottom_data[i] > 0)實現的就是Relu的導數, 這是一個邏輯判斷，如果bottom_data[i]值大於0則(bottom_data[i] > 0)值為1，反之為0
    //這裡((bottom_data[i] > 0) + negative_slope * (bottom_data[i] <= 0)) 實現的是Leaky Relu的導數
    //根據求導鏈式法則，前一層（對於返向傳播前一層為輸出層）的導數對於上一層導數乘以當前層函式的導數
    for (int i = 0; i < count; ++i) {
      bottom_diff[i] = top_diff[i] * ((bottom_data[i] > 0)
          + negative_slope * (bottom_data[i] <= 0));
    }
  }
}


#ifdef CPU_ONLY
STUB_GPU(ReLULayer);
#endif

INSTANTIATE_CLASS(ReLULayer);

}  // namespace caffe

caffe原始碼之 Relu層

本文主要實現caffe框架中/src/caffe/layers/Relu_layer.cpp檔案，該檔案實現的是啟用函式Relu。 ReLU是近些年非常流行的啟用函式。相比於sigmoid與Tanh，它具有一定的優越性，這三者對比可見https://zhuanlan.zhihu.co

caffe原始碼池化層

1、標題圖示池化層（前向傳播）池化層其實和卷積層有點相似，有個類似卷積核的視窗按照固定的步長在移動，每個視窗做一定的操作，按照這個操作的型別可以分為兩種池化層：輸入引數如下：輸入： 1 * 3 * 4 * 4 池化核: 2 * 2 pad: 0 步長：2 輸出引數如下

caffe原始碼解析：層（layer）的註冊與管理

caffe中所有的layer都是類的結構，它們的構造相關的函式都註冊在一個全域性變數g_registry_ 中。首先這個變數的型別 CreatorRegistry是一個map定義， public: typedef shared_ptr<Layer<Dt

caffe原始碼之 Solver類

本文主要解析caffe原始碼檔案/src/caffe/layers/Solver.cpp，該檔案主要定義caffe框架中優化函式類的基類。 Solver這個類實現了優化函式的封裝，其中有一個protected的成員:shared_ptr net_;，這個成員是一個指向Net型別的智慧

caffe原始碼之　Blob類

本文主要解析caffe框架中原始碼檔案/src/caffe/blob.cpp，該檔案主要實現caffe的資料儲存與傳遞。 caffe中Blob類主要用來表示網路中的資料，包括訓練資料，網路各層自身的引數(包括權值、偏置以及它們的梯度)，網路之間傳遞的資料都是通

caffe原始碼理解之inner_product_layer

原文地址：https://www.cnblogs.com/dupuleng/articles/4312149.html 在caffe中所謂的Inner_Product（IP) 層即fully_connected (fc)layer，為什麼叫ip呢，可能是為了看起來比較優雅吧。。從CAF

caffe原始碼閱讀《二》softmax層

前傳程式碼 template <typename Dtype> void SoftmaxLayer<Dtype>::Forward_cpu(const vector<Blob<Dtype>*>& bottom,

Caffe中的EuclideanLoss層原始碼解析

Caffe中的EuclideanLoss層是用於計算L2 loss的（即平方和損失函式），其損失函式為：

DeepLearning（基於caffe）實戰專案（8）--修改caffe原始碼從新增loss(層)函式開始

在caffe中摸爬滾打了一個多月了，修改caffe原始碼，早就想練練手了，loss層是一個比較獨立的一個層，而且可以仿照caffe給的樣例進行新增，難度會稍微小點。caffe自帶了十種loss層(contrastive、euclidean、hinge、multinomial

藍芽檔案傳輸之obex層之上的分析【Android原始碼解析】

在上節中我們仔細分析了藍芽檔案傳輸過程中涉及到的UI介面，最終定格在藍芽裝置掃描的介面，我們只要選擇自己想要傳輸的藍芽裝置就可以進行藍芽檔案的傳輸了。那就是這樣一個簡單的裝置選擇的點選會引發哪些

caffe之SoftmaxWithLoss層自定義實現

caffe中的各層實現，因為封裝了各種函式和為了擴充套件，在提升了效率的同時，降低了一定的程式碼可讀性，這裡，為了更好地理解softmax以及caffe中前向傳播和反向傳播的原理，我用通俗易懂的程式碼實現了SoftmaxWithLoss層（以下簡稱loss層

caffe原始碼解析之新增新的Layer(maxout)

本文分為兩部分，先寫一個入門的教程，然後再給出自己新增maxout與NIN的layer的方法（一） Here's roughly the process I follow. Add a class declaration for your

Android原始碼剖析之Framework層實戰版（Ams管理Activity啟動）

Intent中的四個重要屬性——Action、Data、Category、Extras以上是對intent的介紹，接下來會再介紹一下task，也就是如何啟動，以什麼樣的規則啟動和退出。以下均指launchFlag，標記均以FLAG_ACTIVITY_開頭，介紹時會忽略，請注意一下；啟動時會依次判斷如下標識1、

html+css原始碼之實現登入彈出框遮罩層效果

在web開發中，很多網站都做了一些特別炫麗的效果，比如使用者登入彈框遮罩層效果，本文章向大家介紹css如何實現登入彈出框遮罩層效果，需要的朋友可以參考一下本文章的原始碼。 html+css實現登入彈出框遮罩層效果,原始

caffe原始碼深入學習5：超級詳細的caffe卷積層程式碼解析

在本篇部落格中，筆者為大家解析一下caffe卷積層的原始碼，在開篇提醒各位讀者朋友，由於caffe卷積層實現較為複雜，引數相對較多，因此，讀者朋友們如果發現筆者的部落格中的疏漏或者錯誤之處，請大家不吝賜教，筆者在此表示衷心的感謝。在解析程式碼前，首先要強調一下

【caffe原始碼的梳理之零】caffe框架整體介紹

caffe作為深度學習框架，由C++語言開發，中間使用了大量的類的封裝、繼承、多型，在學習caffe的同時也是在對C++語言特性的學習，可謂一舉兩得。廢話不多說，直接上程式碼吧。 1、caffe原始碼的目錄結構 $ tree -d . ├── cma

【caffe原始碼的梳理之五】caffe資料I/O模組——資料變換器Data_Transformer

資料變換器 caffe的資料變換器（DataTransformer）主要提供了對原始影象的預處理方法，包括隨機的切塊、隨機的映象、幅度縮放、去均值、灰度、色度變換等。 1、資料結構的描述 message TransformationPa

CAFFE原始碼學習筆記之初始化Filler

一、前言為什麼CNN中的初始化那麼重要呢？我想總結的話就是因為他更深一點，相比淺層學習，比如logistics或者SVM,最終問題都轉換成了凸優化，函式優化的目標唯一，所以引數初始化隨便設定為0都不影響，因為跟著梯度走，總歸是會走向最小值的附近的。但

CAFFE原始碼學習筆記之十-data_layer

一、前言 CAFFE在搭建CNN網路的時候，第一層就是資料層，所以本節梳理一下同樣很龐大的DataLayer層。先給一個網路結構： Layer類：層的基類; BaseDataLayer類：資料層的基類; BasePrefetchingD

Caffe學習之自定義建立新的Layer層

caffe原始碼中已經幫我封裝好了各種各樣的layer，但是有時候現有的layer不能滿足設計的網路要求，這個時候需要自己定義一個新的layer，本文參考here，進行簡單講解，具體方式如下：一.建立.hpp檔案 1.新增你的layer標頭檔案置

caffe原始碼 之 Relu層

相關推薦

caffe原始碼之 Relu層