caffe之Data_Layer層程式碼解析

阿新 • • 發佈：2019-02-17

一、 caffe的資料輸入層，根據不同的輸入方式有不同的層，因為本人最早接觸的是通過lmdb資料庫輸入資料，而lmdb對應這DataLayer層，其實還有一個常用的就是ImageDataLayer層，這個層可以直接輸入圖片的路徑，而不需轉換。

inheritance relationship among data classes

上面這張圖，反應輸入層的繼承關係和不同層次關係，可見： DataLayer層繼承自BasePrefetchingDataLayer層，繼承BaseDataLayer層，繼承Layer層。

DataLayer.hpp：其實這個層只做了一件事情，那就是reshape之後，從lmdb資料庫中讀取資料，然後將讀到的資料指標輸出給top，如果有標籤，也會輸出標籤。

ImageDataLayer.cpp：這個層也只做了一件事情，那就是reshape之後，從檔案列表中讀取資料，然後將讀到的資料指標輸出給top，如果有標籤，也會輸出標籤

如果只是使用，那麼參考我的另一篇微博 http://blog.csdn.net/ming5432ming/article/details/78458916 ，沒有必要往下看。

如想了解下具體實現，請往下走。

二、下面是DataLayer.hpp

先說結論：其實這個層只做了一件事情，那就是reshape之後，從lmdb資料庫中讀取資料，然後將讀到的資料指標輸出給top，如果有標籤，也會輸出標籤。

#ifndef CAFFE_DATA_LAYER_HPP_
#define CAFFE_DATA_LAYER_HPP_

#include <vector>

#include "caffe/blob.hpp"
#include "caffe/data_transformer.hpp"
#include "caffe/internal_thread.hpp"
#include "caffe/layer.hpp"
#include "caffe/layers/base_data_layer.hpp"
#include "caffe/proto/caffe.pb.h"
#include "caffe/util/db.hpp"

namespace caffe {

template <typename Dtype>
class DataLayer : public BasePrefetchingDataLayer<Dtype> {
 public:
  explicit DataLayer(const LayerParameter& param);
  virtual ~DataLayer();
  virtual void DataLayerSetUp(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top);
  virtual inline const char* type() const { return "Data"; }     //名字是Data， 這個對應這prototxt檔案中的type變數。
  virtual inline int ExactNumBottomBlobs() const { return 0; }     //data層沒有輸入bottom層
  virtual inline int MinTopBlobs() const { return 1; }
  virtual inline int MaxTopBlobs() const { return 2; }

 protected:
  void Next();
  bool Skip();
  virtual void load_batch(Batch<Dtype>* batch);

  shared_ptr<db::DB> db_;             //lmdb資料庫操作的指標。
  shared_ptr<db::Cursor> cursor_;
  uint64_t offset_;
};

}  // namespace caffe

#endif  // CAFFE_DATA_LAYER_HPP_

下面是DataLayer.cpp

#ifdef USE_OPENCV
#include <opencv2/core/core.hpp>
#endif  // USE_OPENCV
#include <stdint.h>

#include <vector>

#include "caffe/data_transformer.hpp"
#include "caffe/layers/data_layer.hpp"
#include "caffe/util/benchmark.hpp"

namespace caffe {

template <typename Dtype>
DataLayer<Dtype>::DataLayer(const LayerParameter& param)  // 開啟資料庫
  : BasePrefetchingDataLayer<Dtype>(param),
    offset_() {
  db_.reset(db::GetDB(param.data_param().backend()));
  db_->Open(param.data_param().source(), db::READ);
  cursor_.reset(db_->NewCursor());
}

template <typename Dtype>
DataLayer<Dtype>::~DataLayer() {
  this->StopInternalThread();
}

template <typename Dtype>
void DataLayer<Dtype>::DataLayerSetUp(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top) {
  const int batch_size = this->layer_param_.data_param().batch_size();  //從配置檔案中讀取批處理大小；
  // Read a data point, and use it to initialize the top blob.
  Datum datum;
  datum.ParseFromString(cursor_->value());               //讀取一個數據，並存放在datum中， datum是在caffe.proto中定義的資料結構。 裡面包括channel、height、和width

  // Use data_transformer to infer the expected blob shape from datum.
  vector<int> top_shape = this->data_transformer_->InferBlobShape(datum);   //用讀到的資料初始化top_shape結構。
  this->transformed_data_.Reshape(top_shape);
  // Reshape top[0] and prefetch_data according to the batch_size.
  top_shape[0] = batch_size;         //因為datum中沒有批處理的資訊，所以在這補充。
  top[0]->Reshape(top_shape);
  for (int i = 0; i < this->prefetch_.size(); ++i) {
    this->prefetch_[i]->data_.Reshape(top_shape);    //reshape每一個險種中的prefetch_
  }
  LOG_IF(INFO, Caffe::root_solver())
      << "output data size: " << top[0]->num() << ","
      << top[0]->channels() << "," << top[0]->height() << ","
      << top[0]->width();
  // label
  if (this->output_labels_) {
    vector<int> label_shape(1, batch_size);    //如果有標籤，則輸出標籤結構。
    top[1]->Reshape(label_shape);
    for (int i = 0; i < this->prefetch_.size(); ++i) {
      this->prefetch_[i]->label_.Reshape(label_shape);
    }
  }
}

template <typename Dtype>
bool DataLayer<Dtype>::Skip() {
  int size = Caffe::solver_count();
  int rank = Caffe::solver_rank();
  bool keep = (offset_ % size) == rank ||
              // In test mode, only rank 0 runs, so avoid skipping
              this->layer_param_.phase() == TEST;
  return !keep;
}

template<typename Dtype>
void DataLayer<Dtype>::Next() {
  cursor_->Next();
  if (!cursor_->valid()) {         //如果資料庫結束，則從頭開始讀取。
    LOG_IF(INFO, Caffe::root_solver())
        << "Restarting data prefetching from start.";
    cursor_->SeekToFirst();            
  }
  offset_++;
}

// This function is called on prefetch thread
template<typename Dtype>
void DataLayer<Dtype>::load_batch(Batch<Dtype>* batch) {      //每一個執行緒都會執行這個函式。
  CPUTimer batch_timer;
  batch_timer.Start();
  double read_time = 0;
  double trans_time = 0;
  CPUTimer timer;
  CHECK(batch->data_.count());
  CHECK(this->transformed_data_.count());
  const int batch_size = this->layer_param_.data_param().batch_size();  

  Datum datum;
  for (int item_id = 0; item_id < batch_size; ++item_id) {
    timer.Start();
    while (Skip()) {
      Next();
    }
    datum.ParseFromString(cursor_->value());      //讀取一個數據；
    read_time += timer.MicroSeconds();

    if (item_id == 0) {
      // Reshape according to the first datum of each batch
      // on single input batches allows for inputs of varying dimension.
      // Use data_transformer to infer the expected blob shape from datum.
      vector<int> top_shape = this->data_transformer_->InferBlobShape(datum);
      this->transformed_data_.Reshape(top_shape);
      // Reshape batch according to the batch_size.
      top_shape[0] = batch_size;
      batch->data_.Reshape(top_shape);      //批處理batch的  reshape
    }

    // Apply data transformations (mirror, scale, crop...)
    timer.Start();
    int offset = batch->data_.offset(item_id);      //在一個批次中的偏移量；
    Dtype* top_data = batch->data_.mutable_cpu_data();    //讀取cpu的指標，並賦值給top_data；
    this->transformed_data_.set_cpu_data(top_data + offset);
    this->data_transformer_->Transform(datum, &(this->transformed_data_));   //設定cpu的data指標。
    // Copy label.
    if (this->output_labels_) {     //設定label資料。
      Dtype* top_label = batch->label_.mutable_cpu_data();
      top_label[item_id] = datum.label();
    }
    trans_time += timer.MicroSeconds();
    Next();
  }
  timer.Stop();
  batch_timer.Stop();
  DLOG(INFO) << "Prefetch batch: " << batch_timer.MilliSeconds() << " ms.";
  DLOG(INFO) << "     Read time: " << read_time / 1000 << " ms.";
  DLOG(INFO) << "Transform time: " << trans_time / 1000 << " ms.";
}

INSTANTIATE_CLASS(DataLayer);
REGISTER_LAYER_CLASS(Data);

}  // namespace caffe

總結： 其實這個層只做了一件事情， 那就是reshape之後，從資料庫中讀取資料， 然後將讀到的資料指標輸出給top。






三、 既然到了這，順便說一下ImageDataLayer層。
下面是ImageDataLayer.hpp

#ifndef CAFFE_IMAGE_DATA_LAYER_HPP_
#define CAFFE_IMAGE_DATA_LAYER_HPP_

#include <string>
#include <utility>
#include <vector>

#include "caffe/blob.hpp"
#include "caffe/data_transformer.hpp"
#include "caffe/internal_thread.hpp"
#include "caffe/layer.hpp"
#include "caffe/layers/base_data_layer.hpp"
#include "caffe/proto/caffe.pb.h"

namespace caffe {

/**
 * @brief Provides data to the Net from image files.
 *
 * TODO(dox): thorough documentation for Forward and proto params.
 */
template <typename Dtype>
class ImageDataLayer : public BasePrefetchingDataLayer<Dtype> {
 public:
  explicit ImageDataLayer(const LayerParameter& param)
      : BasePrefetchingDataLayer<Dtype>(param) {}
  virtual ~ImageDataLayer();
  virtual void DataLayerSetUp(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top);

  virtual inline const char* type() const { return "ImageData"; }   //這是prototxt中 的type
  virtual inline int ExactNumBottomBlobs() const { return 0; }
  virtual inline int ExactNumTopBlobs() const { return 2; }

 protected:
  shared_ptr<Caffe::RNG> prefetch_rng_;
  virtual void ShuffleImages();
  virtual void load_batch(Batch<Dtype>* batch);

  vector<std::pair<std::string, int> > lines_;
  int lines_id_;
};


}  // namespace caffe

#endif  // CAFFE_IMAGE_DATA_LAYER_HPP_

下面是ImageDataLayer.cpp

#ifdef USE_OPENCV
#include <opencv2/core/core.hpp>

#include <fstream>  // NOLINT(readability/streams)
#include <iostream>  // NOLINT(readability/streams)
#include <string>
#include <utility>
#include <vector>

#include "caffe/data_transformer.hpp"
#include "caffe/layers/base_data_layer.hpp"
#include "caffe/layers/image_data_layer.hpp"
#include "caffe/util/benchmark.hpp"
#include "caffe/util/io.hpp"
#include "caffe/util/math_functions.hpp"
#include "caffe/util/rng.hpp"

namespace caffe {

template <typename Dtype>
ImageDataLayer<Dtype>::~ImageDataLayer<Dtype>() {
  this->StopInternalThread();
}

template <typename Dtype>
void ImageDataLayer<Dtype>::DataLayerSetUp(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top) {
  const int new_height = this->layer_param_.image_data_param().new_height();   //從網路檔案中讀取引數；
  const int new_width  = this->layer_param_.image_data_param().new_width();
  const bool is_color  = this->layer_param_.image_data_param().is_color();
  string root_folder = this->layer_param_.image_data_param().root_folder();

  CHECK((new_height == 0 && new_width == 0) ||
      (new_height > 0 && new_width > 0)) << "Current implementation requires "
      "new_height and new_width to be set at the same time.";
  // Read the file with filenames and labels
  const string& source = this->layer_param_.image_data_param().source();
  LOG(INFO) << "Opening file " << source;
  std::ifstream infile(source.c_str());               // 開啟網路檔案中的檔案列表。
  string line;
  size_t pos;
  int label;
  while (std::getline(infile, line)) {
    pos = line.find_last_of(' ');                        //圖片路徑與標籤用空格隔開
    label = atoi(line.substr(pos + 1).c_str());
    lines_.push_back(std::make_pair(line.substr(0, pos), label));  //讀出一行的圖片路徑和標籤
  }

  CHECK(!lines_.empty()) << "File is empty";

  if (this->layer_param_.image_data_param().shuffle()) {
    // randomly shuffle data
    LOG(INFO) << "Shuffling data";
    const unsigned int prefetch_rng_seed = caffe_rng_rand();
    prefetch_rng_.reset(new Caffe::RNG(prefetch_rng_seed));
    ShuffleImages();
  } else {
    if (this->phase_ == TRAIN && Caffe::solver_rank() > 0 &&
        this->layer_param_.image_data_param().rand_skip() == 0) {
      LOG(WARNING) << "Shuffling or skipping recommended for multi-GPU";
    }
  }
  LOG(INFO) << "A total of " << lines_.size() << " images.";

  lines_id_ = 0;
  // Check if we would need to randomly skip a few data points
  if (this->layer_param_.image_data_param().rand_skip()) {
    unsigned int skip = caffe_rng_rand() %
        this->layer_param_.image_data_param().rand_skip();
    LOG(INFO) << "Skipping first " << skip << " data points.";
    CHECK_GT(lines_.size(), skip) << "Not enough points to skip";
    lines_id_ = skip;
  }
  // Read an image, and use it to initialize the top blob.
  cv::Mat cv_img = ReadImageToCVMat(root_folder + lines_[lines_id_].first,    //讀取圖片資料
                                    new_height, new_width, is_color);
  CHECK(cv_img.data) << "Could not load " << lines_[lines_id_].first;
  // Use data_transformer to infer the expected blob shape from a cv_image.
  vector<int> top_shape = this->data_transformer_->InferBlobShape(cv_img);
  this->transformed_data_.Reshape(top_shape);
  // Reshape prefetch_data and top[0] according to the batch_size.
  const int batch_size = this->layer_param_.image_data_param().batch_size();
  CHECK_GT(batch_size, 0) << "Positive batch size required";
  top_shape[0] = batch_size;
  for (int i = 0; i < this->prefetch_.size(); ++i) {
    this->prefetch_[i]->data_.Reshape(top_shape);                   //reshape  prefetch引數的結構
  }
  top[0]->Reshape(top_shape);

  LOG(INFO) << "output data size: " << top[0]->num() << ","
      << top[0]->channels() << "," << top[0]->height() << ","
      << top[0]->width();
  // label
  vector<int> label_shape(1, batch_size);            //標籤的reshape
  top[1]->Reshape(label_shape);
  for (int i = 0; i < this->prefetch_.size(); ++i) {
    this->prefetch_[i]->label_.Reshape(label_shape);
  }
}

template <typename Dtype>
void ImageDataLayer<Dtype>::ShuffleImages() {
  caffe::rng_t* prefetch_rng =
      static_cast<caffe::rng_t*>(prefetch_rng_->generator());
  shuffle(lines_.begin(), lines_.end(), prefetch_rng);
}

// This function is called on prefetch thread
template <typename Dtype>
void ImageDataLayer<Dtype>::load_batch(Batch<Dtype>* batch) {
  CPUTimer batch_timer;
  batch_timer.Start();
  double read_time = 0;
  double trans_time = 0;
  CPUTimer timer;
  CHECK(batch->data_.count());
  CHECK(this->transformed_data_.count());
  ImageDataParameter image_data_param = this->layer_param_.image_data_param();
  const int batch_size = image_data_param.batch_size();
  const int new_height = image_data_param.new_height();
  const int new_width = image_data_param.new_width();
  const bool is_color = image_data_param.is_color();
  string root_folder = image_data_param.root_folder();

  // Reshape according to the first image of each batch
  // on single input batches allows for inputs of varying dimension.
  cv::Mat cv_img = ReadImageToCVMat(root_folder + lines_[lines_id_].first,
      new_height, new_width, is_color);
  CHECK(cv_img.data) << "Could not load " << lines_[lines_id_].first;
  // Use data_transformer to infer the expected blob shape from a cv_img.
  vector<int> top_shape = this->data_transformer_->InferBlobShape(cv_img);
  this->transformed_data_.Reshape(top_shape);
  // Reshape batch according to the batch_size.
  top_shape[0] = batch_size;
  batch->data_.Reshape(top_shape);

  Dtype* prefetch_data = batch->data_.mutable_cpu_data();
  Dtype* prefetch_label = batch->label_.mutable_cpu_data();

  // datum scales
  const int lines_size = lines_.size();
  for (int item_id = 0; item_id < batch_size; ++item_id) {
    // get a blob
    timer.Start();
    CHECK_GT(lines_size, lines_id_);
    cv::Mat cv_img = ReadImageToCVMat(root_folder + lines_[lines_id_].first,
        new_height, new_width, is_color);
    CHECK(cv_img.data) << "Could not load " << lines_[lines_id_].first;
    read_time += timer.MicroSeconds();
    timer.Start();
    // Apply transformations (mirror, crop...) to the image
    int offset = batch->data_.offset(item_id);
    this->transformed_data_.set_cpu_data(prefetch_data + offset);
    this->data_transformer_->Transform(cv_img, &(this->transformed_data_));
    trans_time += timer.MicroSeconds();

    prefetch_label[item_id] = lines_[lines_id_].second;
    // go to the next iter
    lines_id_++;
    if (lines_id_ >= lines_size) {
      // We have reached the end. Restart from the first.
      DLOG(INFO) << "Restarting data prefetching from start.";
      lines_id_ = 0;
      if (this->layer_param_.image_data_param().shuffle()) {
        ShuffleImages();
      }
    }
  }
  batch_timer.Stop();
  DLOG(INFO) << "Prefetch batch: " << batch_timer.MilliSeconds() << " ms.";
  DLOG(INFO) << "     Read time: " << read_time / 1000 << " ms.";
  DLOG(INFO) << "Transform time: " << trans_time / 1000 << " ms.";
}

INSTANTIATE_CLASS(ImageDataLayer);
REGISTER_LAYER_CLASS(ImageData);

}  // namespace caffe
#endif  // USE_OPENCV

總結：會發現這個cpp的機構和Data_layer.cpp是類似的，作用就是從檔案列表中讀取檔案，並送給top結構。

caffe之Data_Layer層程式碼解析

一、 caffe的資料輸入層，根據不同的輸入方式有不同的層，因為本人最早接觸的是通過lmdb資料庫輸入資料，而lmdb對應這DataLayer層，其實還有一個常用的就是ImageDataLayer層，這個層可以直接輸入圖片的路徑，而不需轉換。上面這張

caffe原始碼深入學習5：超級詳細的caffe卷積層程式碼解析

在本篇部落格中，筆者為大家解析一下caffe卷積層的原始碼，在開篇提醒各位讀者朋友，由於caffe卷積層實現較為複雜，引數相對較多，因此，讀者朋友們如果發現筆者的部落格中的疏漏或者錯誤之處，請大家不吝賜教，筆者在此表示衷心的感謝。在解析程式碼前，首先要強調一下

caffe之SoftmaxWithLoss層自定義實現

caffe中的各層實現，因為封裝了各種函式和為了擴充套件，在提升了效率的同時，降低了一定的程式碼可讀性，這裡，為了更好地理解softmax以及caffe中前向傳播和反向傳播的原理，我用通俗易懂的程式碼實現了SoftmaxWithLoss層（以下簡稱loss層

計算機網絡之物理層解析：

hostname show 出廠路由器問題放大器 version bold -c 物理層是TCP/IP模型的最底層物理層功能：功能一為數據端設備提供發送數據的通路功能二傳輸數據物理層主要關心的問題是：信號、接口、傳輸介質。信號分為：模擬信號和數字信號模擬信號是信號

Matlab程式設計之——卷積神經網路CNN程式碼解析

卷積神經網路CNN程式碼解析 deepLearnToolbox-master是一個深度學習matlab包，裡面含有很多機器學習演算法，如卷積神經網路CNN，深度信念網路DBN，自動編碼AutoE ncoder（堆疊SAE，卷積CAE）的作者是 RasmusBerg Palm 今天給介紹d

caffe中各個層——解析

原文地址：http://www.cnblogs.com/denny402/p/5071126.html 所有的層都具有的引數，如name, type, bottom, top和transform_param請參看我的前一篇文章：Caffe學習系列(2)：資料層及引數本文只講解視覺層（

【C#】之七層登入程式碼詳解

前言　　之前我有寫過一篇三層登入，應用三層，將介面處理、業務邏輯處理和資料訪問分別開來，能夠很好的減少登入系統中各個模組之間的耦合度，使的系統更加可維護、可複用和可擴充套件。而七層登入是在三層基礎之上的更完美的改進，他除了主三層中有的UI層、BLL層、DAL層和Enitity層（實

Caffe程式碼解析(3)

本文轉自：http://alanse7en.github.io/caffedai-ma-3/ 在上文對Google Protocol Buffer進行了簡單的介紹之後，本文將對caffe的Command Line Interfaces進行分析。本文將從一個比較巨集觀的層面上去了解caffe

基於C語言的編碼器（光耦）程式設計之C程式碼解析（二）

程式碼需要一個.c文件和一個.h文件。 .h文件主要配置編碼器相關引數 #define OptoKnobNumber 2 /* 旋鈕個數配置 */ #define _01_SH

【原創】大資料基礎之Spark（4）RDD原理及程式碼解析

一簡介 spark核心是RDD，官方文件地址：https://spark.apache.org/docs/latest/rdd-programming-guide.html#resilient-distributed-datasets-rdds官方描述如下：重點是可容錯，可並行處理 Spark r

【原創】大資料基礎之Spark（5）Shuffle實現原理及程式碼解析

一簡介 Shuffle，簡而言之，就是對資料進行重新分割槽，其中會涉及大量的網路io和磁碟io，為什麼需要shuffle，以詞頻統計reduceByKey過程為例， serverA：partition1: (hello, 1), (word, 1)serverB：partition2: (hell

SSD網路解析之MultiBoxLoss層

SSD網路中的MultiBoxLoss層是根據論文2.2節所提出的損失函式而寫的相應caffe實現，也是整個SSD網路很重要的部分。首先，我們還是先來看一下論文原文對此部分的描述： ①英文部分 Training objective The SSD training objective

SSD網路解析之SmoothL1LossLayer層

SSD網路中的SmoothL1LossLayer層借鑑於Fast R-CNN，用於計算smooth L1損失，其中的光滑L1函式如下： &n

Caffe中的EuclideanLoss層原始碼解析

Caffe中的EuclideanLoss層是用於計算L2 loss的（即平方和損失函式），其損失函式為：

SSD網路解析之PriorBox層

SSD網路中的PriorBox層用於部署特徵圖中每個位置（畫素點）處的預設框（即計算每個預設框相對於網路輸入層輸入影象的歸一化左上角和右下角座標以及設定的座標variance值）預設框的具體設定，我們需要先看一下原論文中的2.2節部分。 ①英文部分如下： Choosing scales

SSD網路解析之Permute層

Permute層是SSD（Single Shot MultiBox Detector）中用於置換索引軸順序的，與matlab中的permute()函式實現類似的功能，首先我們看一下caffe.proto中關於該層引數的說明： optional PermuteParameter permute_p

caffe之(二)pooling層

在caffe中，網路的結構由prototxt檔案中給出，由一些列的Layer（層）組成，常用的層如：資料載入層、卷積操作層、pooling層、非線性變換層、內積運算層、歸一化層、損失計算層等；本篇主要

[caffe筆記004]: caffe新增新層之新增maxout層

針對2017年2月時caffe官網版本。 1. caffe官網中新增新層的流程 Add a class declaration for your layer to include/caffe/layers/your_layer.hpp. Incl

【caffe】caffe之反捲積層

1.前言傳統的CNN網路只能給出影象的LABLE，但是在很多情況下需要對識別的物體進行分割實現end to end，然後FCN出現了，給物體分割提供了一個非常重要的解決思路，其核心就是卷積與反捲積，所以這裡就詳細解釋卷積與反捲積。對於1維的卷積

藍芽檔案傳輸之obex層之上的分析【Android原始碼解析】

在上節中我們仔細分析了藍芽檔案傳輸過程中涉及到的UI介面，最終定格在藍芽裝置掃描的介面，我們只要選擇自己想要傳輸的藍芽裝置就可以進行藍芽檔案的傳輸了。那就是這樣一個簡單的裝置選擇的點選會引發哪些

caffe之Data_Layer層程式碼解析

相關推薦