Memory-Efficient Implementation of DenseNets

阿新 • • 發佈：2018-11-22

**論文地址：**https://arxiv.org/abs/1707.06990
**pytorch實現：**https://github.com/gpleiss/efficient_densenet_pytorch
**tensorflow實現：**https://github.com/joeyearsley/efficient_densenet_tensorflow

屬於實現方式的改進，不是對網路的改進

對intermediate feature採用共享儲存空間的方式來減低模型視訊記憶體，但是會增加訓練時間，因為需要重新計算一些層的輸出。

之前的卷積網路在Concat和BN操作時都會申請新的記憶體空間，而現在通過提前分配的shared memory storage和指標將這些intermediate feature（concate和BN操作生成的特徵）儲存在temporary storage buffers中，大大減少儲存量。

在GPU視訊記憶體限制的情況下，可以訓練更深的網路，因而效果可更好。
在這裡插入圖片描述

tensorflow實現：
超引數：
–batch_size（int） - 每批影象數（預設為3750）
–fp16（bool） - 是否與FP16一起執行（預設為False）
–efficient（bool） - 是否使用漸變檢查點執行（預設為False）
其中用到了Horovod，Horovod可為使用者實現分散式訓練提供幫助
pytorch實現：
超引數：
可以通過設定引數efficient=True來決定是否共享儲存空間。
–depth (int) - depth of the network (number of convolution layers) (default 40)
–growth_rate (int) - number of features added per DenseNet layer (default 12)
–n_epochs (int) - number of epochs for training (default 300)
–batch_size (int) - size of minibatch (default 256)
–seed (int) - manually set the random seed (default None)

還有以下實現方式：
LuaTorch (by Gao Huang)
MxNet (by Danlu Chen)
Caffe (by Tongcheng Li)
（都是論文作者誒）

Memory-Efficient Implementation of DenseNets

**論文地址：**https://arxiv.org/abs/1707.06990 **pytorch實現：**https://github.com/gpleiss/efficient_densenet_pytorch **tensorflow實現：**https://github.com/

the-implementation-of-epoll

linux log blog .org implement ted nta page ngs https://idndx.com/2014/09/01/the-implementation-of-epoll-1/ https://idndx.com/2014/09/02/

Efficient Estimation of Word Representations in Vector Space

提出兩個新穎的模型來計算詞的連續向量表示，這些表示的質量用詞的相似度來計算，結果和其他表現最好的技術進行比較。我們發現有很大的提高而且計算量低，比如1.6百萬的詞只需要不到一天的計算，而且這些向量對於語義和語法的相似度的計算獲得最好的成績。 1 Introduction 一

Implementation of a Single Cycle CPU simulator

代寫Cycle CPU作業、代做MiniCPU留學生作業、代寫C/C++語言作業、代做C/C++程式作業Principle of Computer Organization Implementation of a Single Cycle CPU simulator Project due: 30 Nov

Implementation of a Linked List

Linked List作業代寫、代做C/C++課程設計作業、代寫C/C++實驗作業、代做Memory Management作業Assignment #4 - Memory Management in C:Implementation of a Linked List Assignment Descripti

XRender 擴充套件的設計和實現（Design and Implementation of the X Rendering Extension）

原文地址：https://keithp.com/~keithp/talks/usenix2001/xrender/，本文僅做翻譯。 X Rendering Extension（或者說render，XRender）是X11核心協議的擴充套件，用於在X Server中實現

Compare implementation of tf.AdamOptimizer to its paper

When I reviewed the implementation of Adam optimizer in tensorflow yesterday, I noticed that it’s code is different from the formulas that I saw in A

Implementation of Convolutional Neural Network Using Keras

Implementation of Convolutional Neural Network Using KerasIn this article, we will see the implementation of Convolutional Neural Network (CNN) using Keras

Design and Implementation of the Sun Network File System

Introduction The network file system(NFS) is a client/service application that provides shared file

Model paves way for faster, more efficient translations of more languages: New system may open up the world's roughly 7,000 spok

Translation systems from Google, Facebook, and Amazon require training models to look for patterns in millions of documents -- such as legal and political

The design and implementation of a system to detect and filter large sessions automatically

Author: Lubin Liu 0. Abstract Large sessions waste a lot of computing resources and extend the delivery time of MapReduce jobs. Automatic

Ask HN: I think I'm losing my memory, is lack of sleep a cause?

I'm struggling to get enough sleep per night and find myself feeling more tired waking up than going to bed. I am getting about 5 hours or less of sleep pe

Implementation of Dependency Injection Pattern in C#

Dependency Injection (DI) is a software design pattern that allow us to develop loosely coupled code. DI is a great way to reduce tight c

implementation of Vector (using C++)

The Vector will be a first-class type,meaning that unlike the primitive array in C++,the Vector can be cpoied,and the memory it uses can

C++ Templates (2.1 類模板Stack的實現 Implementation of Class Template Stack)

[返回完整目錄](https://www.cnblogs.com/kaycharm/p/13433381.html#第一部分章節目錄) [toc] # 2.1 類模板Stack的實現 Implementation of Class Template Stack 正如函式模板，可以如下方式在一個頭檔案中宣

Most efficient way to get the last element of a stream

val lang ted reduce class ret return imp pretty Do a reduction that simply returns the current value:Stream<T> stream; T last = str

Ant報錯之out of memory

art googl out clas 一行代碼 java 求助編譯 ace 用Ant打包一個比較大的項目的時候，遇到OutOfMemory的問題，求助於Google和百度，網上的解決方式非常多，可是個人認為不夠具體全面。我的問題須要綜合兩種方法才解決。把方案記下來。以

ORA-04031: Unable To Allocate 32 Bytes Of Shared Memory

different format each package col address 16px 當前 height 記錄一次生產庫遇到的4031錯誤，後來通過調整sga大小將問題解決了報錯信息： ORA-04031: 無法分配 32 字節的共享內存 ("shared poo

【ORACLE】ORA-27102: out of memory報錯的處理

trac conf error 一個 linu erro 大小 spfile target ************************************************************************ ****原文：blog

王立平--android out of memory(OOM)產生原因

默認 -- out mic 產生 con 對象 native 單個開發圖片視頻應用常遇到這個錯誤。 android 內存由 dalvik 和 native 2部分組成。dalvik 也就是 java 堆，創建的對象就是在這裏分配的，而

Memory-Efficient Implementation of DenseNets

相關推薦