深度學習必讀的一些資料

阿新 • • 發佈：2019-02-10

List of reading lists and survey papers:

Books
- Deep Learning, Yoshua Bengio, Ian Goodfellow, Aaron Courville, MIT Press, In preparation.

Review Papers
- The monograph or review paper Learning Deep Architectures for AI (Foundations & Trends in Machine Learning, 2009).
- Deep Machine Learning – A New Frontier in Artificial Intelligence Research – a
  
  survey paper by Itamar Arel, Derek C. Rose, and Thomas P. Karnowski.
- Graves, A. (2012). Supervised sequence labelling with recurrent neural networks(Vol. 385). Springer.
- Schmidhuber, J. (2014). Deep Learning in Neural Networks: An Overview. 75 pages, 850+ references, http://arxiv.org/abs/1404.7828, PDF & LATEX source & complete public BIBTEX file under
  
  http://www.idsia.ch/~juergen/deep-learning-overview.html.
Reinforcement Learning
- Mnih, Volodymyr, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. “Playing Atari with deep reinforcement learning.” arXiv preprint arXiv:1312.5602 (2013).
- Volodymyr Mnih, Nicolas Heess, Alex Graves, Koray Kavukcuoglu. “
  
  Recurrent Models of Visual Attention” ArXiv e-print, 2014.

Computer Vision
- Going Deeper with Convolutions, Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich, 19-Sept-2014.
- Learning Hierarchical Features for Scene Labeling, Clement Farabet, Camille Couprie, Laurent Najman and Yann LeCun, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013.
- Learning Convolutional Feature Hierachies for Visual Recognition, Koray Kavukcuoglu, Pierre Sermanet, Y-Lan Boureau, Karol Gregor, Michaël Mathieu and Yann LeCun, Advances in Neural Information Processing Systems (NIPS 2010), 23, 2010.
- Cireşan, D. C., Meier, U., Gambardella, L. M., & Schmidhuber, J. (2010). Deep, big, simple neural nets for handwritten digit recognition. Neural computation, 22(12), 3207-3220.
- Ciresan, Dan, Ueli Meier, and Jürgen Schmidhuber. “Multi-column deep neural networks for image classification.” Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on. IEEE, 2012.
- Ciresan, D., Meier, U., Masci, J., & Schmidhuber, J. (2011, July). A committee of neural networks for traffic sign classification. In Neural Networks (IJCNN), The 2011 International Joint Conference on (pp. 1918-1921). IEEE.
Disentangling Factors and Variations with Depth
- Goodfellow, Ian, et al. “Measuring invariances in deep networks.” Advances in neural information processing systems 22 (2009): 646-654.
- Bengio, Yoshua, et al. “Better Mixing via Deep Representations.” arXiv preprint arXiv:1207.4404 (2012).
Transfer Learning and domain adaptation
- Raina, Rajat, et al. “Self-taught learning: transfer learning from unlabeled data.” Proceedings of the 24th international conference on Machine learning. ACM, 2007.
- R. Collobert, J. Weston, L. Bottou, M. Karlen, K. Kavukcuoglu and P. Kuksa. Natural Language Processing (Almost) from Scratch. Journal of Machine Learning Research, 12:2493-2537, 2011.
- Mesnil, Grégoire, et al. “Unsupervised and transfer learning challenge: a deep learning approach.” Unsupervised and Transfer Learning Workshop, in conjunction with ICML. 2011.
- Ciresan, D. C., Meier, U., & Schmidhuber, J. (2012, June). Transfer learning for Latin and Chinese characters with deep neural networks. In Neural Networks (IJCNN), The 2012 International Joint Conference on (pp. 1-6). IEEE.
Practical Tricks and Guides
- Practical recommendations for gradient-based training of deep architectures, Yoshua Bengio, U. Montreal, arXiv report:1206.5533, Lecture Notes in Computer Science Volume 7700, Neural Networks: Tricks of the Trade Second Edition, Editors: Grégoire Montavon, Geneviève B. Orr, Klaus-Robert Müller, 2012.
- A practical guide to training Restricted Boltzmann Machines, by Geoffrey Hinton.
Foundation Theory and Motivation
- Hinton, Geoffrey E. “Deterministic Boltzmann learning performs steepest descent in weight-space.” Neural computation 1.1 (1989): 143-150.
- Bengio, Yoshua, and Samy Bengio. “Modeling high-dimensional discrete data with multi-layer neural networks.” Advances in Neural Information Processing Systems 12 (2000): 400-406.
- Bengio, Yoshua, et al. “Greedy layer-wise training of deep networks.” Advances in neural information processing systems 19 (2007): 153.
- Bengio, Yoshua, Martin Monperrus, and Hugo Larochelle. “Nonlocal estimation of manifold structure.” Neural Computation 18.10 (2006): 2509-2528.
- Hinton, Geoffrey E., and Ruslan R. Salakhutdinov. “Reducing the dimensionality of data with neural networks.” Science 313.5786 (2006): 504-507.
- Marc’Aurelio Ranzato, Y., Lan Boureau, and Yann LeCun. “Sparse feature learning for deep belief networks.” Advances in neural information processing systems 20 (2007): 1185-1192.
- Bengio, Yoshua, and Yann LeCun. “Scaling learning algorithms towards AI.” Large-Scale Kernel Machines 34 (2007).
- Le Roux, Nicolas, and Yoshua Bengio. “Representational power of restricted boltzmann machines and deep belief networks.” Neural Computation 20.6 (2008): 1631-1649.
- Sutskever, Ilya, and Geoffrey Hinton. “Temporal-Kernel Recurrent Neural Networks.” Neural Networks 23.2 (2010): 239-243.
- Le Roux, Nicolas, and Yoshua Bengio. “Deep belief networks are compact universal approximators.” Neural computation 22.8 (2010): 2192-2207.
- Bengio, Yoshua, and Olivier Delalleau. “On the expressive power of deep architectures.” Algorithmic Learning Theory. Springer Berlin/Heidelberg, 2011.
- Montufar, Guido F., and Jason Morton. “When Does a Mixture of Products Contain a Product of Mixtures?.” arXiv preprint arXiv:1206.0387 (2012).
- Montúfar, Guido, Razvan Pascanu, Kyunghyun Cho, and Yoshua Bengio. “On the Number of Linear Regions of Deep Neural Networks.” arXiv preprint arXiv:1402.1869 (2014).
Supervised Feedfoward Neural Networks
- The Manifold Tangent Classifier, Salah Rifai, Yann Dauphin, Pascal Vincent, Yoshua Bengio and Xavier Muller, in: NIPS’2011.
- Goodfellow, I., Warde-Farley, D., Mirza, M., Courville, A., and Bengio, Y. (2013). Maxout networks. Technical Report, Universite de Montreal.
- Wang, Sida, and Christopher Manning. “Fast dropout training.” In Proceedings of the 30th International Conference on Machine Learning (ICML-13), pp. 118-126. 2013.
- Glorot, Xavier, Antoine Bordes, and Yoshua Bengio. “Deep sparse rectifier networks.” In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics. JMLR W&CP Volume, vol. 15, pp. 315-323. 2011.
Large Scale Deep Learning
- Building High-level Features Using Large Scale Unsupervised Learning Quoc V. Le, Marc’Aurelio Ranzato, Rajat Monga, Matthieu Devin, Kai Chen, Greg S. Corrado, Jeffrey Dean, and Andrew Y. Ng, ICML 2012.
- Bengio, Yoshua, et al. “Neural probabilistic language models.” Innovations in Machine Learning (2006): 137-186. Specifically Section 3 of this paper discusses the asynchronous SGD.
- Dean, Jeffrey, et al. “Large scale distributed deep networks.” Advances in Neural Information Processing Systems. 2012.
Hyper Parameters

Optimization
- Schaul, Tom, Sixin Zhang, and Yann LeCun. “No More Pesky Learning Rates.” arXiv preprint arXiv:1206.1106 (2012).
- Le Roux, Nicolas, Pierre-Antoine Manzagol, and Yoshua Bengio. “Topmoumoute online natural gradient algorithm.” Neural Information Processing Systems (NIPS). 2007.
- Bordes, Antoine, Léon Bottou, and Patrick Gallinari. “SGD-QN: Careful quasi-Newton stochastic gradient descent.” The Journal of Machine Learning Research 10 (2009): 1737-1754.
- Glorot, Xavier, and Yoshua Bengio. “Understanding the difficulty of training deep feedforward neural networks.” Proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS’10). Society for Artificial Intelligence and Statistics. 2010.
- Glorot, Xavier, Antoine Bordes, and Yoshua Bengio. “Deep Sparse Rectifier Networks.” Proceedings of the 14th International Conference on Artificial Intelligence and Statistics. JMLR W&CP Volume. Vol. 15. 2011.
- “Deep learning via Hessian-free optimization.” Martens, James. Proceedings of the 27th International Conference on Machine Learning (ICML). Vol. 951. 2010.
- Hochreiter, Sepp, and Jürgen Schmidhuber. “Flat minima.” Neural Computation, 9.1 (1997): 1-42.
- Pascanu, Razvan, and Yoshua Bengio. “Revisiting natural gradient for deep networks.” arXiv preprint arXiv:1301.3584 (2013).
- Advances in Neural Information Processing Systems, pp. 2933-2941. 2014.
Unsupervised Feature Learning
- Salakhutdinov, Ruslan, and Geoffrey E. Hinton. “Deep boltzmann machines.” Proceedings of the international conference on artificial intelligence and statistics. Vol. 5. No. 2. Cambridge, MA: MIT Press, 2009.
- Deep Boltzmann Machines
  - An Efficient Learning Procedure for Deep Boltzmann Machines, Ruslan Salakhutdinov and Geoffrey Hinton, Neural Computation August 2012, Vol. 24, No. 8: 1967 — 2006.
  - Montavon, Grégoire, and Klaus-Robert Müller. “Deep Boltzmann Machines and the Centering Trick.” Neural Networks: Tricks of the Trade (2012): 621-637.
  - Salakhutdinov, Ruslan, and Hugo Larochelle. “Efficient learning of deep boltzmann machines.” International Conference on Artificial Intelligence and Statistics. 2010.
  - Salakhutdinov, Ruslan. . Diss. University of Toronto, 2009.
  - Goodfellow, Ian, et al. “Multi-prediction deep Boltzmann machines.” Advances in Neural Information Processing Systems. 2013.

深度學習的一些資料集介紹

資料集分為三類：影象處理相關資料集，自然語言處理相關資料集和語音處理相關資料集。參考：here 以下主要是影象處理相關資料集。 1、mnist：詳情 MNIST資料來自美國國家標準與技術研究所，National Institute of Standards and Technology（

深度學習必讀的一些資料

List of reading lists and survey papers: Books Deep Learning, Yoshua Bengio, Ian Goodfellow, Aaron Courville, MIT Press, In preparation. Review Papers

深度學習（四）轉--入門深度學習的一些開源代碼

姿態估計 multi 入門 nat project bic obj algorithm taf 原文作者：aircraft 原文鏈接：沒錯這篇又是轉發的，因為覺得學習深度學習難免要從別人的代碼開始，所以就轉發了。不過轉發的時候沒找到原作者是誰，所以原作者看到不要

讀懂人工智慧、機器學習、深度學習、大資料，自然語言處理……

從機器學習談起　　在本篇文章中，我將對機器學習做個概要的介紹。本文的目的是能讓即便完全不瞭解機器學習的人也能瞭解機器學習，並且上手相關的實踐。這篇文件也算是EasyPR開發的番外篇，從這裡開始，必須對機器學習瞭解才能進一步介紹EasyPR的核心。當然，本文也面對一般讀者，不會

機器學習 Machine Learning 深度學習 Deep Learning 資料

分享一下我老師大神的人工智慧教程！零基礎，通俗易懂！http://blog.csdn.net/jiangjunshow 也歡迎大家轉載本篇文章。分享知識，造福人民，實現我們中華民族偉大復興！

機器學習 Machine Learning 深度學習 Deep Learning 資料 Chapter 1

關於在深度學習中訓練資料集的batch的經驗總結

由於深度學習的網格很大，用來訓練的資料集也很大。因此不可能一下子將所有資料集都輸入到網路中，便引入了batch_size的概念，下面總結自己兩種常用的呼叫batch的方法 1、使用TensorFlow， tf.train.batch（）。 2、 offset = (offset

分享《深度學習與計算機視覺演算法原理框架應用》《大資料架構詳解從資料獲取到深度學習》PDF資料集

下載：https://pan.baidu.com/s/12-s95JrHek82tLRk3UQO_w 更多資料分享：http://blog.51cto.com/3215120 《深度學習與計算機視覺演算法原理、框架應用》PDF，帶書籤，347頁。《大資料架構詳解：從資料獲取到深度學習》PDF，帶書籤，3

分享《深度學習與計算機視覺演算法原理框架應用》PDF《大資料架構詳解從資料獲取到深度學習》PDF +資料集

下載：https://pan.baidu.com/s/12-s95JrHek82tLRk3UQO_w 更多分享資料：https://www.cnblogs.com/javapythonstudy/ 《深度學習與計算機視覺演算法原理、框架應用》PDF，帶書籤，347頁。《大資料架構詳解：從資料獲取到深度學

深度學習中的資料增廣

問題一：為什麼需要大量的資料當訓練機器學習模型的時候，實際上實在調整它的引數，使得可以跟一個特定的輸入符合。優化的目標是 chase that sweet spot where our model’s loss is low。當前最好的神經網路擁有的引數量是上百萬的量級。

**ubuntu 14.04 安裝好後關於深度學習的一些簡單操作*

1.換原 sudo vim /etc/apt/sources.list #修改原如果沒有vim，可以使用gedit編輯器來修改原 sudo gedit /etc/apt/sources.list 內容替換如下（選用阿里原）： deb http://mirror

Tensorflow深度學習入門——自制資料集

python 將自己的圖片資料集製作成tensorflow可讀取的資料集檔案*.cvs 這裡假設你已經有了樣本圖片資料集，而且正樣本和負樣本已經分好類了說明下製作正樣本資料集*.csv的過程，負樣本資料集的製作也同樣 import os,os.path imp

機器學習，深度學習，免費資料集彙總

【第一波】目前系統整理了一些網上開放的免費科研資料集，以下是分類列表以及下載地址，供高校和科研機構免費下載和使用。金融美國勞工部統計局官方釋出資料上證A股日線資料，1999.12.09 至 2016.06.08，前復權，1095支股票深證A股日線資料，1999

深度學習：醫療資料

一、乳腺：MIAS MiniMammographic Database MIAS MiniMammographic Database(來自researchgate的一個問答)：322例，尺寸：1024*1024pixel，8位，影象資料是PGM格式，找到一個介紹

Peter Cnudde談雅虎如何使用Hadoop、深度學習和大資料平臺

本文要點　　瞭解雅虎如何利用Hadoop和大資料平臺技術; 　　在類似Flickr和Esports這樣的產品中，雅虎如何使用深度學習技術進行場景檢測和物件識別; 　　機器學習在影象識別、定向廣告、搜尋排名、濫用檢測和個性化中的應用; 　　Hadoop叢集上用於分類和排名的機器

深度學習之TFRecord資料集讀、寫的製作、讀取及驗證具體操作過程

如題，TensorFlow官方為我們提供了資料讀取的標準格式：TFRecord，本文主要闡述了該資料格式的製作、讀取及驗證三個具體操作過程。簡要介紹：tfrecord資料檔案是一種將影象資料和標籤統一儲存的二進位制檔案，能更好的利用記憶體，在tensorflow中快速的複製，

深度學習筆記8 資料預處理

資料預處理標準流程自然灰度影象（1）灰度影象具有平穩特性，對每個資料樣本分別做均值消減（即減去直流分量）——每個影象塊，計算平均畫素值，並將影象每個畫素點減去均值。每個影象塊有一個不同的均值。 x=x-repmat(mean(x,1),size(x

關於深度學習的一些比較好的網站總結

神經網路模型之AlexNet的一些總結 http://www.cnblogs.com/gongxijun/p/6027747.html 卷積與濾波的一些特點

深度學習中的資料增強方法

對於較深層次的深度神經網路，其效能會隨著訓練資料的提升而進一步提升。目前深度學習方法廣泛採用的資料增強方法，主要有： multi-scale：多尺度； translate：平移，[-6, -6

【深度學習】IMDB資料集上電影評論二分類

任務描述根據電影評論的文字內容來將電影劃分為正面或者負面。 IMDB資料集 50000條兩級分化的評論。正面負面各為50%。 # 載入資料 from keras.datasets import imdb (train_data, train_labels), (test

深度學習必讀的一些資料

Books

Review Papers

Reinforcement Learning

Computer Vision

Disentangling Factors and Variations with Depth

Transfer Learning and domain adaptation

Practical Tricks and Guides

Foundation Theory and Motivation

Supervised Feedfoward Neural Networks

Large Scale Deep Learning

Hyper Parameters

Optimization

Unsupervised Feature Learning

Deep Boltzmann Machines

相關推薦