1. 程式人生 > >RNN 資源

RNN 資源

A curated list of resources dedicated to recurrent neural networks (closely related todeep learning).

Table of Contents

Codes

Codes

Keras: Theano-based Deep Learning Library

Passage: Library for text analysis with RNNs

Caffe- C++ with MATLAB/Python wrappers

LRCNby Jeff Donahue

Torch- Lua

char-rnnby Andrej Karpathy : multi-layer RNN/LSTM/GRU for training/sampling from character-level language models

LSTMby Wojciech Zaremba : Long Short Term Memory Units to train a language model on word level Penn Tree Bank dataset

Oxfordby Nando de Freitas : Oxford Computer Science - Machine Learning 2015 Practicals

rnnby Nicholas Leonard : general library for implementing RNN, LSTM, BRNN and BLSTM (highly unit tested).

Etc.

RNNLIBby Alex Graves : C++ based LSTM library

RNNLMby Tomas Mikolov : C++ based simple code

neuraltalkby Andrej Karpathy : numpy-based RNN/LSTM implementation

gistby Andrej Karpathy : raw numpy code that implements an efficient batched LSTM

Stanford NLP (CS224d) by Richard Socher

Lecture Note 3: neural network basics

Lecture Note 4: RNN language models, bi-directional RNN, GRU, LSTM

Lecture 12: Recurrent neural networks and LSTMs

Lecture 13: (guest lecture) Alex Graves on Hallucination with RNNs

Alex Graves (2008)

Tomas Mikolov (2012)

Ilya Sutskever (2013)

Richard Socher (2014)

Bi-directional RNN [Paper]

Mike Schuster and Kuldip K. Paliwal,Bidirectional Recurrent Neural Networks, Trans. on Signal Processing 1997

LSTM [Paper]

Sepp Hochreiter and Jurgen Schmidhuber,Long Short-Term Memory, Neural Computation 1997

Multi-dimensional RNN [Paper]

Alex Graves, Santiago Fernandez, and Jurgen Schmidhuber,Multi-Dimensional Recurrent Neural Networks, ICANN 2007

GRU (Gated Recurrent Unit) [Paper]

Kyunghyun Cho, Bart van Berrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio,Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation, arXiv:1406.1078 / EMNLP 2014

Junyoung Chung, Caglar Gulcehre, Kyunghyun Cho, Yoshua Bengio,Gated Feedback Recurrent Neural Networks, arXiv:1502.02367 / ICML 2015

Tree-Structured LSTM [Paper]

Kai Sheng Tai, Richard Socher, and Christopher D. Manning, arXiv:1503.00075 / ACL 2015

Grid LSTM [Paper]

Nal Kalchbrenner, Ivo Danihelka, and Alex Graves,Grid Long Short-Term Memory, arXiv:1507.01526

Klaus Greff, Rupesh Kumar Srivastava, Jan Koutnik, Bas R. Steunebrink, Jurgen Schmidhuber,LSTM: A Search Space Odyssey, arXiv:1503.04069

Tomas Mikolov, Martin Karafiat, Lukas Burget, Jan "Honza" Cernocky, Sanjeev Khudanpur,Recurrent Neural Network based Language Model, Interspeech 2010 [Paper]

Tomas Mikolov, Stefan Kombrink, Lukas Burget, Jan "Honza" Cernocky, Sanjeev Khudanpur,Extensions of Recurrent Neural Network Language Model, ICASSP 2011 [Paper]

Stefan Kombrink, Tomas Mikolov, Martin Karafiat, Lukas Burget,Recurrent Neural Network based Language Modeling in Meeting Recognition, Interspeech 2011 [Paper]

Jiwei Li, Minh-Thang Luong, and Dan Jurafsky,A Hierarchical Neural Autoencoder for Paragraphs and Documents, ACL 2015 [Paper], [Code]

Geoffrey Hinton, Li Deng, Dong Yu, George E. Dahl, Abdel-rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara N. Sainath, and Brian Kingsbury,Deep Neural Networks for Acoustic Modeling in Speech Recognition, IEEE Signam Processing Magazine 2012 [Paper]

Alex Graves, Abdel-rahman Mohamed, and Geoffrey Hinton,Speech Recognition with Deep Recurrent Neural Networks, arXiv:1303.5778 / ICASSP 2013 [Paper]

Jan Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, Kyunghyun Cho, and Yoshua Bengio,Attention-Based Models for Speech Recognition, arXiv:1506.07503 [Paper]

Univ. Montreal [Paper]

Kyunghyun Cho, Bart van Berrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio,Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation, arXiv:1406.1078 / EMNLP 2014

Google [Paper]

Ilya Sutskever, Oriol Vinyals, and Quoc V. Le,Sequence to Sequence Learning with Neural Networks, arXiv:1409.3215 / NIPS 2014

Univ. Montreal [Paper]

Dzmitry Bahdanau, KyungHyun Cho, and Yoshua Bengio,Neural Machine Translation by Jointly Learning to Align and Translate, arXiv:1409.0473 / ICLR 2015

Google + NYU [Paper]

Minh-Thang Luong, Ilya Sutskever, Quoc V. Le, Oriol Vinyals, and Wojciech Zaremba,Addressing the Rare Word Problem in Neural Machine Transltaion, ACL 2015

Lifeng Shang, Zhengdong Lu, and Hang Li,Neural Responding Machine for Short-Text Conversation, arXiv:1503.02364 / ACL 2015 [Paper]

Oriol Vinyals and Quoc V. Le,A Neural Conversational Model, arXiv:1506.05869 [Paper]

Ryan Lowe, Nissan Pow, Iulian V. Serban, and Joelle Pineau,The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems, arXiv:1506.08909 [Paper]

Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, and Alan L. Yuille,Explain Images with Multimodal Recurrent Neural Networks, arXiv:1410.1090

Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, and Alan L. Yuille,Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN), arXiv:1412.6632 / ICLR 2015

Univ. Toronto [Paper] [Web demo]

Ryan Kiros, Ruslan Salakhutdinov, and Richard S. Zemel,Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models, arXiv:1411.2539 / TACL 2015

Berkeley [Web] [Paper]

Jeff Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Kate Saenko, and Trevor Darrell,Long-term Recurrent Convolutional Networks for Visual Recognition and Description, arXiv:1411.4389 / CVPR 2015

Google [Paper]

Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan,Show and Tell: A Neural Image Caption Generator, arXiv:1411.4555 / CVPR 2015

Microsoft [Paper]

Hao Fang, Saurabh Gupta, Forrest Iandola, Rupesh Srivastava, Li Deng, Piotr Dollar, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John C. Platt, Lawrence Zitnick, and Geoffrey Zweig,From Captions to Visual Concepts and Back, arXiv:1411.4952 / CVPR 2015

Xinlei Chen, and C. Lawrence Zitnick,Learning a Recurrent Visual Representation for Image Caption Generation

Xinlei Chen, and C. Lawrence Zitnick,Mind’s Eye: A Recurrent Visual Representation for Image Caption Generation, CVPR 2015

Univ. Montreal + Univ. Toronto [Web] [Paper]

Kelvin Xu, Jimmy Lei Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, Richard S. Zemel, and Yoshua Bengio,Show, Attend, and Tell: Neural Image Caption Generation with Visual Attention, arXiv:1502.03044 / ICML 2015

Idiap + EPFL + Facebook [Paper]

Remi Lebret, Pedro O. Pinheiro, and Ronan Collobert,Phrase-based Image Captioning, arXiv:1502.03671 / ICML 2015

UCLA + Baidu [Paper]

Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, and Alan L. Yuille,Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images, arXiv:1504.06692

Berkeley [Web] [Paper]

Jeff Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Kate Saenko, and Trevor Darrell,Long-term Recurrent Convolutional Networks for Visual Recognition and Description, arXiv:1411.4389 / CVPR 2015

UT Austin + UML + Berkeley [Paper]

Subhashini Venugopalan, Huijuan Xu, Jeff Donahue, Marcus Rohrbach, Raymond Mooney, and Kate Saenko,Translating Videos to Natural Language Using Deep Recurrent Neural Networks, arXiv:1412.4729

Microsoft [Paper]

Yingwei Pan, Tao Mei, Ting Yao, Houqiang Li, and Yong Rui,Joint Modeling Embedding and Translation to Bridge Video and Language, arXiv:1505.01861

UT Austin + Berkeley + UML [Paper]

Subhashini Venugopalan, Marcus Rohrbach, Jeff Donahue, Raymond Mooney, Trevor Darrell, and Kate Saenko,Sequence to Sequence--Video to Text, arXiv:1505.00487

Virginia Tech. + MSR [Web] [Paper]

Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, and Devi Parikh,VQA: Visual Question Answering, arXiv:1505.00468 / CVPR 2015 SUNw:Scene Understanding workshop

MPI + Berkeley [Web] [Paper]

Mateusz Malinowski, Marcus Rohrbach, and Mario Fritz,Ask Your Neurons: A Neural-based Approach to Answering Questions about Images, arXiv:1505.01121

Univ. Toronto [Paper] [Dataset]

Mengye Ren, Ryan Kiros, and Richard Zemel,Exploring Models and Data for Image Question Answering, arXiv:1505.02074 / ICML 2015 deep learning workshop

Baidu + UCLA [Paper] [Dataset]

Hauyuan Gao, Junhua Mao, Jie Zhou, Zhiheng Huang, Lei Wang, and Wei Xu,Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering, arXiv:1505.05612

Karol Gregor, Ivo Danihelka, Alex Graves, Danilo J. Rezende, and Daan Wierstra,DRAW: A Recurrent Neural Network for Image Generation,ICML 2015 [Paper]

Angeliki Lazaridou, Dat T. Nguyen, R. Bernardi, and M. Baroni,Unveiling the Dreams of Word Embeddings: Towards Language-Driven Image Generation,arXiv:1506.03500 [Paper]

Lucas Theis and Matthias Bethge,Generative Image Modeling Using Spatial LSTMs,arXiv:1506.03478 [Paper]

A.Graves, G. Wayne, and I. Danihelka.,Neural Turing Machines,arXiv preprint arXiv:1410.5401 [Paper]

Jason Weston, Sumit Chopra, Antoine Bordes,Memory Networks,arXiv:1410.3916 [Paper]

Sainbayar Sukhbaatar, Arthur Szlam, Jason Weston, and Rob Fergus,End-To-End Memory Networks, arXiv:1503.08895 [Paper]

Wojciech Zaremba and Ilya Sutskever,Reinforcement Learning Neural Turing Machines,arXiv:1505.00521 [Paper]

Marvin Zhang, Sergey Levine, Zoe McCarthy, Chelsea Finn, Pieter Abbeel,Policy Learning with Continuous Memory States for Partially Observed Robotic Control,arXiv:1507.01273.[Paper]

Speech Recognition

OpenSLR(Open Speech and Language Resources)

Image Captioning

Image Question Answering - all based on MS COCO images

VQA

[Multilingual Image QA] : in Chinese, with English translation