論文筆記-Sequence to Sequence Learning with Neural Networks

阿新 • • 發佈：2017-12-23

map tran between work down all 9.png ever onf

大體思想和RNN encoder-decoder是一樣的，只是用來LSTM來實現。

技術分享圖片

paper提到三個important point：

1）encoder和decoder的LSTM是兩個不同的模型

2）deep LSTM表現比shallow好，選用了4層的LSTM

3）實踐中發現將輸入句子reverse後再進行訓練效果更好。So for example, instead of mapping the sentence a,b,c to the sentence α,β,γ, the LSTM is asked to map c,b,a to α,β,γ, where α, β, γ is the translation of a, b, c. This way, a is in close proximity to α, b is fairly close to β, and so on, a fact that makes it easy for SGD to “establish communication” between the input and the output.

論文筆記-Sequence to Sequence Learning with Neural Networks

map tran between work down all 9.png ever onf 大體思想和RNN encoder-decoder是一樣的，只是用來LSTM來實現。 paper提到三個important point： 1）encoder和decoder的LSTM

論文筆記-Sequence to Sequence Learning with Neural Networks

論文筆記-Sequence to Sequence Learning with Neural Networks

【論文閱讀】Sequence to Sequence Learning with Neural Networks

論文復現Sequence to sequence learning with neural networks

Sequence to Sequence Learning with Neural Networks論文閱讀

Sutskever2014_Sequence to Sequence Learning with Neural Networks

Sequence to Sequence Learning with Neural Networks

（翻譯）Sequence to Sequence Learning with Neural Networks

Deep Learning 16：用自編碼器對資料進行降維_讀論文“Reducing the Dimensionality of Data with Neural Networks”的筆記

論文筆記-Personal Recommendation Using Deep Recurrent Neural Networks in NetEase

An Introduction to Deep Learning and Neural Networks

Convolutional Sequence to Sequence Learning 論文筆記

AliMe Chat: A Sequence to Sequence and Rerank based Chatbot Engine論文筆記

Convolutional Sequence to Sequence Learning筆記

機器翻譯模型之Fairseq：《Convolutional Sequence to Sequence Learning》

Facebook的Fairseq模型詳解(Convolutional Sequence to Sequence Learning)

part-aligned系列論文：1707.Deep Representation Learning with Part Loss for Person ReID 論文閱讀筆記

Introduction.to.Machine.Learning.with.Python 筆記

Deep Learning讀書筆記（一）：Reducing the Dimensionality of Data with Neural Networks

論文筆記：SGM: Sequence Generation Model for Multi-label Classification

基於CNN的Seq2Seq模型-Convolutional Sequence to Sequence Learning

論文筆記-Sequence to Sequence Learning with Neural Networks

相關推薦