Utterance-Wise Recurrent Dropout And Iterative Speaker Adaptation For Robust Monaural Speech Recognition

阿新 • • 發佈：2018-06-07

back hid eve 以及 pre learn line sig ann

單聲道語音識別的逐句循環Dropout叠代說話人自適應

WRBN（wide residual BLSTM network，寬殘差雙向長短時記憶網絡）

[2] J. Heymann, L. Drude, and R. Haeb-Umbach, "Wide residual blstm network with discriminative speaker adaptation for robust speech recognition," submitted to the CHiME, vol. 4, 2016.

reverberation，n. [聲] 混響；反射；反響；回響

CLDNN（convolutional, long short-term memory, fully connected deep neural networks，卷積-長短時記憶-全連接深度神經網絡）

[1] T.N. Sainath, O. Vinyals, A. Senior, and H. Sak, "Convolutional, long short-term memory, fully connected deep neural networks," in Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on. IEEE, 2015, pp. 4580

–4584.

speech separation，語音分離，將多說話人同時說話的語句分離為各個說話人獨立說話的語句。

在LSTM訓練中使用Dropout能有效緩解過擬合。

[3] G.E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R.R. Salakhutdinov, "Improving neural networks by preventing co-adaptation of feature detectors," arXiv preprint arXiv:1207.0580, 2012.

在輸出門、遺忘門以及輸入門使用基於語句采樣丟幀Mask

能取得最優結果（Cheng dropout）。

[7] G. Cheng, V. Peddinti, D. Povey, V. Manohar, S. Khudanpur, and Y. Yan, "An exploration of dropout with lstms," in Proceedings of Interspeech, 2017.

基於MLLR的叠代自適應方法，使用上一次叠代的解碼結果來更新高斯參數。

[10] P.C. Woodland, D. Pye, and M.J.F. Gales, "Iterative unsupervised adaptation using maximum likelihood linear regression," inSpokenLanguage, 1996.ICSLP96.Proceedings., Fourth International Conference on. IEEE, 1996, vol. 2, pp. 1133–1136.

近期提出了一種batch正則化說話人自適應。

[14] P. Swietojanski, J. Li, and S. Renals, "Learning hidden unit contributions for unsupervised acoustic model adaptation," IEEE/ACMTransactionsonAudio,Speech, and Language Processing, vol. 24, no. 8, pp. 1450– 1463, 2016.

本文使用了無監督的LIN說話人自適應

[11]

使用的LIN層矩陣維數為80*80，該層被三個輸入特征共享（原始、delta、delta-delta）。

本文嘗試使用以下兩種方式進行叠代的說話人自適應：

在叠代時使用上一次叠代的模型生成新標簽進行訓練。
每次叠代堆疊一個額外的線性輸入層（數學上，多個線性層相當於一個隱層）

傳統DNN訓練方式是segment-wise

實驗得出，使用RNN時，Iter（叠代方案）更優；使用tri-gram時，Stack（堆疊）方案更優

Utterance-Wise Recurrent Dropout And Iterative Speaker Adaptation For Robust Monaural Speech Recognition

back hid eve 以及 pre learn line sig ann 單聲道語音識別的逐句循環Dropout叠代說話人自適應 WRBN（wide residual BLSTM network，寬殘差雙向長短時記憶網絡） [2] J. Heymann

A Bayesian Approach to Deep Neural Network Adaptation with Applications to Robust Automatic Speech Recognition

機器學習屬於瓶頸特征 oid ack enter 變換表示基於貝葉斯的深度神經網絡自適應及其在魯棒自動語音識別中的應用直接貝葉斯DNN自適應使用高斯先驗對DNN進行MAP自適應為何貝葉斯在模型自適應中很有用？因為自適應問題可以視為後驗估計

Setup Apache2 in Debian 9 and enable two ports for two sites

listen apachectl art 127.0.0.1 etc bsp default star oot [email protected]/* */:~# apt-get install apache2 [email protected]/*

Talbot basically sweating the idea and can be happy for the repeat functionality

appear cas actual pear value edi fort lte finally Practically no one enjoyed more NHL hockey when compared with Cam Talbot last time. It

論文筆記-Joint Deep Modeling of Users and Items Using Reviews for Recommendation

一個 solved default view http ati onf 評分分享基本思路：利用用戶和商品的評論構建CNN預測評分。網絡結構： user review網絡與 item review網絡結構一致，僅就前者進行說明從user review tex

python安裝失敗提示“one or more issues caused the setup to fail . Please fix the issues and then retry setup.For more information see the log file”

ase ice body orm bubuko mat 解決方法 3.4 mage 換了項目組，換了新電腦，重裝Python時遇到提示如下圖所示：原因：需要安裝Windows 7 Service Pack 1 直接點擊“update your

Xcode 10 升級專案報錯 “directory not found for option” and “library not found for -libstdc++.6 ~解決方法

聯絡人:石虎 QQ:1224614774 暱稱: 嗡嘛呢叭咪哄 &

《An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its...》論文閱讀之CRNN

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition paper: CRNN 翻譯：CRNN

深度學習論文翻譯解析（二）：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

論文標題：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition 論文作者： Baoguang Shi, Xiang B

Utterance-Wise Recurrent Dropout And Iterative Speaker Adaptation For Robust Monaural Speech Recognition

Utterance-Wise Recurrent Dropout And Iterative Speaker Adaptation For Robust Monaural Speech Recognition

A Bayesian Approach to Deep Neural Network Adaptation with Applications to Robust Automatic Speech Recognition

Setup Apache2 in Debian 9 and enable two ports for two sites

Talbot basically sweating the idea and can be happy for the repeat functionality

論文筆記-Joint Deep Modeling of Users and Items Using Reviews for Recommendation

python安裝失敗提示“one or more issues caused the setup to fail . Please fix the issues and then retry setup.For more information see the log file”

Xcode 10 升級專案報錯 “directory not found for option” and “library not found for -libstdc++.6 ~解決方法

《An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its...》論文閱讀之CRNN

深度學習論文翻譯解析（二）：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

url override and HttpSession implements session for form

url override and HttpSession implements session for real

50+ Data Structure and Algorithms Interview Questions for Programmers

【文獻翻譯】ICE-BA: Incremental, Consistent and Efficient Bundle Adjustment for Visual-Inertial SLAM

網路結構搜尋（3） —— Simple and efficient architecture search for convolutional neural network

How to Develop and Evaluate Naive Methods for Forecasting Household Electricity Consumption

What is Cryptocurrency and Why It Matters for You

The Noose Tightens Around Malta and What it Means for the Island of Crypto

Fujitsu Laboratories and MIT’s Center for Brains, Minds and Machines Broaden Partnership

Made by Google Event 2018: What we expect and don't expect for Pixel, Google Home and more

4 Data Visualization and Web Reporting Tools for your BI solution

Utterance-Wise Recurrent Dropout And Iterative Speaker Adaptation For Robust Monaural Speech Recognition

相關推薦