1. 程式人生 > >Fundamentals of Speech Recognition: Lawrence Rabiner, Biing

Fundamentals of Speech Recognition: Lawrence Rabiner, Biing

This book is a comprehensive and excellent introduction to the ever-expanding
field of Automatic Speech Recognition. Starting with models of speech
production, speech characterization, methods of analysis (transforms etc),
the authors go onto discuss pattern comparison, hidden Markov models (HMMs),
and design and implementation of speech recognition systems, right from
isolated word recognition to large vocabulary continuous speech recognition
systems. Neural networks and their use in speech recognition is also presented,
though somewhat briefly.
Rabiner was the author of the first widely-read tutorial on HMMs, so
naturally the presentation of HMMs is one of the strong points of this
textbook. The theory is developed in detail, but in an easy to follow
fashion, starting with the very basics and with plenty of helpful examples.
The implementation is discussed at great length as well, starting with
the simplest of tasks and progressing to the state-of-the-art (circa 1993).
That isn't to say that HMMs are the only good part of this book - indeed,
practically every topic, whether it be perception, transforms, vector quantization
or dynamic programming, is presented with great clarity. This book really is easy to
learn from, with numerous examples and illustrations.
The field of speech recognition is inherently multi-disciplinary in nature,
drawing upon various areas of study, including Physics, Physiology, Acoustics,
Signal Processing and Computer Science, to name but a few. The authors do a
great job of explaining all these facets, as well as the mathematics that
is an essential tool.

The only caveat is that it's now a little old (published 1993), since the
field has been growing by leaps and bounds - so while the basics remain
the same, things have changed and hence what's said here should not be
taken as the last word on the subject.
Perhaps a new edition is due, and would certainly be most welcome.
However, for an excellent, accessible introduction to this exciting field,
this is still a great choice.

相關推薦

Fundamentals of Speech Recognition: Lawrence Rabiner, Biing

This book is a comprehensive and excellent introduction to the ever-expanding field of Automatic Speech Recognition. Starting with models of speech produ

The Past, Present, and Future of Speech Recognition Technology

The earliest advances in speech recognition focused mainly on the creation of vowel sounds, as the basis of a system that might also learn to interpret pho

A Bayesian Approach to Deep Neural Network Adaptation with Applications to Robust Automatic Speech Recognition

機器學習 屬於 瓶頸 特征 oid ack enter 變換 表示 基於貝葉斯的深度神經網絡自適應及其在魯棒自動語音識別中的應用 直接貝葉斯DNN自適應 使用高斯先驗對DNN進行MAP自適應 為何貝葉斯在模型自適應中很有用? 因為自適應問題可以視為後驗估計

Utterance-Wise Recurrent Dropout And Iterative Speaker Adaptation For Robust Monaural Speech Recognition

back hid eve 以及 pre learn line sig ann 單聲道語音識別的逐句循環Dropout叠代說話人自適應 WRBN(wide residual BLSTM network,寬殘差雙向長短時記憶網絡) [2] J. Heymann

[Unit Testing] Fundamentals of Testing in Javascript

sar help catch ret same develop more ESS cts In this lesson, we’ll get the most fundamental understanding of what an automated test

CS2204 Fundamentals of Internet Application Development

代做CS2204留學生作業、代寫HTML5語言作業、代做Internet Application Development作業、代寫HTML/CSS/Web作業Department of Computer ScienceCity University of Hong KongCS2204 Fundamental

斯坦福大學-自然語言處理入門 筆記 第十二課 詞性標註(Part-of-speech tagging)

一、詞性(part-of-speech)介紹 詞性:名詞(Nouns),動詞(Verbs),形容詞(Adjectives), 副詞(Adverbs)等等就是我們想要研究的詞性 我們可以把詞性分為開放類(open class)和閉合類(closed class)。

人工智慧入門(一):Fundamentals of Artificial Intelligence

參考教材:https://people.cs.kuleuven.be/~danny.deschreye/FAI/ 在FAI的introduction課中,有一個很基本的目標是:實現一個可以通過圖靈測試的chatbox。 主要知識點涉及: 1.搜尋演算法:包括basic search(blind,heur

人工智能入門(一):Fundamentals of Artificial Intelligence

博弈 trac 一個 chat const esc 構建 人工智 constrain 參考教材:https://people.cs.kuleuven.be/~danny.deschreye/FAI/ 在FAI的introduction課中,有一個很基本的目標是:實現一個可以

A CONVERSATIONAL NEURAL LANGUAGE MODEL FOR SPEECH RECOGNITION IN DIGITAL ASSISTANTS文獻閱讀筆記

摘要:對話序列有利於提高數字助手(可以理解為手機的siri,微軟小冰等)的能力,我們探索了神經網路語言模型模擬數字助手的對話。我們提出的結果可以有效刻畫對話特徵,在識別率上相對提高了%4. 1.     不同於其他領域的語音識別,數字助手主要為對話形式的。所以應該建立一個

GNG1106 – Fundamentals of Engineering Computation

GNG1106作業代寫、代做Electrical Engineering作業、代寫python, C/C++程式設計作業GNG1106 – Fundamentals of Engineering ComputationCourse ProjectElectrical EngineeringDesign of

Fundamentals of Logic

Fundamentals of Logic         To make complicated mathematical relationships clear,it is convenient to use the notation of symbolic lo

ECE 150: Fundamentals of Programming

ECE 150作業代做、代寫C/C++語言作業、代做Programming作業、C/C++程式作業代寫ECE 150: Fundamentals of Programming(Sections 001 and 002)Project 3Deadline: 11:59pm Monday December 3,

Rewiew: Unsupervised Learning of Digit Recognition Using Spike-Timing-Dependent Plasticity(IEEE)

閱讀時間:2017年12月 更新時間:2018年6月,合併了在《Frontiers in computational neuroscience》上發表的同名文章 文章資訊 題目:基於STDP非監督學習的數字識別 刊物:IEEE Transactions on Neur

Fundamentals of Power Electronics 中文版譯文

寫在前面的話: · R. W. Erickson 的《Fundamentals of Power Electronics》是一部非常經典的著作,非常全面且基礎性的闡述了電力電子基礎技術。 · 最近正在拜讀,出於個人愛好對其中部分進行了翻譯,原文對應為Funda

word2vec, LSTM Speech Recognition實戰, 圖資料庫

word2vec word2vec是Google於2013年開源推出的一個用於獲取word vector的工具包。作者是Tomas Mikolov。 Github: 注:Tomas Mikolov,捷克布林諾科技大學博士。先後在Google、Facebook

Fundamentals of display technologies for Augmented and Virtual Reality

Display technologiesDisplay typesFully immersiveThese are standard fully immersive virtual reality displays. These stereoscopic displays are combined with

So why is freedom of speech important anyway?

For those who don’t know (and I didn’t), Alex Jones is some conspiracy theory fanatic with a podcast. If you’re curious, you can read on Wikipedia about …

Ask HN: Resources about the fundamentals of programming languages?

Hi HN, what is your favorite book/course to learn about the fundamentals of programming languages?My goal is to learn more about things like different ways

Ask HN: Best way to learn the fundamentals of operating systems

I think 'tomes' is where it's at these days.An operating system covers quite a lot these days so a short book would either skip a lot of topics or be very