Speech and Language Processing, Daniel Jurafsky and James H. Martin, 2025 (Pearson) - Comprehensive textbook covering the principles of ASR, including feature extraction, acoustic modeling (HMM/GMM and deep learning), language modeling, and decoding strategies.
Deep Neural Networks for Acoustic Modeling in Speech Recognition, Geoffrey Hinton, Li Deng, Dong Yu, George Dahl, Abdel-rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara N. Sainath, and Brian Kingsbury, 2012IEEE Signal Processing Magazine, Vol. 29 (IEEE)DOI: 10.1109/MSP.2012.2205597 - A seminal paper that details the application and effectiveness of deep neural networks in acoustic modeling, marking a major advancement in ASR.
An Introduction to Hidden Markov Models, Lawrence R. Rabiner, 1986IEEE ASSP Magazine, Vol. 3 (IEEE)DOI: 10.1109/MASSP.1986.1165342 - Foundational paper providing a clear exposition of Hidden Markov Models, which form the basis for traditional acoustic modeling in speech recognition systems.