Statistical Methods for Speech Recognition, Fred Jelinek, 1998 (The MIT Press) - This book presents the foundational statistical methods for speech recognition, providing a detailed explanation of acoustic models, language models, and their combination for optimal transcription.
Deep Learning for Speech Recognition: An Overview, Li Deng and Dong Yu, 2014IEEE Signal Processing Magazine, Vol. 31 (IEEE)DOI: 10.1109/MSP.2013.2290903 - This overview article discusses the application of deep learning methods to speech recognition, demonstrating how neural networks enhance both acoustic and language modeling components to improve accuracy.