Contextual Listen, Attend and Spell, Etienne Bruguier, Olivier Siohan, Anjishnu Kumar, George Tucker, 2019Interspeech 2019 (ISCA (International Speech Communication Association))DOI: 10.21437/Interspeech.2019-1784 - This paper details the 'Contextual Listen, Attend and Spell (CLAS)' model, a specific neural approach to incorporating context into attention-based end-to-end ASR systems.
Speech and Language Processing (3rd ed. draft), Daniel Jurafsky and James H. Martin, 2025 (Stanford University) - A comprehensive textbook providing a broad theoretical foundation for speech recognition, including language modeling concepts and adaptation strategies.