Speech and Language Processing (3rd ed. draft), Daniel Jurafsky and James H. Martin, 2025 - The leading textbook for speech and natural language processing, offering comprehensive explanations of N-gram models, maximum likelihood estimation, and various smoothing techniques.
Foundations of Statistical Natural Language Processing, Christopher D. Manning, Hinrich Schütze, 1999 (MIT Press) - A classic textbook providing a statistical treatment of language models, including the mathematical basis of N-grams and maximum likelihood estimation.
Fundamentals of Speech Recognition, Lawrence R. Rabiner, Biing-Hwang Juang, 1993 (PTR Prentice Hall) - This book is a foundational text in speech recognition that covers language modeling and probability estimation essential for ASR systems.