Speech and Language Processing, Daniel Jurafsky and James H. Martin, 2025 (Stanford University (online edition)) - A comprehensive and widely-used textbook that provides a detailed explanation of N-gram language models, their mathematical foundations, and applications in speech recognition.
Foundations of Statistical Natural Language Processing, Christopher Manning and Hinrich Schütze, 1999 (The MIT Press) - A seminal textbook presenting statistical methods, including N-gram models, that underpin much of early natural language processing research.