Speech and Language Processing (3rd Edition Draft), Daniel Jurafsky and James H. Martin, 2025 - This textbook provides a comprehensive treatment of traditional frequency-based models, their limitations, and the transition to distributional semantic models and word embeddings.
Efficient Estimation of Word Representations in Vector Space, Tomas Mikolov, Kai Chen, Greg Corrado, Jeffrey Dean, 2013ICLR Workshop PaperDOI: 10.48550/arXiv.1301.3781 - This foundational paper introduces the Word2Vec model, which represents words as dense vectors (embeddings) to capture semantic relationships, addressing key limitations of frequency-based approaches.
Introduction to Information Retrieval, Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze, 2008 (Cambridge University Press) - This book provides a detailed explanation of vector space models and TF-IDF, clarifying the inherent assumptions and structural properties that lead to the limitations discussed in the section.