Speech and Language Processing, Daniel Jurafsky and James H. Martin, 2025 (Pearson) - A comprehensive textbook covering all fundamental NLP techniques, including text representation methods like BoW, TF-IDF, N-grams, and LSA.
Introduction to Information Retrieval, Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze, 2008 (Cambridge University Press) - A standard textbook on information retrieval, providing detailed explanations of Bag-of-Words, TF-IDF, and vector space models for text representation.
Indexing by Latent Semantic Analysis, Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, and Richard Harshman, 1990Journal of the American Society for Information Science, Vol. 41 (Wiley)DOI: 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9 - The original paper introducing Latent Semantic Analysis (LSA), which uses Singular Value Decomposition (SVD) for dimensionality reduction in text.
Feature Hashing for Large-Scale Text Categorization, Kilian Q. Weinberger, Anirban Dasgupta, John Langford, Alex Smola, Tong Zhang, 2009Advances in Neural Information Processing Systems, Vol. 22 (NeurIPS)DOI: 10.5591/978-1-57735-420-3.2009.207 - Introduces and analyzes the 'hashing trick' (feature hashing) as an efficient method for dimensionality reduction in large-scale machine learning, particularly useful for text data.