Introduction to Information Retrieval, Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze, 2008 (Cambridge University Press) - Comprehensive coverage of text representation models like Bag-of-Words and TF-IDF, serving as foundational methods in text feature extraction.
GloVe: Global Vectors for Word Representation, Jeffrey Pennington, Richard Socher, Christopher Manning, 2014Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (Association for Computational Linguistics)DOI: 10.3115/v1/D14-1162 - Presents GloVe, an unsupervised learning algorithm for obtaining vector representations for words that captures global corpus statistics.