Introduction to Information Retrieval, Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze, 2008 (Cambridge University Press) - 详细介绍了词袋模型和TF-IDF等文本表示方法,是文本特征提取的基础。
GloVe: Global Vectors for Word Representation, Jeffrey Pennington, Richard Socher, Christopher Manning, 2014Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (Association for Computational Linguistics)DOI: 10.3115/v1/D14-1162 - 提出了GloVe,一种无监督的学习算法,用于获取捕获全局语料库统计信息的词向量表示。