Feature Hashing for Large Scale Multitask Learning, Kilian Weinberger, Anirban Dasgupta, John Langford, Alex Smola, Josh Attenberg, 2009Proceedings of the 26th International Conference on Machine Learning (ICML) (Proceedings of Machine Learning Research (PMLR))DOI: 10.48550/arXiv.0902.2206 - 这篇开创性论文介绍了并推广了“哈希技巧”,用于机器学习中高效的特征表示,解决了高维数据的可伸缩性问题。
Mining of Massive Datasets, Jure Leskovec, Anand Rajaraman, Jeff Ullman, 2014 (Cambridge University Press) - 一本关于大规模数据分析技术的综合教科书,其中包含了基于哈希的降维方法讨论,为特征哈希提供了更广阔的背景。