Feature Hashing for Large Scale Multitask Learning, Kilian Weinberger, Anirban Dasgupta, John Langford, Alex Smola, Josh Attenberg, 2009Proceedings of the 26th International Conference on Machine Learning (ICML) (Proceedings of Machine Learning Research (PMLR))DOI: 10.48550/arXiv.0902.2206 - This seminal paper introduced and popularized the 'hashing trick' for efficient feature representation in machine learning, addressing scalability issues with high-dimensional data.
Mining of Massive Datasets, Jure Leskovec, Anand Rajaraman, Jeff Ullman, 2014 (Cambridge University Press) - A comprehensive textbook on techniques for analyzing large-scale data, which includes discussions on hashing-based methods for dimensionality reduction, providing a broader context for feature hashing.
sklearn.feature_extraction.FeatureHasher, scikit-learn developers, 2023 - The official documentation for scikit-learn's implementation of Feature Hashing, offering practical usage details, parameter explanations, and a concrete example of its application.