sklearn.preprocessing.OneHotEncoder, scikit-learn developers, 2024 - Official documentation explaining the principles and usage of one-hot encoding for categorical features, including practical examples with the scikit-learn library.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A foundational textbook providing a comprehensive theoretical background on deep learning, including discussions on distributed representations (embeddings) and their role in handling high-dimensional data.
Feature Hashing for Large Scale Multitask Learning, Kilian Q. Weinberger, Anirban Dasgupta, John Langford, Alex Smola, Tony Attwood, 2009Proceedings of the 26th International Conference on Machine Learning (ICML), Vol. 5 (ACM)DOI: 10.1145/1553374.1553516 - A seminal paper introducing feature hashing (the hashing trick) as an efficient technique for handling high-dimensional categorical features, especially in large-scale machine learning problems.