Preprocessing data (sklearn.preprocessing), Scikit-learn developers, 2024 - Official documentation for Scikit-learn's data preprocessing module, providing technical details and usage examples for encoders like OneHotEncoder and LabelEncoder.
Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, Aurélien Géron, 2022 (O'Reilly Media, Inc.) - A practical guide that includes essential data preprocessing steps, such as categorical feature encoding, within the context of building machine learning models.
CatBoost: unbiased boosting with categorical features, Liudmila Prokhorenkova, Gleb Gusev, Aleksandr Vorobev, Anna Veronika Dorogush, Andrey Gulin, 2018Advances in Neural Information Processing Systems, Vol. 31DOI: 10.5555/3295222.3295325 - An academic paper introducing CatBoost, a gradient boosting algorithm that natively handles categorical features and employs specialized encoding techniques to avoid data leakage and bias, particularly relevant for understanding target encoding limitations.