Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, Aurélien Géron, 2022 (O'Reilly Media) - A practical guide to building machine learning systems, covering the entire workflow from data preparation to deployment with Python examples (3rd edition).
Python for Data Analysis, Wes McKinney, 2022 (O'Reilly Media) - Authoritative resource for data manipulation, cleaning, and preparation using Python's Pandas library, essential for exploratory data analysis (3rd edition).
Preprocessing data, Scikit-learn developers, 2024 - Official documentation explaining various data preprocessing techniques and tools available in Scikit-learn, directly relevant to the chapter's focus.
Hidden Technical Debt in Machine Learning Systems, D. Sculley, Gary Holt, Daniel Golovin, Eugene Davydov, Todd Phillips, Dietmar Ebner, Vinay Chaudhary, Michael Young, Jean-François Crespo, Dan Dennison, 2015Advances in Neural Information Processing Systems, Vol. 2 (MIT Press) - A widely cited paper discussing the engineering challenges and costs of machine learning systems, highlighting the importance of infrastructure, data dependencies, and the non-model components of the workflow.