Applied Predictive Modeling, Max Kuhn, Kjell Johnson, 2013 (Springer)DOI: 10.1007/978-1-4614-6849-3 - A comprehensive book on predictive modeling that includes extensive discussions on data preprocessing and feature creation from a statistical perspective.
CS229 Lecture Notes, Andrew Ng, Christopher Ré, Moses Charikar, Tengyu Ma, Anand Avati, Kian Katanforoosh, Yoann Le Calonnec, John Duchi, 2018 - Official lecture notes from Stanford's foundational machine learning course, covering data preparation and feature representation principles.