Designing Machine Learning Systems: An Iterative Process for Production-Ready AI, Chip Huyen, 2022 (O'Reilly Media) - This book provides a comprehensive perspective on building robust machine learning systems, including detailed coverage of feature engineering, feature stores, and strategies to prevent training-serving skew.
Stream Processing with Apache Flink: Fundamentals, StreamSQL, and Table API, Fabian Hueske, Vasia Kalavri, 2019 (O'Reilly Media) - This book offers a guide to Apache Flink, detailing its capabilities for real-time data processing and its application in creating unified batch and stream processing architectures for features.
Hidden Technical Debt in Machine Learning Systems, D. Sculley, Gary Holt, Daniel Golovin, Eugene Davydov, Todd Phillips, Dietmar Ebner, Vinay Chaudhary, Michael Young, Jean-François Crespo, Dan Dennison, 2015Advances in Neural Information Processing Systems 28 (NIPS 2015) (Neural Information Processing Systems Foundation, Inc. (NeurIPS))DOI: 10.5555/2969442.2969562 - A seminal paper that highlights various challenges in putting machine learning into production, prominently discussing the critical issue of training-serving skew.