Hidden Technical Debt in Machine Learning Systems, D. Sculley, Gary Holt, Daniel Golovin, Eugene Davydov, Todd Phillips, Dietmar Ebner, Vinay Chaudhary, Michael Young, Jean-François Crespo, Dan Dennison, 2015Advances in Neural Information Processing Systems (NeurIPS) 28 (Neural Information Processing Systems Foundation, Inc. (NeurIPS)) - Identifies core challenges leading to complexity and maintenance burden in production ML systems.
Engineering MLOps: Machine Learning Operations at Scale, Emmanuel Raj, Harish Lakshmanan, Anurag Agarwal, 2022 (O'Reilly Media) - A comprehensive guide to designing and implementing scalable MLOps platforms, covering abstraction, workflows, and infrastructure.