Machine Learning Engineering, Andriy Burkov, 2020 (True Positive Inc.) - Offers practical guidance on building, deploying, and maintaining ML systems, including architectural considerations for serving models in different operational contexts.
MLOps: A guide to operations for machine learning, Google Cloud, 2023 (Google Cloud) - Provides an industry perspective on MLOps best practices, covering model deployment patterns like online and batch prediction within a robust ML production system.