Machine Learning Design Patterns, Valliappa Lakshmanan, Sara Robinson, Michael Munn, 2020 (O'Reilly Media) - This book covers patterns for building reliable, scalable, and maintainable machine learning systems, with sections relevant to model deployment, serving, and infrastructure. Chapter 13 specifically discusses 'Prediction Serving'.
NVIDIA Deep Learning Performance Guide, NVIDIA Corporation, 2023 (NVIDIA Corporation) - Provides comprehensive guidance on optimizing deep learning training and inference performance on NVIDIA GPUs, covering software, hardware, and best practices.