Learning Spark: Lightning-Fast Data Analytics, Jules S. Damji, Brooke Wenig, Tathagata Das, Denny Lee, 2020 (O'Reilly Media) - A guide to Apache Spark, explaining its architecture and APIs for distributed batch and stream processing, which are essential for scaling monitoring computations and detailed historical analysis.
MLOps Engineering at Scale, Carl Osipov, Nick Schwendeman, Benjamin Shaver, 2022 (O'Reilly Media) - This book provides current practices for MLOps, including methods for building scalable and reliable monitoring systems for machine learning models in production, covering architectural patterns and relevant tools.