Designing Machine Learning Systems: An Iterative Process for Production-Ready AI, Chip Huyen, 2022 (O'Reilly Media) - A comprehensive guide to designing and building machine learning systems, covering the entire lifecycle from data to deployment, with significant focus on hardware considerations for training and inference.
Memory Saving Techniques for Training Large Language Models, Paulius Micikevicius, NVIDIA Corporation, 2023NVIDIA Developer Blog (NVIDIA) - Discusses advanced techniques and best practices to optimize GPU memory usage, particularly for training large language models, directly addressing VRAM constraints and calculation methods.