Dive into Deep Learning, Aston Zhang, Zack C. Lipton, Mu Li, and Alex Smola, 2024 (Cambridge University Press) - This open-source book offers practical guidance on deep learning, implicitly covering the computational demands that necessitate distributed training for modern large-scale models and datasets.
Distributed communication package - torch.distributed, PyTorch Documentation, 2024 (PyTorch Foundation) - This official documentation introduces the core concepts of distributed training in PyTorch, reflecting the industry's response to the challenges of scaling model and data sizes.