NVIDIA NCCL Documentation, NVIDIA Corporation, 2024 - Presents specifications and usage information for the NVIDIA Collective Communications Library, a core component for high-performance GPU communication.
Communication-Efficient Distributed Deep Learning: A Comprehensive Survey, Ruian Jiang, Lei Yang, Jianyi Zhang, Zibin Zheng, 2020ACM Computing Surveys, Vol. 53 (Association for Computing Machinery (ACM))DOI: 10.1145/3389437 - A systematic review of communication strategies in distributed deep learning, covering data and model parallelism and the influence of network design.