High-Bandwidth Interconnects for Distributed Systems
Was this section helpful?
NVIDIA NVLink, NVIDIA Corporation, 2023 (NVIDIA Corporation) - Provides an authoritative technical overview of NVLink and NVSwitch technologies, explaining their role in high-bandwidth intra-node GPU communication.
What is RDMA?, Rick Merritt, 2020 (NVIDIA Blog) - Offers a clear, authoritative explanation of Remote Direct Memory Access (RDMA) technology and its benefits for high-performance, low-latency communication in distributed systems, especially relevant for InfiniBand.
A Survey of Communication-Efficient Distributed Deep Learning Methods, Haibin Lin, Jin-Hau Li, Xiaogang Zhang, and Bin Luo, 2020Journal of Parallel and Distributed Computing, Vol. 146 (Elsevier)DOI: 10.1016/j.jpdc.2020.08.012 - Offers a comprehensive review of methods to improve communication efficiency in distributed deep learning, providing context on why high-bandwidth interconnects are essential.