CUDA C++ Programming Guide, NVIDIA Corporation, 2024 (NVIDIA Corporation) - Provides fundamental information on GPU architecture and parallel programming techniques using CUDA C++.
PyTorch Geometric Documentation, Matthias Fey and Jan E. Lenssen, 2024 - Offers practical instructions and examples for using PyTorch Geometric, including GPU acceleration.
Deep Graph Library (DGL) Documentation, Deep Graph Library Contributors, 2024 - Contains resources for effectively utilizing the Deep Graph Library, covering GPU acceleration and graph handling.
Mixed-Precision Training, Paulius Micikevicius, Sharan Narang, Jonah Alben, Gregory Diamos, Erich Elsen, David Garcia, Boris Ginsburg, Michael Houston, Oleksii Kuchaiev, Ganesh Venkatesh, Hao Wu, 2018ICLR 2018DOI: 10.48550/arXiv.1710.03740 - Introduces the method of mixed-precision training using half-precision floating points (FP16) to reduce memory consumption and accelerate deep learning models.