CUDA C++ Programming Guide, NVIDIA, 2024 (NVIDIA) - The authoritative guide to the CUDA C++ programming model, covering kernel execution, memory management, synchronization, and performance optimization techniques essential for writing custom CUDA kernels.
Professional CUDA C Programming, John Cheng, Max Grossman, and Ty McKercher, 2014 (Wiley) - A comprehensive book on CUDA architecture, programming models, and advanced techniques for optimizing GPU applications, which directly applies to writing efficient custom CUDA extensions.