Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A fundamental textbook that covers the mathematical foundations of neural networks, including matrix operations, which are central to LLM calculations.
NVIDIA Ampere Architecture In-Depth, Ronny Krashinsky, Olivier Giroux, Stephen Jones, Nick Stam, Sridhar Ramaswamy, 2020NVIDIA Technical Blog (NVIDIA) - An official technical overview of a modern GPU architecture designed for AI, detailing its capabilities for high FLOPS throughput crucial for deep learning workloads.