Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A foundational textbook covering neural network architectures, including explanations of activations and their role in computation.
Transformers Documentation: Large Model Training & Inference, Hugging Face, 2024 - Provides practical guidance on using LLMs, covering aspects like memory management, batching, and inference optimizations within the popular transformers library.