Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - This textbook provides a comprehensive theoretical background on optimization methods, including gradient descent, used in deep learning.
Neural Networks Part 3: Learning and Optimization, Stanford University CS231n Course Staff, 2023 - These widely-cited lecture notes offer a practical and intuitive explanation of optimization techniques, including gradient descent, specifically for training neural networks.
Neural Networks and Deep Learning, Michael Nielsen, 2019 - This online book provides an accessible introduction to neural network fundamentals, with a clear explanation of gradient descent and its role in learning.