Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - This foundational textbook introduces optimization algorithms, including detailed explanations of gradient descent and its variants in the context of deep learning.
Lecture Notes: Gradient Descent, Andrew Ng, Tengyu Ma, 2023 (Stanford University) - These widely-cited lecture notes from Stanford University's Machine Learning course offer a clear and concise explanation of gradient descent as a fundamental optimization algorithm.
Numerical Optimization, Jorge Nocedal and Stephen J. Wright, 2006 (Springer)DOI: 10.1007/978-0-387-40065-9 - This authoritative textbook on optimization methods provides a rigorous mathematical treatment of gradient descent, discussing its theoretical underpinnings, convergence properties, and various forms.