Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A comprehensive academic textbook covering optimization techniques like gradient descent as a core component of deep learning.
CS229 Lecture Notes: Supervised Learning, Andrew Ng, Tengyu Ma, 2023Stanford University CS229 Course Materials - Provides a clear and academic explanation of gradient descent within the context of supervised machine learning.
Numerical Optimization, Jorge Nocedal and Stephen J. Wright, 2006 (Springer)DOI: 10.1007/978-0-387-40065-5 - A rigorous and authoritative textbook on optimization methods, including detailed mathematical foundations of gradient-based algorithms.