Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - Provides a comprehensive academic treatment of gradient descent and its variants, suitable for readers seeking a detailed understanding of the algorithm's mathematical underpinnings and practical applications in deep learning.
Mathematics for Machine Learning, Marc Peter Deisenroth, A. Aldo Faisal, and Cheng Soon Ong, 2020 (Cambridge University Press) - Connects foundational mathematics with machine learning, with dedicated sections on optimization and derivatives essential for understanding gradient descent.