Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A comprehensive introduction to gradient descent and its variants, useful for understanding its mathematical basis in machine learning.
Mathematics for Machine Learning, Marc Peter Deisenroth, A. Aldo Faisal, and Cheng Soon Ong, 2020 (Cambridge University Press)DOI: 10.1017/9781108679901 - Covers calculus concepts, including derivatives and gradient descent, foundational for understanding machine learning algorithms.