Mathematics for Machine Learning, Marc Peter Deisenroth, A. Aldo Faisal, Cheng Soon Ong, 2020 (Cambridge University Press) - Provides a solid mathematical foundation for machine learning, including coverage of calculus, partial derivatives, and optimization techniques relevant to cost functions.
Calculus: Early Transcendentals, James Stewart, 2015 (Cengage Learning) - A widely recognized textbook for multivariable calculus, offering rigorous explanations of partial derivatives, the chain rule, and the concept of gradients.
Lecture Notes: Linear Regression (CS229), Andrew Ng, Tengyu Ma, 2023Stanford University CS229 Lecture Notes (Stanford University) - These lecture notes from a renowned machine learning course detail the derivation of cost functions and their gradients for linear regression, a direct application of the section's content.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - Chapter 4 covers numerical computation, including gradient-based optimization and the mathematical foundations for calculating gradients of objective functions.