Mathematics for Machine Learning, Marc Peter Deisenroth, A. Aldo Faisal, and Cheng Soon Ong, 2020 (Cambridge University Press) - Provides the mathematical rigor behind machine learning algorithms, including detailed explanations of derivatives, gradients, and optimization techniques essential for understanding linear regression training.
Machine Learning, Andrew Ng, 2012Coursera (DeepLearning.AI and Stanford Online) - A widely acclaimed online course that clearly demonstrates how simple linear regression is formulated and optimized using gradient descent, providing practical examples and intuition.