Optimization with Stochastic Gradient Descent (SGD)
Was this section helpful?
Matrix Factorization Techniques for Recommender Systems, Yehuda Koren, Robert Bell, Chris Volinsky, 2009Computer, Vol. 42 (IEEE Computer Society)DOI: 10.1109/MC.2009.263 - This paper introduced and popularized matrix factorization with SGD for recommendation systems, especially after its success in the Netflix Prize. It describes the objective function and iterative optimization process.
Deep Learning, Ian Goodfellow, Yoshua Bengio, Aaron Courville, 2016 (MIT Press) - Chapter 8 covers optimization algorithms, including SGD, and various regularization methods. This material is fundamental for understanding the underlying mechanics of iterative model training.
Matrix Factorization Methods for Recommender Systems, Yehuda Koren, Robert Bell, 2015 (Springer)DOI: 10.1007/978-1-4939-2708-2_5 - This chapter from the Recommender Systems Handbook (2nd Edition) provides a detailed overview of matrix factorization in recommender systems, explaining various models and optimization strategies, including the use of SGD.
Lecture Notes on Gradient Descent, CS229, Andrew Ng, 2022 - These well-regarded lecture notes offer a clear introduction to gradient descent, providing foundational understanding applicable to SGD optimization in matrix factorization.