Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - 全面介绍深度学习基本概念的教材,包括本节讨论的各种优化算法。
On the Importance of Initialization and Momentum in Deep Learning, Ilya Sutskever, James Martens, George Dahl, and Geoffrey Hinton, 2013Proceedings of the 30th International Conference on Machine Learning (ICML), Vol. 28 (PMLR) - 探讨动量在加速深度神经网络训练和改善收敛方面作用的重要论文。