Debugging Training Issues Related to Optimization/Regularization
Was this section helpful?
Dive into Deep Learning, Aston Zhang, Zack C. Lipton, Mu Li, and Alex Smola, 2024 (Cambridge University Press) - Provides a comprehensive, practical guide to deep learning, including detailed explanations of optimization and regularization methods, common training pitfalls, and debugging strategies with executable code.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A foundational textbook covering the mathematical and conceptual background of deep learning, including extensive sections on optimization algorithms, regularization techniques, and practical guidelines for training neural networks.
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift, Sergey Ioffe and Christian Szegedy, 2015Proceedings of the 32nd International Conference on Machine Learning (ICML), Vol. 37 (PMLR (Proceedings of Machine Learning Research))DOI: 10.5555/3045118.3045167 - Introduces Batch Normalization, a technique vital for stabilizing and accelerating deep network training by addressing internal covariate shift, directly impacting gradient stability and regularization interactions.