Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - This comprehensive textbook provides a fundamental grounding in deep learning, including detailed explanations of training workflows, regularization techniques, optimization algorithms, and practical considerations for building robust models.
Adam: A Method for Stochastic Optimization, Diederik P. Kingma and Jimmy Ba, 2015International Conference on Learning Representations (ICLR)DOI: 10.48550/arXiv.1412.6980 - Introduces the Adam optimization algorithm, widely used in deep learning training, demonstrating its effectiveness in achieving fast convergence and good generalization on various tasks.
PyTorch Documentation, PyTorch Developers, 2024 - The official PyTorch documentation provides detailed information on implementing deep learning models, including modules for defining networks (torch.nn), optimizers (torch.optim), data loading utilities, and the usage of model.train() and model.eval() within the training loop.