Adam: A Method for Stochastic Optimization, Diederik P. Kingma, Jimmy Ba, 2015International Conference on Learning Representations (ICLR 2015)DOI: 10.48550/arXiv.1412.6980 - The original research paper introducing the Adam optimizer, detailing its algorithm, bias correction, and empirical performance.
Deep Learning, Ian Goodfellow, Yoshua Bengio, Aaron Courville, 2016 (MIT Press) - A comprehensive textbook that covers the theoretical foundations and practical aspects of deep learning, including a detailed explanation of Adam and other optimization algorithms in Chapter 8.