Learning representations by back-propagating errors, David E. Rumelhart, Geoffrey E. Hinton, and Ronald J. Williams, 1986Nature, Vol. 323 (Springer Nature)DOI: 10.1038/323533a0 - Foundational paper popularizing backpropagation, an application of reverse-mode automatic differentiation, for training neural networks.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - Chapter 6 offers an accessible explanation of backpropagation, highlighting its role as reverse-mode AD in deep learning.