Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - Provides an introduction to automatic differentiation in the context of deep learning, discussing gradient computation and potential issues with non-differentiable activation functions or operations.