Flux.jl Documentation: Training Models, The Flux.jl Contributors, 2025 - Offers practical details on gradient computation and application in Flux.jl's training, covering Flux.params and Zygote.gradient.
Zygote.jl Documentation: Gradients, The Zygote.jl Contributors, 2023 - Explains Zygote.jl's automatic differentiation, covering Zygote.gradient and Zygote.pullback.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - Presents deep learning fundamentals, including automatic differentiation, backpropagation, and gradient-based optimization.
Neural Networks and Deep Learning, Michael A. Nielsen, 2015 (Determination Press) - Clear, step-by-step explanation of backpropagation and gradient calculation in neural networks.