Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - Provides a comprehensive theoretical and practical explanation of Batch Normalization within the context of deep learning optimization.
Neural Networks Part 3: Learning and Evaluation, Stanford University CS231n Course Staff, 2022 - Explains Batch Normalization and its practical implications, including benefits for training deep networks, with clear illustrations.