Weight Initialization Strategies for Deep Networks
Was this section helpful?
Understanding the difficulty of training deep feedforward neural networks, Xavier Glorot, Yoshua Bengio, 2010Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS), Vol. 9 (JMLR.org) - Introduces Xavier initialization for stable signal propagation in deep networks with symmetric activation functions.
Deep Learning, Ian Goodfellow, Yoshua Bengio, Aaron Courville, 2016 (MIT Press) - A textbook covering the foundations of deep learning, including discussions on weight initialization and gradient issues.
torch.nn.init, PyTorch Contributors, 2022 (PyTorch Foundation) - Official PyTorch documentation for initialization methods, including kaiming_normal_ and xavier_uniform_, with usage examples.
Keras initializers API, TensorFlow Authors, 2024 (TensorFlow) - Official Keras documentation details weight initializers available, such as HeNormal and GlorotUniform.