On the difficulty of training Recurrent Neural Networks, Razvan Pascanu, Tomas Mikolov, Yoshua Bengio, 2013Proceedings of the 30th International Conference on Machine Learning (ICML), Vol. 28 - 提出梯度裁剪技术,通过处理梯度爆炸问题来稳定循环神经网络的训练。
Deep Learning, Ian Goodfellow, Yoshua Bengio, Aaron Courville, 2016 (MIT Press) - 深度学习的教科书,对梯度问题及其解决方案有详细解释。