Mixed-Precision Training for Deep Neural Networks, Paulius Micikevicius, Sharan Narang, Jonah Alben, Gregory Diamos, Erich Elsen, David Garcia, Boris Ginsburg, Michael Houston, Oleksii Kuchaiev, Ganesh Venkatesh, Hao Wu, 2018International Conference on Learning Representations (ICLR)DOI: 10.48550/arXiv.1710.03740 - 介绍深度神经网络混合精度训练的开创性论文,涵盖FP16的使用、损失缩放以防止下溢,以及其在带有Tensor Cores的NVIDIA GPU上的优势。