Mixed-Precision Training, Paulius Micikevicius, Sharan Narang, Jonah Alben, Gregory Diamos, Erich Elsen, David Garcia, Boris Ginsburg, Michael Houston, Oleksii Kuchaiev, Ganesh Venkatesh, Hao Wu, 2018International Conference on Learning RepresentationsDOI: 10.48550/arXiv.1710.03740 - 介绍了混合精度训练的概念及FP16的损失缩放等核心技术。
Training with BFloat16 on NVIDIA GPUs, Nikolaos Markidis, Andrew P. Overman, Michael Garland, and Jan-Dirk Wegner, 2020 (NVIDIA) - 描述了BF16的设计、相对于FP16的优点以及在NVIDIA GPU上的实现。