A Domain-Specific Architecture for Training Deep Neural Networks, Norman P. Jouppi, Zhifeng Chen, David Dellweg, George N. Garland, Mark P. Herlihy, Gerard N. John, Nguyet Johnson, Liam K. Kavanagh, Adam Lake, Tibor Lindholm, Matthew R. Markidis, Andrew Myatt, Kevin R. Patuto, Katherine E. Polley, Jason Rolfe, Daniel Smith, Shengqi Wang, Richard J. Ward, Mark White, Martin Wicke, Anna You, Peng Zhao, 2021Proceedings of the 47th Annual International Symposium on Computer Architecture (ISCA '20) (ACM)DOI: 10.1145/3400302.3400309 - 介绍了谷歌张量处理单元(TPU)的架构和性能,为专用AI硬件提供了参考。