A Cloud-Scale Architecture for Accelerating AI, Norman P. Jouppi, Zhifeng Chen, David Pattison, Carla J. del Rosario, Nils Hogstrom, David A. Huffman, Daniel F. Kramer, Andrew B. New, Christopher B. Perry, Evan E. Radecki, Timothy B. Smith, Ryan M. Some, David R. Turnbull, Andrew J. Veit, John W. Weigelt, Dean M. Wilkes, Cliff C. Young, Yifeng Chao, Andrew L. Chien, Patrick K. H. Chiang, John A. Gunnels, Mark R. O'Connor, Anant N. Agarwal, Jeffrey Dean, Paul N. Hilfinger, Jeffrey S. Riegel, Edward A. Sacks, Manjot Singh, Stephen W. Smith, Jonathan S. Taylor, David L. Wells, Kenneth C. Yocum, 2021Proceedings of the 48th Annual International Symposium on Computer Architecture (ISCA '21) (ACM)DOI: 10.1145/3465033.3467614 - 介绍了谷歌张量处理单元(TPU)的架构,该架构专为云规模AI工作负载设计,详细阐述了其专用互连和处理能力。