In-Datacenter Performance Analysis of a Tensor Processing Unit, Norman P. Jouppi, Cliff Young, David Patil, Dustin Patterson, Gaurav Agrawal, Haixin Liao, Kevin Schardl, Mike Smith, Dave Washington, Zhifeng Chen, Yuan Cao, Branden Holt, Luke Taylor, Alan Knaggs, Rohan Kumar, Sanjay Nigam, Guang Gao, Al Gleason, Robert Horton, Daniel Kini, Danny Kwon, David Leyda, Francois Obermeyer, Rajkumar Rathod, Kevin Ren, Aditya Taraporevala, Grant Vert, Xinan Long, Niels Dekker, Andy Hölzle, Quoc Le, Chengyu Sun, 2017Proceedings of the 44th Annual International Symposium on Computer Architecture (ISCA '17) (ACM)DOI: 10.1145/3079893.3080246 - 描述了谷歌第一代张量处理单元(TPU)的架构和性能特征。