In-Datacenter Performance Analysis of a Tensor Processing Unit, Norman P. Jouppi, Cliff Young, Nishant Patil, David A. Patterson, et al., 2017Proceedings of the 44th ACM/IEEE Annual International Symposium on Computer Architecture (ISCA '17) (IEEE and ACM)DOI: 10.1145/3079856.3080246 - This seminal paper introduces the Google Tensor Processing Unit (TPU), detailing its architecture, the systolic array, and its performance characteristics for machine learning inference workloads.
Tensor Processing Units (TPUs) | Google Cloud, Google Cloud, 2023 (Google Cloud) - Official documentation providing current details on Google's TPUs, their capabilities, and how to utilize them within the Google Cloud ecosystem.
The Deep Learning Hardware Landscape, Albert Reuther, Jeremy Kepner, Andrew P. Sage, Jeremy R. Holland, Robert T. Bond, Jeffrey K. Oddson, Roger Pearce, Charles M. Leiserson, William M. McMahon, and Peter A. Michaleas, 2021MIT Lincoln Laboratory Journal, Vol. 22 (MIT Lincoln Laboratory) - This survey article presents an overview of hardware options for deep learning, discussing the strengths and weaknesses of CPUs, GPUs, and specialized ASICs, illustrating their respective trade-offs.