A Domain-Specific Compiler for Tensor Processing Units, Heejin Jo, Andrew Siena, Mark Weiser, Nicholas P. Johnson, Robert S. French, Paul M. Smith, Kevin S. Lee, Cliff L. Biffle, Eric S. Chung, and Norman P. Jouppi, 2021Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '21) (ACM)DOI: 10.1145/3445814.3446700 - 研究了 XLA 编译器如何针对 Google 的张量处理单元 (TPU) 进行优化,提供了针对专用硬件的后端特定编译器策略的示例。