Optimize TensorFlow performance with XLA, TensorFlow authors, 2024 (TensorFlow (Google)) - Official guide explaining XLA's role in TensorFlow, how to enable it, its advantages, and practical considerations for performance.
TVM: An Automated End-to-End Optimizing Compiler for Deep Learning, Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Haichen Shen, Meghan Cowan, Leyuan Wang, Yuwei Hu, Luis Ceze, Carlos Guestrin, Arvind Krishnamurthy, 2018Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI '18) (USENIX Association) - Presents another prominent domain-specific compiler for deep learning, illustrating similar graph optimization techniques (e.g., fusion) that are also employed by XLA.