Correlating Framework Operations to Compiled Kernels
Was this section helpful?
MLIR: A Compiler Infrastructure for the End of Moore's Law, Chris Lattner, Mehdi Amini, Uday Bondhugula, Albert Cohen, Andy Davis, Jason Eckhardt, River Riddle, Tate Ulsaker, Sven Verdoolaege, 2021ACM Transactions on Programming Languages and Systems (TOPLAS), Vol. 43 (Association for Computing Machinery)DOI: 10.1145/3472064 - Explains the design of MLIR, a multi-level IR, which is for propagating metadata and source location information through compiler transformations, supporting the correlation of high-level operations to generated code.
NVIDIA Nsight Systems Documentation, NVIDIA Corporation, 2025 (NVIDIA Corporation) - Provides guidance on using Nsight Systems for profiling GPU workloads, specifically detailing the use of NVTX ranges for explicit runtime event annotation and correlation with high-level framework operations.
PyTorch Profiler, PyTorch Developers, 2020 (PyTorch Foundation) - Official guide to the PyTorch Profiler, demonstrating its capabilities for analyzing CPU and GPU performance, including correlation of PyTorch operators with underlying CUDA kernel launches and source code.