Compiler and Runtime Support for Sparse Operations
Was this section helpful?
Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, Richard Barrett, Michael Berry, Tony F. Chan, Jack Demmel, James Donato, June Dongarra, Victor Eijkhout, Roldan Pozo, Charles Romine, and Henk van der Vorst, 1994 (SIAM)DOI: 10.1137/1.9781611971538 - Covers standard sparse matrix storage formats (CSR, CSC, COO) and fundamental sparse linear algebra algorithms, which are essential for understanding compiler/runtime implementation.
NVIDIA Ampere Architecture Whitepaper, NVIDIA, 2020 (NVIDIA) - Provides a detailed description of the NVIDIA Ampere architecture, including the introduction and function of Sparse Tensor Cores for accelerating sparse deep learning computations.
Deep Learning with Sparse Tensors: A Survey, Xiaoming Sun, Yining Ma, Yifei Li, Junbo Zhao, and Jianling Liu, 2020Tsinghua Science and Technology, Vol. 25 (IEEE)DOI: 10.26599/TST.2020.9010046 - Comprehensive survey on sparse tensor operations in deep learning, covering representation, computation, and optimization techniques relevant for both compilers and runtimes.