Compiler and Runtime Support for Sparse Operations
New ยท Open Source
Kerb - LLM Development Toolkit
Python toolkit for building production-ready LLM applications. Modular utilities for prompts, RAG, agents, structured outputs, and multi-provider support.
Was this section helpful?
Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, Richard Barrett, Michael Berry, Tony F. Chan, Jack Demmel, James Donato, June Dongarra, Victor Eijkhout, Roldan Pozo, Charles Romine, and Henk van der Vorst, 1994 (SIAM)DOI: 10.1137/1.9781611971538 - Covers standard sparse matrix storage formats (CSR, CSC, COO) and fundamental sparse linear algebra algorithms, which are essential for understanding compiler/runtime implementation.
NVIDIA Ampere Architecture Whitepaper, NVIDIA, 2020 (NVIDIA) - Provides a detailed description of the NVIDIA Ampere architecture, including the introduction and function of Sparse Tensor Cores for accelerating sparse deep learning computations.
Deep Learning with Sparse Tensors: A Survey, Xiaoming Sun, Yining Ma, Yifei Li, Junbo Zhao, and Jianling Liu, 2020Tsinghua Science and Technology, Vol. 25 (IEEE)DOI: 10.26599/TST.2020.9010046 - Comprehensive survey on sparse tensor operations in deep learning, covering representation, computation, and optimization techniques relevant for both compilers and runtimes.