Introduction to ML Compiler Optimization
章节 1: The ML Compilation Stack
The Framework-Hardware Gap
Anatomy of an ML Compiler
AOT versus JIT Compilation
Tracing and Graph Capture
Environment Setup Practice
章节 2: Intermediate Representations
Role of Intermediate Representation
Dataflow Graphs and Dependencies
Static versus Dynamic Shapes
章节 3: Graph-Level Optimizations
Operator Fusion Strategies
Constant Folding and Propagation
Memory Layout Transformation
Common Subexpression Elimination
Implementing a Graph Pass
章节 4: Kernel and Loop Optimization
Loop Tiling and Cache Locality
Loop Unrolling and Reordering
Parallelization Strategies
Matrix Multiplication Practice
章节 5: Auto-Tuning and Code Generation
Defining the Search Space
Cost Models in Auto-Tuning
Automated Schedule Search
Running an Auto-Tuning Session