NVIDIA TensorRT Developer Guide, NVIDIA Corporation, 2023 (NVIDIA Corporation) - The official developer guide for NVIDIA TensorRT. It details its architecture, optimization techniques like layer fusion, precision calibration, kernel auto-tuning, and deployment workflow.
OpenVINO Documentation, Intel Corporation, 2023 (Intel Corporation) - The official documentation for the Intel OpenVINO Toolkit. It covers the Model Optimizer, Inference Engine, and hardware-specific optimizations for Intel platforms.
The Deep Learning Compiler: A Unified Solution for Hardware Acceleration, Hao Wang, Dit-Yan Yeung, 2020ACM Computing Surveys, Vol. 53 (Association for Computing Machinery)DOI: 10.1145/3409383 - A comprehensive survey that introduces the architecture, optimization techniques, and challenges in deep learning compilers, including graph optimization and hardware-specific code generation.