NVIDIA TensorRT-LLM Documentation, NVIDIA, 2024 - Official documentation detailing the features, usage, and API of TensorRT-LLM for LLM deployment.
NVIDIA TensorRT Documentation, NVIDIA, 2024 - Official documentation for NVIDIA TensorRT, providing core concepts of the deep learning optimizer and runtime.