Python toolkit for building production-ready LLM applications. Modular utilities for prompts, RAG, agents, structured outputs, and multi-provider support.
Was this section helpful?
Quantization (beta), PyTorch Documentation Team, 2019 - Official documentation explaining PyTorch's quantization support, including practical details on per-tensor and per-channel quantization schemes and their implementation.