Python toolkit for building production-ready LLM applications. Modular utilities for prompts, RAG, agents, structured outputs, and multi-provider support.
Was this section helpful?
Post Training Quantization (PTQ), PyTorch Development Team, 2019 (PyTorch Foundation) - Official documentation for PyTorch's Post-Training Quantization module, providing practical details on PTQ algorithms and their implementation.
Post-training quantization, Google AI for Developers, 2023 - A practical guide from Google Developers explaining the principles and common algorithms of post-training quantization, including MinMax and KL divergence methods.