Python toolkit for building production-ready LLM applications. Modular utilities for prompts, RAG, agents, structured outputs, and multi-provider support.
Hugging Face Optimum Documentation, Hugging Face, 2024 (Hugging Face) - Official documentation for Hugging Face Optimum, offering practical guides, tutorials, and API references for optimizing and quantizing models using various advanced techniques, including GPTQ and AWQ.