Python toolkit for building production-ready LLM applications. Modular utilities for prompts, RAG, agents, structured outputs, and multi-provider support.
Product Quantization for Nearest Neighbor Search, Herve Jégou, Matthijs Douze, and Cordelia Schmid, 2011IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 33 (IEEE)DOI: 10.1109/TPAMI.2010.57 - Introduces Product Quantization, a fundamental technique for compressing high-dimensional vectors, directly relevant to memory optimization and scalability.