Scaling Vector Search: Sharding Replication and Indexing
New · Open Source
Kerb - LLM Development Toolkit
Python toolkit for building production-ready LLM applications. Modular utilities for prompts, RAG, agents, structured outputs, and multi-provider support.
Product Quantization for Nearest Neighbor Search, Jégou, Hervé, Douze, Matthijs, and Schmid, Cordelia, 2011IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 33 (IEEE)DOI: 10.1109/TPAMI.2010.207 - The original paper detailing Product Quantization (PQ), a fundamental technique for compressing high-dimensional vectors to reduce memory footprint.