Milvus: A Purpose-Built Vector Data Management System, Gu, Xinyu and Song, Jun and Zheng, Xiaomeng and Li, Bowen and Zhang, Bin and Chen, Xiaohan and Su, Jincheng and Wu, Weiping and Zhang, Chao and Chen, Xiaolong and Jiang, Gang and Hu, Yu, 2021Proceedings of the VLDB Endowment, Vol. 14 (VLDB Endowment)DOI: 10.14778/3476239.3476274 - This paper details the architecture of Milvus, an open-source vector database, which includes sharding strategies and distributed query processing for large-scale vector search. It provides insights into how a production vector search system handles scalability.
Designing Data-Intensive Applications, Martin Kleppmann, 2017 (O'Reilly Media) - Chapter 6, 'Partitioning Data,' offers an explanation of various sharding (partitioning) strategies, including hash-based and range-based methods, load balancing, and rebalancing. These are general principles applicable to distributed vector indexes.