Product Quantization for Nearest Neighbor Search, Hervé Jégou, Matthijs Douze, Cordelia Schmid, 2011IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 33 (IEEE Computer Society)DOI: 10.1109/TPAMI.2010.57 - Describes the fundamental Product Quantization technique for efficient approximate nearest neighbor search, crucial for reducing memory footprint.
DiskANN: Fast Accurate Billion-point Nearest Neighbor Search on a Single Node, Suhas Jayaram, Ravishankar Krishnaswamy, Somesh Singh, Advitya Gemawat, Shivani Agarwal, Karthikeyan Ramasamy, Sanjog Panda, 2019Proceedings of the 36th International Conference on Machine Learning (ICML), Vol. 97 (Proceedings of Machine Learning Research (PMLR))DOI: 10.5591/978-1-5108-9635-4_17 - Presents an ANN algorithm optimized for large datasets residing primarily on SSDs, offering significant memory cost reduction at the trade-off of higher latency.
AWS Well-Architected Framework - Cost Optimization Pillar, Amazon Web Services, 2024 (Amazon Web Services) - Provides architectural best practices for optimizing costs in cloud environments, covering resource selection, autoscaling, and operational efficiency.