Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela, 2020Advances in Neural Information Processing Systems (NeurIPS), Vol. 33 (NeurIPS)DOI: 10.5591/978-1-950305-24-1_223 - Introduces Retrieval-Augmented Generation (RAG), providing the foundational context for efficient data retrieval to enhance LLM responses.
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks, Nils Reimers, Iryna Gurevych, 2019Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP)DOI: 10.18653/v1/D19-1410 - Describes Sentence-BERT, a model for generating semantically meaningful sentence embeddings, directly relevant to the 'Role of Embeddings' section.
Approximate Nearest Neighbor Search in High Dimensions: A Survey, Xin Dong, Ben Moses, Yifan Li, 2019arXiv preprint arXiv:1905.10519 - A survey providing an overview of techniques and algorithms for efficient approximate nearest neighbor search, fundamental to vector stores and similarity search.