Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks, Nils Reimers and Iryna Gurevych, 2019Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)DOI: 10.48550/arXiv.1908.10084 - This paper introduces Sentence-BERT, a modification of BERT that yields semantically meaningful sentence embeddings useful for tasks like semantic similarity search.
Introduction to Information Retrieval, Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, 2008 (Cambridge University Press) - This widely used textbook provides fundamental concepts of information retrieval, including vector space models and ranking, which are pertinent to understanding semantic search.
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela, 2020Advances in Neural Information Processing Systems 33 (NeurIPS 2020), Vol. 33DOI: 10.48550/arXiv.2005.11401 - This paper presents the Retrieval-Augmented Generation (RAG) framework, combining a pre-trained retriever with a generator for better knowledge-intensive NLP tasks, directly relevant to the mention of RAG systems.