Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks, Nils Reimers and Iryna Gurevych, 2019Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) (Association for Computational Linguistics (ACL))DOI: 10.18653/v1/D19-1410 - 本文介绍了Sentence-BERT,一种生成具有语义意义的句子嵌入的方法,该方法对于基于余弦相似度的语义搜索(RAG系统中的核心技术)非常有效。
Introduction to Information Retrieval, Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze, 2008 (Cambridge University Press) - 一本基础教科书,对向量空间模型、余弦相似度和信息检索的理论基础进行了详细阐述,与语义搜索直接相关。其中第六章尤其有价值。
Approximate Nearest Neighbor Search, Alexandr Andoni, Piotr Indyk, Thijs Laarhoven, Ilya Razenshteyn, Ludwig Schmidt, 2021Communications of the ACM, Vol. 64 (Association for Computing Machinery)DOI: 10.1145/3472066 - 这篇综述文章全面介绍了近似最近邻(ANN)搜索算法,这些算法对于在大规模向量数据库中高效扩展相似度搜索至关重要。
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, Patrick Lewis, Ethan Perez, Aleksa Gordić, Vladimir Karpukhin, Katja Filippova, Marc'Aurelio Ranzato, Mihai S. Rădulescu, Mike Lewis, Yann N. Dauphin, and Douwe Kiela, 2020Advances in Neural Information Processing Systems (NeurIPS) (NeurIPS Foundation)DOI: 10.5591/neurips.2020.00949 - 这篇基础论文介绍了检索增强生成(RAG)框架,详细阐述了如何将依赖相似度搜索的检索组件与大型语言模型相结合,以提高性能。