Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela, 2020Advances in Neural Information Processing Systems (NeurIPS)DOI: 10.48550/arXiv.2005.11401 - 介绍检索增强生成(RAG)框架的开创性论文,展示了其架构以及结合检索器和生成器的有效性。
Dense Passage Retrieval for Open-Domain Question Answering, Vladimir Karpukhin, Barlas Oğuz, Sewon Min, Patrick Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, and Wen-tau Yih, 2020Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)DOI: 10.48550/arXiv.2004.04906 - 关于使用密集向量嵌入进行高效准确的段落检索的开创性工作,这对于RAG检索器的“搜索”阶段至关重要。