Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela, 2020Advances in Neural Information Processing Systems, Vol. 33DOI: 10.48550/arXiv.2005.11401 - Introduces the foundational architecture and principles of Retrieval-Augmented Generation, essential for understanding RAG system design and its potential failure points.
Dense Passage Retrieval for Open-Domain Question Answering, Vladimir Karpukhin, Barlas Oğuz, Sewon Min, Patrick Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, Wen-tau Yih, 2020Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)DOI: 10.48550/arXiv.2004.04906 - Introduces Dense Passage Retrieval (DPR), a foundational method for the retrieval component of RAG, impacting the effectiveness of finding relevant chunks and mitigating retrieval failures.