Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela, 2020Advances in Neural Information Processing Systems (NeurIPS 2020)DOI: 10.48550/arXiv.2005.11401 - Describes the original RAG architecture, emphasizing the role of a retrieved knowledge base and its implications for knowledge access.
Introduction to Information Retrieval, Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze, 2008 (Cambridge University Press) - A comprehensive textbook covering document representation, indexing, and retrieval methods, fundamental to building search and RAG systems.
Documents, LangChain, 2024 (LangChain) - Official documentation explaining the Document object structure and its role in data pipelines for LLM applications.
Retrieval-Augmented Generation (RAG) on Google Cloud, Google Cloud, 2024 (Google Cloud) - An official guide explaining the RAG architecture, including the initial data ingestion and knowledge base creation process on a cloud platform.