Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela, 2020Advances in Neural Information Processing Systems (NeurIPS)DOI: 10.48550/arXiv.2005.11401 - Introduces the core Retrieval Augmented Generation (RAG) architecture, establishing the need for external data retrieval to augment LLMs, which sets the stage for document loading and splitting.
Document loaders, LangChain, 2024 (LangChain) - Official documentation describing various methods and tools for ingesting data from diverse sources into a format suitable for RAG applications.
Text splitters, LangChain, 2024 (LangChain) - Official documentation detailing strategies and implementations for breaking down large documents into smaller, manageable chunks, including different splitting techniques and overlap considerations.
Building LLM-Powered Applications, Andrew Catton, Shreyas Subramanian, Matthew Ready, and Karan Singh, 2024 (O'Reilly Media) - Provides practical guidance on developing LLM applications, including sections on data preparation, document processing, and retrieval strategies essential for RAG systems.