Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela, 2020Advances in Neural Information Processing Systems, Vol. 33 (Neural Information Processing Systems Foundation, Inc.)DOI: 10.55917/cb.2023-1123 - 介绍了检索增强生成(RAG)架构,这是理解本节讨论的系统数据加载的基础。
Speech and Language Processing, Daniel Jurafsky and James H. Martin, 2023 (Online (3rd Edition Draft)) - 权威NLP教材的第10章“信息抽取”,讨论从文本中抽取结构化信息和元数据的方法,与RAG数据准备相关。