Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela, 2020Advances in Neural Information Processing Systems (NeurIPS)DOI: 10.48550/arXiv.2005.11401 - 该论文介绍了检索增强生成(RAG),一个结合预训练参数和非参数记忆的框架,使模型能够访问和利用外部知识,为许多知识库交互模式奠定了基础。
Toolformer: Language Models That Can Use Tools, Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Luke Zettlemoyer, Nicola Cancedda, Thomas Scialom, 2023arXiv preprint arXiv:2302.04761DOI: 10.48550/arXiv.2302.04761 - 该论文介绍了Toolformer,一种通过自监督学习教授语言模型使用外部工具的方法,展示了模型如何学习调用API并整合其输出。