Alpaca: A Strong Open-Source Instruction-Following Model, Rohan Taori, Ishaan Gulrajani, Tianyi Zhang, Yann Dubois, Xuechen Li, Carlos Guestrin, Percy Liang, Tatsunori B. Hashimoto, 2023 (Stanford CRFM) - 描述了Alpaca数据集,该数据集使用Self-Instruct方法生成,用于指令遵循模型。
OpenAssistant Conversations - A New Dataset for Open-Source Instruction Tuning, Andreas Köpf, Yannic Kilcher, Dimitri von Rütte, Sotiris Anagnostidis, Zhi-Rui Tam, Keith Stevens, Abdullah Barhoum, Nguyen Minh Duc, Oliver Stanley, Richárd Nagyfi, Shahul ES, Sameer Suri, David Glushkov, Arnav Dantuluri, Andrew Maguire, Christoph Schuhmann, Huu Nguyen, Alexander Mattick, 2023NeurIPS 2023 Datasets and BenchmarksDOI: 10.48550/arXiv.2304.07327 - 介绍了OpenAssistant Conversations数据集,这是一个大型众包数据集,用于多轮指令调优。