Approaches to Maximize Data Originality and Variety
Was this section helpful?
Self-Instruct: Aligning LLMs with Your Own Data, Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A. Smith, Daniel Khashabi, Hannaneh Hajishirzi, 2022ACL 2023DOI: 10.48550/arXiv.2212.10560 - Introduces Self-Instruct, a framework for improving LLM instruction-following by having the model generate its own instructions and outputs, which inherently boosts diversity.