Approaches to Maximize Data Originality and Variety
New · Open Source
Kerb - LLM Development Toolkit
Python toolkit for building production-ready LLM applications. Modular utilities for prompts, RAG, agents, structured outputs, and multi-provider support.
Was this section helpful?
Self-Instruct: Aligning LLMs with Your Own Data, Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A. Smith, Daniel Khashabi, Hannaneh Hajishirzi, 2022ACL 2023DOI: 10.48550/arXiv.2212.10560 - Introduces Self-Instruct, a framework for improving LLM instruction-following by having the model generate its own instructions and outputs, which inherently boosts diversity.