The Supervised Learning Phase (Critique and Revision)
New · Open Source
Kerb - LLM Development Toolkit
Python toolkit for building production-ready LLM applications. Modular utilities for prompts, RAG, agents, structured outputs, and multi-provider support.
Was this section helpful?
Self-Refine: Iterative Refinement with Self-Feedback, Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, Shashank Gupta, Bodhisattwa Prasad Majumder, Katherine Hermann, Sean Welleck, Amir Yazdanbakhsh, Peter Clark, 2023arXiv preprint arXiv:2303.17651DOI: 10.48550/arXiv.2303.17651 - This paper presents a general framework for iterative self-correction in LLMs, which provides a broader conceptual understanding of the critique and revision loop used in Constitutional AI.