ApX logoApX logo
Reinforcement Learning from AI Feedback (RLAIF)