ApX logo

© 2025 ApX Machine Learning

Training Objectives for Reward Models (Bradley-Terry)