What Makes a Reward Model a Good Teacher? An Optimization Perspective

Publication
Advances in Neural Information Processing Systems (NeurIPS), 2025

Related