Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization

Publication
International Conference on Learning Representations (ICLR), 2025

Related