MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization

Published: 2025, Last Modified: 15 Jan 2026ICLR 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading