Iterative Self-training for Code Generation via Reinforced Re-ranking

Published: 01 Jan 2025, Last Modified: 11 May 2025ECIR (3) 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Generating high-quality code that solves complex programming tasks is challenging, especially with current decoder-based models that produce highly stochastic outputs. In code generation, even minor errors can easily break the entire solution. Leveraging multiple sampled solutions can significantly improve the overall output quality.
Loading