Reinforcement Learning for Large Language Model Fine-Tuning: A Systematic Literature Review

Published: 27 Nov 2025, Last Modified: 26 May 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Abstract: Large Language Models (LLMs) have been developed for a wide range of language-based tasks, while Reinforcement Learning (RL) has been primarily applied to decision-making problems such as robotics, game theory, and control systems. Nowadays, these two paradigms are integrated through different...
Loading