# Reward Functions

This module contains some useful reward functions, primarily intended for use with the [`GRPOTrainer`] and [`RLOOTrainer`].

## accuracy_reward

[[autodoc]] rewards.accuracy_reward

## reasoning_accuracy_reward

[[autodoc]] rewards.reasoning_accuracy_reward

## think_format_reward

[[autodoc]] rewards.think_format_reward

## get_soft_overlong_punishment

[[autodoc]] rewards.get_soft_overlong_punishment
