Reward Loop
===========

Last updated: 10/10/2025.

.. warning::
   Reward Loop is still in progress.

Reward Loop is designed for more flexible and easy-to-use reward computation.

**Design goal**:

- Support more efficient reward computation through asynchronous design
- Provide more flexible reward model interface for user costimized reward function
- Provide request level load balance between multiple reward servers
