# NOTE
This supplementary package contains the essential code for our TRM framework, covering:
- TRM training pipeline
- Asynchronous tool calling mechanism
- TRM reward calculation
- Integration of TRM with PPO and GRPO

We aim to release the complete codebase once permitted by conference and institutional guidelines.