# C²GSPG: Confidence-calibrated Group Sequence Policy Gradient towards Self-aware Reasoning

This repository contains the implementation of the paper:
**C²GSPG: Confidence-calibrated Group Sequence Policy Gradient towards Self-aware Reasoning.**

It provides the experimental setups and source code for two categories of reasoning tasks:

## Mathematical Reasoning

The implementation is provided in the **`MATH_Code`** folder.
Please refer to the `README.md` inside this folder for detailed instructions on how to run the experiments.

## Logical Reasoning

The implementation is provided in the **`KK_Code`** folder.
Please refer to the `README.md` inside this folder for detailed instructions on how to run the experiments.
