# Construction of Uncertainty Keywords

We explain the semi-supervised construction of uncertainty keywords, which involves the following steps:

#### Step 1: Train a Random Forest Classifier
Train a random forest classifier with TF-IDF $k$-gram features ($k=2,3,4,5$) to discriminate between the two types of traces. Use the following steps:

1. Apply 5-fold stratified cross-validation to obtain five random forest models.
2. Average the feature importance scores across all folds.
3. Select the top features as initial candidates.

To do this, run:
```bash
python -m src.detect.classifier
```

This produces `classification_results/gsm8k_classification_results.json`, which contains the top $k$-gram features.

#### Step 2: Categorizing the Keyword Set

Organize the extracted $k$-gram features into three semantic categories:
- **Insufficiency**
- **Impossibility**
- **Epistemic Uncertainty**

Assign each feature to a category using a keyword-based filtering approach. For each category, specify a small set of words that reflect its characteristic type of uncertainty. The filtering terms are listed below:

| Category         | Filtering Terms                                                                 |
|------------------|---------------------------------------------------------------------------------|
| Insufficiency    | missing, insufficient, incomplete, no numbers, no information, not enough, without, additional, not given, unless, lack, unclear, shot dark |
| Impossibility    | not possible, impossible, hard, can't, cannot, difficult, unlikely              |
| Epistemic Uncertainty | maybe, perhaps, possibly, possible, might, not sure, I think, guess, assume, would, could |

Break ties by prioritizing:
1. **Insufficiency** over **Impossibility**.
2. **Impossibility** over **Epistemic Uncertainty**.

This process yields three disjoint sets:
- $K_{\mathrm{ins}}$: Insufficiency keywords.
- $K_{\mathrm{imp}}$: Impossibility keywords.
- $K_{\mathrm{epi}}$: Epistemic Uncertainty keywords.

The full uncertainty keyword set is $K = K_{\mathrm{ins}} \cup K_{\mathrm{imp}} \cup K_{\mathrm{epi}}$.

In our experiments, our script `src/simulator.py` calls `get_uncertainty_keywords` in `src/detect/uncertainty_keywords.py` to retrieve the full keyword set $K$ for uncertainty detection.

#### Detect Abstention Using LLM Judge

The `src/detect/detectors.py` module provides the `LLMJudgeAbstentionDetector` class, which is used to determine whether a model's response constitutes an abstention. This is achieved by leveraging a large language model (LLM) to judge the response based on a predefined prompt.

In the `compute_metrics.py` script, the `LLMJudgeAbstentionDetector` is used to perform batched abstention detection. 

# LLM Judge Abstention Detection

`src/detect/detectors.py` implements the `LLMJudgeAbstentionDetector` class, which uses an LLM to determine if a model's response is an abstention. The system prompt is included in `evaluation_judge_prompt.py`.