# FlashTrace Experiment 4 (Aider Attribution Faithfulness / Row-only)

This directory provides token-level attribution faithfulness evaluation tools on the Aider dataset, **outputting only row-part RISE/MAS**, not saving sample-level traces.

Evaluation scope (fixed):
- Dataset: `exp/exp4/data/aider.jsonl`
- Methods:
  - `ifr_all_positions`
  - `ifr_multi_hop_both` (FlashTrace)
- Metrics: `RISE`, `MAS` (row attribution only)

Main files:
- `run_exp.py`: Attribution + faithfulness evaluation, outputs to `exp/exp4/output/`

---

## Data Format

`exp/exp4/data/aider.jsonl` has one JSON per line, corresponding to one sample:
- `input`: prompt (used directly as user prompt content)
- `output`: target (used directly as model generation text; script internally appends EOS for scoring)
- `length`: field from data (script doesn't depend on it, only passes through to metadata)

Note: Aider's `output` looks like:
1) First line `xxx.py`
2) Second line opening fence ```
3) Middle is code
4) Last line is closing fence ```

---

## Attribution and Sink Selection

The script uses `input` as `prompt` and `output` as `target` for each sample (no re-generation), and selects different sinks on the attribution results (`indices_to_explain=[start_tok,end_tok]`, all based on token span from `tokenizer(target, add_special_tokens=False)`; excludes EOS).

### `ifr_all_positions` (outputs two sinks)

- `last_line`: Takes the **last non-empty, non-``` line before closing fence** in `output`, and maps that line's character span to token span; falls back to `full_output` if parsing fails.
- `last_token`: Takes the last token of `last_line` (single-point span `[end,end]`).

Note: The script computes `ifr_all_positions` attribution matrix only once per sample, then takes row attribution on both sinks separately and calculates faithfulness.

### `ifr_multi_hop_both` (FlashTrace, outputs one sink only)

- `full_output`: Uses complete `output` as sink (token span `[0, n_tok-1]`).
- Faithfulness perturbation side follows exp2 protocol: skips stop tokens on prompt-side (determined by `ft_ifr_improve.py` stop-token configuration).

---

## Metric Output (Row-only)

Output CSV only contains row attribution's `RISE/MAS` aggregate statistics:
- `Method,Sink,Row_RISE_Mean,Row_RISE_Std,Row_MAS_Mean,Row_MAS_Std,Used,Skipped,Avg_Sample_Time_s`

Output path:
- `exp/exp4/output/faithfulness/aider/<model_tag>/row_only_<N>_examples.csv`

Where `<model_tag>` prioritizes `--model`, otherwise uses `--model_path` directory name.

---

## Usage

Recommended to run from repo root (ensures relative paths work):

```bash
python exp/exp4/run_exp.py \
  --data_path exp/exp4/data/aider.jsonl \
  --output_root exp/exp4/output \
  --model qwen-8B \
  --model_path /opt/share/models/Qwen/Qwen3-8B/ \
  --cuda 2,3,4,5,6,7 \
  --num_examples 100 \
  --n_hops 1 \
  --k 20
```

Common parameters:
- `--model_path` / `--model`: Local model path or HF repo id (provide at least one)
- `--tokenizer_path`: Optional; defaults to model path/id if not provided
- `--cuda`: Supports `0` (single GPU) or `0,1,2` (multi-GPU, internally sets `CUDA_VISIBLE_DEVICES` and uses `device_map=auto`)
- `--num_examples`: Evaluate first N samples (in file order; `--seed` reserved, currently no random sampling)
- `--n_hops`: FlashTrace (`ifr_multi_hop_both`) hop count
- `--k`: MAS/RISE perturbation steps
- `--chunk_tokens` / `--sink_chunk_tokens`: IFR computation chunk parameters (generally keep defaults)
