# FlashTrace

Token-level attribution for LLMs with multi-hop reasoning chains (Chain-of-Thought).

## Overview

FlashTrace implements and evaluates attribution methods for understanding how LLMs trace information through reasoning steps:

- **IFR (Information Flow Routes)**: Gradient-based attribution via attention-weighted value propagation
- **AttnLRP**: Attention-aware Layer-wise Relevance Propagation
- **FlashTrace (FT)**: Multi-hop recursive attribution that traces reasoning through thinking→output spans
- **Perturbation baselines**: RISE, MAS, KL-divergence ablation methods

## Repository Structure

```
├── llm_attr.py          # Main LLMAttribution class
├── ifr_core.py          # Low-level IFR tensor operations
├── ft_ifr_improve.py    # FlashTrace enhancements
├── lrp_patches.py       # LRP forward patches for model layers
├── attnlrp.py           # AttnLRP implementation
├── evaluations/         # Faithfulness and recovery metrics
│   ├── faithfulness.py
│   └── attribution_recovery.py
└── exp/
    ├── exp1/            # Time/memory benchmarks
    ├── exp2/            # Faithfulness & recovery evaluation
    ├── exp3-exp5/       # Additional experiments
    └── case_study/      # Visualization tools
```

## Installation

```bash
# Python 3.13+ required
pip install -r requirements.txt

# spaCy model for sentence splitting
python -m spacy download en_core_web_sm
```

Or with uv:
```bash
uv sync
```

## Usage

### Time/Memory Benchmarks (exp1)

```bash
python exp/exp1/run_time_curve.py \
  --model qwen-8B \
  --model_path /path/to/Qwen3-8B/ \
  --cuda 0,1,2 \
  --attr_funcs ifr_multi_hop,attnlrp,perturbation_all \
  --input_lengths 1024,4096 \
  --output_lengths 256,512
```

### Faithfulness & Recovery Evaluation (exp2)

```bash
# Step 1: Sample and filter
python exp/exp2/sample_and_filter.py --dataset morehopqa --max_examples 100

# Step 2: Run attribution evaluation
python exp/exp2/run_exp.py \
  --datasets morehopqa \
  --attr_funcs ifr_multi_hop,attnlrp,IG \
  --model qwen-8B \
  --model_path /path/to/model/ \
  --cuda 0,1 \
  --mode faithfulness_gen,recovery_ruler \
  --num_examples 100
```

### Case Study Visualization

```bash
python exp/case_study/run_ifr_case.py \
  --mode ft \
  --dataset exp/exp2/data/morehopqa.jsonl \
  --index 0 \
  --model_path /path/to/model/ \
  --cuda 0 \
  --n_hops 3

# Serve HTML output
python -m http.server 8888 --directory exp/case_study/out
```

## Model Support

Tested architectures:
- Qwen3, Qwen2
- Llama

Requires models with standard decoder structure.

## Multi-GPU Usage

Use `--cuda 0,1,2` to specify GPUs for model sharding.

## Output Locations

- `exp/exp1/out/`: Time/memory benchmark CSVs
- `exp/exp2/data/`: Cached filtered samples (JSONL)
- `exp/exp2/output/`: Faithfulness/recovery CSVs
- `exp/case_study/out/`: HTML visualizations
