# FT Multi-hop Case Analysis & IFR Standard Visualization (exp/case_study)

This directory provides a lightweight single-sample IFR visualization workflow, without modifying core evaluation code.

## Features
- Reads a single sample (default `exp/exp2/data/morehopqa.jsonl`, index 0).
- Supports multiple modes:
  - `ft`: Current multi-hop FT attribution (internally calls `LLMIFRAttribution.calculate_ifr_multi_hop`).
  - `ifr`: Standard IFR (single hop), defaults to **aggregated IFR** on specified sink span (shows 1 panel only).
  - `ifr_all_positions_output_only`: Computes IFR token-level matrix only for output tokens within `sink_span`, and generates Row / Recursive (CAGE) two panels based on that matrix.
  - `attnlrp`: AttnLRP hop0 (reuses FT-AttnLRP span-aggregate logic, equivalent to `LLMLRPAttribution.calculate_attnlrp_multi_hop(n_hops=0)`, visualizes `raw_attributions[0].token_importance_total`).
  - `ft_attnlrp`: FT-AttnLRP (strictly reuses `LLMLRPAttribution.calculate_attnlrp_aggregated_multi_hop`, consistent with `exp/exp2/`; directly visualizes each hop's `token_importance_total`).
- Visualizes two views:
  - **Pre-trim token-level (full)**: Complete sequence heatmap with chat template (template + user prompt + generation).
  - **Prompt-only token-level**: Heatmap showing only user prompt tokens (excluding generation tokens).
- Heatmaps colored by `|score|` (no positive/negative distinction); each panel's full/prompt two images independently normalize color depth using p99.5(`|score|`).
- Outputs JSON (complete values) and HTML (per-hop heatmaps).
- Additionally provides MAS (faithfulness / token perturbation) visualization: performs token-level perturbation evaluation on specified attribution method, and renders perturbation impact heatmap + MAS score.

## Quick Start
```bash
# Modify model/model_path according to local model
# Multi-hop FT (default) ft_split_hop,ft_improve
python exp/case_study/run_ifr_case.py \
  --mode ft_split_hop \
  --dataset exp/exp2/data/morehopqa.jsonl \
  --index 0 \
  --model qwen-8B \
  --model_path /opt/share/models/Qwen/Qwen3-8B/ \
  --cuda 0 \
  --n_hops 3

# Standard IFR (single hop, can specify sink span)
python exp/case_study/run_ifr_case.py \
  --mode ifr \
  --dataset exp/exp2/data/morehopqa.jsonl \
  --index 0 \
  --model qwen-8B \
  --model_path /opt/share/models/Qwen/Qwen3-8B/ \
  --cuda 0 \
  --sink_span 0 0

# IFR output-only: compute IFR matrix only on output range, generate Row/Recursive (CAGE) two panels
python exp/case_study/run_ifr_case.py \
  --mode ifr_all_positions_output_only \
  --dataset exp/exp2/data/short-morehopqa.jsonl \
  --index 0 \
  --model qwen-8B \
  --model_path /opt/share/models/Qwen/Qwen3-8B/ \
  --cuda 0

# AttnLRP hop0 (reuses FT-AttnLRP span-aggregate; visualizes hop0 raw vector)
python exp/case_study/run_ifr_case.py \
  --mode attnlrp \
  --dataset exp/exp2/data/morehopqa.jsonl \
  --index 0 \
  --model qwen-8B \
  --model_path /opt/share/models/Qwen/Qwen3-8B/ \
  --cuda 0 \
  --sink_span 0 20

# FT-attnLRP (multi-hop recursive AttnLRP)
python exp/case_study/run_ifr_case.py \
  --mode ft_attnlrp \
  --dataset exp/exp2/data/morehopqa.jsonl \
  --index 0 \
  --model qwen-8B \
  --model_path /opt/share/models/Qwen/Qwen3-8B/ \
  --cuda 0,2,3,4,5,7 \
  --n_hops 3 \
  --attnlrp_neg_handling abs \
  --attnlrp_norm_mode norm
```

Outputs located in `exp/case_study/out/`, filename prefix varies by mode, e.g.:
- `ft_case_<dataset>_idx<idx>.json/html`
- `ifr_case_<dataset>_idx<idx>.json/html`
- `ifr_output_only_case_<dataset>_idx<idx>.json/html`
- `attnlrp_case_<dataset>_idx<idx>.json/html`
- `ft_attnlrp_case_<dataset>_idx<idx>.json/html`

## MAS (Faithfulness / Token Perturbation) Visualization

> Note: MAS here is consistent with `llm_attr_eval.LLMAttributionEvaluator.faithfulness_test()`:
> 1) First run specified method's attribution on sample, and get token-level attribution (Seq / Row / Recursive).
> 2) Sort prompt tokens by importance, progressively replace token id with `tokenizer.pad_token_id` (token-level perturbation).
> 3) Use `sum log p(generation + EOS | prompt)` to get score curve, compute RISE / MAS / RISE+AP.
> 4) Visualization uses "marginal logprob change from each perturbation step" as token score, rendered as token spans "perturbation impact heatmap".

```bash
# FT-IFR (ifr_multi_hop; default --method ft)
python exp/case_study/run_mas_case.py \
  --dataset exp/exp2/data/short-morehopqa.jsonl \
  --index 0 \
  --model qwen-8B \
  --model_path /opt/share/models/Qwen/Qwen3-8B/ \
  --cuda 0 \
  --method ft \
  --n_hops 3
```

Common method choices (aligned with `run_ifr_case.py` mode names):
```bash
# IFR (needs sink_span; default uses dataset cache field)
python exp/case_study/run_mas_case.py --method ifr --sink_span 0 20 ...

# IFR output-only (computes IFR token-level matrix only for output tokens within sink_span)
python exp/case_study/run_mas_case.py --method ifr_all_positions_output_only --sink_span 0 20 ...

# FT-IFR (ifr_multi_hop)
python exp/case_study/run_mas_case.py --method ft --n_hops 1 --sink_span 0 20 --thinking_span 0 20 ...

# AttnLRP hop0 (reuses FT-AttnLRP hop0; still needs indices_to_explain/sink_span for Seq/Row/Rec)
python exp/case_study/run_mas_case.py --method attnlrp --sink_span 0 20 ...

# FT-AttnLRP (attnlrp_aggregated_multi_hop)
python exp/case_study/run_mas_case.py --method ft_attnlrp --n_hops 1 --sink_span 0 20 --thinking_span 0 20 ...
```

Outputs located in `exp/case_study/out/`, filename prefix:
- `mas_case_<method>_<dataset>_idx<idx>.json/html`

HTML defaults to 3 attribution perspective panels (Seq / Row / Recursive), each panel has 2 rows of token-level heatmaps:
- **Method attribution (token weights)**: The method's token attribution weights (for sorting/density).
- **Attribution-guided MAS marginal (path deltas)**: Marginal impact from progressive replacement sorted by attribution (this is the perturbation path actually used in evaluation).

## Viewing HTML in Browser
1) First run above commands to generate `.html` (terminal prints like `wrote exp/case_study/out/...html`).

2) Start a static file server at repo root (pick any port, e.g., 8888):
```bash
python -m http.server 8888 --directory exp/case_study/out
```

3) Open in browser (note it's `http://`, not `https://`):
- Local: `http://127.0.0.1:8888/<your-html-filename>`
- Remote machine (recommend port forwarding): Run locally `ssh -L 8888:127.0.0.1:8888 <user>@<server>`, then open `http://127.0.0.1:8888/<your-html-filename>` in local browser

If you see lots of `400 Bad request version` with garbled text in `http.server` logs, it's usually some client connecting with HTTPS to HTTP port; confirm browser address bar is `http://...`.

## Optional Parameters
- `--sink_span a b` / `--thinking_span a b`: Override generation-side sink/thinking sentence span (default uses cache fields).
- `--attnlrp_neg_handling drop|abs`: FT-AttnLRP per-hop negative value handling (drop=clamp>=0, abs=absolute value).
- `--attnlrp_norm_mode norm|no_norm`: FT-AttnLRP normalization and hop ratio switch (norm=global+thinking normalization and enable ratio; no_norm=disable all three).
- `--chunk_tokens` / `--sink_chunk_tokens`: IFR chunking parameters.
- `--output_dir`: Modify output directory.

## File Descriptions
- `run_ifr_case.py`: CLI entry and output (supports `ft`/`ifr`/`ifr_all_positions_output_only`/`attnlrp`/`ft_attnlrp` modes).
- `run_mas_case.py`: MAS (faithfulness / token perturbation) visualization entry and output (supports `ifr`/`ifr_all_positions_output_only`/`ft`/`attnlrp`/`ft_attnlrp`).
- `analysis.py`: Per-hop cleanup and encapsulation (token-level).
- `viz.py`: HTML rendering and heatmaps.
