# Reproducing ACE-RAG

## Artifact Scope

This artifact reproduces ACE-RAG on Wikipedia-style QA benchmarks.
It focuses on ACE-RAG outputs, ablations, and evaluation protocol, not on
regenerating every baseline from scratch.

Document/page titles are used as canonical doc anchors.
The default doc-anchor mode is `title`, matching the submitted paper.
Title-free doc-anchor induction is outside the scope of this artifact and left for future work.

Public datasets and model checkpoints are not included.
Users must obtain datasets from their original sources.
Baseline values are reference values unless raw baseline predictions are explicitly provided.
Common-prompt and native-prompt protocols are separate evaluation protocols, not arbitrary tuned variants.
Official paper protocols are specified in `configs/`.

## Environment Setup

```bash
conda env create -f environment.yml
conda activate ace-rag
pip install -r requirements.txt
```

## Dataset Layout

Prepare a data root from files downloaded from the original public sources:

```bash
python scripts/prepare_data_layout.py --src /path/to/downloaded/files --dst /path/to/data
export ACE_RAG_DATA_DIR=/path/to/data
export ACE_RAG_OUTPUT_DIR=/path/to/output
```

Expected files:

```text
${ACE_RAG_DATA_DIR}/popqa/popqa.json
${ACE_RAG_DATA_DIR}/popqa/popqa_corpus.json
${ACE_RAG_DATA_DIR}/hotpotqa/hotpotqa.json
${ACE_RAG_DATA_DIR}/hotpotqa/hotpotqa_corpus.json
${ACE_RAG_DATA_DIR}/2wiki/2wikimultihopqa.json
${ACE_RAG_DATA_DIR}/2wiki/2wikimultihopqa_corpus.json
${ACE_RAG_DATA_DIR}/musique/musique.json
${ACE_RAG_DATA_DIR}/musique/musique_corpus.json
```

## Model Setup

Serve the generator:

```bash
vllm serve Qwen/Qwen2.5-7B-Instruct \
  --gpu-memory-utilization 0.55 \
  --tensor-parallel-size 1 \
  --max_model_len 8192 \
  --port 8013
```

Configure the OpenAI-compatible endpoint:

```bash
export ACE_RAG_LLM_BASE_URL=http://localhost:8013/v1
export VLLM_API_KEY=EMPTY
```

Download or provide NV-Embed-v2 locally:

```bash
huggingface-cli download nvidia/NV-Embed-v2 \
  --local-dir /path/to/nvidia/NV-Embed-v2
export NVEMBED_MODEL_PATH=/path/to/nvidia/NV-Embed-v2
```

NV-Embed-v2 is not included in the archive. The embedding loader uses local
files, so the model must already exist at `NVEMBED_MODEL_PATH`.

## Preflight Checks

```bash
python scripts/check_env.py
python scripts/check_vllm.py
python scripts/check_nvembed.py
```

## Smoke Test

```bash
python -m compileall main.py scripts utils
python main.py --help
bash scripts/run_smoke_all.sh 3
```

## Paper Protocol Reproduction

Common-prompt protocol:

```bash
bash scripts/run_paper_common_prompt.sh hotpotqa
LIMIT=1000 bash scripts/run_paper_common_prompt.sh 2wiki
```

Native-prompt protocol:

```bash
bash scripts/run_paper_native_prompt.sh hotpotqa
LIMIT=1000 bash scripts/run_paper_native_prompt.sh 2wiki
```

The official configs fix the retrieval policy, prompt protocol, rendering
policy, compact context budget, and doc-anchor mode. CLI and environment
settings are intended for datasets, limits, output locations, model endpoints,
and local model paths.

## Ablation Reproduction

```bash
bash scripts/run_paper_ablation.sh hotpotqa
```

The ablation config covers full ACE-RAG, w/o Mention Edge, w/o Residual Cues,
w/o Chain Order, and w/o Anchor Bundle.

## Release Check

```bash
ACE_RAG_LLM_BASE_URL=http://localhost:8013/v1 bash scripts/run_release_reproduction_check.sh
```

The release check runs HotpotQA and 2Wiki with a small limit under the
common-prompt and native-prompt protocols, prints metric summaries, and deletes
its temporary output directory unless `ACE_RAG_KEEP_RELEASE_CHECK_OUTPUTS=1` is set.

## Expected Outputs

Paper-scale runs write under `${ACE_RAG_OUTPUT_DIR:-outputs}`:

```text
{dataset}/indexing/{timestamp}/
{dataset}/eval/{timestamp}/predictions.jsonl
{dataset}/eval/{timestamp}/eval.json
{dataset}/eval/{timestamp}/eval_summary.md
{dataset}/eval/{timestamp}/timing_summary.json
```

Generated outputs, logs, caches, datasets, model checkpoints, PDFs, paper
sources, and figures are not part of the release archive.
