# Artifact Guide (Anonymous)

## Entry points
- `examples/ce_graph_refine/toy_demo.py`: sanity-check run.
- `verl/ce_graph/refine.py`: main refinement loop.

## Core abstractions
- `Workflow`: DAG of nodes and edges.
- `ExecutionTrace`: per-instance trace (including failure indicator).
- `CounterexamplePool`: stores failed traces.
- `FailureSignature`: structured vector / fields distilled from trace.
- `FailureClustering`: mode discovery over signatures.
- `OperatorLibrary`: constrained edits (local graph modifications).

## Reproducibility
The toy demo is deterministic and does not call external models. For full benchmarks, replace the runner with your
agent executor and verifier.
