We evaluate on the 400 ARC public evaluation tasks using a verifier that executes candidate programs on the provided training pairs and we count a task as solved if any of 16 diverse methods returns a verified solution. We release drivers and modular implementations (BARC, MARC, MCTS, plan search, mixture-of-agents, self-consistency) and instructions to recreate submission.json files from public ARC JSONs; logs and intermediate artifacts can be placed in the provided folder to replay aggregation. We specify environment details and seeds, and document dependencies.