# Repository Cleanup Notes

## Active Paths

- `index.ts`, `src/`, `skills/`, `agent-runtime.plugin.json`, `package.json`: active CKM-HypoGen Agent Runtime plugin code and packaged skills.
- `ckm-eval/`: active CKM evaluation harness used for the paper experiments. The documented entry points are in `ckm-eval/scripts/EXPERIMENTS.md`.
- `docs/`: plugin architecture and cleanup documentation.
- `workflows/`, `.releaserc.json`, `pnpm-lock.yaml`, `tsconfig.json`: release, CI, and build metadata.

## Archived Paths

- `legacy/`: deprecated experiments and prototypes. This directory is already outside the plugin build path and should remain the landing area for superseded code.

## Generated Or Local-Only Paths

- `analysis/`, `experiments/`, `figures/`, `paper/`, `reference/`: paper drafts, analysis outputs, downloaded references, and generated figures.
- `ckm-eval/results/`: large experiment output directory.
- `ckm-eval/.venv/`: local Python virtual environment.
- `ckm-eval/.cache/`: arXiv/full-text metadata cache.
- `**/__pycache__/`, `*.pyc`, `.DS_Store`, `.env`, `.claude/`, `.vscode/`: local machine, secret, editor, or generated artifacts.
- `*.pdf`, `*.doc`, `*.docx`: local binary research artifacts. Use Markdown or source files for tracked documentation.

## Cleanup Completed

- Fixed the malformed `.gitignore` entry that had combined `.cache` and `ckm-eval/.env`.
- Ensured local cache and secret paths are explicitly ignored.
- Removed tracked generated files from the git index:
  - 562 files under `ckm-eval/.cache/arxiv_metadata/`
  - 2 tracked Python bytecode files under `ckm-eval/**/__pycache__/`
- Removed tracked binary attachments that are not part of the plugin release:
  - `2603.08127v1.pdf`
  - `figures/ckm_framework.pdf`
  - `记录文档.docx`
- Rewrote `README.md` and `README.zh.md` as release-facing project overviews.

The cache files were left on disk for local reuse; they are just no longer tracked by git.

## Next Cleanup Candidates

- Consolidate duplicate experiment runners in `ckm-eval/scripts/{pool,ablation1,lite,ablation2}/batch_run.py` into a parameterized shared runner. The current scripts are still active, so this should be a refactor, not deletion.
- Consolidate common logic across `ckm-eval/scripts/*/eval_single.py`. The ablations are documented active experiments, but their copied scaffolding is a maintenance risk.
- Decide whether paper draft directories should remain ignored (`paper/`, `arxiv_paper/`) or become tracked publication sources.
- If the current experiment outputs are no longer needed locally, delete ignored directories `ckm-eval/results/`, `ckm-eval/.venv/`, and `ckm-eval/.cache/` to reclaim about 5.1 GB.
