# MPF Cluster Scripts (standalone bundle)

Minimal extract from the hyperparameter-tuning dashboard: only what is needed to run the cluster scripts. No Flask app, RQ, or dashboard UI.

## Layout

- **cluster_scripts/** — Entry points and configs
  - `run_cluster_experiment.py` — Run a single experiment (config + run-id)
  - `run_custom_dataset_experiment.py` — Optuna tuning on a custom synthetic dataset
  - `generate_cluster_config.py` — Build cluster config JSON from datasets/models
  - `aggregate_cluster_results.py` — Summarize results (optionally `--import_to` DB)
  - `save_model_from_log.py` — Save MPF model from an Optuna log file
  - `submit_cluster_jobs.sh` — SLURM submit script
  - `configs/` — Example cluster config JSONs
  - `hyperparams/` — blackbox.json, interpretable.json
- **experiment_runner/** — Optuna execution and default models
- **services/datasets.py** — Dataset loading (OpenML, Friedman, local_npy, etc.)
- **models.py** — DB models (used only by `aggregate_cluster_results.py --import_to`)

## Usage

Run everything from this directory (the bundle root):

```bash
cd reproducibility/mpf_cluster_scripts

# Generate config
python cluster_scripts/generate_cluster_config.py \
  --suite "353[1-10]" --models MPFRegressor XGBRegressor \
  --hyperparams cluster_scripts/hyperparams/interpretable.json \
  --output cluster_config.json

# Submit jobs (SLURM)
bash cluster_scripts/submit_cluster_jobs.sh cluster_config.json results/

# Or run one experiment locally
python cluster_scripts/run_cluster_experiment.py --config cluster_config.json --run-id 0 --output results/

# Aggregate results
python cluster_scripts/aggregate_cluster_results.py results/ --output summary.csv
```

See **cluster_scripts/CLUSTER_EXECUTION.md** for full workflow and options.

## Dependencies

Install the same ML stack as the dashboard (optuna, openml, scikit-learn, xgboost, lightgbm, interpret, mpf-py, etc.). For `aggregate_cluster_results.py --import_to` you need Flask and flask-sqlalchemy. See `requirements.txt` in the full dashboard or use the project’s main `reproducibility/requirements.txt` if it already lists these.
