# Slight (paper code)

## Overview
This repository contains the training + evaluation pipeline for the Slight traffic signal control agent.
The entry point is `run_slight.py`, which wires dataset selection, configuration overrides, CityFlow
simulation, and the training loop (generator -> sample construction -> updater -> evaluation).

## Repository layout
- `run_slight.py`: CLI entry point; chooses a dataset and launches the pipeline.
- `models/`: Slight agent, embedding, and Q-network code.
- `utils/`: pipeline, generator, updater, CityFlow environment wrapper, and helpers.
- `data/`: CityFlow roadnet + traffic flow JSON files for bundled datasets.
- `env.yml`: conda environment spec used during development.

## Requirements
- Python 3.8+ (the bundled `env.yml` pins Python 3.8.20).
- PyTorch (used by the Slight agent and embeddings).
- CityFlow Python package (`cityflow`) used by `utils/cityflow_env.py`.

### Recommended setup (conda)
```bash
conda env create -f env.yml
conda activate Slight
```

If you prefer not to use conda, ensure your environment includes a compatible
Python version plus `torch` and `cityflow`.

## Data
Datasets are stored under `data/<Template>/<GRID>/` where `<GRID>` matches the
`roadnet_*.json` file name. The shipped datasets are:

```
data/
  Hangzhou/4_4/
    anon_4_4_hangzhou_real.json
    anon_4_4_hangzhou_real_5816.json
    roadnet_4_4.json
  Jinan/3_4/
    anon_3_4_jinan_real.json
    anon_3_4_jinan_real_2000.json
    anon_3_4_jinan_real_2500.json
    roadnet_3_4.json
  newyork_28_7/28_7/
    anon_28_7_newyork_real_double.json
    anon_28_7_newyork_real_triple.json
    roadnet_28_7.json
```

`run_slight.py` selects a default flow file per dataset:
- `hangzhou`: `anon_4_4_hangzhou_real.json` and `anon_4_4_hangzhou_real_5816.json`
- `jinan`: `anon_3_4_jinan_real.json`, `anon_3_4_jinan_real_2000.json`, `anon_3_4_jinan_real_2500.json`
- `ny`: `anon_28_7_newyork_real_double.json`
- `custom`: expects `data/SumoConstruct2/16_16/anon_16_16_custom_sumo_2.json` and `roadnet_16_16.json`

To add your own dataset, mimic this layout and update `run_slight.py` if the
template name, grid size, or traffic file names differ.

## Quick start
From this folder:

```bash
python run_slight.py --dataset ny
```

Other datasets:

```bash
python run_slight.py --dataset hangzhou
python run_slight.py --dataset jinan
python run_slight.py --dataset custom
```

## CLI options
`run_slight.py` exposes common experiment switches. Defaults are shown in parentheses.

General:
- `--memo` (`slight_benchmark`): output subfolder name.
- `--mod` (`Slight`): model identifier recorded in logs.

Execution:
- `--generators` (`1`): number of generator processes per traffic file.
- `--workers` (`3`): max concurrent traffic files when using multi-process.
- `--multi-process` (default): run each traffic file in its own process.
- `--single-process`: run sequentially in the current process.

Dataset selection:
- `--dataset {hangzhou|jinan|custom|ny}` (default: `ny`)

Slight-specific:
- `--num-skills` (`4`): number of discrete skills.
- `--meta-freq` (`10`): steps between meta-controller decisions.
- `--cvae-latent` (`16`): CVAE latent size.

Embeddings and CVAE:
- `--lstm-hidden` (`128`), `--lstm-layers` (`1`), `--lstm-dropout` (`0.0`)
- `--cvae-hidden` (`128`), `--cvae-lr` (`1e-3`), `--cvae-kl` (`1e-4`)

Example:

```bash
python run_slight.py --dataset ny --num-skills 4 --meta-freq 10 \
  --cvae-latent 16 --lstm-hidden 128 --cvae-hidden 128
```

## What happens when you run
- For each traffic flow file in the selected dataset, the pipeline:
  1) copies CityFlow JSON files into a new working directory,
  2) generates rollouts, constructs samples, updates the network, and
  3) evaluates the model each round.
- Each dataset is configured for 50 rounds and 3600 simulation steps per round
  (see `run_slight.py` if you need to adjust these).
- CityFlow uses a fixed seed of `0` in the generated `cityflow.config`.

## Outputs
Runs write to:
- `model/<memo>/...` (checkpoints)
- `records/<memo>/...` (logs, configs, training artifacts)
- `errors/<memo>/...` (if any)

The `--memo` flag controls the experiment subfolder name. If a folder already
exists (and it is not `model/default` or `records/default`), the pipeline will
raise `FileExistsError`. Use a new `--memo` value or delete the old directory.

## Troubleshooting
- `ModuleNotFoundError: cityflow`: install the CityFlow Python package and make
  sure any native dependencies are available.
- `FileExistsError` when rerunning: choose a new `--memo` or delete the prior
  `model/<memo>` and `records/<memo>` directories.
- Multiprocessing hangs: try `--single-process` or reduce `--workers`.
- Missing dataset files: confirm the data layout and file names match the
  dataset configuration in `run_slight.py`.

## Notes
- The default NY run uses `anon_28_7_newyork_real_double.json`; if you want the
  triple flow, update `run_slight.py` or pass a custom dataset.
- The pipeline writes a `cityflow.config` into the working directory each run
  and copies the selected roadnet and flow JSON files into `records/...`.
