# BenchRL v2 Modular Architecture Design

Please do not publish.

## Core Design Principles

1. Simplicity over Complexity: Follow CleanRL's philosophy of clear, readable code
2. Modular but not Over-engineered: Avoid the complexity of garage while maintaining extensibility
3. Minimize CPU/GPU transfers and use PyTorch best practices
4. Research-Friendly: Easy for researchers to understand and extend algorithms

## Architecture Overview

```txt
  BenchRL v2 Structure:
  ├── environments/        # Environment management (keep as-is)
  ├── algorithms/         # Algorithm implementations
  ├── configs/         # experiment YAML configurations
  ├── modules/          # Network architectures
  ├── utils/buffers/           # Experience storage
  ├── policies/          # Policy interfaces
  └── trainers/          # Training orchestration
```

# Requirements

Before everything, you need to have poetry installed, then run:

```sh
poetry install
```

to install dependencies

# How to run an experiment

Simply run the following command:

```sh
poetry run python ./src/benchrl/experiments/run_v2.py
```

This uses the main config file `src/benchrl/configs/config.yaml` (you need to change the correspondic configs).

To override a configuration you can run the following command:

```sh
poetry run python ./src/benchrl/experiments/run_v2.py experiment.seed=0 environment="<env-config-filename-without-extention>" algorithm="<algorithm-config-filename-without-extention>"
```

If you are on a cluster managed by slurm, you can use `sbatch submit_job_array.sh` to launch mutiple experiments with different configurations.