Anonymized ICLR supplementary material version.

# PPO-EWMA

This is code for training agents using PPO-EWMA and PPG-EWMA, introduced in the paper _Batch size-invariance for policy optimization_.

## Installation

Supported platforms: MacOS and Ubuntu, Python 3.7

Installation using [Miniconda](https://docs.conda.io/en/latest/miniconda.html), after cloning the repo into `ppo-ewma`:

```
conda env update --name ppo-ewma --file ppo-ewma/environment.yml
conda activate ppo-ewma
pip install -e ppo-ewma
```

Alternatively, install the dependencies from [`environment.yml`](environment.yml) manually.