# 2p0s-solver

2p0s-solver is a library for running various 2p0s or deep RL algorithms on OpenSpiel games. Furthermore, we compute exact exploitability using the [exp-a-spiel library]() which currently supports Phantom Tic-Tac-Toe and 3x3 Dark Hex, as well as their abrupt versions.

## Installation

```bash
# Install and build the main dependencies
cd 2p0s-solver
pixi install
pixi shell

# Make sure OpenSpiel is uninstalled then install our custom OpenSpiel build (a few fixes and additional features for PTTT and DH3)
# Note: The code has been extensively tested with Python 3.11 on Linux. However, other wheels for MacOS or different Python versions are available in ./wheels.
pip uninstall open_spiel -y
pip install wheels/open_spiel-1.5-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl

# Lastly, install the remaining requirements
pip install -r requirements.txt
```

## Usage

Test commands for all game variants with a PPO policy and a DQN best reponse.

```bash
python main.py --algorithm ppo --run-best-response dqn game="classical_dark_hex" max_steps=10_000 dqn_max_steps=10_000 cuda=False

python main.py --algorithm ppo --run-best-response dqn game="abrupt_dark_hex" max_steps=10_000 dqn_max_steps=10_000 cuda=False

python main.py --algorithm ppo --run-best-response dqn game="classical_phantom_ttt" max_steps=10_000 dqn_max_steps=10_000 cuda=False

python main.py --algorithm ppo --run-best-response dqn game="abrupt_phantom_ttt" max_steps=10_000 dqn_max_steps=10_000 cuda=False
```

# Evaluating head2head 

- Input the paths and algorithm names into a `yaml ` file like the example in `head2head/example.yaml`

```
python head2head/head2head_eval.py --agents-yaml path/to/example.yaml --save-dir <save-dir>
```

This command will kick off head2head evaluation for the agents listed in the eval `yaml` using the `exp-a-spiel` library. 

## Appendix
### pip installing exp-a-spiel (NOT NECESARY, only as alternative debugging tool)
```
cd 2p0s-solver
pip install ./exp-a-spiel 
```
