# AMS: Action Manifold Smoothing - Supplementary Material

## Contents

- `logs/`: TensorBoard training logs (5 seeds per environment)
- `figures/`: Generated learning curves

## Requirements

```bash
pip install gymnasium shimmy torch tensorboard stable-baselines3 matplotlib numpy
pip install dm_control
```

## Visualizing Results

```bash
cd logs
python ../code/visualize_logs.py
```

This generates `figures/all_results.pdf` showing learning curves for all environments.

Alternatively, view logs directly with TensorBoard:

```bash
tensorboard --logdir logs/
```

## Training

```bash
# AMS-TD3
python code/ams_td3.py --env_id dm_control/dog-run-v0 --total_timesteps 1000000

# AMS-SAC
python code/ams_sac.py --env_id dm_control/dog-run-v0 --total_timesteps 1000000
```

## Environments

| Environment    | Action Dim | Training Steps |
| -------------- | ---------- | -------------- |
| Quadruped-Walk | 12         | 500K           |
| Quadruped-Run  | 12         | 500K           |
| Dog-Walk       | 38         | 500K           |
| Dog-Run        | 38         | 1M             |

## Hyperparameters

All experiments use identical hyperparameters without task-specific tuning:

| Parameter              | Value |
| ---------------------- | ----- |
| Neighborhood samples K | 8     |
| Neighborhood radius    | 0.2   |
| Learning rate          | 3e-4  |
| Batch size             | 256   |
| Parallel environments  | 8     |
| Replay buffer size     | 1M    |

## Seeds

All experiments use 5 random seeds: {0, 10, 42, 77, 123}