# Instructions

To run SuperB on AntMaze, please use the following command:

```
python src/rvs/train.py --configs experiments/config/d4rl/rvs_b/maze2d_rvs_b.cfg --use_gpu --store_dataset_gpu True --seed 0 --env_name antmaze-umaze-v2 --bootstrap_iters 1 --bts_model qr --bts_batch_size 1024 --bts_hidden_size 512 --bts_learning_rate 0.001 --bts_epochs 10 --bts_quantiles 5 --bts_only_sample_last True --bts_only_sample_last_policy True --bts_relabel_style greedy --epochs 100 --learning_rate_scheduler cosine --checkpoint_every_n_epochs 400 --dropout_p 0.0 --obs_noise 0.0 --weight_decay 1e-2 --hidden_size 1024 --depth 2 --discretize False --bts_value_fn True --reward_targets 0,20,40,60,80,100 --discount_factor 0.997 --reward_preprocessing antmaze --eval_return_quantile -1 --trajectory_samples 52 --num_cpu 4 --deterministic False
```

To run the Gym tasks, please use the following command:

```
python src/rvs/train.py --configs experiments/config/d4rl/rvs_b/maze2d_rvs_b.cfg --store_dataset_gpu True --use_gpu --seed 2 --env_name walker2d-medium-replay-v2 --bootstrap_iters 2 --bts_model qr --bts_batch_size 1024 --bts_hidden_size 512 --bts_learning_rate 3e-4 --bts_epochs 10 --bts_only_sample_last true --bts_relabel_style greedy --bts_only_sample_last_policy true --bts_quantiles 5 --epochs 100 --checkpoint_every_n_epochs 400 --trajectory_samples 30 --num_cpu 6 --obs_noise 0.0 --dropout_p 0.0 --discount_factor 1.0 --weight_decay 1e-2 --hidden_size 1024 --depth 2 --reward_targets 0,20,40,60,80,100 --num_cpu 6 --reward_preprocessing conservative --eval_return_quantile -1 --deterministic true --run_tag ULTRA10 --bts_ensemble min --clean_wandb_after --learning_rate_scheduler cosine --discretize false
```
