# Code for the paper "Adaptive Destruction Processes for Diffusion Samplers".

## Setup 

First, create a conda environment and install the requirements from the root directory:

```
conda create -n gfn_diffusion python=3.9
conda activate gfn_diffusion
pip install -r requirements.txt
```

## Sampling

To run experiments, go to **energy_sampling** directory:

```
cd energy_sampling
```

### Exemplary commands to run experiments
Below are commands to reproduce some of the results:

#### 25GMM:

To run a baseline $\text{TB}_{\theta}$ (fixed var.) on 25GMM with 5 steps:
```
python train.py --energy distorted_gmm --distortion_coef 0.0 --epochs 25000 --seed 0 --T 5 --t_scale 5  --zero_init --clipping --lr_policy 1e-3 --lr_flow 1e-1 --replay_ratio_n 2 --gamma 0.9999 --batch_size 512 --clip_grad_norm 200 --discretizer harmonic_discretizer --process_param standard --data_dim 2 --pf_mode_fwd tb --pf_mode_bwd tb --pb_mode_fwd no_update --pb_mode_bwd no_update --exploratory --exploration_wd --exploration_factor 0.3 --both_ways --local_search --buffer_size 600000 --prioritized rank --rank_weight 0.01 --ld_step 0.1 --ld_schedule --target_acceptance_rate 0.574
```

To run a baseline $\text{TB}_{\theta}$ (learned var.) on 25GMM with 5 steps, add `--learned_variance` flag:
```
python train.py --energy distorted_gmm --distortion_coef 0.0 --epochs 25000 --seed 0 --T 5 --t_scale 5  --zero_init --clipping --lr_policy 1e-3 --lr_flow 1e-1 --replay_ratio_n 2 --gamma 0.9999 --batch_size 512 --clip_grad_norm 200 --discretizer harmonic_discretizer --process_param standard --data_dim 2 --pf_mode_fwd tb --pf_mode_bwd tb --pb_mode_fwd no_update --pb_mode_bwd no_update --learned_variance --exploratory --exploration_wd --exploration_factor 0.3 --both_ways --local_search --buffer_size 600000 --prioritized rank --rank_weight 0.01 --ld_step 0.1 --ld_schedule --target_acceptance_rate 0.574
```

To run an algorithm with learned destruction process $\text{TB}_{\theta, \varphi}$ on 25GMM with 5 steps:
```
python train.py --energy distorted_gmm --distortion_coef 0.0 --epochs 25000 --seed 0 --T 5 --t_scale 5  --zero_init --clipping --lr_policy 1e-3 --lr_flow 1e-1 --replay_ratio_n 2 --gamma 0.9999 --batch_size 512 --huber_loss_quantile 0.99 --clip_grad_norm 200 --grad_history_window_size 3000 --discretizer harmonic_discretizer --process_param standard --data_dim 2 --pf_mode_fwd tb --pf_mode_bwd tb --pb_mode_fwd tb --pb_mode_bwd tb --learned_variance --exploratory --exploration_wd --exploration_factor 0.3 --both_ways --local_search --buffer_size 600000 --prioritized rank --rank_weight 0.01 --ld_step 0.1 --ld_schedule --target_acceptance_rate 0.574 --learn_pb --pb_scale_range 0.9 --lr_back_multiplier 1.0 --tau 0.05 --target_pf --target_pb --use_2optimizers --share_backbone
```

To run an algorithm with learned destruction process $\text{TB}_{\theta}$, $\text{TLM}_{\varphi}$ on 25GMM with 5 steps, change `--pb_mode_fwd` flag on `tlm`:
```
python train.py --energy distorted_gmm --distortion_coef 0.0 --epochs 25000 --seed 0 --T 5 --t_scale 5  --zero_init --clipping --lr_policy 1e-3 --lr_flow 1e-1 --replay_ratio_n 2 --gamma 0.9999 --batch_size 512 --huber_loss_quantile 0.99 --clip_grad_norm 200 --grad_history_window_size 3000 --discretizer harmonic_discretizer --process_param standard --data_dim 2 --pf_mode_fwd tb --pf_mode_bwd tb --pb_mode_fwd tlm --pb_mode_bwd tb --learned_variance --exploratory --exploration_wd --exploration_factor 0.3 --both_ways --local_search --buffer_size 600000 --prioritized rank --rank_weight 0.01 --ld_step 0.1 --ld_schedule --target_acceptance_rate 0.574 --learn_pb --pb_scale_range 0.9 --lr_back_multiplier 1.0 --tau 0.05 --target_pf --target_pb --use_2optimizers --share_backbone
```

#### Distorted 25GMM:

To run a baseline $\text{TB}_{\theta}$ (fixed var.) on Distorted 25GMM with 5 steps, change `--distortion_coef` to $0.1$:
```
python train.py --energy distorted_gmm --distortion_coef 0.1 --epochs 25000 --seed 0 --T 5 --t_scale 5  --zero_init --clipping --lr_policy 1e-3 --lr_flow 1e-1 --replay_ratio_n 2 --gamma 0.9999 --batch_size 512 --clip_grad_norm 200 --discretizer harmonic_discretizer --process_param standard --data_dim 2 --pf_mode_fwd tb --pf_mode_bwd tb --pb_mode_fwd no_update --pb_mode_bwd no_update --exploratory --exploration_wd --exploration_factor 0.3 --both_ways --local_search --buffer_size 600000 --prioritized rank --rank_weight 0.01 --ld_step 0.1 --ld_schedule --target_acceptance_rate 0.574
```

To run a baseline $\text{TB}_{\theta}$ (fixed var.) on Distorted 25GMM with 10 steps, set `--T` to $10$:
```
python train.py --energy distorted_gmm --distortion_coef 0.1 --epochs 25000 --seed 0 --T 10 --t_scale 5  --zero_init --clipping --lr_policy 1e-3 --lr_flow 1e-1 --replay_ratio_n 2 --gamma 0.9999 --batch_size 512 --clip_grad_norm 200 --discretizer harmonic_discretizer --process_param standard --data_dim 2 --pf_mode_fwd tb --pf_mode_bwd tb --pb_mode_fwd no_update --pb_mode_bwd no_update --exploratory --exploration_wd --exploration_factor 0.3 --both_ways --local_search --buffer_size 600000 --prioritized rank --rank_weight 0.01 --ld_step 0.1 --ld_schedule --target_acceptance_rate 0.574
```

#### Hard Funnel:

To run a baseline $\text{TB}_{\theta}$ (fixed var.) on Hard Funnel with 10 steps:
```
python train.py --energy hard_funnel --epochs 25000 --seed 0 --T 10 --t_scale 1 --zero_init --clipping --lr_policy 1e-3 --lr_flow 1e-1 --replay_ratio_n 2 --gamma 0.9999 --batch_size 512 --clip_grad_norm 200 --discretizer uniform_discretizer --process_param standard --data_dim 10 --pf_mode_fwd tb --pf_mode_bwd tb --pb_mode_fwd no_update --pb_mode_bwd no_update --exploratory --exploration_wd --exploration_factor 0.2 --both_ways --local_search --buffer_size 600000 --prioritized rank --rank_weight 0.01 --ld_step 0.1 --ld_schedule --target_acceptance_rate 0.574 
```

To run an algorithm with learned destruction process $\text{TB}_{\theta}$ on Hard Funnel with 10 steps with all stabilisation techniques:
```
python train.py --energy hard_funnel --epochs 25000 --seed 0 --T 10 --t_scale 1 --zero_init --clipping --lr_policy 1e-3 --lr_flow 1e-1 --replay_ratio_n 2 --gamma 0.9999 --batch_size 512 --clip_grad_norm 200 --discretizer uniform_discretizer --process_param standard --data_dim 10 --pf_mode_fwd tb --pf_mode_bwd tb --pb_mode_fwd tb --pb_mode_bwd tb --learned_variance --exploratory --exploration_wd --exploration_factor 0.2 --both_ways --local_search --buffer_size 600000 --prioritized rank --rank_weight 0.01 --ld_step 0.1 --ld_schedule --target_acceptance_rate 0.574 --learn_pb --pb_scale_range 0.9 --lr_back_multiplier 0.001 --tau 0.05 --target_pf --target_pb --use_2optimizers --share_backbone 
```

#### Other

If you want to log results in Wandb, add the flag `--wandb`.

## References

This code is based on the published code for the paper of [Sendera, 2024](https://arxiv.org/abs/2402.05098).
