# Paper

## Installation 
### Create en environment and activate it
```shell
python3 -m venv <location-of-env>
source <location-of-env>/bin/activate
```
Or use Conda. 

### Install JAX 
```shell
pip install --upgrade pip
pip install --upgrade "jax[cpu]"
```
To enable GPU computations, use instead
```shell 
pip install --upgrade pip

# CUDA 12 installation
# Note: wheels only available on linux.
pip install --upgrade "jax[cuda12_pip]" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html

# CUDA 11 installation
# Note: wheels only available on linux.
pip install --upgrade "jax[cuda11_pip]" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html
```

### Install remaining dependencies 
```shell
cd <this-directory>
pip install -r requirements.txt
```

## Data 

In this codebase we include runs generated using the code from Campbell et al. at https://github.com/andrew-cr/online_var_fil, where we have put the data and results in the folder `data/crnn/dim_5_500_obs`.

## Experiments 

Experiments for the paper are in the notebook of the root folder, which should run from start to finish.

## Some settings

There multiple ELBO modes depending on the data:
- When $p$ and $q$ are both linear-Gaussian HMMs, setting `elbo_mode=closed_form` will perform training by differentiation through the analytical recursions of the ELBO, but this has only been programmed for the offline scenario.
- In other cases, there are multiple options:
    - For an unbiased offline elbo, set `elbo_mode=autodiff_on_batch`. This will compute the ELBO by drawing trajectories from $q_{0:t}^\lambda$ via the backward kernels. 
    - For an online ELBO, set `elbo_mode=score,resampling,bptt_depth_2` which corresponds to our elbo and gradients using the resampling technique and with a backpropagation truncated at $\Delta=2$. This is the most common setting used in our experiments. If you remove `resampling` you will get heavier updates with similar performance
    - In the non-amortized setting, set `elbo_mode=score,truncated,resampling,bptt_depth_2` to get the ELBO updates associated with $G_{t-1} =0$ as explained in Appendix.

- For training regimes, there are multiple options:
    - The offline (batch) mode corresponds to `training_mode=reset,<length>,<num-grad-steps>` meaning that the ELBO gradients will be reset after processing `<length>` observations, and that `<num-grad-steps>` will be taken after processing this length. In the paper we only explore `<num-grad-steps>` set to 1 in this offline case. Note that if you set `<length>` inferior to the length of the sequence you're training on you will get ELBO and gradient updates for consecutive sub-batches of your original sequence (not explored in the paper).
    - The online mode can either be `training_mode=streaming,<num-grad-steps` or `training_mode=streaming,<num-grad-steps>,difference` where `<num-grad-steps>` corresponds to the number of parameter updates per timestep. The `difference` term means that we use updates according to $\nabla (\mathcal{L}_t^\lambda - \mathcal{L}_{t-1}^\lambda)$. 