# OUFlow

This is a PyTorch implementation of OUFlow, a generative model for multivariate irregular time series.

## Table of Contents

- [Requirements](#requirements)
- [Usage](#usage)

## Requirements

The code has been tested with `Python 3.11.5` and `torch==2.5.1`.
Other requirements are listed in the `requirements.txt`.

## Usage

### Use Prepared Dataset

- You can train and test the OUFlow model by following these procedures.

#### 1. Prepare dataset

```bash
python main.py --config [config_file_name] -d
```

- For Solar dataset, please download `solar_AL.txt` from [here](https://github.com/laiguokun/multivariate-time-series-data) and place it in the `./data` directory and then run the script above.

- `config_file_name` is the relative path of the config file from `./config` without ".yaml" extension.
- For example:

```bash
python main.py --config lorenz63 -d
```

#### 2. Train the model

```bash
python main.py --config [config_file_name] -t
```

- The checkpoint with the lowest validation loss is saved in `./output/[experiment_name]/checkpoint`.

#### 3. Test the model

```bash
py -m benchmark.eval --config [config_file_name] --checkpoint [checkpoint_file] -[g, f, i]
```

- `checkpoint_file` must be specified as the relative path from `./output/[experiment_name]/checkpoint`.
- Option -g, -f, and/or -i must be specified for generation, forecasting, and/or imputation task, respectively.
- For example:

```bash
py -m benchmark.eval --config lorenz63 --checkpoint checkpoint_000000_0.000e+00.pt -g
```

- The evaluation result is exported to `./output/[experiment_name]/benchmark`.

### Use your dataset 

#### 1. Prepare config file and dataset

- The config file should be placed in `./config`.
- Dataset must be prepared as a `PackedSequence` of PyTorch.
    - Create `./output/[experiment_name]/dataset` and dataset must be placed therein. `experiment_name` is specified in config file.
    - The dataset files must be named as `train_x.pt`, `train_t.pt`, `val_x.pt`, and `val_t.pt`. Files with "_x" correspond to target variables and those with "_t" are corrsponding timepoints.
    - `train_x` and `val_x` pack some scenarios with the shape (n_timestamps, dim), and `train_t` and `val_t` pack corresponding timepoints with the shape (n_timestamps, 1); after padding, their shapes are (n_scenario, n_timestamps, dim) and (n_scenario, n_timestamps, 1).

#### 2. Train the model

```bash
python main.py --config [config_file_name] -t
```

- The checkpoint with the lowest validation loss is saved in `./output/[experiment_name]/checkpoint`.

#### 3. Use the model

- First, load the trained OUFlow model using code such as following:

```python
import yaml
import torch
from torch import Tensor
from torch.nn.utils.rnn import pack_sequence
from src.nn import OUFlow

device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

config_path = [your config path]
with open(config_path, 'r') as f:
    config = yaml.safe_load(f)
dim = 3 # dim. of target variables
model = OUFlow(dim, **config['model']).to(device)
checkpoint_path = [your checkpoint path]
model.load_state_dict(torch.load(checkpoint_path, weights_only=False)['model'])
model.eval()
```

- Adjust the placeholders like [your config path] and [your checkpoint path] to match your actual paths.
- By using the loaded model, you can generate time series by following way:

```python
n_sample = 16   # Number of generated path
ts_pred = torch.arange(0, 1, 0.01, device=device).unsqueeze(0).unsqueeze(-1) # The timepoints of generated time series given by a Tensor with shape (batch_size, n_timestamps, 1)

# Generate from prior
with torch.no_grad():
    xs_pred = model(ts_pred, n_sample)  # Generated time series with the shape (batch_size, n_sample, n_timestamps, dim)

# Generate with conditions
ts_cond = pack_sequence([Tensor([[0.0], [0.1]]).to(device, torch.double)])   # The timepoints of condition data, which packs batch_size scenarios of Tensor with the shape (n_time_condition, 1)
xs_cond = pack_sequence([Tensor([[1.0, 0.0, 0.0], [0.9, 0.0, 0.1]]).to(device, torch.double)])   # The target variables of condition data, which packs batch_size scenarios of Tensor with the shape (n_time_condition, dim)
with torch.no_grad():
    xs_pred = model(ts_pred, n_sample, xs_cond, ts_cond)
```
