# Information Geometry Loss for Time Series Forecasting

## :sparkles: Abstract

![model](InfoGeo_Loss.png)

Time series forecasting plays a critical role in various domains, from financial markets to climate prediction. While existing methods have achieved remarkable progress, most rely on point-wise loss functions (e.g., MSE, MAE) that treat predictions as deterministic values, overlooking the inherent uncertainty and distributional characteristics of time series data.

To address this limitation, we propose **InfoGeo Loss**, a novel loss function grounded in **Information Geometry** that captures the geometric structure of probability distributions in time series. By modeling predictions and ground truth as probability distributions rather than point estimates, InfoGeo Loss leverages two key components:

1. **Fisher-Rao Distance**: Measures the intrinsic geometric distance between distributions on the statistical manifold
2. **Bregman Divergence**: Quantifies distributional discrepancies through KL divergence

Our approach introduces a **gradient-based dynamic weighting mechanism** that automatically balances these two components during training, adapting to the learning dynamics of the model. InfoGeo Loss seamlessly integrates with existing forecasting architectures and can be combined with traditional point-wise losses and structural losses (e.g., PS Loss) for enhanced performance.

**Key Contributions:**
- A principled information-geometric framework for time series forecasting
- Dynamic gradient-based weighting for adaptive loss balancing
- Compatibility with existing architectures and loss functions
- State-of-the-art performance across multiple benchmarks

InfoGeo Loss establishes a new paradigm for time series forecasting by bridging the gap between deterministic predictions and probabilistic modeling, providing both theoretical rigor and practical effectiveness.

## :sparkles: Usage

The InfoGeo loss is implemented in the `./[model_name]/exp/exp_main.py` and `./[model_name]/exp/exp_long_term_forecasting.py` files.

### 1. Installation

Install PyTorch and the necessary dependencies:

```bash
pip install -r requirements.txt
```

### 2. Dataset Preparation

Download the datasets from [Google Drive](https://drive.google.com/file/d/1l51QsKvQPcqILT3DwfjCgx8Dsg2rpjot/view?pli=1) and store them in the `datasets` folder.

### 3. Training and Evaluation

We provide experiment scripts for all backbone models under the `./scripts/` folder. You can reproduce the results using the following commands. Results will be stored in `./logs/MSE/` and `./logs/InfoGeo/`.

#### PatchTST Example:
```bash
cd PatchTST

# Train with MSE loss only
bash ./scripts/ETTh1.sh 0

# Train with InfoGeo loss
bash ./scripts/ETTh1.sh 1
```

#### iTransformer Example:
```bash
cd iTransformer

# Train with MSE loss only
bash ./scripts/ETTh1.sh 0

# Train with InfoGeo loss
bash ./scripts/ETTh1.sh 1
```

#### TimeMixer Example:
```bash
cd TimeMixer

# Train with MSE loss only
bash ./scripts/ETTh1.sh 0

# Train with InfoGeo loss
bash ./scripts/ETTh1.sh 1
```

#### TimesNet Example:
```bash
cd TimesNet

# Train with MSE loss only
bash ./scripts/ETTh1.sh 0

# Train with InfoGeo loss
bash ./scripts/ETTh1.sh 1
```

### 4. Hyperparameters

Key hyperparameters for InfoGeo Loss:

- `--use_infogeo_loss`: Enable InfoGeo Loss (0: disabled, 1: enabled)
- `--infogeo_lambda`: Weight coefficient for InfoGeo Loss (default: 0.5)
- `--window_size`: Window size for distribution estimation (default: 5)

You can also combine InfoGeo Loss with PS Loss:

- `--use_ps_loss`: Enable PS Loss (0: disabled, 1: enabled)
- `--ps_lambda`: Weight coefficient for PS Loss (default: 0.1)
- `--patch_len_threshold`: Patch length threshold (default: 24)

### 5. Supported Datasets

- **ETT** (Electricity Transformer Temperature): ETTh1, ETTh2, ETTm1, ETTm2
- **Electricity**: ECL
- **Weather**: Weather
- **Exchange Rate**: Exchange

### 6. Supported Models

- **PatchTST**: Patch-based Transformer for time series
- **iTransformer**: Inverted Transformer architecture
- **TimeMixer**: Multi-scale mixing for time series
- **TimesNet**: Temporal 2D-variation modeling
- **DLinear**: Decomposition Linear model

## :chart_with_upwards_trend: Results

InfoGeo Loss consistently improves forecasting performance across different models and datasets. Detailed results can be found in our paper.


## :sparkling_heart: Acknowledgements

We appreciate the following GitHub repositories for their valuable code and efforts:

- **iTransformer**: https://github.com/thuml/iTransformer
- **PatchTST**: https://github.com/yuqinie98/PatchTST
- **TimeMixer**: https://github.com/kwuking/TimeMixer
- **DLinear**: https://github.com/cure-lab/LTSF-Linear
- **TimesNet**: https://github.com/thuml/Time-Series-Library