
# Generative Modeling of Battery Degradation with Flow Matching and Diffusion Transformers

## Abstract

Battery degradation remains a critical challenge in the pursuit of green technologies and sustainable energy solutions. Despite significant research efforts, predicting battery cycle life accurately remains difficult due to the complex interplay of aging and cycling behaviors. To address this challenge, we introduce FlowBatt, a general-purpose model for battery degradation prediction and synthesis trained via flow matching. FlowBatt leverages a scalable diffusion transformer (DiT) backbone, enabling high expressivity and scalability. The model operates as a probabilistic predictor of entire cycle life trajectories and as a generative model capable of synthesizing realistic degradation curves for data augmentation. We demonstrate the advantages of flow-based generative approaches by comparing models trained with flow matching, diffusion processes, and supervised learning. FlowBatt achieves results that are comparable to or better than state-of-the-art performance for the remaining useful life prediction task and provides accurate and generalizable state-of-health predictions while capturing uncertainty in aging dynamics. Beyond prediction accuracy, this work advances the development of foundational and scalable models for battery degradation.


## Repository Structure

    ├── batlinet/              # Data preprocessing (forked and modified)
    ├── cache/                 # Cache directory
    ├── configs/               # Configurations for experiments
    ├── workspaces/            # Pre-trained models
    ├── Configs.py             # Training config for FlowBatt
    ├── Configs_diffbatt.py    # Training configs for DiffBatt
    ├── Configs_trf.py         # Training Configs for Transformer
    ├── data_utils.py          # Data loading & preprocessing
    ├── nns.py                 # FlowBatt model implementation
    ├── nns_diffbatt.py        # DiffBatt model implementation
    ├── nns_trf.py             # Transformer model implementation
    ├── pipeline.py            # Data pipeline
    ├── plot_helper.py         # Plotting utilities
    ├── pp_utils.py            # Post-processing utilities
    ├── requirements.txt       # Python dependencies
    ├── run_model.py           # Testing pre-trained models
    ├── training.py            # Training script for FlowBatt
    ├── training_diffbatt.py   # Training script for DiffBatt
    ├── training_trf.py        # Training script for Transformer


## Requirements

    requirements.txt

-----

### Running Pre-trained Models

Pre-trained models can be run using the `run_model.py` script. The script requires the model architecture and dataset to be specified as arguments.

To reproduce the results from the paper, the following command is run in the terminal:

```bash
python run_model.py --model flowbatt --dataset matr_1_Q
```

The model and dataset can be swapped out by changing the values for the `--model` and `--dataset` arguments.

**Available Models (`--model`):**

  * `flowbatt`
  * `diffbatt`
  * `trf`

**Available Datasets (`--dataset`):**

  * `matr_1_Q`
  * `matr_2_Q`
  * `hust_Q`
  * `mix_100_Q`
  * `mix_20_Q`

For example, to run the `flowbatt` model on the `hust_Q` dataset, the following command is used:

```bash
python run_model.py --model flowbatt --dataset hust_Q
```
