# Being More Lightweight and Practical: Mini-sized Contrastive Learning Pre-trained Models for Fine-Grained Traffic Tasks

This repository provides the **PyTorch implementation of MiniTraffic**, a lightweight pre-trained model tailored for **fine-grained traffic prediction**.

---

## ❓ Why Mini-sized?

Most existing pre-trained traffic models are large and focus on coarse-grained, road-level prediction. However, real-world applications—such as lane-level traffic control—demand more fine-grained forecasting under limited data and computational resources.

**MiniTraffic is designed for this scenario:**

- 🔍 **Fine-grained need:** Lane-level prediction is crucial for precise traffic management, but the data is often sparse and noisy.
- ⚙️ **Efficiency matters:** Large models are hard to train, deploy, and maintain—especially on edge devices or in real-time.
- 🚫 **Limited generalization:** Existing pre-trained models often fail to transfer effectively to lane-level tasks.

✅ **MiniTraffic tackles these challenges via a compact pre-training framework that:**

- Leverages only road-level data during pre-training  
- Simulates lane-level variation using frequency-domain augmentation  
- Learns transferable representations for downstream fine-tuning  
- Maintains a compact model size for fast, resource-efficient deployment

---

## 🧩 Key Designs

![MiniTraffic Overview](./fig/Mini_Traffic.png "Overview of MiniTraffic.")

- 🔁 **Frequency Domain Stability Augmentation (FDA):**  
  Exploits the correlation between road- and lane-level traffic patterns to simulate realistic lane-level data, enabling effective cross-granularity transfer.

- 🧊 **Contrastive Clustering Backbone:**  
  Uses contrastive learning to cluster spatio-temporally similar patches, and applies attention-based message passing within sparse, small-scale patch graphs. This design reduces computational cost while preserving structure.

---

## 🛠️ Requirements

- Python 3.8  
- PyTorch 1.11.0  
- NumPy  
- SciPy  
- Pandas  
- Matplotlib  
- Scikit-learn  
- Argparse  

Install dependencies via:

```bash
pip install -r requirements.txt
```

---

## 🗂️ Project Structure

```plaintext
MiniTraffic/
├── dataset/                                        #  Datasets (PEMS, METR-LA, HuaNan, etc.)
├── fig/                                            #  Visual assets (e.g., model overview)
├── Mini_Traffc_Prediction/src/utils.py             #  Utilities for data processing, graph building
├── Mini_Traffc_Prediction/src/models/              #  Model components (FDA, Backbone, etc.)
├── requirements.txt                                #  Dependencies
├── Mini_Traffc_Prediction/minitraffic_pretrain.py  #  Entry script for pre-training
├── Mini_Traffc_Prediction/minitraffic_finetune.py  #  Entry script for fine-tuning
```

---

## 📂 `dataset/` — Traffic Datasets

Place ll datasets in the `dataset/` directory.

-  For **METR-LA** and **PEMS-BAY**, refer to the [DCRNN repository](https://github.com/liyaguang/DCRNN)
-  For **HuaNan**, **PeMS**, and **PeMS_F**, see the [Lane-Level Traffic-Benchmark](https://github.com/ShuhaoLii/LaneLevel-Traffic-Benchmark)

You can also use your own data—just ensure it follows the same format and is stored in `dataset/`.

---

## 🧠 Mini_Traffc_Prediction/` — Pre-training MiniTraffic

Pre-training uses only road-level data to simulate lane-level behavior through frequency-domain augmentation and contrastive clustering.

To launch pre-training, run:

```bash
python minitraffic_pretrain.py
```

## 🔧 `Mini_Traffc_Prediction/` — Fine-tuning on Downstream Tasks

MiniTraffic supports both road-level and lane-level fine-tuning with minimal computation.

To fine-tune on a downstream dataset, run:

```bash
python minitraffic_finetune.py
```
