# Capacity Large Model for Trajectory Prediction

This directory contains the implementation of the **Capacity Large Model** used in our paper for trajectory prediction, based on the **MiniMind** architecture with supervised fine-tuning (SFT) and trajectory enrollment.
This model is provided **alongside MP-LSTM** as a complementary method, representing a large-capacity, representation-based approach to trajectory forecasting.



## Directory Structure

CapacityLargeModel/
├── data_preprocessing/          # Data preprocessing notebooks (available after acceptance)
│   ├── data_processing.ipynb
│   └── dataset_preperation.ipynb
│
├── models/                       # Model architectures
│   ├── model.py                  # Base model utilities
│   ├── model_minimind.py         # MiniMind backbone
│   ├── model_minimind_sft.py     # Supervised fine-tuning (SFT) model
│   └── model_minimind_final.py   # Final model with trajectory enrollment
│
├── utils/                        # Utility functions
│   ├── dataloader_Public.py      # Public dataset loader
│   ├── earth_computation.py      # Geodesic & coordinate utilities
│   └── metrics.py                # Evaluation metrics
│
├── weights_pretrain/             # Pretrained MiniMind weights
│   └── 830_statedict_0.16575811230219328.pth
│ 
├── weights_sft/                  # Supervised fine-tuned weights
│   ├── 17_statedict_large_95.75.pth
│   └── 28_statedict_light_96.86.pth
│ 
├── samples/             # Qualitative prediction examples
│   └── trajectory_prediction.png   # Prediction result figures
│
├── enrolled_trajectory.npy       # Enrolled trajectory memory
├── evaluate_final_dataloader_Public.py    # Evaluate the model
└── README.md



## Method Overview

The Capacity Large Model follows a **representation-based trajectory prediction paradigm**, consisting of three stages:

1. **Pretraining (MiniMind Backbone)**  
   A transformer-based sequence model is pretrained on large-scale trajectory data to learn generic motion representations.

2. **Supervised Fine-Tuning (SFT)**  
   The pretrained model is fine-tuned with trajectory supervision to enhance destination-aware representation learning.

3. **Trajectory Enrollment & Retrieval**  
   - Representative trajectories are encoded and stored in a latent space (`enrolled_trajectory.npy`)
   - During inference, test trajectories are matched against enrolled trajectories via cosine similarity
   - The most similar enrolled trajectory provides destination guidance for prediction

This approach leverages **model capacity + memory-based retrieval** to improve long-horizon prediction stability.



## Environment Requirements

- Python ≥ 3.8  
- PyTorch ≥ 1.13  
- Transformers  
- NumPy  
- Pandas  
- Scikit-learn  
- Matplotlib  

Example installation:
```bash
pip install torch transformers numpy pandas scikit-learn matplotlib
```


## Data Preparation (will be available after acceptance)

Raw trajectory data should be placed under the `data/` directory in CSV format.
Data preprocessing and dataset construction are provided in:
```
data_preprocessing/
├── data_processing.ipynb
└── dataset_preperation.ipynb
```

These notebooks:
* Clean and segment trajectory data
* Compute auxiliary features (e.g., speed, heading, coastline distance)
* Prepare training and evaluation datasets



## Trajectory Enrollment

The enrolled trajectory memory is stored in:
```
enrolled_trajectory.npy
```
This file contains:
* Encoded trajectory representations
* Corresponding destination labels


## Qualitative Results

The `samples/` directory contains qualitative visualization results used for analysis and demonstration purposes, including:
- Predicted trajectory figures (`.png`) illustrating long-horizon forecasting behavior
These files are **not required for training or evaluation**, but are provided to facilitate qualitative inspection and result interpretation.



## Evaluation

To evaluate the final Capacity Large Model on public dataset (AIS_dataset):
```bash
python evaluate_final_dataloader_Public.py
```

The evaluation script:
* Loads pretrained + SFT model weights
* Performs autoregressive multi-step prediction
* Reports metrics including:
  * Mean squared positional error
  * Fréchet distance
  * Mean squared error of curvature



## Code Availability

For this Capacity Large Model, **training and standalone inference scripts are not included** in this repository.
This repository provides:
- The complete **model architectures**
- The **final pretrained and SFT model weights**
- The **trajectory enrollment mechanism**
- The **evaluation pipeline** used in the paper

This is sufficient to:
- Reproduce the reported experimental results
- Evaluate the model on the provided datasets
- Analyze the model behavior and prediction performance

The exclusion of separate training and inference scripts does **not affect the reproducibility of the experimental results** presented in the paper.



## Notes

* This model is **independent of MP-LSTM** and uses a fundamentally different prediction paradigm.
* Public datasets are used for evaluation by default.
* Longitude and latitude are modeled jointly in a unified representation space.
* This code is intended for **research and reproducibility purposes**.