# MP-LSTM: Multi-step Ship Trajectory Prediction

This repository provides a Python implementation for reproducing the **MP-LSTM** method proposed in the paper:
> **A novel MP-LSTM method for ship trajectory prediction based on AIS data**  
> *Da-wei Gao et al., Ocean Engineering, 2021*
The MP-LSTM framework combines **historical trajectory matching**, **support point prediction**, and **cubic spline interpolation** to achieve accurate multi-step ship trajectory prediction based on AIS data.



## Project Structure

MP-LSTM/
├── LSTM-Public/        # Main MP-LSTM implementation
│ ├── create_database.py        # AIS preprocessing & trajectory database construction
│ ├── process.py       # Trajectory processing logic
│ ├── train_model.py       # MP-LSTM training pipeline
│ ├── inference.py       # Inference & trajectory prediction
│ └── utils/
│  ├── earth_computation.py         # Geodesic & coordinate utilities
│  ├── metrics.py        # Evaluation metrics (Frechet, curvature, etc.)
│  └── visualization.py 
│
├── mp_lstm_models2/ # Pre-trained MP-LSTM models
│ ├── lstm_longitude.pth # Longitude support-point LSTM
│ ├── lstm_latitude.pth # Latitude support-point LSTM
│ └── scalers.pkl # Feature & target scalers
│
└── README.md        



## Method Overview

The MP-LSTM framework consists of four main components:

1. **AIS Data Preprocessing**  
   - Time alignment via cubic spline interpolation  
   - Noise reduction using median filtering  

2. **Reference Trajectory Generation**  
   - Historical trajectories are filtered based on motion similarity  
   - Min-max normalization and MSE are used to select the closest reference trajectory  

3. **Support Point Prediction (LSTM)**  
   - Longitude and latitude displacements are predicted independently  
   - Input features: displacement, speed (SOG), and course (COG)  

4. **Multi-step Trajectory Reconstruction**  
   - Destination point obtained from reference trajectory  
   - Full trajectory generated via cubic spline interpolation  



## Environment Requirements

- Python ≥ 3.8  
- PyTorch  
- NumPy  
- Pandas  
- Scikit-learn  
- SciPy  
- Matplotlib  
You may install dependencies via:
```bash
pip install numpy pandas scipy scikit-learn matplotlib torch
```


## Data Preparation

AIS data should contain at least the following fields:
* `mmsi`
* `postime`
* `lat`
* `lon`
* `sog`
* `cog`

To build the trajectory database:
```bash
python LSTM-Public/create_database.py
```
This step preprocesses AIS data and generates fixed-length trajectory segments for training and testing.



## Training

To train the MP-LSTM model from scratch:
```bash
python LSTM-Public/train_model.py
```

This will:
* Load the trajectory database
* Train two independent LSTM models (longitude & latitude)
* Save model weights and scalers



## Inference

To perform trajectory prediction using trained models:
```bash
python LSTM-Public/inference.py
```
You may also directly use the provided pretrained models in `mp_lstm_models2/`.



## Required Intermediate Files

The inference pipeline requires the following intermediate files:
- `trajectory_database_288.pkl`
- `test_trajectories.pkl`
These files are **not included in this repository** due to file size limitations on OpenReview.

### File Descriptions

- **trajectory_database_288.pkl**  
  A serialized trajectory database containing fixed-length trajectory segments (history length = 288), generated from raw AIS data.  
  This file is used as the **reference trajectory database** during inference.

- **test_trajectories.pkl**  
  A serialized test set of trajectory segments used for evaluation and inference.

### How to Generate These Files

Both files can be **automatically generated** using the provided preprocessing scripts.
Specifically, database is produced by running the trajectory processing logic in:
```
LSTM-Public/create_database.py
```
trajectory segments for inference is produced by:
```
LSTM-Public/process.py
```
depending on the dataset used.

After preprocessing, the generated trajectory database can be saved as:
```python
db.save_database("trajectory_database_288.pkl")
```

The test trajectories are obtained by splitting the full trajectory database during preprocessing or training, and can be saved as:
```python
with open("test_trajectories.pkl", "wb") as f:
    pickle.dump(test_trajectories, f)
```

Thus, all required intermediate files for inference can be **fully reproduced** from raw AIS data using this repository.


## Evaluation

Trajectory prediction performance can be evaluated using:
* Positional Mean Square Error (MSE)
* Curvature MSE
* Fréchet Distance
These are implemented in:
```
LSTM-Public/utils/metrics.py
```

## Notes

* Longitude and latitude displacements are modeled independently, following the paper.
* This repository is an **independent re-implementation** of the MP-LSTM method described in the paper; the original authors' code has not been released.



## Reference

```
Gao, D., Zhu, Y., Zhang, J., He, Y., Yan, K., & Yan, B. (2021).
A novel MP-LSTM method for ship trajectory prediction based on AIS data.
Ocean Engineering, 228, 108956.
```