# Environment setup

Create environment
```
conda create -n car4cast python=3.10
conda activate car4cast
conda install conda-forge::ffmpeg
```

Install Pandaset devkit
```
pip install git+https://github.com/scaleapi/pandaset-devkit.git@master#subdirectory=python
```

Install langchain and openai for forecasting
```
pip install openai
pip install -qU "langchain[openai]"
pip install -U langgraph
```

Install ArgoVerse devkit following https://argoverse.github.io/user-guide/getting_started.html#installing-via-pip 

Others
```
pip install -r requirements.txt
```

```
pip uninstall polars
pip install polars-lts-cpu
```
# Data processing

For each of the following source datasets, we show how to produce JSON data in the Car4Cast format and BEV pointmaps. JSON data will be now saved in raw format, i.e., without partitioning into history/forecast and without spatial clustering.

## ArgoVerse
Follow the directions (https://argoverse.github.io/user-guide/getting_started.html#downloading-the-data) to download the ArgoVerse2 **sensor** dataset.

Update the variables in the beginning of `process/process_argoverse.py` and `map/generate_map_argoverse.py`.
```
fps = 2  # desired annotation frequency in Hz
root_dir = '/your/root/dir'
dataset_name = 'argoverse3D'
split_name = 'train' OR 'val' OR 'test'
out_dir = '/your/output/dir/'
```
The dataset should live at `{root_dir}/{dataset_name}/sensor/{split_name}`.

Run
```
python process/process_argoverse.py
python map/generate_map_argoverse.py
```

## CADC
Follow the directions to download CADC (https://github.com/mpitropov/cadc_devkit/blob/master/download_cadcd.py).

Update the variables in the beginning of `process/process_cadc.py` and `map/generate_map_cadc.py`.

```
dataset_root = '/path/to/cadcd'
out_dir = '/your/output/dir/'
```

Run
```
python process/process_cadc.py
python map/generate_map_cadc.py
```

Since CADC has a native annotation frequency of 3 Hz (not a multiple of the Car4Cast frequency of 2 Hz), the resulting JSON files need to be downsampled.

Run
```
process/resample_dict.py --source_freq 3 --target_freq 2 --json_dir /your/previous/output/dir --out_dir /your/new/output/dir
```

## KITTI
Follow the directions to download the KITTI 3D Object Detection dataset (https://www.cvlibs.net/datasets/kitti/eval_object.php?obj_benchmark=3d).

Make sure your directory has the following subdirectories, dowloaded and extracted from the corresponding links on the KITTI webpage:
- `calib_02`: "Download camera calibration matrices of object data set (16 MB)"
- `label_02`: "Download training labels of object data set (5 MB)"
- `velodyne`: "Download Velodyne point clouds, if you want to use laser information (29 GB)"
- `oxts_02`: "Download GPS/IMU data, if you want to use map information (8 MB)" from https://www.cvlibs.net/datasets/kitti/eval_tracking.php

Update the variables in the beginning of `process/process_kitti.py` and `map/generate_map_kitti.py`.

```
fps = 2  # desired annotation frequency in Hz
data_root = '/path/to/kitti'
out_root = '/your/output/dir/'
```

Run
```
python process/process_kitti.py
python map/generate_map_kitti.py
```

## NuScenes

Follow the directions to download NuScenes (https://www.nuscenes.org/nuscenes#download)

Update the variables in the beginning of `process/process_nuscenes.py` and `map/generate_map_nuscenes.py`.

```
data_root = '/path/to/nuscenes'
out_root = '/your/output/dir/'
```

Run
```
python process/process_nuscenes.py
python map/generate_map_nuscenes.py
```

## PandaSet

This is the only case where JSON files will be already partitioned, so there is no need to apply further processing steps (details will be explained in the following sections).

Download and extract PandaSet from HuggingFace (https://huggingface.co/datasets/georghess/pandaset/tree/main)

Update the variables in the beginning of `process/process_pandaset.py` and `map/generate_map_pandaset.py`.

```
data_root = '/path/to/pandaset'
output_root = '/your/output/dir/'
```

Run
```
python process/process_pandaset.py
python map/generate_map_pandaset.py
```

## Data partitioning

For all datasets except PandaSet, raw JSON files need to be subdivided in scenes of 8 historical and 8 future steps, as well as clustered spatially to ensure ~10 instances are in each final scene.

Run
```
python process/split_dicts.py
```

with the following command line arguments:
| Argument              | Description                                                                                           |
|-----------------------|---------------------------------------------------------------------------------------------------|
| `--num_history 8`   | Number of history steps (8)                                      |
| `--num_forecast 8`  | Number of forecast steps (8)                                |
| `--json_dir`| Path to the directory of source, full-scene JSON files |
| `--out_dir`       | Path to the directory to save divided JSON files (in `out_dir/history` and `out_dir/forecast_gt`)                       |

## Dataset structure
Your dataset directory should have the following structure:

```
/path/to/ds
|---- bounds
|---- |---- scene_id.pkl
|---- |---- ...
|---- forecast_gt
|---- |---- scene_id.json  
|---- |---- ...
|---- history
|---- |---- scene_id.json  
|---- |---- ...
|---- maps
|---- |---- scene_id.png  
|---- |---- ...
|---- maps_with_bounds
|---- |---- scene_id.png  
|---- |---- ...
```

## Decimal truncation & map filtering
We finally truncate all float numbers to 3 decimal places and filter out scenes where the map modality is not available (because the original pointcloud was not available).

Both operations are carried out by `process/convert_decimal.py`, where you need to specify your history, forecast, and map directories at the end of the script.

```
src_dirs = [
        "/path/to/your/history",
        "/path/to/your/forecast_gt",
    ]    
dst_dirs = [
        "/path/to/new/history",
        "/path/to/new/forecast_gt",
    ]
check_dirs = [
       "/path/to/your/bounds",
       "/path/to/your/bounds",
    ]
```

## Conversion to ChatML
To convert the dataset into the ChatML format (useful for finetuning), run:

```
python process/convert_to_chat_jsonl.py \
        --ds_root /path/to/ds \
        --prompt_file /path/to/prompt.txt \
        --out_dir /path/to/ds          # default: same as ds_root
```

# Forecasting

We use the OpenAI API to perform forecasting experiments with pre-trained LLMs. The dedicated script is `forecast/forecast_deep.py` (or `forecast/forecast_deep_nothink.py` to deactivate reasoning in Qwen3 models).

Add your API key in the beginning of the script by either specifying its path or setting it directly:

```
# Create an OpenAI client with your token and endpoint
with open("/path/to/key.txt") as f:
    api_key = f.read().strip()
openai = OpenAI(
    api_key=api_key,   
    base_url="your/provider/endpoint",
)
```

To start forecasting on an individual scene, run either script with the following arguments:

| Argument             | Description                                                                                         |
|----------------------|-----------------------------------------------------------------------------------------------------|
| `--model`            | The model to use (default: `'meta-llama/Meta-Llama-3-8B-Instruct'`).                                |
| `--json`             | Path to the input historical JSON file.                                                          |
| `--txt`              | Path to the input text file containing the prompt (default: `'prompts/forecast_A_3D.txt'`).           |
| `--out`              | Path to the output directory where responses will be saved.                           |
| `--maps`             | Optional. Path to a directory containing map PNG files, used for multimodal forecasting.            |

## Linear baseline

To foreacast with the linear baseline on a set of scenes, run `python forecast/forecast_linear.py` with the following arguments:

| Argument         | Description                                                         |
|------------------|---------------------------------------------------------------------|
| `--history_dir`  | Path to the directory of historical JSON files.       |
| `--out_dir`      | Path to the output directory for forecasted JSON files. |


# Evaluation

Please refer to `eval/README_eval.md`.

# Visualization

We provide a script to visualize animated historical and future trajectories, and (optionally) trajectories predicted by a model.

Run
```
python visualize/vis_forecasts.py
```

with the following command line arguments:
| Argument              | Description                                                                                           |
|-----------------------|---------------------------------------------------------------------------------------------------|
| `-H`, `--history_file`   | Path to the historical input data file.                                      |
| `-F`, `--forecast_file`  | Path to the ground-truth forecast data file.                                |
| `-P`, `--prediction_file`| Path to the predicted forecast data file. |
| `-O`, `--output_dir`       | Path to the directory to save output video.                       |

Historical trajectories will be shown in blue, future ones in green, and predicted ones in yellow.