

This repository contains a streamlined training pipeline for the 3DSAV QA task with DualLPSS.
Early LPSS guides cross-modal fusion in the unified encoder, and Late LPSS injects logits bias
in the QA head for answer prediction.

## Install

```
conda env create -f environment.yml
conda activate ./.conda_env/3dsav_dual_lpss
```

## Data

All dataset paths are relative to the repo root. The loader expects:

```
data/scanfamily/
  annotations/
    qa/
      3DSAV_v1.0_train.json
      3DSAV_v1.0_val.json
      3DSAV_v1.0_test.json
    meta_data/
      scannetv2_raw_categories.json
      scannetv2-labels.combined.tsv
      cat2glove42b.json
  scan_data/
    instance_id_to_gmm_color/
    instance_id_to_loc/
    instance_id_to_name/
    pcd_with_global_alignment/
  save_mask/  # optional, only for pc_type: pred
```

If you use a custom subset, place it under `data/scanfamily/annotations/qa/<subdir>/` and set
`qa_dataset.args.data_subdir` in `configs/3dsav_dual_lpss.yaml`.

## Weights

- BERT weights (local files only): `pretrain_weights/bert-base-uncased`
- Optional 3DSAV pretrained checkpoint: `project/pretrain_weights/3dsav.pth`
  If you do not have it, set `pipeline.restore_model: false` in `configs/3dsav_dual_lpss.yaml`.

## PointNet2 Ops

```
cd model/vision/pointnet2
python setup.py install
```

## Training

Single GPU:

```
python run.py --config configs/3dsav_dual_lpss.yaml
```

Multi-GPU (DDP):

```
torchrun --nproc_per_node=4 run.py --config configs/3dsav_dual_lpss.yaml
```

## Outputs

With the experiment manager enabled (default), outputs are written to:
`experiments/dual_lpss_<timestamp>/{tensorboard,checkpoints,logs}`.
