# Model — SNN‑Driven Multimodal HAR

This module implements the SNN-based multimodal model and training pipeline. It integrates:
- SGN (skeleton backbone)
- Spiking Mamba + SPS (event backbone)
- SSE (Sparse Semantic Extractor)
- SCM (Spiking Cross‑Mamba) fusion
- DIB (Discretized Information Bottleneck)

## Folder Structure (high‑level)
- `feeders/`: dataloaders and utilities
- `graph/`: graph definitions (e.g., NTU graph)
- `mmodel/`: model implementations
  - `Skeletonbackbone.py`: SGN skeleton encoder
  - `Eventbackbone.py`: event encoder (SPS + Spiking Mamba)
  - `Semanticlearning.py`: SSE (Hypergraph + GSA)
  - `Multimodal.py`: builds the full fusion network (SSE → shared MLP → SCM → DIB)
  - `ITHU.py`: DIB (two‑stage Bernoulli KL + cosine; `alpha, lambda1, lambda2` are non‑learnable)
  - `multfft.py`: spectral utilities (if enabled)
  - `trainMM.py`: module‑level trainer (optional)
- `config/`: YAML configs (e.g., `nturgbd-cross-subject/defaultMM.yaml`)
- `trainMM.py`: top‑level training script

## Quickstart
Install
```bash
pip install -r Model/requirements.txt
```

Train
```bash
python Model/trainMM.py \
  --config Model/config/nturgbd-cross-subject/defaultMM.yaml \
  --work-dir runs/ntu_cs \
  --phase train
```

Test
```bash
python Model/trainMM.py \
  --config Model/config/nturgbd-cross-subject/defaultMM.yaml \
  --phase test \
  --weights runs/ntu_cs/latest.pt
```

## Config Highlights (YAML)
- `feeder`: dataset class and paths
- `model_args`:
  - skeleton/event encoder settings (channels, frames `T`, joints `V`)
  - SSE: `k_neighbors`, groups for GSA
  - SCM: hidden dims for Mamba
  - DIB: `alpha`, `lambda1`, `lambda2` (constants; set via model init)
- `optim`: optimizer, lr schedule, epochs

## Notes
- DIB hyperparameters follow paper notation and are fixed (buffers). Override at model init if needed.
- Ensure event inputs’ shape/resolution matches the SPS settings or resize upstream.
- ROI & V2E parameters are set in the data pipeline (see top‑level README), not here.

## Minimal API Reference
- `mmodel.Multimodal.MultiModalFusionNetwork`
  - inputs: `skeleton_input (N,C,T,V,M)`, `event_input (N,T,C,H,W)`
  - returns: logits, `IB_total`

## Troubleshooting
- Shape mismatches: verify `T, V, resolution` across feeders and backbones.
- Missing CUDA ops (Mamba/SpikingJelly): update deps or disable optional paths in code.
- Checkpoints: use `--work-dir` to manage logs and weights.
