# PID-Guided Partial Alignment for Multimodal Decentralized Federated Learning


## 🔧 Project Structure

- `main.py`: Entry point for parsing arguments, setting up the dataset, clients, topology, and training loop.
- `Clients/`: Contains implementations of various client training schemes and models (e.g., `ParseDL`, `TaskDL`).
- `datasets/`: Contains dataloading logic (e.g., `get_loaders`).
- `communications.py`: Contains mixing logic such as `mix_partial_parameters()` for decentralized parameter aggregation.

## ⚙️ Features

- **Flexible Client Initialization**: Supports heterogeneous clients with different modality configurations via `modality_type` bitmasks.
- **Modular Communication Topologies**:
  - Ring
  - Random Gossip
  - Chordal Ring (1-D Torus)
- **Support for Multiple Datasets**: Including `cap`, `har`, `ave`, `modelnet`, with dataset-specific configurations.

## 🚀 Usage

### 1. Requirements

The project requires the following packages:

```bash
networkx==3.2.1
numpy==2.2.6
torch==2.4.0
torchaudio==2.4.0
torchvision==0.19.0
```

To install all dependencies, run:

```bash
pip install -r requirements.txt
```

### 2. Run Training

```bash
# Run with the IEMOCAP dataset (3 modalities: Audio/Video/Text)
python main.py --dataset cap --mode parse --topology ring --n_clients 40 --group_size 10 --non_iid_alpha 0.5 --learning_rate 5e-5 --batch_size 64

# Run with the KU-HAR dataset (2 modalities: Acc/Gyro)
python main.py --dataset har --mode parse --topology ring --n_clients 30 --group_size 10 --non_iid_alpha 0.5 --learning_rate 0.1 --batch_size 64

# Run with the AVE dataset (2 modalities: Audio/Visual)
python main.py --dataset ave --mode parse --topology ring --n_clients 30 --group_size 10 --non_iid_alpha 0.5 --learning_rate 0.005 --batch_size 32

# Run with the ModelNet-40 dataset
python main.py --dataset modelnet --mode parse --topology ring --n_clients 30 --group_size 10 --non_iid_alpha 0.5 --learning_rate 0.01 --batch_size 64
```
