# NeuMoSync 🧠

Official implementation of **NeuMoSync** 
<!-- 
## 📑 Table of Contents

- [Installation](#installation)
- [Datasets](#datasets)
- [Usage](#usage)
- [Configuration](#configuration)
- [Examples](#examples)
- [Citation](#citation)
- [License](#license)
- [Acknowledgments](#acknowledgments) -->

## 🚀 Installation

Set up the environment using the provided Conda configuration:

```bash
# Create environment from the repository root
conda env create -f environment.yml

# Activate the environment
conda activate first
```

> **📌 Note:** If Conda reports an error about the `prefix` field, remove or edit the `prefix:` line at the bottom of `environment.yml`.

## 📊 Datasets

Most benchmarks use standard datasets that download automatically via TorchVision when not present locally.

### CIFAR-10 / CIFAR-100 / MNIST

- **Auto-download:** Yes
- **Default location:** `./data`
- **Task implementations:** `Random-label MNIST`, `Permuted MNIST`, `Random-label CIFAR`, `Permuted-label CIFAR10`, `Continual CIFAR100`,
- **Configuration files:** `configs/sl/task/*.yaml`

### Tiny-ImageNet

- **Implementation:** `continual_tiny_imagenet.py`
- **Auto-download:** Yes (from Stanford's official URL)

### Mini-ImageNet

- **Implementation:** `Permuted-label Mini-ImageNet`
- **Auto-download:** No (requires preprocessed cache files)

**Expected directory structure:**

```
<root>/mini_imagenet/1/
├── mini-imagenet-cache-train.pkl
├── mini-imagenet-cache-val.pkl
└── mini-imagenet-cache-test.pkl
```

**Default root path:** `./tasks/data`

To use a different location, modify the `root` parameter in `tasks/task_generator.py` or pass it when constructing the `MiniImageNet` class.

## 💻 Usage

### Supervised Continual Learning

The main training script is `main_sl.py`, configured using Hydra.

**Basic command:**

```bash
python main_sl.py [overrides...]
```

**Configuration structure:**
- **Base path:** `configs/sl/`
- **Main config:** `configs/sl/config.yaml`

**Default settings:**

```yaml
defaults:
  - arch: mlp
  - agent: neuro_sync
  - task: random_label_cifar10

seed: 42
wandb: true
proj_name: test
```

### 🏗️ Available Architectures

- Simple MLP
- Simple CNN
- Larger CNN with MLP layers
- ResNet-18

### 🤖 Available Methods (Agents)

| Method | Reference |
|--------|-----------|
| **baseline** | Standard baseline |
| **continual_backprop** | [Continual Backpropagation](https://arxiv.org/abs/2108.06325) |
| **crelu** | [CReLU](https://proceedings.mlr.press/v232/abbas23a/abbas23a.pdf) |
| **deep_fourier** | [Deep Fourier](https://openreview.net/forum?id=NIkfix2eDQ) |
| **ewc** | [EWC](https://arxiv.org/pdf/1612.00796) |
| **l2_init** | [L2 Init](https://arxiv.org/abs/2308.11958) |
| **ewc_l2_init** | EWC + L2 Init |
| **hat** | [HAT](https://arxiv.org/pdf/1801.01423) |
| **prelu** | [PReLU](https://arxiv.org/abs/2505.09486) |
| **shrink_perturb** | [Shrink and Perturb](https://arxiv.org/abs/1910.08475) |
| **vit** | [Vision Transformer](https://arxiv.org/abs/2010.11929) |
| **neuro_sync** | NeuMoSync (this work) ⭐ |

> **🔧 NeuMoSync Configuration Options:**
> 
> NeuMoSync offers different controller versions and neuron feature fusion strategies:
> 
> - **`attention_neuro_sync=self`** (Encoder-only) - Used for all experiments except ImageNet
> - **`attention_neuro_sync=mae`** (Encoder-decoder with cross-attention) - Used for ResNet experiments
> 
> Choose the appropriate configuration based on your architecture and dataset requirements.

### 🎯 MAML-based Experiments

For MAML-style meta-learning experiments, use:

```bash
python maml_main_sl.py \
  arch=cnn \
  agent=baseline \
  task=continual_cifar10 \
  seed=0 \
  wandb=false
```

**Implementation files:**
- `maml.py`
- `maml_main_sl.py`

## ⚙️ Configuration

Hydra creates an output directory under `./outputs/` for each run, capturing the full configuration and logs.

**Configuration hierarchy:**
- **Architecture:** `configs/sl/arch/*.yaml`
- **Agent:** `configs/sl/agent/*.yaml`
- **Task:** `configs/sl/task/*.yaml`

Override any parameter using command-line arguments:

```bash
python main_sl.py arch=cnn seed=42 wandb=true
```

## 📝 Examples

### NeuMoSync on Random-Label CIFAR-10

```bash
python main_sl.py \
  arch=cnn \
  agent=neuro_sync \
  task=random_label_cifar10 \
  seed=0 \
  wandb=false \
  proj_name=local_debug
```

### Baseline on Continual CIFAR-100

```bash
python main_sl.py \
  arch=cnn \
  agent=baseline \
  task=continual_cifar100 \
  seed=0 \
  wandb=true \
  proj_name=continual_cifar100_baseline
```

### EWC on Permuted-MNIST

```bash
python main_sl.py \
  arch=mlp \
  agent=ewc \
  task=permuted_MNIST \
  seed=1 \
  wandb=false
```

### Vision Transformer on Shuffled CIFAR-10

```bash
python main_sl.py \
  agent=vit \
  task=shuffle_cifar10 \
  seed=0 \
  wandb=true \
  proj_name=vit_shuffle_cifar10
```

## 📚 Citation

If you use this code in your research, please cite our paper:

```bibtex
@article{neumosync2024,
}
```

## 📄 License



## 🙏 Acknowledgments

Some baseline implementations are adapted from [L2_Init](https://github.com/skumar9876/L2_Init). We thank the authors for making their code publicly available.