# AXIS: Efficient Multi-Source Knowledge Transfer by Model Merging - Code

This repository contains the official implementation for the paper "Efficient Multi-Source Knowledge Transfer by Model Merging".

## Experiment-to-File Mapping

This section maps the experiments described in the paper to the corresponding script files in the `src/` directory.

### Note on Hyperparameters

-   **Trainable Parameters (N) for AXIS:** The percentage of trainable singular values (e.g., N=10%, N=20%) is controlled by the `--svd_threshold` argument. For example, `N=10%` corresponds to `--svd_threshold 0.1`.
-   **Trainable Parameters for aTLAS:** The total number of learnable parameters for the aTLAS baseline is set by `--total-trainable-params` to match the budget of AXIS.
-   **Number of Aggregated Components (K):** The number of SVD components to aggregate is set by the `--svd_k` argument.
-   **Number of Source Task Vectors:** The number of source models used for merging is controlled by the `--end-index` argument. By default, this will merge models from the 1st up to the specified index. To control the exact number of vectors, you can use `--resume-from-idx` to specify the starting index.
-   **Target Dataset:** The target dataset for fine-tuning is specified using the `--target-dataset-name` argument.
-   **Model Architecture:** The model can be specified using the `--model` argument. The available options are `ViT-B-32` and `ViT-L-14`.
-   **Script-Specific Configurations:** Please note that for some experiment scripts use hardcoded hyperparameters instead of command-line arguments. Please check the corresponding script for the exact configuration.

---

### 1. Main Performance Comparison (Figs. 5, 6, 7, 15)

Compares AXIS and aTLAS on target datasets with varying numbers of source models and parameter budgets.

-   **AXIS:** `src/axis_experiments/axis_top_global_main.py`
-   **aTLAS Baseline:** `src/atlas_experiments/atlas_main.py`

---

### 2. Robustness to Parameter Perturbations (Fig. 9)

Evaluates model resilience to noisy or pruned source task vectors.

-   **Corrupted (Noisy) Parameters:**
    -   **AXIS:** `src/axis_experiments/axis_top_global_corrupted_parameters.py`
    -   **aTLAS:** `src/atlas_experiments/atlas_corrupted.py`
-   **Pruned Parameters:**
    -   **AXIS:** `src/axis_experiments/axis_top_global_prunned.py`
    -   **aTLAS:** `src/atlas_experiments/atlas_prunned.py`

---

### 3. Robustness to Input Degradation (Fig. 11, Table 2)

Tests performance on images with dropped patches or common corruptions.

-   **Patch Dropout:**
    -   **AXIS:** `src/axis_experiments/axis_top_global_droppatches.py`
    -   **aTLAS:** `src/atlas_experiments/atlas_droppatches.py`
-   **Common Image Corruptions:**
    -   **AXIS:** `src/axis_experiments/axis_top_global_corruption_input.py`
    -   **aTLAS:** `src/atlas_experiments/atlas_corruption.py`

---

### 4. Ablation Studies & Analysis (Sec. 5)

-   **Zero-Shot Evaluation (Fig. 10):**
    -   **AXIS (SVD-based):** `src/axis_experiments/axis_top_global_zeroshot.py`
    -   **Baseline (Averaging):** `src/axis_experiments/axis_zeroshot_average.py`

-   **Varying Number of K Components (Fig. 12):**
    -   **Script:** `src/axis_experiments/axis_top_global_topK.py`

-   **Component Selection Strategy (Fig. 13):**
    -   **Top Components:** `src/axis_experiments/axis_top_global.py`
    -   **Bottom Components:** `src/axis_experiments/axis_down_global.py`
    -   **Arbitrary Components:** `src/axis_experiments/axis_arbitrary_global.py`
