# The Modality Focusing Hypothesis

### Synthetic Gaussian
(1) Generate multimodal data, and apply cross-modal KD
```shell
python gauss/main.py
```
Experiment 1: vary γ
![image](gauss/figs/exp1.png)

Experiment 2: vary α
![image](gauss/figs/exp2.png)

(2) Use Algorithm 1 to rank features based on modality-general decisive information, and then re-apply cross-modal KD.
```shell
python gauss/main_remove_gd.py
```
Experiment 1: vary modality-general feature dimensions
![image](gauss/figs/exp_remove_gd1.png)

Experiment 2: vary modality-specific feature dimensions
![image](gauss/figs/exp_remove_gd2.png)

Modified T denotes modality-general teacher, and modified S is the student with the modality-general teacher.
Same as Table 2 in the main paper, the modality-general teacher performs worse but is better in terms of cross-modal KD performance. 


### Other datasets

Coming soon

