# A Theory of Unimodal Bias in Multimodal Learning Supplementary Material

This folder contain videos of feature evolution corresponding to figures of training trajectories shown in the paper. We list the corresponding figures as follows.

Fig. 2b: `L2_early_feat.mp4`

Fig. 2d: `L2_late_feat.mp4`

Fig. 3a: `L2_late_rho0.5_feat.mp4`

Fig. 3d: `L2_late_rho-0.5_feat.mp4`

Fig. 5a: `L4_Lf1_feat.mp4`, `L4_Lf2_feat.mp4`, `L4_Lf3_feat.mp4`, and `L4_Lf4_feat.mp4`

Fig. 11a: `ReLU_L2_early_xor_var1_feat.mp4`

Fig. 11b: `ReLU_L2_late_xor_var1.mp4`

Fig. 11c: `ReLU_L2_early_xor_var2_feat.mp4`

Fig. 11d: `ReLU_L2_late_xor_var2_feat.mp4`

Fig. 11e: `ReLU_L2_early_xor_var3_feat.mp4`

Fig. 11f: `ReLU_L2_late_xor_var3_feat.mp4`

Videos `L2_early_rho0.5_feat.mp4` and `L2_early_rho0.5_feat.mp4` do not correspond to any figures in the paper. But we include them here as a supplementary experiment to compare two-layer early fusion linear networks and two-layer late fusion linear.
