# Mixture of Experts for Audio-Visual Learning

This is the Pytorch implementation of our paper. 


Due to time constraints, the current repository has not been thoroughly cleaned up. We plan to provide a more streamlined version of the code and release model checkpoints in the future. Please stay tuned for further updates and improvements.

## 👍Acknowledgments

Our code is based on [DG-SCT](https://github.com/haoyi-duan/DG-SCT), [CMBS](https://github.com/marmot-xy/CMBS), [AVSBench](https://github.com/OpenNLPLab/AV[SBench), [MGN](https://github.com/stoneMo/MGN), [MUSIC-AVQA](https://github.com/GeWu-Lab/MUSIC-AVQA), and [LAVisH](https://github.com/GenjiB/LAVISH).

------

### 📝Requirements and Installation

- ###### Getting Started

```python
cd AVMOE
pip install -r requirements.txt
```


## AVE
## AVQA
## AVVP

## AVS
