Acknowledgment: The source code is based on FairSeq and github repository of AV-HuBERT.