A Multi-Modal Fusion Approach for Audio-Visual Scene Classification Enhanced by CLIP VariantsDownload PDF

Published: 2021, Last Modified: 05 Nov 2023DCASE 2021Readers: Everyone
0 Replies

Loading