ACAVCaps: Enabling large-scale training for fine-grained and diverse audio understanding

Yadong Niu, Tianzi Wang, Heinrich Dinkel, Xingwei Sun, Jiahao Zhou, Gang Li, Jizhong Liu, Junbo Zhang, Jian Luan

Published: 2026, Last Modified: 06 May 2026CoRR 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading