ICAGC 2024: Inspirational and Convincing Audio Generation Challenge 2024

Published: 01 Jan 2024, Last Modified: 22 Jul 2025ISCSLP 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: The Inspirational and Convincing Audio Generation Challenge 2024 (ICAGC 2024) is part of the ISCSLP 2024 Competitions and Challenges track. While current text-to-speech (TTS) technology can generate high-quality audio, its ability to convey complex emotions and controlled detail content remains limited. This constraint leads to a discrepancy between the generated audio and human subjective perception in practical ap-plications like companion robots for children and marketing bots. The core issue lies in the inconsistency between high-quality audio generation and the ultimate human subjective ex-perience. Therefore, this challenge aims to enhance the persua-siveness and acceptability of synthesized audio, focusing on human alignment convincing and inspirational audio generation. A total of 19 teams have registered for the challenge, and the results of the competition and the competition are described in this paper.
Loading