TTIC 2025 Workshop SpeechAI Submissions

ConvM2D2: Improving Generative Music Evaluation using Self-Supervised Alternative to CLAP
Kehinde Abdulsalam Elelu, Joshua E Siegel, Saffary Ali, Luong, Duc Hung, Babatunde Simeon, Ebuka Okpala
- Published: 01 Aug 2025, Last Modified: 26 Aug 2025
- SpeechAI TTIC 2025 OralorPoster
- Readers: Everyone
Explainable and Automatic Interruption Strategies for Full-Duplex Conversational AI
Takyoung Kim, Dilek Hakkani-Tür
- Published: 01 Aug 2025, Last Modified: 26 Aug 2025
- SpeechAI TTIC 2025 OralorPoster
- Readers: Everyone
A Multimodal, Multi-Turn Large Speech-Language Model for Real-Time Emotion Tracking and Empathetic Responding
Zijia Liu, Xiaocheng Yang, Dilek Hakkani-Tür
- Published: 01 Aug 2025, Last Modified: 26 Aug 2025
- SpeechAI TTIC 2025 OralorPoster
- Readers: Everyone
VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing
Zhisheng Zheng, Puyuan Peng, Anuj Diwan, Cong Phuoc Huynh, Xiaohang Sun, Zhu Liu, Vimal Bhat, David Harwath
- Published: 01 Aug 2025, Last Modified: 26 Aug 2025
- SpeechAI TTIC 2025 OralorPoster
- Readers: Everyone
CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition
Martijn Bartelds, Ananjan Nandi, Moussa Koulako Bala Doumbouya, Dan Jurafsky, Tatsunori Hashimoto, Karen Livescu
- Published: 01 Aug 2025, Last Modified: 26 Aug 2025
- SpeechAI TTIC 2025 OralorPoster
- Readers: Everyone
Self-Supervised Speech Models For Word-Level Stuttered Speech Detection
Yi-Jen Shih, David Harwath, Alex Dimakis, Zoi Gkalitsiou
- Published: 01 Aug 2025, Last Modified: 26 Aug 2025
- SpeechAI TTIC 2025 OralorPoster
- Readers: Everyone
On-device Streaming Discrete Speech Units
Kwanghee Choi, Masao Someki, Emma Strubell, Shinji Watanabe
- Published: 01 Aug 2025, Last Modified: 26 Aug 2025
- SpeechAI TTIC 2025 OralorPoster
- Readers: Everyone
Scaling Rich Style-Prompted Text-to-Speech Datasets
Anuj Diwan, Zhisheng Zheng, David Harwath, Eunsol Choi
- Published: 01 Aug 2025, Last Modified: 26 Aug 2025
- SpeechAI TTIC 2025 OralorPoster
- Readers: Everyone
MLA-Conformer: A Latent Attention-Enhanced Conformer for Efficient Speech Recognition
Prasanth
- Published: 01 Aug 2025, Last Modified: 26 Aug 2025
- SpeechAI TTIC 2025 OralorPoster
- Readers: Everyone
Collaborative Spoken and Written Models for Conversational Language Modeling
Chung-Ming Chien, Karen Livescu
- Published: 01 Aug 2025, Last Modified: 26 Aug 2025
- SpeechAI TTIC 2025 OralorPoster
- Readers: Everyone
Flow-SLM: Joint Learning of Linguistic and Acoustic Information for Spoken Language Modeling
Ju-Chieh Chou, Jiawei Zhou, Karen Livescu
- Published: 01 Aug 2025, Last Modified: 26 Aug 2025
- SpeechAI TTIC 2025 OralorPoster
- Readers: Everyone