Toggle navigation
OpenReview
.net
Login
×
Back to
TTIC
TTIC 2025 Workshop SpeechAI Submissions
ConvM2D2: Improving Generative Music Evaluation using Self-Supervised Alternative to CLAP
Kehinde Abdulsalam Elelu
,
Joshua E Siegel
,
Saffary Ali
,
Luong, Duc Hung
,
Babatunde Simeon
,
Ebuka Okpala
Published: 01 Aug 2025, Last Modified: 26 Aug 2025
SpeechAI TTIC 2025 OralorPoster
Readers:
Everyone
Explainable and Automatic Interruption Strategies for Full-Duplex Conversational AI
Takyoung Kim
,
Dilek Hakkani-Tür
Published: 01 Aug 2025, Last Modified: 26 Aug 2025
SpeechAI TTIC 2025 OralorPoster
Readers:
Everyone
A Multimodal, Multi-Turn Large Speech-Language Model for Real-Time Emotion Tracking and Empathetic Responding
Zijia Liu
,
Xiaocheng Yang
,
Dilek Hakkani-Tür
Published: 01 Aug 2025, Last Modified: 26 Aug 2025
SpeechAI TTIC 2025 OralorPoster
Readers:
Everyone
VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing
Zhisheng Zheng
,
Puyuan Peng
,
Anuj Diwan
,
Cong Phuoc Huynh
,
Xiaohang Sun
,
Zhu Liu
,
Vimal Bhat
,
David Harwath
Published: 01 Aug 2025, Last Modified: 26 Aug 2025
SpeechAI TTIC 2025 OralorPoster
Readers:
Everyone
CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition
Martijn Bartelds
,
Ananjan Nandi
,
Moussa Koulako Bala Doumbouya
,
Dan Jurafsky
,
Tatsunori Hashimoto
,
Karen Livescu
Published: 01 Aug 2025, Last Modified: 26 Aug 2025
SpeechAI TTIC 2025 OralorPoster
Readers:
Everyone
Self-Supervised Speech Models For Word-Level Stuttered Speech Detection
Yi-Jen Shih
,
David Harwath
,
Alex Dimakis
,
Zoi Gkalitsiou
Published: 01 Aug 2025, Last Modified: 26 Aug 2025
SpeechAI TTIC 2025 OralorPoster
Readers:
Everyone
On-device Streaming Discrete Speech Units
Kwanghee Choi
,
Masao Someki
,
Emma Strubell
,
Shinji Watanabe
Published: 01 Aug 2025, Last Modified: 26 Aug 2025
SpeechAI TTIC 2025 OralorPoster
Readers:
Everyone
Scaling Rich Style-Prompted Text-to-Speech Datasets
Anuj Diwan
,
Zhisheng Zheng
,
David Harwath
,
Eunsol Choi
Published: 01 Aug 2025, Last Modified: 26 Aug 2025
SpeechAI TTIC 2025 OralorPoster
Readers:
Everyone
MLA-Conformer: A Latent Attention-Enhanced Conformer for Efficient Speech Recognition
Prasanth
Published: 01 Aug 2025, Last Modified: 26 Aug 2025
SpeechAI TTIC 2025 OralorPoster
Readers:
Everyone
Collaborative Spoken and Written Models for Conversational Language Modeling
Chung-Ming Chien
,
Karen Livescu
Published: 01 Aug 2025, Last Modified: 26 Aug 2025
SpeechAI TTIC 2025 OralorPoster
Readers:
Everyone
Flow-SLM: Joint Learning of Linguistic and Acoustic Information for Spoken Language Modeling
Ju-Chieh Chou
,
Jiawei Zhou
,
Karen Livescu
Published: 01 Aug 2025, Last Modified: 26 Aug 2025
SpeechAI TTIC 2025 OralorPoster
Readers:
Everyone
«
‹
1
2
›
»