OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Yuhang Jia
Joined
May 2025
Names
Yuhang Jia
(Preferred)
,
YuhangJia
Emails
****@mail.nankai.edu.cn
(Confirmed)
Personal Links
Homepage
DBLP
Career & Education History
MS student
School of Computer science,
Nankai University
(nankai.edu.cn)
2024
–
2025
Undergrad student
School of Computer science,
Nankai University
(nankai.edu.cn)
2020
–
2024
Advisors, Relations & Conflicts
Coauthor
Yong Qin
2022
–
2025
Expertise
Speech
,
Audio and language processing
2023
–
2025
Publications
AudioEval: Automatic Dual-Perspective and Multi-Dimensional Evaluation of Text-to-Audio-Generation
Hui Wang
,
Jinghua Zhao
,
Cheng Liu
,
Yuhang Jia
,
Haoqin Sun
,
Jiaming Zhou
,
Yong Qin
CoRR 2025
Readers:
Everyone
RealTalk-CN: A Realistic Chinese Speech Task-Oriented Dialogue Benchmark with Cross-Modal Analysis
Enzhi Wang
,
Jiaming Zhou
,
Yuhang Jia
,
Aobo Kong
,
Qicheng Li
,
Yong Qin
ICLR 2026 Conference Desk Rejected Submission
Readers:
Everyone
Interpretable Audio Editing Evaluation via Chain-of-Thought Difference-Commonality Reasoning with Multimodal LLMs
Yuhang Jia
,
Xu Zhang
,
Yang Chen
,
Hui Wang
,
Enzhi Wang
,
Yong Qin
CoRR 2025
Readers:
Everyone
GLAD: Global-Local Aware Dynamic Mixture-of-Experts for Multi-Talker ASR
Yujie Guo
,
Jiaming Zhou
,
Yuhang Jia
,
Shiwan Zhao
,
Yong Qin
CoRR 2025
Readers:
Everyone
TTA-Bench: A Comprehensive Benchmark for Evaluating Text-to-Audio Models
Hui Wang
,
Cheng Liu
,
Junyang Chen
,
Haoze Liu
,
Yuhang Jia
,
Shiwan Zhao
,
Jiaming Zhou
,
Haoqin Sun
,
Hui Bu
,
Yong Qin
CoRR 2025
Readers:
Everyone
Cross-Modal Knowledge Distillation for Speech Large Language Models
Enzhi Wang
,
Qicheng Li
,
Zhiyuan Tang
,
Yuhang Jia
CoRR 2025
Readers:
Everyone
Towards Automatic Evaluation and High-Quality Pseudo-Parallel Dataset Construction for Audio Editing: A Human-in-the-Loop Method
Yuhang Jia
,
Hui Wang
,
Xin Nie
,
Yujie Guo
,
Lianru Gao
,
Yong Qin
CoRR 2025
Readers:
Everyone
From Contrast to Commonality: Audio Commonality Captioning for Enhanced Audio-Text Cross-modal Understanding in Multimodal LLMs
Yuhang Jia
,
Xu Zhang
,
Yong Qin
CoRR 2025
Readers:
Everyone
Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides
Jinghua Zhao
,
Yuhang Jia
,
Shiyao Wang
,
Jiaming Zhou
,
Hui Wang
,
Yong Qin
CoRR 2025
Readers:
Everyone
Chinese-LiPS: A Chinese Audio-Visual Speech Recognition Dataset with Lip-Reading and Presentation Slides
Jinghua Zhao
,
Yuhang Jia
,
Shiyao Wang
,
Jiaming Zhou
,
Hui Wang
,
Yong Qin
ICME 2025
Readers:
Everyone
View all 13 publications
Co-Authors
Aobo Kong
Cheng Liu
Enzhi Wang
Haoqin Sun
Haoran Li
Haoze Liu
Hui Bu
Hui Wang
Jiaming Zhou
Jiarong Kang
Jinghua Zhao
Junyang Chen
Lianru Gao
Qicheng Li
Shiwan Zhao
Shiyao Wang
Wenjia Zeng
Xin Nie
Xu Zhang
Yang Chen
Yong Chen
Yong Qin
Yujie Guo
Zhiyuan Tang
Ziyue Jiang