OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Kaisi Guan
MS student, Gaoling School of artificial intelligence, Renmin University of China
Joined
March 2024
Names
Kaisi Guan
(Preferred)
,
Kaisi-Guan
Emails
****@ruc.edu.cn
(Confirmed)
Personal Links
Homepage
Google Scholar
DBLP
Semantic Scholar
Career & Education History
MS student
Gaoling School of artificial intelligence,
Renmin University of China
(ruc.edu.cn)
2024
–
2027
Undergrad student
Gaoling School of artificial intelligence,
Renmin University of China
(ruc.edu.cn)
2020
–
2024
Advisors, Relations & Conflicts
PhD Advisor
Ruihua Song
2024
–
2027
Expertise
Multimodal Learning
2024
–
2027
Audio-Visual Joint Generation & Understanding
2024
–
2027
Publications
ChronusOmni: Improving Time Awareness of Omni Large Language Models
Yijing Chen
,
Yihan Wu
,
Kaisi Guan
,
Yuchen Ren
,
Yuyue Wang
,
Ruihua Song
,
Liyun Ru
CoRR 2025
Readers:
Everyone
Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction
Kaisi Guan
,
Xihua Wang
,
Zhengfeng Lai
,
Xin Cheng
,
Peng Zhang
,
Xiaojiang Liu
,
Ruihua Song
,
Meng Cao
CoRR 2025
Readers:
Everyone
VSSFlow: Unifying Video-conditioned Sound and Speech Generation via Joint Learning
Xin Cheng
,
Yuyue Wang
,
Xihua Wang
,
Yihan Wu
,
Kaisi Guan
,
Yijing Chen
,
Peng Zhang
,
Xiaojiang Liu
,
Meng Cao
,
Ruihua Song
Submitted to ICLR 2026
Readers:
Everyone
Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction
Kaisi Guan
,
Xihua Wang
,
Zhengfeng Lai
,
Xin Cheng
,
Peng Zhang
,
Xiaojiang Liu
,
Ruihua Song
,
Meng Cao
Submitted to ICLR 2026
Readers:
Everyone
VSSFlow: Unifying Video-conditioned Sound and Speech Generation via Joint Learning
Xin Cheng
,
Yuyue Wang
,
Xihua Wang
,
Yihan Wu
,
Kaisi Guan
,
Yijing Chen
,
Peng Zhang
,
Xiaojiang Liu
,
Meng Cao
,
Ruihua Song
CoRR 2025
Readers:
Everyone
ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering
Kaisi Guan
,
Zhengfeng Lai
,
Yuchong Sun
,
Peng Zhang
,
Wei Liu
,
Kieran Liu
,
Meng Cao
,
Ruihua Song
CoRR 2025
Readers:
Everyone
ETVA: Evaluation of Text-to-Video Alignment via Fine-Grained Question Generation and Answering
Kaisi Guan
,
Zhengfeng Lai
,
Yuchong Sun
,
Peng Zhang
,
Wei Liu
,
Kieran Liu
,
Meng Cao
,
Ruihua Song
ICCV 2025
Readers:
Everyone
BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain
Kaisi Guan
,
Qian Cao
,
Yuchong Sun
,
Xiting Wang
,
Ruihua Song
EMNLP (Findings) 2024
Readers:
Everyone
Co-Authors
Kieran Liu
Liyun Ru
Meng Cao
Peng Zhang
Qian Cao
Ruihua Song
Wei Liu
Xiaojiang Liu
Xihua Wang
Xin Cheng
Xiting Wang
Yihan Wu
Yijing Chen
Yuchen Ren
Yuchong Sun
Yuyue Wang
Zhengfeng Lai