OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Gouthaman KV
Researcher, Dolby
Joined
January 2020
Names
Gouthaman KV
(Preferred)
,
GOUTHAMAN KV
Emails
****@cse.iitm.ac.in
(Confirmed)
,
****@dolby.com
(Confirmed)
,
****@gmail.com
(Confirmed)
Personal Links
Homepage
Google Scholar
DBLP
Semantic Scholar
Career & Education History
Researcher
Dolby
(dolby.com)
2023
–
Present
PhD student
Indian Institute of Technology Madras
(iitm.ac.in)
2017
–
2022
Advisors, Relations & Conflicts
Coauthor
Gauri Jagatap
2025
–
Present
Coauthor
Lie Lu
2023
–
Present
Coauthor
Andrea Fanelli
2023
–
Present
Coauthor
Shanti Stewart
2024
–
2024
PhD Advisor
Anurag Mittal
2017
–
2022
Coauthor
Athira M Nambiar
2017
–
2022
Expertise
Deep learning
Present
Vision-Languge Problems
Present
Computer Vision
Present
Multimodal deep learning (Video+Audio+Text)
Present
Audio Processing
Present
Music Processing
Present
Multimodal fusion
Present
Publications
Towards Mitigating Hallucinations in Large Vision-Language Models by Refining Textual Embeddings
Aakriti Agrawal
,
Gouthaman KV
,
Rohith Aralikatti
,
Gauri Jagatap
,
Jiaxin Yuan
,
Vijay Kamarshi
,
Andrea Fanelli
,
Furong Huang
ICLR 2026 Workshop MM Intelligence Poster
Readers:
Everyone
A robust PPG foundation model using multimodal physiological supervision
Eloy Geenjaar
,
Vince D. Calhoun
,
scott daly
,
Gouthaman KV
,
Lie Lu
,
Trisha Mittal
,
Daniel P. Darcy
Submitted to ICLR 2026
Readers:
Everyone
Moment Sampling in Video LLMs for Long-Form Video QA
Mustafa Chasmai
,
Gauri Jagatap
,
Gouthaman KV
,
Grant Van Horn
,
Subhransu Maji
,
Andrea Fanelli
CoRR 2025
Readers:
Everyone
Semi-Supervised Contrastive Learning for Controllable Video-to-Music Retrieval
Shanti Stewart
,
Gouthaman KV
,
Lie Lu
,
Andrea Fanelli
ICASSP 2025
Readers:
Everyone
On the role of question encoder sequence model in robust visual question answering
Gouthaman KV
,
Anurag Mittal
Pattern Recognit. 2022
Readers:
Everyone
On the Significance of Question Encoder Sequence Model in the Out-of-Distribution Performance in Visual Question Answering
Gouthaman KV
,
Anurag Mittal
CoRR 2021
Readers:
Everyone
Linguistically-aware attention for reducing the semantic gap in vision-language tasks
Gouthaman KV
,
Athira M. Nambiar
,
Kancheti Sai Srinivas
,
Anurag Mittal
Pattern Recognit. 2021
Readers:
Everyone
Reducing Language Biases in Visual Question Answering with Visually-Grounded Question Encoder
Gouthaman KV
,
Anurag Mittal
ECCV (13) 2020
Readers:
Everyone
View all 11 publications
Co-Authors
Aakriti Agrawal
Andrea Fanelli
Anurag Mittal
Athira M. Nambiar
Daniel P. Darcy
Eloy Geenjaar
Furong Huang
Gauri Jagatap
Grant Van Horn
Jiaxin Yuan
Kancheti Sai Srinivas
Lie Lu
Mustafa Chasmai
Rohith Aralikatti
Shanti Stewart
Subhransu Maji
Trisha Mittal
Vijay Kamarshi
Vince D. Calhoun
scott daly