Gouthaman KV

Researcher, Dolby

Joined

January 2020

Names

Gouthaman KV (Preferred)

GOUTHAMAN KV

Emails

****@cse.iitm.ac.in (Confirmed)

****@dolby.com (Confirmed)

****@gmail.com (Confirmed)

Personal Links

Career & Education History

Researcher

Dolby (dolby.com)

2023 – Present

PhD student

Indian Institute of Technology Madras (iitm.ac.in)

2017 – 2022

Advisors, Relations & Conflicts

Coauthor

Gauri Jagatap

2025 – Present

Coauthor

Lie Lu

2023 – Present

Coauthor

Andrea Fanelli

2023 – Present

Coauthor

Shanti Stewart

2024 – 2024

PhD Advisor

Anurag Mittal

2017 – 2022

Coauthor

Athira M Nambiar

2017 – 2022

Expertise

Deep learning

Present

Vision-Languge Problems

Present

Computer Vision

Present

Multimodal deep learning (Video+Audio+Text)

Present

Audio Processing

Present

Music Processing

Present

Multimodal fusion

Present

Publications

Towards Mitigating Hallucinations in Large Vision-Language Models by Refining Textual Embeddings
Aakriti Agrawal, Gouthaman KV, Rohith Aralikatti, Gauri Jagatap, Jiaxin Yuan, Vijay Kamarshi, Andrea Fanelli, Furong Huang
- ICLR 2026 Workshop MM Intelligence Poster
- Readers: Everyone
A robust PPG foundation model using multimodal physiological supervision
Eloy Geenjaar, Vince D. Calhoun, scott daly, Gouthaman KV, Lie Lu, Trisha Mittal, Daniel P. Darcy
- Submitted to ICLR 2026
- Readers: Everyone
Moment Sampling in Video LLMs for Long-Form Video QA
Mustafa Chasmai, Gauri Jagatap, Gouthaman KV, Grant Van Horn, Subhransu Maji, Andrea Fanelli
- CoRR 2025
- Readers: Everyone
Semi-Supervised Contrastive Learning for Controllable Video-to-Music Retrieval
Shanti Stewart, Gouthaman KV, Lie Lu, Andrea Fanelli
- ICASSP 2025
- Readers: Everyone
On the role of question encoder sequence model in robust visual question answering
Gouthaman KV, Anurag Mittal
- Pattern Recognit. 2022
- Readers: Everyone
On the Significance of Question Encoder Sequence Model in the Out-of-Distribution Performance in Visual Question Answering
Gouthaman KV, Anurag Mittal
- CoRR 2021
- Readers: Everyone
Linguistically-aware attention for reducing the semantic gap in vision-language tasks
Gouthaman KV, Athira M. Nambiar, Kancheti Sai Srinivas, Anurag Mittal
- Pattern Recognit. 2021
- Readers: Everyone
Reducing Language Biases in Visual Question Answering with Visually-Grounded Question Encoder
Gouthaman KV, Anurag Mittal
- ECCV (13) 2020
- Readers: Everyone

View all 11 publications

Gouthaman KV

Names

Emails

Personal Links

Career & Education History

Advisors, Relations & Conflicts

Expertise

Publications

Co-Authors