Generating and Adapting Audio Description with Vision–Language Models for Blind and Low-Vision Users

Published: 22 Sept 2025, Last Modified: 03 Jan 2026WiML @ NeurIPS 2025EveryoneRevisionsBibTeXCC BY 4.0
Keywords: vision language models, audio description, blind and low-vision users, prompt engineering, accessibility
Submission Number: 360
Loading