DARE: Deceiving Audio-Visual speech Recognition model

Published: 01 Jan 2021, Last Modified: 13 Nov 2024Knowl. Based Syst. 2021EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We initiate a targeted attack on AVSR model and detection network simultaneously.•Our attack shows promising results on the publicly available well known LRW dataset.•We successfully circumvent popular defences while maintaining imperceptibility.
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview