DARE: Deceiving Audio-Visual speech Recognition model

Published: 01 Jan 2021, Last Modified: 13 Nov 2024Knowl. Based Syst. 2021EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We initiate a targeted attack on AVSR model and detection network simultaneously.•Our attack shows promising results on the publicly available well known LRW dataset.•We successfully circumvent popular defences while maintaining imperceptibility.
Loading