Multi-space channel representation learning for mono-to-binaural conversion based audio deepfake detection

Published: 01 Jan 2024, Last Modified: 06 Feb 2025Inf. Fusion 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We pioneer the definition of a feature representation of binaural audio as three spaces.•We present MSCR-ADD, a novel ADD scheme with a novel multi-space channel representation learning strategy.•Experimental results on four datasets verify the power of our method.
Loading