ViXNet: Vision Transformer with Xception Network for deepfakes based video and image forgery detection

Published: 01 Jan 2022, Last Modified: 13 Nov 2024Expert Syst. Appl. 2022EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Proposed a deep learning based model for deepfake image/video detection.•It has a patch-wise self-attention module which learns local image artifacts.•It consists of a vision transformer which learns correlation among masked patches.•Xception based global image features are stacked with patch based local features.•The model achieves good results on some standard video forgery detection datasets.
Loading