Keywords: Malware Detection, Concept Drift, Test-Time Adaptation, Self-Supervised Learning, Masked Autoencoder, Pseudo-Labeling, Binary Classification
TL;DR: We propose MADCAT, a test-time adaptation framework for malware detection using self-supervised masked autoencoders, which improves robustness to concept drift without requiring labels.
Abstract: We present MADCAT (MAlware Detection under Concept Drift through Adaptation during Test-time), a self-supervised approach designed to address the concept drift problem in malware detection. MADCAT employs an encoder-decoder architecture and works by test-time training of the encoder on a small, balanced subset of the test-time data using a self-supervised objective. During test-time training, the model learns features that are useful for detecting both previously seen (old) data and newly arriving samples. We demonstrate the effectiveness of MADCAT in continuous Android malware detection settings. MADCAT consistently outperforms baseline methods in detection performance at test-time. We also show the synergy between MADCAT and prior approaches in addressing concept drift in malware detection.
Submission Number: 14
Loading