Augmentation Backdoors

Joseph Rance; Yiren Zhao; Ilia Shumailov; Robert D. Mullins

Augmentation Backdoors

Joseph Rance, Yiren Zhao, Ilia Shumailov, Robert D. Mullins

Published: 04 Mar 2023, Last Modified: 04 Aug 2025ICLR 2023 BANDS SpotlightReaders: Everyone

Keywords: training time attacks, backdoors, augmentation

TL;DR: We present three backdoor attacks that can be covertly inserted into data augmentation functions.

Abstract: Data augmentation is used extensively to improve model generalisation. However, reliance on external libraries to implement augmentation methods introduces a vulnerability into the machine learning pipeline. It is well known that backdoors can be inserted into machine learning models through serving a modified dataset to train on. Augmentation therefore presents a perfect opportunity to perform this modification without requiring an initially backdoored dataset. In this paper we present three backdoor attacks that can be covertly inserted into data augmentation. Our attacks each insert a backdoor using a different type of computer vision augmentation transform, covering simple image transforms, GAN-based augmentation, and composition-based augmentation. By inserting the backdoor using these augmentation transforms, we make our backdoors difficult to detect, while still supporting arbitrary backdoor functionality. We evaluate our attacks on a range of computer vision benchmarks and demonstrate that an attacker is able to introduce backdoors through just a malicious augmentation routine.

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 3 code implementations](https://www.catalyzex.com/paper/augmentation-backdoors/code)

0 Replies

Loading