TrojanNet: Exposing the Danger of Trojan Horse Attack on Neural Networks

Chuan Guo; Ruihan Wu; Kilian Q. Weinberger

TrojanNet: Exposing the Danger of Trojan Horse Attack on Neural Networks

Chuan Guo, Ruihan Wu, Kilian Q. Weinberger

25 Sept 2019 (modified: 05 May 2023)ICLR 2020 Conference Blind SubmissionReaders: Everyone

Keywords: machine learning security

TL;DR: Parameters of a trained neural network can be permuted to produce a completely separate model for a different task, enabling the embedding of Trojan horse networks inside another network.

Abstract: The complexity of large-scale neural networks can lead to poor understanding of their internal details. We show that this opaqueness provides an opportunity for adversaries to embed unintended functionalities into the network in the form of Trojan horse attacks. Our novel framework hides the existence of a malicious network within a benign transport network. Our attack is flexible, easy to execute, and difficult to detect. We prove theoretically that the malicious network's detection is computationally infeasible and demonstrate empirically that the transport network does not compromise its disguise. Our attack exposes an important, previously unknown loophole that unveils a new direction in machine learning security.

Data: [CIFAR-10](https://paperswithcode.com/dataset/cifar-10), [CIFAR-100](https://paperswithcode.com/dataset/cifar-100), [LFW](https://paperswithcode.com/dataset/lfw), [SVHN](https://paperswithcode.com/dataset/svhn)

Original Pdf: pdf

5 Replies

Loading