Future Internet | |
TwinNet: A Double Sub-Network Framework for Detecting Universal Adversarial Perturbations | |
Yibin Ruan1  Jiazhu Dai1  | |
[1] School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China; | |
关键词: deep neural network; universal adversarial perturbation; double sub-network; detecting; PCA; ImageNet; | |
DOI : 10.3390/fi10030026 | |
来源: DOAJ |
【 摘 要 】
Deep neural network has achieved great progress on tasks involving complex abstract concepts. However, there exist adversarial perturbations, which are imperceptible to humans, which can tremendously undermine the performance of deep neural network classifiers. Moreover, universal adversarial perturbations can even fool classifiers on almost all examples with just a single perturbation vector. In this paper, we propose TwinNet, a framework for neural network classifiers to detect such adversarial perturbations. TwinNet makes no modification of the protected classifier. It detects adversarially perturbated examples by enhancing different types of features in dedicated networks and fusing the output of the networks later. The paper empirically shows that our framework can identify adversarial perturbations effectively with a slight loss in accuracy when predicting normal examples, which outperforms state-of-the-art works.
【 授权许可】
Unknown