Detecting Localized Adversarial Examples: A Generic Approach using Critical Region Analysis

Deep neural networks (DNNs) have been applied in a wide range of applications, e.g., face recognition and image classification; however, they are vulnerable to adversarial examples. By adding a small amount of imperceptible perturbations, an attacker can easily manipulate the outputs of a DNN. Parti...

Full description

Saved in:

Bibliographic Details
Published in	IEEE INFOCOM 2021 - IEEE Conference on Computer Communications pp. 1 - 10
Main Authors	Li, Fengting, Liu, Xuankai, Zhang, Xiaoli, Li, Qi, Sun, Kun, Li, Kang
Format	Conference Proceeding
Language	English
Published	IEEE 10.05.2021
Subjects	Conferences Deep learning Face recognition Image classification Perturbation methods Robustness Training
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Deep neural networks (DNNs) have been applied in a wide range of applications, e.g., face recognition and image classification; however, they are vulnerable to adversarial examples. By adding a small amount of imperceptible perturbations, an attacker can easily manipulate the outputs of a DNN. Particularly, the localized adversarial examples only perturb a small and contiguous region of the target object, so that they are robust and effective in both digital and physical worlds. Although the localized adversarial examples have more severe real-world impacts than traditional pixel attacks, they have not been well addressed in the literature. In this paper, we propose a generic defense system called TaintRadar to accurately detect localized adversarial examples via analyzing critical regions that have been manipulated by attackers. The main idea is that when removing critical regions from input images, the ranking changes of adversarial labels will be larger than those of benign labels. Compared with existing defense solutions, TaintRadar can effectively capture sophisticated localized partial attacks, e.g., the eye-glasses attack, while not requiring additional training or fine-tuning of the original model's structure. Comprehensive experiments have been conducted in both digital and physical worlds to verify the effectiveness and robustness of our defense.
ISSN:	2641-9874
DOI:	10.1109/INFOCOM42981.2021.9488754