Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network

Adversarial examples provoke weak reliability and potential security issues in deep neural networks. Although adversarial training has been widely studied to improve adversarial robustness, it works in an over-parameterized regime and requires high computations and large memory budgets. To bridge ad...

Full description

Saved in:

Bibliographic Details
Published in	2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) pp. 15105 - 15115
Main Authors	Lee, Byung-Kwan, Kim, Junho, Ro, Yong Man
Format	Conference Proceeding
Language	English
Published	IEEE 01.06.2022
Subjects	Adversarial attack and defense Computational modeling Degradation Memory management Neural networks Robustness Semantics Training
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Adversarial examples provoke weak reliability and potential security issues in deep neural networks. Although adversarial training has been widely studied to improve adversarial robustness, it works in an over-parameterized regime and requires high computations and large memory budgets. To bridge adversarial robustness and model compression, we propose a novel adversarial pruning method, Masking Adversarial Damage (MAD) that employs second-order information of adversarial loss. By using it, we can accurately estimate adversarial saliency for model parameters and determine which parameters can be pruned without weakening adversarial robustness. Furthermore, we reveal that model parameters of initial layer are highly sensitive to the adversarial examples and show that compressed feature representation retains semantic information for the target objects. Through extensive experiments on three public datasets, we demonstrate that MAD effectively prunes adversarially trained networks without loosing adversarial robustness and shows better performance than previous adversarial pruning methods.
ISSN:	2575-7075
DOI:	10.1109/CVPR52688.2022.01470