Handling the adversarial attacks

The i.i.d assumption is the corner stone of most conventional machine learning algorithms. However, reducing the bias and variance of the learning model on the i.i.d dataset may not help the model to prevent from their failure on the adversarial samples, which are intentionally generated by either t...

Full description

Saved in:

Bibliographic Details
Published in	Journal of ambient intelligence and humanized computing Vol. 10; no. 8; pp. 2929 - 2943
Main Authors	Cao, Ning, Li, Guofu, Zhu, Pengjia, Sun, Qian, Wang, Yingying, Li, Jing, Yan, Maoling, Zhao, Yongbin
Format	Journal Article
Language	English
Published	Heidelberg Springer Nature B.V 01.08.2019
Subjects	Algorithms Cluster analysis Clustering Cybercrime Data mining Decision trees Deep learning Empirical analysis Machine learning Neural networks Security systems Taxonomy Variance
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The i.i.d assumption is the corner stone of most conventional machine learning algorithms. However, reducing the bias and variance of the learning model on the i.i.d dataset may not help the model to prevent from their failure on the adversarial samples, which are intentionally generated by either the malicious users or its rival programs. This paper gives a brief introduction of machine learning and adversarial learning, discussing the research frontier of the adversarial issues noticed by both the machine learning and network security field. We argue that one key reason of the adversarial issue is that the learning algorithms may not exploit the input feature set enough, so that the attackers can focus on a small set of features to trick the model. To address this issue, we consider two important classes of classifiers. For random forest, we propose a type of random forest called Weighted Random Forest (WRF) to encourage the model to give even credits to the input features. This approach can be further improved by careful selection of a subset of trees based on the clustering analysis during the run time. For neural networks, we propose to introduce extra soft constraints based on the weight variance to the objective function, such that the model would base the classification decision on more evenly distributed feature impact. Empirical experiments show that these approaches can effectively improve the robustness of the learnt model against their baseline systems.
ISSN:	1868-5137 1868-5145
DOI:	10.1007/s12652-018-0714-6