Network Intrusion Detection System Based on an Adversarial Auto-Encoder with Few Labeled Training Samples

Network intrusion detection systems (NIDS) are critical to defending network systems from cyber attacks. Recently, machine learning has been applied to enhance NIDS capability. To train a supervised machine-learning model, a large number of labeled training samples are required to achieve practical...

Full description

Saved in:

Bibliographic Details
Published in	Journal of network and systems management Vol. 31; no. 1; p. 5
Main Author	Shiomoto, Kohei
Format	Journal Article
Language	English
Published	New York Springer US 01.03.2023 Springer Nature B.V
Subjects	Annotations Coders Communications Engineering Communications traffic Computer Communication Networks Computer Science Computer Systems Organization and Communication Networks Cybercrime Cybersecurity Information Systems and Communication Service Intrusion detection systems Machine learning Multilayer perceptrons Multilayers Networks Operations Research/Decision Theory Semi-supervised learning Training Network intrusion detection system Semi-supervised learning Adversarial auto-encoder
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Network intrusion detection systems (NIDS) are critical to defending network systems from cyber attacks. Recently, machine learning has been applied to enhance NIDS capability. To train a supervised machine-learning model, a large number of labeled training samples are required to achieve practical performance. However, labeling data samples is a costly task. Additionally, obtaining anomaly data samples is difficult because trends in network traffic that are subject to NIDS change daily, and new attacks continue to be generated. To address this issue, we propose a semi-supervised machine-learning-based NIDS that reduces the required number of labeled training samples by applying an adversarial auto-encoder (AAE) technique. We evaluated the proposed method through a series of experiments and confirmed that the proposed AAE-based NIDS achieves performance comparable to that of multi-layer perceptron-based NIDS with only 0.1% of the labeled training samples. We also confirmed that the selection of data samples for annotation does not affect the performance of the proposed AAE-based NIDS. We also evaluated the relationship between the performance of the proposed method and the dimension of its latent-variable vector. The best performance as measured by recall and F1 score occurred when the dimensionality of the latent variable vector was 10, which suggests that this structure allows for accurate decomposition of attack and normal. This study presents promising results obtained by the proposed semi-supervised learning method with a reduced number of labeled training samples, which reduces the operational costs of a machine-learning-based NIDS.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1064-7570 1573-7705
DOI:	10.1007/s10922-022-09698-w