A new data normalization method for unsupervised anomaly intrusion detection

Unsupervised anomaly detection can detect attacks without the need for clean or labeled training data. This paper studies the application of clustering to unsupervised anomaly detection (ACUAD). Data records are mapped to a feature space. Anomalies are detected by determining which points lie in the...

Full description

Saved in:
Bibliographic Details
Published inFrontiers of information technology & electronic engineering Vol. 11; no. 10; pp. 778 - 784
Main Authors Cai, Long-zheng, Chen, Jian, Ke, Yun, Chen, Tao, Li, Zhi-gang
Format Journal Article
LanguageEnglish
Published Heidelberg SP Zhejiang University Press 01.10.2010
Springer Nature B.V
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Unsupervised anomaly detection can detect attacks without the need for clean or labeled training data. This paper studies the application of clustering to unsupervised anomaly detection (ACUAD). Data records are mapped to a feature space. Anomalies are detected by determining which points lie in the sparse regions of the feature space. A critical element for this method to be effective is the definition of the distance function between data records. We propose a unified normalization distance framework for records with numeric and nominal features mixed data. A heuristic method that computes the distance for nominal features is proposed, taking advantage of an important characteristic of nominal features--their probability distribution. Then, robust methods are proposed for mapping numeric features and computing their distance, these being able to tolerate the impact of the value difference in scale and diversification among features, and outliers introduced by intrusions. Empirical experiments with the KDD 1999 dataset showed that ACUAD can detect intrusions with relatively low false alarm rates compared with other approaches.
Bibliography:Unsupervised anomaly detection, Data mining, Intrusion detection, Network security
TP393.08
ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ISSN:1869-1951
2095-9184
1869-196X
2095-9230
DOI:10.1631/jzus.C0910625