High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning

High-dimensional problem domains pose significant challenges for anomaly detection. The presence of irrelevant features can conceal the presence of anomalies. This problem, known as the ‘curse of dimensionality’, is an obstacle for many anomaly detection techniques. Building a robust anomaly detecti...

Full description

Saved in:
Bibliographic Details
Published inPattern recognition Vol. 58; pp. 121 - 134
Main Authors Erfani, Sarah M., Rajasegarar, Sutharshan, Karunasekera, Shanika, Leckie, Christopher
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 01.10.2016
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:High-dimensional problem domains pose significant challenges for anomaly detection. The presence of irrelevant features can conceal the presence of anomalies. This problem, known as the ‘curse of dimensionality’, is an obstacle for many anomaly detection techniques. Building a robust anomaly detection model for use in high-dimensional spaces requires the combination of an unsupervised feature extractor and an anomaly detector. While one-class support vector machines are effective at producing decision surfaces from well-behaved feature vectors, they can be inefficient at modelling the variation in large, high-dimensional datasets. Architectures such as deep belief networks (DBNs) are a promising technique for learning robust features. We present a hybrid model where an unsupervised DBN is trained to extract generic underlying features, and a one-class SVM is trained from the features learned by the DBN. Since a linear kernel can be substituted for nonlinear ones in our hybrid model without loss of accuracy, our model is scalable and computationally efficient. The experimental results show that our proposed model yields comparable anomaly detection performance with a deep autoencoder, while reducing its training and testing time by a factor of 3 and 1000, respectively. •We use a combination of a one-class SVM and deep learning.•In our model linear kernels can be used rather than nonlinear ones.•Our model delivers a comparable accuracy with a deep autoencoder.•Our model executes 3times faster in training and 1000 faster than a deep autoencoder.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0031-3203
1873-5142
DOI:10.1016/j.patcog.2016.03.028