IoT Dataset Validation Using Machine Learning Techniques for Traffic Anomaly Detection

With advancements in engineering and science, the application of smart systems is increasing, generating a faster growth of the IoT network traffic. The limitations due to IoT restricted power and computing devices also raise concerns about security vulnerabilities. Machine learning-based techniques...

Full description

Saved in:
Bibliographic Details
Published inElectronics (Basel) Vol. 10; no. 22; p. 2857
Main Authors Vigoya, Laura, Fernandez, Diego, Carneiro, Victor, Nóvoa, Francisco J.
Format Journal Article
LanguageEnglish
Published Basel MDPI AG 01.11.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:With advancements in engineering and science, the application of smart systems is increasing, generating a faster growth of the IoT network traffic. The limitations due to IoT restricted power and computing devices also raise concerns about security vulnerabilities. Machine learning-based techniques have recently gained credibility in a successful application for the detection of network anomalies, including IoT networks. However, machine learning techniques cannot work without representative data. Given the scarcity of IoT datasets, the DAD emerged as an instrument for knowing the behavior of dedicated IoT-MQTT networks. This paper aims to validate the DAD dataset by applying Logistic Regression, Naive Bayes, Random Forest, AdaBoost, and Support Vector Machine to detect traffic anomalies in IoT. To obtain the best results, techniques for handling unbalanced data, feature selection, and grid search for hyperparameter optimization have been used. The experimental results show that the proposed dataset can achieve a high detection rate in all the experiments, providing the best mean accuracy of 0.99 for the tree-based models, with a low false-positive rate, ensuring effective anomaly detection.
ISSN:2079-9292
2079-9292
DOI:10.3390/electronics10222857