A survey on data stream clustering and classification

Nowadays, with the advance of technology, many applications generate huge amounts of data streams at very high speed. Examples include network traffic, web click streams, video surveillance, and sensor networks. Data stream mining has become a hot research topic. Its goal is to extract hidden knowle...

Full description

Saved in:

Bibliographic Details
Published in	Knowledge and information systems Vol. 45; no. 3; pp. 535 - 569
Main Authors	Nguyen, Hai-Long, Woon, Yew-Kwong, Ng, Wee-Keong
Format	Journal Article
Language	English
Published	London Springer London 01.12.2015 Springer Nature B.V
Subjects	Algorithms Classification Clustering Computer Science Data analysis Data mining Data Mining and Knowledge Discovery Data transmission Database Management Datasets Decision trees High speed Information Storage and Retrieval Information systems Information Systems and Communication Service Information Systems Applications (incl.Internet) Internet service providers IT in Business Networks Queries Sensors Surveillance Survey Paper Survey Clustering Data stream mining Classification
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Nowadays, with the advance of technology, many applications generate huge amounts of data streams at very high speed. Examples include network traffic, web click streams, video surveillance, and sensor networks. Data stream mining has become a hot research topic. Its goal is to extract hidden knowledge/patterns from continuous data streams. Unlike traditional data mining where the dataset is static and can be repeatedly read many times, data stream mining algorithms face many challenges and have to satisfy constraints such as bounded memory, single-pass, real-time response, and concept-drift detection. This paper presents a comprehensive survey of the state-of-the-art data stream mining algorithms with a focus on clustering and classification because of their ubiquitous usage. It identifies mining constraints, proposes a general model for data stream mining, and depicts the relationship between traditional data mining and data stream mining. Furthermore, it analyzes the advantages as well as limitations of data stream algorithms and suggests potential areas for future research.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	0219-1377 0219-3116
DOI:	10.1007/s10115-014-0808-1