Effect of Data Repair on Mining Network Streams

Data quality issues have special implications in network data. Data glitches are propagated rapidly along pathways dictated by the hierarchy and topology of the network. In this paper, we use temporal data from a vast data network to study data glitches and their effect on network monitoring tasks s...

Full description

Saved in:
Bibliographic Details
Published in2012 IEEE 12th International Conference on Data Mining Workshops pp. 226 - 233
Main Authors Ji Meng Loh, Dasu, T.
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2012
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Data quality issues have special implications in network data. Data glitches are propagated rapidly along pathways dictated by the hierarchy and topology of the network. In this paper, we use temporal data from a vast data network to study data glitches and their effect on network monitoring tasks such as anomaly detection. We demonstrate the consequences of cleaning the data, and develop targeted and customized cleaning strategies by exploiting the network hierarchy.
ISSN:2375-9232
2375-9259
DOI:10.1109/ICDMW.2012.125