Towards Discovering the Intrinsic Cardinality and Dimensionality of Time Series Using MDL

Most algorithms for mining or indexing time series data do not operate directly on the original data, but instead they consider alternative representations that include transforms, quantization, approximation, and multi-resolution abstractions. Choosing the best representation and abstraction level...

Full description

Saved in:
Bibliographic Details
Published inAlgorithmic Probability and Friends. Bayesian Prediction and Artificial Intelligence pp. 184 - 197
Main Authors Hu, Bing, Rakthanmanon, Thanawin, Hao, Yuan, Evans, Scott, Lonardi, Stefano, Keogh, Eamonn
Format Book Chapter
LanguageEnglish
Published Berlin, Heidelberg Springer Berlin Heidelberg
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Most algorithms for mining or indexing time series data do not operate directly on the original data, but instead they consider alternative representations that include transforms, quantization, approximation, and multi-resolution abstractions. Choosing the best representation and abstraction level for a given task/dataset is arguably the most critical step in time series data mining. In this paper, we investigate techniques that discover the natural intrinsic representation model, dimensionality and alphabet cardinality of a time series. The ability to discover these intrinsic features has implications beyond selecting the best parameters for particular algorithms, as characterizing data in such a manner is useful in its own right and an important sub-routine in algorithms for classification, clustering and outlier discovery. We will frame the discovery of these intrinsic features in the Minimal Description Length (MDL) framework. Extensive empirical tests show that our method is simpler, more general and significantly more accurate than previous methods, and has the important advantage of being essentially parameter-free.
ISBN:3642449573
9783642449574
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-642-44958-1_14