Adaptive fuzzy partitions for evolving association rules in big data stream
The amount of data being generated in industrial and scientific applications is constantly increasing. These are often generated as a chronologically ordered unlabeled data flow which exceeds usual storage and processing capacities. Association stream mining is an appealing field which models comple...
Saved in:
Published in | International journal of approximate reasoning Vol. 93; pp. 463 - 486 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
Elsevier Inc
01.02.2018
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The amount of data being generated in industrial and scientific applications is constantly increasing. These are often generated as a chronologically ordered unlabeled data flow which exceeds usual storage and processing capacities. Association stream mining is an appealing field which models complex environments online by finding relationships among the attributes without presupposing any a priori structure. The discovered relationships are continuously adapted to the dynamics of the problem in a pure online way, being able to deal with both categorical and continuous attributes. This paper presents a new advanced version, Fuzzy-CSar-AFP, of an online genetic fuzzy system designed to obtain interesting fuzzy association rules from data streams. It is capable of managing partitions of different granularity for the variables, which allows the algorithm to adapt to the precision requirements of each variable in the rule. It can also work with data streams without needing to know the domains of the attributes as it includes a mechanism which updates them in real-time. Fuzzy-CSar-AFP performance is validated in an original real-world Psychophysiology problem where associations between different electroencephalogram signals in subjects which are put through different stimuli are analyzed.
•Presents a new advanced version of an online genetic fuzzy system.•The algorithm is designed to obtain fuzzy association rules from data streams.•Able to manage different granularity partitions without knowing attributes' domains.•Performance is evaluated in an original real-world Psychophysiology problem.•Propose new visualizations and numeric measures to assess algorithm's performance. |
---|---|
ISSN: | 0888-613X 1873-4731 |
DOI: | 10.1016/j.ijar.2017.11.014 |