Anytime Subgroup Discovery in Numerical Domains with Guarantees

Subgroup discovery is the task of discovering patterns that accurately discriminate a class label from the others. Existing approaches can uncover such patterns either through an exhaustive or an approximate exploration of the pattern search space. However, an exhaustive exploration is generally unf...

Full description

Saved in:
Bibliographic Details
Published inMachine Learning and Knowledge Discovery in Databases Vol. 11052; pp. 500 - 516
Main Authors Belfodil, Aimene, Belfodil, Adnene, Kaytoue, Mehdi
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2019
Springer International Publishing
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN3030109275
9783030109271
ISSN0302-9743
1611-3349
DOI10.1007/978-3-030-10928-8_30

Cover

Loading…
More Information
Summary:Subgroup discovery is the task of discovering patterns that accurately discriminate a class label from the others. Existing approaches can uncover such patterns either through an exhaustive or an approximate exploration of the pattern search space. However, an exhaustive exploration is generally unfeasible whereas approximate approaches do not provide guarantees bounding the error of the best pattern quality nor the exploration progression (“How far are we of an exhaustive search”). We design here an algorithm for mining numerical data with three key properties w.r.t. the state of the art: (i) It yields progressively interval patterns whose quality improves over time; (ii) It can be interrupted anytime and always gives a guarantee bounding the error on the top pattern quality and (iii) It always bounds a distance to the exhaustive exploration. After reporting experimentations showing the effectiveness of our method, we discuss its generalization to other kinds of patterns. Code related to this paper is available at: https://github.com/Adnene93/RefineAndMine.
Bibliography:Electronic supplementary materialThe online version of this chapter (https://doi.org/10.1007/978-3-030-10928-8_30) contains supplementary material, which is available to authorized users.
A. Belfodil and A. Belfodil—Both authors contributed equally to this work.
ISBN:3030109275
9783030109271
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-030-10928-8_30