An active learning paradigm based on a priori data reduction and organization

•A novel active learning paradigm, called DROP, based on a priori data reduction and organization.•DROP does not require classification and reorganization of all non-annotated samples in the dataset at each iteration.•The proposed paradigm allows to achieve high accuracy quickly with minimum user in...

Full description

Saved in:
Bibliographic Details
Published inExpert systems with applications Vol. 41; no. 14; pp. 6086 - 6097
Main Authors Saito, Priscila T.M., de Rezende, Pedro J., Falcão, Alexandre X., Suzuki, Celso T.N., Gomes, Jancarlo F.
Format Journal Article
LanguageEnglish
Published Amsterdam Elsevier Ltd 15.10.2014
Elsevier
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•A novel active learning paradigm, called DROP, based on a priori data reduction and organization.•DROP does not require classification and reorganization of all non-annotated samples in the dataset at each iteration.•The proposed paradigm allows to achieve high accuracy quickly with minimum user interaction.•Results are shown with different clustering and classification strategies, and on a variety of real-world datasets. In the past few years, active learning has been reasonably successful and it has drawn a lot of attention. However, recent active learning methods have focused on strategies in which a large unlabeled dataset has to be reprocessed at each learning iteration. As the datasets grow, these strategies become inefficient or even a tremendous computational challenge. In order to address these issues, we propose an effective and efficient active learning paradigm which attains a significant reduction in the size of the learning set by applying an a priori process of identification and organization of a small relevant subset. Furthermore, the concomitant classification and selection processes enable the classification of a very small number of samples, while selecting the informative ones. Experimental results showed that the proposed paradigm allows to achieve high accuracy quickly with minimum user interaction, further improving its efficiency.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ObjectType-Article-1
ObjectType-Feature-2
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2014.04.007