Real-world-events data sifting through ultra-small labeled datasets and graph fusion

The information on social media is vital, especially for events such as natural disasters or terrorist attacks, that might cause rapid growth of data sharing through social media networks. However, collecting and processing data of an event is a challenging task and essentially requires a great deal...

Full description

Saved in:
Bibliographic Details
Published inApplied soft computing Vol. 132; p. 109865
Main Authors Vega-Oliveros, Didier A., Nascimento, José, Lavi, Bahram, Rocha, Anderson
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.01.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The information on social media is vital, especially for events such as natural disasters or terrorist attacks, that might cause rapid growth of data sharing through social media networks. However, collecting and processing data of an event is a challenging task and essentially requires a great deal of data cleaning and filtering out what is relevant/irrelevant to the event. Data sifting task endeavors to identifying the related content to the depicted event data. We propose a learning strategy to dynamically learn complementary contributions from different data-driven features through a semi-supervised graph-fusion technique. Our proposed method relies upon minimal training labeled data samples —  ultra-small data learning. Learning through a small labeled set is also of particular interest to forensic investigators and medical researchers — concerning massive data labeling and minimizing energy-efficient computing to reduce redundancy and repetitions. We assess the effectiveness of the proposed semi-supervised method on five datasets from real-world events. Compared with prior-art (supervised and semi-supervised ones), experimental results show the proposed method achieves the best classification results and most efficient computational footprint. •We tackle the data sifting problem as classification task with ultra-small labeled data.•We extract specialized features and aggregate them in a graph-based fusion approach.•We propose a Graph-SSL-Fusion technique to learn with ultra-small labeled examples.•We present a Green AI method for data sifting in a ultra-small labeling regime.•Performance in runtime and F1-score are competitive to the prior-art supervised method.
ISSN:1568-4946
1872-9681
DOI:10.1016/j.asoc.2022.109865