A Modified Boosted Ensemble Classifier on Location Based Social Networking

One of the research issues that researchers are interested in is unbalanced data classification techniques. Boosting approaches like Wang's Boosting and Modified Boosted SVM (MBSVM) have been demonstrated to be more effective for unbalanced data. Our proposal The Modified Boosted Random Forest...

Full description

Saved in:

Bibliographic Details
Published in	Journal of information and organizational sciences Vol. 47; no. 1; pp. 65 - 82
Main Authors	Shree K, Lakshmi, Ashok Kumar, Ranganath
Format	Journal Article
Language	English
Published	Varazdin Faculty of organization and informatics, University of Zagreb 01.01.2023 Fakultet organizacije i informatike, Sveučilište u Zagrebu Sveuciliste u Zagrebu, Fakultet Organizacije i Informatike University of Zagreb, Faculty of organization and informatics
Subjects	Algorithms Classifiers Datasets Decision trees ICT Information and Communications Technologies Imbalanced data learning Location Based Social Media (LBSM) Random Forest Recommender systems Social Network Services (SNS) Social Network Services (SNS) Imbalanced data learning Location Based Social Media (LBSM) Random Forest
Online Access	Get full text

Cover

Loading…

More Information
Summary:	One of the research issues that researchers are interested in is unbalanced data classification techniques. Boosting approaches like Wang's Boosting and Modified Boosted SVM (MBSVM) have been demonstrated to be more effective for unbalanced data. Our proposal The Modified Boosted Random Forest (MBRF) classifier is a Random Forest classifier that uses the Boosting approach. The main motivation of the study is to analyze sentiment of geotagged tweets understanding the state of mind of people at FIFA and Olympics datasets. Tree based model Random Forest algorithm using boosting approach classifies the tweets to build a recommendation system with an idea of providing commercial suggestions to participants, recommending local places to visit or perform activities. MBRF employs various strategies: i) a distance-based weight-update method based on K-Medoids ii) a sign-based classifier elimination technique. We have equally partitioned the datasets as 70% of data allocated for training and the remaining 30% data as test data. Our imbalanced data ratio measured 3.1666 and 4.6 for FIFA and Olympics datasets. We looked at accuracy, precision, recall and ROC curves for each event. The average AUC achieved by MBRF on FIFA dataset is 0.96 and Olympics is 0.97. A comparison of MBRF and Decision tree model using 'Entropy' proved MBRF better.
ISSN:	1846-3312 1846-9418
DOI:	10.31341/jios.47.1.4