Handling Imbalanced Data in Predictive Maintenance: A Resampling-Based Approach

Imbalanced data is a common problem in many areas, and it can have significant impacts on the performance and generalizability of machine learning models. This is because the models fail to create a good representation of the examples in the minority class. This study aims at improving the classific...

Full description

Saved in:
Bibliographic Details
Published in2023 5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA) pp. 1 - 6
Main Authors Cicak, Sejma, Avci, Umut
Format Conference Proceeding
LanguageEnglish
Published IEEE 08.06.2023
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Imbalanced data is a common problem in many areas, and it can have significant impacts on the performance and generalizability of machine learning models. This is because the models fail to create a good representation of the examples in the minority class. This study aims at improving the classification success for the predictive maintenance tasks in which the data is generally imbalanced. To this end, we use resampling methods that target creating balanced data. We present various oversampling and undersampling techniques and apply them to both synthetic and real-world datasets. We then perform classification experiments with imbalanced and balanced datasets by using different classifiers. The performances of different classifiers have been compared. More importantly, we evaluate the effectiveness of resampling techniques to provide insights into their usefulness in handling class imbalance. Our study contributes to the growing body of literature on addressing the class imbalance in classification tasks and provides practical guidance for selecting appropriate sampling methods based on the characteristics of the dataset.
AbstractList Imbalanced data is a common problem in many areas, and it can have significant impacts on the performance and generalizability of machine learning models. This is because the models fail to create a good representation of the examples in the minority class. This study aims at improving the classification success for the predictive maintenance tasks in which the data is generally imbalanced. To this end, we use resampling methods that target creating balanced data. We present various oversampling and undersampling techniques and apply them to both synthetic and real-world datasets. We then perform classification experiments with imbalanced and balanced datasets by using different classifiers. The performances of different classifiers have been compared. More importantly, we evaluate the effectiveness of resampling techniques to provide insights into their usefulness in handling class imbalance. Our study contributes to the growing body of literature on addressing the class imbalance in classification tasks and provides practical guidance for selecting appropriate sampling methods based on the characteristics of the dataset.
Author Cicak, Sejma
Avci, Umut
Author_xml – sequence: 1
  givenname: Sejma
  surname: Cicak
  fullname: Cicak, Sejma
  email: sejma.cicak@gmail.com
  organization: Yaşar University,Engineering Faculty,Bornova/İzmir,Türkiye
– sequence: 2
  givenname: Umut
  surname: Avci
  fullname: Avci, Umut
  email: umut.avci@yasar.edu.tr
  organization: Yaşar University,Engineering Faculty,Bornova/İzmir,Türkiye
BookMark eNo1j9FKwzAUhiPohc69gWBeoDXpaZoc7-p0djCpDL0ep02igTYrbRF8ezfUq__i4_vgv2Ln8RAdY7dSpFIKvKvqXakMaJNmIoNUCqkKjXjGlqjRgBIAWmXqktUVRduF-ME3fUMdxdZZ_kgz8RD56-hsaOfw5fgLhTi7eOL3vOQ7N1E_nLzkgaajUg7DeKD285pdeOomt_zbBXtfP72tqmRbP29W5TYJUuKcaFJaoAephG2ELhC9sdbbVuVoQRNmjXY296oQBknmFn1rCplnCjQcGSzYzW83OOf2wxh6Gr_3_z_hB_PQTEc
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/HORA58378.2023.10156799
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Xplore
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Explore
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350337525
EndPage 6
ExternalDocumentID 10156799
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i119t-7a5709f3150db07699f8ddfdc549d37a92b7ed4f56089a14d9fc8614253732b73
IEDL.DBID RIE
IngestDate Thu Jan 18 11:13:49 EST 2024
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i119t-7a5709f3150db07699f8ddfdc549d37a92b7ed4f56089a14d9fc8614253732b73
PageCount 6
ParticipantIDs ieee_primary_10156799
PublicationCentury 2000
PublicationDate 2023-June-8
PublicationDateYYYYMMDD 2023-06-08
PublicationDate_xml – month: 06
  year: 2023
  text: 2023-June-8
  day: 08
PublicationDecade 2020
PublicationTitle 2023 5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA)
PublicationTitleAbbrev HORA
PublicationYear 2023
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.8825588
Snippet Imbalanced data is a common problem in many areas, and it can have significant impacts on the performance and generalizability of machine learning models. This...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms classification
Data models
Human computer interaction
imbalanced data
Machine learning
Machine learning algorithms
Prediction algorithms
predictive maintenance
Robots
Task analysis
Title Handling Imbalanced Data in Predictive Maintenance: A Resampling-Based Approach
URI https://ieeexplore.ieee.org/document/10156799
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1dS8MwFA3OJ59UnPhNHnxNbZukSXybH6MK22Q42NvIpwyxE-1e_PXmdp2iIPhS2ibQktCe3NxzzkXo3EHqKxQ50YozwnjKiAmckyBSLj2VsQkCxcGwKCfsfsqnrVi90cJ47xvymU_gtMnlu4VdwlZZ_MJjtCGU6qBOPK7EWi1nK0vVRTka9zgYpCdQEzxZ9_5RN6WBjf42Gq4fuGKLPCfL2iT245cX47_faAd1vxV6-OELe3bRhq_20KgEy4R4je9eDFAWrXf4Rtcaz6vYG1Iy8HPDAw0mEeC04S9xD4_9uwZeefVEriKmOdxrfca7aNK_fbwuSVswgcyzTNVEaC5SFWhc5DmTikKpIJ0LzsYg0FGhVW6EdyzEVY5UOmNOBSsjPuecChrb6D7arBaVP0BYWl4oAfLKnDIIKqS1zqhC-zwonfpD1IXRmL2uPDFm64E4-uP-MdqCSWlIVvIEbdZvS38a4bw2Z800fgJtUp74
link.rule.ids 310,311,783,787,792,793,799,27938,55087
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3JTsMwELWgHOAEiCJ2fODqkM2xza0sVQpdUNVKvVVeUYVIEaQXvh5PmoJAQuKWxJYS2UreTOa9NwhdGCh9uSwmUtCUpDRMiXKUEsdCym3C_RAkir1-lo_T-wmd1GL1Sgtjra3IZzaAw6qWb-Z6Ab_K_Bvusw0mxDra8IE1Z0u5Vs3aikJxmQ-GLQoW6QF0BQ9W8390TqmAo72N-qtbLvkiz8GiVIH--OXG-O9n2kHNb40efvxCn120Zos9NMjBNMGf486LAtKitgbfylLiWeFnQ1EGPm-4J8EmArw27BVu4aF9l8AsL57ItUc1g1u103gTjdt3o5uc1C0TyCyKREmYpCwULvFhnlEhy4Rw3BhntE8DTcKkiBWzJnU-zuFCRqkRTnOP0DFNWOLHkn3UKOaFPUCYa5oJBgLLOEkhreBaGyUyaWMnZGgPURNWY_q6dMWYrhbi6I_r52gzH_W6026n_3CMtmCDKsoVP0GN8m1hTz24l-qs2tJP7-KiRA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2023+5th+International+Congress+on+Human-Computer+Interaction%2C+Optimization+and+Robotic+Applications+%28HORA%29&rft.atitle=Handling+Imbalanced+Data+in+Predictive+Maintenance%3A+A+Resampling-Based+Approach&rft.au=Cicak%2C+Sejma&rft.au=Avci%2C+Umut&rft.date=2023-06-08&rft.pub=IEEE&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FHORA58378.2023.10156799&rft.externalDocID=10156799