Handling Imbalanced Data in Predictive Maintenance: A Resampling-Based Approach
Imbalanced data is a common problem in many areas, and it can have significant impacts on the performance and generalizability of machine learning models. This is because the models fail to create a good representation of the examples in the minority class. This study aims at improving the classific...
Saved in:
Published in | 2023 5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA) pp. 1 - 6 |
---|---|
Main Authors | , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
08.06.2023
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Imbalanced data is a common problem in many areas, and it can have significant impacts on the performance and generalizability of machine learning models. This is because the models fail to create a good representation of the examples in the minority class. This study aims at improving the classification success for the predictive maintenance tasks in which the data is generally imbalanced. To this end, we use resampling methods that target creating balanced data. We present various oversampling and undersampling techniques and apply them to both synthetic and real-world datasets. We then perform classification experiments with imbalanced and balanced datasets by using different classifiers. The performances of different classifiers have been compared. More importantly, we evaluate the effectiveness of resampling techniques to provide insights into their usefulness in handling class imbalance. Our study contributes to the growing body of literature on addressing the class imbalance in classification tasks and provides practical guidance for selecting appropriate sampling methods based on the characteristics of the dataset. |
---|---|
AbstractList | Imbalanced data is a common problem in many areas, and it can have significant impacts on the performance and generalizability of machine learning models. This is because the models fail to create a good representation of the examples in the minority class. This study aims at improving the classification success for the predictive maintenance tasks in which the data is generally imbalanced. To this end, we use resampling methods that target creating balanced data. We present various oversampling and undersampling techniques and apply them to both synthetic and real-world datasets. We then perform classification experiments with imbalanced and balanced datasets by using different classifiers. The performances of different classifiers have been compared. More importantly, we evaluate the effectiveness of resampling techniques to provide insights into their usefulness in handling class imbalance. Our study contributes to the growing body of literature on addressing the class imbalance in classification tasks and provides practical guidance for selecting appropriate sampling methods based on the characteristics of the dataset. |
Author | Cicak, Sejma Avci, Umut |
Author_xml | – sequence: 1 givenname: Sejma surname: Cicak fullname: Cicak, Sejma email: sejma.cicak@gmail.com organization: Yaşar University,Engineering Faculty,Bornova/İzmir,Türkiye – sequence: 2 givenname: Umut surname: Avci fullname: Avci, Umut email: umut.avci@yasar.edu.tr organization: Yaşar University,Engineering Faculty,Bornova/İzmir,Türkiye |
BookMark | eNo1j9FKwzAUhiPohc69gWBeoDXpaZoc7-p0djCpDL0ep02igTYrbRF8ezfUq__i4_vgv2Ln8RAdY7dSpFIKvKvqXakMaJNmIoNUCqkKjXjGlqjRgBIAWmXqktUVRduF-ME3fUMdxdZZ_kgz8RD56-hsaOfw5fgLhTi7eOL3vOQ7N1E_nLzkgaajUg7DeKD285pdeOomt_zbBXtfP72tqmRbP29W5TYJUuKcaFJaoAephG2ELhC9sdbbVuVoQRNmjXY296oQBknmFn1rCplnCjQcGSzYzW83OOf2wxh6Gr_3_z_hB_PQTEc |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/HORA58378.2023.10156799 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Xplore IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Explore url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 9798350337525 |
EndPage | 6 |
ExternalDocumentID | 10156799 |
Genre | orig-research |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-LOGICAL-i119t-7a5709f3150db07699f8ddfdc549d37a92b7ed4f56089a14d9fc8614253732b73 |
IEDL.DBID | RIE |
IngestDate | Thu Jan 18 11:13:49 EST 2024 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i119t-7a5709f3150db07699f8ddfdc549d37a92b7ed4f56089a14d9fc8614253732b73 |
PageCount | 6 |
ParticipantIDs | ieee_primary_10156799 |
PublicationCentury | 2000 |
PublicationDate | 2023-June-8 |
PublicationDateYYYYMMDD | 2023-06-08 |
PublicationDate_xml | – month: 06 year: 2023 text: 2023-June-8 day: 08 |
PublicationDecade | 2020 |
PublicationTitle | 2023 5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA) |
PublicationTitleAbbrev | HORA |
PublicationYear | 2023 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 1.8825588 |
Snippet | Imbalanced data is a common problem in many areas, and it can have significant impacts on the performance and generalizability of machine learning models. This... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 1 |
SubjectTerms | classification Data models Human computer interaction imbalanced data Machine learning Machine learning algorithms Prediction algorithms predictive maintenance Robots Task analysis |
Title | Handling Imbalanced Data in Predictive Maintenance: A Resampling-Based Approach |
URI | https://ieeexplore.ieee.org/document/10156799 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1dS8MwFA3OJ59UnPhNHnxNbZukSXybH6MK22Q42NvIpwyxE-1e_PXmdp2iIPhS2ibQktCe3NxzzkXo3EHqKxQ50YozwnjKiAmckyBSLj2VsQkCxcGwKCfsfsqnrVi90cJ47xvymU_gtMnlu4VdwlZZ_MJjtCGU6qBOPK7EWi1nK0vVRTka9zgYpCdQEzxZ9_5RN6WBjf42Gq4fuGKLPCfL2iT245cX47_faAd1vxV6-OELe3bRhq_20KgEy4R4je9eDFAWrXf4Rtcaz6vYG1Iy8HPDAw0mEeC04S9xD4_9uwZeefVEriKmOdxrfca7aNK_fbwuSVswgcyzTNVEaC5SFWhc5DmTikKpIJ0LzsYg0FGhVW6EdyzEVY5UOmNOBSsjPuecChrb6D7arBaVP0BYWl4oAfLKnDIIKqS1zqhC-zwonfpD1IXRmL2uPDFm64E4-uP-MdqCSWlIVvIEbdZvS38a4bw2Z800fgJtUp74 |
link.rule.ids | 310,311,783,787,792,793,799,27938,55087 |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3JTsMwELWgHOAEiCJ2fODqkM2xza0sVQpdUNVKvVVeUYVIEaQXvh5PmoJAQuKWxJYS2UreTOa9NwhdGCh9uSwmUtCUpDRMiXKUEsdCym3C_RAkir1-lo_T-wmd1GL1Sgtjra3IZzaAw6qWb-Z6Ab_K_Bvusw0mxDra8IE1Z0u5Vs3aikJxmQ-GLQoW6QF0BQ9W8390TqmAo72N-qtbLvkiz8GiVIH--OXG-O9n2kHNb40efvxCn120Zos9NMjBNMGf486LAtKitgbfylLiWeFnQ1EGPm-4J8EmArw27BVu4aF9l8AsL57ItUc1g1u103gTjdt3o5uc1C0TyCyKREmYpCwULvFhnlEhy4Rw3BhntE8DTcKkiBWzJnU-zuFCRqkRTnOP0DFNWOLHkn3UKOaFPUCYa5oJBgLLOEkhreBaGyUyaWMnZGgPURNWY_q6dMWYrhbi6I_r52gzH_W6026n_3CMtmCDKsoVP0GN8m1hTz24l-qs2tJP7-KiRA |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2023+5th+International+Congress+on+Human-Computer+Interaction%2C+Optimization+and+Robotic+Applications+%28HORA%29&rft.atitle=Handling+Imbalanced+Data+in+Predictive+Maintenance%3A+A+Resampling-Based+Approach&rft.au=Cicak%2C+Sejma&rft.au=Avci%2C+Umut&rft.date=2023-06-08&rft.pub=IEEE&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FHORA58378.2023.10156799&rft.externalDocID=10156799 |