A review of ensemble learning and data augmentation models for class imbalanced problems: Combination, implementation and evaluation

Class imbalance (CI) in classification problems arises when the number of observations belonging to one class is lower than the other. Ensemble learning combines multiple models to obtain a robust model and has been prominently used with data augmentation methods to address class imbalance problems....

Full description

Saved in:
Bibliographic Details
Published inExpert systems with applications Vol. 244; p. 122778
Main Authors Khan, Azal Ahmad, Chaudhari, Omkar, Chandra, Rohitash
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 15.06.2024
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Class imbalance (CI) in classification problems arises when the number of observations belonging to one class is lower than the other. Ensemble learning combines multiple models to obtain a robust model and has been prominently used with data augmentation methods to address class imbalance problems. In the last decade, a number of strategies have been added to enhance ensemble learning and data augmentation methods, along with new methods such as generative adversarial networks (GANs). A combination of these has been applied in many studies, and the evaluation of different combinations would enable a better understanding and guidance for different application domains. In this paper, we present a computational study to evaluate data augmentation and ensemble learning methods used to address prominent benchmark CI problems. We present a general framework that evaluates 9 data augmentation and 9 ensemble learning methods for CI problems. Our objective is to identify the most effective combination for improving classification performance on imbalanced datasets. The results indicate that combinations of data augmentation methods with ensemble learning can significantly improve classification performance on imbalanced datasets. We find that traditional data augmentation methods such as the synthetic minority oversampling technique (SMOTE) and random oversampling (ROS) are not only better in performance for selected CI problems, but also computationally less expensive than GANs. Our study is vital for the development of novel models for handling imbalanced datasets. •Class imbalance (CI) in classification problems arises one class is lower than the other classes.•We present a computational review to evaluate data augmentation and ensemble learning methods for CI problems.•We propose a general framework that evaluates 10 data augmentation and 10 ensemble learning methods for CI problems.•Our objective is to identify the most effective combination for improving classification performance on imbalanced datasets.•The results indicate that the combinations can significantly improve classification performance on imbalanced datasets.
AbstractList Class imbalance (CI) in classification problems arises when the number of observations belonging to one class is lower than the other. Ensemble learning combines multiple models to obtain a robust model and has been prominently used with data augmentation methods to address class imbalance problems. In the last decade, a number of strategies have been added to enhance ensemble learning and data augmentation methods, along with new methods such as generative adversarial networks (GANs). A combination of these has been applied in many studies, and the evaluation of different combinations would enable a better understanding and guidance for different application domains. In this paper, we present a computational study to evaluate data augmentation and ensemble learning methods used to address prominent benchmark CI problems. We present a general framework that evaluates 9 data augmentation and 9 ensemble learning methods for CI problems. Our objective is to identify the most effective combination for improving classification performance on imbalanced datasets. The results indicate that combinations of data augmentation methods with ensemble learning can significantly improve classification performance on imbalanced datasets. We find that traditional data augmentation methods such as the synthetic minority oversampling technique (SMOTE) and random oversampling (ROS) are not only better in performance for selected CI problems, but also computationally less expensive than GANs. Our study is vital for the development of novel models for handling imbalanced datasets. •Class imbalance (CI) in classification problems arises one class is lower than the other classes.•We present a computational review to evaluate data augmentation and ensemble learning methods for CI problems.•We propose a general framework that evaluates 10 data augmentation and 10 ensemble learning methods for CI problems.•Our objective is to identify the most effective combination for improving classification performance on imbalanced datasets.•The results indicate that the combinations can significantly improve classification performance on imbalanced datasets.
ArticleNumber 122778
Author Chaudhari, Omkar
Chandra, Rohitash
Khan, Azal Ahmad
Author_xml – sequence: 1
  givenname: Azal Ahmad
  orcidid: 0009-0000-9435-5328
  surname: Khan
  fullname: Khan, Azal Ahmad
  email: k.azal@iitg.ac.in
  organization: Department of Chemistry, Indian Institute of Technology Guwahati, Assam, India
– sequence: 2
  givenname: Omkar
  surname: Chaudhari
  fullname: Chaudhari, Omkar
  email: c.omkar@iitg.ac.in
  organization: Department of Chemistry, Indian Institute of Technology Guwahati, Assam, India
– sequence: 3
  givenname: Rohitash
  orcidid: 0000-0001-6353-1464
  surname: Chandra
  fullname: Chandra, Rohitash
  email: rohitash.chandra@unsw.edu.au
  organization: Transitional Artificial Intelligence Research Group, School of Mathematics and Statistics, University of New South Wales, Sydney, Australia
BookMark eNp9kMtKxDAUQIMoOI7-gKt8gK1J0zatuBkGXzDgRtchj5shQ5sMSWfEvR9uOyMuXLgKN9xzuJwLdOqDB4SuKckpofXtJof0IfOCFCynRcF5c4JmtOEsq3nLTtGMtBXPSsrLc3SR0oYQygnhM_S1wBH2Dj5wsBh8gl51gDuQ0Tu_xtIbbOQgsdyte_CDHFzwuA8GuoRtiFh3MiXseiU76TUYvI1hNPTpDi9Dr5w_EDfjxnb8_TVMXtjLbncYL9GZlV2Cq593jt4fH96Wz9nq9elluVhluqRkyCqmaFtbUxNVMaoaQpVmVjJTMl7wxlRSgQXgraq1YjUBW9lSWd3q1mhKFJuj5ujVMaQUwQrtjgcNUbpOUCKmmmIjpppiqimONUe0-INuo-tl_Pwfuj9CY60pchRJO5gyuQh6ECa4__Bvt_SUvA
CitedBy_id crossref_primary_10_1016_j_envsoft_2024_106072
crossref_primary_10_3390_life14111372
crossref_primary_10_1039_D4TA06452F
crossref_primary_10_2478_ijcss_2024_0007
crossref_primary_10_1080_01431161_2025_2465916
crossref_primary_10_1007_s11227_024_06108_7
crossref_primary_10_1145_3700791
crossref_primary_10_3390_cancers16193417
crossref_primary_10_1109_LGRS_2025_3541770
crossref_primary_10_1016_j_gsd_2024_101345
crossref_primary_10_2166_wpt_2024_264
crossref_primary_10_1007_s12243_025_01072_6
crossref_primary_10_1088_1402_4896_ad564c
crossref_primary_10_1186_s12911_024_02819_2
crossref_primary_10_1016_j_apradiso_2025_111714
crossref_primary_10_3390_cancers16234046
crossref_primary_10_1007_s10796_024_10576_w
crossref_primary_10_1016_j_csite_2025_105888
crossref_primary_10_1016_j_oceaneng_2025_120460
crossref_primary_10_1080_19475705_2024_2425732
crossref_primary_10_1016_j_knosys_2024_112761
crossref_primary_10_1016_j_eswa_2024_125595
crossref_primary_10_37394_23207_2024_21_162
crossref_primary_10_1109_ACCESS_2024_3457753
crossref_primary_10_3390_math12050701
crossref_primary_10_1016_j_eswa_2024_125945
crossref_primary_10_1177_00220345241311888
crossref_primary_10_1542_peds_2024_066675
crossref_primary_10_4236_ojapps_2024_142036
crossref_primary_10_1016_j_aei_2024_102606
crossref_primary_10_4236_ojsst_2025_151004
crossref_primary_10_1186_s13040_024_00397_7
crossref_primary_10_1016_j_jiph_2024_102541
crossref_primary_10_1016_j_compeleceng_2024_109863
crossref_primary_10_3390_electronics13030613
crossref_primary_10_3390_s25020543
crossref_primary_10_3390_jmse12122212
crossref_primary_10_1007_s11600_024_01466_5
crossref_primary_10_3390_analytics4010010
crossref_primary_10_1016_j_engappai_2024_109552
crossref_primary_10_1248_bpb_b24_00506
crossref_primary_10_3390_bdcc9010015
crossref_primary_10_1109_ACCESS_2024_3473028
crossref_primary_10_3390_su16188015
crossref_primary_10_2478_ebtj_2024_0020
crossref_primary_10_32604_cmes_2024_054766
crossref_primary_10_1177_08953996241296249
crossref_primary_10_1038_s41529_025_00573_y
crossref_primary_10_1016_j_jece_2025_115463
crossref_primary_10_1016_j_aei_2024_103079
crossref_primary_10_1016_j_aei_2024_102737
crossref_primary_10_3390_math12213423
crossref_primary_10_1093_ijlct_ctae292
crossref_primary_10_1186_s40249_025_01273_0
crossref_primary_10_1016_j_neucom_2025_129896
crossref_primary_10_3390_bioengineering11080770
crossref_primary_10_1016_j_compbiomed_2025_110008
crossref_primary_10_3390_ma17112549
crossref_primary_10_3390_math13050835
crossref_primary_10_2196_56022
crossref_primary_10_1109_ACCESS_2024_3411774
crossref_primary_10_1007_s12145_024_01631_w
crossref_primary_10_1016_j_jastp_2024_106338
crossref_primary_10_1111_risa_17708
crossref_primary_10_3390_atmos16020127
crossref_primary_10_1007_s42044_025_00240_0
crossref_primary_10_1038_s41598_025_90612_0
crossref_primary_10_3390_app15062977
crossref_primary_10_1038_s41598_024_80495_y
crossref_primary_10_1109_ACCESS_2025_3544625
crossref_primary_10_3390_electronics14040705
crossref_primary_10_1061_AJRUA6_RUENG_1480
crossref_primary_10_1016_j_cmpb_2025_108657
crossref_primary_10_1111_jsr_70044
crossref_primary_10_1109_ACCESS_2025_3528079
crossref_primary_10_3390_cancers17010121
crossref_primary_10_1038_s41598_025_91882_4
crossref_primary_10_1007_s10462_025_11107_y
crossref_primary_10_1016_j_agwat_2024_109147
crossref_primary_10_1016_j_jcsr_2025_109458
crossref_primary_10_1186_s13040_025_00440_1
crossref_primary_10_1080_10255842_2025_2475466
crossref_primary_10_1007_s41939_024_00551_y
crossref_primary_10_1080_14680629_2024_2374885
crossref_primary_10_14801_jkiit_2024_22_1_61
Cites_doi 10.1613/jair.953
10.4310/SII.2009.v2.n3.a8
10.1023/A:1010933404324
10.1016/j.patcog.2017.07.024
10.1186/s12911-022-01821-w
10.1016/j.pnucene.2017.07.015
10.1109/ACCESS.2022.3158977
10.1155/2019/3761203
10.1038/s41436-019-0439-8
10.1016/j.asoc.2018.09.029
10.1109/MCI.2014.2350953
10.1016/j.isprsjprs.2016.01.011
10.1002/bimj.200710415
10.1038/nbt1206-1565
10.1016/j.patrec.2016.10.006
10.1016/j.ecolmodel.2006.05.021
10.1023/A:1016409317640
10.1007/s00521-017-3128-z
10.1016/j.eswa.2022.118835
10.1007/s00500-018-3629-4
10.3390/info14070415
10.1016/j.jbi.2020.103465
10.1109/TNNLS.2017.2771290
10.1016/j.cose.2016.12.004
10.1016/j.ins.2017.05.008
10.1007/s10796-020-10031-6
10.1023/A:1007465907571
10.1016/j.neucom.2010.06.024
10.1002/cpt.2266
10.1371/journal.pone.0249338
10.1186/1472-6947-11-51
10.1109/ACCESS.2019.2949286
10.1016/j.aiopen.2021.08.002
10.1007/s12551-018-0449-9
10.1111/insr.12016
10.1109/TIT.1967.1053964
10.3390/su14148707
10.1016/j.physa.2018.10.060
10.1109/TNN.2006.880583
10.1016/j.watres.2021.117821
10.3233/IDA-2002-6504
10.15294/sji.v9i1.31648
10.1145/3485128
10.3390/forecast4010011
10.1111/j.1541-0420.2006.00578.x
10.22215/timreview/1282
10.1007/BF00116251
10.1109/TNNLS.2013.2274735
10.1016/j.neucom.2022.08.055
10.1016/j.ins.2019.06.007
10.1016/j.patcog.2014.11.014
10.1016/j.aap.2019.105405
10.1016/j.enbuild.2017.11.039
10.1109/ACCESS.2021.3111898
10.1007/s00180-015-0642-2
10.1186/s40537-021-00514-x
10.1016/j.envsoft.2023.105654
10.1016/j.neucom.2017.03.011
10.1002/(SICI)1097-4571(199401)45:1<12::AID-ASI2>3.0.CO;2-L
10.1002/sam.10061
10.1023/A:1022699900025
10.1186/s40537-022-00648-6
10.1016/j.knosys.2022.109902
10.1186/s40537-021-00460-8
10.1007/s10462-020-09896-5
10.1007/s10994-006-6226-1
10.3390/electronics11091495
10.1109/ACCESS.2019.2947359
10.1016/S0893-6080(05)80023-1
10.1016/j.acalib.2019.02.013
10.1109/JSEN.2021.3131166
10.1016/j.eswa.2022.118732
10.3389/fgene.2020.00820
10.3390/info12090374
10.1016/j.neucom.2023.126726
10.3390/s150715974
10.1155/2021/2565488
10.1613/jair.1.11192
10.1257/.41.2.478
10.1002/cpe.4281
10.1016/j.knosys.2018.05.037
10.1049/bme2.12031
10.1016/j.jclinepi.2021.01.010
10.1109/TSMCB.2007.914695
10.1007/s11023-020-09548-1
10.1109/TASE.2020.2998467
10.1109/TSMCC.2011.2161285
10.1016/j.neucom.2021.10.045
10.5121/ijdkp.2015.5201
10.1016/j.patcog.2019.02.023
10.1016/j.asoc.2019.105837
10.1142/S1469026813400014
10.1016/j.knosys.2023.110273
10.1109/ACCESS.2018.2789428
10.1016/j.eswa.2022.116624
10.1109/ACCESS.2021.3088999
10.1371/journal.pone.0067863
10.1016/j.compbiomed.2020.103735
10.1109/TNN.2004.836201
10.1016/j.aiopen.2022.03.001
10.1016/j.ins.2021.10.029
10.1016/j.neunet.2016.04.008
10.1186/s40537-019-0197-0
10.1016/j.neucom.2020.03.064
10.1016/j.cose.2022.102846
10.3390/rs14153547
10.1007/BF00116900
10.1109/ACCESS.2018.2890693
10.1017/S026988891300043X
10.1186/s40537-018-0151-6
10.1109/ACCESS.2019.2927266
10.1016/j.dss.2007.12.002
10.1016/j.imu.2019.100180
10.1016/j.ins.2018.06.056
10.1109/ACCESS.2022.3207287
10.1016/j.ins.2021.02.056
10.1016/j.knosys.2013.03.012
10.1016/j.knosys.2014.06.004
10.1016/S0031-3203(96)00142-2
10.1016/j.renene.2018.10.062
10.1145/3422622
10.1016/j.eswa.2010.08.028
10.1016/j.engappai.2007.07.001
10.1016/j.neucom.2021.04.112
10.3390/jpm13020373
10.1109/ACCESS.2019.2946980
10.1007/s00521-019-04378-4
10.1016/j.ins.2022.04.058
10.1016/j.ins.2020.08.068
10.1016/j.procs.2022.01.143
10.1007/s42979-021-00655-z
10.1016/j.patcog.2012.03.014
10.1007/BF00994018
10.1007/s10346-019-01286-5
10.1016/j.eswa.2016.12.035
10.1002/prot.21870
10.1109/TCBB.2021.3095482
10.1007/s100440050003
10.1109/LGRS.2018.2803259
10.3390/s19061476
10.3390/risks6020045
10.1016/j.measurement.2019.107377
10.1016/j.trc.2015.02.019
10.3390/a13010017
10.1109/TIP.2013.2277800
10.1186/s12911-020-01201-2
10.17694/bajece.679662
10.1016/j.asoc.2022.109588
10.1007/s11749-016-0481-7
10.1016/j.neucom.2014.03.075
10.1109/TCBB.2019.2911071
10.1016/j.neucom.2015.01.068
10.1016/j.cmet.2022.03.002
10.1007/s42979-021-00558-z
10.1214/aos/1079120128
10.1155/2017/1827016
10.1016/j.jhydrol.2009.06.005
10.1177/0962280220980484
10.1016/j.compbiomed.2022.105909
10.1109/MSP.2017.2765202
10.1016/j.knosys.2016.05.048
10.1016/j.ijforecast.2020.07.007
10.3233/JIFS-169526
10.3390/pr10040749
10.1198/10618600152418584
10.1038/nbt0908-1011
10.1186/s40537-019-0192-5
10.1177/15501329221106935
10.1109/TNNLS.2018.2878400
10.1016/j.frl.2018.12.032
10.1007/s10462-011-9272-4
10.1007/s11069-020-04409-7
10.3389/fnbot.2013.00021
10.1145/3178582
10.1007/s11280-012-0178-0
10.1016/j.watres.2020.115788
10.1109/TNNLS.2015.2461436
10.1016/j.future.2022.01.026
10.1016/j.artmed.2020.101935
10.1016/j.isprsjprs.2010.11.001
10.1109/ACCESS.2022.3163270
10.1002/sim.1228
10.1016/j.ecoinf.2020.101202
10.1109/TIE.2017.2726961
10.1137/140988826
10.2495/DATA050031
10.3844/jcssp.2014.1151.1155
10.1016/j.patrec.2020.05.035
10.3390/s22186766
10.1371/journal.pone.0254841
10.3390/app12147189
10.1016/j.eswa.2022.117233
10.1145/3544558
10.1109/MCI.2014.2307227
10.3390/sym10070250
10.3390/info12070266
10.1016/j.procs.2019.09.229
10.1016/j.engappai.2022.105151
10.1016/j.neucom.2014.07.064
10.1016/j.is.2015.02.006
10.1093/comjnl/bxab039
10.1007/s11704-019-8208-z
10.3390/electronics11172703
10.1016/j.procs.2017.05.365
10.1109/TSMCA.2010.2084081
10.1109/ACCESS.2020.3034015
10.1016/j.cmpb.2020.105568
10.1109/ACCESS.2022.3172432
10.1109/MCE.2020.3015439
10.3389/fnagi.2017.00329
10.1007/s10844-017-0446-7
ContentType Journal Article
Copyright 2023 The Author(s)
Copyright_xml – notice: 2023 The Author(s)
DBID 6I.
AAFTH
AAYXX
CITATION
DOI 10.1016/j.eswa.2023.122778
DatabaseName ScienceDirect Open Access Titles
Elsevier:ScienceDirect:Open Access
CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1873-6793
ExternalDocumentID 10_1016_j_eswa_2023_122778
S0957417423032803
GroupedDBID --K
--M
.DC
.~1
0R~
13V
1B1
1RT
1~.
1~5
4.4
457
4G.
5GY
5VS
6I.
7-5
71M
8P~
9JN
9JO
AAAKF
AABNK
AACTN
AAEDT
AAEDW
AAFTH
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AARIN
AAXUO
AAYFN
ABBOA
ABFNM
ABMAC
ABMVD
ABMYL
ABUCO
ABYKQ
ACDAQ
ACGFS
ACHRH
ACNTT
ACRLP
ACZNC
ADBBV
ADEZE
ADTZH
AEBSH
AECPX
AEKER
AENEX
AFKWA
AFTJW
AGHFR
AGJBL
AGUBO
AGUMN
AGYEJ
AHHHB
AHJVU
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJOXV
AKRWK
ALEQD
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
APLSM
AXJTR
BJAXD
BKOJK
BLXMC
BNSAS
CS3
DU5
EBS
EFJIC
EFLBG
EO8
EO9
EP2
EP3
F5P
FDB
FIRID
FNPLU
FYGXN
G-Q
GBLVA
GBOLZ
HAMUX
IHE
J1W
JJJVA
KOM
M41
MO0
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
PQQKQ
Q38
RIG
ROL
RPZ
SDF
SDG
SDP
SDS
SES
SEW
SPC
SPCBC
SSB
SSD
SSL
SST
SSV
SSZ
T5K
TN5
~G-
29G
AAAKG
AAQXK
AATTM
AAXKI
AAYWO
AAYXX
ABJNI
ABKBG
ABWVN
ABXDB
ACNNM
ACRPL
ACVFH
ADCNI
ADJOM
ADMUD
ADNMO
AEIPS
AEUPX
AFJKZ
AFPUW
AFXIZ
AGCQF
AGQPQ
AGRNS
AIGII
AIIUN
AKBMS
AKYEP
ANKPU
APXCP
ASPBG
AVWKF
AZFZN
BNPGV
CITATION
EJD
FEDTE
FGOYB
G-2
HLZ
HVGLF
HZ~
LG9
LY1
LY7
R2-
SBC
SET
SSH
WUQ
XPP
ZMT
ID FETCH-LOGICAL-c410t-53b196fd60b531b801bc3fa3d437278d5abefee79b6cb360ef5f4bfc9c9dc10b3
IEDL.DBID .~1
ISSN 0957-4174
IngestDate Tue Jul 01 04:06:19 EDT 2025
Thu Apr 24 23:11:48 EDT 2025
Sat Mar 23 16:41:20 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Keywords Data augmentation
Ensemble learning
Class imbalance
Machine learning
Language English
License This is an open access article under the CC BY-NC-ND license.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c410t-53b196fd60b531b801bc3fa3d437278d5abefee79b6cb360ef5f4bfc9c9dc10b3
ORCID 0009-0000-9435-5328
0000-0001-6353-1464
OpenAccessLink https://www.sciencedirect.com/science/article/pii/S0957417423032803
ParticipantIDs crossref_citationtrail_10_1016_j_eswa_2023_122778
crossref_primary_10_1016_j_eswa_2023_122778
elsevier_sciencedirect_doi_10_1016_j_eswa_2023_122778
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2024-06-15
PublicationDateYYYYMMDD 2024-06-15
PublicationDate_xml – month: 06
  year: 2024
  text: 2024-06-15
  day: 15
PublicationDecade 2020
PublicationTitle Expert systems with applications
PublicationYear 2024
Publisher Elsevier Ltd
Publisher_xml – name: Elsevier Ltd
References Padurariu, Breaban (b225) 2019; 159
Gao, Zhang, Liu, Wu (b95) 2020; 108
Poon, Granger (b235) 2003; 41
Jayapermana, Aradea, Kurniati (b136) 2022; 9
Ding, Xu, Tong, Liu (b66) 2022
Hoi, Sahoo, Lu, Zhao (b122) 2021; 459
Al-Azani, El-Alfy (b8) 2017; 109
Parsa, Movahedi, Taghipour, Derrible, Mohammadian (b229) 2020; 136
Sauber-Cole, Khoshgoftaar (b262) 2022; 9
Vitianingsih, A. V., Othman, Z., Baharin, S. S. K., Suraji, A., & Maukar, A. L. Application of the synthetic over-sampling method to increase the sensitivity of algorithm classification for class imbalance in small spatial datasets.
Li (b161) 2018; 7
Shahani, Kamran, Zheng, Liu, Guo (b265) 2021; 2021
Haixiang, Yijing, Shang, Mingyun, Yuanyue, Bing (b104) 2017; 73
Ruff, Vandermeulen, Goernitz, Deecke, Siddiqui, Binder (b255) 2018
Xu, Shen, Nie, Kou (b330) 2020; 107
Agarwal, Farid, Gu, He, Nagano, Li (b5) 2019
Devi, Kavitha (b62) 2017
Lango, Stefanowski (b156) 2018; 50
Parmar, Vaswani, Uszkoreit, Kaiser, Shazeer, Ku (b228) 2018
Wang, She, Ward (b316) 2021; 54
Islam, Hridi, Hossain, Narman (b131) 2020
Pedregosa, Varoquaux, Gramfort, Michel, Thirion, Grisel (b230) 2011; 12
Zuech, Hancock, Khoshgoftaar (b359) 2021; 8
Flach, Kull (b84) 2015; 28
De Bin (b59) 2016; 31
Li, Zheng, Wang, Cao, Guo, Fu (b170) 2021; 70
Nanni, Fantozzi, Lazzarini (b212) 2015; 158
Han, Wang, Mao (b108) 2005
Japkowicz, Stephen (b135) 2002; 6
Zhou, Lee, Selvam, Lee, Lin, Ren (b355) 2020
Chen, Xu, Ying, Chen, Feng, Fang (b52) 2019; 7
Breiman (b35) 2001; 45
Liu, Wu, Li, Li, Tan, Bai (b184) 2022; 22
Johnson, Khoshgoftaar (b140) 2019; 6
Loyola-Gonzalez (b189) 2019; 7
Walker, Jiang (b303) 2019; 45
Yu, Qu, Zhang, Lu, Qin, Liu (b337) 2021
Li, Zhang (b168) 2020; 32
Shin, Yoon, Kim, Kim, Go, Cha (b272) 2021; 61
Touzani, Granderson, Fernandes (b292) 2018; 158
Yen, Lee (b335) 2006
Pintelas, Livieris, Pintelas (b232) 2020; 13
Wang, Liu, Zio (b311) 2022
Kumar, Choudhary, Cho (b155) 2020
Tama, Rhee (b286) 2019; 31
He, Bai, Garcia, Li (b116) 2008
Dong, Yu, Cao, Shi, Ma (b68) 2020; 14
Hempstalk, Frank, Witten (b118) 2008
Belgiu, Drăguţ (b23) 2016; 114
Hido, Kashima, Takahashi (b119) 2009; 2
Pang, Chen, Peng, Ma, Zhao, Ji (b227) 2019
Luque, Carrasco, Martín, de Las Heras (b191) 2019; 91
Pérez-Ortiz, Gutiérrez, Tino, Hervás-Martínez (b231) 2015; 27
Belouch, hadaj (b24) 2017
Yanabe, Nishi, Hashimoto (b333) 2020
Abro, Taşci, Aybars (b3) 2020; 8
Bunkhumpornpat, Sinapiromsaran, Lursinsap (b38) 2011
Shin, Kim, Chung, Chung, Han, Cho (b271) 2020; 10
Wang, Shi, Lyu, Deng (b317) 2017
Wen, Sun, Yang, Song, Gao, Wang (b321) 2020
Caruana, Niculescu-Mizil (b42) 2006
Mishra, Patnaik, Bansal, Naidoo, Naik, Nayak (b204) 2021
Wang, Li, Wu, Hovy, Sun (b309) 2022
Wang, Minku, Yao (b313) 2013; 12
Bader-El-Den, Teitei, Perry (b14) 2018; 30
Alfaro, García, Gámez, Elizondo (b10) 2008; 45
Zhu, Xia, Jin, Yan, Cai, Yan (b357) 2018; 6
Han, Zhang, Ding, Gu, Liu, Huo (b109) 2021; 2
Rodriguez, Herraiz, Harrison, Dolado, Riquelme (b252) 2014
Liu, Wu, Zhou (b185) 2008; 39
Chandra, Bhagat, Maharana, Krivitsky (b44) 2021; 9
Quinlan (b241) 1986; 1
Ghourabi (b100) 2022; 10
Polikar (b234) 2012
Iwana, Uchida (b132) 2021; 16
Mediavilla-Relaño, Lázaro, Figueiras-Vidal (b199) 2022
Hasanin, Khoshgoftaar (b112) 2018
Cambria, White (b41) 2014; 9
Lin (b176) 2020
Grandini, Bagli, Visani (b103) 2020
Li, Liu (b164) 2018; 30
Bentéjac, Csörgő, Martínez-Muñoz (b27) 2021; 54
Ragni, Knill, Rath, Gales (b244) 2014
Nichols, Herbert Chan, Baker (b218) 2019; 11
Tang, Xia, Zhang, Long (b287) 2020
Geurts, Ernst, Wehenkel (b99) 2006; 63
Liu, Meric, Havulinna, Teo, Åberg, Ruuskanen (b181) 2022; 34
Calo, Efendiev, Galvis, Li (b40) 2016; 14
Goutte, Gaussier (b102) 2005
Vassallo, Vella, Ellul (b299) 2021; 2
Wang, Deng, Wang (b305) 2020; 136
Liang, Huang, Saratchandran, Sundararajan (b172) 2006; 17
Sun, Song, Zhu, Sun, Xu, Zhou (b281) 2015; 48
Le, Hoang Son, Vo, Lee, Baik (b157) 2018; 10
Fu, Lu, Zhang, Lu, Wang, Zhang (b89) 2023; 213
Hancock, Khoshgoftaar (b110) 2020
Liu, Dong, Zhao, Tian (b180) 2022; 10
Khan, Madden (b148) 2014; 29
Kingma, Ba (b151) 2014
Gao, Gao (b93) 2010; 73
Zhang, Zhan (b350) 2017
Liu, Zhang, Fan (b186) 2022; 195
Sherazi, Bae, Lee (b268) 2021; 16
Hothorn, Bühlmann, Kneib, Schmid, Hofner (b124) 2010; 11
Zhao, Li, Chen, Aihara (b352) 2008; 70
Ganaie, Hu, Malik, Tanveer, Suganthan (b91) 2022; 115
Nash, Sellers, Talbot, Cawthorn, Ford (b214) 1995
Shao, Geng, Liu, Dai, Yan, Yang (b266) 2021
Vasudevan (b300) 2014; 10
Wang, Chen, Zhang, Li (b304) 2022; 22
Chandra, Jain, Maharana, Krivitsky (b45) 2022; 10
Hajek, Munk (b106) 2023
Floridi, Chiriatti (b85) 2020; 30
Mirza, Lin (b202) 2016; 80
Yang, Liu, Zeng, Xie (b334) 2019; 133
Loh (b187) 2011; 1
Sun, He, Xu, Wu, He, Li (b279) 2022; 149
Ben-David, Kushilevitz, Mansour (b25) 1997; 29
Li, Chen, Tan, Li, Gu, Zhang (b162) 2021; 105
Mountrakis, Im, Ogole (b209) 2011; 66
Rocha, Goldenstein (b251) 2013; 25
Wu, Ye, Zhang, Ng, Ho (b326) 2014; 67
Cai, Wang, Jiang, Zhang, Peng (b39) 2022; 584
Xiwen, Xiaosong (b328) 2021
Zhou, Chawla, Jin, Williams (b354) 2014; 9
Ling, Sheng (b179) 2008
Tutz, Binder (b294) 2006; 62
Gao, Jin, Xia, Wu, Gu, Yan (b94) 2020; 11
Ma, Peng, Wang, Dong (b194) 2021
Minastireanu, Mesnita (b201) 2019; 2019
Rätsch, Onoda, Müller (b246) 1998; 11
Gandhi, Pandey (b92) 2015
Le, Oktian, Kim (b158) 2022; 14
Wu, Emerton, Duan, Wood, Wetterhall, Robertson (b325) 2020; 7
Suthaharan (b282) 2016
Ning, Zhao, Ma (b219) 2021; 19
Abdelgayed, Morsi, Sidhu (b2) 2017; 65
Domingo, Watanabe (b67) 2000
Galar, Fernandez, Barrenechea, Bustince, Herrera (b90) 2011; 42
Hu, Liang, Ma, He (b127) 2009
Lin, Tsai, Hu, Jhang (b177) 2017; 409
Dou, Yunus, Bui, Merghadi, Sahana, Zhu (b70) 2020; 17
Zhang, Ramezani, Naeim (b348) 2019
Johnson, Khoshgoftaar (b139) 2019
Lu, Wu, Tai, Tang (b190) 2018
Raffel, Shazeer, Roberts, Lee, Narang, Matena (b243) 2020; 21
Utgoff (b296) 1989; 4
Dua, Graff (b73) 2017
Moreo, Esuli, Sebastiani (b208) 2016
Dasarathy, Sánchez, Townsend (b58) 2000; 3
Devi, kr. Biswas, Purkayastha (b61) 2017; 93
Barua, Islam, Murase (b20) 2011
Bradley (b34) 1997; 30
Zhang, Krawczyk, Garcia, Rosales-Pérez, Herrera (b346) 2016; 106
Devlin, Chang, Lee, Toutanova (b63) 2018
Kim, Shin, Lee, Lee, Kang, Cho (b150) 2021; 207
Farajzadeh-Zanjani, Razavi-Far, Saif (b81) 2016
Ogunleye, Wang (b222) 2019; 17
Rolnick, Donti, Kaack, Kochanski, Lacoste, Sankaran (b254) 2022; 55
Kendall, Gal (b145) 2017; 30
Shilong (b269) 2021
Friedman (b88) 2001
Wei, Dunbrack (b319) 2013; 8
Yuan, Ma (b339) 2012
Re, M., & Valentini, G. 1 ensemble methods: a review 3 (1).
Li, Stasinakis, Yeo (b165) 2022; 4
Van Dyk, Meng (b298) 2001; 10
Akbani, Kwek, Japkowicz (b7) 2004
Seliya, Abdollah Zadeh, Khoshgoftaar (b263) 2021; 8
Bénard, Biau, Da Veiga, Scornet (b26) 2021
Natekin, Knoll (b215) 2013; 7
Wolpert (b324) 1992; 5
Acheampong, Nunoo-Mensah, Chen (b4) 2021
Kannapiran, Sindha (b142) 2023; 30
Liu, Wang, Zhang, Chen, Xiang (b183) 2017; 69
Mohammed, Rawashdeh, Abdullah (b205) 2020
Prachuabsupakij, Soonthornphisaj (b236) 2012
Bi, Zhang (b28) 2018; 158
Sarica, Cerasa, Quattrone (b261) 2017; 9
Chang, Chang, Wu (b47) 2018; 73
Li, Wang, Sung (b166) 2008; 21
He, Thiesson (b117) 2007
Wang, Yeung (b318) 2020; 53
Liu, Meric, Havulinna, Teo, Ruuskanen, Sanders (b182) 2020
Gaye, Zhang, Wulamu (b96) 2021; 12
Oono, Suzuki (b223) 2020; 33
Widmer, Kubat (b323) 1996; 23
Demirkıran, Çayır, Ünal, Dağ (b60) 2022; 121
Ekpo, Takyi, Gyening (b75) 2022
Khan, Madden (b147) 2010
Nanni, Franco (b213) 2011; 38
Freund, Schapire (b87) 1996
Sun, Liu, Sima (b280) 2020; 32
Zhou (b353) 2012
Van Calster, Van Belle, Condous, Bourne, Timmerman, Van Huffel (b297) 2008
Zenko, Todorovski, Dzeroski (b342) 2001
Ho (b120) 1995
Chen, Guestrin (b49) 2016
Bartlett, Traskin (b19) 2006; 19
Ge, Gu, Chang, Cai (b97) 2020
Wang, Minku, Yao (b314) 2016
Zhang, Karaman, Chang (b345) 2019
Yu, Xia, Fei, Lu (b338) 2021; 10
Syarif, Zaluska, Prugel-Bennett, Wills (b283) 2012
Ding, Chen, Dong, Fu, Cui (b65) 2022; 131
Tsymbal (b293) 2004
Zeng, Yang, Zhang, Wu, Zhang, Dai (b341) 2019; 2019
Zhu, Lin, Liu (b356) 2017; 72
Liao, Hu, Yang, Rosenhahn (b175) 2022
Chamseddine, Mansouri, Soui, Abed (b43) 2022; 129
Biau, Scornet (b29) 2016; 25
Dai, Liu, Yang (b57) 2022; 257
Hofner, Mayr, Schmid (b121) 2014
Chawla, Bowyer, Hall, Kegelmeyer (b48) 2002; 16
Hatwell, Gaber, Atif Azad (b115) 2020; 20
Feng, Gangal, Wei, Chandar, Vosoughi, Mitamura (b82) 2021
Tax (b288) 2002
Tahir, Kittler, Yan (b285) 2012; 45
Kamalov, Moussa, Avante Reyes (b141) 2022; 11
Fernández, Garcia, Herrera, Chawla (b83) 2018; 61
Kapoor, Negi, Marshall, Chandra (b143) 2023
Kingsford, Salzberg (b152) 2008; 26
Mirza, Lin, Liu (b203) 2015; 149
Cortes, Vapnik (b54) 1995; 20
Japkowicz (b134) 2000
Wang, Li, Jiang, Lu, Liu, Jian (b308) 2020
Markoski, Ivanković, Ratgeber, Pecev, Glušac (b198) 2015; 12
Błaszczyński, Stefanowski, Idkowiak (b31) 2013
Alam, Rahman, Rahman (b9) 2019; 15
Badirli, Liu, Xing, Bhowmik, Doan, Keerthi (b15) 2020
Solomatine, Shrestha (b278) 2004
Noble (b220) 2006; 24
Shobana, Umamaheswari (b273) 2021
Faraggi, Reiser (b80) 2002; 21
Timofeev (b289) 2004
Mienye, Sun (b200) 2022; 10
Naik, Mohan (b211) 2021; 9
Agrawal, Mamidi (b6) 2022
Khoshgoftaar, Van Hulse, Napolitano (b149) 2011; 41
Hu, Hu, Maybank (b126) 2008; 38
Hu, Chen, Zhang (b125) 2019
Mao, Liu, Ding, Li (b197) 2019; 7
Puri, Kumar Gupta (b238) 2022; 65
Siers, Islam (b275) 2015; 51
Džeroski, Ženko (b74) 2002
Hossin, Sulaiman (b123) 2015; 5
Dorogush, Ershov, Gulin (b69) 2018
Han, Hayashi, Rundo, Araki, Shimoda, Muramatsu (b107) 2018
Zhang, Zhang, Yang (b351) 2013; 22
Anaby-Tavor, Carmeli, Goldbraich, Kantor, Kour, Shlomov (b12) 2020
Błaszczyński, Stefanowski (b30) 2015; 150
Creswell, White, Dumoulin, Arulkumaran, Sengupta, Bharath (b56) 2018; 35
Emu, Jahin, Akter, Patwary, Akter (b76) 2022
Goodfellow, Pouget-Abadie, Mirza, Xu, Warde-Farley, Ozair (b101) 2020; 63
Douzas, Bacao, Last (b72) 2018; 465
Quinto (b242) 2020
Xu, Coco, Neale (b329) 2020; 177
Fonseca, Douzas, Bacao (b86) 2021; 12
Ranjan, Castillo, Chellappa (b245) 2017
Zhang, Li, Jia, Ma, Luo, Li (b347) 2020; 152
Hajek, Abedin, Sivarajah (b105) 2022
Huang (b129) 2020
Buckland, Gey (b37) 1994; 45
Fan, Stolfo, Zhang (b79) 1999
Wang, Li, Zhao (b310) 2022; 602
Loh (b188) 2014; 82
Qin, Liu, Wang, Liu, Deng, Ma (b239
Barua (10.1016/j.eswa.2023.122778_b20) 2011
Minastireanu (10.1016/j.eswa.2023.122778_b201) 2019; 2019
Sharma (10.1016/j.eswa.2023.122778_b267) 2022
Bai (10.1016/j.eswa.2023.122778_b17) 2023; 558
Wang (10.1016/j.eswa.2023.122778_b317) 2017
Wang (10.1016/j.eswa.2023.122778_b311) 2022
Xu (10.1016/j.eswa.2023.122778_b332) 2019; 32
Lin (10.1016/j.eswa.2023.122778_b176) 2020
Liu (10.1016/j.eswa.2023.122778_b186) 2022; 195
Rätsch (10.1016/j.eswa.2023.122778_b246) 1998; 11
Qin (10.1016/j.eswa.2023.122778_b239) 2021; 133
Lu (10.1016/j.eswa.2023.122778_b190) 2018
Zhang (10.1016/j.eswa.2023.122778_b347) 2020; 152
Haixiang (10.1016/j.eswa.2023.122778_b104) 2017; 73
Fu (10.1016/j.eswa.2023.122778_b89) 2023; 213
Zhang (10.1016/j.eswa.2023.122778_b345) 2019
Kim (10.1016/j.eswa.2023.122778_b150) 2021; 207
Dorogush (10.1016/j.eswa.2023.122778_b69) 2018
Ofek (10.1016/j.eswa.2023.122778_b221) 2017; 243
Resende (10.1016/j.eswa.2023.122778_b249) 2018; 51
Liao (10.1016/j.eswa.2023.122778_b174) 2021
Kingma (10.1016/j.eswa.2023.122778_b151) 2014
Prachuabsupakij (10.1016/j.eswa.2023.122778_b236) 2012
Ben-David (10.1016/j.eswa.2023.122778_b25) 1997; 29
Dietterich (10.1016/j.eswa.2023.122778_b64) 2002
Cloke (10.1016/j.eswa.2023.122778_b53) 2009; 375
Devlin (10.1016/j.eswa.2023.122778_b63) 2018
Quinlan (10.1016/j.eswa.2023.122778_b241) 1986; 1
Abro (10.1016/j.eswa.2023.122778_b3) 2020; 8
Bahlmann (10.1016/j.eswa.2023.122778_b16) 2002
Li (10.1016/j.eswa.2023.122778_b165) 2022; 4
Semanjski (10.1016/j.eswa.2023.122778_b264) 2015; 15
Dai (10.1016/j.eswa.2023.122778_b57) 2022; 257
Ranjan (10.1016/j.eswa.2023.122778_b245) 2017
Chamseddine (10.1016/j.eswa.2023.122778_b43) 2022; 129
Puri (10.1016/j.eswa.2023.122778_b238) 2022; 65
Wang (10.1016/j.eswa.2023.122778_b309) 2022
Chen (10.1016/j.eswa.2023.122778_b49) 2016
Nanni (10.1016/j.eswa.2023.122778_b212) 2015; 158
Li (10.1016/j.eswa.2023.122778_b169) 2019; 21
Agrawal (10.1016/j.eswa.2023.122778_b6) 2022
Friedman (10.1016/j.eswa.2023.122778_b88) 2001
Chawla (10.1016/j.eswa.2023.122778_b48) 2002; 16
Kumar (10.1016/j.eswa.2023.122778_b154) 2019; 23
Loh (10.1016/j.eswa.2023.122778_b188) 2014; 82
Flach (10.1016/j.eswa.2023.122778_b84) 2015; 28
Wei (10.1016/j.eswa.2023.122778_b320) 2013; 16
Rocha (10.1016/j.eswa.2023.122778_b251) 2013; 25
Markoski (10.1016/j.eswa.2023.122778_b198) 2015; 12
Japkowicz (10.1016/j.eswa.2023.122778_b135) 2002; 6
Bria (10.1016/j.eswa.2023.122778_b36) 2020; 120
Pintelas (10.1016/j.eswa.2023.122778_b232) 2020; 13
Wen (10.1016/j.eswa.2023.122778_b321) 2020
Bobadilla (10.1016/j.eswa.2023.122778_b32) 2013; 46
Ezzat (10.1016/j.eswa.2023.122778_b78) 2016; 17
Mishra (10.1016/j.eswa.2023.122778_b204) 2021
Galar (10.1016/j.eswa.2023.122778_b90) 2011; 42
Kamalov (10.1016/j.eswa.2023.122778_b141) 2022; 11
Pérez-Ortiz (10.1016/j.eswa.2023.122778_b231) 2015; 27
Tang (10.1016/j.eswa.2023.122778_b287) 2020
Hothorn (10.1016/j.eswa.2023.122778_b124) 2010; 11
Sherazi (10.1016/j.eswa.2023.122778_b268) 2021; 16
Zhang (10.1016/j.eswa.2023.122778_b344) 2022; 18
Chen (10.1016/j.eswa.2023.122778_b50) 2015
Hu (10.1016/j.eswa.2023.122778_b127) 2009
Zeng (10.1016/j.eswa.2023.122778_b341) 2019; 2019
Liu (10.1016/j.eswa.2023.122778_b182) 2020
Natras (10.1016/j.eswa.2023.122778_b216) 2022; 14
Rayhan (10.1016/j.eswa.2023.122778_b247) 2017
Solomatine (10.1016/j.eswa.2023.122778_b278) 2004
Alam (10.1016/j.eswa.2023.122778_b9) 2019; 15
Ghourabi (10.1016/j.eswa.2023.122778_b100) 2022; 10
Li (10.1016/j.eswa.2023.122778_b166) 2008; 21
Banga (10.1016/j.eswa.2023.122778_b18) 2021
Yu (10.1016/j.eswa.2023.122778_b338) 2021; 10
Devi (10.1016/j.eswa.2023.122778_b61) 2017; 93
Hajek (10.1016/j.eswa.2023.122778_b106) 2023
Zhuang (10.1016/j.eswa.2023.122778_b358) 2018; 12
Zhang (10.1016/j.eswa.2023.122778_b343) 2015; 58
Mohammed (10.1016/j.eswa.2023.122778_b205) 2020
Parmar (10.1016/j.eswa.2023.122778_b228) 2018
Liang (10.1016/j.eswa.2023.122778_b173) 2019
Jiang (10.1016/j.eswa.2023.122778_b138) 2021; 18
Salcedo-Sanz (10.1016/j.eswa.2023.122778_b259) 2014; 4
Siers (10.1016/j.eswa.2023.122778_b275) 2015; 51
Cover (10.1016/j.eswa.2023.122778_b55) 1967; 13
Shin (10.1016/j.eswa.2023.122778_b271) 2020; 10
Kannapiran (10.1016/j.eswa.2023.122778_b142) 2023; 30
Saeed (10.1016/j.eswa.2023.122778_b258) 2023
Vassallo (10.1016/j.eswa.2023.122778_b299) 2021; 2
Bénard (10.1016/j.eswa.2023.122778_b26) 2021
Mirza (10.1016/j.eswa.2023.122778_b203) 2015; 149
Li (10.1016/j.eswa.2023.122778_b167) 2020
Faraggi (10.1016/j.eswa.2023.122778_b80) 2002; 21
Dasarathy (10.1016/j.eswa.2023.122778_b58) 2000; 3
Gaye (10.1016/j.eswa.2023.122778_b96) 2021; 12
Bee (10.1016/j.eswa.2023.122778_b22) 2018; 6
Kingsford (10.1016/j.eswa.2023.122778_b152) 2008; 26
Džeroski (10.1016/j.eswa.2023.122778_b74) 2002
Tama (10.1016/j.eswa.2023.122778_b286) 2019; 31
Utgoff (10.1016/j.eswa.2023.122778_b296) 1989; 4
Goodfellow (10.1016/j.eswa.2023.122778_b101) 2020; 63
He (10.1016/j.eswa.2023.122778_b116) 2008
Yang (10.1016/j.eswa.2023.122778_b334) 2019; 133
Yuan (10.1016/j.eswa.2023.122778_b339) 2012
Hancock (10.1016/j.eswa.2023.122778_b110) 2020
Ning (10.1016/j.eswa.2023.122778_b219) 2021; 19
Bradley (10.1016/j.eswa.2023.122778_b34) 1997; 30
Wang (10.1016/j.eswa.2023.122778_b312) 2017; 2017
Li (10.1016/j.eswa.2023.122778_b161) 2018; 7
Agarwal (10.1016/j.eswa.2023.122778_b5) 2019
Sanchez (10.1016/j.eswa.2023.122778_b260) 2018; 34
Ma (10.1016/j.eswa.2023.122778_b193) 2022
Abd Al Rahman (10.1016/j.eswa.2023.122778_b1) 2022; 210
Huang (10.1016/j.eswa.2023.122778_b129) 2020
Li (10.1016/j.eswa.2023.122778_b164) 2018; 30
Lango (10.1016/j.eswa.2023.122778_b156) 2018; 50
Liu (10.1016/j.eswa.2023.122778_b184) 2022; 22
Natekin (10.1016/j.eswa.2023.122778_b215) 2013; 7
Van Dyk (10.1016/j.eswa.2023.122778_b298) 2001; 10
Błaszczyński (10.1016/j.eswa.2023.122778_b30) 2015; 150
Ling (10.1016/j.eswa.2023.122778_b179) 2008
Xiao (10.1016/j.eswa.2023.122778_b327) 2019; 517
Polikar (10.1016/j.eswa.2023.122778_b234) 2012
Ge (10.1016/j.eswa.2023.122778_b97) 2020
Buckland (10.1016/j.eswa.2023.122778_b37) 1994; 45
Lyashevska (10.1016/j.eswa.2023.122778_b192) 2021; 30
Podgorelec (10.1016/j.eswa.2023.122778_b233) 2002; 26
Sun (10.1016/j.eswa.2023.122778_b280) 2020; 32
Ma (10.1016/j.eswa.2023.122778_b194) 2021
Prusty (10.1016/j.eswa.2023.122778_b237) 2017; 100
Nash (10.1016/j.eswa.2023.122778_b214) 1995
Zhang (10.1016/j.eswa.2023.122778_b350) 2017
10.1016/j.eswa.2023.122778_b301
Liu (10.1016/j.eswa.2023.122778_b185) 2008; 39
Zhu (10.1016/j.eswa.2023.122778_b356) 2017; 72
He (10.1016/j.eswa.2023.122778_b117) 2007
Ekpo (10.1016/j.eswa.2023.122778_b75) 2022
Le (10.1016/j.eswa.2023.122778_b157) 2018; 10
Gao (10.1016/j.eswa.2023.122778_b93) 2010; 73
Wang (10.1016/j.eswa.2023.122778_b315) 2018; 29
Létinier (10.1016/j.eswa.2023.122778_b160) 2021; 110
Chen (10.1016/j.eswa.2023.122778_b51) 2021
Li (10.1016/j.eswa.2023.122778_b168) 2020; 32
Ganaie (10.1016/j.eswa.2023.122778_b91) 2022; 115
Qin (10.1016/j.eswa.2023.122778_b240) 2020; 195
Gao (10.1016/j.eswa.2023.122778_b95) 2020; 108
Ho (10.1016/j.eswa.2023.122778_b120) 1995
Islam (10.1016/j.eswa.2023.122778_b131) 2020
Lin (10.1016/j.eswa.2023.122778_b178) 2017; 409
Bayer (10.1016/j.eswa.2023.122778_b21) 2022; 55
Han (10.1016/j.eswa.2023.122778_b109) 2021; 2
Hido (10.1016/j.eswa.2023.122778_b119) 2009; 2
Chang (10.1016/j.eswa.2023.122778_b47) 2018; 73
Xu (10.1016/j.eswa.2023.122778_b331) 2021; 572
Wang (10.1016/j.eswa.2023.122778_b305) 2020; 136
Douzas (10.1016/j.eswa.2023.122778_b72) 2018; 465
Zhao (10.1016/j.eswa.2023.122778_b352) 2008; 70
Oza (10.1016/j.eswa.2023.122778_b224) 2004
Akbani (10.1016/j.eswa.2023.122778_b7) 2004
Mienye (10.1016/j.eswa.2023.122778_b200) 2022; 10
Bunkhumpornpat (10.1016/j.eswa.2023.122778_b38) 2011
Chandra (10.1016/j.eswa.2023.122778_b45) 2022; 10
Liao (10.1016/j.eswa.2023.122778_b175) 2022
Mirza (10.1016/j.eswa.2023.122778_b202) 2016; 80
Sarica (10.1016/j.eswa.2023.122778_b261) 2017; 9
Fernández (10.1016/j.eswa.2023.122778_b83) 2018; 61
Padurariu (10.1016/j.eswa.2023.122778_b225) 2019; 159
Pan (10.1016/j.eswa.2023.122778_b226) 2018
Yoon (10.1016/j.eswa.2023.122778_b336) 2023; 13
Bader-El-Den (10.1016/j.eswa.2023.122778_b14) 2018; 30
Khan (10.1016/j.eswa.2023.122778_b148) 2014; 29
Li (10.1016/j.eswa.2023.122778_b170) 2021; 70
Liu (10.1016/j.eswa.2023.122778_b183) 2017; 69
10.1016/j.eswa.2023.122778_b248
Lin (10.1016/j.eswa.2023.122778_b177) 2017; 409
Moreo (10.1016/j.eswa.2023.122778_b208) 2016
Goutte (10.1016/j.eswa.2023.122778_b102) 2005
Mushava (10.1016/j.eswa.2023.122778_b210) 2022; 202
Yu (10.1016/j.eswa.2023.122778_b337) 2021
Wu (10.1016/j.eswa.2023.122778_b326) 2014; 67
Hastie (10.1016/j.eswa.2023.122778_b114) 2009; 2
Khan (10.1016/j.eswa.2023.122778_b147) 2010
Wang (10.1016/j.eswa.2023.122778_b314) 2016
Hofner (10.1016/j.eswa.2023.122778_b121) 2014
Grandini (10.1016/j.eswa.2023.122778_b103) 2020
Vasudevan (10.1016/j.eswa.2023.122778_b300) 2014; 10
Walach (10.1016/j.eswa.2023.122778_b302) 2016
Błaszczyński (10.1016/j.eswa.2023.122778_b31) 2013
Ding (10.1016/j.eswa.2023.122778_b66) 2022
Liu (10.1016/j.eswa.2023.122778_b181) 2022; 34
Shorten (10.1016/j.eswa.2023.122778_b274) 2019; 6
Geurts (10.1016/j.eswa.2023.122778_b99) 2006; 63
Ke (10.1016/j.eswa.2023.122778_b144) 2017; 30
Zhang (10.1016/j.eswa.2023.122778_b348) 2019
Creswell (10.1016/j.eswa.2023.122778_b56) 2018; 35
Li (10.1016/j.eswa.2023.122778_b163) 2022
Oono (10.1016/j.eswa.2023.122778_b223) 2020; 33
Emu (10.1016/j.eswa.2023.122778_b76) 2022
Parsa (10.1016/j.eswa.2023.122778_b229) 2020; 136
Westerlund (10.1016/j.eswa.2023.122778_b322) 2019; 9
Rodriguez (10.1016/j.eswa.2023.122778_b252) 2014
Belouch (10.1016/j.eswa.2023.122778_b24) 2017
Japkowicz (10.1016/j.eswa.2023.122778_b134) 2000
Wang (10.1016/j.eswa.2023.122778_b304) 2022; 22
Xu (10.1016/j.eswa.2023.122778_b329) 2020; 177
Demirkıran (10.1016/j.eswa.2023.122778_b60) 2022; 121
Douzas (10.1016/j.eswa.2023.122778_b71) 2019; 501
Espíndola (10.1016/j.eswa.2023.122778_b77) 2005; 35
Wang (10.1016/j.eswa.2023.122778_b313) 2013; 12
Cai (10.1016/j.eswa.2023.122778_b39) 2022; 584
Huang (10.1016/j.eswa.2023.122778_b130) 2018; 15
Hasanin (10.1016/j.eswa.2023.122778_b112) 2018
Wang (10.1016/j.eswa.2023.122778_b310) 2022; 602
Shao (10.1
References_xml – volume: 2017
  year: 2017
  ident: b312
  article-title: A novel ensemble method for imbalanced data learning: bagging of extrapolation-SMOTE SVM
  publication-title: Computational Intelligence and Neuroscience
– volume: 45
  start-page: 12
  year: 1994
  end-page: 19
  ident: b37
  article-title: The relationship between recall and precision
  publication-title: Journal of the American Society for Information Science
– volume: 149
  year: 2022
  ident: b279
  article-title: Multi-label classification of fundus images with graph convolutional network and LightGBM
  publication-title: Computers in Biology and Medicine
– year: 2023
  ident: b143
  article-title: Cyclone trajectory and intensity prediction with uncertainty quantification using variational recurrent neural networks
  publication-title: Environmental Modelling & Software
– volume: 7
  start-page: 149890
  year: 2019
  end-page: 149899
  ident: b307
  article-title: Feature learning viewpoint of AdaBoost and a new algorithm
  publication-title: IEEE Access
– start-page: 150
  year: 2019
  end-page: 153
  ident: b173
  article-title: Product marketing prediction based on XGboost and LightGBM algorithm
  publication-title: Proceedings of the 2nd international conference on artificial intelligence and pattern recognition
– start-page: 31
  year: 2004
  end-page: 40
  ident: b224
  article-title: Aveboost2: Boosting for noisy data
  publication-title: International workshop on multiple classifier systems
– volume: 14
  start-page: 8707
  year: 2022
  ident: b158
  article-title: XGBoost for imbalanced multiclass classification-based industrial internet of things intrusion detection systems
  publication-title: Sustainability
– start-page: 38
  year: 2019
  ident: b5
  article-title: Protecting world leaders against deep fakes
  publication-title: CVPR workshops, Vol. 1
– year: 2017
  ident: b245
  article-title: L2-constrained softmax loss for discriminative face verification
– volume: 9
  start-page: 98
  year: 2022
  ident: b262
  article-title: The use of generative adversarial networks to alleviate class imbalance in tabular data: a survey
  publication-title: Journal of Big Data
– start-page: 201
  year: 2002
  end-page: 211
  ident: b74
  article-title: Stacking with multi-response model trees
  publication-title: International workshop on multiple classifier systems
– year: 2021
  ident: b204
  article-title: DTCDWT-SMOTE-XGBoost-based islanding detection for distributed generation systems: An approach of class-imbalanced issue
  publication-title: IEEE Systems Journal
– volume: 572
  start-page: 574
  year: 2021
  end-page: 589
  ident: b331
  article-title: A cluster-based oversampling algorithm combining SMOTE and k-means for imbalanced medical data
  publication-title: Information Sciences
– volume: 199
  start-page: 1128
  year: 2022
  end-page: 1135
  ident: b306
  article-title: Research on personal credit risk evaluation based on XGBoost
  publication-title: Procedia Computer Science
– volume: 37
  start-page: 587
  year: 2021
  end-page: 603
  ident: b33
  article-title: Kaggle forecasting competitions: An overlooked learning opportunity
  publication-title: International Journal of Forecasting
– volume: 12
  start-page: 7189
  year: 2022
  ident: b11
  article-title: Toward an efficient automatic self-augmentation labeling tool for intrusion detection based on a semi-supervised approach
  publication-title: Applied Sciences
– volume: 32
  start-page: 1971
  year: 2020
  end-page: 1979
  ident: b168
  article-title: Research on orthopedic auxiliary classification and prediction model based on XGBoost algorithm
  publication-title: Neural Computing and Applications
– volume: 63
  start-page: 139
  year: 2020
  end-page: 144
  ident: b101
  article-title: Generative adversarial networks
  publication-title: Communications of the ACM
– year: 2020
  ident: b103
  article-title: Metrics for multi-class classification: an overview
– volume: 11
  start-page: 2703
  year: 2022
  ident: b141
  article-title: KDE-based ensemble learning for imbalanced data
  publication-title: Electronics
– volume: 41
  start-page: 478
  year: 2003
  end-page: 539
  ident: b235
  article-title: Forecasting volatility in financial markets: A review
  publication-title: Journal of Economic Literature
– volume: 67
  start-page: 105
  year: 2014
  end-page: 116
  ident: b326
  article-title: ForesTexter: An efficient random forest algorithm for imbalanced text categorization
  publication-title: Knowledge-Based Systems
– volume: 45
  start-page: 110
  year: 2008
  end-page: 122
  ident: b10
  article-title: Bankruptcy forecasting: An empirical comparison of AdaBoost and neural networks
  publication-title: Decision Support Systems
– volume: 16
  start-page: 321
  year: 2002
  end-page: 357
  ident: b48
  article-title: SMOTE: synthetic minority over-sampling technique
  publication-title: Journal of Artificial Intelligence Research
– start-page: 49
  year: 2002
  end-page: 54
  ident: b16
  article-title: Online handwriting recognition with support vector machines-a kernel approach
  publication-title: Proceedings eighth international workshop on frontiers in handwriting recognition
– volume: 15
  start-page: 607
  year: 2018
  end-page: 611
  ident: b98
  article-title: Very high resolution object-based land use–land cover urban classification using extreme gradient boosting
  publication-title: IEEE Geoscience and Remote Sensing Letters
– start-page: 111
  year: 2000
  end-page: 117
  ident: b134
  article-title: The class imbalance problem: Significance and strategies
  publication-title: Proc. of the int’l conf. on artificial intelligence, Vol. 56
– volume: 66
  start-page: 247
  year: 2011
  end-page: 259
  ident: b209
  article-title: Support vector machines in remote sensing: A review
  publication-title: ISPRS Journal of Photogrammetry and Remote Sensing
– volume: 136
  start-page: 190
  year: 2020
  end-page: 197
  ident: b305
  article-title: Imbalance-XGBoost: leveraging weighted and focal losses for binary label-imbalanced classification with XGBoost
  publication-title: Pattern Recognition Letters
– start-page: 352
  year: 2022
  end-page: 356
  ident: b6
  article-title: LastResort at SemEval-2022 task 4: Towards patronizing and condescending language detection using pre-trained transformer based models ensembles
  publication-title: Proceedings of the 16th international workshop on semantic evaluation (SemEval-2022)
– start-page: 110
  year: 2002
  end-page: 125
  ident: b64
  article-title: Ensemble learning
  publication-title: The handbook of brain theory and neural networks, Vol. 2
– start-page: 937
  year: 2021
  end-page: 945
  ident: b26
  article-title: Interpretable random forests via rule extraction
  publication-title: International conference on artificial intelligence and statistics
– volume: 55
  start-page: 1
  year: 2022
  end-page: 39
  ident: b21
  article-title: A survey on data augmentation for text classification
  publication-title: ACM Computing Surveys
– volume: 8
  start-page: 1
  year: 2021
  end-page: 31
  ident: b263
  article-title: A literature review on one-class classification and its potential applications in big data
  publication-title: Journal of Big Data
– volume: 29
  start-page: 345
  year: 2014
  end-page: 374
  ident: b148
  article-title: One-class classification: taxonomy of study and review of techniques
  publication-title: The Knowledge Engineering Review
– volume: 30
  year: 2018
  ident: b164
  article-title: A comparative study of the class imbalance problem in Twitter spam detection
  publication-title: Concurrency and Computation: Practice and Experience
– volume: 16
  start-page: 114
  year: 2005
  end-page: 131
  ident: b270
  article-title: Incremental training of support vector machines
  publication-title: IEEE Transactions on Neural Networks
– volume: 6
  start-page: 45
  year: 2018
  ident: b22
  article-title: Estimating and forecasting conditional risk measures with extreme value theory: a review
  publication-title: Risks
– volume: 12
  start-page: 2825
  year: 2011
  end-page: 2830
  ident: b230
  article-title: Scikit-learn: Machine learning in Python
  publication-title: Journal of Machine Learning Research
– volume: 17
  start-page: 267
  year: 2016
  end-page: 276
  ident: b78
  article-title: Drug-target interaction prediction via class imbalance-aware ensemble learning
  publication-title: BMC Bioinformatics
– volume: 42
  start-page: 463
  year: 2011
  end-page: 484
  ident: b90
  article-title: A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches
  publication-title: IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews)
– start-page: 180
  year: 2000
  end-page: 189
  ident: b67
  article-title: MadaBoost: A modification of AdaBoost
  publication-title: COLT
– start-page: 1163
  year: 2004
  end-page: 1168
  ident: b278
  article-title: AdaBoost. RT: a boosting algorithm for regression problems
  publication-title: 2004 IEEE international joint conference on neural networks (IEEE Cat. No. 04CH37541), Vol. 2
– volume: 2
  start-page: 1
  year: 2021
  end-page: 15
  ident: b299
  article-title: Application of gradient boosting algorithms for anti-money laundering in cryptocurrencies
  publication-title: SN Computer Science
– volume: 10
  start-page: 42
  year: 2020
  end-page: 48
  ident: b271
  article-title: Emergency department return prediction system using blood samples with LightGBM for smart health care services
  publication-title: IEEE Consumer Electronics Magazine
– volume: 136
  year: 2020
  ident: b229
  article-title: Toward safer highways, application of XGBoost and SHAP for real-time accident detection and feature analysis
  publication-title: Accident Analysis and Prevention
– volume: 8
  start-page: 181
  year: 2020
  end-page: 185
  ident: b3
  article-title: A stacking-based ensemble learning method for outlier detection
  publication-title: Balkan Journal of Electrical and Computer Engineering
– volume: 107
  year: 2020
  ident: b330
  article-title: A hybrid sampling algorithm combining M-SMOTE and ENN based on Random forest for medical imbalanced data
  publication-title: Journal of Biomedical Informatics
– volume: 17
  start-page: 2131
  year: 2019
  end-page: 2140
  ident: b222
  article-title: XGBoost model for chronic kidney disease diagnosis
  publication-title: IEEE/ACM Transactions on Computational Biology and Bioinformatics
– year: 2022
  ident: b309
  article-title: Pre-trained language models and their applications
  publication-title: Engineering
– volume: 9
  start-page: 86230
  year: 2021
  end-page: 86242
  ident: b211
  article-title: Novel stock crisis prediction technique—a study on indian stock market
  publication-title: IEEE Access
– start-page: 952
  year: 2020
  end-page: 958
  ident: b333
  article-title: Anomaly detection based on histogram methodology and factor analysis using LightGBM for cooling systems
  publication-title: 2020 25th IEEE international conference on emerging technologies and factory automation (ETFA), Vol. 1
– start-page: 231
  year: 2008
  end-page: 235
  ident: b179
  article-title: Cost-sensitive learning and the class imbalance problem
  publication-title: Encyclopedia of machine learning, Vol. 2011
– volume: 177
  year: 2020
  ident: b329
  article-title: A predictive model of recreational water quality based on adaptive synthetic sampling algorithms and machine learning
  publication-title: Water Research
– volume: 409
  year: 2017
  ident: b178
  article-title: Clustering-based undersampling in class-imbalanced data
  publication-title: Information Sciences
– volume: 34
  start-page: 719
  year: 2022
  end-page: 730
  ident: b181
  article-title: Early prediction of incident liver disease using conventional risk factors and gut-microbiome-augmented gradient boosting
  publication-title: Cell Metabolism
– volume: 465
  start-page: 1
  year: 2018
  end-page: 20
  ident: b72
  article-title: Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE
  publication-title: Information Sciences
– start-page: 34
  year: 2022
  end-page: 47
  ident: b75
  article-title: LightGBM-RF: A hybrid model for anomaly detection in smart building
  publication-title: Frontiers in cyber security: 5th international conference, FCS 2022, Kumasi, Ghana, December 13–15, 2022, Proceedings
– volume: 32
  year: 2020
  ident: b280
  article-title: A novel cryptocurrency price trend forecasting model based on LightGBM
  publication-title: Finance Research Letters
– volume: 6
  start-page: 448
  year: 1976
  end-page: 452
  ident: b290
  article-title: An experiment with the edited nearest-neighbor rule
  publication-title: IEEE Transactions on Systems, Man, and Cybernetics
– start-page: 3207
  year: 2020
  end-page: 3216
  ident: b167
  article-title: Celeb-df: A large-scale challenging dataset for deepfake forensics
  publication-title: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition
– reference: Re, M., & Valentini, G. 1 ensemble methods: a review 3 (1).
– start-page: 1
  year: 2012
  end-page: 6
  ident: b339
  article-title: Sampling + reweighting: Boosting the performance of AdaBoost on imbalanced datasets
  publication-title: The 2012 international joint conference on neural networks (IJCNN)
– start-page: 1390
  year: 2008
  end-page: 1396
  ident: b297
  article-title: Multi-class AUC metrics and weighted alternatives
  publication-title: 2008 IEEE international joint conference on neural networks (IEEE world congress on computational intelligence)
– volume: 18
  year: 2022
  ident: b344
  article-title: Research and application of XGBoost in imbalanced data
  publication-title: International Journal of Distributed Sensor Networks
– start-page: 1189
  year: 2001
  end-page: 1232
  ident: b88
  article-title: Greedy function approximation: a gradient boosting machine
  publication-title: Annals of Statistics
– volume: 11
  start-page: 820
  year: 2020
  ident: b94
  article-title: Identification of orphan genes in unbalanced datasets based on ensemble learning
  publication-title: Frontiers in Genetics
– volume: 12
  start-page: 189
  year: 2015
  end-page: 207
  ident: b198
  article-title: Application of adaboost algorithm in basketball player detection
  publication-title: Acta Polytechnica Hungarica
– start-page: 660
  year: 2016
  end-page: 676
  ident: b302
  article-title: Learning to count with cnn boosting
  publication-title: European conference on computer vision
– year: 2020
  ident: b321
  article-title: Time series data augmentation for deep learning: A survey
– start-page: 1
  year: 2012
  end-page: 34
  ident: b234
  article-title: Ensemble learning
  publication-title: Ensemble machine learning
– volume: 6
  start-page: 1
  year: 2019
  end-page: 48
  ident: b274
  article-title: A survey on image data augmentation for deep learning
  publication-title: Journal of Big Data
– volume: 10
  start-page: 1
  year: 2001
  end-page: 50
  ident: b298
  article-title: The art of data augmentation
  publication-title: Journal of Computational and Graphical Statistics
– start-page: 278
  year: 1995
  end-page: 282
  ident: b120
  article-title: Random decision forests
  publication-title: Proceedings of 3rd international conference on document analysis and recognition, Vol. 1
– start-page: 810
  year: 2014
  end-page: 814
  ident: b244
  article-title: Data augmentation for low resource languages
  publication-title: INTERSPEECH 2014: 15th annual conference of the international speech communication association
– volume: 7
  year: 2020
  ident: b325
  article-title: Ensemble flood forecasting: Current status and future opportunities
  publication-title: Wiley Interdisciplinary Reviews: Water
– volume: 39
  start-page: 261
  year: 2013
  end-page: 283
  ident: b153
  article-title: Decision trees: a recent overview
  publication-title: Artificial Intelligence Review
– start-page: 4393
  year: 2018
  end-page: 4402
  ident: b255
  article-title: Deep one-class classification
  publication-title: International conference on machine learning
– start-page: 1
  year: 2021
  end-page: 14
  ident: b18
  article-title: Performance analysis of regression algorithms and feature selection techniques to predict PM 2.5 in smart cities
  publication-title: International Journal of Systems Assurance Engineering and Management
– start-page: 1322
  year: 2008
  end-page: 1328
  ident: b116
  article-title: ADASYN: Adaptive synthetic sampling approach for imbalanced learning
  publication-title: 2008 IEEE international joint conference on neural networks (IEEE world congress on computational intelligence)
– volume: 58
  start-page: 308
  year: 2015
  end-page: 324
  ident: b343
  article-title: A gradient boosting method to improve travel time prediction
  publication-title: Transportation Research Part C (Emerging Technologies)
– volume: 6
  start-page: 4641
  year: 2018
  end-page: 4652
  ident: b357
  article-title: Class weights random forest algorithm for processing class imbalanced medical data
  publication-title: IEEE Access
– volume: 129
  year: 2022
  ident: b43
  article-title: Handling class imbalance in COVID-19 chest X-ray images classification: Using SMOTE and weighted loss
  publication-title: Applied Soft Computing
– volume: 14
  start-page: 3547
  year: 2022
  ident: b216
  article-title: Ensemble machine learning of Random Forest, AdaBoost and XGBoost for vertical total electron content forecasting
  publication-title: Remote Sensing
– volume: 120
  year: 2020
  ident: b36
  article-title: Addressing class imbalance in deep learning for small lesion detection on medical images
  publication-title: Computers in Biology and Medicine
– volume: 12
  year: 2021
  ident: b86
  article-title: Improving imbalanced land cover classification with K-means SMOTE: Detecting and oversampling distinctive minority spectral signatures
  publication-title: Information
– start-page: 160
  year: 2021
  end-page: 164
  ident: b328
  article-title: Speaker recognition system with limited data based on LightGBM and fusion features
  publication-title: 2021 6th international conference on computational intelligence and applications (ICCIA)
– volume: 6
  start-page: 429
  year: 2002
  end-page: 449
  ident: b135
  article-title: The class imbalance problem: A systematic study
  publication-title: Intelligent Data Analysis
– volume: 5
  start-page: 241
  year: 1992
  end-page: 259
  ident: b324
  article-title: Stacked generalization
  publication-title: Neural Networks
– volume: 29
  start-page: 45
  year: 1997
  end-page: 63
  ident: b25
  article-title: Online learning versus offline learning
  publication-title: Machine Learning
– year: 2022
  ident: b199
  article-title: Imbalance example-dependent cost classification: A Bayesian based method
  publication-title: Expert Systems with Applications
– volume: 31
  start-page: 955
  year: 2019
  end-page: 965
  ident: b286
  article-title: An in-depth experimental study of anomaly detection using gradient boosted machine
  publication-title: Neural Computing and Applications
– volume: 54
  start-page: 1
  year: 2021
  end-page: 38
  ident: b316
  article-title: Generative adversarial networks in computer vision: A survey and taxonomy
  publication-title: ACM Computing Surveys
– volume: 3
  start-page: 19
  year: 2000
  end-page: 30
  ident: b58
  article-title: Nearest neighbour editing and condensing tools–synergy exploitation
  publication-title: Pattern Analysis & Applications
– volume: 105
  start-page: 2499
  year: 2021
  end-page: 2522
  ident: b162
  article-title: Application of the borderline-SMOTE method in susceptibility assessments of debris flows in Pinggu District, Beijing, China
  publication-title: Natural Hazards
– start-page: 1
  year: 2020
  end-page: 7
  ident: b131
  article-title: Network anomaly detection using lightgbm: A gradient boosting classifier
  publication-title: 2020 30th international telecommunication networks and applications conference (ITNAC)
– volume: 9
  year: 2019
  ident: b322
  article-title: The emergence of deepfake technology: A review
  publication-title: Technology Innovation Management Review
– volume: 33
  start-page: 18917
  year: 2020
  end-page: 18930
  ident: b223
  article-title: Optimization and generalization analysis of transduction through gradient boosting and application to multi-scale graph neural networks
  publication-title: Advances in Neural Information Processing Systems
– volume: 21
  start-page: 785
  year: 2008
  end-page: 795
  ident: b166
  article-title: AdaBoost with SVM-based component classifiers
  publication-title: Engineering Applications of Artificial Intelligence
– volume: 17
  start-page: 1411
  year: 2006
  end-page: 1423
  ident: b172
  article-title: A fast and accurate online sequential learning algorithm for feedforward networks
  publication-title: IEEE Transactions on Neural Networks
– year: 2022
  ident: b267
  article-title: SMOTified-GAN for class imbalanced pattern classification problems
  publication-title: IEEE Access
– volume: 27
  start-page: 1947
  year: 2015
  end-page: 1961
  ident: b231
  article-title: Oversampling the minority class in the feature space
  publication-title: IEEE Transactions on Neural Networks and Learning Systems
– start-page: 346
  year: 2019
  end-page: 356
  ident: b113
  article-title: Investigating random undersampling and feature selection on bioinformatics big data
  publication-title: 2019 IEEE fifth international conference on big data computing service and applications (BigDataService)
– volume: 9
  start-page: 48
  year: 2014
  end-page: 57
  ident: b41
  article-title: Jumping NLP curves: A review of natural language processing research
  publication-title: IEEE Computational Intelligence Magazine
– volume: 26
  start-page: 445
  year: 2002
  end-page: 463
  ident: b233
  article-title: Decision trees: an overview and their use in medicine
  publication-title: Journal of Medical Systems
– year: 2021
  ident: b266
  article-title: Cpt: A pre-trained unbalanced transformer for both chinese language understanding and generation
– volume: 30
  year: 2017
  ident: b145
  article-title: What uncertainties do we need in bayesian deep learning for computer vision?
  publication-title: Advances in Neural Information Processing Systems
– volume: 28
  year: 2015
  ident: b84
  article-title: Precision-recall-gain curves: PR analysis done right
  publication-title: Advances in Neural Information Processing Systems
– year: 2020
  ident: b15
  article-title: Gradient boosting neural networks: Grownet
– year: 2021
  ident: b82
  article-title: A survey of data augmentation approaches for NLP
– start-page: 18187
  year: 2022
  end-page: 18196
  ident: b175
  article-title: Text to image generation with semantic-spatial aware GAN
  publication-title: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition
– volume: 61
  year: 2021
  ident: b272
  article-title: Effects of class imbalance on resampling and ensemble learning for improved prediction of cyanobacteria blooms
  publication-title: Ecological Informatics
– volume: 133
  start-page: 121
  year: 2021
  end-page: 129
  ident: b239
  article-title: Natural language processing was effective in assisting rapid title and abstract screening when updating systematic reviews
  publication-title: Journal of Clinical Epidemiology
– volume: 243
  start-page: 88
  year: 2017
  end-page: 102
  ident: b221
  article-title: Fast-CBUS: A fast clustering-based undersampling method for addressing the class imbalance problem
  publication-title: Neurocomputing
– volume: 100
  start-page: 355
  year: 2017
  end-page: 364
  ident: b237
  article-title: Weighted-SMOTE: A modification to SMOTE for event classification in sodium cooled fast reactors
  publication-title: Progress in Nuclear Energy
– volume: 12
  year: 2013
  ident: b313
  article-title: Online class imbalance learning and its applications in fault detection
  publication-title: International Journal of Computational Intelligence and Applications
– start-page: 2118
  year: 2016
  end-page: 2124
  ident: b314
  article-title: Dealing with multiple classes in online class imbalance learning
– volume: 30
  start-page: 1145
  year: 1997
  end-page: 1159
  ident: b34
  article-title: The use of the area under the ROC curve in the evaluation of machine learning algorithms
  publication-title: Pattern Recognition
– volume: 150
  start-page: 529
  year: 2015
  end-page: 542
  ident: b30
  article-title: Neighbourhood sampling in bagging for imbalanced data
  publication-title: Neurocomputing
– year: 2020
  ident: b182
  article-title: Early prediction of liver disease using conventional risk factors and gut microbiome-augmented gradient boosting
  publication-title: MedRxiv
– volume: 7
  start-page: 154096
  year: 2019
  end-page: 154113
  ident: b189
  article-title: Black-box vs. white-box: Understanding their advantages and weaknesses from a practical point of view
  publication-title: IEEE Access
– volume: 19
  year: 2019
  ident: b171
  article-title: Improved PSO AdaBoost ensemble algorithm for imbalanced data
  publication-title: Sensors
– volume: 15
  start-page: 15974
  year: 2015
  end-page: 15987
  ident: b264
  article-title: Smart city mobility application—gradient boosting trees for mobility prediction and analysis based on crowdsourced data
  publication-title: Sensors
– year: 2002
  ident: b288
  article-title: One-class classification: Concept learning in the absence of counter-examples
– start-page: 269
  year: 2013
  end-page: 278
  ident: b31
  article-title: Extending bagging for imbalanced data
  publication-title: Proceedings of the 8th international conference on computer recognition systems CORES 2013
– start-page: 237
  year: 2016
  end-page: 269
  ident: b282
  article-title: Decision tree learning
  publication-title: Machine learning models and algorithms for big data classification
– start-page: 718
  year: 2021
  end-page: 729
  ident: b174
  article-title: Study of application of composite sampling and improved LightGBM algorithm to the diagnosis of unbalanced transformer fault samples
  publication-title: International conference on mechanical engineering, measurement control, and instrumentation, Vol. 11930
– volume: 41
  start-page: 552
  year: 2011
  end-page: 568
  ident: b149
  article-title: Comparing boosting and bagging techniques with noisy and imbalanced data
  publication-title: IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans
– volume: 30
  year: 2017
  ident: b144
  article-title: Lightgbm: A highly efficient gradient boosting decision tree
  publication-title: Advances in Neural Information Processing Systems
– volume: 4
  start-page: 184
  year: 2022
  end-page: 207
  ident: b165
  article-title: A hybrid XGBoost-MLP model for credit risk assessment on digital supply chain finance
  publication-title: Forecasting
– start-page: 878
  year: 2005
  end-page: 887
  ident: b108
  article-title: Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning
  publication-title: International conference on intelligent computing
– volume: 158
  start-page: 48
  year: 2015
  end-page: 61
  ident: b212
  article-title: Coupling different methods for overcoming the class imbalance problem
  publication-title: Neurocomputing
– year: 2017
  ident: b73
  article-title: UCI machine learning repository
– volume: 109
  start-page: 359
  year: 2017
  end-page: 366
  ident: b8
  article-title: Using word embedding and ensemble learning for highly imbalanced data sentiment analysis in short arabic text
  publication-title: Procedia Computer Science
– volume: 39
  start-page: 539
  year: 2008
  end-page: 550
  ident: b185
  article-title: Exploratory undersampling for class-imbalance learning
  publication-title: IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics)
– start-page: 58
  year: 2004
  ident: b293
  article-title: The problem of concept drift: definitions and related work, Vol. 106
– start-page: 12299
  year: 2021
  end-page: 12310
  ident: b51
  article-title: Pre-trained image processing transformer
  publication-title: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition
– volume: 32
  start-page: 13
  year: 2004
  end-page: 29
  ident: b137
  article-title: Process consistency for adaboost
  publication-title: The Annals of Statistics
– volume: 501
  start-page: 118
  year: 2019
  end-page: 135
  ident: b71
  article-title: Geometric SMOTE a geometrically enhanced drop-in replacement for SMOTE
  publication-title: Information Sciences
– volume: 9
  start-page: 8
  year: 2022
  end-page: 15
  ident: b136
  article-title: Implementation of stacking ensemble classifier for multi-class classification of COVID-19 vaccines topics on Twitter
  publication-title: Scientific Journal of Informatics
– start-page: 1
  year: 2022
  end-page: 19
  ident: b105
  article-title: Fraud detection in mobile payment systems using an XGBoost-based framework
  publication-title: Information Systems Frontiers
– volume: 13
  start-page: 21
  year: 1967
  end-page: 27
  ident: b55
  article-title: Nearest neighbor pattern classification
  publication-title: IEEE Transactions on Information Theory
– volume: 131
  start-page: 240
  year: 2022
  end-page: 254
  ident: b65
  article-title: Imbalanced data classification: A KNN and generative adversarial networks-based hybrid approach for intrusion detection
  publication-title: Future Generation Computer Systems
– volume: 10
  start-page: 1151
  year: 2014
  ident: b300
  article-title: Iterative dichotomiser-3 algorithm in data mining applied to diabetes database
  publication-title: Journal of Computer Science
– start-page: 735
  year: 2011
  end-page: 744
  ident: b20
  article-title: A novel synthetic minority oversampling technique for imbalanced data set learning
  publication-title: International conference on neural information processing
– volume: 16
  start-page: 449
  year: 2013
  end-page: 475
  ident: b320
  article-title: Effective detection of sophisticated online banking fraud on extremely imbalanced data
  publication-title: World Wide Web
– volume: 10
  start-page: 607
  year: 2021
  end-page: 624
  ident: b338
  article-title: A survey on deepfake video detection
  publication-title: IET Biometrics
– volume: 35
  start-page: 53
  year: 2018
  end-page: 65
  ident: b56
  article-title: Generative adversarial networks: An overview
  publication-title: IEEE Signal Processing Magazine
– start-page: 1
  year: 2017
  end-page: 4
  ident: b24
  article-title: Comparison of ensemble learning methods applied to network intrusion detection
  publication-title: Proceedings of the second international conference on internet of things, data and cloud computing
– year: 2007
  ident: b117
  article-title: Asymmetric gradient boosting with application to spam filtering
  publication-title: CEAS
– volume: 10
  start-page: 99129
  year: 2022
  end-page: 99149
  ident: b200
  article-title: A survey of ensemble learning: Concepts, algorithms, applications, and prospects
  publication-title: IEEE Access
– volume: 510
  start-page: 1
  year: 2022
  end-page: 14
  ident: b217
  article-title: Evolutionary bagging for ensemble learning
  publication-title: Neurocomputing
– start-page: 1
  year: 2023
  end-page: 15
  ident: b106
  article-title: Speech emotion recognition and text sentiment analysis for financial distress prediction
  publication-title: Neural Computing and Applications
– year: 2020
  ident: b155
  article-title: Data augmentation using pre-trained transformer models
– volume: 10
  start-page: 40482
  year: 2022
  end-page: 40495
  ident: b45
  article-title: Revisiting Bayesian autoencoders with MCMC
  publication-title: IEEE Access
– volume: 7
  start-page: 9515
  year: 2019
  end-page: 9530
  ident: b197
  article-title: Imbalanced fault diagnosis of rolling bearing based on generative adversarial network: A comparative study
  publication-title: IEEE Access
– start-page: 7383
  year: 2020
  end-page: 7390
  ident: b12
  article-title: Do not have enough data? Deep learning to the rescue!
  publication-title: Proceedings of the AAAI conference on artificial intelligence, Vol. 34
– volume: 22
  start-page: 1
  year: 2022
  end-page: 16
  ident: b184
  article-title: Solving the class imbalance problem using ensemble algorithm: application of screening for aortic dissection
  publication-title: BMC Medical Informatics and Decision Making
– volume: 8
  year: 2013
  ident: b319
  article-title: The role of balanced training and testing data sets for binary classifiers in bioinformatics
  publication-title: PLoS One
– year: 2012
  ident: b340
  article-title: Adadelta: an adaptive learning rate method
– volume: 14
  start-page: 415
  year: 2023
  ident: b13
  article-title: Multi-class skin cancer classification using vision transformer networks and convolutional neural network-based pre-trained models
  publication-title: Information
– start-page: 468
  year: 2020
  end-page: 481
  ident: b308
  article-title: Malicious domain detection based on k-means and smote
  publication-title: International conference on computational science
– volume: 159
  start-page: 736
  year: 2019
  end-page: 745
  ident: b225
  article-title: Dealing with data imbalance in text classification
  publication-title: Procedia Computer Science
– start-page: 1
  year: 2011
  end-page: 4
  ident: b38
  article-title: MUTE: Majority under-sampling technique
  publication-title: 2011 8th international conference on information, communications & signal processing
– start-page: 91
  year: 2019
  end-page: 94
  ident: b125
  article-title: Short paper: Credit card fraud detection using LightGBM with asymmetric error control
  publication-title: 2019 second international conference on artificial intelligence for industries (AI4I)
– start-page: 669
  year: 2001
  end-page: 670
  ident: b342
  article-title: A comparison of stacking with meta decision trees to bagging, boosting, and stacking with other methods
  publication-title: Proceedings 2001 IEEE international conference on data mining
– volume: 30
  start-page: 681
  year: 2020
  end-page: 694
  ident: b85
  article-title: GPT-3: Its nature, scope, limits, and consequences
  publication-title: Minds and Machines
– start-page: 243
  year: 2020
  end-page: 248
  ident: b205
  article-title: Machine learning with oversampling and undersampling techniques: overview study and experimental results
  publication-title: 2020 11th international conference on information and communication systems (ICICS)
– volume: 38
  start-page: 2395
  year: 2011
  end-page: 2400
  ident: b213
  article-title: Reduced Reward-punishment editing for building ensembles of classifiers
  publication-title: Expert Systems with Applications
– volume: 459
  start-page: 249
  year: 2021
  end-page: 289
  ident: b122
  article-title: Online learning: A comprehensive survey
  publication-title: Neurocomputing
– volume: 86
  year: 2020
  ident: b250
  article-title: Ensemble approach based on bagging, boosting and stacking for short-term prediction in agribusiness time series
  publication-title: Applied Soft Computing
– volume: 23
  start-page: 69
  year: 1996
  end-page: 101
  ident: b323
  article-title: Learning in the presence of concept drift and hidden contexts
  publication-title: Machine Learning
– volume: 158
  start-page: 81
  year: 2018
  end-page: 93
  ident: b28
  article-title: An empirical comparison on state-of-the-art multi-class imbalance learning algorithms and a new diversified ensemble learning scheme
  publication-title: Knowledge-Based Systems
– volume: 2
  start-page: 349
  year: 2009
  end-page: 360
  ident: b114
  article-title: Multi-class adaboost
  publication-title: Statistics and its Interface
– volume: 22
  start-page: 6766
  year: 2022
  ident: b295
  article-title: Explainable malware detection system using transformers-based transfer learning and multi-model visual representation
  publication-title: Sensors
– volume: 110
  start-page: 392
  year: 2021
  end-page: 400
  ident: b160
  article-title: Artificial intelligence for unstructured healthcare data: application to coding of patient reporting of adverse drug reactions
  publication-title: Clinical Pharmacology & Therapeutics
– volume: 72
  start-page: 327
  year: 2017
  end-page: 340
  ident: b356
  article-title: Synthetic minority oversampling technique for multiclass imbalance problems
  publication-title: Pattern Recognition
– volume: 9
  start-page: 130353
  year: 2021
  end-page: 130365
  ident: b44
  article-title: Bayesian graph convolutional neural networks via tempered MCMC
  publication-title: IEEE Access
– start-page: 505
  year: 2008
  end-page: 519
  ident: b118
  article-title: One-class classification by combining density and class probability estimation
  publication-title: Joint European conference on machine learning and knowledge discovery in databases
– volume: 50
  start-page: 419
  year: 2008
  end-page: 430
  ident: b257
  article-title: Youden Index and optimal cut-point estimated from observations affected by a lower limit of detection
  publication-title: Biometrical Journal: Journal of Mathematical Methods in Biosciences
– start-page: 1
  year: 2016
  end-page: 7
  ident: b81
  article-title: Efficient sampling techniques for ensemble learning and diagnosing bearing defects under class imbalanced condition
  publication-title: 2016 IEEE symposium series on computational intelligence (SSCI)
– volume: 45
  start-page: 3738
  year: 2012
  end-page: 3750
  ident: b285
  article-title: Inverse random under sampling for class imbalance problem and its application to multi-label classification
  publication-title: Pattern Recognition
– volume: 7
  start-page: 150960
  year: 2019
  end-page: 150968
  ident: b52
  article-title: Prediction of extubation failure for intensive care unit patients using light gradient boosting machine
  publication-title: IEEE Access
– volume: 2019
  year: 2019
  ident: b201
  article-title: Light gbm machine learning algorithm to online click fraud detection
  publication-title: Journal of Information Assurance & Cybersecurity
– start-page: 593
  year: 2012
  end-page: 602
  ident: b283
  article-title: Application of bagging, boosting and stacking to intrusion detection
  publication-title: International workshop on machine learning and data mining in pattern recognition
– year: 2014
  ident: b121
  article-title: gamboostLSS: An R package for model building and variable selection in the GAMLSS framework
– volume: 210
  year: 2022
  ident: b1
  article-title: Waveguide quality inspection in quantum cascade lasers: A capsule neural network approach
  publication-title: Expert Systems with Applications
– start-page: 1371
  year: 2017
  end-page: 1374
  ident: b350
  article-title: Machine learning in rock facies classification: An application of XGBoost
  publication-title: International geophysical conference, Qingdao, China, 17-20 April 2017
– volume: 14
  start-page: 482
  year: 2016
  end-page: 501
  ident: b40
  article-title: Randomized oversampling for generalized multiscale finite element methods
  publication-title: Multiscale Modeling and Simulation
– volume: 23
  start-page: 10755
  year: 2019
  end-page: 10767
  ident: b154
  article-title: TLUSBoost algorithm: a boosting solution for class imbalance problem
  publication-title: Soft Computing
– start-page: 1111
  year: 2019
  end-page: 1116
  ident: b195
  article-title: LightGBM: An effective decision tree gradient boosting method to predict customer loyalty in the finance industry
  publication-title: 2019 14th international conference on computer science & education (ICCSE)
– volume: 602
  start-page: 259
  year: 2022
  end-page: 268
  ident: b310
  article-title: Corporate finance risk prediction based on LightGBM
  publication-title: Information Sciences
– year: 2014
  ident: b151
  article-title: Adam: A method for stochastic optimization
– volume: 213
  year: 2023
  ident: b89
  article-title: Automatic grading of Diabetic macular edema based on end-to-end network
  publication-title: Expert Systems with Applications
– start-page: 13622
  year: 2021
  end-page: 13631
  ident: b194
  article-title: MUST-GAN: Multi-level statistics transfer for self-driven person image generation
  publication-title: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition
– start-page: 731
  year: 2006
  end-page: 740
  ident: b335
  article-title: Under-sampling approaches for improving prediction of the minority class in an imbalanced dataset
  publication-title: Intelligent control and automation
– volume: 195
  year: 2022
  ident: b186
  article-title: A two-stage hybrid credit risk prediction model based on XGBoost and graph-based deep neural network
  publication-title: Expert Systems with Applications
– volume: 29
  start-page: 4802
  year: 2018
  end-page: 4821
  ident: b315
  article-title: A systematic study of online class imbalance learning with concept drift
  publication-title: IEEE Transactions on Neural Networks and Learning Systems
– start-page: 4055
  year: 2018
  end-page: 4064
  ident: b228
  article-title: Image transformer
  publication-title: International conference on machine learning
– volume: 2
  start-page: 268
  year: 2021
  ident: b111
  article-title: Gradient boosted decision tree algorithms for medicare fraud detection
  publication-title: SN Computer Science
– volume: 16
  year: 2021
  ident: b132
  article-title: An empirical survey of data augmentation for time series classification with neural networks
  publication-title: Plos One
– volume: 25
  start-page: 289
  year: 2013
  end-page: 302
  ident: b251
  article-title: Multiclass from binary: Expanding one-versus-all, one-versus-one and ecoc-based approaches
  publication-title: IEEE Transactions on Neural Networks and Learning Systems
– volume: 65
  start-page: 1595
  year: 2017
  end-page: 1605
  ident: b2
  article-title: Fault detection and classification based on co-training of semisupervised machine learning
  publication-title: IEEE Transactions on Industrial Electronics
– volume: 108
  year: 2020
  ident: b95
  article-title: Handling imbalanced medical image data: A deep-learning-based one-class classification approach
  publication-title: Artificial Intelligence in Medicine
– volume: 133
  start-page: 433
  year: 2019
  end-page: 441
  ident: b334
  article-title: Real-time condition monitoring and fault detection of components based on machine-learning reconstruction model
  publication-title: Renewable Energy
– volume: 7
  start-page: 93010
  year: 2019
  end-page: 93022
  ident: b196
  article-title: An experimental study with imbalanced classification approaches for credit card fraud detection
  publication-title: IEEE Access
– volume: 24
  start-page: 1565
  year: 2006
  end-page: 1567
  ident: b220
  article-title: What is a support vector machine?
  publication-title: Nature biotechnology
– start-page: 1
  year: 2017
  end-page: 5
  ident: b247
  article-title: Cusboost: Cluster-based under-sampling with boosting for imbalanced classification
  publication-title: 2017 2nd international conference on computational systems and information technology for sustainable solution (CSITSS)
– start-page: 205
  year: 2018
  end-page: 220
  ident: b190
  article-title: Image generation from sketch constraint using contextual GAN
  publication-title: Proceedings of the European conference on computer vision (ECCV)
– volume: 517
  start-page: 29
  year: 2019
  end-page: 35
  ident: b327
  article-title: SVM and KNN ensemble learning for traffic incident detection
  publication-title: Physica A. Statistical Mechanics and its Applications
– year: 2023
  ident: b258
  article-title: Explainable AI (XIA): A systematic meta-survey of current challenges and future opportunities
  publication-title: Knowledge-Based Systems
– volume: 4
  start-page: 161
  year: 1989
  end-page: 186
  ident: b296
  article-title: Incremental induction of decision trees
  publication-title: Machine Learning
– volume: 73
  start-page: 3079
  year: 2010
  end-page: 3088
  ident: b93
  article-title: Edited AdaBoost by weighted kNN
  publication-title: Neurocomputing
– volume: 48
  start-page: 1623
  year: 2015
  end-page: 1637
  ident: b281
  article-title: A novel ensemble method for classifying imbalanced data
  publication-title: Pattern Recognition
– volume: 46
  start-page: 109
  year: 2013
  end-page: 132
  ident: b32
  article-title: Recommender systems survey
  publication-title: Knowledge-Based Systems
– volume: 22
  start-page: 1067
  year: 2020
  end-page: 1083
  ident: b276
  article-title: Bankruptcy prediction using deep learning approach based on borderline SMOTE
  publication-title: Information Systems Frontiers
– year: 2020
  ident: b355
  article-title: Pre-training text-to-text transformers for concept-centric common sense
– start-page: 161
  year: 2006
  end-page: 168
  ident: b42
  article-title: An empirical comparison of supervised learning algorithms
  publication-title: Proceedings of the 23rd international conference on Machine learning
– year: 2020
  ident: b129
  article-title: An optimized lightgbm model for fraud detection
  publication-title: Journal of physics: Conference series, Vol. 1651
– volume: 152
  year: 2020
  ident: b347
  article-title: Machinery fault diagnosis with imbalanced data using deep generative adversarial networks
  publication-title: Measurement
– year: 2018
  ident: b226
  article-title: Application of XGBoost algorithm in hourly PM2. 5 concentration prediction
  publication-title: IOP conference series: earth and environmental science, Vol. 113
– volume: 212
  year: 2023
  ident: b256
  article-title: An ensemble credit scoring model based on logistic regression with heterogeneous balancing and weighting effects
  publication-title: Expert Systems with Applications
– volume: 54
  start-page: 1937
  year: 2021
  end-page: 1967
  ident: b27
  article-title: A comparative analysis of gradient boosting algorithms
  publication-title: Artificial Intelligence Review
– volume: 21
  start-page: 2126
  year: 2019
  end-page: 2134
  ident: b169
  article-title: Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis
  publication-title: Genetics in Medicine
– volume: 202
  year: 2022
  ident: b210
  article-title: A novel XGBoost extension for credit scoring class-imbalanced data combining a generalized extreme value link and a modified focal loss function
  publication-title: Expert Systems with Applications
– start-page: 188
  year: 2010
  end-page: 197
  ident: b147
  article-title: A survey of recent trends in one class classification
  publication-title: Artificial intelligence and cognitive science
– volume: 51
  start-page: 62
  year: 2015
  end-page: 71
  ident: b275
  article-title: Software defect prediction using a cost sensitive decision forest and voting, and a potential solution to the class imbalance problem
  publication-title: Information Systems
– start-page: 734
  year: 2018
  end-page: 738
  ident: b107
  article-title: GAN-based synthetic brain MR image generation
  publication-title: 2018 IEEE 15th international symposium on biomedical imaging (ISBI 2018)
– volume: 50
  start-page: 97
  year: 2018
  end-page: 127
  ident: b156
  article-title: Multi-class and feature selection extensions of roughly balanced bagging for imbalanced data
  publication-title: Journal of Intelligent Information Systems
– start-page: 345
  year: 2005
  end-page: 359
  ident: b102
  article-title: A probabilistic interpretation of precision, recall and F-score, with implication for evaluation
  publication-title: European conference on information retrieval
– year: 2020
  ident: b242
  article-title: Next-generation machine learning with spark: Covers XGBoost, LightGBM, Spark NLP, distributed deep learning with Keras, and more
– volume: 10
  start-page: 749
  year: 2022
  ident: b180
  article-title: Predictive classifier for cardiovascular disease based on stacking model fusion
  publication-title: Processes
– volume: 8
  start-page: 1
  year: 2021
  end-page: 20
  ident: b359
  article-title: Detecting web attacks using random undersampling and ensemble learners
  publication-title: Journal of Big Data
– year: 2004
  ident: b289
  article-title: Classification and regression trees (CART) theory and applications, Vol. 54
– start-page: 125
  year: 2017
  end-page: 131
  ident: b62
  article-title: Fraud detection in credit card transactions by using classification algorithms
  publication-title: 2017 international conference on current trends in computer, electrical, electronics and communication (CTCEEC)
– volume: 18
  start-page: 1206
  year: 2021
  end-page: 1217
  ident: b138
  article-title: Data augmentation classifier for imbalanced fault classification
  publication-title: IEEE Transactions on Automation Science and Engineering
– volume: 20
  start-page: 273
  year: 1995
  end-page: 297
  ident: b54
  article-title: Support-vector networks
  publication-title: Machine Learning
– volume: 25
  start-page: 197
  year: 2016
  end-page: 227
  ident: b29
  article-title: A random forest guided tour
  publication-title: Test
– volume: 2
  start-page: 412
  year: 2009
  end-page: 426
  ident: b119
  article-title: Roughly balanced bagging for imbalanced data
  publication-title: Statistical Analysis and Data Mining: The ASA Data Science Journal
– volume: 63
  start-page: 3
  year: 2006
  end-page: 42
  ident: b99
  article-title: Extremely randomized trees
  publication-title: Machine Learning
– start-page: 181
  year: 2021
  end-page: 188
  ident: b337
  article-title: Speech recognition based on concatenated acoustic feature and lightGBM model
  publication-title: Twelfth international conference on signal processing systems, Vol. 11719
– start-page: 486
  year: 2022
  end-page: 491
  ident: b76
  article-title: A novel technique to solve class imbalance problem
  publication-title: 2022 international conference on innovations in science, engineering and technology (ICISET)
– volume: 10
  start-page: 250
  year: 2018
  ident: b157
  article-title: A cluster-based boosting algorithm for bankruptcy prediction in a highly imbalanced dataset
  publication-title: Symmetry
– volume: 73
  start-page: 914
  year: 2018
  end-page: 920
  ident: b47
  article-title: Application of eXtreme gradient boosting trees in the construction of credit risk assessment models for financial institutions
  publication-title: Applied Soft Computing
– volume: 70
  start-page: 1125
  year: 2008
  end-page: 1132
  ident: b352
  article-title: Protein classification with imbalanced data
  publication-title: Proteins: Structure, Function, and Bioinformatics
– year: 2022
  ident: b46
  article-title: Distributed Bayesian optimisation framework for deep neuroevolution
  publication-title: Neurocomputing
– start-page: 572
  year: 2020
  end-page: 579
  ident: b110
  article-title: Performance of catboost and xgboost in medicare fraud detection
  publication-title: 2020 19th IEEE international conference on machine learning and applications (ICMLA)
– volume: 7
  start-page: 21
  year: 2013
  ident: b215
  article-title: Gradient boosting machines, a tutorial
  publication-title: Frontiers in Neurorobotics
– volume: 207
  year: 2021
  ident: b150
  article-title: Improving the performance of machine learning models for early warning of harmful algal blooms using an adaptive synthetic sampling method
  publication-title: Water Research
– volume: 20
  start-page: 1
  year: 2020
  end-page: 25
  ident: b115
  article-title: Ada-WHIPS: explaining AdaBoost classification with applications in the health sciences
  publication-title: BMC Medical Informatics and Decision Making
– volume: 25
  year: 2012
  ident: b277
  article-title: Practical Bayesian optimization of machine learning algorithms
  publication-title: Advances in Neural Information Processing Systems
– start-page: 785
  year: 2016
  end-page: 794
  ident: b49
  article-title: Xgboost: A scalable tree boosting system
  publication-title: Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining
– volume: 14
  start-page: 241
  year: 2020
  end-page: 258
  ident: b68
  article-title: A survey on ensemble learning
  publication-title: Frontiers of Computer Science
– year: 2012
  ident: b353
  article-title: Ensemble methods: Foundations and algorithms
– start-page: 148
  year: 1996
  end-page: 156
  ident: b87
  article-title: Experiments with a new boosting algorithm
  publication-title: Icml, Vol. 96
– volume: 69
  start-page: 35
  year: 2017
  end-page: 49
  ident: b183
  article-title: Addressing the class imbalance problem in twitter spam detection using ensemble learning
  publication-title: Computers & Security
– volume: 19
  start-page: 2632
  year: 2021
  end-page: 2641
  ident: b219
  article-title: A novel method for Identification of Glutarylation sites combining Borderline-SMOTE with Tomek links technique in imbalanced data
  publication-title: IEEE/ACM Transactions on Computational Biology and Bioinformatics
– volume: 404
  start-page: 351
  year: 2020
  end-page: 366
  ident: b284
  article-title: AdaBoost-CNN: An adaptive boosting algorithm for convolutional neural networks to classify multi-class imbalanced datasets using transfer learning
  publication-title: Neurocomputing
– start-page: 608
  year: 2020
  end-page: 612
  ident: b287
  article-title: A customer churn prediction model based on XGBoost and MLP
  publication-title: 2020 international conference on computer engineering and application (ICCEA)
– volume: 55
  start-page: 1
  year: 2022
  end-page: 96
  ident: b254
  article-title: Tackling climate change with machine learning
  publication-title: ACM Computing Surveys
– volume: 199
  start-page: 176
  year: 2006
  end-page: 187
  ident: b206
  article-title: Predicting tree species presence and basal area in Utah: a comparison of stochastic gradient boosting, generalized additive models, and tree-based methods
  publication-title: Ecological Modelling
– volume: 93
  start-page: 3
  year: 2017
  end-page: 12
  ident: b61
  article-title: Redundancy-driven modified Tomek-link based undersampling: A solution to class imbalance
  publication-title: Pattern Recognition Letters
– volume: 11
  start-page: 1
  year: 2011
  end-page: 13
  ident: b146
  article-title: Predicting disease risks from highly imbalanced data using random forest
  publication-title: BMC Medical Informatics and Decision Making
– start-page: 2523
  year: 2019
  end-page: 2531
  ident: b348
  article-title: WOTBoost: Weighted oversampling technique in boosting for imbalanced learning
  publication-title: 2019 IEEE international conference on big data (Big data)
– start-page: 256
  year: 2019
  end-page: 263
  ident: b227
  article-title: A signature-based assistant random oversampling method for malware detection
  publication-title: 2019 18th IEEE International conference on trust, security and privacy in computing and communications/13th IEEE international conference on big data science and engineering (TrustCom/BigDataSE)
– volume: 11
  start-page: 1495
  year: 2022
  ident: b349
  article-title: Coronary artery disease detection model based on class balancing methods and LightGBM algorithm
  publication-title: Electronics
– volume: 26
  start-page: 1011
  year: 2008
  end-page: 1013
  ident: b152
  article-title: What are decision trees ?
  publication-title: Nature biotechnology
– volume: 30
  start-page: 916
  year: 2021
  end-page: 925
  ident: b192
  article-title: Class imbalance in gradient boosting classification algorithms: Application to experimental stroke data
  publication-title: Statistical Methods in Medical Research
– volume: 65
  start-page: 124
  year: 2022
  end-page: 138
  ident: b238
  article-title: Improved hybrid bag-boost ensemble with K-means-SMOTE–ENN technique for handling noisy class imbalanced data
  publication-title: The Computer Journal
– start-page: 362
  year: 1999
  end-page: 366
  ident: b79
  article-title: The application of AdaBoost for distributed, scalable and on-line learning
  publication-title: Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
– volume: 5
  start-page: 1
  year: 2018
  end-page: 30
  ident: b159
  article-title: A survey on addressing high-class imbalance in big data
  publication-title: Journal of Big Data
– volume: 62
  start-page: 961
  year: 2006
  end-page: 971
  ident: b294
  article-title: Generalized additive modeling with implicit variable selection by likelihood-based boosting
  publication-title: Biometrics
– volume: 22
  start-page: 4664
  year: 2013
  end-page: 4677
  ident: b351
  article-title: Real-time object tracking via online discriminative feature selection
  publication-title: IEEE Transactions on Image Processing
– volume: 17
  start-page: 641
  year: 2020
  end-page: 658
  ident: b70
  article-title: Improved landslide assessment using support vector machine with bagging, boosting, and stacking ensemble machine learning framework in a mountainous watershed, Japan
  publication-title: Landslides
– start-page: 3468
  year: 2022
  end-page: 3476
  ident: b193
  article-title: Retrieval-based gradient boosting decision trees for disease risk assessment
  publication-title: Proceedings of the 28th ACM SIGKDD conference on knowledge discovery and data mining
– start-page: 84
  year: 2022
  end-page: 92
  ident: b311
  article-title: A modified generative adversarial network for fault diagnosis in high-speed train components with imbalanced and heterogeneous monitoring data
– volume: 1
  start-page: 14
  year: 2011
  end-page: 23
  ident: b187
  article-title: Classification and regression trees
  publication-title: Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
– volume: 115
  year: 2022
  ident: b91
  article-title: Ensemble deep learning: A review
  publication-title: Engineering Applications of Artificial Intelligence
– volume: 12
  start-page: 374
  year: 2021
  ident: b96
  article-title: A tweet sentiment classification approach using a hybrid stacked ensemble technique
  publication-title: Information
– volume: 30
  start-page: 2163
  year: 2018
  end-page: 2172
  ident: b14
  article-title: Biased random forest for dealing with the class imbalance problem
  publication-title: IEEE Transactions on Neural Networks and Learning Systems
– volume: 2
  start-page: 225
  year: 2021
  end-page: 250
  ident: b109
  article-title: Pre-trained models: Past, present and future
  publication-title: AI Open
– volume: 45
  start-page: 203
  year: 2019
  end-page: 212
  ident: b303
  article-title: Application of adaptive boosting (AdaBoost) in demand-driven acquisition (DDA) prediction: A machine-learning approach
  publication-title: The Journal of Academic Librarianship
– volume: 34
  start-page: 3463
  year: 2018
  end-page: 3473
  ident: b260
  article-title: Feature ranking for multi-fault diagnosis of rotating machinery by using random forest and KNN
  publication-title: Journal of Intelligent & Fuzzy Systems
– volume: 149
  start-page: 316
  year: 2015
  end-page: 329
  ident: b203
  article-title: Ensemble of subset online sequential extreme learning machine for class imbalance and concept drift
  publication-title: Neurocomputing
– volume: 51
  start-page: 1
  year: 2018
  end-page: 36
  ident: b249
  article-title: A survey of random forest based methods for intrusion detection systems
  publication-title: ACM Computing Surveys
– year: 2018
  ident: b69
  article-title: CatBoost: gradient boosting with categorical features support
– start-page: 32
  year: 2020
  end-page: 36
  ident: b176
  article-title: Sentiment analysis of e-commerce customer reviews based on natural language processing
  publication-title: Proceedings of the 2020 2nd international conference on big data and artificial intelligence
– volume: 11
  year: 1998
  ident: b246
  article-title: Regularizing adaboost
  publication-title: Advances in Neural Information Processing Systems
– volume: 9
  start-page: 329
  year: 2017
  ident: b261
  article-title: Random forest algorithm for the classification of neuroimaging data in Alzheimer’s disease: a systematic review
  publication-title: Frontiers in Aging Neuroscience
– reference: Vitianingsih, A. V., Othman, Z., Baharin, S. S. K., Suraji, A., & Maukar, A. L. Application of the synthetic over-sampling method to increase the sensitivity of algorithm classification for class imbalance in small spatial datasets.
– volume: 12
  start-page: 51
  year: 2018
  end-page: 56
  ident: b358
  article-title: Research on E-commerce customer churn prediction based on improved value model and XG-boost algorithm
  publication-title: Management Science and Engineering
– volume: 53
  start-page: 1
  year: 2020
  end-page: 37
  ident: b318
  article-title: A survey on Bayesian deep learning
  publication-title: ACM Computing Surveys
– volume: 106
  start-page: 251
  year: 2016
  end-page: 263
  ident: b346
  article-title: Empowering one-vs-one decomposition with ensemble learning for multi-class imbalanced data
  publication-title: Knowledge-Based Systems
– volume: 35
  start-page: 25
  year: 2005
  end-page: 34
  ident: b77
  article-title: On extending f-measure and g-mean metrics to multi-class problems
  publication-title: WIT Transactions on Information and Communication Technologies
– volume: 11
  start-page: 2109
  year: 2010
  end-page: 2113
  ident: b124
  article-title: Model-based boosting 2.0
  publication-title: Journal of Machine Learning Research
– volume: 30
  start-page: 282
  year: 2023
  end-page: 291
  ident: b142
  article-title: Voice-based gender recognition model using FRT and light GBM
  publication-title: Tehnički Vjesnik
– start-page: 717
  year: 2012
  end-page: 724
  ident: b236
  article-title: Clustering and combined sampling approaches for multi-class imbalanced data classification
  publication-title: Advances in information technology and industry applications
– volume: 121
  year: 2022
  ident: b60
  article-title: An ensemble of pre-trained transformer models for imbalanced multiclass malware classification
  publication-title: Computers & Security
– volume: 19
  year: 2006
  ident: b19
  article-title: Adaboost is consistent
  publication-title: Advances in Neural Information Processing Systems
– volume: 31
  start-page: 513
  year: 2016
  end-page: 531
  ident: b59
  article-title: Boosting in Cox regression: a comparison between the likelihood-based and the model-based approaches with focus on the R-packages CoxBoost and mboost
  publication-title: Computational Statistics
– volume: 22
  start-page: 1474
  year: 2022
  end-page: 1485
  ident: b304
  article-title: Dual-attention generative adversarial networks for fault diagnosis under the class-imbalanced conditions
  publication-title: IEEE Sensors Journal
– volume: 6
  start-page: 1
  year: 2019
  end-page: 54
  ident: b140
  article-title: Survey on deep learning with class imbalance
  publication-title: Journal of Big Data
– volume: 11
  start-page: 111
  year: 2019
  end-page: 118
  ident: b218
  article-title: Machine learning: applications of artificial intelligence to imaging and diagnosis
  publication-title: Biophysical Reviews
– volume: 195
  year: 2020
  ident: b240
  article-title: A GAN-based image synthesis method for skin lesion classification
  publication-title: Computer Methods and Programs in Biomedicine
– start-page: 1
  year: 2019
  end-page: 6
  ident: b345
  article-title: Detecting and simulating artifacts in GAN fake images
  publication-title: 2019 IEEE international workshop on information forensics and security (WIFS)
– volume: 73
  start-page: 220
  year: 2017
  end-page: 239
  ident: b104
  article-title: Learning from class-imbalanced data: Review of methods and applications
  publication-title: Expert Systems with Applications
– volume: 13
  start-page: 373
  year: 2023
  ident: b336
  article-title: Multi-modal stacking ensemble for the diagnosis of cardiovascular diseases
  publication-title: Journal of Personalized Medicine
– volume: 21
  start-page: 5485
  year: 2020
  end-page: 5551
  ident: b243
  article-title: Exploring the limits of transfer learning with a unified text-to-text transformer
  publication-title: Journal of Machine Learning Research
– volume: 15
  year: 2019
  ident: b9
  article-title: A Random Forest based predictor for medical data classification using feature ranking
  publication-title: Informatics in Medicine Unlocked
– start-page: 232
  year: 2020
  end-page: 236
  ident: b97
  article-title: Credit card fraud detection using lightgbm model
  publication-title: 2020 international conference on E-commerce and internet technology (ECIT)
– start-page: 175
  year: 2019
  end-page: 183
  ident: b139
  article-title: Deep learning and data sampling with imbalanced big data
  publication-title: 2019 IEEE 20th international conference on information reuse and integration for data science (IRI)
– volume: 70
  start-page: 1
  year: 2021
  end-page: 17
  ident: b170
  article-title: A novel method for imbalanced fault diagnosis of rotating machinery based on generative adversarial networks
  publication-title: IEEE Transactions on Instrumentation and Measurement
– volume: 10
  start-page: 48890
  year: 2022
  end-page: 48903
  ident: b100
  article-title: A security model based on LightGBM and transformer to protect healthcare systems from cyberattacks
  publication-title: IEEE Access
– volume: 5
  start-page: 1
  year: 2015
  ident: b123
  article-title: A review on evaluation metrics for data classification evaluations
  publication-title: International Journal of Data Mining & Knowledge Management Process
– volume: 584
  start-page: 50
  year: 2022
  end-page: 64
  ident: b39
  article-title: A new clustering mining algorithm for multi-source imbalanced location data
  publication-title: Information Sciences
– volume: 2019
  year: 2019
  ident: b341
  article-title: A lightGBM-based EEG analysis method for driver mental states classification
  publication-title: Computational Intelligence and Neuroscience
– volume: 2021
  year: 2021
  ident: b265
  article-title: Application of gradient boosting machine learning algorithms to predict uniaxial compressive strength of soft sedimentary rocks at Thar Coalfield
  publication-title: Advances in Civil Engineering
– year: 2010
  ident: b253
  article-title: Pattern classification using ensemble methods, Vol. 75
– volume: 9
  start-page: 62
  year: 2014
  end-page: 74
  ident: b354
  article-title: Big data opportunities and challenges: Discussions from data analytics perspectives [discussion forum]
  publication-title: IEEE Computational Intelligence Magazine
– volume: 91
  start-page: 216
  year: 2019
  end-page: 231
  ident: b191
  article-title: The impact of class imbalance in classification performance metrics based on the binary confusion matrix
  publication-title: Pattern Recognition
– volume: 38
  start-page: 577
  year: 2008
  end-page: 583
  ident: b126
  article-title: Adaboost-based algorithm for network intrusion detection
  publication-title: IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics)
– volume: 4
  start-page: 234
  year: 2014
  end-page: 267
  ident: b259
  article-title: Support vector machines in engineering: an overview
  publication-title: Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
– start-page: 378
  year: 2013
  end-page: 389
  ident: b291
  article-title: SMOTE for regression
  publication-title: Portuguese conference on artificial intelligence
– volume: 409
  start-page: 17
  year: 2017
  end-page: 26
  ident: b177
  article-title: Clustering-based undersampling in class-imbalanced data
  publication-title: Information Sciences
– year: 1995
  ident: b214
  article-title: Abalone
– year: 2018
  ident: b63
  article-title: Bert: Pre-training of deep bidirectional transformers for language understanding
– volume: 375
  start-page: 613
  year: 2009
  end-page: 626
  ident: b53
  article-title: Ensemble flood forecasting: A review
  publication-title: Journal of Hydrology
– volume: 114
  start-page: 24
  year: 2016
  end-page: 31
  ident: b23
  article-title: Random forest in remote sensing: A review of applications and future directions
  publication-title: ISPRS Journal of Photogrammetry and Remote Sensing
– start-page: 1
  year: 2014
  end-page: 10
  ident: b252
  article-title: Preliminary comparison of techniques for dealing with imbalance in software defect prediction
  publication-title: Proceedings of the 18th international conference on evaluation and assessment in software engineering
– volume: 257
  year: 2022
  ident: b57
  article-title: Class-imbalanced positive instances augmentation via three-line hybrid
  publication-title: Knowledge-Based Systems
– volume: 1
  start-page: 81
  year: 1986
  end-page: 106
  ident: b241
  article-title: Induction of decision trees
  publication-title: Machine Learning
– volume: 13
  start-page: 17
  year: 2020
  ident: b232
  article-title: A grey-box ensemble model exploiting black-box accuracy and white-box intrinsic interpretability
  publication-title: Algorithms
– volume: 8
  start-page: 195741
  year: 2020
  end-page: 195751
  ident: b128
  article-title: A novel wireless network intrusion detection method based on adaptive synthetic sampling and an improved convolutional neural network
  publication-title: IEEE Access
– volume: 158
  start-page: 1533
  year: 2018
  end-page: 1543
  ident: b292
  article-title: Gradient boosting machine for modeling the energy consumption of commercial buildings
  publication-title: Energy and Buildings
– volume: 32
  year: 2019
  ident: b332
  article-title: Modeling tabular data using conditional GAN
  publication-title: Advances in Neural Information Processing Systems
– volume: 61
  start-page: 863
  year: 2018
  end-page: 905
  ident: b83
  article-title: SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary
  publication-title: Journal of Artificial Intelligence Research
– start-page: 1
  year: 2021
  end-page: 41
  ident: b4
  article-title: Transformer models for text-based emotion detection: a review of BERT-based approaches
  publication-title: Artificial Intelligence Review
– start-page: 805
  year: 2016
  end-page: 808
  ident: b208
  article-title: Distributional random oversampling for imbalanced text classification
  publication-title: Proceedings of the 39th international ACM SIGIR conference on research and development in information retrieval
– volume: 558
  year: 2023
  ident: b17
  article-title: Gradient boosting Bayesian neural networks via Langevin MCMC
  publication-title: Neurocomputing
– volume: 82
  start-page: 329
  year: 2014
  end-page: 348
  ident: b188
  article-title: Fifty years of classification and regression trees
  publication-title: International Statistical Review
– volume: 21
  start-page: 3093
  year: 2002
  end-page: 3106
  ident: b80
  article-title: Estimation of the area under the ROC curve
  publication-title: Statistics in Medicine
– year: 2022
  ident: b66
  article-title: Data augmentation for deep graph learning: A survey
– start-page: 70
  year: 2018
  end-page: 79
  ident: b112
  article-title: The effects of random undersampling with simulated class imbalance for big data
  publication-title: 2018 IEEE international conference on information reuse and integration (IRI)
– start-page: 39
  year: 2004
  end-page: 50
  ident: b7
  article-title: Applying support vector machines to imbalanced datasets
  publication-title: European conference on machine learning
– start-page: 13
  year: 2009
  end-page: 17
  ident: b127
  article-title: MSMOTE: Improving classification performance when training data is imbalanced
  publication-title: 2009 second international workshop on computer science and engineering, Vol. 2
– year: 2017
  ident: b317
  article-title: Electricity consumption prediction using XGBoost based on discrete wavelet transform
  publication-title: DEStech Transactions on Computer Science and Engineering
– volume: 45
  start-page: 5
  year: 2001
  end-page: 32
  ident: b35
  article-title: Random forests
  publication-title: Machine Learning
– start-page: 72
  year: 2017
  end-page: 78
  ident: b207
  article-title: Review of random forest classification techniques to resolve data imbalance
  publication-title: 2017 1st international conference on intelligent systems and information management (ICISIM)
– year: 2022
  ident: b163
  article-title: Data augmentation approaches in natural language processing: A survey
  publication-title: AI Open
– volume: 15
  start-page: 41
  year: 2018
  end-page: 51
  ident: b130
  article-title: Applications of support vector machine (SVM) learning in cancer genomics
  publication-title: Cancer Genomics & Proteomics
– volume: 16
  year: 2021
  ident: b268
  article-title: A soft voting ensemble classifier for early prediction and diagnosis of occurrences of major adverse cardiovascular events for STEMI and NSTEMI during 2-year follow-up in patients with acute coronary syndrome
  publication-title: PLoS One
– start-page: 1
  year: 2015
  end-page: 4
  ident: b50
  article-title: Xgboost: extreme gradient boosting
– volume: 7
  year: 2018
  ident: b161
  article-title: Monthly housing rent forecast based on lightgbm (light gradient boosting) model
  publication-title: International Journal of Intelligent Information and Management Science
– start-page: 399
  year: 2015
  end-page: 404
  ident: b92
  article-title: Hybrid ensemble of classifiers using voting
  publication-title: 2015 international conference on green computing and Internet of Things (ICGCIoT)
– volume: 80
  start-page: 79
  year: 2016
  end-page: 94
  ident: b202
  article-title: Meta-cognitive online sequential extreme learning machine for imbalanced and concept-drifting data classification
  publication-title: Neural Networks
– volume: 547
  start-page: 777
  year: 2021
  end-page: 796
  ident: b133
  article-title: A distributed sensor-fault detection and diagnosis framework using machine learning
  publication-title: Information Sciences
– start-page: 480
  year: 2021
  end-page: 483
  ident: b269
  article-title: Machine learning model for sales forecasting by using XGBoost
  publication-title: 2021 IEEE international conference on consumer electronics and computer engineering (ICCECE)
– start-page: 1223
  year: 2021
  end-page: 1229
  ident: b273
  article-title: Prediction of liver disease using gradient boost machine learning techniques with feature scaling
  publication-title: 2021 5th international conference on computing methodologies and communication (ICCMC)
– volume: 54
  start-page: 1
  issue: 2
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b316
  article-title: Generative adversarial networks in computer vision: A survey and taxonomy
  publication-title: ACM Computing Surveys
– volume: 16
  start-page: 321
  year: 2002
  ident: 10.1016/j.eswa.2023.122778_b48
  article-title: SMOTE: synthetic minority over-sampling technique
  publication-title: Journal of Artificial Intelligence Research
  doi: 10.1613/jair.953
– start-page: 12299
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b51
  article-title: Pre-trained image processing transformer
– volume: 2
  start-page: 349
  issue: 3
  year: 2009
  ident: 10.1016/j.eswa.2023.122778_b114
  article-title: Multi-class adaboost
  publication-title: Statistics and its Interface
  doi: 10.4310/SII.2009.v2.n3.a8
– volume: 45
  start-page: 5
  issue: 1
  year: 2001
  ident: 10.1016/j.eswa.2023.122778_b35
  article-title: Random forests
  publication-title: Machine Learning
  doi: 10.1023/A:1010933404324
– start-page: 188
  year: 2010
  ident: 10.1016/j.eswa.2023.122778_b147
  article-title: A survey of recent trends in one class classification
– volume: 72
  start-page: 327
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b356
  article-title: Synthetic minority oversampling technique for multiclass imbalance problems
  publication-title: Pattern Recognition
  doi: 10.1016/j.patcog.2017.07.024
– volume: 22
  start-page: 1
  issue: 1
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b184
  article-title: Solving the class imbalance problem using ensemble algorithm: application of screening for aortic dissection
  publication-title: BMC Medical Informatics and Decision Making
  doi: 10.1186/s12911-022-01821-w
– ident: 10.1016/j.eswa.2023.122778_b248
– start-page: 180
  year: 2000
  ident: 10.1016/j.eswa.2023.122778_b67
  article-title: MadaBoost: A modification of AdaBoost
– volume: 100
  start-page: 355
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b237
  article-title: Weighted-SMOTE: A modification to SMOTE for event classification in sodium cooled fast reactors
  publication-title: Progress in Nuclear Energy
  doi: 10.1016/j.pnucene.2017.07.015
– start-page: 717
  year: 2012
  ident: 10.1016/j.eswa.2023.122778_b236
  article-title: Clustering and combined sampling approaches for multi-class imbalanced data classification
– year: 2022
  ident: 10.1016/j.eswa.2023.122778_b267
  article-title: SMOTified-GAN for class imbalanced pattern classification problems
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2022.3158977
– volume: 2019
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b341
  article-title: A lightGBM-based EEG analysis method for driver mental states classification
  publication-title: Computational Intelligence and Neuroscience
  doi: 10.1155/2019/3761203
– volume: 21
  start-page: 2126
  issue: 9
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b169
  article-title: Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis
  publication-title: Genetics in Medicine
  doi: 10.1038/s41436-019-0439-8
– volume: 73
  start-page: 914
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b47
  article-title: Application of eXtreme gradient boosting trees in the construction of credit risk assessment models for financial institutions
  publication-title: Applied Soft Computing
  doi: 10.1016/j.asoc.2018.09.029
– volume: 25
  year: 2012
  ident: 10.1016/j.eswa.2023.122778_b277
  article-title: Practical Bayesian optimization of machine learning algorithms
  publication-title: Advances in Neural Information Processing Systems
– volume: 9
  start-page: 62
  issue: 4
  year: 2014
  ident: 10.1016/j.eswa.2023.122778_b354
  article-title: Big data opportunities and challenges: Discussions from data analytics perspectives [discussion forum]
  publication-title: IEEE Computational Intelligence Magazine
  doi: 10.1109/MCI.2014.2350953
– volume: 114
  start-page: 24
  year: 2016
  ident: 10.1016/j.eswa.2023.122778_b23
  article-title: Random forest in remote sensing: A review of applications and future directions
  publication-title: ISPRS Journal of Photogrammetry and Remote Sensing
  doi: 10.1016/j.isprsjprs.2016.01.011
– start-page: 1163
  year: 2004
  ident: 10.1016/j.eswa.2023.122778_b278
  article-title: AdaBoost. RT: a boosting algorithm for regression problems
– volume: 50
  start-page: 419
  issue: 3
  year: 2008
  ident: 10.1016/j.eswa.2023.122778_b257
  article-title: Youden Index and optimal cut-point estimated from observations affected by a lower limit of detection
  publication-title: Biometrical Journal: Journal of Mathematical Methods in Biosciences
  doi: 10.1002/bimj.200710415
– volume: 24
  start-page: 1565
  issue: 12
  year: 2006
  ident: 10.1016/j.eswa.2023.122778_b220
  article-title: What is a support vector machine?
  publication-title: Nature biotechnology
  doi: 10.1038/nbt1206-1565
– volume: 93
  start-page: 3
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b61
  article-title: Redundancy-driven modified Tomek-link based undersampling: A solution to class imbalance
  publication-title: Pattern Recognition Letters
  doi: 10.1016/j.patrec.2016.10.006
– volume: 199
  start-page: 176
  issue: 2
  year: 2006
  ident: 10.1016/j.eswa.2023.122778_b206
  article-title: Predicting tree species presence and basal area in Utah: a comparison of stochastic gradient boosting, generalized additive models, and tree-based methods
  publication-title: Ecological Modelling
  doi: 10.1016/j.ecolmodel.2006.05.021
– volume: 26
  start-page: 445
  issue: 5
  year: 2002
  ident: 10.1016/j.eswa.2023.122778_b233
  article-title: Decision trees: an overview and their use in medicine
  publication-title: Journal of Medical Systems
  doi: 10.1023/A:1016409317640
– volume: 31
  start-page: 955
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b286
  article-title: An in-depth experimental study of anomaly detection using gradient boosted machine
  publication-title: Neural Computing and Applications
  doi: 10.1007/s00521-017-3128-z
– volume: 213
  year: 2023
  ident: 10.1016/j.eswa.2023.122778_b89
  article-title: Automatic grading of Diabetic macular edema based on end-to-end network
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2022.118835
– volume: 23
  start-page: 10755
  issue: 21
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b154
  article-title: TLUSBoost algorithm: a boosting solution for class imbalance problem
  publication-title: Soft Computing
  doi: 10.1007/s00500-018-3629-4
– volume: 14
  start-page: 415
  issue: 7
  year: 2023
  ident: 10.1016/j.eswa.2023.122778_b13
  article-title: Multi-class skin cancer classification using vision transformer networks and convolutional neural network-based pre-trained models
  publication-title: Information
  doi: 10.3390/info14070415
– start-page: 1
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b247
  article-title: Cusboost: Cluster-based under-sampling with boosting for imbalanced classification
– year: 2017
  ident: 10.1016/j.eswa.2023.122778_b73
– volume: 107
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b330
  article-title: A hybrid sampling algorithm combining M-SMOTE and ENN based on Random forest for medical imbalanced data
  publication-title: Journal of Biomedical Informatics
  doi: 10.1016/j.jbi.2020.103465
– start-page: 269
  year: 2013
  ident: 10.1016/j.eswa.2023.122778_b31
  article-title: Extending bagging for imbalanced data
– volume: 29
  start-page: 4802
  issue: 10
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b315
  article-title: A systematic study of online class imbalance learning with concept drift
  publication-title: IEEE Transactions on Neural Networks and Learning Systems
  doi: 10.1109/TNNLS.2017.2771290
– start-page: 256
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b227
  article-title: A signature-based assistant random oversampling method for malware detection
– year: 2020
  ident: 10.1016/j.eswa.2023.122778_b242
– volume: 69
  start-page: 35
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b183
  article-title: Addressing the class imbalance problem in twitter spam detection using ensemble learning
  publication-title: Computers & Security
  doi: 10.1016/j.cose.2016.12.004
– volume: 409
  start-page: 17
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b177
  article-title: Clustering-based undersampling in class-imbalanced data
  publication-title: Information Sciences
  doi: 10.1016/j.ins.2017.05.008
– start-page: 1111
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b195
  article-title: LightGBM: An effective decision tree gradient boosting method to predict customer loyalty in the finance industry
– volume: 22
  start-page: 1067
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b276
  article-title: Bankruptcy prediction using deep learning approach based on borderline SMOTE
  publication-title: Information Systems Frontiers
  doi: 10.1007/s10796-020-10031-6
– start-page: 660
  year: 2016
  ident: 10.1016/j.eswa.2023.122778_b302
  article-title: Learning to count with cnn boosting
– start-page: 1
  year: 2012
  ident: 10.1016/j.eswa.2023.122778_b339
  article-title: Sampling + reweighting: Boosting the performance of AdaBoost on imbalanced datasets
– volume: 19
  year: 2006
  ident: 10.1016/j.eswa.2023.122778_b19
  article-title: Adaboost is consistent
  publication-title: Advances in Neural Information Processing Systems
– volume: 29
  start-page: 45
  year: 1997
  ident: 10.1016/j.eswa.2023.122778_b25
  article-title: Online learning versus offline learning
  publication-title: Machine Learning
  doi: 10.1023/A:1007465907571
– volume: 73
  start-page: 3079
  issue: 16–18
  year: 2010
  ident: 10.1016/j.eswa.2023.122778_b93
  article-title: Edited AdaBoost by weighted kNN
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2010.06.024
– volume: 110
  start-page: 392
  issue: 2
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b160
  article-title: Artificial intelligence for unstructured healthcare data: application to coding of patient reporting of adverse drug reactions
  publication-title: Clinical Pharmacology & Therapeutics
  doi: 10.1002/cpt.2266
– volume: 16
  issue: 6
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b268
  article-title: A soft voting ensemble classifier for early prediction and diagnosis of occurrences of major adverse cardiovascular events for STEMI and NSTEMI during 2-year follow-up in patients with acute coronary syndrome
  publication-title: PLoS One
  doi: 10.1371/journal.pone.0249338
– start-page: 346
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b113
  article-title: Investigating random undersampling and feature selection on bioinformatics big data
– volume: 7
  issue: 6
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b161
  article-title: Monthly housing rent forecast based on lightgbm (light gradient boosting) model
  publication-title: International Journal of Intelligent Information and Management Science
– volume: 11
  start-page: 1
  issue: 1
  year: 2011
  ident: 10.1016/j.eswa.2023.122778_b146
  article-title: Predicting disease risks from highly imbalanced data using random forest
  publication-title: BMC Medical Informatics and Decision Making
  doi: 10.1186/1472-6947-11-51
– volume: 7
  start-page: 154096
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b189
  article-title: Black-box vs. white-box: Understanding their advantages and weaknesses from a practical point of view
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2019.2949286
– volume: 2
  start-page: 225
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b109
  article-title: Pre-trained models: Past, present and future
  publication-title: AI Open
  doi: 10.1016/j.aiopen.2021.08.002
– volume: 11
  start-page: 111
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b218
  article-title: Machine learning: applications of artificial intelligence to imaging and diagnosis
  publication-title: Biophysical Reviews
  doi: 10.1007/s12551-018-0449-9
– volume: 82
  start-page: 329
  issue: 3
  year: 2014
  ident: 10.1016/j.eswa.2023.122778_b188
  article-title: Fifty years of classification and regression trees
  publication-title: International Statistical Review
  doi: 10.1111/insr.12016
– year: 2020
  ident: 10.1016/j.eswa.2023.122778_b321
– volume: 13
  start-page: 21
  issue: 1
  year: 1967
  ident: 10.1016/j.eswa.2023.122778_b55
  article-title: Nearest neighbor pattern classification
  publication-title: IEEE Transactions on Information Theory
  doi: 10.1109/TIT.1967.1053964
– start-page: 148
  year: 1996
  ident: 10.1016/j.eswa.2023.122778_b87
  article-title: Experiments with a new boosting algorithm
– start-page: 468
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b308
  article-title: Malicious domain detection based on k-means and smote
– volume: 14
  start-page: 8707
  issue: 14
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b158
  article-title: XGBoost for imbalanced multiclass classification-based industrial internet of things intrusion detection systems
  publication-title: Sustainability
  doi: 10.3390/su14148707
– volume: 517
  start-page: 29
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b327
  article-title: SVM and KNN ensemble learning for traffic incident detection
  publication-title: Physica A. Statistical Mechanics and its Applications
  doi: 10.1016/j.physa.2018.10.060
– start-page: 1
  year: 2015
  ident: 10.1016/j.eswa.2023.122778_b50
– volume: 17
  start-page: 1411
  issue: 6
  year: 2006
  ident: 10.1016/j.eswa.2023.122778_b172
  article-title: A fast and accurate online sequential learning algorithm for feedforward networks
  publication-title: IEEE Transactions on Neural Networks
  doi: 10.1109/TNN.2006.880583
– volume: 207
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b150
  article-title: Improving the performance of machine learning models for early warning of harmful algal blooms using an adaptive synthetic sampling method
  publication-title: Water Research
  doi: 10.1016/j.watres.2021.117821
– volume: 6
  start-page: 429
  issue: 5
  year: 2002
  ident: 10.1016/j.eswa.2023.122778_b135
  article-title: The class imbalance problem: A systematic study
  publication-title: Intelligent Data Analysis
  doi: 10.3233/IDA-2002-6504
– volume: 9
  start-page: 8
  issue: 1
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b136
  article-title: Implementation of stacking ensemble classifier for multi-class classification of COVID-19 vaccines topics on Twitter
  publication-title: Scientific Journal of Informatics
  doi: 10.15294/sji.v9i1.31648
– volume: 55
  start-page: 1
  issue: 2
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b254
  article-title: Tackling climate change with machine learning
  publication-title: ACM Computing Surveys
  doi: 10.1145/3485128
– volume: 4
  start-page: 184
  issue: 1
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b165
  article-title: A hybrid XGBoost-MLP model for credit risk assessment on digital supply chain finance
  publication-title: Forecasting
  doi: 10.3390/forecast4010011
– volume: 62
  start-page: 961
  issue: 4
  year: 2006
  ident: 10.1016/j.eswa.2023.122778_b294
  article-title: Generalized additive modeling with implicit variable selection by likelihood-based boosting
  publication-title: Biometrics
  doi: 10.1111/j.1541-0420.2006.00578.x
– volume: 9
  issue: 11
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b322
  article-title: The emergence of deepfake technology: A review
  publication-title: Technology Innovation Management Review
  doi: 10.22215/timreview/1282
– start-page: 237
  year: 2016
  ident: 10.1016/j.eswa.2023.122778_b282
  article-title: Decision tree learning
– volume: 1
  start-page: 81
  issue: 1
  year: 1986
  ident: 10.1016/j.eswa.2023.122778_b241
  article-title: Induction of decision trees
  publication-title: Machine Learning
  doi: 10.1007/BF00116251
– volume: 28
  year: 2015
  ident: 10.1016/j.eswa.2023.122778_b84
  article-title: Precision-recall-gain curves: PR analysis done right
  publication-title: Advances in Neural Information Processing Systems
– volume: 25
  start-page: 289
  issue: 2
  year: 2013
  ident: 10.1016/j.eswa.2023.122778_b251
  article-title: Multiclass from binary: Expanding one-versus-all, one-versus-one and ecoc-based approaches
  publication-title: IEEE Transactions on Neural Networks and Learning Systems
  doi: 10.1109/TNNLS.2013.2274735
– volume: 510
  start-page: 1
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b217
  article-title: Evolutionary bagging for ensemble learning
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2022.08.055
– volume: 15
  start-page: 41
  issue: 1
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b130
  article-title: Applications of support vector machine (SVM) learning in cancer genomics
  publication-title: Cancer Genomics & Proteomics
– year: 2017
  ident: 10.1016/j.eswa.2023.122778_b245
– volume: 501
  start-page: 118
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b71
  article-title: Geometric SMOTE a geometrically enhanced drop-in replacement for SMOTE
  publication-title: Information Sciences
  doi: 10.1016/j.ins.2019.06.007
– start-page: 18187
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b175
  article-title: Text to image generation with semantic-spatial aware GAN
– volume: 48
  start-page: 1623
  issue: 5
  year: 2015
  ident: 10.1016/j.eswa.2023.122778_b281
  article-title: A novel ensemble method for classifying imbalanced data
  publication-title: Pattern Recognition
  doi: 10.1016/j.patcog.2014.11.014
– start-page: 1
  year: 2012
  ident: 10.1016/j.eswa.2023.122778_b234
  article-title: Ensemble learning
– volume: 136
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b229
  article-title: Toward safer highways, application of XGBoost and SHAP for real-time accident detection and feature analysis
  publication-title: Accident Analysis and Prevention
  doi: 10.1016/j.aap.2019.105405
– volume: 158
  start-page: 1533
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b292
  article-title: Gradient boosting machine for modeling the energy consumption of commercial buildings
  publication-title: Energy and Buildings
  doi: 10.1016/j.enbuild.2017.11.039
– start-page: 38
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b5
  article-title: Protecting world leaders against deep fakes
– volume: 9
  start-page: 130353
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b44
  article-title: Bayesian graph convolutional neural networks via tempered MCMC
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2021.3111898
– volume: 31
  start-page: 513
  issue: 2
  year: 2016
  ident: 10.1016/j.eswa.2023.122778_b59
  article-title: Boosting in Cox regression: a comparison between the likelihood-based and the model-based approaches with focus on the R-packages CoxBoost and mboost
  publication-title: Computational Statistics
  doi: 10.1007/s00180-015-0642-2
– volume: 8
  start-page: 1
  issue: 1
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b263
  article-title: A literature review on one-class classification and its potential applications in big data
  publication-title: Journal of Big Data
  doi: 10.1186/s40537-021-00514-x
– year: 2023
  ident: 10.1016/j.eswa.2023.122778_b143
  article-title: Cyclone trajectory and intensity prediction with uncertainty quantification using variational recurrent neural networks
  publication-title: Environmental Modelling & Software
  doi: 10.1016/j.envsoft.2023.105654
– volume: 17
  start-page: 267
  issue: 19
  year: 2016
  ident: 10.1016/j.eswa.2023.122778_b78
  article-title: Drug-target interaction prediction via class imbalance-aware ensemble learning
  publication-title: BMC Bioinformatics
– volume: 243
  start-page: 88
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b221
  article-title: Fast-CBUS: A fast clustering-based undersampling method for addressing the class imbalance problem
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2017.03.011
– volume: 45
  start-page: 12
  issue: 1
  year: 1994
  ident: 10.1016/j.eswa.2023.122778_b37
  article-title: The relationship between recall and precision
  publication-title: Journal of the American Society for Information Science
  doi: 10.1002/(SICI)1097-4571(199401)45:1<12::AID-ASI2>3.0.CO;2-L
– start-page: 232
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b97
  article-title: Credit card fraud detection using lightgbm model
– volume: 2
  start-page: 412
  issue: 5–6
  year: 2009
  ident: 10.1016/j.eswa.2023.122778_b119
  article-title: Roughly balanced bagging for imbalanced data
  publication-title: Statistical Analysis and Data Mining: The ASA Data Science Journal
  doi: 10.1002/sam.10061
– volume: 4
  start-page: 161
  issue: 2
  year: 1989
  ident: 10.1016/j.eswa.2023.122778_b296
  article-title: Incremental induction of decision trees
  publication-title: Machine Learning
  doi: 10.1023/A:1022699900025
– start-page: 1
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b131
  article-title: Network anomaly detection using lightgbm: A gradient boosting classifier
– start-page: 1
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b345
  article-title: Detecting and simulating artifacts in GAN fake images
– start-page: 161
  year: 2006
  ident: 10.1016/j.eswa.2023.122778_b42
  article-title: An empirical comparison of supervised learning algorithms
– volume: 39
  start-page: 539
  issue: 2
  year: 2008
  ident: 10.1016/j.eswa.2023.122778_b185
  article-title: Exploratory undersampling for class-imbalance learning
  publication-title: IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics)
– volume: 9
  start-page: 98
  issue: 1
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b262
  article-title: The use of generative adversarial networks to alleviate class imbalance in tabular data: a survey
  publication-title: Journal of Big Data
  doi: 10.1186/s40537-022-00648-6
– volume: 257
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b57
  article-title: Class-imbalanced positive instances augmentation via three-line hybrid
  publication-title: Knowledge-Based Systems
  doi: 10.1016/j.knosys.2022.109902
– volume: 8
  start-page: 1
  issue: 1
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b359
  article-title: Detecting web attacks using random undersampling and ensemble learners
  publication-title: Journal of Big Data
  doi: 10.1186/s40537-021-00460-8
– volume: 54
  start-page: 1937
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b27
  article-title: A comparative analysis of gradient boosting algorithms
  publication-title: Artificial Intelligence Review
  doi: 10.1007/s10462-020-09896-5
– start-page: 399
  year: 2015
  ident: 10.1016/j.eswa.2023.122778_b92
  article-title: Hybrid ensemble of classifiers using voting
– volume: 63
  start-page: 3
  issue: 1
  year: 2006
  ident: 10.1016/j.eswa.2023.122778_b99
  article-title: Extremely randomized trees
  publication-title: Machine Learning
  doi: 10.1007/s10994-006-6226-1
– start-page: 1390
  year: 2008
  ident: 10.1016/j.eswa.2023.122778_b297
  article-title: Multi-class AUC metrics and weighted alternatives
– start-page: 378
  year: 2013
  ident: 10.1016/j.eswa.2023.122778_b291
  article-title: SMOTE for regression
– volume: 11
  start-page: 1495
  issue: 9
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b349
  article-title: Coronary artery disease detection model based on class balancing methods and LightGBM algorithm
  publication-title: Electronics
  doi: 10.3390/electronics11091495
– volume: 7
  start-page: 149890
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b307
  article-title: Feature learning viewpoint of AdaBoost and a new algorithm
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2019.2947359
– volume: 11
  start-page: 2109
  year: 2010
  ident: 10.1016/j.eswa.2023.122778_b124
  article-title: Model-based boosting 2.0
  publication-title: Journal of Machine Learning Research
– start-page: 669
  year: 2001
  ident: 10.1016/j.eswa.2023.122778_b342
  article-title: A comparison of stacking with meta decision trees to bagging, boosting, and stacking with other methods
– volume: 5
  start-page: 241
  issue: 2
  year: 1992
  ident: 10.1016/j.eswa.2023.122778_b324
  article-title: Stacked generalization
  publication-title: Neural Networks
  doi: 10.1016/S0893-6080(05)80023-1
– volume: 45
  start-page: 203
  issue: 3
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b303
  article-title: Application of adaptive boosting (AdaBoost) in demand-driven acquisition (DDA) prediction: A machine-learning approach
  publication-title: The Journal of Academic Librarianship
  doi: 10.1016/j.acalib.2019.02.013
– volume: 22
  start-page: 1474
  issue: 2
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b304
  article-title: Dual-attention generative adversarial networks for fault diagnosis under the class-imbalanced conditions
  publication-title: IEEE Sensors Journal
  doi: 10.1109/JSEN.2021.3131166
– year: 2018
  ident: 10.1016/j.eswa.2023.122778_b63
– volume: 212
  year: 2023
  ident: 10.1016/j.eswa.2023.122778_b256
  article-title: An ensemble credit scoring model based on logistic regression with heterogeneous balancing and weighting effects
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2022.118732
– volume: 11
  start-page: 820
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b94
  article-title: Identification of orphan genes in unbalanced datasets based on ensemble learning
  publication-title: Frontiers in Genetics
  doi: 10.3389/fgene.2020.00820
– start-page: 4393
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b255
  article-title: Deep one-class classification
– volume: 12
  start-page: 374
  issue: 9
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b96
  article-title: A tweet sentiment classification approach using a hybrid stacked ensemble technique
  publication-title: Information
  doi: 10.3390/info12090374
– volume: 558
  year: 2023
  ident: 10.1016/j.eswa.2023.122778_b17
  article-title: Gradient boosting Bayesian neural networks via Langevin MCMC
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2023.126726
– start-page: 1
  year: 2014
  ident: 10.1016/j.eswa.2023.122778_b252
  article-title: Preliminary comparison of techniques for dealing with imbalance in software defect prediction
– volume: 15
  start-page: 15974
  issue: 7
  year: 2015
  ident: 10.1016/j.eswa.2023.122778_b264
  article-title: Smart city mobility application—gradient boosting trees for mobility prediction and analysis based on crowdsourced data
  publication-title: Sensors
  doi: 10.3390/s150715974
– volume: 2021
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b265
  article-title: Application of gradient boosting machine learning algorithms to predict uniaxial compressive strength of soft sedimentary rocks at Thar Coalfield
  publication-title: Advances in Civil Engineering
  doi: 10.1155/2021/2565488
– volume: 61
  start-page: 863
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b83
  article-title: SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary
  publication-title: Journal of Artificial Intelligence Research
  doi: 10.1613/jair.1.11192
– volume: 41
  start-page: 478
  issue: 2
  year: 2003
  ident: 10.1016/j.eswa.2023.122778_b235
  article-title: Forecasting volatility in financial markets: A review
  publication-title: Journal of Economic Literature
  doi: 10.1257/.41.2.478
– start-page: 608
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b287
  article-title: A customer churn prediction model based on XGBoost and MLP
– volume: 30
  issue: 5
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b164
  article-title: A comparative study of the class imbalance problem in Twitter spam detection
  publication-title: Concurrency and Computation: Practice and Experience
  doi: 10.1002/cpe.4281
– start-page: 1
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b4
  article-title: Transformer models for text-based emotion detection: a review of BERT-based approaches
  publication-title: Artificial Intelligence Review
– start-page: 3468
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b193
  article-title: Retrieval-based gradient boosting decision trees for disease risk assessment
– start-page: 125
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b62
  article-title: Fraud detection in credit card transactions by using classification algorithms
– start-page: 175
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b139
  article-title: Deep learning and data sampling with imbalanced big data
– start-page: 718
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b174
  article-title: Study of application of composite sampling and improved LightGBM algorithm to the diagnosis of unbalanced transformer fault samples
– volume: 158
  start-page: 81
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b28
  article-title: An empirical comparison on state-of-the-art multi-class imbalance learning algorithms and a new diversified ensemble learning scheme
  publication-title: Knowledge-Based Systems
  doi: 10.1016/j.knosys.2018.05.037
– start-page: 72
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b207
  article-title: Review of random forest classification techniques to resolve data imbalance
– volume: 10
  start-page: 607
  issue: 6
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b338
  article-title: A survey on deepfake video detection
  publication-title: IET Biometrics
  doi: 10.1049/bme2.12031
– volume: 133
  start-page: 121
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b239
  article-title: Natural language processing was effective in assisting rapid title and abstract screening when updating systematic reviews
  publication-title: Journal of Clinical Epidemiology
  doi: 10.1016/j.jclinepi.2021.01.010
– volume: 38
  start-page: 577
  issue: 2
  year: 2008
  ident: 10.1016/j.eswa.2023.122778_b126
  article-title: Adaboost-based algorithm for network intrusion detection
  publication-title: IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics)
  doi: 10.1109/TSMCB.2007.914695
– start-page: 735
  year: 2011
  ident: 10.1016/j.eswa.2023.122778_b20
  article-title: A novel synthetic minority oversampling technique for imbalanced data set learning
– year: 2020
  ident: 10.1016/j.eswa.2023.122778_b129
  article-title: An optimized lightgbm model for fraud detection
– year: 2010
  ident: 10.1016/j.eswa.2023.122778_b253
– volume: 30
  start-page: 681
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b85
  article-title: GPT-3: Its nature, scope, limits, and consequences
  publication-title: Minds and Machines
  doi: 10.1007/s11023-020-09548-1
– year: 2020
  ident: 10.1016/j.eswa.2023.122778_b15
– volume: 18
  start-page: 1206
  issue: 3
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b138
  article-title: Data augmentation classifier for imbalanced fault classification
  publication-title: IEEE Transactions on Automation Science and Engineering
  doi: 10.1109/TASE.2020.2998467
– volume: 42
  start-page: 463
  issue: 4
  year: 2011
  ident: 10.1016/j.eswa.2023.122778_b90
  article-title: A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches
  publication-title: IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews)
  doi: 10.1109/TSMCC.2011.2161285
– year: 2022
  ident: 10.1016/j.eswa.2023.122778_b46
  article-title: Distributed Bayesian optimisation framework for deep neuroevolution
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2021.10.045
– start-page: 1223
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b273
  article-title: Prediction of liver disease using gradient boost machine learning techniques with feature scaling
– volume: 5
  start-page: 1
  issue: 2
  year: 2015
  ident: 10.1016/j.eswa.2023.122778_b123
  article-title: A review on evaluation metrics for data classification evaluations
  publication-title: International Journal of Data Mining & Knowledge Management Process
  doi: 10.5121/ijdkp.2015.5201
– volume: 91
  start-page: 216
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b191
  article-title: The impact of class imbalance in classification performance metrics based on the binary confusion matrix
  publication-title: Pattern Recognition
  doi: 10.1016/j.patcog.2019.02.023
– volume: 86
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b250
  article-title: Ensemble approach based on bagging, boosting and stacking for short-term prediction in agribusiness time series
  publication-title: Applied Soft Computing
  doi: 10.1016/j.asoc.2019.105837
– volume: 12
  issue: 04
  year: 2013
  ident: 10.1016/j.eswa.2023.122778_b313
  article-title: Online class imbalance learning and its applications in fault detection
  publication-title: International Journal of Computational Intelligence and Applications
  doi: 10.1142/S1469026813400014
– start-page: 572
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b110
  article-title: Performance of catboost and xgboost in medicare fraud detection
– year: 2023
  ident: 10.1016/j.eswa.2023.122778_b258
  article-title: Explainable AI (XIA): A systematic meta-survey of current challenges and future opportunities
  publication-title: Knowledge-Based Systems
  doi: 10.1016/j.knosys.2023.110273
– start-page: 110
  year: 2002
  ident: 10.1016/j.eswa.2023.122778_b64
  article-title: Ensemble learning
– volume: 6
  start-page: 4641
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b357
  article-title: Class weights random forest algorithm for processing class imbalanced medical data
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2018.2789428
– volume: 195
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b186
  article-title: A two-stage hybrid credit risk prediction model based on XGBoost and graph-based deep neural network
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2022.116624
– volume: 9
  start-page: 86230
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b211
  article-title: Novel stock crisis prediction technique—a study on indian stock market
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2021.3088999
– start-page: 345
  year: 2005
  ident: 10.1016/j.eswa.2023.122778_b102
  article-title: A probabilistic interpretation of precision, recall and F-score, with implication for evaluation
– start-page: 1
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b105
  article-title: Fraud detection in mobile payment systems using an XGBoost-based framework
  publication-title: Information Systems Frontiers
– volume: 4
  start-page: 234
  issue: 3
  year: 2014
  ident: 10.1016/j.eswa.2023.122778_b259
  article-title: Support vector machines in engineering: an overview
  publication-title: Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
– start-page: 84
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b311
– volume: 8
  issue: 7
  year: 2013
  ident: 10.1016/j.eswa.2023.122778_b319
  article-title: The role of balanced training and testing data sets for binary classifiers in bioinformatics
  publication-title: PLoS One
  doi: 10.1371/journal.pone.0067863
– year: 2014
  ident: 10.1016/j.eswa.2023.122778_b121
– start-page: 13622
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b194
  article-title: MUST-GAN: Multi-level statistics transfer for self-driven person image generation
– volume: 120
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b36
  article-title: Addressing class imbalance in deep learning for small lesion detection on medical images
  publication-title: Computers in Biology and Medicine
  doi: 10.1016/j.compbiomed.2020.103735
– start-page: 58
  year: 2004
  ident: 10.1016/j.eswa.2023.122778_b293
– volume: 16
  start-page: 114
  issue: 1
  year: 2005
  ident: 10.1016/j.eswa.2023.122778_b270
  article-title: Incremental training of support vector machines
  publication-title: IEEE Transactions on Neural Networks
  doi: 10.1109/TNN.2004.836201
– start-page: 1
  year: 2023
  ident: 10.1016/j.eswa.2023.122778_b106
  article-title: Speech emotion recognition and text sentiment analysis for financial distress prediction
  publication-title: Neural Computing and Applications
– start-page: 480
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b269
  article-title: Machine learning model for sales forecasting by using XGBoost
– start-page: 878
  year: 2005
  ident: 10.1016/j.eswa.2023.122778_b108
  article-title: Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning
– start-page: 39
  year: 2004
  ident: 10.1016/j.eswa.2023.122778_b7
  article-title: Applying support vector machines to imbalanced datasets
– start-page: 937
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b26
  article-title: Interpretable random forests via rule extraction
– year: 2022
  ident: 10.1016/j.eswa.2023.122778_b163
  article-title: Data augmentation approaches in natural language processing: A survey
  publication-title: AI Open
  doi: 10.1016/j.aiopen.2022.03.001
– start-page: 2523
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b348
  article-title: WOTBoost: Weighted oversampling technique in boosting for imbalanced learning
– volume: 584
  start-page: 50
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b39
  article-title: A new clustering mining algorithm for multi-source imbalanced location data
  publication-title: Information Sciences
  doi: 10.1016/j.ins.2021.10.029
– volume: 80
  start-page: 79
  year: 2016
  ident: 10.1016/j.eswa.2023.122778_b202
  article-title: Meta-cognitive online sequential extreme learning machine for imbalanced and concept-drifting data classification
  publication-title: Neural Networks
  doi: 10.1016/j.neunet.2016.04.008
– volume: 6
  start-page: 1
  issue: 1
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b274
  article-title: A survey on image data augmentation for deep learning
  publication-title: Journal of Big Data
  doi: 10.1186/s40537-019-0197-0
– volume: 404
  start-page: 351
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b284
  article-title: AdaBoost-CNN: An adaptive boosting algorithm for convolutional neural networks to classify multi-class imbalanced datasets using transfer learning
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2020.03.064
– year: 2022
  ident: 10.1016/j.eswa.2023.122778_b199
  article-title: Imbalance example-dependent cost classification: A Bayesian based method
  publication-title: Expert Systems with Applications
– year: 2004
  ident: 10.1016/j.eswa.2023.122778_b289
– volume: 121
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b60
  article-title: An ensemble of pre-trained transformer models for imbalanced multiclass malware classification
  publication-title: Computers & Security
  doi: 10.1016/j.cose.2022.102846
– volume: 14
  start-page: 3547
  issue: 15
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b216
  article-title: Ensemble machine learning of Random Forest, AdaBoost and XGBoost for vertical total electron content forecasting
  publication-title: Remote Sensing
  doi: 10.3390/rs14153547
– start-page: 486
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b76
  article-title: A novel technique to solve class imbalance problem
– start-page: 3207
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b167
  article-title: Celeb-df: A large-scale challenging dataset for deepfake forensics
– year: 2007
  ident: 10.1016/j.eswa.2023.122778_b117
  article-title: Asymmetric gradient boosting with application to spam filtering
– start-page: 34
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b75
  article-title: LightGBM-RF: A hybrid model for anomaly detection in smart building
– volume: 23
  start-page: 69
  issue: 1
  year: 1996
  ident: 10.1016/j.eswa.2023.122778_b323
  article-title: Learning in the presence of concept drift and hidden contexts
  publication-title: Machine Learning
  doi: 10.1007/BF00116900
– start-page: 111
  year: 2000
  ident: 10.1016/j.eswa.2023.122778_b134
  article-title: The class imbalance problem: Significance and strategies
– volume: 7
  start-page: 9515
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b197
  article-title: Imbalanced fault diagnosis of rolling bearing based on generative adversarial network: A comparative study
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2018.2890693
– volume: 29
  start-page: 345
  issue: 3
  year: 2014
  ident: 10.1016/j.eswa.2023.122778_b148
  article-title: One-class classification: taxonomy of study and review of techniques
  publication-title: The Knowledge Engineering Review
  doi: 10.1017/S026988891300043X
– volume: 5
  start-page: 1
  issue: 1
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b159
  article-title: A survey on addressing high-class imbalance in big data
  publication-title: Journal of Big Data
  doi: 10.1186/s40537-018-0151-6
– volume: 7
  start-page: 93010
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b196
  article-title: An experimental study with imbalanced classification approaches for credit card fraud detection
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2019.2927266
– volume: 45
  start-page: 110
  issue: 1
  year: 2008
  ident: 10.1016/j.eswa.2023.122778_b10
  article-title: Bankruptcy forecasting: An empirical comparison of AdaBoost and neural networks
  publication-title: Decision Support Systems
  doi: 10.1016/j.dss.2007.12.002
– volume: 15
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b9
  article-title: A Random Forest based predictor for medical data classification using feature ranking
  publication-title: Informatics in Medicine Unlocked
  doi: 10.1016/j.imu.2019.100180
– volume: 12
  start-page: 51
  issue: 3
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b358
  article-title: Research on E-commerce customer churn prediction based on improved value model and XG-boost algorithm
  publication-title: Management Science and Engineering
– volume: 21
  start-page: 5485
  issue: 1
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b243
  article-title: Exploring the limits of transfer learning with a unified text-to-text transformer
  publication-title: Journal of Machine Learning Research
– volume: 465
  start-page: 1
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b72
  article-title: Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE
  publication-title: Information Sciences
  doi: 10.1016/j.ins.2018.06.056
– volume: 10
  start-page: 99129
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b200
  article-title: A survey of ensemble learning: Concepts, algorithms, applications, and prospects
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2022.3207287
– year: 2021
  ident: 10.1016/j.eswa.2023.122778_b266
– volume: 572
  start-page: 574
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b331
  article-title: A cluster-based oversampling algorithm combining SMOTE and k-means for imbalanced medical data
  publication-title: Information Sciences
  doi: 10.1016/j.ins.2021.02.056
– volume: 46
  start-page: 109
  year: 2013
  ident: 10.1016/j.eswa.2023.122778_b32
  article-title: Recommender systems survey
  publication-title: Knowledge-Based Systems
  doi: 10.1016/j.knosys.2013.03.012
– volume: 12
  start-page: 189
  issue: 1
  year: 2015
  ident: 10.1016/j.eswa.2023.122778_b198
  article-title: Application of adaboost algorithm in basketball player detection
  publication-title: Acta Polytechnica Hungarica
– start-page: 181
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b337
  article-title: Speech recognition based on concatenated acoustic feature and lightGBM model
– volume: 67
  start-page: 105
  year: 2014
  ident: 10.1016/j.eswa.2023.122778_b326
  article-title: ForesTexter: An efficient random forest algorithm for imbalanced text categorization
  publication-title: Knowledge-Based Systems
  doi: 10.1016/j.knosys.2014.06.004
– year: 2018
  ident: 10.1016/j.eswa.2023.122778_b69
– start-page: 952
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b333
  article-title: Anomaly detection based on histogram methodology and factor analysis using LightGBM for cooling systems
– volume: 30
  start-page: 1145
  issue: 7
  year: 1997
  ident: 10.1016/j.eswa.2023.122778_b34
  article-title: The use of the area under the ROC curve in the evaluation of machine learning algorithms
  publication-title: Pattern Recognition
  doi: 10.1016/S0031-3203(96)00142-2
– volume: 133
  start-page: 433
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b334
  article-title: Real-time condition monitoring and fault detection of components based on machine-learning reconstruction model
  publication-title: Renewable Energy
  doi: 10.1016/j.renene.2018.10.062
– start-page: 201
  year: 2002
  ident: 10.1016/j.eswa.2023.122778_b74
  article-title: Stacking with multi-response model trees
– volume: 63
  start-page: 139
  issue: 11
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b101
  article-title: Generative adversarial networks
  publication-title: Communications of the ACM
  doi: 10.1145/3422622
– start-page: 160
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b328
  article-title: Speaker recognition system with limited data based on LightGBM and fusion features
– volume: 38
  start-page: 2395
  issue: 3
  year: 2011
  ident: 10.1016/j.eswa.2023.122778_b213
  article-title: Reduced Reward-punishment editing for building ensembles of classifiers
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2010.08.028
– volume: 21
  start-page: 785
  issue: 5
  year: 2008
  ident: 10.1016/j.eswa.2023.122778_b166
  article-title: AdaBoost with SVM-based component classifiers
  publication-title: Engineering Applications of Artificial Intelligence
  doi: 10.1016/j.engappai.2007.07.001
– start-page: 49
  year: 2002
  ident: 10.1016/j.eswa.2023.122778_b16
  article-title: Online handwriting recognition with support vector machines-a kernel approach
– volume: 459
  start-page: 249
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b122
  article-title: Online learning: A comprehensive survey
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2021.04.112
– start-page: 1322
  year: 2008
  ident: 10.1016/j.eswa.2023.122778_b116
  article-title: ADASYN: Adaptive synthetic sampling approach for imbalanced learning
– volume: 13
  start-page: 373
  issue: 2
  year: 2023
  ident: 10.1016/j.eswa.2023.122778_b336
  article-title: Multi-modal stacking ensemble for the diagnosis of cardiovascular diseases
  publication-title: Journal of Personalized Medicine
  doi: 10.3390/jpm13020373
– volume: 7
  start-page: 150960
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b52
  article-title: Prediction of extubation failure for intensive care unit patients using light gradient boosting machine
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2019.2946980
– volume: 32
  start-page: 1971
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b168
  article-title: Research on orthopedic auxiliary classification and prediction model based on XGBoost algorithm
  publication-title: Neural Computing and Applications
  doi: 10.1007/s00521-019-04378-4
– start-page: 91
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b125
  article-title: Short paper: Credit card fraud detection using LightGBM with asymmetric error control
– volume: 602
  start-page: 259
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b310
  article-title: Corporate finance risk prediction based on LightGBM
  publication-title: Information Sciences
  doi: 10.1016/j.ins.2022.04.058
– volume: 547
  start-page: 777
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b133
  article-title: A distributed sensor-fault detection and diagnosis framework using machine learning
  publication-title: Information Sciences
  doi: 10.1016/j.ins.2020.08.068
– start-page: 1
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b18
  article-title: Performance analysis of regression algorithms and feature selection techniques to predict PM 2.5 in smart cities
  publication-title: International Journal of Systems Assurance Engineering and Management
– volume: 199
  start-page: 1128
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b306
  article-title: Research on personal credit risk evaluation based on XGBoost
  publication-title: Procedia Computer Science
  doi: 10.1016/j.procs.2022.01.143
– volume: 2
  start-page: 268
  issue: 4
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b111
  article-title: Gradient boosted decision tree algorithms for medicare fraud detection
  publication-title: SN Computer Science
  doi: 10.1007/s42979-021-00655-z
– volume: 45
  start-page: 3738
  issue: 10
  year: 2012
  ident: 10.1016/j.eswa.2023.122778_b285
  article-title: Inverse random under sampling for class imbalance problem and its application to multi-label classification
  publication-title: Pattern Recognition
  doi: 10.1016/j.patcog.2012.03.014
– year: 2012
  ident: 10.1016/j.eswa.2023.122778_b340
– volume: 20
  start-page: 273
  issue: 3
  year: 1995
  ident: 10.1016/j.eswa.2023.122778_b54
  article-title: Support-vector networks
  publication-title: Machine Learning
  doi: 10.1007/BF00994018
– volume: 17
  start-page: 641
  issue: 3
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b70
  article-title: Improved landslide assessment using support vector machine with bagging, boosting, and stacking ensemble machine learning framework in a mountainous watershed, Japan
  publication-title: Landslides
  doi: 10.1007/s10346-019-01286-5
– volume: 73
  start-page: 220
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b104
  article-title: Learning from class-imbalanced data: Review of methods and applications
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2016.12.035
– volume: 70
  start-page: 1125
  issue: 4
  year: 2008
  ident: 10.1016/j.eswa.2023.122778_b352
  article-title: Protein classification with imbalanced data
  publication-title: Proteins: Structure, Function, and Bioinformatics
  doi: 10.1002/prot.21870
– volume: 19
  start-page: 2632
  issue: 5
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b219
  article-title: A novel method for Identification of Glutarylation sites combining Borderline-SMOTE with Tomek links technique in imbalanced data
  publication-title: IEEE/ACM Transactions on Computational Biology and Bioinformatics
  doi: 10.1109/TCBB.2021.3095482
– start-page: 32
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b176
  article-title: Sentiment analysis of e-commerce customer reviews based on natural language processing
– start-page: 2118
  year: 2016
  ident: 10.1016/j.eswa.2023.122778_b314
  article-title: Dealing with multiple classes in online class imbalance learning
– volume: 3
  start-page: 19
  issue: 1
  year: 2000
  ident: 10.1016/j.eswa.2023.122778_b58
  article-title: Nearest neighbour editing and condensing tools–synergy exploitation
  publication-title: Pattern Analysis & Applications
  doi: 10.1007/s100440050003
– year: 2021
  ident: 10.1016/j.eswa.2023.122778_b82
– volume: 15
  start-page: 607
  issue: 4
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b98
  article-title: Very high resolution object-based land use–land cover urban classification using extreme gradient boosting
  publication-title: IEEE Geoscience and Remote Sensing Letters
  doi: 10.1109/LGRS.2018.2803259
– volume: 19
  issue: 6
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b171
  article-title: Improved PSO AdaBoost ensemble algorithm for imbalanced data
  publication-title: Sensors
  doi: 10.3390/s19061476
– year: 2012
  ident: 10.1016/j.eswa.2023.122778_b353
– start-page: 810
  year: 2014
  ident: 10.1016/j.eswa.2023.122778_b244
  article-title: Data augmentation for low resource languages
– volume: 6
  start-page: 45
  issue: 2
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b22
  article-title: Estimating and forecasting conditional risk measures with extreme value theory: a review
  publication-title: Risks
  doi: 10.3390/risks6020045
– start-page: 278
  year: 1995
  ident: 10.1016/j.eswa.2023.122778_b120
  article-title: Random decision forests
– volume: 152
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b347
  article-title: Machinery fault diagnosis with imbalanced data using deep generative adversarial networks
  publication-title: Measurement
  doi: 10.1016/j.measurement.2019.107377
– start-page: 1
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b24
  article-title: Comparison of ensemble learning methods applied to network intrusion detection
– volume: 58
  start-page: 308
  year: 2015
  ident: 10.1016/j.eswa.2023.122778_b343
  article-title: A gradient boosting method to improve travel time prediction
  publication-title: Transportation Research Part C (Emerging Technologies)
  doi: 10.1016/j.trc.2015.02.019
– volume: 13
  start-page: 17
  issue: 1
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b232
  article-title: A grey-box ensemble model exploiting black-box accuracy and white-box intrinsic interpretability
  publication-title: Algorithms
  doi: 10.3390/a13010017
– volume: 12
  start-page: 2825
  year: 2011
  ident: 10.1016/j.eswa.2023.122778_b230
  article-title: Scikit-learn: Machine learning in Python
  publication-title: Journal of Machine Learning Research
– year: 2018
  ident: 10.1016/j.eswa.2023.122778_b226
  article-title: Application of XGBoost algorithm in hourly PM2. 5 concentration prediction
– volume: 22
  start-page: 4664
  issue: 12
  year: 2013
  ident: 10.1016/j.eswa.2023.122778_b351
  article-title: Real-time object tracking via online discriminative feature selection
  publication-title: IEEE Transactions on Image Processing
  doi: 10.1109/TIP.2013.2277800
– start-page: 362
  year: 1999
  ident: 10.1016/j.eswa.2023.122778_b79
  article-title: The application of AdaBoost for distributed, scalable and on-line learning
– start-page: 150
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b173
  article-title: Product marketing prediction based on XGboost and LightGBM algorithm
– start-page: 4055
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b228
  article-title: Image transformer
– start-page: 593
  year: 2012
  ident: 10.1016/j.eswa.2023.122778_b283
  article-title: Application of bagging, boosting and stacking to intrusion detection
– year: 2017
  ident: 10.1016/j.eswa.2023.122778_b317
  article-title: Electricity consumption prediction using XGBoost based on discrete wavelet transform
  publication-title: DEStech Transactions on Computer Science and Engineering
– volume: 20
  start-page: 1
  issue: 1
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b115
  article-title: Ada-WHIPS: explaining AdaBoost classification with applications in the health sciences
  publication-title: BMC Medical Informatics and Decision Making
  doi: 10.1186/s12911-020-01201-2
– year: 2020
  ident: 10.1016/j.eswa.2023.122778_b155
– volume: 8
  start-page: 181
  issue: 2
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b3
  article-title: A stacking-based ensemble learning method for outlier detection
  publication-title: Balkan Journal of Electrical and Computer Engineering
  doi: 10.17694/bajece.679662
– volume: 129
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b43
  article-title: Handling class imbalance in COVID-19 chest X-ray images classification: Using SMOTE and weighted loss
  publication-title: Applied Soft Computing
  doi: 10.1016/j.asoc.2022.109588
– start-page: 231
  year: 2008
  ident: 10.1016/j.eswa.2023.122778_b179
  article-title: Cost-sensitive learning and the class imbalance problem
– year: 2022
  ident: 10.1016/j.eswa.2023.122778_b309
  article-title: Pre-trained language models and their applications
  publication-title: Engineering
– volume: 25
  start-page: 197
  issue: 2
  year: 2016
  ident: 10.1016/j.eswa.2023.122778_b29
  article-title: A random forest guided tour
  publication-title: Test
  doi: 10.1007/s11749-016-0481-7
– volume: 149
  start-page: 316
  year: 2015
  ident: 10.1016/j.eswa.2023.122778_b203
  article-title: Ensemble of subset online sequential extreme learning machine for class imbalance and concept drift
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2014.03.075
– year: 2014
  ident: 10.1016/j.eswa.2023.122778_b151
– start-page: 1
  year: 2011
  ident: 10.1016/j.eswa.2023.122778_b38
  article-title: MUTE: Majority under-sampling technique
– volume: 17
  start-page: 2131
  issue: 6
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b222
  article-title: XGBoost model for chronic kidney disease diagnosis
  publication-title: IEEE/ACM Transactions on Computational Biology and Bioinformatics
  doi: 10.1109/TCBB.2019.2911071
– volume: 158
  start-page: 48
  year: 2015
  ident: 10.1016/j.eswa.2023.122778_b212
  article-title: Coupling different methods for overcoming the class imbalance problem
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2015.01.068
– start-page: 1189
  year: 2001
  ident: 10.1016/j.eswa.2023.122778_b88
  article-title: Greedy function approximation: a gradient boosting machine
  publication-title: Annals of Statistics
– start-page: 205
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b190
  article-title: Image generation from sketch constraint using contextual GAN
– volume: 11
  year: 1998
  ident: 10.1016/j.eswa.2023.122778_b246
  article-title: Regularizing adaboost
  publication-title: Advances in Neural Information Processing Systems
– volume: 34
  start-page: 719
  issue: 5
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b181
  article-title: Early prediction of incident liver disease using conventional risk factors and gut-microbiome-augmented gradient boosting
  publication-title: Cell Metabolism
  doi: 10.1016/j.cmet.2022.03.002
– volume: 2
  start-page: 1
  issue: 3
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b299
  article-title: Application of gradient boosting algorithms for anti-money laundering in cryptocurrencies
  publication-title: SN Computer Science
  doi: 10.1007/s42979-021-00558-z
– volume: 32
  start-page: 13
  issue: 1
  year: 2004
  ident: 10.1016/j.eswa.2023.122778_b137
  article-title: Process consistency for adaboost
  publication-title: The Annals of Statistics
  doi: 10.1214/aos/1079120128
– volume: 2017
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b312
  article-title: A novel ensemble method for imbalanced data learning: bagging of extrapolation-SMOTE SVM
  publication-title: Computational Intelligence and Neuroscience
  doi: 10.1155/2017/1827016
– volume: 375
  start-page: 613
  issue: 3–4
  year: 2009
  ident: 10.1016/j.eswa.2023.122778_b53
  article-title: Ensemble flood forecasting: A review
  publication-title: Journal of Hydrology
  doi: 10.1016/j.jhydrol.2009.06.005
– volume: 30
  start-page: 916
  issue: 3
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b192
  article-title: Class imbalance in gradient boosting classification algorithms: Application to experimental stroke data
  publication-title: Statistical Methods in Medical Research
  doi: 10.1177/0962280220980484
– ident: 10.1016/j.eswa.2023.122778_b301
– start-page: 1371
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b350
  article-title: Machine learning in rock facies classification: An application of XGBoost
– volume: 149
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b279
  article-title: Multi-label classification of fundus images with graph convolutional network and LightGBM
  publication-title: Computers in Biology and Medicine
  doi: 10.1016/j.compbiomed.2022.105909
– volume: 35
  start-page: 53
  issue: 1
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b56
  article-title: Generative adversarial networks: An overview
  publication-title: IEEE Signal Processing Magazine
  doi: 10.1109/MSP.2017.2765202
– year: 2021
  ident: 10.1016/j.eswa.2023.122778_b204
  article-title: DTCDWT-SMOTE-XGBoost-based islanding detection for distributed generation systems: An approach of class-imbalanced issue
  publication-title: IEEE Systems Journal
– volume: 106
  start-page: 251
  year: 2016
  ident: 10.1016/j.eswa.2023.122778_b346
  article-title: Empowering one-vs-one decomposition with ensemble learning for multi-class imbalanced data
  publication-title: Knowledge-Based Systems
  doi: 10.1016/j.knosys.2016.05.048
– volume: 37
  start-page: 587
  issue: 2
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b33
  article-title: Kaggle forecasting competitions: An overlooked learning opportunity
  publication-title: International Journal of Forecasting
  doi: 10.1016/j.ijforecast.2020.07.007
– start-page: 1
  year: 2016
  ident: 10.1016/j.eswa.2023.122778_b81
  article-title: Efficient sampling techniques for ensemble learning and diagnosing bearing defects under class imbalanced condition
– start-page: 70
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b112
  article-title: The effects of random undersampling with simulated class imbalance for big data
– start-page: 31
  year: 2004
  ident: 10.1016/j.eswa.2023.122778_b224
  article-title: Aveboost2: Boosting for noisy data
– volume: 1
  start-page: 14
  issue: 1
  year: 2011
  ident: 10.1016/j.eswa.2023.122778_b187
  article-title: Classification and regression trees
  publication-title: Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
– volume: 7
  issue: 3
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b325
  article-title: Ensemble flood forecasting: Current status and future opportunities
  publication-title: Wiley Interdisciplinary Reviews: Water
– volume: 34
  start-page: 3463
  issue: 6
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b260
  article-title: Feature ranking for multi-fault diagnosis of rotating machinery by using random forest and KNN
  publication-title: Journal of Intelligent & Fuzzy Systems
  doi: 10.3233/JIFS-169526
– start-page: 505
  year: 2008
  ident: 10.1016/j.eswa.2023.122778_b118
  article-title: One-class classification by combining density and class probability estimation
– volume: 10
  start-page: 749
  issue: 4
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b180
  article-title: Predictive classifier for cardiovascular disease based on stacking model fusion
  publication-title: Processes
  doi: 10.3390/pr10040749
– volume: 2019
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b201
  article-title: Light gbm machine learning algorithm to online click fraud detection
  publication-title: Journal of Information Assurance & Cybersecurity
– volume: 10
  start-page: 1
  issue: 1
  year: 2001
  ident: 10.1016/j.eswa.2023.122778_b298
  article-title: The art of data augmentation
  publication-title: Journal of Computational and Graphical Statistics
  doi: 10.1198/10618600152418584
– start-page: 352
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b6
  article-title: LastResort at SemEval-2022 task 4: Towards patronizing and condescending language detection using pre-trained transformer based models ensembles
– volume: 26
  start-page: 1011
  issue: 9
  year: 2008
  ident: 10.1016/j.eswa.2023.122778_b152
  article-title: What are decision trees ?
  publication-title: Nature biotechnology
  doi: 10.1038/nbt0908-1011
– volume: 6
  start-page: 1
  issue: 1
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b140
  article-title: Survey on deep learning with class imbalance
  publication-title: Journal of Big Data
  doi: 10.1186/s40537-019-0192-5
– volume: 18
  issue: 6
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b344
  article-title: Research and application of XGBoost in imbalanced data
  publication-title: International Journal of Distributed Sensor Networks
  doi: 10.1177/15501329221106935
– start-page: 734
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b107
  article-title: GAN-based synthetic brain MR image generation
– volume: 30
  start-page: 2163
  issue: 7
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b14
  article-title: Biased random forest for dealing with the class imbalance problem
  publication-title: IEEE Transactions on Neural Networks and Learning Systems
  doi: 10.1109/TNNLS.2018.2878400
– year: 2022
  ident: 10.1016/j.eswa.2023.122778_b66
– year: 2002
  ident: 10.1016/j.eswa.2023.122778_b288
– volume: 30
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b145
  article-title: What uncertainties do we need in bayesian deep learning for computer vision?
  publication-title: Advances in Neural Information Processing Systems
– volume: 32
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b280
  article-title: A novel cryptocurrency price trend forecasting model based on LightGBM
  publication-title: Finance Research Letters
  doi: 10.1016/j.frl.2018.12.032
– volume: 39
  start-page: 261
  issue: 4
  year: 2013
  ident: 10.1016/j.eswa.2023.122778_b153
  article-title: Decision trees: a recent overview
  publication-title: Artificial Intelligence Review
  doi: 10.1007/s10462-011-9272-4
– volume: 105
  start-page: 2499
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b162
  article-title: Application of the borderline-SMOTE method in susceptibility assessments of debris flows in Pinggu District, Beijing, China
  publication-title: Natural Hazards
  doi: 10.1007/s11069-020-04409-7
– volume: 7
  start-page: 21
  year: 2013
  ident: 10.1016/j.eswa.2023.122778_b215
  article-title: Gradient boosting machines, a tutorial
  publication-title: Frontiers in Neurorobotics
  doi: 10.3389/fnbot.2013.00021
– volume: 51
  start-page: 1
  issue: 3
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b249
  article-title: A survey of random forest based methods for intrusion detection systems
  publication-title: ACM Computing Surveys
  doi: 10.1145/3178582
– volume: 16
  start-page: 449
  issue: 4
  year: 2013
  ident: 10.1016/j.eswa.2023.122778_b320
  article-title: Effective detection of sophisticated online banking fraud on extremely imbalanced data
  publication-title: World Wide Web
  doi: 10.1007/s11280-012-0178-0
– volume: 6
  start-page: 448
  issue: 6
  year: 1976
  ident: 10.1016/j.eswa.2023.122778_b290
  article-title: An experiment with the edited nearest-neighbor rule
  publication-title: IEEE Transactions on Systems, Man, and Cybernetics
– start-page: 805
  year: 2016
  ident: 10.1016/j.eswa.2023.122778_b208
  article-title: Distributional random oversampling for imbalanced text classification
– volume: 177
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b329
  article-title: A predictive model of recreational water quality based on adaptive synthetic sampling algorithms and machine learning
  publication-title: Water Research
  doi: 10.1016/j.watres.2020.115788
– volume: 53
  start-page: 1
  issue: 5
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b318
  article-title: A survey on Bayesian deep learning
  publication-title: ACM Computing Surveys
– start-page: 7383
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b12
  article-title: Do not have enough data? Deep learning to the rescue!
– volume: 27
  start-page: 1947
  issue: 9
  year: 2015
  ident: 10.1016/j.eswa.2023.122778_b231
  article-title: Oversampling the minority class in the feature space
  publication-title: IEEE Transactions on Neural Networks and Learning Systems
  doi: 10.1109/TNNLS.2015.2461436
– year: 2020
  ident: 10.1016/j.eswa.2023.122778_b182
  article-title: Early prediction of liver disease using conventional risk factors and gut microbiome-augmented gradient boosting
  publication-title: MedRxiv
– volume: 131
  start-page: 240
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b65
  article-title: Imbalanced data classification: A KNN and generative adversarial networks-based hybrid approach for intrusion detection
  publication-title: Future Generation Computer Systems
  doi: 10.1016/j.future.2022.01.026
– volume: 108
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b95
  article-title: Handling imbalanced medical image data: A deep-learning-based one-class classification approach
  publication-title: Artificial Intelligence in Medicine
  doi: 10.1016/j.artmed.2020.101935
– volume: 66
  start-page: 247
  issue: 3
  year: 2011
  ident: 10.1016/j.eswa.2023.122778_b209
  article-title: Support vector machines in remote sensing: A review
  publication-title: ISPRS Journal of Photogrammetry and Remote Sensing
  doi: 10.1016/j.isprsjprs.2010.11.001
– volume: 10
  start-page: 40482
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b45
  article-title: Revisiting Bayesian autoencoders with MCMC
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2022.3163270
– volume: 21
  start-page: 3093
  issue: 20
  year: 2002
  ident: 10.1016/j.eswa.2023.122778_b80
  article-title: Estimation of the area under the ROC curve
  publication-title: Statistics in Medicine
  doi: 10.1002/sim.1228
– volume: 61
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b272
  article-title: Effects of class imbalance on resampling and ensemble learning for improved prediction of cyanobacteria blooms
  publication-title: Ecological Informatics
  doi: 10.1016/j.ecoinf.2020.101202
– year: 1995
  ident: 10.1016/j.eswa.2023.122778_b214
– volume: 65
  start-page: 1595
  issue: 2
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b2
  article-title: Fault detection and classification based on co-training of semisupervised machine learning
  publication-title: IEEE Transactions on Industrial Electronics
  doi: 10.1109/TIE.2017.2726961
– volume: 14
  start-page: 482
  issue: 1
  year: 2016
  ident: 10.1016/j.eswa.2023.122778_b40
  article-title: Randomized oversampling for generalized multiscale finite element methods
  publication-title: Multiscale Modeling and Simulation
  doi: 10.1137/140988826
– volume: 35
  start-page: 25
  year: 2005
  ident: 10.1016/j.eswa.2023.122778_b77
  article-title: On extending f-measure and g-mean metrics to multi-class problems
  publication-title: WIT Transactions on Information and Communication Technologies
  doi: 10.2495/DATA050031
– volume: 10
  start-page: 1151
  issue: 7
  year: 2014
  ident: 10.1016/j.eswa.2023.122778_b300
  article-title: Iterative dichotomiser-3 algorithm in data mining applied to diabetes database
  publication-title: Journal of Computer Science
  doi: 10.3844/jcssp.2014.1151.1155
– volume: 136
  start-page: 190
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b305
  article-title: Imbalance-XGBoost: leveraging weighted and focal losses for binary label-imbalanced classification with XGBoost
  publication-title: Pattern Recognition Letters
  doi: 10.1016/j.patrec.2020.05.035
– start-page: 243
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b205
  article-title: Machine learning with oversampling and undersampling techniques: overview study and experimental results
– volume: 22
  start-page: 6766
  issue: 18
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b295
  article-title: Explainable malware detection system using transformers-based transfer learning and multi-model visual representation
  publication-title: Sensors
  doi: 10.3390/s22186766
– volume: 16
  issue: 7
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b132
  article-title: An empirical survey of data augmentation for time series classification with neural networks
  publication-title: Plos One
  doi: 10.1371/journal.pone.0254841
– volume: 12
  start-page: 7189
  issue: 14
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b11
  article-title: Toward an efficient automatic self-augmentation labeling tool for intrusion detection based on a semi-supervised approach
  publication-title: Applied Sciences
  doi: 10.3390/app12147189
– volume: 202
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b210
  article-title: A novel XGBoost extension for credit scoring class-imbalanced data combining a generalized extreme value link and a modified focal loss function
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2022.117233
– volume: 55
  start-page: 1
  issue: 7
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b21
  article-title: A survey on data augmentation for text classification
  publication-title: ACM Computing Surveys
  doi: 10.1145/3544558
– volume: 9
  start-page: 48
  issue: 2
  year: 2014
  ident: 10.1016/j.eswa.2023.122778_b41
  article-title: Jumping NLP curves: A review of natural language processing research
  publication-title: IEEE Computational Intelligence Magazine
  doi: 10.1109/MCI.2014.2307227
– volume: 10
  start-page: 250
  issue: 7
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b157
  article-title: A cluster-based boosting algorithm for bankruptcy prediction in a highly imbalanced dataset
  publication-title: Symmetry
  doi: 10.3390/sym10070250
– volume: 12
  issue: 7
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b86
  article-title: Improving imbalanced land cover classification with K-means SMOTE: Detecting and oversampling distinctive minority spectral signatures
  publication-title: Information
  doi: 10.3390/info12070266
– volume: 159
  start-page: 736
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b225
  article-title: Dealing with data imbalance in text classification
  publication-title: Procedia Computer Science
  doi: 10.1016/j.procs.2019.09.229
– year: 2020
  ident: 10.1016/j.eswa.2023.122778_b355
– volume: 32
  year: 2019
  ident: 10.1016/j.eswa.2023.122778_b332
  article-title: Modeling tabular data using conditional GAN
  publication-title: Advances in Neural Information Processing Systems
– volume: 115
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b91
  article-title: Ensemble deep learning: A review
  publication-title: Engineering Applications of Artificial Intelligence
  doi: 10.1016/j.engappai.2022.105151
– volume: 33
  start-page: 18917
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b223
  article-title: Optimization and generalization analysis of transduction through gradient boosting and application to multi-scale graph neural networks
  publication-title: Advances in Neural Information Processing Systems
– volume: 30
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b144
  article-title: Lightgbm: A highly efficient gradient boosting decision tree
  publication-title: Advances in Neural Information Processing Systems
– volume: 150
  start-page: 529
  year: 2015
  ident: 10.1016/j.eswa.2023.122778_b30
  article-title: Neighbourhood sampling in bagging for imbalanced data
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2014.07.064
– volume: 51
  start-page: 62
  year: 2015
  ident: 10.1016/j.eswa.2023.122778_b275
  article-title: Software defect prediction using a cost sensitive decision forest and voting, and a potential solution to the class imbalance problem
  publication-title: Information Systems
  doi: 10.1016/j.is.2015.02.006
– start-page: 13
  year: 2009
  ident: 10.1016/j.eswa.2023.122778_b127
  article-title: MSMOTE: Improving classification performance when training data is imbalanced
– volume: 65
  start-page: 124
  issue: 1
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b238
  article-title: Improved hybrid bag-boost ensemble with K-means-SMOTE–ENN technique for handling noisy class imbalanced data
  publication-title: The Computer Journal
  doi: 10.1093/comjnl/bxab039
– year: 2020
  ident: 10.1016/j.eswa.2023.122778_b103
– volume: 14
  start-page: 241
  issue: 2
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b68
  article-title: A survey on ensemble learning
  publication-title: Frontiers of Computer Science
  doi: 10.1007/s11704-019-8208-z
– volume: 11
  start-page: 2703
  issue: 17
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b141
  article-title: KDE-based ensemble learning for imbalanced data
  publication-title: Electronics
  doi: 10.3390/electronics11172703
– volume: 409
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b178
  article-title: Clustering-based undersampling in class-imbalanced data
  publication-title: Information Sciences
– volume: 109
  start-page: 359
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b8
  article-title: Using word embedding and ensemble learning for highly imbalanced data sentiment analysis in short arabic text
  publication-title: Procedia Computer Science
  doi: 10.1016/j.procs.2017.05.365
– volume: 41
  start-page: 552
  issue: 3
  year: 2011
  ident: 10.1016/j.eswa.2023.122778_b149
  article-title: Comparing boosting and bagging techniques with noisy and imbalanced data
  publication-title: IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans
  doi: 10.1109/TSMCA.2010.2084081
– volume: 8
  start-page: 195741
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b128
  article-title: A novel wireless network intrusion detection method based on adaptive synthetic sampling and an improved convolutional neural network
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2020.3034015
– volume: 195
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b240
  article-title: A GAN-based image synthesis method for skin lesion classification
  publication-title: Computer Methods and Programs in Biomedicine
  doi: 10.1016/j.cmpb.2020.105568
– volume: 70
  start-page: 1
  year: 2021
  ident: 10.1016/j.eswa.2023.122778_b170
  article-title: A novel method for imbalanced fault diagnosis of rotating machinery based on generative adversarial networks
  publication-title: IEEE Transactions on Instrumentation and Measurement
– volume: 10
  start-page: 48890
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b100
  article-title: A security model based on LightGBM and transformer to protect healthcare systems from cyberattacks
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2022.3172432
– volume: 30
  start-page: 282
  issue: 1
  year: 2023
  ident: 10.1016/j.eswa.2023.122778_b142
  article-title: Voice-based gender recognition model using FRT and light GBM
  publication-title: Tehnički Vjesnik
– volume: 10
  start-page: 42
  issue: 3
  year: 2020
  ident: 10.1016/j.eswa.2023.122778_b271
  article-title: Emergency department return prediction system using blood samples with LightGBM for smart health care services
  publication-title: IEEE Consumer Electronics Magazine
  doi: 10.1109/MCE.2020.3015439
– volume: 9
  start-page: 329
  year: 2017
  ident: 10.1016/j.eswa.2023.122778_b261
  article-title: Random forest algorithm for the classification of neuroimaging data in Alzheimer’s disease: a systematic review
  publication-title: Frontiers in Aging Neuroscience
  doi: 10.3389/fnagi.2017.00329
– volume: 210
  year: 2022
  ident: 10.1016/j.eswa.2023.122778_b1
  article-title: Waveguide quality inspection in quantum cascade lasers: A capsule neural network approach
  publication-title: Expert Systems with Applications
– start-page: 785
  year: 2016
  ident: 10.1016/j.eswa.2023.122778_b49
  article-title: Xgboost: A scalable tree boosting system
– volume: 50
  start-page: 97
  issue: 1
  year: 2018
  ident: 10.1016/j.eswa.2023.122778_b156
  article-title: Multi-class and feature selection extensions of roughly balanced bagging for imbalanced data
  publication-title: Journal of Intelligent Information Systems
  doi: 10.1007/s10844-017-0446-7
– start-page: 731
  year: 2006
  ident: 10.1016/j.eswa.2023.122778_b335
  article-title: Under-sampling approaches for improving prediction of the minority class in an imbalanced dataset
SSID ssj0017007
Score 2.7193584
SecondaryResourceType review_article
Snippet Class imbalance (CI) in classification problems arises when the number of observations belonging to one class is lower than the other. Ensemble learning...
SourceID crossref
elsevier
SourceType Enrichment Source
Index Database
Publisher
StartPage 122778
SubjectTerms Class imbalance
Data augmentation
Ensemble learning
Machine learning
Title A review of ensemble learning and data augmentation models for class imbalanced problems: Combination, implementation and evaluation
URI https://dx.doi.org/10.1016/j.eswa.2023.122778
Volume 244
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV07T8MwELYqWFh4I8qj8sAGaZPYzoOtqqgKiC5QqVsUO3ZV1KYVbcXGxA_nLnEKSKgDS6JEthP5bN9Z_u77CLmKsyxNUSMMboHDFc-cyNfcMXGkOEOn5GJy8lM_6A34w1AMa6RT5cIgrNKu_eWaXqzW9k3L9mZrPh63niE4AHeIJ43ICVcwfnIe4ihvfqxhHkg_F5Z8e6GDpW3iTInx0ot35B7yWdPz_RCl1v5yTj8cTnef7NpIkbbLnzkgNZ0fkr1KhYHaSXlEPtu0zD-hM0NhU6qncqKpVYMY0TTPKMJAaboaTW2iUU4LAZwFhYiVKoyf6XgqEeQI_UGtxszilsLHYONc1LiBEhXUvGgB2_2mCj8mg-7dS6fnWG0FR3HPXTqCSZh7JgtcCbNQgp-SipmUZXiOF0aZSKU2WoexDJRkgauNMFwaFas4U54r2QnZyme5PiU0jBgLhOQxCwKuXS-KoVW4MAhOhCtVnXhVpybKEo-j_sUkqRBmrwkaIkFDJKUh6uR6XWde0m5sLC0qWyW_Bk8CfmFDvbN_1jsnO_DEETHmiQuytXxb6UuITZayUQy-Btlu3z_2-l8fz-Ws
linkProvider Elsevier
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV07T8MwELaqdoCFN6I8PbBBaBLbebBVFVVLHwut1C2KHQcVtWlFW_EH-OHcNU4BCXVgSaTE50Q-23fWfXcfIbdhksQxcoTBzbO44okVuJpbaRgoztAo2Zic3Ot7rSF_HolRiTSKXBiEVZq9P9_T17u1eVIzo1mbj8e1F3AOwBxipBFrwmHFzwpWpxJlUqm3O63-Jpjg23nWNLS3UMDkzuQwL734wPJDLntwXNdHtrW_7NMPm9M8IHvGWaT1_H8OSUlnR2S_IGKgZl0ek886zVNQ6CylcC7VUznR1BBCvNI4SygiQWm8ep2aXKOMrjlwFhScVqrQhabjqUScIwwJNTQzi0cKH4Oz81riHloUaPN1D9jvd7XwEzJsPg0aLcvQK1iKO_bSEkzC8ksTz5awECWYKqlYGrMEQ3l-kIhY6lRrP5SeksyzdSpSLlMVqjBRji3ZKSlns0yfEeoHjHlC8pB5Hte2E4TQK1wY-CfClqpKnGJQI2VqjyMFxiQqQGZvESoiQkVEuSKq5G4jM88rb2xtLQpdRb_mTwSmYYvc-T_lbshOa9DrRt12v3NBduENRwCZIy5Jefm-0lfgqizltZmKXwDP6F0
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+review+of+ensemble+learning+and+data+augmentation+models+for+class+imbalanced+problems%3A+Combination%2C+implementation+and+evaluation&rft.jtitle=Expert+systems+with+applications&rft.au=Khan%2C+Azal+Ahmad&rft.au=Chaudhari%2C+Omkar&rft.au=Chandra%2C+Rohitash&rft.date=2024-06-15&rft.pub=Elsevier+Ltd&rft.issn=0957-4174&rft.eissn=1873-6793&rft.volume=244&rft_id=info:doi/10.1016%2Fj.eswa.2023.122778&rft.externalDocID=S0957417423032803
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0957-4174&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0957-4174&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0957-4174&client=summon