A hybrid bat and grey wolf optimizer for gene selection in cancer classification A hybrid bat and grey wolf optimizer for gene selection

DNA microarray is a technique in which a chip containing numerous DNA codes is used for the expression estimation of an extensive number of genes simultaneously. These genes are arranged in a table or data format. The gene expression data can be employed in pattern recognition algorithms to differen...

Full description

Saved in:
Bibliographic Details
Published inKnowledge and information systems Vol. 67; no. 1; pp. 455 - 495
Main Authors Tbaishat, Dina, Tubishat, Mohammad, Makhadmeh, Sharif Naser, Alomari, Osama Ahmad
Format Journal Article
LanguageEnglish
Published London Springer London 01.01.2025
Subjects
Online AccessGet full text

Cover

Loading…
Abstract DNA microarray is a technique in which a chip containing numerous DNA codes is used for the expression estimation of an extensive number of genes simultaneously. These genes are arranged in a table or data format. The gene expression data can be employed in pattern recognition algorithms to differentiate between samples obtained from healthy individuals and those with cancer. However, recognizing biomarkers’ patterns from gene selection data is considered challenging because of its huge dimensionality and the presence of noisy, irrelevant, and unwanted genes, leading to mislearning process and, thus, declining in the classification performance. Therefore, in this paper, an intelligent gene selection approach is proposed on the basis of robust minimum redundancy maximum relevancy as the filter and hybrid improved bat algorithm (BA) with grey wolf optimizer (GWO) (BA-GWO). The BA-GWO is introduced to determinate a limited number of biomarker genes that significantly enhance the classification performance. In this approach, the k -nearest neighbor algorithm was employed for the classification task. The proposed BA-GWO is mainly introduced to improve the BA search agents’ performance in searching for the best candidate gene subset that carries the biomarkers for cancer classification. Furthermore, the BA-GWO is designed to enhance both exploitation and exploration capabilities while ensuring a balanced approach and preventing stagnation in local optima. The primary function of this proposed approach is to enhance the solutions acquired through the BA by utilizing them as the initial population for the GWO. The proposed approach is evaluated using ten widely recognized microarray datasets in the experimental stage, including CNS, Colon, Leukemia 3c, Leukemia 4c, Leukemia, Lung Cancer, Lymphoma, MLL, Ovarian, and SRBCT. The performance of the hybridization of BA and GWO, as well as recent and base optimization algorithms, is evaluated. Afterward, the hybrid versions are compared with their individual optimization algorithms. Moreover, the hybridization algorithms are compared with each other. For further validation, the proposed approach performance is compared with twelve state-of-the-art comparative methods in terms of accuracy and the selected genes. The findings indicate that the proposed approach yields superior outcomes in two out of eight datasets, while also delivering highly competitive results in the remaining datasets.
AbstractList DNA microarray is a technique in which a chip containing numerous DNA codes is used for the expression estimation of an extensive number of genes simultaneously. These genes are arranged in a table or data format. The gene expression data can be employed in pattern recognition algorithms to differentiate between samples obtained from healthy individuals and those with cancer. However, recognizing biomarkers’ patterns from gene selection data is considered challenging because of its huge dimensionality and the presence of noisy, irrelevant, and unwanted genes, leading to mislearning process and, thus, declining in the classification performance. Therefore, in this paper, an intelligent gene selection approach is proposed on the basis of robust minimum redundancy maximum relevancy as the filter and hybrid improved bat algorithm (BA) with grey wolf optimizer (GWO) (BA-GWO). The BA-GWO is introduced to determinate a limited number of biomarker genes that significantly enhance the classification performance. In this approach, the k -nearest neighbor algorithm was employed for the classification task. The proposed BA-GWO is mainly introduced to improve the BA search agents’ performance in searching for the best candidate gene subset that carries the biomarkers for cancer classification. Furthermore, the BA-GWO is designed to enhance both exploitation and exploration capabilities while ensuring a balanced approach and preventing stagnation in local optima. The primary function of this proposed approach is to enhance the solutions acquired through the BA by utilizing them as the initial population for the GWO. The proposed approach is evaluated using ten widely recognized microarray datasets in the experimental stage, including CNS, Colon, Leukemia 3c, Leukemia 4c, Leukemia, Lung Cancer, Lymphoma, MLL, Ovarian, and SRBCT. The performance of the hybridization of BA and GWO, as well as recent and base optimization algorithms, is evaluated. Afterward, the hybrid versions are compared with their individual optimization algorithms. Moreover, the hybridization algorithms are compared with each other. For further validation, the proposed approach performance is compared with twelve state-of-the-art comparative methods in terms of accuracy and the selected genes. The findings indicate that the proposed approach yields superior outcomes in two out of eight datasets, while also delivering highly competitive results in the remaining datasets.
Author Tubishat, Mohammad
Alomari, Osama Ahmad
Makhadmeh, Sharif Naser
Tbaishat, Dina
Author_xml – sequence: 1
  givenname: Dina
  surname: Tbaishat
  fullname: Tbaishat, Dina
  email: dina.tbaishat@zu.ac.ae
  organization: College of Technological Innovation, Zayed University
– sequence: 2
  givenname: Mohammad
  surname: Tubishat
  fullname: Tubishat, Mohammad
  organization: College of Technological Innovation, Zayed University
– sequence: 3
  givenname: Sharif Naser
  surname: Makhadmeh
  fullname: Makhadmeh, Sharif Naser
  email: sharif.makhadmeh@uop.edu.jo
  organization: Department of Information Technology, King Abdullah II School for Information Technology, The University of Jordan, Artificial Intelligence Research Center (AIRC), Ajman University
– sequence: 4
  givenname: Osama Ahmad
  surname: Alomari
  fullname: Alomari, Osama Ahmad
  organization: Department of Computer Science and Information Technology, College of Engineering, Abu Dhabi University
BookMark eNqdjkGKAjEQRYMoqDNewFVdoLWqW21cyjCDSxfuQzpWt5G2IknLoKc3zngCF5__4fHhjVVfvLBSU8IZIZbzSEi0zDBfpOR5Wj01wpzWWUG06r82FWU5VOMYT4hUrohGareB460K7gCV6cDIAZrAN_j1bQ3-0rmzu3OA2gdoWBgit2w75wWcgDViE7StidHVzpon-FSD2rSRJ6_-UMXP9_5rm8VLcNJw0Cd_DZKQJtRPef0vr5O8_pPXWLz3egDs309j
ContentType Journal Article
Copyright The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024 Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
Copyright_xml – notice: The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024 Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
DOI 10.1007/s10115-024-02225-0
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 0219-3116
EndPage 495
ExternalDocumentID 10_1007_s10115_024_02225_0
GrantInformation_xml – fundername: Zayed University
  grantid: R22078; R22078
  funderid: http://dx.doi.org/10.13039/501100008675
GroupedDBID -59
-5G
-BR
-EM
-Y2
-~C
.4S
.86
.DC
.VR
06D
0R~
0VY
1N0
1SB
203
29L
2J2
2JN
2JY
2KG
2LR
2P1
2VQ
2~H
30V
3V.
4.4
406
408
409
40D
40E
5GY
5VS
67Z
6KP
6NX
7WY
8AO
8FE
8FG
8FL
8FW
8TC
8UJ
95-
95.
95~
96X
AAAVM
AABHQ
AACDK
AAHNG
AAIAL
AAJBT
AAJKR
AANZL
AARHV
AARTL
AASML
AATNV
AATVU
AAUYE
AAWCG
AAYIU
AAYQN
AAYTO
AAYZH
ABAKF
ABBBX
ABBXA
ABDZT
ABECU
ABFTD
ABFTV
ABHLI
ABHQN
ABJNI
ABJOX
ABKCH
ABKTR
ABMNI
ABMQK
ABNWP
ABQBU
ABQSL
ABSXP
ABTEG
ABTHY
ABTKH
ABTMW
ABULA
ABUWG
ABWNU
ABXPI
ACAOD
ACBXY
ACGFO
ACGFS
ACHSB
ACHXU
ACKNC
ACMDZ
ACMLO
ACOKC
ACOMO
ACPIV
ACREN
ACSNA
ACZOJ
ADHHG
ADHIR
ADINQ
ADKNI
ADKPE
ADMLS
ADRFC
ADTPH
ADURQ
ADYFF
ADYOE
ADZKW
AEBTG
AEFQL
AEGAL
AEGNC
AEJHL
AEJRE
AEKMD
AEMSY
AENEX
AEOHA
AEPYU
AESKC
AETLH
AEVLU
AEXYK
AFBBN
AFGCZ
AFKRA
AFLOW
AFQWF
AFWTZ
AFYQB
AFZKB
AGAYW
AGDGC
AGJBK
AGMZJ
AGQEE
AGQMX
AGRTI
AGWIL
AGWZB
AGYKE
AHAVH
AHBYD
AHKAY
AHSBF
AHYZX
AIAKS
AIGIU
AIIXL
AILAN
AITGF
AJBLW
AJRNO
AJZVZ
ALMA_UNASSIGNED_HOLDINGS
ALWAN
AMKLP
AMTXH
AMXSW
AMYLF
AMYQR
AOCGG
ARAPS
ARCSS
ARMRJ
ASPBG
AVWKF
AXYYD
AYJHY
AZFZN
AZQEC
B-.
BA0
BDATZ
BENPR
BEZIV
BGLVJ
BGNMA
BPHCQ
BSONS
CAG
CCPQU
COF
CS3
CSCUP
DDRTE
DL5
DNIVK
DPUIP
DU5
DWQXO
EBLON
EBS
EDO
EIOEI
EJD
ESBYG
F5P
FEDTE
FERAY
FFXSO
FIGPU
FINBP
FNLPD
FRNLG
FRRFC
FSGXE
FWDCC
GGCAI
GGRSB
GJIRD
GNUQQ
GNWQR
GQ6
GQ7
GQ8
GROUPED_ABI_INFORM_COMPLETE
GXS
H13
HCIFZ
HF~
HG5
HG6
HMJXF
HQYDN
HRMNR
HVGLF
HZ~
I-F
I09
IHE
IJ-
IKXTQ
ITM
IWAJR
IXC
IXE
IZIGR
IZQ
I~X
I~Z
J-C
J0Z
JBSCW
JCJTX
JZLTJ
K60
K6V
K6~
K7-
KDC
KOV
LAS
LLZTM
M0C
M0N
M4Y
MA-
MK~
ML~
N2Q
NB0
NPVJJ
NQJWS
NU0
O9-
O93
O9J
OAM
P2P
P62
P9O
PF0
PQBIZ
PQBZA
PQQKQ
PROAC
PT4
PT5
Q2X
QOS
R89
R9I
RIG
ROL
RPX
RSV
S16
S1Z
S27
S3B
SAP
SCO
SDH
SHX
SISQX
SJYHP
SNE
SNPRN
SNX
SOHCF
SOJ
SPISZ
SRMVM
SSLCW
STPWE
SZN
T13
TSG
TSK
TSV
TUC
TUS
U2A
UG4
UOJIU
UTJUX
UZXMN
VC2
VFIZW
W23
W48
WK8
YLTOR
Z45
Z5O
Z7R
Z7S
Z7X
Z7Y
Z7Z
Z81
Z83
Z88
ZMTXR
~A9
ID FETCH-springer_journals_10_1007_s10115_024_02225_03
IEDL.DBID U2A
ISSN 0219-1377
IngestDate Fri Feb 21 02:37:37 EST 2025
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Bat algorithm
rMRMR
Classification
Gene selection optimization
Grey wolf optimizer
Language English
LinkModel DirectLink
MergedId FETCHMERGED-springer_journals_10_1007_s10115_024_02225_03
ParticipantIDs springer_journals_10_1007_s10115_024_02225_0
PublicationCentury 2000
PublicationDate 20250100
PublicationDateYYYYMMDD 2025-01-01
PublicationDate_xml – month: 1
  year: 2025
  text: 20250100
PublicationDecade 2020
PublicationPlace London
PublicationPlace_xml – name: London
PublicationSubtitle An International Journal
PublicationTitle Knowledge and information systems
PublicationTitleAbbrev Knowl Inf Syst
PublicationYear 2025
Publisher Springer London
Publisher_xml – name: Springer London
SSID ssj0017611
Score 4.748979
Snippet DNA microarray is a technique in which a chip containing numerous DNA codes is used for the expression estimation of an extensive number of genes...
SourceID springer
SourceType Publisher
StartPage 455
SubjectTerms Computer Science
Data Mining and Knowledge Discovery
Database Management
Information Storage and Retrieval
Information Systems and Communication Service
Information Systems Applications (incl.Internet)
IT in Business
Regular Paper
Subtitle A hybrid bat and grey wolf optimizer for gene selection
Title A hybrid bat and grey wolf optimizer for gene selection in cancer classification
URI https://link.springer.com/article/10.1007/s10115-024-02225-0
Volume 67
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEB6kvXjxLT7LHDwaWPbV7nErXYti8WChnpYkTbGgW-iulPrr_RJ3VdBLT3sJIZPJ5JtvZyZDdAWMi2dTX4puLJUIdYJ7UIG1atk1UaL9pGdsRPdhFA_H4d0kmtRFYWWT7d6EJN1N_avYDd6LAKYIR1IEiHo7Ane3iVxjP_2OHYCYuz55sEVh39OrS2X-n-NPANThSrZHO7VDyOmXBvdpyxQHtNs0W-Da9g7pMeWXtS2vYiUrBv9nMOU1rxavM17A7t_mHxgOF5RxJAyXrr8NNp3nBWur2iVr6ynb1CCnjSMKssHTzVA0K8vrc1XmP48MW0FyCJI7QXIvOKZWsSjMCXEAn0VpLzLgViF2vKckgDzxprGvZaKiU7reZOazzYaf07Zve-S63xQX1KqW7-YSwF2pDrXTrN8f2e_t8_2g4_T2Ce1MlpE
linkProvider Springer Nature
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LS8NAEB5qPejFt_h2DnpzIWybNDkoBLW09oGHFnoLu9stFmoCTaTEo7_c2TVRQS_V-2YI-ebxTWZmB-CCYpw3GXPBGp6QrK4C8oOSslYlGtoNFA98bSq6vb7XGtYfRu6oAm_lLIztdi9LktZTfxt2I_bCKKYwm6Qwp2il7Oh8QYlaet2-I1QvOW_eD25brBQSFaCk0dcNvUZSRJIiKylyavTUCqwStfCNZQx5-FmJoDTfbt0jy2bmdr5i8Ob3N_pRTrVRqrkFGwW9xPBDH7ahouMd2CxXN2BhybvwGOJTboa1UIoMRTxGyrtzXCSzCSbkRZ6nr3ScCC2SgmlM7bYcghCnMSqjKHNUhnebRiOL7R7U_vBB9qEaJ7E-AKwRA5LKcTVlanXCz5eCaEHgjD2uRCDdQ7haRvLRcsfPYa016HWjbrvfOYZ1brbv2h8gJ1DN5i_6lChBJs8sZgg3_1OBdwA1wPo
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LS8NAEB5qBfHiW3w7B725NGyTtDkoBGtprZYeLPQWdjdbLGgqTaTUH-FvdnabqKCX6n0zhMzMzjeZ-WYAzijG-cOYC1bzhWSuCugelJS1KlHTXqB4UNemonvf9Vt993bgDUrwXnBhbLd7UZKccxrMlKYkq7zEw8o34hshGUbxhdmEhTl5W2VHz6aUtKWX7QZp-Jzz5s3DdYsVAqNcQWn0Na3XSIpIUmQlRU6VnlqCZddwi8k_-jz8rEpQym838JGXMzOpLyfh_P5GP0qrNmI1N2Ath5oYzm1jE0o62YL1Yo0D5l69Db0QH2eGuIVSZCiSGCkHn-F0_DTEMd0oz6M3Ok7gFsnYNKZ2cw6pE0cJKmM0E1QGg5umI6vnHaj-4YPsQjkZJ3oPsEpoSCrH05S1uaTLuhQEEQIn9rkSgfT24WIRyQeLHT-FlV6jGd21u51DWOVmEa_9F3IE5Wzyqo8JHWTyxKoM4ep_FvABb1TFLQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+hybrid+bat+and+grey+wolf+optimizer+for+gene+selection+in+cancer+classification&rft.jtitle=Knowledge+and+information+systems&rft.au=Tbaishat%2C+Dina&rft.au=Tubishat%2C+Mohammad&rft.au=Makhadmeh%2C+Sharif+Naser&rft.au=Alomari%2C+Osama+Ahmad&rft.date=2025-01-01&rft.pub=Springer+London&rft.issn=0219-1377&rft.eissn=0219-3116&rft.volume=67&rft.issue=1&rft.spage=455&rft.epage=495&rft_id=info:doi/10.1007%2Fs10115-024-02225-0&rft.externalDocID=10_1007_s10115_024_02225_0
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0219-1377&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0219-1377&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0219-1377&client=summon