Gene Expression Data Analysis using Supervised Machine Learning Algorithm in Data Mining for Breast Cancer Prediction

The human body is composed of cells, and each cell has a variety of components. The human body's cells are essential for both the construction and operation of living things. Cells in the human body are capable of dividing and self-destructing as necessary. This natural functioning can occasion...

Full description

Saved in:
Bibliographic Details
Published in2024 5th International Conference on Image Processing and Capsule Networks (ICIPCN) pp. 550 - 553
Main Authors Nathiya, S., Sumitha, J.
Format Conference Proceeding
LanguageEnglish
Published IEEE 03.07.2024
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The human body is composed of cells, and each cell has a variety of components. The human body's cells are essential for both the construction and operation of living things. Cells in the human body are capable of dividing and self-destructing as necessary. This natural functioning can occasionally alter; for example, certain cells may grow uncontrollably or improperly, giving rise to tumors. When these tumors develop into malignant ones and spread to nearby body parts, the condition is known as cancer. Specifically, if the cells in the breast can grow uncontrollably then this condition is known as breast cancer. A gene is a piece of genetic material that is present in every human cell. The genes that code for the proteins that determine how each cell functions. Gene expression is the name given to this function that produce protein. Occasionally, variations in the genetic material, or mutated genes, produce alterations in the values of gene expression. Breast cancer is caused by these altered gene expression levels. Some diagnostic techniques are necessary to detect this altered gene expression for breast cancer, and they are highly effective in helping medical professionals to pinpoint the disease's biomarker. Thus, the goal of this research is to use gene expression data to create a model that can accurately identify breast cancer biomarkers. The suggested model in this study, a pruned neural network method, is designed to predict breast cancer while requiring less memory and execution time and offering the best accuracy. This suggested model is evaluated against current techniques, including support vector machines and decision trees, and it outperforms them with a \mathbf{98 \%} accuracy rate. Having a larger number of gene characteristics and fewer samples is a drawback of using gene expression data. Therefore, the wrapper approaches are employed as a preprocessing strategy in this research work to remove the undesirable genes. The gene expression data is preprocessed before being applied to both current and proposed models.
AbstractList The human body is composed of cells, and each cell has a variety of components. The human body's cells are essential for both the construction and operation of living things. Cells in the human body are capable of dividing and self-destructing as necessary. This natural functioning can occasionally alter; for example, certain cells may grow uncontrollably or improperly, giving rise to tumors. When these tumors develop into malignant ones and spread to nearby body parts, the condition is known as cancer. Specifically, if the cells in the breast can grow uncontrollably then this condition is known as breast cancer. A gene is a piece of genetic material that is present in every human cell. The genes that code for the proteins that determine how each cell functions. Gene expression is the name given to this function that produce protein. Occasionally, variations in the genetic material, or mutated genes, produce alterations in the values of gene expression. Breast cancer is caused by these altered gene expression levels. Some diagnostic techniques are necessary to detect this altered gene expression for breast cancer, and they are highly effective in helping medical professionals to pinpoint the disease's biomarker. Thus, the goal of this research is to use gene expression data to create a model that can accurately identify breast cancer biomarkers. The suggested model in this study, a pruned neural network method, is designed to predict breast cancer while requiring less memory and execution time and offering the best accuracy. This suggested model is evaluated against current techniques, including support vector machines and decision trees, and it outperforms them with a \mathbf{98 \%} accuracy rate. Having a larger number of gene characteristics and fewer samples is a drawback of using gene expression data. Therefore, the wrapper approaches are employed as a preprocessing strategy in this research work to remove the undesirable genes. The gene expression data is preprocessed before being applied to both current and proposed models.
Author Nathiya, S.
Sumitha, J.
Author_xml – sequence: 1
  givenname: S.
  surname: Nathiya
  fullname: Nathiya, S.
  email: nathiyasri24@gmail.com
  organization: Dr.SNS Rajalakshmi College of Arts and Science,Department of Computer Science,Coimbatore,Tamil Nadu,India
– sequence: 2
  givenname: J.
  surname: Sumitha
  fullname: Sumitha, J.
  email: sumivenkat2006@gmail.com
  organization: Dr.SNS Rajalakshmi College of Arts and Science,Department of Computer Science,Coimbatore,Tamil Nadu,India
BookMark eNqFzcFOwkAQxvElwYMib0DMvIB1tpUte4SCSiKGRO9kUgaYpGyb2ZbI24uGu6fv8Ms_353phzqwMQ8WE2vRPy2L5br4cNkkTZMU0-cEEb3rmaHP_SQbY-Zym9tb071yYFh8N8oxSh1gTi3BNFB1jhKhixL28Nk1rCeJvIUVlQe5FO9MGn5tWu1rlfZwBLnGK_mDXa0wU6bYQkGhZIW18lbK9vJyb252VEUeXndgRi-Lr-LtUZh506gcSc8bi86hR5f9wz9tCExq
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ICIPCN63822.2024.00096
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE/IET Electronic Library
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE/IET Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350367171
EndPage 553
ExternalDocumentID 10660906
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-ieee_primary_106609063
IEDL.DBID RIE
IngestDate Wed Sep 18 05:50:16 EDT 2024
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-ieee_primary_106609063
ParticipantIDs ieee_primary_10660906
PublicationCentury 2000
PublicationDate 2024-July-3
PublicationDateYYYYMMDD 2024-07-03
PublicationDate_xml – month: 07
  year: 2024
  text: 2024-July-3
  day: 03
PublicationDecade 2020
PublicationTitle 2024 5th International Conference on Image Processing and Capsule Networks (ICIPCN)
PublicationTitleAbbrev ICIPCN
PublicationYear 2024
Publisher IEEE
Publisher_xml – name: IEEE
Score 3.8477237
Snippet The human body is composed of cells, and each cell has a variety of components. The human body's cells are essential for both the construction and operation of...
SourceID ieee
SourceType Publisher
StartPage 550
SubjectTerms Accuracy
Biological system modeling
Breast cancer
Gene Expression
Machine learning algorithms
Neural networks
Predictive models
Proteins
Pruned Neural Network
Title Gene Expression Data Analysis using Supervised Machine Learning Algorithm in Data Mining for Breast Cancer Prediction
URI https://ieeexplore.ieee.org/document/10660906
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fS8MwEA66J59UrPhjyj342tpmW9s8at3YhJaBCnsb_ZHVobajpiD-9d6lrQNR8K2Etjlyl1zIfd8Xxq6472BW8rgZY_iYwxRneuLjYihWMfp7hTmCE8E5jNzp0_B-MVq0ZHXNhZFSavCZtOhR1_KzMq3pqAxnuOvaggS2dz0hGrJWy_p1bHE9C2bzIMKA4sSw4iSLbWsx_u21KTprTPZZ1PXXgEVerFolVvr5Q4rx3wYdMGNL0IP5d-o5ZDuyOGI1SUjD-KOFthZwF6sYOtURIIR7Dg_1hlaHd5lBqHGUElqJ1RxuXvOyWqvnN1i3H4f6-gjAjS3cEnpdQUBRUmHnVOAhpxqsPxk_BlOT7F5uGvGKZWfy4Jj1irKQJwzEKMvs2HNlhrurJIkFlV2cDF9yE-FzccqMX39x9kf7Oduj4deg1kGf9VRVywtM3Sq51C77Al0snqI
link.rule.ids 310,311,783,787,792,793,799,27939,55088
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NS8NAEB2kHvSkYsWPqnPw2pimTZo9amxptAkFK_RWNs22FjUtMQHx1zuzSSyIgrewJNmBmd0Zdt57C3BluS3KSl2rKSl8mp0ZrfTIpc1QzCX5e045wmKCcxA6g6fO_cSelGR1zYVRSmnwmTL4Uffy49Us56MyWuGOYwoW2N62ubAo6Fol77dlimvf80deSCFlMcfKYmFsU8vxby5O0XmjvwdhNWMBF3kx8iwyZp8_xBj_bdI-1DcUPRx9J58D2FLJIeQsIo29jxLcmuCdzCRWuiPIGPcFPuZr3h_eVYyBRlIqLEVWF3jzulily-z5DZflx4G-QAKptMVbxq9n6HGcpDQ5t3jYrXVo9Htjb9Bku6frQr5iWpncPoJaskrUMaCw49iUXUfFVF9FkRTceGnF9JITCdcSJ1D_9Renf4xfws5gHAynQz98OINddoWGuLYbUMvSXJ1TIs-iC-2-L9Vaoe8
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2024+5th+International+Conference+on+Image+Processing+and+Capsule+Networks+%28ICIPCN%29&rft.atitle=Gene+Expression+Data+Analysis+using+Supervised+Machine+Learning+Algorithm+in+Data+Mining+for+Breast+Cancer+Prediction&rft.au=Nathiya%2C+S.&rft.au=Sumitha%2C+J.&rft.date=2024-07-03&rft.pub=IEEE&rft.spage=550&rft.epage=553&rft_id=info:doi/10.1109%2FICIPCN63822.2024.00096&rft.externalDocID=10660906