Gene Expression Data Analysis using Supervised Machine Learning Algorithm in Data Mining for Breast Cancer Prediction
The human body is composed of cells, and each cell has a variety of components. The human body's cells are essential for both the construction and operation of living things. Cells in the human body are capable of dividing and self-destructing as necessary. This natural functioning can occasion...
Saved in:
Published in | 2024 5th International Conference on Image Processing and Capsule Networks (ICIPCN) pp. 550 - 553 |
---|---|
Main Authors | , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
03.07.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | The human body is composed of cells, and each cell has a variety of components. The human body's cells are essential for both the construction and operation of living things. Cells in the human body are capable of dividing and self-destructing as necessary. This natural functioning can occasionally alter; for example, certain cells may grow uncontrollably or improperly, giving rise to tumors. When these tumors develop into malignant ones and spread to nearby body parts, the condition is known as cancer. Specifically, if the cells in the breast can grow uncontrollably then this condition is known as breast cancer. A gene is a piece of genetic material that is present in every human cell. The genes that code for the proteins that determine how each cell functions. Gene expression is the name given to this function that produce protein. Occasionally, variations in the genetic material, or mutated genes, produce alterations in the values of gene expression. Breast cancer is caused by these altered gene expression levels. Some diagnostic techniques are necessary to detect this altered gene expression for breast cancer, and they are highly effective in helping medical professionals to pinpoint the disease's biomarker. Thus, the goal of this research is to use gene expression data to create a model that can accurately identify breast cancer biomarkers. The suggested model in this study, a pruned neural network method, is designed to predict breast cancer while requiring less memory and execution time and offering the best accuracy. This suggested model is evaluated against current techniques, including support vector machines and decision trees, and it outperforms them with a \mathbf{98 \%} accuracy rate. Having a larger number of gene characteristics and fewer samples is a drawback of using gene expression data. Therefore, the wrapper approaches are employed as a preprocessing strategy in this research work to remove the undesirable genes. The gene expression data is preprocessed before being applied to both current and proposed models. |
---|---|
AbstractList | The human body is composed of cells, and each cell has a variety of components. The human body's cells are essential for both the construction and operation of living things. Cells in the human body are capable of dividing and self-destructing as necessary. This natural functioning can occasionally alter; for example, certain cells may grow uncontrollably or improperly, giving rise to tumors. When these tumors develop into malignant ones and spread to nearby body parts, the condition is known as cancer. Specifically, if the cells in the breast can grow uncontrollably then this condition is known as breast cancer. A gene is a piece of genetic material that is present in every human cell. The genes that code for the proteins that determine how each cell functions. Gene expression is the name given to this function that produce protein. Occasionally, variations in the genetic material, or mutated genes, produce alterations in the values of gene expression. Breast cancer is caused by these altered gene expression levels. Some diagnostic techniques are necessary to detect this altered gene expression for breast cancer, and they are highly effective in helping medical professionals to pinpoint the disease's biomarker. Thus, the goal of this research is to use gene expression data to create a model that can accurately identify breast cancer biomarkers. The suggested model in this study, a pruned neural network method, is designed to predict breast cancer while requiring less memory and execution time and offering the best accuracy. This suggested model is evaluated against current techniques, including support vector machines and decision trees, and it outperforms them with a \mathbf{98 \%} accuracy rate. Having a larger number of gene characteristics and fewer samples is a drawback of using gene expression data. Therefore, the wrapper approaches are employed as a preprocessing strategy in this research work to remove the undesirable genes. The gene expression data is preprocessed before being applied to both current and proposed models. |
Author | Nathiya, S. Sumitha, J. |
Author_xml | – sequence: 1 givenname: S. surname: Nathiya fullname: Nathiya, S. email: nathiyasri24@gmail.com organization: Dr.SNS Rajalakshmi College of Arts and Science,Department of Computer Science,Coimbatore,Tamil Nadu,India – sequence: 2 givenname: J. surname: Sumitha fullname: Sumitha, J. email: sumivenkat2006@gmail.com organization: Dr.SNS Rajalakshmi College of Arts and Science,Department of Computer Science,Coimbatore,Tamil Nadu,India |
BookMark | eNqFzcFOwkAQxvElwYMib0DMvIB1tpUte4SCSiKGRO9kUgaYpGyb2ZbI24uGu6fv8Ms_353phzqwMQ8WE2vRPy2L5br4cNkkTZMU0-cEEb3rmaHP_SQbY-Zym9tb071yYFh8N8oxSh1gTi3BNFB1jhKhixL28Nk1rCeJvIUVlQe5FO9MGn5tWu1rlfZwBLnGK_mDXa0wU6bYQkGhZIW18lbK9vJyb252VEUeXndgRi-Lr-LtUZh506gcSc8bi86hR5f9wz9tCExq |
CODEN | IEEPAD |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/ICIPCN63822.2024.00096 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE/IET Electronic Library IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE/IET Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 9798350367171 |
EndPage | 553 |
ExternalDocumentID | 10660906 |
Genre | orig-research |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-ieee_primary_106609063 |
IEDL.DBID | RIE |
IngestDate | Wed Sep 18 05:50:16 EDT 2024 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-ieee_primary_106609063 |
ParticipantIDs | ieee_primary_10660906 |
PublicationCentury | 2000 |
PublicationDate | 2024-July-3 |
PublicationDateYYYYMMDD | 2024-07-03 |
PublicationDate_xml | – month: 07 year: 2024 text: 2024-July-3 day: 03 |
PublicationDecade | 2020 |
PublicationTitle | 2024 5th International Conference on Image Processing and Capsule Networks (ICIPCN) |
PublicationTitleAbbrev | ICIPCN |
PublicationYear | 2024 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 3.8477237 |
Snippet | The human body is composed of cells, and each cell has a variety of components. The human body's cells are essential for both the construction and operation of... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 550 |
SubjectTerms | Accuracy Biological system modeling Breast cancer Gene Expression Machine learning algorithms Neural networks Predictive models Proteins Pruned Neural Network |
Title | Gene Expression Data Analysis using Supervised Machine Learning Algorithm in Data Mining for Breast Cancer Prediction |
URI | https://ieeexplore.ieee.org/document/10660906 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fS8MwEA66J59UrPhjyj342tpmW9s8at3YhJaBCnsb_ZHVobajpiD-9d6lrQNR8K2Etjlyl1zIfd8Xxq6472BW8rgZY_iYwxRneuLjYihWMfp7hTmCE8E5jNzp0_B-MVq0ZHXNhZFSavCZtOhR1_KzMq3pqAxnuOvaggS2dz0hGrJWy_p1bHE9C2bzIMKA4sSw4iSLbWsx_u21KTprTPZZ1PXXgEVerFolVvr5Q4rx3wYdMGNL0IP5d-o5ZDuyOGI1SUjD-KOFthZwF6sYOtURIIR7Dg_1hlaHd5lBqHGUElqJ1RxuXvOyWqvnN1i3H4f6-gjAjS3cEnpdQUBRUmHnVOAhpxqsPxk_BlOT7F5uGvGKZWfy4Jj1irKQJwzEKMvs2HNlhrurJIkFlV2cDF9yE-FzccqMX39x9kf7Oduj4deg1kGf9VRVywtM3Sq51C77Al0snqI |
link.rule.ids | 310,311,783,787,792,793,799,27939,55088 |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NS8NAEB2kHvSkYsWPqnPw2pimTZo9amxptAkFK_RWNs22FjUtMQHx1zuzSSyIgrewJNmBmd0Zdt57C3BluS3KSl2rKSl8mp0ZrfTIpc1QzCX5e045wmKCcxA6g6fO_cSelGR1zYVRSmnwmTL4Uffy49Us56MyWuGOYwoW2N62ubAo6Fol77dlimvf80deSCFlMcfKYmFsU8vxby5O0XmjvwdhNWMBF3kx8iwyZp8_xBj_bdI-1DcUPRx9J58D2FLJIeQsIo29jxLcmuCdzCRWuiPIGPcFPuZr3h_eVYyBRlIqLEVWF3jzulily-z5DZflx4G-QAKptMVbxq9n6HGcpDQ5t3jYrXVo9Htjb9Bku6frQr5iWpncPoJaskrUMaCw49iUXUfFVF9FkRTceGnF9JITCdcSJ1D_9Renf4xfws5gHAynQz98OINddoWGuLYbUMvSXJ1TIs-iC-2-L9Vaoe8 |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2024+5th+International+Conference+on+Image+Processing+and+Capsule+Networks+%28ICIPCN%29&rft.atitle=Gene+Expression+Data+Analysis+using+Supervised+Machine+Learning+Algorithm+in+Data+Mining+for+Breast+Cancer+Prediction&rft.au=Nathiya%2C+S.&rft.au=Sumitha%2C+J.&rft.date=2024-07-03&rft.pub=IEEE&rft.spage=550&rft.epage=553&rft_id=info:doi/10.1109%2FICIPCN63822.2024.00096&rft.externalDocID=10660906 |