VTrans: A VAE-Based Pre-Trained Transformer Method for Microbiome Data Analysis
Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient surv...
Saved in:
Published in | Journal of computational biology Vol. 32; no. 9; pp. 85 - 864 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
United States
Mary Ann Liebert, Inc., publishers
01.09.2025
|
Subjects | |
Online Access | Get full text |
ISSN | 1557-8666 1557-8666 |
DOI | 10.1089/cmb.2024.0884 |
Cover
Abstract | Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at
https://github.com/wenwenmin/VTrans
and
https://doi.org/10.5281/zenodo.14166580
. |
---|---|
AbstractList | Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/VTrans and https://doi.org/10.5281/zenodo.14166580. Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/VTrans and https://doi.org/10.5281/zenodo.14166580 . Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/VTrans and https://doi.org/10.5281/zenodo.14166580.Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/VTrans and https://doi.org/10.5281/zenodo.14166580. |
Author | Min, Wenwen Zhu, Fangfang Shi, Xinyuan |
Author_xml | – sequence: 1 givenname: Xinyuan surname: Shi fullname: Shi, Xinyuan – sequence: 2 givenname: Fangfang surname: Zhu fullname: Zhu, Fangfang – sequence: 3 givenname: Wenwen orcidid: 0000-0002-2558-2911 surname: Min fullname: Min, Wenwen |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/40295093$$D View this record in MEDLINE/PubMed |
BookMark | eNqFkDtPwzAQgC1URGlhZEUeWVLsOH6ErZTykFqVoXSNnOQsghKn2OnQf49DC2Jjutd3J903QgPbWkDoipIJJSq9LZp8EpM4mRClkhN0TjmXkRJCDP7kQzTy_oMQygSRZ2iYkDjlJGXnaLVZO239HZ7izXQe3WsPJX51EIV2ZUP-PTata8DhJXTvbYlDhZdV4dq8ahvAD7rTeGp1vfeVv0CnRtceLo9xjN4e5-vZc7RYPb3MpouoiGPWRTQRheCmZGBkQiRIzrkWqVZSEp2kuqCElyxWPDWgBDFxSROSG56XYU8ww8bo5nB369rPHfguaypfQF1rC-3OZ4ymQhAlw5NjdH1Ed3kDZbZ1VaPdPvuREIDoAISXvHdgfhFKsl5yFiRnveSslxx4duB7RltbV5CD6_7Z-gLj7n3b |
Cites_doi | 10.1038/s41598-020-63159-5 10.1093/nar/gkad801 10.1016/S1470-2045(18)30952-5 10.1126/scisignal.2004088 10.3389/fmicb.2017.00752 10.1186/s12859-018-2205-3 10.1186/s12874-018-0482-1 10.1038/s41586-020-2095-1 10.1038/nature09922 10.1146/annurev-genom-091416-035324 10.1038/s41598-021-83184-2 10.1038/nrg3182 10.1093/bioinformatics/btad286 10.1038/s43705-022-00182-9 10.5114/wo.2014.47136 |
ContentType | Journal Article |
Copyright | 2025, Mary Ann Liebert, Inc., publishers |
Copyright_xml | – notice: 2025, Mary Ann Liebert, Inc., publishers |
DBID | AAYXX CITATION CGR CUY CVF ECM EIF NPM 7X8 |
DOI | 10.1089/cmb.2024.0884 |
DatabaseName | CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic |
DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic |
DatabaseTitleList | MEDLINE MEDLINE - Academic |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Biology Mathematics |
EISSN | 1557-8666 |
EndPage | 864 |
ExternalDocumentID | 40295093 10_1089_cmb_2024_0884 |
Genre | Journal Article |
GroupedDBID | --- 0R~ 29K 4.4 53G 5GY ABBKN ACGFO ADBBV AENEX ALMA_UNASSIGNED_HOLDINGS BAWUL BNQNF CS3 D-I DIK DU5 EBS F5P IAO IHR IM4 MV1 NQHIM O9- P2P RML RNS SCNPE TN5 TR2 UE5 AAYXX CITATION 34G 39C ABEFU AI. CAG CGR COF CUY CVF ECM EIF EJD IER IGS ITC NPM R.V RMSOB VH1 7X8 |
ID | FETCH-LOGICAL-c223t-146c65fd3ef7407e7555a69a8770a49ac105d32859fe860f2d140bf5bd6c663f3 |
ISSN | 1557-8666 |
IngestDate | Fri Sep 05 17:33:32 EDT 2025 Wed Sep 03 02:28:26 EDT 2025 Wed Aug 27 16:40:51 EDT 2025 Wed Aug 27 07:13:53 EDT 2025 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 9 |
Keywords | saliency map Transformer multihead-co-attention variational autoencoder microbiome data pretraining |
Language | English |
License | https://www.liebertpub.com/nv/resources-tools/text-and-data-mining-policy/121 |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c223t-146c65fd3ef7407e7555a69a8770a49ac105d32859fe860f2d140bf5bd6c663f3 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ORCID | 0000-0002-2558-2911 |
PMID | 40295093 |
PQID | 3196608709 |
PQPubID | 23479 |
PageCount | 780 |
ParticipantIDs | proquest_miscellaneous_3196608709 pubmed_primary_40295093 crossref_primary_10_1089_cmb_2024_0884 maryannliebert_primary_10_1089_cmb_2024_0884 |
PublicationCentury | 2000 |
PublicationDate | 2025-Sep |
PublicationDateYYYYMMDD | 2025-09-01 |
PublicationDate_xml | – month: 09 year: 2025 text: 2025-Sep |
PublicationDecade | 2020 |
PublicationPlace | United States |
PublicationPlace_xml | – name: United States |
PublicationTitle | Journal of computational biology |
PublicationTitleAlternate | J Comput Biol |
PublicationYear | 2025 |
Publisher | Mary Ann Liebert, Inc., publishers |
Publisher_xml | – name: Mary Ann Liebert, Inc., publishers |
References | B20 B23 B15 B26 Vaswani A (B27) 2017; 30 B16 B28 B29 B19 B1 B2 B3 Lu J (B14) 2016; 29 B4 B5 B6 B8 |
References_xml | – ident: B19 doi: 10.1038/s41598-020-63159-5 – ident: B29 doi: 10.1093/nar/gkad801 – ident: B15 doi: 10.1016/S1470-2045(18)30952-5 – ident: B5 doi: 10.1126/scisignal.2004088 – ident: B16 doi: 10.3389/fmicb.2017.00752 – ident: B20 doi: 10.1186/s12859-018-2205-3 – ident: B8 doi: 10.1186/s12874-018-0482-1 – ident: B23 doi: 10.1038/s41586-020-2095-1 – volume: 30 start-page: 5998 year: 2017 ident: B27 publication-title: Advances In Neural Information Processing Systems – ident: B28 doi: 10.1038/nature09922 – ident: B2 doi: 10.1146/annurev-genom-091416-035324 – volume: 29 year: 2016 ident: B14 publication-title: Advances in Neural Information Processing Systems – ident: B1 doi: 10.1038/s41598-021-83184-2 – ident: B3 doi: 10.1038/nrg3182 – ident: B4 doi: 10.1093/bioinformatics/btad286 – ident: B6 doi: 10.1038/s43705-022-00182-9 – ident: B26 doi: 10.5114/wo.2014.47136 |
SSID | ssj0013607 |
Score | 2.4410923 |
Snippet | Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of... |
SourceID | proquest pubmed crossref maryannliebert |
SourceType | Aggregation Database Index Database Publisher |
StartPage | 85 |
SubjectTerms | Algorithms Computational Biology - methods Deep Learning Humans Microbiota - genetics Neoplasms - genetics Neoplasms - microbiology Neoplasms - mortality Original Articles |
Title | VTrans: A VAE-Based Pre-Trained Transformer Method for Microbiome Data Analysis |
URI | https://www.liebertpub.com/doi/abs/10.1089/cmb.2024.0884 https://www.ncbi.nlm.nih.gov/pubmed/40295093 https://www.proquest.com/docview/3196608709 |
Volume | 32 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELagCAkkKiivLQ8ZCXGBtGniR9LbgrYq0G0v2bLiYtmODT00RTQr1P56xnbidFUqCpfIcZKJNJ81L3tmEHpdUE3rLRdxr02akFLTRKaaJbni3MIzy5WLd0z32e6MfJrT-dAV0GeXtGpDn_8xr-R_UIU5wNVlyf4DspEoTMAY8IUrIAzXa2F86DVNSC4_HE-S96CS3Nl_k1Su8wOMq94uNa5cj2sW7c8VTo9C_aVjA7C3MpYmucJU1b71Qx827Oo2xdiMbwv8dn7UnC2Gpfb1-8LbxbL5ZmWnHX3qoZdyX0zzq8tB6yIOGY1HqqKQpKDZGOtKWF-euySW08JVNdXHChzyjGyAZCOD_un33PcPxM5sb09Uk3l1E93KOA_77h8_D9tCzOe_x591RVOB_OYS8SUj457LAZRNAza9O6t-tSvhTYrqPlrtGIzHAdgH6IZp1tDt0B30bA3dncaSuqcP0UEAexuPcYQaX4AaX4AaB6gx3OEBauygxj3Uj9BsZ1J92E26bhiJBhOuTUClaUZtnRvLwQs3nFIqWSkLzlNJSqnBUq5zV4_QmoKlNqvBd1aWqhq-Y7nNH6OV5qQxTxHeKiz4sUTl8DIhppAwVlmhsgzIFCUZoTc9-8SPUPRE-MMKRSmAz8LxWTg-j9C7Zeb-7fVXPesFSDG3NSUbc7I4FU4RsBR0RzlCTwImkRRJsxLM2nz9Gl8_Q3eGFfscrbQ_F-YFWI2teulX0m9FimoV |
linkProvider | Flying Publisher |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=VTrans%3A+A+VAE-Based+Pre-Trained+Transformer+Method+for+Microbiome+Data+Analysis&rft.jtitle=Journal+of+computational+biology&rft.au=Shi%2C+Xinyuan&rft.au=Zhu%2C+Fangfang&rft.au=Min%2C+Wenwen&rft.date=2025-09-01&rft.issn=1557-8666&rft.eissn=1557-8666&rft_id=info:doi/10.1089%2Fcmb.2024.0884&rft.externalDBID=NO_FULL_TEXT |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1557-8666&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1557-8666&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1557-8666&client=summon |