VTrans: A VAE-Based Pre-Trained Transformer Method for Microbiome Data Analysis

Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient surv...

Full description

Saved in:

Bibliographic Details
Published in	Journal of computational biology Vol. 32; no. 9; pp. 85 - 864
Main Authors	Shi, Xinyuan, Zhu, Fangfang, Min, Wenwen
Format	Journal Article
Language	English
Published	United States Mary Ann Liebert, Inc., publishers 01.09.2025
Subjects	Algorithms Computational Biology - methods Deep Learning Humans Microbiota - genetics Neoplasms - genetics Neoplasms - microbiology Neoplasms - mortality Original Articles saliency map Transformer multihead-co-attention variational autoencoder microbiome data pretraining
Online Access	Get full text
ISSN	1557-8666 1557-8666
DOI	10.1089/cmb.2024.0884

Cover

Abstract	Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/VTrans and https://doi.org/10.5281/zenodo.14166580 .
AbstractList	Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/VTrans and https://doi.org/10.5281/zenodo.14166580. Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/VTrans and https://doi.org/10.5281/zenodo.14166580 . Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/VTrans and https://doi.org/10.5281/zenodo.14166580.Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/VTrans and https://doi.org/10.5281/zenodo.14166580.
Author	Min, Wenwen Zhu, Fangfang Shi, Xinyuan
Author_xml	– sequence: 1 givenname: Xinyuan surname: Shi fullname: Shi, Xinyuan – sequence: 2 givenname: Fangfang surname: Zhu fullname: Zhu, Fangfang – sequence: 3 givenname: Wenwen orcidid: 0000-0002-2558-2911 surname: Min fullname: Min, Wenwen
BackLink	https://www.ncbi.nlm.nih.gov/pubmed/40295093$$D View this record in MEDLINE/PubMed
BookMark	eNqFkDtPwzAQgC1URGlhZEUeWVLsOH6ErZTykFqVoXSNnOQsghKn2OnQf49DC2Jjutd3J903QgPbWkDoipIJJSq9LZp8EpM4mRClkhN0TjmXkRJCDP7kQzTy_oMQygSRZ2iYkDjlJGXnaLVZO239HZ7izXQe3WsPJX51EIV2ZUP-PTata8DhJXTvbYlDhZdV4dq8ahvAD7rTeGp1vfeVv0CnRtceLo9xjN4e5-vZc7RYPb3MpouoiGPWRTQRheCmZGBkQiRIzrkWqVZSEp2kuqCElyxWPDWgBDFxSROSG56XYU8ww8bo5nB369rPHfguaypfQF1rC-3OZ4ymQhAlw5NjdH1Ed3kDZbZ1VaPdPvuREIDoAISXvHdgfhFKsl5yFiRnveSslxx4duB7RltbV5CD6_7Z-gLj7n3b
Cites_doi	10.1038/s41598-020-63159-5 10.1093/nar/gkad801 10.1016/S1470-2045(18)30952-5 10.1126/scisignal.2004088 10.3389/fmicb.2017.00752 10.1186/s12859-018-2205-3 10.1186/s12874-018-0482-1 10.1038/s41586-020-2095-1 10.1038/nature09922 10.1146/annurev-genom-091416-035324 10.1038/s41598-021-83184-2 10.1038/nrg3182 10.1093/bioinformatics/btad286 10.1038/s43705-022-00182-9 10.5114/wo.2014.47136
ContentType	Journal Article
Copyright	2025, Mary Ann Liebert, Inc., publishers
Copyright_xml	– notice: 2025, Mary Ann Liebert, Inc., publishers
DBID	AAYXX CITATION CGR CUY CVF ECM EIF NPM 7X8
DOI	10.1089/cmb.2024.0884
DatabaseName	CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic
DatabaseTitle	CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic
DatabaseTitleList	MEDLINE MEDLINE - Academic
Database_xml	– sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Biology Mathematics
EISSN	1557-8666
EndPage	864
ExternalDocumentID	40295093 10_1089_cmb_2024_0884
Genre	Journal Article
GroupedDBID	--- 0R~ 29K 4.4 53G 5GY ABBKN ACGFO ADBBV AENEX ALMA_UNASSIGNED_HOLDINGS BAWUL BNQNF CS3 D-I DIK DU5 EBS F5P IAO IHR IM4 MV1 NQHIM O9- P2P RML RNS SCNPE TN5 TR2 UE5 AAYXX CITATION 34G 39C ABEFU AI. CAG CGR COF CUY CVF ECM EIF EJD IER IGS ITC NPM R.V RMSOB VH1 7X8
ID	FETCH-LOGICAL-c223t-146c65fd3ef7407e7555a69a8770a49ac105d32859fe860f2d140bf5bd6c663f3
ISSN	1557-8666
IngestDate	Fri Sep 05 17:33:32 EDT 2025 Wed Sep 03 02:28:26 EDT 2025 Wed Aug 27 16:40:51 EDT 2025 Wed Aug 27 07:13:53 EDT 2025
IsPeerReviewed	true
IsScholarly	true
Issue	9
Keywords	saliency map Transformer multihead-co-attention variational autoencoder microbiome data pretraining
Language	English
License	https://www.liebertpub.com/nv/resources-tools/text-and-data-mining-policy/121
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-c223t-146c65fd3ef7407e7555a69a8770a49ac105d32859fe860f2d140bf5bd6c663f3
Notes	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ORCID	0000-0002-2558-2911
PMID	40295093
PQID	3196608709
PQPubID	23479
PageCount	780
ParticipantIDs	proquest_miscellaneous_3196608709 pubmed_primary_40295093 crossref_primary_10_1089_cmb_2024_0884 maryannliebert_primary_10_1089_cmb_2024_0884
PublicationCentury	2000
PublicationDate	2025-Sep
PublicationDateYYYYMMDD	2025-09-01
PublicationDate_xml	– month: 09 year: 2025 text: 2025-Sep
PublicationDecade	2020
PublicationPlace	United States
PublicationPlace_xml	– name: United States
PublicationTitle	Journal of computational biology
PublicationTitleAlternate	J Comput Biol
PublicationYear	2025
Publisher	Mary Ann Liebert, Inc., publishers
Publisher_xml	– name: Mary Ann Liebert, Inc., publishers
References	B20 B23 B15 B26 Vaswani A (B27) 2017; 30 B16 B28 B29 B19 B1 B2 B3 Lu J (B14) 2016; 29 B4 B5 B6 B8
References_xml	– ident: B19 doi: 10.1038/s41598-020-63159-5 – ident: B29 doi: 10.1093/nar/gkad801 – ident: B15 doi: 10.1016/S1470-2045(18)30952-5 – ident: B5 doi: 10.1126/scisignal.2004088 – ident: B16 doi: 10.3389/fmicb.2017.00752 – ident: B20 doi: 10.1186/s12859-018-2205-3 – ident: B8 doi: 10.1186/s12874-018-0482-1 – ident: B23 doi: 10.1038/s41586-020-2095-1 – volume: 30 start-page: 5998 year: 2017 ident: B27 publication-title: Advances In Neural Information Processing Systems – ident: B28 doi: 10.1038/nature09922 – ident: B2 doi: 10.1146/annurev-genom-091416-035324 – volume: 29 year: 2016 ident: B14 publication-title: Advances in Neural Information Processing Systems – ident: B1 doi: 10.1038/s41598-021-83184-2 – ident: B3 doi: 10.1038/nrg3182 – ident: B4 doi: 10.1093/bioinformatics/btad286 – ident: B6 doi: 10.1038/s43705-022-00182-9 – ident: B26 doi: 10.5114/wo.2014.47136
SSID	ssj0013607
Score	2.4410923
Snippet	Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of...
SourceID	proquest pubmed crossref maryannliebert
SourceType	Aggregation Database Index Database Publisher
StartPage	85
SubjectTerms	Algorithms Computational Biology - methods Deep Learning Humans Microbiota - genetics Neoplasms - genetics Neoplasms - microbiology Neoplasms - mortality Original Articles
Title	VTrans: A VAE-Based Pre-Trained Transformer Method for Microbiome Data Analysis
URI	https://www.liebertpub.com/doi/abs/10.1089/cmb.2024.0884 https://www.ncbi.nlm.nih.gov/pubmed/40295093 https://www.proquest.com/docview/3196608709
Volume	32
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELagCAkkKiivLQ8ZCXGBtGniR9LbgrYq0G0v2bLiYtmODT00RTQr1P56xnbidFUqCpfIcZKJNJ81L3tmEHpdUE3rLRdxr02akFLTRKaaJbni3MIzy5WLd0z32e6MfJrT-dAV0GeXtGpDn_8xr-R_UIU5wNVlyf4DspEoTMAY8IUrIAzXa2F86DVNSC4_HE-S96CS3Nl_k1Su8wOMq94uNa5cj2sW7c8VTo9C_aVjA7C3MpYmucJU1b71Qx827Oo2xdiMbwv8dn7UnC2Gpfb1-8LbxbL5ZmWnHX3qoZdyX0zzq8tB6yIOGY1HqqKQpKDZGOtKWF-euySW08JVNdXHChzyjGyAZCOD_un33PcPxM5sb09Uk3l1E93KOA_77h8_D9tCzOe_x591RVOB_OYS8SUj457LAZRNAza9O6t-tSvhTYrqPlrtGIzHAdgH6IZp1tDt0B30bA3dncaSuqcP0UEAexuPcYQaX4AaX4AaB6gx3OEBauygxj3Uj9BsZ1J92E26bhiJBhOuTUClaUZtnRvLwQs3nFIqWSkLzlNJSqnBUq5zV4_QmoKlNqvBd1aWqhq-Y7nNH6OV5qQxTxHeKiz4sUTl8DIhppAwVlmhsgzIFCUZoTc9-8SPUPRE-MMKRSmAz8LxWTg-j9C7Zeb-7fVXPesFSDG3NSUbc7I4FU4RsBR0RzlCTwImkRRJsxLM2nz9Gl8_Q3eGFfscrbQ_F-YFWI2teulX0m9FimoV
linkProvider	Flying Publisher
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=VTrans%3A+A+VAE-Based+Pre-Trained+Transformer+Method+for+Microbiome+Data+Analysis&rft.jtitle=Journal+of+computational+biology&rft.au=Shi%2C+Xinyuan&rft.au=Zhu%2C+Fangfang&rft.au=Min%2C+Wenwen&rft.date=2025-09-01&rft.issn=1557-8666&rft.eissn=1557-8666&rft_id=info:doi/10.1089%2Fcmb.2024.0884&rft.externalDBID=NO_FULL_TEXT
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1557-8666&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1557-8666&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1557-8666&client=summon