VTrans: A VAE-Based Pre-Trained Transformer Method for Microbiome Data Analysis

Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient surv...

Full description

Saved in:
Bibliographic Details
Published inJournal of computational biology Vol. 32; no. 9; pp. 85 - 864
Main Authors Shi, Xinyuan, Zhu, Fangfang, Min, Wenwen
Format Journal Article
LanguageEnglish
Published United States Mary Ann Liebert, Inc., publishers 01.09.2025
Subjects
Online AccessGet full text
ISSN1557-8666
1557-8666
DOI10.1089/cmb.2024.0884

Cover

Abstract Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/VTrans and https://doi.org/10.5281/zenodo.14166580 .
AbstractList Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/VTrans and https://doi.org/10.5281/zenodo.14166580.
Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/VTrans and https://doi.org/10.5281/zenodo.14166580 .
Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/VTrans and https://doi.org/10.5281/zenodo.14166580.Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/VTrans and https://doi.org/10.5281/zenodo.14166580.
Author Min, Wenwen
Zhu, Fangfang
Shi, Xinyuan
Author_xml – sequence: 1
  givenname: Xinyuan
  surname: Shi
  fullname: Shi, Xinyuan
– sequence: 2
  givenname: Fangfang
  surname: Zhu
  fullname: Zhu, Fangfang
– sequence: 3
  givenname: Wenwen
  orcidid: 0000-0002-2558-2911
  surname: Min
  fullname: Min, Wenwen
BackLink https://www.ncbi.nlm.nih.gov/pubmed/40295093$$D View this record in MEDLINE/PubMed
BookMark eNqFkDtPwzAQgC1URGlhZEUeWVLsOH6ErZTykFqVoXSNnOQsghKn2OnQf49DC2Jjutd3J903QgPbWkDoipIJJSq9LZp8EpM4mRClkhN0TjmXkRJCDP7kQzTy_oMQygSRZ2iYkDjlJGXnaLVZO239HZ7izXQe3WsPJX51EIV2ZUP-PTata8DhJXTvbYlDhZdV4dq8ahvAD7rTeGp1vfeVv0CnRtceLo9xjN4e5-vZc7RYPb3MpouoiGPWRTQRheCmZGBkQiRIzrkWqVZSEp2kuqCElyxWPDWgBDFxSROSG56XYU8ww8bo5nB369rPHfguaypfQF1rC-3OZ4ymQhAlw5NjdH1Ed3kDZbZ1VaPdPvuREIDoAISXvHdgfhFKsl5yFiRnveSslxx4duB7RltbV5CD6_7Z-gLj7n3b
Cites_doi 10.1038/s41598-020-63159-5
10.1093/nar/gkad801
10.1016/S1470-2045(18)30952-5
10.1126/scisignal.2004088
10.3389/fmicb.2017.00752
10.1186/s12859-018-2205-3
10.1186/s12874-018-0482-1
10.1038/s41586-020-2095-1
10.1038/nature09922
10.1146/annurev-genom-091416-035324
10.1038/s41598-021-83184-2
10.1038/nrg3182
10.1093/bioinformatics/btad286
10.1038/s43705-022-00182-9
10.5114/wo.2014.47136
ContentType Journal Article
Copyright 2025, Mary Ann Liebert, Inc., publishers
Copyright_xml – notice: 2025, Mary Ann Liebert, Inc., publishers
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7X8
DOI 10.1089/cmb.2024.0884
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList MEDLINE

MEDLINE - Academic
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
Mathematics
EISSN 1557-8666
EndPage 864
ExternalDocumentID 40295093
10_1089_cmb_2024_0884
Genre Journal Article
GroupedDBID ---
0R~
29K
4.4
53G
5GY
ABBKN
ACGFO
ADBBV
AENEX
ALMA_UNASSIGNED_HOLDINGS
BAWUL
BNQNF
CS3
D-I
DIK
DU5
EBS
F5P
IAO
IHR
IM4
MV1
NQHIM
O9-
P2P
RML
RNS
SCNPE
TN5
TR2
UE5
AAYXX
CITATION
34G
39C
ABEFU
AI.
CAG
CGR
COF
CUY
CVF
ECM
EIF
EJD
IER
IGS
ITC
NPM
R.V
RMSOB
VH1
7X8
ID FETCH-LOGICAL-c223t-146c65fd3ef7407e7555a69a8770a49ac105d32859fe860f2d140bf5bd6c663f3
ISSN 1557-8666
IngestDate Fri Sep 05 17:33:32 EDT 2025
Wed Sep 03 02:28:26 EDT 2025
Wed Aug 27 16:40:51 EDT 2025
Wed Aug 27 07:13:53 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 9
Keywords saliency map
Transformer
multihead-co-attention
variational autoencoder
microbiome data
pretraining
Language English
License https://www.liebertpub.com/nv/resources-tools/text-and-data-mining-policy/121
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c223t-146c65fd3ef7407e7555a69a8770a49ac105d32859fe860f2d140bf5bd6c663f3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ORCID 0000-0002-2558-2911
PMID 40295093
PQID 3196608709
PQPubID 23479
PageCount 780
ParticipantIDs proquest_miscellaneous_3196608709
pubmed_primary_40295093
crossref_primary_10_1089_cmb_2024_0884
maryannliebert_primary_10_1089_cmb_2024_0884
PublicationCentury 2000
PublicationDate 2025-Sep
PublicationDateYYYYMMDD 2025-09-01
PublicationDate_xml – month: 09
  year: 2025
  text: 2025-Sep
PublicationDecade 2020
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle Journal of computational biology
PublicationTitleAlternate J Comput Biol
PublicationYear 2025
Publisher Mary Ann Liebert, Inc., publishers
Publisher_xml – name: Mary Ann Liebert, Inc., publishers
References B20
B23
B15
B26
Vaswani A (B27) 2017; 30
B16
B28
B29
B19
B1
B2
B3
Lu J (B14) 2016; 29
B4
B5
B6
B8
References_xml – ident: B19
  doi: 10.1038/s41598-020-63159-5
– ident: B29
  doi: 10.1093/nar/gkad801
– ident: B15
  doi: 10.1016/S1470-2045(18)30952-5
– ident: B5
  doi: 10.1126/scisignal.2004088
– ident: B16
  doi: 10.3389/fmicb.2017.00752
– ident: B20
  doi: 10.1186/s12859-018-2205-3
– ident: B8
  doi: 10.1186/s12874-018-0482-1
– ident: B23
  doi: 10.1038/s41586-020-2095-1
– volume: 30
  start-page: 5998
  year: 2017
  ident: B27
  publication-title: Advances In Neural Information Processing Systems
– ident: B28
  doi: 10.1038/nature09922
– ident: B2
  doi: 10.1146/annurev-genom-091416-035324
– volume: 29
  year: 2016
  ident: B14
  publication-title: Advances in Neural Information Processing Systems
– ident: B1
  doi: 10.1038/s41598-021-83184-2
– ident: B3
  doi: 10.1038/nrg3182
– ident: B4
  doi: 10.1093/bioinformatics/btad286
– ident: B6
  doi: 10.1038/s43705-022-00182-9
– ident: B26
  doi: 10.5114/wo.2014.47136
SSID ssj0013607
Score 2.4410923
Snippet Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of...
SourceID proquest
pubmed
crossref
maryannliebert
SourceType Aggregation Database
Index Database
Publisher
StartPage 85
SubjectTerms Algorithms
Computational Biology - methods
Deep Learning
Humans
Microbiota - genetics
Neoplasms - genetics
Neoplasms - microbiology
Neoplasms - mortality
Original Articles
Title VTrans: A VAE-Based Pre-Trained Transformer Method for Microbiome Data Analysis
URI https://www.liebertpub.com/doi/abs/10.1089/cmb.2024.0884
https://www.ncbi.nlm.nih.gov/pubmed/40295093
https://www.proquest.com/docview/3196608709
Volume 32
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELagCAkkKiivLQ8ZCXGBtGniR9LbgrYq0G0v2bLiYtmODT00RTQr1P56xnbidFUqCpfIcZKJNJ81L3tmEHpdUE3rLRdxr02akFLTRKaaJbni3MIzy5WLd0z32e6MfJrT-dAV0GeXtGpDn_8xr-R_UIU5wNVlyf4DspEoTMAY8IUrIAzXa2F86DVNSC4_HE-S96CS3Nl_k1Su8wOMq94uNa5cj2sW7c8VTo9C_aVjA7C3MpYmucJU1b71Qx827Oo2xdiMbwv8dn7UnC2Gpfb1-8LbxbL5ZmWnHX3qoZdyX0zzq8tB6yIOGY1HqqKQpKDZGOtKWF-euySW08JVNdXHChzyjGyAZCOD_un33PcPxM5sb09Uk3l1E93KOA_77h8_D9tCzOe_x591RVOB_OYS8SUj457LAZRNAza9O6t-tSvhTYrqPlrtGIzHAdgH6IZp1tDt0B30bA3dncaSuqcP0UEAexuPcYQaX4AaX4AaB6gx3OEBauygxj3Uj9BsZ1J92E26bhiJBhOuTUClaUZtnRvLwQs3nFIqWSkLzlNJSqnBUq5zV4_QmoKlNqvBd1aWqhq-Y7nNH6OV5qQxTxHeKiz4sUTl8DIhppAwVlmhsgzIFCUZoTc9-8SPUPRE-MMKRSmAz8LxWTg-j9C7Zeb-7fVXPesFSDG3NSUbc7I4FU4RsBR0RzlCTwImkRRJsxLM2nz9Gl8_Q3eGFfscrbQ_F-YFWI2teulX0m9FimoV
linkProvider Flying Publisher
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=VTrans%3A+A+VAE-Based+Pre-Trained+Transformer+Method+for+Microbiome+Data+Analysis&rft.jtitle=Journal+of+computational+biology&rft.au=Shi%2C+Xinyuan&rft.au=Zhu%2C+Fangfang&rft.au=Min%2C+Wenwen&rft.date=2025-09-01&rft.issn=1557-8666&rft.eissn=1557-8666&rft_id=info:doi/10.1089%2Fcmb.2024.0884&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1557-8666&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1557-8666&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1557-8666&client=summon