Hidden challenges in evaluating spillover risk of zoonotic viruses using machine learning models

Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses. Methods I...

Full description

Saved in:
Bibliographic Details
Published inCommunications medicine Vol. 5; no. 1; pp. 187 - 10
Main Authors Kawasaki, Junna, Suzuki, Tadaki, Hamada, Michiaki
Format Journal Article
LanguageEnglish
Published London Nature Publishing Group UK 20.05.2025
Springer Nature B.V
Nature Portfolio
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses. Methods In this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences. Results Here we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model’s applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2. Conclusions Our study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats. Plain language summary To prepare for future pandemics caused by animal-derived viruses, there is a growing need for computational models that can predict whether a virus might infect humans. We constructed extensive datasets covering information about different viruses, including key human pathogens. We developed computational models using these datasets, which outperformed existing approaches across many virus types. However, we also revealed that current models share the same unresolved challenges when assessing whether specific viruses will infect humans, including SARS-CoV-2. These findings suggest that current models may fail to identify animal viruses that can infect humans, which underscores the urgent need for improved predictive models to strengthen pandemic preparedness. Kawasaki et al. construct a dataset covering 26 viral families and use large language models pre-trained on nucleotide sequences to identify zoonotic viruses with human infectivity potential. High predictive performance was obtained, even with partial viral sequences, but not all zoonotic lineages could be identified.
AbstractList Abstract Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses. Methods In this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences. Results Here we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model’s applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2. Conclusions Our study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats.
Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses. In this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences. Here we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model's applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2. Our study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats.
BackgroundMachine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses.MethodsIn this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences.ResultsHere we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model’s applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2.ConclusionsOur study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats.Plain language summaryTo prepare for future pandemics caused by animal-derived viruses, there is a growing need for computational models that can predict whether a virus might infect humans. We constructed extensive datasets covering information about different viruses, including key human pathogens. We developed computational models using these datasets, which outperformed existing approaches across many virus types. However, we also revealed that current models share the same unresolved challenges when assessing whether specific viruses will infect humans, including SARS-CoV-2. These findings suggest that current models may fail to identify animal viruses that can infect humans, which underscores the urgent need for improved predictive models to strengthen pandemic preparedness.
To prepare for future pandemics caused by animal-derived viruses, there is a growing need for computational models that can predict whether a virus might infect humans. We constructed extensive datasets covering information about different viruses, including key human pathogens. We developed computational models using these datasets, which outperformed existing approaches across many virus types. However, we also revealed that current models share the same unresolved challenges when assessing whether specific viruses will infect humans, including SARS-CoV-2. These findings suggest that current models may fail to identify animal viruses that can infect humans, which underscores the urgent need for improved predictive models to strengthen pandemic preparedness. Kawasaki et al. construct a dataset covering 26 viral families and use large language models pre-trained on nucleotide sequences to identify zoonotic viruses with human infectivity potential. High predictive performance was obtained, even with partial viral sequences, but not all zoonotic lineages could be identified.
Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses.BACKGROUNDMachine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses.In this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences.METHODSIn this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences.Here we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model's applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2.RESULTSHere we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model's applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2.Our study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats.CONCLUSIONSOur study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats.
Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses. Methods In this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences. Results Here we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model’s applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2. Conclusions Our study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats. Plain language summary To prepare for future pandemics caused by animal-derived viruses, there is a growing need for computational models that can predict whether a virus might infect humans. We constructed extensive datasets covering information about different viruses, including key human pathogens. We developed computational models using these datasets, which outperformed existing approaches across many virus types. However, we also revealed that current models share the same unresolved challenges when assessing whether specific viruses will infect humans, including SARS-CoV-2. These findings suggest that current models may fail to identify animal viruses that can infect humans, which underscores the urgent need for improved predictive models to strengthen pandemic preparedness. Kawasaki et al. construct a dataset covering 26 viral families and use large language models pre-trained on nucleotide sequences to identify zoonotic viruses with human infectivity potential. High predictive performance was obtained, even with partial viral sequences, but not all zoonotic lineages could be identified.
ArticleNumber 187
Author Kawasaki, Junna
Suzuki, Tadaki
Hamada, Michiaki
Author_xml – sequence: 1
  givenname: Junna
  orcidid: 0000-0002-6609-5300
  surname: Kawasaki
  fullname: Kawasaki, Junna
  email: jrt13mpmuk@gmail.com
  organization: Faculty of Science and Engineering, Waseda University, Department of Infectious Disease Pathobiology, Graduate School of Medicine, Chiba University
– sequence: 2
  givenname: Tadaki
  orcidid: 0000-0002-3820-9542
  surname: Suzuki
  fullname: Suzuki, Tadaki
  organization: Department of Infectious Disease Pathobiology, Graduate School of Medicine, Chiba University, Department of Infectious Disease Pathology, National Institute of Infectious Diseases, Japan Institute for Health Security
– sequence: 3
  givenname: Michiaki
  orcidid: 0000-0001-9466-1034
  surname: Hamada
  fullname: Hamada, Michiaki
  email: mhamada@waseda.jp
  organization: Faculty of Science and Engineering, Waseda University, Cellular and Molecular Biotechnology Research Institute (CMB), National Institute of Advanced Industrial Science and Technology (AIST), Graduate School of Medicine, Nippon Medical School
BackLink https://www.ncbi.nlm.nih.gov/pubmed/40394176$$D View this record in MEDLINE/PubMed
BookMark eNp9kktv1DAUhSNUREvpH2CBIrFhE_ArfqwQqoBWqsQGJHbGia9nPHjswU6mgl-POymlZcHK1vV3zr3yPU-bo5giNM1zjF5jROWbwqjseYdI3yGkEO2uHzUnRFDUcc6-Ht27HzdnpWwQQkRwxSR60hwzRBXDgp803y68tRDbcW1CgLiC0vrYwt6E2Uw-rtqy8yGkPeQ2-_K9Ta79lVJMkx_bvc9zqYK53IBbM659hDaAyfFQSBZCedY8diYUOLs9T5svH95_Pr_orj59vDx_d9WNTLGpE0pYIe2AsePS9oJTTCQeLbCRS2cMc8I4xwnmgigsuWWWGekGS4AqSwk9bS4XX5vMRu-y35r8Uyfj9aGQ8kqbXKcOoJ1AAOBAKGmYEnzoiZEEnOFY9YOE6vV28drNwxbsCHHKJjwwffgS_Vqv0l5jghQRBFWHV7cOOf2YoUx668sIIZgIaS6aEsQJ5bRnFX35D7pJc471rw4UpYgLWqkX90e6m-XPJitAFmDMqZQM7g7BSN8kRi-J0TUx-pAYfV1FdBGVCtfl57-9_6P6DU9RxT0
Cites_doi 10.1128/mBio.01638-21
10.1038/nature24039
10.1128/JVI.02005-07
10.1098/rstb.2020.0358
10.1038/s41586-023-06617-0
10.1371/journal.ppat.1011116
10.1016/S0168-9525(00)02024-2
10.1126/science.aap7463
10.3390/v8030066
10.1016/j.coviro.2023.101346
10.3201/eid3007.240508
10.1093/nar/gkx932
10.1093/bioinformatics/bts565
10.1093/molbev/msx281
10.1186/s12859-023-05582-9
10.1101/2024.05.01.591751
10.1093/bioinformatics/btl158
10.15585/mmwr.mm7321e1
10.1038/s41564-024-01879-4
10.5281/zenodo.11103091
10.1093/bioinformatics/bty407
10.1038/s41586-022-04532-4
10.1016/j.virusres.2017.10.014
10.1093/bioinformatics/btab083
10.1016/j.tim.2015.11.002
10.1109/BIBM58861.2023.10385501
10.1093/nar/gku1207
10.5281/zenodo.11102793
10.1093/molbev/mst010
10.1126/science.aap9072
10.1093/gbe/evac018
10.1093/bib/bbac204
10.5281/zenodo.11103056
10.1038/s41586-021-04332-2
10.1186/1471-2105-10-421
10.1002/cpbi.96
10.1371/journal.pbio.3001135
10.1038/nmeth.4285
10.1111/tbed.13314
10.1093/bioinformatics/btaa705
10.1371/journal.pbio.3001390
10.1093/molbev/msaa015
10.1073/pnas.2202871119
10.1371/journal.pcbi.1012597
10.1093/nargab/lqab004
10.5281/zenodo.11103079
ContentType Journal Article
Copyright The Author(s) 2025
2025. The Author(s).
Copyright Springer Nature B.V. Dec 2025
The Author(s) 2025 2025
Copyright_xml – notice: The Author(s) 2025
– notice: 2025. The Author(s).
– notice: Copyright Springer Nature B.V. Dec 2025
– notice: The Author(s) 2025 2025
DBID C6C
AAYXX
CITATION
NPM
3V.
7X7
7XB
88C
88E
8C1
8FI
8FJ
8FK
ABUWG
AFKRA
AZQEC
BENPR
CCPQU
COVID
DWQXO
FYUFA
GHDGH
K9.
M0S
M0T
M1P
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQQKQ
PQUKI
PRINS
7X8
5PM
DOA
DOI 10.1038/s43856-025-00903-w
DatabaseName Springer Nature OA Free Journals
CrossRef
PubMed
ProQuest Central (Corporate)
ProQuest Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Healthcare Administration Database (Alumni)
Medical Database (Alumni Edition)
Public Health Database
ProQuest Hospital Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
ProQuest Central Essentials
ProQuest Central
ProQuest One
Coronavirus Research Database
ProQuest Central
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Health & Medical Complete (Alumni)
ProQuest Health & Medical Collection
Healthcare Administration Database
Medical Database
ProQuest Central Premium
ProQuest One Academic
Publicly Available Content Database
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
MEDLINE - Academic
PubMed Central (Full Participant titles)
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
PubMed
Publicly Available Content Database
ProQuest One Academic Middle East (New)
ProQuest Central Essentials
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Central China
ProQuest Central
ProQuest Health & Medical Research Collection
Health Research Premium Collection
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
Health & Medical Research Collection
ProQuest Central (New)
ProQuest Medical Library (Alumni)
ProQuest Public Health
ProQuest One Academic Eastern Edition
ProQuest Health Management
Coronavirus Research Database
ProQuest Hospital Collection
Health Research Premium Collection (Alumni)
ProQuest Hospital Collection (Alumni)
ProQuest Health & Medical Complete
ProQuest Medical Library
ProQuest One Academic UKI Edition
ProQuest Health Management (Alumni Edition)
ProQuest One Academic
ProQuest One Academic (New)
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList
PubMed
Publicly Available Content Database

MEDLINE - Academic

Database_xml – sequence: 1
  dbid: C6C
  name: Springer Nature OA Free Journals
  url: http://www.springeropen.com/
  sourceTypes: Publisher
– sequence: 2
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 3
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 4
  dbid: BENPR
  name: ProQuest Central
  url: https://www.proquest.com/central
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
EISSN 2730-664X
EndPage 10
ExternalDocumentID oai_doaj_org_article_f70eeefe798a4976b52a82efa6195b8e
PMC12092720
40394176
10_1038_s43856_025_00903_w
Genre Journal Article
GrantInformation_xml – fundername: MEXT | Japan Society for the Promotion of Science (JSPS)
  grantid: JP22KJ2901
  funderid: https://doi.org/10.13039/501100001691
– fundername: MEXT | JST | Precursory Research for Embryonic Science and Technology (PRESTO)
  grantid: JPMJPR23R4
  funderid: https://doi.org/10.13039/501100009023
– fundername: MEXT | JST | Precursory Research for Embryonic Science and Technology (PRESTO)
  grantid: JPMJPR23R4
– fundername: MEXT | Japan Society for the Promotion of Science (JSPS)
  grantid: JP22KJ2901
GroupedDBID 0R~
53G
7X7
88E
8C1
8FI
8FJ
AAJSJ
AASML
ABUWG
ACLNF
AFKRA
ALIPV
ALMA_UNASSIGNED_HOLDINGS
BENPR
C6C
CCPQU
EBLON
FYUFA
GROUPED_DOAJ
HMCUK
M0T
M1P
M~E
NAO
OK1
PGMZT
PHGZM
PHGZT
PIMPY
PSQYO
RPM
SNYQT
UKHRP
AAYXX
CITATION
NPM
PJZUB
PPXIY
3V.
7XB
8FK
AARCD
AZQEC
COVID
DWQXO
K9.
PKEHL
PQEST
PQQKQ
PQUKI
PRINS
7X8
5PM
PUEGO
ID FETCH-LOGICAL-c494t-797d78db11f68d57631281cde4c68faa4f7aff6216729186d4d4a8fbd2e39d323
IEDL.DBID C6C
ISSN 2730-664X
IngestDate Wed Aug 27 01:11:33 EDT 2025
Thu Aug 21 18:30:44 EDT 2025
Fri Jul 11 17:23:57 EDT 2025
Wed Aug 13 11:26:42 EDT 2025
Mon Jul 21 06:07:07 EDT 2025
Sun Jul 06 05:05:44 EDT 2025
Wed May 21 12:01:58 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Language English
License 2025. The Author(s).
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c494t-797d78db11f68d57631281cde4c68faa4f7aff6216729186d4d4a8fbd2e39d323
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ORCID 0000-0002-6609-5300
0000-0002-3820-9542
0000-0001-9466-1034
OpenAccessLink https://www.nature.com/articles/s43856-025-00903-w
PMID 40394176
PQID 3206330673
PQPubID 5642959
PageCount 10
ParticipantIDs doaj_primary_oai_doaj_org_article_f70eeefe798a4976b52a82efa6195b8e
pubmedcentral_primary_oai_pubmedcentral_nih_gov_12092720
proquest_miscellaneous_3206236354
proquest_journals_3206330673
pubmed_primary_40394176
crossref_primary_10_1038_s43856_025_00903_w
springer_journals_10_1038_s43856_025_00903_w
PublicationCentury 2000
PublicationDate 2025-05-20
PublicationDateYYYYMMDD 2025-05-20
PublicationDate_xml – month: 05
  year: 2025
  text: 2025-05-20
  day: 20
PublicationDecade 2020
PublicationPlace London
PublicationPlace_xml – name: London
– name: England
PublicationTitle Communications medicine
PublicationTitleAbbrev Commun Med
PublicationTitleAlternate Commun Med (Lond)
PublicationYear 2025
Publisher Nature Publishing Group UK
Springer Nature B.V
Nature Portfolio
Publisher_xml – name: Nature Publishing Group UK
– name: Springer Nature B.V
– name: Nature Portfolio
References D Carroll (903_CR1) 2018; 359
MA Takata (903_CR33) 2017; 550
G Yu (903_CR30) 2020; 69
JM Bartoszewicz (903_CR8) 2021; 3
Y Bao (903_CR16) 2008; 82
L Fu (903_CR24) 2012; 28
N Mollentze (903_CR3) 2021; 19
RC Edgar (903_CR5) 2022; 602
S Temmam (903_CR41) 2022; 604
Z Zhang (903_CR7) 2019; 66
J Hadfield (903_CR18) 2018; 34
J Kawasaki (903_CR36) 2021; 12
F Mock (903_CR9) 2020; 37
N Mollentze (903_CR13) 2023; 61
Z Ming (903_CR10) 2023; 24
903_CR17
SA Babayan (903_CR32) 2018; 362
903_CR39
903_CR38
903_CR37
M Wille (903_CR14) 2021; 19
W Li (903_CR25) 2006; 22
903_CR35
903_CR12
903_CR11
P Rice (903_CR19) 2000; 16
Y Ji (903_CR20) 2021; 37
EJ Lefkowitz (903_CR23) 2018; 46
K Katoh (903_CR26) 2013; 30
MA Martínez (903_CR34) 2016; 24
JR Brister (903_CR15) 2015; 43
C Camacho (903_CR22) 2009; 10
CJ Carlson (903_CR2) 2021; 376
AL Greninger (903_CR4) 2018; 244
GT Keusch (903_CR6) 2022; 119
BQ Minh (903_CR27) 2020; 37
S Kalyaanamoorthy (903_CR28) 2017; 14
DT Hoang (903_CR29) 2018; 35
903_CR48
903_CR47
903_CR46
903_CR45
903_CR44
903_CR21
T Mihara (903_CR31) 2016; 8
P Li (903_CR40) 2023; 19
903_CR43
903_CR42
References_xml – volume: 12
  start-page: e0163821
  year: 2021
  ident: 903_CR36
  publication-title: MBio
  doi: 10.1128/mBio.01638-21
– volume: 550
  start-page: 124
  year: 2017
  ident: 903_CR33
  publication-title: Nature
  doi: 10.1038/nature24039
– volume: 82
  start-page: 596
  year: 2008
  ident: 903_CR16
  publication-title: J. Virol.
  doi: 10.1128/JVI.02005-07
– volume: 376
  start-page: 20200358
  year: 2021
  ident: 903_CR2
  publication-title: Philos. Trans. R. Soc. Lond. B Biol. Sci.
  doi: 10.1098/rstb.2020.0358
– ident: 903_CR43
  doi: 10.1038/s41586-023-06617-0
– volume: 19
  start-page: e1011116
  year: 2023
  ident: 903_CR40
  publication-title: PLoS Pathog.
  doi: 10.1371/journal.ppat.1011116
– volume: 16
  start-page: 276
  year: 2000
  ident: 903_CR19
  publication-title: Trends Genet.
  doi: 10.1016/S0168-9525(00)02024-2
– volume: 359
  start-page: 872
  year: 2018
  ident: 903_CR1
  publication-title: Science
  doi: 10.1126/science.aap7463
– volume: 8
  start-page: 66
  year: 2016
  ident: 903_CR31
  publication-title: Viruses
  doi: 10.3390/v8030066
– volume: 61
  start-page: 101346
  year: 2023
  ident: 903_CR13
  publication-title: Curr. Opin. Virol.
  doi: 10.1016/j.coviro.2023.101346
– ident: 903_CR38
  doi: 10.3201/eid3007.240508
– volume: 46
  start-page: D708
  year: 2018
  ident: 903_CR23
  publication-title: Nucleic Acids Res.
  doi: 10.1093/nar/gkx932
– volume: 28
  start-page: 3150
  year: 2012
  ident: 903_CR24
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bts565
– volume: 35
  start-page: 518
  year: 2018
  ident: 903_CR29
  publication-title: Mol. Biol. Evol.
  doi: 10.1093/molbev/msx281
– volume: 24
  year: 2023
  ident: 903_CR10
  publication-title: BMC Bioinforma.
  doi: 10.1186/s12859-023-05582-9
– ident: 903_CR39
  doi: 10.1101/2024.05.01.591751
– volume: 22
  start-page: 1658
  year: 2006
  ident: 903_CR25
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btl158
– ident: 903_CR37
  doi: 10.15585/mmwr.mm7321e1
– ident: 903_CR44
  doi: 10.1038/s41564-024-01879-4
– ident: 903_CR48
  doi: 10.5281/zenodo.11103091
– volume: 34
  start-page: 4121
  year: 2018
  ident: 903_CR18
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bty407
– volume: 604
  start-page: 330
  year: 2022
  ident: 903_CR41
  publication-title: Nature
  doi: 10.1038/s41586-022-04532-4
– volume: 244
  start-page: 218
  year: 2018
  ident: 903_CR4
  publication-title: Virus Res.
  doi: 10.1016/j.virusres.2017.10.014
– volume: 37
  start-page: 2112
  year: 2021
  ident: 903_CR20
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btab083
– volume: 24
  start-page: 134
  year: 2016
  ident: 903_CR34
  publication-title: Trends Microbiol
  doi: 10.1016/j.tim.2015.11.002
– ident: 903_CR11
  doi: 10.1109/BIBM58861.2023.10385501
– volume: 43
  start-page: D571
  year: 2015
  ident: 903_CR15
  publication-title: Nucleic Acids Res.
  doi: 10.1093/nar/gku1207
– ident: 903_CR45
  doi: 10.5281/zenodo.11102793
– volume: 30
  start-page: 772
  year: 2013
  ident: 903_CR26
  publication-title: Mol. Biol. Evol.
  doi: 10.1093/molbev/mst010
– volume: 362
  start-page: 577
  year: 2018
  ident: 903_CR32
  publication-title: Science
  doi: 10.1126/science.aap9072
– ident: 903_CR17
  doi: 10.1093/gbe/evac018
– ident: 903_CR21
  doi: 10.1093/bib/bbac204
– ident: 903_CR46
  doi: 10.5281/zenodo.11103056
– volume: 602
  start-page: 142
  year: 2022
  ident: 903_CR5
  publication-title: Nature
  doi: 10.1038/s41586-021-04332-2
– volume: 10
  year: 2009
  ident: 903_CR22
  publication-title: BMC Bioinforma.
  doi: 10.1186/1471-2105-10-421
– volume: 69
  start-page: e96
  year: 2020
  ident: 903_CR30
  publication-title: Curr. Protoc. Bioinforma.
  doi: 10.1002/cpbi.96
– volume: 19
  start-page: e3001135
  year: 2021
  ident: 903_CR14
  publication-title: PLoS Biol.
  doi: 10.1371/journal.pbio.3001135
– volume: 14
  start-page: 587
  year: 2017
  ident: 903_CR28
  publication-title: Nat. Methods
  doi: 10.1038/nmeth.4285
– volume: 66
  start-page: 2517
  year: 2019
  ident: 903_CR7
  publication-title: Transbound. Emerg. Dis.
  doi: 10.1111/tbed.13314
– volume: 37
  start-page: 318
  year: 2020
  ident: 903_CR9
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btaa705
– volume: 19
  start-page: e3001390
  year: 2021
  ident: 903_CR3
  publication-title: PLoS Biol.
  doi: 10.1371/journal.pbio.3001390
– volume: 37
  start-page: 1530
  year: 2020
  ident: 903_CR27
  publication-title: Mol. Biol. Evol.
  doi: 10.1093/molbev/msaa015
– volume: 119
  start-page: e2202871119
  year: 2022
  ident: 903_CR6
  publication-title: Proc. Natl Acad. Sci. USA
  doi: 10.1073/pnas.2202871119
– ident: 903_CR42
  doi: 10.1371/journal.pcbi.1012597
– ident: 903_CR35
– volume: 3
  start-page: lqab004
  year: 2021
  ident: 903_CR8
  publication-title: NAR Genom. Bioinform.
  doi: 10.1093/nargab/lqab004
– ident: 903_CR47
  doi: 10.5281/zenodo.11103079
– ident: 903_CR12
SSID ssj0002769480
Score 2.3056793
Snippet Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity....
Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the...
BackgroundMachine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity....
To prepare for future pandemics caused by animal-derived viruses, there is a growing need for computational models that can predict whether a virus might...
Abstract Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human...
SourceID doaj
pubmedcentral
proquest
pubmed
crossref
springer
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Publisher
StartPage 187
SubjectTerms 45
631/114/2163
631/326/596/2564
Datasets
Genomes
Influenza
Large language models
Machine learning
Medicine
Medicine & Public Health
Metadata
Viruses
Zoonoses
SummonAdditionalLinks – databaseName: DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwELZQD4gLopRHSkFG4gZWE799BES1qgQnKvVm_CyLwKmaXSr112M7ydKlRVy4xk40modnJjP-BoBXnjmV8wSHgrAeUW8lMp1XKJ-LBmNHWuMq2ucnvjihx6fs9Nqor9ITNsIDj4w7jKINIcQglDQ0-07LsJE4RJMjf2ZlKKdv9nnXkqlvtZzGFZXtdEumJfJwoESy0m_LUFv-TaDLLU9UAftvizJvNkv-UTGtjujoAbg_RZDw7Uj5LrgT0kNw9-NUI98DXxYFFSRBN49JGeAywRnVO53B4Xz5vXZuwtJXDvsIr_o-9flz8OfyYj3kF0o3_Bn8URstA5wmS-QHZWzO8AicHH34_H6BpjkKyFFFV0go4YX0tusilz4nGKSUz5wP1HEZjaFRmBg57niOtDvJPfXUyGg9DkR5gsljsJP6FJ4C6KQKkplgaRcox222dtFZhVmk1lDTNuD1zFN9PsJl6FrmJlKPEtBZArpKQF824F1h-2ZngbquD7IC6EkB9L8UoAEHs9D0ZH-DJjiHXiUbIg14uVnOllPKISaFfj3uwSQHXLQBT0YZbyihLVG0E7wBckv6W6Rur6Tl14rOXS4jl-J2A97MivKbrr_zYv9_8OIZuIerhrN8-h2AndXFOjzPQdPKvqj28QsbzRX8
  priority: 102
  providerName: Directory of Open Access Journals
– databaseName: ProQuest Health & Medical Collection
  dbid: 7X7
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1Lj9MwEB7BIiEuiDdZFmQkbmBtYjuxfUKAWFVIcGKl3oyfpRIkpWlZaX89tut0VV7X2Ikcz4w9nvn8DcAL11oZzwkWe24cZs4IrBsncVwXNSGW1tpmts9P3eycfZi38xJwGwuscloT80LtBpti5KeUxM00-bf09eoHTlWjUna1lNC4DjcSdVmCdPE538dYCO8kE3W5K1NTcToyKtqEum1xnSIU-OJgP8q0_X_zNf-ETP6WN83b0dkduF38SPRmJ_i7cM339-Dmx5Ipvw9fZokbpEd2KpYyomWPJm7vfoHG1fJbxm-ihC5HQ0CXw9AP8XPo53K9HeMLCRO_QN8z3NKjUl8iPkjFc8YHcH72_vO7GS7VFLBlkm0wl9xx4UzThE64eMygKYlmnWe2E0FrFrgOoSNNF_3tRnSOOaZFMI54Kh0l9CEc9UPvHwOyQnrRam9Y41lH6mjzvDGStIEZzXRdwctpTtVqR5qhcrKbCrWTgIoSUFkC6qKCt2na9z0T4XV-MKwXqtiPCrz23gfPpdAsulCmJVoQH3Q8ALZG-ApOJqGpYoWjutKZCp7vm6P9pKSI7v2w3fUhNLpdrIJHOxnvR8JqKlnDuwrEgfQPhnrY0i-_Zo7udCU5pbgreDUpytW4_j0Xx___jSdwi2TdbePqdgJHm_XWP41O0cY8y5r_C-WjDF0
  priority: 102
  providerName: ProQuest
Title Hidden challenges in evaluating spillover risk of zoonotic viruses using machine learning models
URI https://link.springer.com/article/10.1038/s43856-025-00903-w
https://www.ncbi.nlm.nih.gov/pubmed/40394176
https://www.proquest.com/docview/3206330673
https://www.proquest.com/docview/3206236354
https://pubmed.ncbi.nlm.nih.gov/PMC12092720
https://doaj.org/article/f70eeefe798a4976b52a82efa6195b8e
Volume 5
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1bi9QwFD7sBcQX8W7XtUTwTYttkubyODvsMgy4iLowbzVpknFA22U648L-epO0HRldH3xpoUnKIScnObd8B-CNKWvp7YQ6s1ybjBotMlUYmfl9UWFck1zVEe3zks2u6HxRLg4Aj3dhYtJ-hLSM2_SYHfa-o0SUIV22zPLgWshuDuE4QLeHVT1l051fBXMmqciH-zE5EXcM3TuDIlT_Xfrl32mSf8RK4xF08RAeDLojmvTUPoID2zyGex-G6PgT-DoLeCANqscCKR1aNWjE826WqLtefY85myhklKPWodu2bVr_O_Rztd52fkDIg1-iHzHF0qKhpoT_EArmdE_h6uL8y3SWDRUUsppKusm45IYLo4vCMWG8aUFC4Kw2ltZMOKWo48o5hgvmdexCMEMNVcJpgy2RhmDyDI6atrEvANVCWlEqq2lhKcO5l3NeaIlLR7WiKk_g7Tin1XUPlFHFADcRVc-BynOgihyobhI4C9O-6xlAruOHdr2sBqZXjufWWme5FIp6tUmXWAlsnfJGX6mFTeB0ZFo1SF5XEeyVrmAHkQRe75q9zIRAiGpsu-37YOJVLZrA857HO0poTiQtOEtA7HF_j9T9lmb1LeJyh2vIIaydwLtxofym699zcfJ_3V_CfRzXcul3uFM42qy39pVXjDY6hUO-4P4ppkUKx5PJ_PPcv8_OLz9-SqOUpNHlkEaf1i-mkhGW
linkProvider Springer Nature
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwEB5VWwm4IN6kFDASnCBqYjuxc0CIQqstbVcItVJvrh3by0o0WTa7rOBH8RuxnWSr5XXrNXYiZ17-xjOeAXius7JwfkIZG6Z0TLXisUx1ETu7KDEuSSLLUO1zlA9P6Yez7GwDfvZ3YXxaZW8Tg6HWdenPyHcIdpupx7fkzfRr7LtG-ehq30KjFYtD833pXLbm9cF7x98XGO_vnbwbxl1XgbikBZ3HrGCaca3S1OZcO7hNfDCp1IaWObdSUsuktTlOc4c7U55rqqnkVmlsSKGJL3TgTP4mJc6VGcDm7t7o46fVqQ5meUF50t3OSQjfaSjhmc_zzeLEn4nEy7UdMDQK-Bu6_TNJ87dIbdgA92_BzQ65oretqN2GDVPdgWvHXWz-LpwPfTWSCpV9e5YGTSrUVxOvxqiZTr6EjFHk89lRbdGPuq5q9zn0bTJbNO4Fn4U_RhchwdOgrqOFe-Db9TT34PRKKH0fBlVdmYeASl4YnkmjaGpojhNnZViqCpxZqiSVSQQve5qKaVumQ4TwOuGi5YBwHBCBA2IZwa4n-2qmL7EdHtSzseg0VliWGGOsYQWX1IE2lWHJsbHSuZyZ4iaC7Z5potP7RlxKaQTPVsNOY30YRlamXrRzMHFAj0bwoOXxaiU0IQVNWR4BX-P-2lLXR6rJ51AV3F-C9kH1CF71gnK5rn_TYuv_v_EUrg9Pjo_E0cHo8BHcwEGOM2dbt2Ewny3MYwfJ5upJpwcIzq9a9X4BFrlKqA
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwEB5VRaq4IN4EChgJTmBtYjuxc0AIKKsthYoDlfbm2rG9rATJ0uyygp_Gr8N2kq2W163X2Ikcz8OfPZ9nAB6bvCr9PqHClmuDmdECq8yU2PtFRUhFU1XFbJ_HxeSEvZ3m0x34OdyFCbTKwSdGR22aKpyRjyjxi2nAt3TkelrEh4Pxi8VXHCpIhUjrUE6jU5Ej-33tt2_t88MDL-snhIzffHw9wX2FAVyxki0xL7nhwugsc4UwHnrTEFiqjGVVIZxSzHHlXEGywmPQTBSGGaaE04ZYWhoakh5493-J0zwLNsanfHO-Q3hRMpH293RSKkYtoyIPjN8cp-F0BK-31sJYMuBvOPdPuuZvMdu4FI6vwpUew6KXndJdgx1bX4e9932U_gacTkJekhpVQ6GWFs1rNOQVr2eoXcw_R-4oCsx21Dj0o2nqxn8OfZufrVr_QuDjz9CXSPW0qK9t4R-Ewj3tTTi5kHm-Bbt1U9s7gCpRWpErq1lmWUFS7294pkuSO6YVU2kCT4c5lYsuYYeMgXYqZCcB6SUgowTkOoFXYdo3PUOy7figOZvJ3nal46m11lleCsU8fNM5UYJYp_zmM9fCJrA_CE32HqCV5_qawKNNs7fdEJBRtW1WXR9CPeRjCdzuZLwZCUtpyTJeJCC2pL811O2Wev4p5gcP16FDeD2BZ4OinI_r33Nx9_-_8RD2vMHJd4fHR_fgMolqnHsnuw-7y7OVve-x2VI_iEaA4PSire4XwjJNeA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Hidden+challenges+in+evaluating+spillover+risk+of+zoonotic+viruses+using+machine+learning+models&rft.jtitle=Communications+medicine&rft.au=Kawasaki%2C+Junna&rft.au=Suzuki%2C+Tadaki&rft.au=Hamada%2C+Michiaki&rft.date=2025-05-20&rft.pub=Nature+Publishing+Group+UK&rft.eissn=2730-664X&rft.volume=5&rft_id=info:doi/10.1038%2Fs43856-025-00903-w&rft_id=info%3Apmid%2F40394176&rft.externalDocID=PMC12092720
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2730-664X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2730-664X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2730-664X&client=summon