Hidden challenges in evaluating spillover risk of zoonotic viruses using machine learning models

Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses. Methods I...

Full description

Saved in:

Bibliographic Details
Published in	Communications medicine Vol. 5; no. 1; pp. 187 - 10
Main Authors	Kawasaki, Junna, Suzuki, Tadaki, Hamada, Michiaki
Format	Journal Article
Language	English
Published	London Nature Publishing Group UK 20.05.2025 Springer Nature B.V Nature Portfolio
Subjects	45 631/114/2163 631/326/596/2564 Datasets Genomes Influenza Large language models Machine learning Medicine Medicine & Public Health Metadata Viruses Zoonoses
Online Access	Get full text

Cover

Loading…

Abstract	Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses. Methods In this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences. Results Here we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model’s applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2. Conclusions Our study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats. Plain language summary To prepare for future pandemics caused by animal-derived viruses, there is a growing need for computational models that can predict whether a virus might infect humans. We constructed extensive datasets covering information about different viruses, including key human pathogens. We developed computational models using these datasets, which outperformed existing approaches across many virus types. However, we also revealed that current models share the same unresolved challenges when assessing whether specific viruses will infect humans, including SARS-CoV-2. These findings suggest that current models may fail to identify animal viruses that can infect humans, which underscores the urgent need for improved predictive models to strengthen pandemic preparedness. Kawasaki et al. construct a dataset covering 26 viral families and use large language models pre-trained on nucleotide sequences to identify zoonotic viruses with human infectivity potential. High predictive performance was obtained, even with partial viral sequences, but not all zoonotic lineages could be identified.
AbstractList	Abstract Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses. Methods In this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences. Results Here we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model’s applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2. Conclusions Our study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats. Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses. In this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences. Here we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model's applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2. Our study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats. BackgroundMachine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses.MethodsIn this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences.ResultsHere we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model’s applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2.ConclusionsOur study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats.Plain language summaryTo prepare for future pandemics caused by animal-derived viruses, there is a growing need for computational models that can predict whether a virus might infect humans. We constructed extensive datasets covering information about different viruses, including key human pathogens. We developed computational models using these datasets, which outperformed existing approaches across many virus types. However, we also revealed that current models share the same unresolved challenges when assessing whether specific viruses will infect humans, including SARS-CoV-2. These findings suggest that current models may fail to identify animal viruses that can infect humans, which underscores the urgent need for improved predictive models to strengthen pandemic preparedness. To prepare for future pandemics caused by animal-derived viruses, there is a growing need for computational models that can predict whether a virus might infect humans. We constructed extensive datasets covering information about different viruses, including key human pathogens. We developed computational models using these datasets, which outperformed existing approaches across many virus types. However, we also revealed that current models share the same unresolved challenges when assessing whether specific viruses will infect humans, including SARS-CoV-2. These findings suggest that current models may fail to identify animal viruses that can infect humans, which underscores the urgent need for improved predictive models to strengthen pandemic preparedness. Kawasaki et al. construct a dataset covering 26 viral families and use large language models pre-trained on nucleotide sequences to identify zoonotic viruses with human infectivity potential. High predictive performance was obtained, even with partial viral sequences, but not all zoonotic lineages could be identified. Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses.BACKGROUNDMachine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses.In this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences.METHODSIn this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences.Here we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model's applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2.RESULTSHere we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model's applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2.Our study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats.CONCLUSIONSOur study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats. Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses. Methods In this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences. Results Here we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model’s applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2. Conclusions Our study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats. Plain language summary To prepare for future pandemics caused by animal-derived viruses, there is a growing need for computational models that can predict whether a virus might infect humans. We constructed extensive datasets covering information about different viruses, including key human pathogens. We developed computational models using these datasets, which outperformed existing approaches across many virus types. However, we also revealed that current models share the same unresolved challenges when assessing whether specific viruses will infect humans, including SARS-CoV-2. These findings suggest that current models may fail to identify animal viruses that can infect humans, which underscores the urgent need for improved predictive models to strengthen pandemic preparedness. Kawasaki et al. construct a dataset covering 26 viral families and use large language models pre-trained on nucleotide sequences to identify zoonotic viruses with human infectivity potential. High predictive performance was obtained, even with partial viral sequences, but not all zoonotic lineages could be identified.
ArticleNumber	187
Author	Kawasaki, Junna Suzuki, Tadaki Hamada, Michiaki
Author_xml	– sequence: 1 givenname: Junna orcidid: 0000-0002-6609-5300 surname: Kawasaki fullname: Kawasaki, Junna email: jrt13mpmuk@gmail.com organization: Faculty of Science and Engineering, Waseda University, Department of Infectious Disease Pathobiology, Graduate School of Medicine, Chiba University – sequence: 2 givenname: Tadaki orcidid: 0000-0002-3820-9542 surname: Suzuki fullname: Suzuki, Tadaki organization: Department of Infectious Disease Pathobiology, Graduate School of Medicine, Chiba University, Department of Infectious Disease Pathology, National Institute of Infectious Diseases, Japan Institute for Health Security – sequence: 3 givenname: Michiaki orcidid: 0000-0001-9466-1034 surname: Hamada fullname: Hamada, Michiaki email: mhamada@waseda.jp organization: Faculty of Science and Engineering, Waseda University, Cellular and Molecular Biotechnology Research Institute (CMB), National Institute of Advanced Industrial Science and Technology (AIST), Graduate School of Medicine, Nippon Medical School
BackLink	https://www.ncbi.nlm.nih.gov/pubmed/40394176$$D View this record in MEDLINE/PubMed
BookMark	eNp9kktv1DAUhSNUREvpH2CBIrFhE_ArfqwQqoBWqsQGJHbGia9nPHjswU6mgl-POymlZcHK1vV3zr3yPU-bo5giNM1zjF5jROWbwqjseYdI3yGkEO2uHzUnRFDUcc6-Ht27HzdnpWwQQkRwxSR60hwzRBXDgp803y68tRDbcW1CgLiC0vrYwt6E2Uw-rtqy8yGkPeQ2-_K9Ta79lVJMkx_bvc9zqYK53IBbM659hDaAyfFQSBZCedY8diYUOLs9T5svH95_Pr_orj59vDx_d9WNTLGpE0pYIe2AsePS9oJTTCQeLbCRS2cMc8I4xwnmgigsuWWWGekGS4AqSwk9bS4XX5vMRu-y35r8Uyfj9aGQ8kqbXKcOoJ1AAOBAKGmYEnzoiZEEnOFY9YOE6vV28drNwxbsCHHKJjwwffgS_Vqv0l5jghQRBFWHV7cOOf2YoUx668sIIZgIaS6aEsQJ5bRnFX35D7pJc471rw4UpYgLWqkX90e6m-XPJitAFmDMqZQM7g7BSN8kRi-J0TUx-pAYfV1FdBGVCtfl57-9_6P6DU9RxT0
Cites_doi	10.1128/mBio.01638-21 10.1038/nature24039 10.1128/JVI.02005-07 10.1098/rstb.2020.0358 10.1038/s41586-023-06617-0 10.1371/journal.ppat.1011116 10.1016/S0168-9525(00)02024-2 10.1126/science.aap7463 10.3390/v8030066 10.1016/j.coviro.2023.101346 10.3201/eid3007.240508 10.1093/nar/gkx932 10.1093/bioinformatics/bts565 10.1093/molbev/msx281 10.1186/s12859-023-05582-9 10.1101/2024.05.01.591751 10.1093/bioinformatics/btl158 10.15585/mmwr.mm7321e1 10.1038/s41564-024-01879-4 10.5281/zenodo.11103091 10.1093/bioinformatics/bty407 10.1038/s41586-022-04532-4 10.1016/j.virusres.2017.10.014 10.1093/bioinformatics/btab083 10.1016/j.tim.2015.11.002 10.1109/BIBM58861.2023.10385501 10.1093/nar/gku1207 10.5281/zenodo.11102793 10.1093/molbev/mst010 10.1126/science.aap9072 10.1093/gbe/evac018 10.1093/bib/bbac204 10.5281/zenodo.11103056 10.1038/s41586-021-04332-2 10.1186/1471-2105-10-421 10.1002/cpbi.96 10.1371/journal.pbio.3001135 10.1038/nmeth.4285 10.1111/tbed.13314 10.1093/bioinformatics/btaa705 10.1371/journal.pbio.3001390 10.1093/molbev/msaa015 10.1073/pnas.2202871119 10.1371/journal.pcbi.1012597 10.1093/nargab/lqab004 10.5281/zenodo.11103079
ContentType	Journal Article
Copyright	The Author(s) 2025 2025. The Author(s). Copyright Springer Nature B.V. Dec 2025 The Author(s) 2025 2025
Copyright_xml	– notice: The Author(s) 2025 – notice: 2025. The Author(s). – notice: Copyright Springer Nature B.V. Dec 2025 – notice: The Author(s) 2025 2025
DBID	C6C AAYXX CITATION NPM 3V. 7X7 7XB 88C 88E 8C1 8FI 8FJ 8FK ABUWG AFKRA AZQEC BENPR CCPQU COVID DWQXO FYUFA GHDGH K9. M0S M0T M1P PHGZM PHGZT PIMPY PJZUB PKEHL PPXIY PQEST PQQKQ PQUKI PRINS 7X8 5PM DOA
DOI	10.1038/s43856-025-00903-w
DatabaseName	Springer Nature OA Free Journals CrossRef PubMed ProQuest Central (Corporate) ProQuest Health & Medical Collection ProQuest Central (purchase pre-March 2016) Healthcare Administration Database (Alumni) Medical Database (Alumni Edition) Public Health Database ProQuest Hospital Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ProQuest Central (Alumni) ProQuest Central UK/Ireland ProQuest Central Essentials ProQuest Central ProQuest One Coronavirus Research Database ProQuest Central Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Health & Medical Complete (Alumni) ProQuest Health & Medical Collection Healthcare Administration Database Medical Database ProQuest Central Premium ProQuest One Academic Publicly Available Content Database ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China MEDLINE - Academic PubMed Central (Full Participant titles) DOAJ Directory of Open Access Journals
DatabaseTitle	CrossRef PubMed Publicly Available Content Database ProQuest One Academic Middle East (New) ProQuest Central Essentials ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest One Health & Nursing ProQuest Central China ProQuest Central ProQuest Health & Medical Research Collection Health Research Premium Collection Health and Medicine Complete (Alumni Edition) ProQuest Central Korea Health & Medical Research Collection ProQuest Central (New) ProQuest Medical Library (Alumni) ProQuest Public Health ProQuest One Academic Eastern Edition ProQuest Health Management Coronavirus Research Database ProQuest Hospital Collection Health Research Premium Collection (Alumni) ProQuest Hospital Collection (Alumni) ProQuest Health & Medical Complete ProQuest Medical Library ProQuest One Academic UKI Edition ProQuest Health Management (Alumni Edition) ProQuest One Academic ProQuest One Academic (New) ProQuest Central (Alumni) MEDLINE - Academic
DatabaseTitleList	PubMed Publicly Available Content Database MEDLINE - Academic
Database_xml	– sequence: 1 dbid: C6C name: Springer Nature OA Free Journals url: http://www.springeropen.com/ sourceTypes: Publisher – sequence: 2 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 3 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 4 dbid: BENPR name: ProQuest Central url: https://www.proquest.com/central sourceTypes: Aggregation Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Medicine
EISSN	2730-664X
EndPage	10
ExternalDocumentID	oai_doaj_org_article_f70eeefe798a4976b52a82efa6195b8e PMC12092720 40394176 10_1038_s43856_025_00903_w
Genre	Journal Article
GrantInformation_xml	– fundername: MEXT \| Japan Society for the Promotion of Science (JSPS) grantid: JP22KJ2901 funderid: https://doi.org/10.13039/501100001691 – fundername: MEXT \| JST \| Precursory Research for Embryonic Science and Technology (PRESTO) grantid: JPMJPR23R4 funderid: https://doi.org/10.13039/501100009023 – fundername: MEXT \| JST \| Precursory Research for Embryonic Science and Technology (PRESTO) grantid: JPMJPR23R4 – fundername: MEXT \| Japan Society for the Promotion of Science (JSPS) grantid: JP22KJ2901
GroupedDBID	0R~ 53G 7X7 88E 8C1 8FI 8FJ AAJSJ AASML ABUWG ACLNF AFKRA ALIPV ALMA_UNASSIGNED_HOLDINGS BENPR C6C CCPQU EBLON FYUFA GROUPED_DOAJ HMCUK M0T M1P M~E NAO OK1 PGMZT PHGZM PHGZT PIMPY PSQYO RPM SNYQT UKHRP AAYXX CITATION NPM PJZUB PPXIY 3V. 7XB 8FK AARCD AZQEC COVID DWQXO K9. PKEHL PQEST PQQKQ PQUKI PRINS 7X8 5PM PUEGO
ID	FETCH-LOGICAL-c494t-797d78db11f68d57631281cde4c68faa4f7aff6216729186d4d4a8fbd2e39d323
IEDL.DBID	C6C
ISSN	2730-664X
IngestDate	Wed Aug 27 01:11:33 EDT 2025 Thu Aug 21 18:30:44 EDT 2025 Fri Jul 11 17:23:57 EDT 2025 Wed Aug 13 11:26:42 EDT 2025 Mon Jul 21 06:07:07 EDT 2025 Sun Jul 06 05:05:44 EDT 2025 Wed May 21 12:01:58 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Issue	1
Language	English
License	2025. The Author(s). Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c494t-797d78db11f68d57631281cde4c68faa4f7aff6216729186d4d4a8fbd2e39d323
Notes	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ORCID	0000-0002-6609-5300 0000-0002-3820-9542 0000-0001-9466-1034
OpenAccessLink	https://www.nature.com/articles/s43856-025-00903-w
PMID	40394176
PQID	3206330673
PQPubID	5642959
PageCount	10
ParticipantIDs	doaj_primary_oai_doaj_org_article_f70eeefe798a4976b52a82efa6195b8e pubmedcentral_primary_oai_pubmedcentral_nih_gov_12092720 proquest_miscellaneous_3206236354 proquest_journals_3206330673 pubmed_primary_40394176 crossref_primary_10_1038_s43856_025_00903_w springer_journals_10_1038_s43856_025_00903_w
PublicationCentury	2000
PublicationDate	2025-05-20
PublicationDateYYYYMMDD	2025-05-20
PublicationDate_xml	– month: 05 year: 2025 text: 2025-05-20 day: 20
PublicationDecade	2020
PublicationPlace	London
PublicationPlace_xml	– name: London – name: England
PublicationTitle	Communications medicine
PublicationTitleAbbrev	Commun Med
PublicationTitleAlternate	Commun Med (Lond)
PublicationYear	2025
Publisher	Nature Publishing Group UK Springer Nature B.V Nature Portfolio
Publisher_xml	– name: Nature Publishing Group UK – name: Springer Nature B.V – name: Nature Portfolio
References	D Carroll (903_CR1) 2018; 359 MA Takata (903_CR33) 2017; 550 G Yu (903_CR30) 2020; 69 JM Bartoszewicz (903_CR8) 2021; 3 Y Bao (903_CR16) 2008; 82 L Fu (903_CR24) 2012; 28 N Mollentze (903_CR3) 2021; 19 RC Edgar (903_CR5) 2022; 602 S Temmam (903_CR41) 2022; 604 Z Zhang (903_CR7) 2019; 66 J Hadfield (903_CR18) 2018; 34 J Kawasaki (903_CR36) 2021; 12 F Mock (903_CR9) 2020; 37 N Mollentze (903_CR13) 2023; 61 Z Ming (903_CR10) 2023; 24 903_CR17 SA Babayan (903_CR32) 2018; 362 903_CR39 903_CR38 903_CR37 M Wille (903_CR14) 2021; 19 W Li (903_CR25) 2006; 22 903_CR35 903_CR12 903_CR11 P Rice (903_CR19) 2000; 16 Y Ji (903_CR20) 2021; 37 EJ Lefkowitz (903_CR23) 2018; 46 K Katoh (903_CR26) 2013; 30 MA Martínez (903_CR34) 2016; 24 JR Brister (903_CR15) 2015; 43 C Camacho (903_CR22) 2009; 10 CJ Carlson (903_CR2) 2021; 376 AL Greninger (903_CR4) 2018; 244 GT Keusch (903_CR6) 2022; 119 BQ Minh (903_CR27) 2020; 37 S Kalyaanamoorthy (903_CR28) 2017; 14 DT Hoang (903_CR29) 2018; 35 903_CR48 903_CR47 903_CR46 903_CR45 903_CR44 903_CR21 T Mihara (903_CR31) 2016; 8 P Li (903_CR40) 2023; 19 903_CR43 903_CR42
References_xml	– volume: 12 start-page: e0163821 year: 2021 ident: 903_CR36 publication-title: MBio doi: 10.1128/mBio.01638-21 – volume: 550 start-page: 124 year: 2017 ident: 903_CR33 publication-title: Nature doi: 10.1038/nature24039 – volume: 82 start-page: 596 year: 2008 ident: 903_CR16 publication-title: J. Virol. doi: 10.1128/JVI.02005-07 – volume: 376 start-page: 20200358 year: 2021 ident: 903_CR2 publication-title: Philos. Trans. R. Soc. Lond. B Biol. Sci. doi: 10.1098/rstb.2020.0358 – ident: 903_CR43 doi: 10.1038/s41586-023-06617-0 – volume: 19 start-page: e1011116 year: 2023 ident: 903_CR40 publication-title: PLoS Pathog. doi: 10.1371/journal.ppat.1011116 – volume: 16 start-page: 276 year: 2000 ident: 903_CR19 publication-title: Trends Genet. doi: 10.1016/S0168-9525(00)02024-2 – volume: 359 start-page: 872 year: 2018 ident: 903_CR1 publication-title: Science doi: 10.1126/science.aap7463 – volume: 8 start-page: 66 year: 2016 ident: 903_CR31 publication-title: Viruses doi: 10.3390/v8030066 – volume: 61 start-page: 101346 year: 2023 ident: 903_CR13 publication-title: Curr. Opin. Virol. doi: 10.1016/j.coviro.2023.101346 – ident: 903_CR38 doi: 10.3201/eid3007.240508 – volume: 46 start-page: D708 year: 2018 ident: 903_CR23 publication-title: Nucleic Acids Res. doi: 10.1093/nar/gkx932 – volume: 28 start-page: 3150 year: 2012 ident: 903_CR24 publication-title: Bioinformatics doi: 10.1093/bioinformatics/bts565 – volume: 35 start-page: 518 year: 2018 ident: 903_CR29 publication-title: Mol. Biol. Evol. doi: 10.1093/molbev/msx281 – volume: 24 year: 2023 ident: 903_CR10 publication-title: BMC Bioinforma. doi: 10.1186/s12859-023-05582-9 – ident: 903_CR39 doi: 10.1101/2024.05.01.591751 – volume: 22 start-page: 1658 year: 2006 ident: 903_CR25 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btl158 – ident: 903_CR37 doi: 10.15585/mmwr.mm7321e1 – ident: 903_CR44 doi: 10.1038/s41564-024-01879-4 – ident: 903_CR48 doi: 10.5281/zenodo.11103091 – volume: 34 start-page: 4121 year: 2018 ident: 903_CR18 publication-title: Bioinformatics doi: 10.1093/bioinformatics/bty407 – volume: 604 start-page: 330 year: 2022 ident: 903_CR41 publication-title: Nature doi: 10.1038/s41586-022-04532-4 – volume: 244 start-page: 218 year: 2018 ident: 903_CR4 publication-title: Virus Res. doi: 10.1016/j.virusres.2017.10.014 – volume: 37 start-page: 2112 year: 2021 ident: 903_CR20 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btab083 – volume: 24 start-page: 134 year: 2016 ident: 903_CR34 publication-title: Trends Microbiol doi: 10.1016/j.tim.2015.11.002 – ident: 903_CR11 doi: 10.1109/BIBM58861.2023.10385501 – volume: 43 start-page: D571 year: 2015 ident: 903_CR15 publication-title: Nucleic Acids Res. doi: 10.1093/nar/gku1207 – ident: 903_CR45 doi: 10.5281/zenodo.11102793 – volume: 30 start-page: 772 year: 2013 ident: 903_CR26 publication-title: Mol. Biol. Evol. doi: 10.1093/molbev/mst010 – volume: 362 start-page: 577 year: 2018 ident: 903_CR32 publication-title: Science doi: 10.1126/science.aap9072 – ident: 903_CR17 doi: 10.1093/gbe/evac018 – ident: 903_CR21 doi: 10.1093/bib/bbac204 – ident: 903_CR46 doi: 10.5281/zenodo.11103056 – volume: 602 start-page: 142 year: 2022 ident: 903_CR5 publication-title: Nature doi: 10.1038/s41586-021-04332-2 – volume: 10 year: 2009 ident: 903_CR22 publication-title: BMC Bioinforma. doi: 10.1186/1471-2105-10-421 – volume: 69 start-page: e96 year: 2020 ident: 903_CR30 publication-title: Curr. Protoc. Bioinforma. doi: 10.1002/cpbi.96 – volume: 19 start-page: e3001135 year: 2021 ident: 903_CR14 publication-title: PLoS Biol. doi: 10.1371/journal.pbio.3001135 – volume: 14 start-page: 587 year: 2017 ident: 903_CR28 publication-title: Nat. Methods doi: 10.1038/nmeth.4285 – volume: 66 start-page: 2517 year: 2019 ident: 903_CR7 publication-title: Transbound. Emerg. Dis. doi: 10.1111/tbed.13314 – volume: 37 start-page: 318 year: 2020 ident: 903_CR9 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btaa705 – volume: 19 start-page: e3001390 year: 2021 ident: 903_CR3 publication-title: PLoS Biol. doi: 10.1371/journal.pbio.3001390 – volume: 37 start-page: 1530 year: 2020 ident: 903_CR27 publication-title: Mol. Biol. Evol. doi: 10.1093/molbev/msaa015 – volume: 119 start-page: e2202871119 year: 2022 ident: 903_CR6 publication-title: Proc. Natl Acad. Sci. USA doi: 10.1073/pnas.2202871119 – ident: 903_CR42 doi: 10.1371/journal.pcbi.1012597 – ident: 903_CR35 – volume: 3 start-page: lqab004 year: 2021 ident: 903_CR8 publication-title: NAR Genom. Bioinform. doi: 10.1093/nargab/lqab004 – ident: 903_CR47 doi: 10.5281/zenodo.11103079 – ident: 903_CR12
SSID	ssj0002769480
Score	2.3056793
Snippet	Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity.... Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the... BackgroundMachine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity.... To prepare for future pandemics caused by animal-derived viruses, there is a growing need for computational models that can predict whether a virus might... Abstract Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human...
SourceID	doaj pubmedcentral proquest pubmed crossref springer
SourceType	Open Website Open Access Repository Aggregation Database Index Database Publisher
StartPage	187
SubjectTerms	45 631/114/2163 631/326/596/2564 Datasets Genomes Influenza Large language models Machine learning Medicine Medicine & Public Health Metadata Viruses Zoonoses
SummonAdditionalLinks	– databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwELZQD4gLopRHSkFG4gZWE799BES1qgQnKvVm_CyLwKmaXSr112M7ydKlRVy4xk40modnJjP-BoBXnjmV8wSHgrAeUW8lMp1XKJ-LBmNHWuMq2ucnvjihx6fs9Nqor9ITNsIDj4w7jKINIcQglDQ0-07LsJE4RJMjf2ZlKKdv9nnXkqlvtZzGFZXtdEumJfJwoESy0m_LUFv-TaDLLU9UAftvizJvNkv-UTGtjujoAbg_RZDw7Uj5LrgT0kNw9-NUI98DXxYFFSRBN49JGeAywRnVO53B4Xz5vXZuwtJXDvsIr_o-9flz8OfyYj3kF0o3_Bn8URstA5wmS-QHZWzO8AicHH34_H6BpjkKyFFFV0go4YX0tusilz4nGKSUz5wP1HEZjaFRmBg57niOtDvJPfXUyGg9DkR5gsljsJP6FJ4C6KQKkplgaRcox222dtFZhVmk1lDTNuD1zFN9PsJl6FrmJlKPEtBZArpKQF824F1h-2ZngbquD7IC6EkB9L8UoAEHs9D0ZH-DJjiHXiUbIg14uVnOllPKISaFfj3uwSQHXLQBT0YZbyihLVG0E7wBckv6W6Rur6Tl14rOXS4jl-J2A97MivKbrr_zYv9_8OIZuIerhrN8-h2AndXFOjzPQdPKvqj28QsbzRX8 priority: 102 providerName: Directory of Open Access Journals – databaseName: ProQuest Health & Medical Collection dbid: 7X7 link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1Lj9MwEB7BIiEuiDdZFmQkbmBtYjuxfUKAWFVIcGKl3oyfpRIkpWlZaX89tut0VV7X2Ikcz4w9nvn8DcAL11oZzwkWe24cZs4IrBsncVwXNSGW1tpmts9P3eycfZi38xJwGwuscloT80LtBpti5KeUxM00-bf09eoHTlWjUna1lNC4DjcSdVmCdPE538dYCO8kE3W5K1NTcToyKtqEum1xnSIU-OJgP8q0_X_zNf-ETP6WN83b0dkduF38SPRmJ_i7cM339-Dmx5Ipvw9fZokbpEd2KpYyomWPJm7vfoHG1fJbxm-ihC5HQ0CXw9AP8XPo53K9HeMLCRO_QN8z3NKjUl8iPkjFc8YHcH72_vO7GS7VFLBlkm0wl9xx4UzThE64eMygKYlmnWe2E0FrFrgOoSNNF_3tRnSOOaZFMI54Kh0l9CEc9UPvHwOyQnrRam9Y41lH6mjzvDGStIEZzXRdwctpTtVqR5qhcrKbCrWTgIoSUFkC6qKCt2na9z0T4XV-MKwXqtiPCrz23gfPpdAsulCmJVoQH3Q8ALZG-ApOJqGpYoWjutKZCp7vm6P9pKSI7v2w3fUhNLpdrIJHOxnvR8JqKlnDuwrEgfQPhnrY0i-_Zo7udCU5pbgreDUpytW4_j0Xx___jSdwi2TdbePqdgJHm_XWP41O0cY8y5r_C-WjDF0 priority: 102 providerName: ProQuest
Title	Hidden challenges in evaluating spillover risk of zoonotic viruses using machine learning models
URI	https://link.springer.com/article/10.1038/s43856-025-00903-w https://www.ncbi.nlm.nih.gov/pubmed/40394176 https://www.proquest.com/docview/3206330673 https://www.proquest.com/docview/3206236354 https://pubmed.ncbi.nlm.nih.gov/PMC12092720 https://doaj.org/article/f70eeefe798a4976b52a82efa6195b8e
Volume	5
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1bi9QwFD7sBcQX8W7XtUTwTYttkubyODvsMgy4iLowbzVpknFA22U648L-epO0HRldH3xpoUnKIScnObd8B-CNKWvp7YQ6s1ybjBotMlUYmfl9UWFck1zVEe3zks2u6HxRLg4Aj3dhYtJ-hLSM2_SYHfa-o0SUIV22zPLgWshuDuE4QLeHVT1l051fBXMmqciH-zE5EXcM3TuDIlT_Xfrl32mSf8RK4xF08RAeDLojmvTUPoID2zyGex-G6PgT-DoLeCANqscCKR1aNWjE826WqLtefY85myhklKPWodu2bVr_O_Rztd52fkDIg1-iHzHF0qKhpoT_EArmdE_h6uL8y3SWDRUUsppKusm45IYLo4vCMWG8aUFC4Kw2ltZMOKWo48o5hgvmdexCMEMNVcJpgy2RhmDyDI6atrEvANVCWlEqq2lhKcO5l3NeaIlLR7WiKk_g7Tin1XUPlFHFADcRVc-BynOgihyobhI4C9O-6xlAruOHdr2sBqZXjufWWme5FIp6tUmXWAlsnfJGX6mFTeB0ZFo1SF5XEeyVrmAHkQRe75q9zIRAiGpsu-37YOJVLZrA857HO0poTiQtOEtA7HF_j9T9lmb1LeJyh2vIIaydwLtxofym699zcfJ_3V_CfRzXcul3uFM42qy39pVXjDY6hUO-4P4ppkUKx5PJ_PPcv8_OLz9-SqOUpNHlkEaf1i-mkhGW
linkProvider	Springer Nature
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwEB5VWwm4IN6kFDASnCBqYjuxc0CIQqstbVcItVJvrh3by0o0WTa7rOBH8RuxnWSr5XXrNXYiZ17-xjOeAXius7JwfkIZG6Z0TLXisUx1ETu7KDEuSSLLUO1zlA9P6Yez7GwDfvZ3YXxaZW8Tg6HWdenPyHcIdpupx7fkzfRr7LtG-ehq30KjFYtD833pXLbm9cF7x98XGO_vnbwbxl1XgbikBZ3HrGCaca3S1OZcO7hNfDCp1IaWObdSUsuktTlOc4c7U55rqqnkVmlsSKGJL3TgTP4mJc6VGcDm7t7o46fVqQ5meUF50t3OSQjfaSjhmc_zzeLEn4nEy7UdMDQK-Bu6_TNJ87dIbdgA92_BzQ65oretqN2GDVPdgWvHXWz-LpwPfTWSCpV9e5YGTSrUVxOvxqiZTr6EjFHk89lRbdGPuq5q9zn0bTJbNO4Fn4U_RhchwdOgrqOFe-Db9TT34PRKKH0fBlVdmYeASl4YnkmjaGpojhNnZViqCpxZqiSVSQQve5qKaVumQ4TwOuGi5YBwHBCBA2IZwa4n-2qmL7EdHtSzseg0VliWGGOsYQWX1IE2lWHJsbHSuZyZ4iaC7Z5potP7RlxKaQTPVsNOY30YRlamXrRzMHFAj0bwoOXxaiU0IQVNWR4BX-P-2lLXR6rJ51AV3F-C9kH1CF71gnK5rn_TYuv_v_EUrg9Pjo_E0cHo8BHcwEGOM2dbt2Ewny3MYwfJ5upJpwcIzq9a9X4BFrlKqA
linkToPdf	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwEB5VRaq4IN4EChgJTmBtYjuxc0AIKKsthYoDlfbm2rG9rATJ0uyygp_Gr8N2kq2W163X2Ikcz8OfPZ9nAB6bvCr9PqHClmuDmdECq8yU2PtFRUhFU1XFbJ_HxeSEvZ3m0x34OdyFCbTKwSdGR22aKpyRjyjxi2nAt3TkelrEh4Pxi8VXHCpIhUjrUE6jU5Ej-33tt2_t88MDL-snhIzffHw9wX2FAVyxki0xL7nhwugsc4UwHnrTEFiqjGVVIZxSzHHlXEGywmPQTBSGGaaE04ZYWhoakh5493-J0zwLNsanfHO-Q3hRMpH293RSKkYtoyIPjN8cp-F0BK-31sJYMuBvOPdPuuZvMdu4FI6vwpUew6KXndJdgx1bX4e9932U_gacTkJekhpVQ6GWFs1rNOQVr2eoXcw_R-4oCsx21Dj0o2nqxn8OfZufrVr_QuDjz9CXSPW0qK9t4R-Ewj3tTTi5kHm-Bbt1U9s7gCpRWpErq1lmWUFS7294pkuSO6YVU2kCT4c5lYsuYYeMgXYqZCcB6SUgowTkOoFXYdo3PUOy7figOZvJ3nal46m11lleCsU8fNM5UYJYp_zmM9fCJrA_CE32HqCV5_qawKNNs7fdEJBRtW1WXR9CPeRjCdzuZLwZCUtpyTJeJCC2pL811O2Wev4p5gcP16FDeD2BZ4OinI_r33Nx9_-_8RD2vMHJd4fHR_fgMolqnHsnuw-7y7OVve-x2VI_iEaA4PSire4XwjJNeA
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Hidden+challenges+in+evaluating+spillover+risk+of+zoonotic+viruses+using+machine+learning+models&rft.jtitle=Communications+medicine&rft.au=Kawasaki%2C+Junna&rft.au=Suzuki%2C+Tadaki&rft.au=Hamada%2C+Michiaki&rft.date=2025-05-20&rft.pub=Nature+Publishing+Group+UK&rft.eissn=2730-664X&rft.volume=5&rft_id=info:doi/10.1038%2Fs43856-025-00903-w&rft_id=info%3Apmid%2F40394176&rft.externalDocID=PMC12092720
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2730-664X&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2730-664X&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2730-664X&client=summon