Hidden challenges in evaluating spillover risk of zoonotic viruses using machine learning models
Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses. Methods I...
Saved in:
Published in | Communications medicine Vol. 5; no. 1; pp. 187 - 10 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
London
Nature Publishing Group UK
20.05.2025
Springer Nature B.V Nature Portfolio |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Background
Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses.
Methods
In this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences.
Results
Here we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model’s applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2.
Conclusions
Our study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats.
Plain language summary
To prepare for future pandemics caused by animal-derived viruses, there is a growing need for computational models that can predict whether a virus might infect humans. We constructed extensive datasets covering information about different viruses, including key human pathogens. We developed computational models using these datasets, which outperformed existing approaches across many virus types. However, we also revealed that current models share the same unresolved challenges when assessing whether specific viruses will infect humans, including SARS-CoV-2. These findings suggest that current models may fail to identify animal viruses that can infect humans, which underscores the urgent need for improved predictive models to strengthen pandemic preparedness.
Kawasaki et al. construct a dataset covering 26 viral families and use large language models pre-trained on nucleotide sequences to identify zoonotic viruses with human infectivity potential. High predictive performance was obtained, even with partial viral sequences, but not all zoonotic lineages could be identified. |
---|---|
AbstractList | Abstract Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses. Methods In this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences. Results Here we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model’s applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2. Conclusions Our study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats. Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses. In this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences. Here we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model's applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2. Our study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats. BackgroundMachine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses.MethodsIn this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences.ResultsHere we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model’s applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2.ConclusionsOur study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats.Plain language summaryTo prepare for future pandemics caused by animal-derived viruses, there is a growing need for computational models that can predict whether a virus might infect humans. We constructed extensive datasets covering information about different viruses, including key human pathogens. We developed computational models using these datasets, which outperformed existing approaches across many virus types. However, we also revealed that current models share the same unresolved challenges when assessing whether specific viruses will infect humans, including SARS-CoV-2. These findings suggest that current models may fail to identify animal viruses that can infect humans, which underscores the urgent need for improved predictive models to strengthen pandemic preparedness. To prepare for future pandemics caused by animal-derived viruses, there is a growing need for computational models that can predict whether a virus might infect humans. We constructed extensive datasets covering information about different viruses, including key human pathogens. We developed computational models using these datasets, which outperformed existing approaches across many virus types. However, we also revealed that current models share the same unresolved challenges when assessing whether specific viruses will infect humans, including SARS-CoV-2. These findings suggest that current models may fail to identify animal viruses that can infect humans, which underscores the urgent need for improved predictive models to strengthen pandemic preparedness. Kawasaki et al. construct a dataset covering 26 viral families and use large language models pre-trained on nucleotide sequences to identify zoonotic viruses with human infectivity potential. High predictive performance was obtained, even with partial viral sequences, but not all zoonotic lineages could be identified. Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses.BACKGROUNDMachine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses.In this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences.METHODSIn this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences.Here we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model's applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2.RESULTSHere we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model's applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2.Our study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats.CONCLUSIONSOur study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats. Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the lack of comprehensive datasets for viral infectivity poses a major challenge, limiting the predictable range of viruses. Methods In this study, we address this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing the BERT-infect model, which leverages large language models pre-trained on extensive nucleotide sequences. Results Here we show that our approach substantially boosts model performance. This enhancement is particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Our model also exhibits high predictive performance even with partial viral sequences, such as high-throughput sequencing reads or contig sequences from de novo sequence assemblies, indicating the model’s applicability for mining zoonotic viruses from virus metagenomic data. Furthermore, models trained on data up to 2018 demonstrate robust predictive capability for most viruses identified post-2018. Nonetheless, high-resolution evaluation based on phylogenetic analysis reveals general limitations in current machine learning models: the difficulty in alerting the human infectious risk in specific zoonotic viral lineages, including SARS-CoV-2. Conclusions Our study provides a comprehensive benchmark for viral infectivity prediction models and highlights unresolved issues in fully exploiting machine learning to prepare for future zoonotic threats. Plain language summary To prepare for future pandemics caused by animal-derived viruses, there is a growing need for computational models that can predict whether a virus might infect humans. We constructed extensive datasets covering information about different viruses, including key human pathogens. We developed computational models using these datasets, which outperformed existing approaches across many virus types. However, we also revealed that current models share the same unresolved challenges when assessing whether specific viruses will infect humans, including SARS-CoV-2. These findings suggest that current models may fail to identify animal viruses that can infect humans, which underscores the urgent need for improved predictive models to strengthen pandemic preparedness. Kawasaki et al. construct a dataset covering 26 viral families and use large language models pre-trained on nucleotide sequences to identify zoonotic viruses with human infectivity potential. High predictive performance was obtained, even with partial viral sequences, but not all zoonotic lineages could be identified. |
ArticleNumber | 187 |
Author | Kawasaki, Junna Suzuki, Tadaki Hamada, Michiaki |
Author_xml | – sequence: 1 givenname: Junna orcidid: 0000-0002-6609-5300 surname: Kawasaki fullname: Kawasaki, Junna email: jrt13mpmuk@gmail.com organization: Faculty of Science and Engineering, Waseda University, Department of Infectious Disease Pathobiology, Graduate School of Medicine, Chiba University – sequence: 2 givenname: Tadaki orcidid: 0000-0002-3820-9542 surname: Suzuki fullname: Suzuki, Tadaki organization: Department of Infectious Disease Pathobiology, Graduate School of Medicine, Chiba University, Department of Infectious Disease Pathology, National Institute of Infectious Diseases, Japan Institute for Health Security – sequence: 3 givenname: Michiaki orcidid: 0000-0001-9466-1034 surname: Hamada fullname: Hamada, Michiaki email: mhamada@waseda.jp organization: Faculty of Science and Engineering, Waseda University, Cellular and Molecular Biotechnology Research Institute (CMB), National Institute of Advanced Industrial Science and Technology (AIST), Graduate School of Medicine, Nippon Medical School |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/40394176$$D View this record in MEDLINE/PubMed |
BookMark | eNp9kktv1DAUhSNUREvpH2CBIrFhE_ArfqwQqoBWqsQGJHbGia9nPHjswU6mgl-POymlZcHK1vV3zr3yPU-bo5giNM1zjF5jROWbwqjseYdI3yGkEO2uHzUnRFDUcc6-Ht27HzdnpWwQQkRwxSR60hwzRBXDgp803y68tRDbcW1CgLiC0vrYwt6E2Uw-rtqy8yGkPeQ2-_K9Ta79lVJMkx_bvc9zqYK53IBbM659hDaAyfFQSBZCedY8diYUOLs9T5svH95_Pr_orj59vDx_d9WNTLGpE0pYIe2AsePS9oJTTCQeLbCRS2cMc8I4xwnmgigsuWWWGekGS4AqSwk9bS4XX5vMRu-y35r8Uyfj9aGQ8kqbXKcOoJ1AAOBAKGmYEnzoiZEEnOFY9YOE6vV28drNwxbsCHHKJjwwffgS_Vqv0l5jghQRBFWHV7cOOf2YoUx668sIIZgIaS6aEsQJ5bRnFX35D7pJc471rw4UpYgLWqkX90e6m-XPJitAFmDMqZQM7g7BSN8kRi-J0TUx-pAYfV1FdBGVCtfl57-9_6P6DU9RxT0 |
Cites_doi | 10.1128/mBio.01638-21 10.1038/nature24039 10.1128/JVI.02005-07 10.1098/rstb.2020.0358 10.1038/s41586-023-06617-0 10.1371/journal.ppat.1011116 10.1016/S0168-9525(00)02024-2 10.1126/science.aap7463 10.3390/v8030066 10.1016/j.coviro.2023.101346 10.3201/eid3007.240508 10.1093/nar/gkx932 10.1093/bioinformatics/bts565 10.1093/molbev/msx281 10.1186/s12859-023-05582-9 10.1101/2024.05.01.591751 10.1093/bioinformatics/btl158 10.15585/mmwr.mm7321e1 10.1038/s41564-024-01879-4 10.5281/zenodo.11103091 10.1093/bioinformatics/bty407 10.1038/s41586-022-04532-4 10.1016/j.virusres.2017.10.014 10.1093/bioinformatics/btab083 10.1016/j.tim.2015.11.002 10.1109/BIBM58861.2023.10385501 10.1093/nar/gku1207 10.5281/zenodo.11102793 10.1093/molbev/mst010 10.1126/science.aap9072 10.1093/gbe/evac018 10.1093/bib/bbac204 10.5281/zenodo.11103056 10.1038/s41586-021-04332-2 10.1186/1471-2105-10-421 10.1002/cpbi.96 10.1371/journal.pbio.3001135 10.1038/nmeth.4285 10.1111/tbed.13314 10.1093/bioinformatics/btaa705 10.1371/journal.pbio.3001390 10.1093/molbev/msaa015 10.1073/pnas.2202871119 10.1371/journal.pcbi.1012597 10.1093/nargab/lqab004 10.5281/zenodo.11103079 |
ContentType | Journal Article |
Copyright | The Author(s) 2025 2025. The Author(s). Copyright Springer Nature B.V. Dec 2025 The Author(s) 2025 2025 |
Copyright_xml | – notice: The Author(s) 2025 – notice: 2025. The Author(s). – notice: Copyright Springer Nature B.V. Dec 2025 – notice: The Author(s) 2025 2025 |
DBID | C6C AAYXX CITATION NPM 3V. 7X7 7XB 88C 88E 8C1 8FI 8FJ 8FK ABUWG AFKRA AZQEC BENPR CCPQU COVID DWQXO FYUFA GHDGH K9. M0S M0T M1P PHGZM PHGZT PIMPY PJZUB PKEHL PPXIY PQEST PQQKQ PQUKI PRINS 7X8 5PM DOA |
DOI | 10.1038/s43856-025-00903-w |
DatabaseName | Springer Nature OA Free Journals CrossRef PubMed ProQuest Central (Corporate) ProQuest Health & Medical Collection ProQuest Central (purchase pre-March 2016) Healthcare Administration Database (Alumni) Medical Database (Alumni Edition) Public Health Database ProQuest Hospital Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ProQuest Central (Alumni) ProQuest Central UK/Ireland ProQuest Central Essentials ProQuest Central ProQuest One Coronavirus Research Database ProQuest Central Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Health & Medical Complete (Alumni) ProQuest Health & Medical Collection Healthcare Administration Database Medical Database ProQuest Central Premium ProQuest One Academic Publicly Available Content Database ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China MEDLINE - Academic PubMed Central (Full Participant titles) DOAJ Directory of Open Access Journals |
DatabaseTitle | CrossRef PubMed Publicly Available Content Database ProQuest One Academic Middle East (New) ProQuest Central Essentials ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest One Health & Nursing ProQuest Central China ProQuest Central ProQuest Health & Medical Research Collection Health Research Premium Collection Health and Medicine Complete (Alumni Edition) ProQuest Central Korea Health & Medical Research Collection ProQuest Central (New) ProQuest Medical Library (Alumni) ProQuest Public Health ProQuest One Academic Eastern Edition ProQuest Health Management Coronavirus Research Database ProQuest Hospital Collection Health Research Premium Collection (Alumni) ProQuest Hospital Collection (Alumni) ProQuest Health & Medical Complete ProQuest Medical Library ProQuest One Academic UKI Edition ProQuest Health Management (Alumni Edition) ProQuest One Academic ProQuest One Academic (New) ProQuest Central (Alumni) MEDLINE - Academic |
DatabaseTitleList | PubMed Publicly Available Content Database MEDLINE - Academic |
Database_xml | – sequence: 1 dbid: C6C name: Springer Nature OA Free Journals url: http://www.springeropen.com/ sourceTypes: Publisher – sequence: 2 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 3 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 4 dbid: BENPR name: ProQuest Central url: https://www.proquest.com/central sourceTypes: Aggregation Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Medicine |
EISSN | 2730-664X |
EndPage | 10 |
ExternalDocumentID | oai_doaj_org_article_f70eeefe798a4976b52a82efa6195b8e PMC12092720 40394176 10_1038_s43856_025_00903_w |
Genre | Journal Article |
GrantInformation_xml | – fundername: MEXT | Japan Society for the Promotion of Science (JSPS) grantid: JP22KJ2901 funderid: https://doi.org/10.13039/501100001691 – fundername: MEXT | JST | Precursory Research for Embryonic Science and Technology (PRESTO) grantid: JPMJPR23R4 funderid: https://doi.org/10.13039/501100009023 – fundername: MEXT | JST | Precursory Research for Embryonic Science and Technology (PRESTO) grantid: JPMJPR23R4 – fundername: MEXT | Japan Society for the Promotion of Science (JSPS) grantid: JP22KJ2901 |
GroupedDBID | 0R~ 53G 7X7 88E 8C1 8FI 8FJ AAJSJ AASML ABUWG ACLNF AFKRA ALIPV ALMA_UNASSIGNED_HOLDINGS BENPR C6C CCPQU EBLON FYUFA GROUPED_DOAJ HMCUK M0T M1P M~E NAO OK1 PGMZT PHGZM PHGZT PIMPY PSQYO RPM SNYQT UKHRP AAYXX CITATION NPM PJZUB PPXIY 3V. 7XB 8FK AARCD AZQEC COVID DWQXO K9. PKEHL PQEST PQQKQ PQUKI PRINS 7X8 5PM PUEGO |
ID | FETCH-LOGICAL-c494t-797d78db11f68d57631281cde4c68faa4f7aff6216729186d4d4a8fbd2e39d323 |
IEDL.DBID | C6C |
ISSN | 2730-664X |
IngestDate | Wed Aug 27 01:11:33 EDT 2025 Thu Aug 21 18:30:44 EDT 2025 Fri Jul 11 17:23:57 EDT 2025 Wed Aug 13 11:26:42 EDT 2025 Mon Jul 21 06:07:07 EDT 2025 Sun Jul 06 05:05:44 EDT 2025 Wed May 21 12:01:58 EDT 2025 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 1 |
Language | English |
License | 2025. The Author(s). Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/. |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c494t-797d78db11f68d57631281cde4c68faa4f7aff6216729186d4d4a8fbd2e39d323 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 |
ORCID | 0000-0002-6609-5300 0000-0002-3820-9542 0000-0001-9466-1034 |
OpenAccessLink | https://www.nature.com/articles/s43856-025-00903-w |
PMID | 40394176 |
PQID | 3206330673 |
PQPubID | 5642959 |
PageCount | 10 |
ParticipantIDs | doaj_primary_oai_doaj_org_article_f70eeefe798a4976b52a82efa6195b8e pubmedcentral_primary_oai_pubmedcentral_nih_gov_12092720 proquest_miscellaneous_3206236354 proquest_journals_3206330673 pubmed_primary_40394176 crossref_primary_10_1038_s43856_025_00903_w springer_journals_10_1038_s43856_025_00903_w |
PublicationCentury | 2000 |
PublicationDate | 2025-05-20 |
PublicationDateYYYYMMDD | 2025-05-20 |
PublicationDate_xml | – month: 05 year: 2025 text: 2025-05-20 day: 20 |
PublicationDecade | 2020 |
PublicationPlace | London |
PublicationPlace_xml | – name: London – name: England |
PublicationTitle | Communications medicine |
PublicationTitleAbbrev | Commun Med |
PublicationTitleAlternate | Commun Med (Lond) |
PublicationYear | 2025 |
Publisher | Nature Publishing Group UK Springer Nature B.V Nature Portfolio |
Publisher_xml | – name: Nature Publishing Group UK – name: Springer Nature B.V – name: Nature Portfolio |
References | D Carroll (903_CR1) 2018; 359 MA Takata (903_CR33) 2017; 550 G Yu (903_CR30) 2020; 69 JM Bartoszewicz (903_CR8) 2021; 3 Y Bao (903_CR16) 2008; 82 L Fu (903_CR24) 2012; 28 N Mollentze (903_CR3) 2021; 19 RC Edgar (903_CR5) 2022; 602 S Temmam (903_CR41) 2022; 604 Z Zhang (903_CR7) 2019; 66 J Hadfield (903_CR18) 2018; 34 J Kawasaki (903_CR36) 2021; 12 F Mock (903_CR9) 2020; 37 N Mollentze (903_CR13) 2023; 61 Z Ming (903_CR10) 2023; 24 903_CR17 SA Babayan (903_CR32) 2018; 362 903_CR39 903_CR38 903_CR37 M Wille (903_CR14) 2021; 19 W Li (903_CR25) 2006; 22 903_CR35 903_CR12 903_CR11 P Rice (903_CR19) 2000; 16 Y Ji (903_CR20) 2021; 37 EJ Lefkowitz (903_CR23) 2018; 46 K Katoh (903_CR26) 2013; 30 MA Martínez (903_CR34) 2016; 24 JR Brister (903_CR15) 2015; 43 C Camacho (903_CR22) 2009; 10 CJ Carlson (903_CR2) 2021; 376 AL Greninger (903_CR4) 2018; 244 GT Keusch (903_CR6) 2022; 119 BQ Minh (903_CR27) 2020; 37 S Kalyaanamoorthy (903_CR28) 2017; 14 DT Hoang (903_CR29) 2018; 35 903_CR48 903_CR47 903_CR46 903_CR45 903_CR44 903_CR21 T Mihara (903_CR31) 2016; 8 P Li (903_CR40) 2023; 19 903_CR43 903_CR42 |
References_xml | – volume: 12 start-page: e0163821 year: 2021 ident: 903_CR36 publication-title: MBio doi: 10.1128/mBio.01638-21 – volume: 550 start-page: 124 year: 2017 ident: 903_CR33 publication-title: Nature doi: 10.1038/nature24039 – volume: 82 start-page: 596 year: 2008 ident: 903_CR16 publication-title: J. Virol. doi: 10.1128/JVI.02005-07 – volume: 376 start-page: 20200358 year: 2021 ident: 903_CR2 publication-title: Philos. Trans. R. Soc. Lond. B Biol. Sci. doi: 10.1098/rstb.2020.0358 – ident: 903_CR43 doi: 10.1038/s41586-023-06617-0 – volume: 19 start-page: e1011116 year: 2023 ident: 903_CR40 publication-title: PLoS Pathog. doi: 10.1371/journal.ppat.1011116 – volume: 16 start-page: 276 year: 2000 ident: 903_CR19 publication-title: Trends Genet. doi: 10.1016/S0168-9525(00)02024-2 – volume: 359 start-page: 872 year: 2018 ident: 903_CR1 publication-title: Science doi: 10.1126/science.aap7463 – volume: 8 start-page: 66 year: 2016 ident: 903_CR31 publication-title: Viruses doi: 10.3390/v8030066 – volume: 61 start-page: 101346 year: 2023 ident: 903_CR13 publication-title: Curr. Opin. Virol. doi: 10.1016/j.coviro.2023.101346 – ident: 903_CR38 doi: 10.3201/eid3007.240508 – volume: 46 start-page: D708 year: 2018 ident: 903_CR23 publication-title: Nucleic Acids Res. doi: 10.1093/nar/gkx932 – volume: 28 start-page: 3150 year: 2012 ident: 903_CR24 publication-title: Bioinformatics doi: 10.1093/bioinformatics/bts565 – volume: 35 start-page: 518 year: 2018 ident: 903_CR29 publication-title: Mol. Biol. Evol. doi: 10.1093/molbev/msx281 – volume: 24 year: 2023 ident: 903_CR10 publication-title: BMC Bioinforma. doi: 10.1186/s12859-023-05582-9 – ident: 903_CR39 doi: 10.1101/2024.05.01.591751 – volume: 22 start-page: 1658 year: 2006 ident: 903_CR25 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btl158 – ident: 903_CR37 doi: 10.15585/mmwr.mm7321e1 – ident: 903_CR44 doi: 10.1038/s41564-024-01879-4 – ident: 903_CR48 doi: 10.5281/zenodo.11103091 – volume: 34 start-page: 4121 year: 2018 ident: 903_CR18 publication-title: Bioinformatics doi: 10.1093/bioinformatics/bty407 – volume: 604 start-page: 330 year: 2022 ident: 903_CR41 publication-title: Nature doi: 10.1038/s41586-022-04532-4 – volume: 244 start-page: 218 year: 2018 ident: 903_CR4 publication-title: Virus Res. doi: 10.1016/j.virusres.2017.10.014 – volume: 37 start-page: 2112 year: 2021 ident: 903_CR20 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btab083 – volume: 24 start-page: 134 year: 2016 ident: 903_CR34 publication-title: Trends Microbiol doi: 10.1016/j.tim.2015.11.002 – ident: 903_CR11 doi: 10.1109/BIBM58861.2023.10385501 – volume: 43 start-page: D571 year: 2015 ident: 903_CR15 publication-title: Nucleic Acids Res. doi: 10.1093/nar/gku1207 – ident: 903_CR45 doi: 10.5281/zenodo.11102793 – volume: 30 start-page: 772 year: 2013 ident: 903_CR26 publication-title: Mol. Biol. Evol. doi: 10.1093/molbev/mst010 – volume: 362 start-page: 577 year: 2018 ident: 903_CR32 publication-title: Science doi: 10.1126/science.aap9072 – ident: 903_CR17 doi: 10.1093/gbe/evac018 – ident: 903_CR21 doi: 10.1093/bib/bbac204 – ident: 903_CR46 doi: 10.5281/zenodo.11103056 – volume: 602 start-page: 142 year: 2022 ident: 903_CR5 publication-title: Nature doi: 10.1038/s41586-021-04332-2 – volume: 10 year: 2009 ident: 903_CR22 publication-title: BMC Bioinforma. doi: 10.1186/1471-2105-10-421 – volume: 69 start-page: e96 year: 2020 ident: 903_CR30 publication-title: Curr. Protoc. Bioinforma. doi: 10.1002/cpbi.96 – volume: 19 start-page: e3001135 year: 2021 ident: 903_CR14 publication-title: PLoS Biol. doi: 10.1371/journal.pbio.3001135 – volume: 14 start-page: 587 year: 2017 ident: 903_CR28 publication-title: Nat. Methods doi: 10.1038/nmeth.4285 – volume: 66 start-page: 2517 year: 2019 ident: 903_CR7 publication-title: Transbound. Emerg. Dis. doi: 10.1111/tbed.13314 – volume: 37 start-page: 318 year: 2020 ident: 903_CR9 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btaa705 – volume: 19 start-page: e3001390 year: 2021 ident: 903_CR3 publication-title: PLoS Biol. doi: 10.1371/journal.pbio.3001390 – volume: 37 start-page: 1530 year: 2020 ident: 903_CR27 publication-title: Mol. Biol. Evol. doi: 10.1093/molbev/msaa015 – volume: 119 start-page: e2202871119 year: 2022 ident: 903_CR6 publication-title: Proc. Natl Acad. Sci. USA doi: 10.1073/pnas.2202871119 – ident: 903_CR42 doi: 10.1371/journal.pcbi.1012597 – ident: 903_CR35 – volume: 3 start-page: lqab004 year: 2021 ident: 903_CR8 publication-title: NAR Genom. Bioinform. doi: 10.1093/nargab/lqab004 – ident: 903_CR47 doi: 10.5281/zenodo.11103079 – ident: 903_CR12 |
SSID | ssj0002769480 |
Score | 2.3056793 |
Snippet | Background
Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity.... Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity. However, the... BackgroundMachine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human infectivity.... To prepare for future pandemics caused by animal-derived viruses, there is a growing need for computational models that can predict whether a virus might... Abstract Background Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their potential for human... |
SourceID | doaj pubmedcentral proquest pubmed crossref springer |
SourceType | Open Website Open Access Repository Aggregation Database Index Database Publisher |
StartPage | 187 |
SubjectTerms | 45 631/114/2163 631/326/596/2564 Datasets Genomes Influenza Large language models Machine learning Medicine Medicine & Public Health Metadata Viruses Zoonoses |
SummonAdditionalLinks | – databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwELZQD4gLopRHSkFG4gZWE799BES1qgQnKvVm_CyLwKmaXSr112M7ydKlRVy4xk40modnJjP-BoBXnjmV8wSHgrAeUW8lMp1XKJ-LBmNHWuMq2ucnvjihx6fs9Nqor9ITNsIDj4w7jKINIcQglDQ0-07LsJE4RJMjf2ZlKKdv9nnXkqlvtZzGFZXtdEumJfJwoESy0m_LUFv-TaDLLU9UAftvizJvNkv-UTGtjujoAbg_RZDw7Uj5LrgT0kNw9-NUI98DXxYFFSRBN49JGeAywRnVO53B4Xz5vXZuwtJXDvsIr_o-9flz8OfyYj3kF0o3_Bn8URstA5wmS-QHZWzO8AicHH34_H6BpjkKyFFFV0go4YX0tusilz4nGKSUz5wP1HEZjaFRmBg57niOtDvJPfXUyGg9DkR5gsljsJP6FJ4C6KQKkplgaRcox222dtFZhVmk1lDTNuD1zFN9PsJl6FrmJlKPEtBZArpKQF824F1h-2ZngbquD7IC6EkB9L8UoAEHs9D0ZH-DJjiHXiUbIg14uVnOllPKISaFfj3uwSQHXLQBT0YZbyihLVG0E7wBckv6W6Rur6Tl14rOXS4jl-J2A97MivKbrr_zYv9_8OIZuIerhrN8-h2AndXFOjzPQdPKvqj28QsbzRX8 priority: 102 providerName: Directory of Open Access Journals – databaseName: ProQuest Health & Medical Collection dbid: 7X7 link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1Lj9MwEB7BIiEuiDdZFmQkbmBtYjuxfUKAWFVIcGKl3oyfpRIkpWlZaX89tut0VV7X2Ikcz4w9nvn8DcAL11oZzwkWe24cZs4IrBsncVwXNSGW1tpmts9P3eycfZi38xJwGwuscloT80LtBpti5KeUxM00-bf09eoHTlWjUna1lNC4DjcSdVmCdPE538dYCO8kE3W5K1NTcToyKtqEum1xnSIU-OJgP8q0_X_zNf-ETP6WN83b0dkduF38SPRmJ_i7cM339-Dmx5Ipvw9fZokbpEd2KpYyomWPJm7vfoHG1fJbxm-ihC5HQ0CXw9AP8XPo53K9HeMLCRO_QN8z3NKjUl8iPkjFc8YHcH72_vO7GS7VFLBlkm0wl9xx4UzThE64eMygKYlmnWe2E0FrFrgOoSNNF_3tRnSOOaZFMI54Kh0l9CEc9UPvHwOyQnrRam9Y41lH6mjzvDGStIEZzXRdwctpTtVqR5qhcrKbCrWTgIoSUFkC6qKCt2na9z0T4XV-MKwXqtiPCrz23gfPpdAsulCmJVoQH3Q8ALZG-ApOJqGpYoWjutKZCp7vm6P9pKSI7v2w3fUhNLpdrIJHOxnvR8JqKlnDuwrEgfQPhnrY0i-_Zo7udCU5pbgreDUpytW4_j0Xx___jSdwi2TdbePqdgJHm_XWP41O0cY8y5r_C-WjDF0 priority: 102 providerName: ProQuest |
Title | Hidden challenges in evaluating spillover risk of zoonotic viruses using machine learning models |
URI | https://link.springer.com/article/10.1038/s43856-025-00903-w https://www.ncbi.nlm.nih.gov/pubmed/40394176 https://www.proquest.com/docview/3206330673 https://www.proquest.com/docview/3206236354 https://pubmed.ncbi.nlm.nih.gov/PMC12092720 https://doaj.org/article/f70eeefe798a4976b52a82efa6195b8e |
Volume | 5 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1bi9QwFD7sBcQX8W7XtUTwTYttkubyODvsMgy4iLowbzVpknFA22U648L-epO0HRldH3xpoUnKIScnObd8B-CNKWvp7YQ6s1ybjBotMlUYmfl9UWFck1zVEe3zks2u6HxRLg4Aj3dhYtJ-hLSM2_SYHfa-o0SUIV22zPLgWshuDuE4QLeHVT1l051fBXMmqciH-zE5EXcM3TuDIlT_Xfrl32mSf8RK4xF08RAeDLojmvTUPoID2zyGex-G6PgT-DoLeCANqscCKR1aNWjE826WqLtefY85myhklKPWodu2bVr_O_Rztd52fkDIg1-iHzHF0qKhpoT_EArmdE_h6uL8y3SWDRUUsppKusm45IYLo4vCMWG8aUFC4Kw2ltZMOKWo48o5hgvmdexCMEMNVcJpgy2RhmDyDI6atrEvANVCWlEqq2lhKcO5l3NeaIlLR7WiKk_g7Tin1XUPlFHFADcRVc-BynOgihyobhI4C9O-6xlAruOHdr2sBqZXjufWWme5FIp6tUmXWAlsnfJGX6mFTeB0ZFo1SF5XEeyVrmAHkQRe75q9zIRAiGpsu-37YOJVLZrA857HO0poTiQtOEtA7HF_j9T9lmb1LeJyh2vIIaydwLtxofym699zcfJ_3V_CfRzXcul3uFM42qy39pVXjDY6hUO-4P4ppkUKx5PJ_PPcv8_OLz9-SqOUpNHlkEaf1i-mkhGW |
linkProvider | Springer Nature |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwEB5VWwm4IN6kFDASnCBqYjuxc0CIQqstbVcItVJvrh3by0o0WTa7rOBH8RuxnWSr5XXrNXYiZ17-xjOeAXius7JwfkIZG6Z0TLXisUx1ETu7KDEuSSLLUO1zlA9P6Yez7GwDfvZ3YXxaZW8Tg6HWdenPyHcIdpupx7fkzfRr7LtG-ehq30KjFYtD833pXLbm9cF7x98XGO_vnbwbxl1XgbikBZ3HrGCaca3S1OZcO7hNfDCp1IaWObdSUsuktTlOc4c7U55rqqnkVmlsSKGJL3TgTP4mJc6VGcDm7t7o46fVqQ5meUF50t3OSQjfaSjhmc_zzeLEn4nEy7UdMDQK-Bu6_TNJ87dIbdgA92_BzQ65oretqN2GDVPdgWvHXWz-LpwPfTWSCpV9e5YGTSrUVxOvxqiZTr6EjFHk89lRbdGPuq5q9zn0bTJbNO4Fn4U_RhchwdOgrqOFe-Db9TT34PRKKH0fBlVdmYeASl4YnkmjaGpojhNnZViqCpxZqiSVSQQve5qKaVumQ4TwOuGi5YBwHBCBA2IZwa4n-2qmL7EdHtSzseg0VliWGGOsYQWX1IE2lWHJsbHSuZyZ4iaC7Z5potP7RlxKaQTPVsNOY30YRlamXrRzMHFAj0bwoOXxaiU0IQVNWR4BX-P-2lLXR6rJ51AV3F-C9kH1CF71gnK5rn_TYuv_v_EUrg9Pjo_E0cHo8BHcwEGOM2dbt2Ewny3MYwfJ5upJpwcIzq9a9X4BFrlKqA |
linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwEB5VRaq4IN4EChgJTmBtYjuxc0AIKKsthYoDlfbm2rG9rATJ0uyygp_Gr8N2kq2W163X2Ikcz8OfPZ9nAB6bvCr9PqHClmuDmdECq8yU2PtFRUhFU1XFbJ_HxeSEvZ3m0x34OdyFCbTKwSdGR22aKpyRjyjxi2nAt3TkelrEh4Pxi8VXHCpIhUjrUE6jU5Ej-33tt2_t88MDL-snhIzffHw9wX2FAVyxki0xL7nhwugsc4UwHnrTEFiqjGVVIZxSzHHlXEGywmPQTBSGGaaE04ZYWhoakh5493-J0zwLNsanfHO-Q3hRMpH293RSKkYtoyIPjN8cp-F0BK-31sJYMuBvOPdPuuZvMdu4FI6vwpUew6KXndJdgx1bX4e9932U_gacTkJekhpVQ6GWFs1rNOQVr2eoXcw_R-4oCsx21Dj0o2nqxn8OfZufrVr_QuDjz9CXSPW0qK9t4R-Ewj3tTTi5kHm-Bbt1U9s7gCpRWpErq1lmWUFS7294pkuSO6YVU2kCT4c5lYsuYYeMgXYqZCcB6SUgowTkOoFXYdo3PUOy7figOZvJ3nal46m11lleCsU8fNM5UYJYp_zmM9fCJrA_CE32HqCV5_qawKNNs7fdEJBRtW1WXR9CPeRjCdzuZLwZCUtpyTJeJCC2pL811O2Wev4p5gcP16FDeD2BZ4OinI_r33Nx9_-_8RD2vMHJd4fHR_fgMolqnHsnuw-7y7OVve-x2VI_iEaA4PSire4XwjJNeA |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Hidden+challenges+in+evaluating+spillover+risk+of+zoonotic+viruses+using+machine+learning+models&rft.jtitle=Communications+medicine&rft.au=Kawasaki%2C+Junna&rft.au=Suzuki%2C+Tadaki&rft.au=Hamada%2C+Michiaki&rft.date=2025-05-20&rft.pub=Nature+Publishing+Group+UK&rft.eissn=2730-664X&rft.volume=5&rft_id=info:doi/10.1038%2Fs43856-025-00903-w&rft_id=info%3Apmid%2F40394176&rft.externalDocID=PMC12092720 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2730-664X&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2730-664X&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2730-664X&client=summon |