Virus classification for viral genomic fragments using PhaGCN2

Abstract Viruses are the most ubiquitous and diverse entities in the biome. Due to the rapid growth of newly identified viruses, there is an urgent need for accurate and comprehensive virus classification, particularly for novel viruses. Here, we present PhaGCN2, which can rapidly classify the taxon...

Full description

Saved in:
Bibliographic Details
Published inBriefings in bioinformatics Vol. 24; no. 1
Main Authors Jiang, Jing-Zhe, Yuan, Wen-Guang, Shang, Jiayu, Shi, Ying-Hui, Yang, Li-Ling, Liu, Min, Zhu, Peng, Jin, Tao, Sun, Yanni, Yuan, Li-Hong
Format Journal Article
LanguageEnglish
Published England Oxford University Press 19.01.2023
Oxford Publishing Limited (England)
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Abstract Viruses are the most ubiquitous and diverse entities in the biome. Due to the rapid growth of newly identified viruses, there is an urgent need for accurate and comprehensive virus classification, particularly for novel viruses. Here, we present PhaGCN2, which can rapidly classify the taxonomy of viral sequences at the family level and supports the visualization of the associations of all families. We evaluate the performance of PhaGCN2 and compare it with the state-of-the-art virus classification tools, such as vConTACT2, CAT and VPF-Class, using the widely accepted metrics. The results show that PhaGCN2 largely improves the precision and recall of virus classification, increases the number of classifiable virus sequences in the Global Ocean Virome dataset (v2.0) by four times and classifies more than 90% of the Gut Phage Database. PhaGCN2 makes it possible to conduct high-throughput and automatic expansion of the database of the International Committee on Taxonomy of Viruses. The source code is freely available at https://github.com/KennthShang/PhaGCN2.0.
AbstractList Viruses are the most ubiquitous and diverse entities in the biome. Due to the rapid growth of newly identified viruses, there is an urgent need for accurate and comprehensive virus classification, particularly for novel viruses. Here, we present PhaGCN2, which can rapidly classify the taxonomy of viral sequences at the family level and supports the visualization of the associations of all families. We evaluate the performance of PhaGCN2 and compare it with the state-of-the-art virus classification tools, such as vConTACT2, CAT and VPF-Class, using the widely accepted metrics. The results show that PhaGCN2 largely improves the precision and recall of virus classification, increases the number of classifiable virus sequences in the Global Ocean Virome dataset (v2.0) by four times and classifies more than 90% of the Gut Phage Database. PhaGCN2 makes it possible to conduct high-throughput and automatic expansion of the database of the International Committee on Taxonomy of Viruses. The source code is freely available at https://github.com/KennthShang/PhaGCN2.0.
Abstract Viruses are the most ubiquitous and diverse entities in the biome. Due to the rapid growth of newly identified viruses, there is an urgent need for accurate and comprehensive virus classification, particularly for novel viruses. Here, we present PhaGCN2, which can rapidly classify the taxonomy of viral sequences at the family level and supports the visualization of the associations of all families. We evaluate the performance of PhaGCN2 and compare it with the state-of-the-art virus classification tools, such as vConTACT2, CAT and VPF-Class, using the widely accepted metrics. The results show that PhaGCN2 largely improves the precision and recall of virus classification, increases the number of classifiable virus sequences in the Global Ocean Virome dataset (v2.0) by four times and classifies more than 90% of the Gut Phage Database. PhaGCN2 makes it possible to conduct high-throughput and automatic expansion of the database of the International Committee on Taxonomy of Viruses. The source code is freely available at https://github.com/KennthShang/PhaGCN2.0.
Viruses are the most ubiquitous and diverse entities in the biome. Due to the rapid growth of newly identified viruses, there is an urgent need for accurate and comprehensive virus classification, particularly for novel viruses. Here, we present PhaGCN2, which can rapidly classify the taxonomy of viral sequences at the family level and supports the visualization of the associations of all families. We evaluate the performance of PhaGCN2 and compare it with the state-of-the-art virus classification tools, such as vConTACT2, CAT and VPF-Class, using the widely accepted metrics. The results show that PhaGCN2 largely improves the precision and recall of virus classification, increases the number of classifiable virus sequences in the Global Ocean Virome dataset (v2.0) by four times and classifies more than 90% of the Gut Phage Database. PhaGCN2 makes it possible to conduct high-throughput and automatic expansion of the database of the International Committee on Taxonomy of Viruses. The source code is freely available at https://github.com/KennthShang/PhaGCN2.0.Viruses are the most ubiquitous and diverse entities in the biome. Due to the rapid growth of newly identified viruses, there is an urgent need for accurate and comprehensive virus classification, particularly for novel viruses. Here, we present PhaGCN2, which can rapidly classify the taxonomy of viral sequences at the family level and supports the visualization of the associations of all families. We evaluate the performance of PhaGCN2 and compare it with the state-of-the-art virus classification tools, such as vConTACT2, CAT and VPF-Class, using the widely accepted metrics. The results show that PhaGCN2 largely improves the precision and recall of virus classification, increases the number of classifiable virus sequences in the Global Ocean Virome dataset (v2.0) by four times and classifies more than 90% of the Gut Phage Database. PhaGCN2 makes it possible to conduct high-throughput and automatic expansion of the database of the International Committee on Taxonomy of Viruses. The source code is freely available at https://github.com/KennthShang/PhaGCN2.0.
Author Shang, Jiayu
Yuan, Wen-Guang
Zhu, Peng
Jin, Tao
Sun, Yanni
Jiang, Jing-Zhe
Yang, Li-Ling
Liu, Min
Yuan, Li-Hong
Shi, Ying-Hui
Author_xml – sequence: 1
  givenname: Jing-Zhe
  orcidid: 0000-0001-5260-7822
  surname: Jiang
  fullname: Jiang, Jing-Zhe
  email: jingzhejiang@gmail.com
– sequence: 2
  givenname: Wen-Guang
  orcidid: 0000-0002-0191-642X
  surname: Yuan
  fullname: Yuan, Wen-Guang
  email: 1187670718@qq.com
– sequence: 3
  givenname: Jiayu
  orcidid: 0000-0001-5974-4985
  surname: Shang
  fullname: Shang, Jiayu
  email: jyshang2-c@my.cityu.edu.hk
– sequence: 4
  givenname: Ying-Hui
  surname: Shi
  fullname: Shi, Ying-Hui
  email: 1398294360@qq.com
– sequence: 5
  givenname: Li-Ling
  surname: Yang
  fullname: Yang, Li-Ling
  email: 2534118522@qq.com
– sequence: 6
  givenname: Min
  surname: Liu
  fullname: Liu, Min
  email: 1099535669@qq.com
– sequence: 7
  givenname: Peng
  surname: Zhu
  fullname: Zhu, Peng
  email: 1142093155@qq.com
– sequence: 8
  givenname: Tao
  surname: Jin
  fullname: Jin, Tao
  email: jingzhejiang@gmail.com
– sequence: 9
  givenname: Yanni
  orcidid: 0000-0003-1373-8023
  surname: Sun
  fullname: Sun, Yanni
  email: yannisun@cityu.edu.hk
– sequence: 10
  givenname: Li-Hong
  surname: Yuan
  fullname: Yuan, Li-Hong
  email: ylh@gdpu.edu.cn
BackLink https://www.ncbi.nlm.nih.gov/pubmed/36464489$$D View this record in MEDLINE/PubMed
BookMark eNp90E1LxDAQgOEgirqrJ-9SEESQaj4mSXsRZPELFvUgXkOaJmukbdakFfz3Vne9iHjKHJ4ZwjtBm13oLEIHBJ8RXLLzylfnVaUNx3wD7RKQMgfMYfNrFjLnINgOmqT0ijHFsiDbaIcJEABFuYsunn0cUmYanZJ33ujehy5zIWbvPuomW9gutN5kLupFa7s-ZUPy3SJ7fNE3s3u6h7acbpLdX79T9HR99TS7zecPN3ezy3luGBR9TkpnKaWsYpYRoKzgJZiCGFmA47RmNXfOggFaCi7BAuOi1laKEuuiljWbopPV2WUMb4NNvWp9MrZpdGfDkBSVIDHmAshIj37R1zDEbvycYoQwEBSXMKrDtRqq1tZqGX2r44f6CTMCsgImhpSidcr4_jtOH7VvFMHqK74a46t1_HHn9NfOz9m_9fFKh2H5L_wEEgCRMA
CitedBy_id crossref_primary_10_1038_s41467_023_42125_5
crossref_primary_10_1016_j_fm_2025_104733
crossref_primary_10_1016_j_scitotenv_2024_174531
crossref_primary_10_1093_nar_gkad977
crossref_primary_10_1016_j_envint_2023_108055
crossref_primary_10_1016_j_biortech_2024_131839
crossref_primary_10_1016_j_envint_2025_109363
crossref_primary_10_1038_s41396_023_01414_z
crossref_primary_10_1038_s41467_024_52450_y
crossref_primary_10_1038_s41564_023_01598_2
crossref_primary_10_1128_aem_00296_25
crossref_primary_10_1186_s40168_024_01853_6
crossref_primary_10_1128_aem_00850_24
crossref_primary_10_1007_s00705_024_05986_9
crossref_primary_10_1016_j_envres_2024_119070
crossref_primary_10_1016_j_jare_2024_06_022
crossref_primary_10_1038_s41467_025_57500_7
crossref_primary_10_1093_bib_bbad408
crossref_primary_10_3389_fmolb_2023_1305506
crossref_primary_10_1038_s41467_024_53454_4
crossref_primary_10_1016_j_cej_2025_161877
crossref_primary_10_1038_s41467_024_51101_6
crossref_primary_10_3390_microorganisms12081736
crossref_primary_10_4103_1673_5374_382223
crossref_primary_10_1038_s41467_024_53317_y
crossref_primary_10_1128_mbio_03009_22
crossref_primary_10_1016_j_cej_2024_152448
crossref_primary_10_1186_s12866_025_03854_3
crossref_primary_10_1186_s13059_024_03236_4
crossref_primary_10_1016_j_virol_2024_110015
crossref_primary_10_1038_s41396_023_01404_1
crossref_primary_10_1186_s40793_024_00549_6
crossref_primary_10_1007_s10482_023_01912_2
crossref_primary_10_1038_s41467_024_47214_7
crossref_primary_10_1186_s40168_024_01902_0
crossref_primary_10_3390_v16040590
crossref_primary_10_1016_j_watres_2024_121741
crossref_primary_10_1038_s41467_024_52464_6
crossref_primary_10_1002_imt2_188
crossref_primary_10_1093_bib_bbaf084
crossref_primary_10_1186_s40168_024_01791_3
crossref_primary_10_1038_s43705_023_00307_8
crossref_primary_10_1016_j_jare_2024_09_016
crossref_primary_10_1186_s40643_025_00852_1
crossref_primary_10_1128_aem_00695_24
Cites_doi 10.1016/j.coviro.2021.10.011
10.1038/s41586-018-0012-7
10.1016/j.cell.2019.03.040
10.1038/s41564-021-00928-6
10.1128/br.35.3.235-241.1971
10.1002/gch2.1018
10.1186/s40168-020-00990-y
10.1038/nmeth.3176
10.1038/nmeth.1938
10.1186/1471-2105-11-119
10.1098/rsob.170189
10.1007/978-1-0716-0334-5_4
10.1016/j.cell.2021.01.029
10.1038/nrmicro1750
10.3390/v4113209
10.1038/nature20167
10.1093/bioinformatics/btab026
10.1093/nar/gks1220
10.1093/molbev/msn023
10.1093/bioinformatics/btaa1066
10.1093/bioinformatics/btab293
10.1093/nar/gkj023
10.1038/s41587-019-0100-8
10.1093/nar/gkaa946
10.1016/j.ymeth.2020.05.018
10.1111/j.1751-1097.2007.00266.x
10.1038/nrmicro.2016.177
10.1016/S0022-2836(05)80360-2
ContentType Journal Article
Copyright The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com 2022
The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Copyright_xml – notice: The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com 2022
– notice: The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
– notice: The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7QO
7SC
8FD
FR3
JQ2
K9.
L7M
L~C
L~D
P64
RC3
7X8
DOI 10.1093/bib/bbac505
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
Biotechnology Research Abstracts
Computer and Information Systems Abstracts
Technology Research Database
Engineering Research Database
ProQuest Computer Science Collection
ProQuest Health & Medical Complete (Alumni)
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Biotechnology and BioEngineering Abstracts
Genetics Abstracts
MEDLINE - Academic
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Genetics Abstracts
Biotechnology Research Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
ProQuest Health & Medical Complete (Alumni)
Engineering Research Database
Advanced Technologies Database with Aerospace
Biotechnology and BioEngineering Abstracts
Computer and Information Systems Abstracts Professional
MEDLINE - Academic
DatabaseTitleList Genetics Abstracts
MEDLINE
CrossRef

MEDLINE - Academic
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1477-4054
ExternalDocumentID 36464489
10_1093_bib_bbac505
10.1093/bib/bbac505
Genre Research Support, Non-U.S. Gov't
Journal Article
GrantInformation_xml – fundername: Natural Science Foundation of China
  grantid: 31972847
– fundername: Key-Area Research and Development Program of Guangdong Province
  grantid: 2022B0202110001
– fundername: Central Public-Interest Scientific Institution Basal Research Fund
  grantid: 2021SD05
– fundername: Guangdong Provincial Special Fund for Modern Agriculture Industry Technology Innovation Teams
  grantid: 2019KJ141
GroupedDBID ---
-E4
.2P
.I3
0R~
1TH
23N
2WC
36B
4.4
48X
53G
5GY
5VS
6J9
70D
8VB
AAGQS
AAHBH
AAIJN
AAIMJ
AAJKP
AAJQQ
AAMDB
AAMVS
AAOGV
AAPQZ
AAPXW
AARHZ
AAUQX
AAVAP
AAVLN
ABDBF
ABEJV
ABEUO
ABGNP
ABIXL
ABNKS
ABPQP
ABPTD
ABQLI
ABQTQ
ABWST
ABXVV
ABXZS
ABZBJ
ACGFO
ACGFS
ACGOD
ACIWK
ACPRK
ACUFI
ACUHS
ACUXJ
ACYTK
ADBBV
ADEYI
ADFTL
ADGKP
ADGZP
ADHKW
ADHZD
ADOCK
ADPDF
ADQBN
ADRDM
ADRTK
ADVEK
ADYVW
ADZTZ
ADZXQ
AECKG
AEGPL
AEGXH
AEJOX
AEKKA
AEKSI
AELWJ
AEMDU
AEMOZ
AENEX
AENZO
AEPUE
AETBJ
AEWNT
AFFZL
AFGWE
AFIYH
AFOFC
AFRAH
AGINJ
AGKEF
AGQXC
AGSYK
AHMBA
AHQJS
AHXPO
AIAGR
AIJHB
AJEEA
AJEUX
AKHUL
AKVCP
AKWXX
ALMA_UNASSIGNED_HOLDINGS
ALTZX
ALUQC
ALXQX
AMNDL
ANAKG
APIBT
APWMN
ARIXL
AXUDD
AYOIW
AZVOD
BAWUL
BAYMD
BEYMZ
BHONS
BQDIO
BQUQU
BSWAC
BTQHN
C1A
C45
CAG
CDBKE
COF
CS3
CZ4
DAKXR
DIK
DILTD
DU5
D~K
E3Z
EAD
EAP
EAS
EBA
EBC
EBD
EBR
EBS
EBU
EE~
EJD
EMB
EMK
EMOBN
EST
ESX
F5P
F9B
FHSFR
FLIZI
FLUFQ
FOEOM
FQBLK
GAUVT
GJXCC
GROUPED_DOAJ
GX1
H13
H5~
HAR
HW0
HZ~
IOX
J21
JXSIZ
K1G
KBUDW
KOP
KSI
KSN
M-Z
M49
MK~
ML0
N9A
NGC
NLBLG
NMDNZ
NOMLY
NU-
O0~
O9-
OAWHX
ODMLO
OJQWA
OK1
OVD
OVEED
P2P
PAFKI
PEELM
PQQKQ
Q1.
Q5Y
QWB
RD5
RPM
RUSNO
RW1
RXO
SV3
TEORI
TH9
TJP
TLC
TOX
TR2
TUS
W8F
WOQ
X7H
YAYTL
YKOAZ
YXANX
ZKX
ZL0
~91
AAYXX
AHGBF
CITATION
ADRIX
AFXEN
BCRHZ
CGR
CUY
CVF
ECM
EIF
NPM
ROX
7QO
7SC
8FD
FR3
JQ2
K9.
L7M
L~C
L~D
P64
RC3
7X8
ID FETCH-LOGICAL-c348t-19fe2223b3e314238594c81c784f52d3d5ffe4c4296574e4356dae7690a8d7d3
IEDL.DBID TOX
ISSN 1467-5463
1477-4054
IngestDate Fri Jul 11 16:04:31 EDT 2025
Mon Jun 30 11:09:02 EDT 2025
Wed Feb 19 02:25:06 EST 2025
Thu Apr 24 22:55:54 EDT 2025
Tue Jul 01 03:39:44 EDT 2025
Wed Apr 02 06:58:29 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords graph convolutional network
semi-supervised machine learning
virus classification
PhaGCN2
ICTV
Language English
License This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)
https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model
The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c348t-19fe2223b3e314238594c81c784f52d3d5ffe4c4296574e4356dae7690a8d7d3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ORCID 0000-0003-1373-8023
0000-0001-5260-7822
0000-0002-0191-642X
0000-0001-5974-4985
PMID 36464489
PQID 3113462094
PQPubID 26846
ParticipantIDs proquest_miscellaneous_2747005641
proquest_journals_3113462094
pubmed_primary_36464489
crossref_citationtrail_10_1093_bib_bbac505
crossref_primary_10_1093_bib_bbac505
oup_primary_10_1093_bib_bbac505
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2023-01-19
PublicationDateYYYYMMDD 2023-01-19
PublicationDate_xml – month: 01
  year: 2023
  text: 2023-01-19
  day: 19
PublicationDecade 2020
PublicationPlace England
PublicationPlace_xml – name: England
– name: Oxford
PublicationTitle Briefings in bioinformatics
PublicationTitleAlternate Brief Bioinform
PublicationYear 2023
Publisher Oxford University Press
Oxford Publishing Limited (England)
Publisher_xml – name: Oxford University Press
– name: Oxford Publishing Limited (England)
References Nayfach (2023011917141726500_ref28) 2021; 6
Dutilh (2023011917141726500_ref18) 2021; 51
Hyatt (2023011917141726500_ref26) 2010; 11
Suttle (2023011917141726500_ref2) 2007; 5
Simmonds (2023011917141726500_ref16) 2017; 15
Shi (2023011917141726500_ref30) 2018; 556
Asokan (2023011917141726500_ref4) 2013; 2
Roux (2023011917141726500_ref15) 2021; 49
Buchfink (2023011917141726500_ref35) 2015; 12
(2023011917141726500_ref25) 2009
Pickett (2023011917141726500_ref9) 2012; 4
Guo (2023011917141726500_ref36) 2021; 9
Nepusz (2023011917141726500_ref31) 2012; 9
Kudla (2023011917141726500_ref12) 2020; 36
Camarillo-Guerrero (2023011917141726500_ref14) 2021; 184
Bin Jang (2023011917141726500_ref21) 2019; 37
Masson (2023011917141726500_ref11) 2013; 41
(2023011917141726500_ref27) 2021
Shang (2023011917141726500_ref19) 2021; 37
Abu-Mostafa (2023011917141726500_ref20) 2012
Yilin Zhu (2023011917141726500_ref34) 2022
Shi (2023011917141726500_ref29) 2016; 540
Gregory (2023011917141726500_ref13) 2019; 177
Altschul (2023011917141726500_ref33) 1990; 215
Geoghegan (2023011917141726500_ref3) 2017; 7
Baltimore (2023011917141726500_ref6) 1971; 35
Gelderblom (2023011917141726500_ref1) 1996
Paez-Espino (2023011917141726500_ref17) 2017; 45
Meijenfeldt (2023011917141726500_ref22) 2019; 20
Adams (2023011917141726500_ref8) 2006; 34
Bhat (2023011917141726500_ref7) 2020
Elbe (2023011917141726500_ref10) 2017; 1
Grant (2023011917141726500_ref5) 2008; 84
Shang (2023011917141726500_ref24) 2021; 189
Pons (2023011917141726500_ref23) 2021; 37
Lima-Mendez (2023011917141726500_ref32) 2008; 25
References_xml – volume: 51
  start-page: 207
  year: 2021
  ident: 2023011917141726500_ref18
  article-title: Perspective on taxonomic classification of uncultivated viruses
  publication-title: Curr Opin Virol
  doi: 10.1016/j.coviro.2021.10.011
– volume: 556
  start-page: 197
  year: 2018
  ident: 2023011917141726500_ref30
  article-title: The evolutionary history of vertebrate RNA viruses
  publication-title: Nature
  doi: 10.1038/s41586-018-0012-7
– volume: 177
  start-page: 1109
  year: 2019
  ident: 2023011917141726500_ref13
  article-title: Marine DNA viral macro- and microdiversity from pole to pole
  publication-title: Cell
  doi: 10.1016/j.cell.2019.03.040
– year: 2021
  ident: 2023011917141726500_ref27
  article-title: Dataset of oyster virome and the remarkable virus diversity in filter-feeding oysters
  publication-title: Research Square
– volume: 6
  start-page: 960
  year: 2021
  ident: 2023011917141726500_ref28
  article-title: Metagenomic compendium of 189,680 DNA viruses from the human gut microbiome
  publication-title: Nat Microbiol
  doi: 10.1038/s41564-021-00928-6
– volume: 35
  start-page: 235
  year: 1971
  ident: 2023011917141726500_ref6
  article-title: Expression of animal virus genomes
  publication-title: Bacteriol Rev
  doi: 10.1128/br.35.3.235-241.1971
– volume: 45
  start-page: D457
  year: 2017
  ident: 2023011917141726500_ref17
  article-title: IMG/VR: a database of cultured and uncultured DNA viruses and retroviruses
  publication-title: Nucleic Acids Res
– year: 2022
  ident: 2023011917141726500_ref34
  article-title: Phage taxonomic classification: challenges, current tools, and limitations
  publication-title: arXiv
– volume: 1
  start-page: 33
  year: 2017
  ident: 2023011917141726500_ref10
  article-title: Data, disease and diplomacy: GISAID's innovative contribution to global health
  publication-title: Glob Chall
  doi: 10.1002/gch2.1018
– volume: 9
  start-page: 37
  year: 2021
  ident: 2023011917141726500_ref36
  article-title: VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses
  publication-title: Microbiome
  doi: 10.1186/s40168-020-00990-y
– volume: 12
  start-page: 59
  year: 2015
  ident: 2023011917141726500_ref35
  article-title: Fast and sensitive protein alignment using DIAMOND
  publication-title: Nat Methods
  doi: 10.1038/nmeth.3176
– volume: 9
  start-page: 471
  year: 2012
  ident: 2023011917141726500_ref31
  article-title: Detecting overlapping protein complexes in protein-protein interaction networks
  publication-title: Nat Methods
  doi: 10.1038/nmeth.1938
– volume: 11
  start-page: 119
  year: 2010
  ident: 2023011917141726500_ref26
  article-title: Prodigal: prokaryotic gene recognition and translation initiation site identification
  publication-title: BMC Bioinform
  doi: 10.1186/1471-2105-11-119
– volume: 7
  start-page: 170189
  year: 2017
  ident: 2023011917141726500_ref3
  article-title: Predicting virus emergence amid evolutionary noise
  publication-title: Open Biol
  doi: 10.1098/rsob.170189
– start-page: 29
  volume-title: Characterization of Plant Viruses: Methods and Protocols
  year: 2020
  ident: 2023011917141726500_ref7
  doi: 10.1007/978-1-0716-0334-5_4
– volume: 184
  start-page: 1098
  year: 2021
  ident: 2023011917141726500_ref14
  article-title: Massive expansion of human gut bacteriophage diversity
  publication-title: Cell
  doi: 10.1016/j.cell.2021.01.029
– volume: 5
  start-page: 801
  year: 2007
  ident: 2023011917141726500_ref2
  article-title: Marine viruses—major players in the global ecosystem
  publication-title: Nat Rev Microbiol
  doi: 10.1038/nrmicro1750
– volume: 4
  start-page: 3209
  year: 2012
  ident: 2023011917141726500_ref9
  article-title: Virus pathogen database and analysis resource (ViPR): a comprehensive bioinformatics database and analysis resource for the coronavirus research community
  publication-title: Viruses
  doi: 10.3390/v4113209
– start-page: 361
  volume-title: Proceedings of the International AAAI Conference on Web and Social Media
  year: 2009
  ident: 2023011917141726500_ref25
– volume: 540
  start-page: 539
  year: 2016
  ident: 2023011917141726500_ref29
  article-title: Redefining the invertebrate RNA virosphere
  publication-title: Nature
  doi: 10.1038/nature20167
– volume: 37
  start-page: 1805
  year: 2021
  ident: 2023011917141726500_ref23
  article-title: VPF-class: taxonomic assignment and host prediction of uncultivated viruses based on viral protein families
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btab026
– volume: 41
  start-page: D579
  year: 2013
  ident: 2023011917141726500_ref11
  article-title: ViralZone: recent updates to the virus knowledge resource
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gks1220
– volume: 25
  start-page: 762
  year: 2008
  ident: 2023011917141726500_ref32
  article-title: Reticulate representation of evolutionary and functional relationships between phage genomes
  publication-title: Mol Biol Evol
  doi: 10.1093/molbev/msn023
– volume: 36
  start-page: 5507
  year: 2020
  ident: 2023011917141726500_ref12
  article-title: Virxicon: a lexicon of viral sequences
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btaa1066
– volume: 37
  start-page: i25
  year: 2021
  ident: 2023011917141726500_ref19
  article-title: Bacteriophage classification for assembled contigs using graph convolutional network
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btab293
– volume: 20
  start-page: 1
  year: 2019
  ident: 2023011917141726500_ref22
  article-title: Robust taxonomic classification of uncharted microbial sequences and bins with CAT and BAT
  publication-title: Genome Biol
– volume: 34
  start-page: D382
  year: 2006
  ident: 2023011917141726500_ref8
  article-title: DPVweb: a comprehensive database of plant and fungal virus genes and genomes
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkj023
– volume: 2
  start-page: 76
  year: 2013
  ident: 2023011917141726500_ref4
  article-title: Emerging infectious diseases, antimicrobial resistance and millennium development goals: resolving the challenges through one health
  publication-title: Cent Asian J Glob Health
– volume-title: Learning from Data: A Short Course
  year: 2012
  ident: 2023011917141726500_ref20
– volume: 37
  start-page: 632
  year: 2019
  ident: 2023011917141726500_ref21
  article-title: Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks
  publication-title: Nat Biotechnol
  doi: 10.1038/s41587-019-0100-8
– volume: 49
  start-page: D764
  year: 2021
  ident: 2023011917141726500_ref15
  article-title: IMG/VR v3: an integrated ecological and evolutionary framework for interrogating genomes of uncultivated viruses
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkaa946
– volume: 189
  start-page: 95
  year: 2021
  ident: 2023011917141726500_ref24
  article-title: CHEER: HierarCHical taxonomic classification for viral mEtagEnomic data via deep leaRning
  publication-title: Methods
  doi: 10.1016/j.ymeth.2020.05.018
– volume-title: Medical Microbiology
  year: 1996
  ident: 2023011917141726500_ref1
– volume: 84
  start-page: 356
  year: 2008
  ident: 2023011917141726500_ref5
  article-title: Hypothesis—ultraviolet-B irradiance and vitamin D reduce the risk of viral infections and thus their sequelae, including autoimmune diseases and some cancers
  publication-title: Photochem Photobiol
  doi: 10.1111/j.1751-1097.2007.00266.x
– volume: 15
  start-page: 161
  year: 2017
  ident: 2023011917141726500_ref16
  article-title: Consensus statement: virus taxonomy in the age of metagenomics
  publication-title: Nat Rev Microbiol
  doi: 10.1038/nrmicro.2016.177
– volume: 215
  start-page: 403
  year: 1990
  ident: 2023011917141726500_ref33
  article-title: Basic local alignment search tool
  publication-title: J Mol Biol
  doi: 10.1016/S0022-2836(05)80360-2
SSID ssj0020781
Score 2.5921066
Snippet Abstract Viruses are the most ubiquitous and diverse entities in the biome. Due to the rapid growth of newly identified viruses, there is an urgent need for...
Viruses are the most ubiquitous and diverse entities in the biome. Due to the rapid growth of newly identified viruses, there is an urgent need for accurate...
SourceID proquest
pubmed
crossref
oup
SourceType Aggregation Database
Index Database
Enrichment Source
Publisher
SubjectTerms Classification
Databases, Factual
Genome, Viral
Genomics
Sequences
Software
Source code
State-of-the-art reviews
Taxonomy
Viruses
Viruses - genetics
Title Virus classification for viral genomic fragments using PhaGCN2
URI https://www.ncbi.nlm.nih.gov/pubmed/36464489
https://www.proquest.com/docview/3113462094
https://www.proquest.com/docview/2747005641
Volume 24
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwhV3NS8MwFA8yELyI31anRthJKCNN2rQXQYZzCE4PU3YraZrMweyk3Q7-977XdoXp0EsvTUj7kvB-7-v3COkY-LpIKOFGygSu8IR1Ix8eoJtSZjwW-AqrkZ-GweBVPI79cZ0gW2wI4Ue8m0yTbpIo7ZdUpaB-kSJ_9Dxu7Crkq6mKiKSL7O51Gd6PuWuKZ62Y7RemLHVLf4_s1qCQ3lW7uE-2THZAtqs2kV-H5PZtmi8LqhHoYmZPKUwKaJNihu6MIs_qx1RTm6tJWbFGMZt9Ql_e1UNv6B2RUf9-1Bu4dd8DV3MRLlwWWYNqO0EHJcCdEH5Zh0zLUFjfS3nqW2uEBk0S-FIYADxBqowEO1eFqUz5MWll88ycEmrTUIJJJBXcSoEBPaReSaSR1npWcumQm5VMYl1zgmNrillcxaZ5DAKMawE6pNMM_qyoMDYPuwLh_j2ivRJ8XN-YIuaMcRF4YG065Lp5DWcdAxgqM_NlEaMFjdylgjnkpNqwZh0eCDQ1o7N_lz8nO-jxQccwi9qktciX5gKQxSK5LM_VNzvYyME
linkProvider Oxford University Press
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Virus+classification+for+viral+genomic+fragments+using+PhaGCN2&rft.jtitle=Briefings+in+bioinformatics&rft.au=Jing-Zhe+Jiang&rft.au=Wen-Guang+Yuan&rft.au=Shang%2C+Jiayu&rft.au=Ying-Hui%2C+Shi&rft.date=2023-01-19&rft.pub=Oxford+Publishing+Limited+%28England%29&rft.issn=1467-5463&rft.eissn=1477-4054&rft.volume=24&rft.issue=1&rft_id=info:doi/10.1093%2Fbib%2Fbbac505&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1467-5463&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1467-5463&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1467-5463&client=summon