Chromosome-Scale Assembly of the Bread Wheat Genome Reveals Thousands of Additional Gene Copies

Abstract Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference g...

Full description

Saved in:
Bibliographic Details
Published inGenetics (Austin) Vol. 216; no. 2; pp. 599 - 608
Main Authors Alonge, Michael, Shumate, Alaina, Puiu, Daniela, Zimin, Aleksey V, Salzberg, Steven L
Format Journal Article
LanguageEnglish
Published United States Oxford University Press 01.10.2020
Genetics Society of America
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Abstract Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered >5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus.
AbstractList Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered .5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus.
Bread wheat ( is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered >5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the photoperiod response locus.
Bread wheat ( Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered >5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus.
Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered >5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus.
Abstract Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered >5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus.
Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered >5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus.Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered >5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus.
Author Shumate, Alaina
Zimin, Aleksey V
Alonge, Michael
Salzberg, Steven L
Puiu, Daniela
Author_xml – sequence: 1
  givenname: Michael
  surname: Alonge
  fullname: Alonge, Michael
  email: malonge11@gmail.com
  organization: Department of Computer Science, Johns Hopkins University, Baltimore, Maryland 21218
– sequence: 2
  givenname: Alaina
  surname: Shumate
  fullname: Shumate, Alaina
  organization: Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218
– sequence: 3
  givenname: Daniela
  surname: Puiu
  fullname: Puiu, Daniela
  organization: Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218
– sequence: 4
  givenname: Aleksey V
  surname: Zimin
  fullname: Zimin, Aleksey V
  organization: Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218
– sequence: 5
  givenname: Steven L
  surname: Salzberg
  fullname: Salzberg, Steven L
  email: salzberg@jhu.edu
  organization: Department of Computer Science, Johns Hopkins University, Baltimore, Maryland 21218
BackLink https://www.ncbi.nlm.nih.gov/pubmed/32796007$$D View this record in MEDLINE/PubMed
BookMark eNqNkktr3DAUhUVJaR7tLygUQTfdeKqHJUubwmRok0Kg0KZ0KTT2VaxgS1PJDuTfV2aSkGSTriTQd47u4xyjgxADIPSekhUVvP58BQEm3-YVZWTFCReEvkJHVNe8YpLTg0f3Q3Sc8zUhRGqh3qBDzhotCWmOkNn0KY4xxxGqX60dAK9zhnE73OLo8NQDPk1gO_ynBzvhMwgFxD_hBuyQ8WUf52xDlxd23XV-8jHYYcEAb-LOQ36LXruCwru78wT9_vb1cnNeXfw4-75ZX1StaOhUCabqLVW6JRoU150A1rhtI52Q1GoBruk01ZJJcNZq6rRubW2Z1gyU6hznJ-jL3nc3b0foWghTsoPZJT_adGui9ebpS_C9uYo3phFcqloXg093Bin-nSFPZvS5hWGwAUqXhgnGJFVCq5fRmtd1IwURBf34DL2OcyozWijBqVZEykJ9eFz8Q9X3ayoA3wNtijkncA8IJWYJg7kPgylhMPswFJV-pmr9ZJcdlQn44QXtaq-N8-6_PvsH2abLGA
CitedBy_id crossref_primary_10_3389_fpls_2024_1458250
crossref_primary_10_1371_journal_pone_0309944
crossref_primary_10_3390_ijms26020665
crossref_primary_10_1002_tpg2_20191
crossref_primary_10_1007_s11105_024_01522_w
crossref_primary_10_3390_ijms242317054
crossref_primary_10_3390_plants13111514
crossref_primary_10_1093_bioinformatics_btaa1016
crossref_primary_10_3390_life13081668
crossref_primary_10_1016_j_molp_2020_09_001
crossref_primary_10_1016_j_molp_2022_01_014
crossref_primary_10_3390_ijms25168614
crossref_primary_10_1016_j_molp_2022_01_013
crossref_primary_10_1007_s00122_023_04383_1
crossref_primary_10_1186_s12864_021_07475_8
crossref_primary_10_3389_fpls_2022_1006409
crossref_primary_10_31857_S0042132423010040
crossref_primary_10_3389_fpls_2022_984825
crossref_primary_10_3389_fgene_2021_818880
crossref_primary_10_1038_s41588_022_01022_1
crossref_primary_10_3390_life14040535
crossref_primary_10_1016_j_pbi_2022_102255
crossref_primary_10_1093_pcp_pcaa152
crossref_primary_10_1111_tpj_15289
crossref_primary_10_1002_tpg2_20212
crossref_primary_10_1093_dnares_dsab008
crossref_primary_10_3389_fpls_2021_774994
crossref_primary_10_1093_g3journal_jkab421
crossref_primary_10_1134_S2079086423020032
Cites_doi 10.1093/bioinformatics/bty191
10.1093/bioinformatics/btx304
10.1038/s41477-019-0577-7
10.1093/bioinformatics/btq033
10.1016/j.ympev.2006.01.023
10.1111/tpj.13424
10.1111/pce.13167
10.1186/s12863-015-0258-0
10.1101/gr.217117.116
10.1371/journal.pcbi.1005944
10.1371/journal.pcbi.1007981
10.1093/bioinformatics/btp324
10.1101/gr.101360.109
10.1371/journal.pone.0033234
10.1126/science.1251788
10.1186/gb-2004-5-2-r12
10.1007/BF02672069
10.1016/j.cell.2020.05.023
10.1371/journal.pgen.1005997
10.12688/f1000research.23297.1
10.1016/S0022-2836(05)80360-2
10.1186/s13059-015-0582-8
10.1093/bioinformatics/btp352
10.1126/science.aar7191
10.1101/2020.06.24.169680
10.14806/ej.17.1.200
10.1186/s13059-019-1829-6
10.1038/353031a0
10.1126/science.1143986
10.1007/s00122-007-0603-4
10.1093/gigascience/gix097
10.1093/bioinformatics/btv033
10.1016/j.cell.2020.05.021
10.1038/s41477-019-0422-z
10.1038/35056041
ContentType Journal Article
Copyright Copyright © 2020 by the Genetics Society of America 2020
Copyright © 2020 by the Genetics Society of America.
Copyright Genetics Society of America Oct 2020
Copyright_xml – notice: Copyright © 2020 by the Genetics Society of America 2020
– notice: Copyright © 2020 by the Genetics Society of America.
– notice: Copyright Genetics Society of America Oct 2020
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
3V.
4T-
4U-
7QP
7SS
7TK
7TM
7X2
7X7
7XB
88A
88E
88I
8AO
8C1
8FD
8FE
8FH
8FI
8FJ
8FK
8G5
ABUWG
AEUYN
AFKRA
ATCPS
AZQEC
BBNVY
BENPR
BHPHI
CCPQU
DWQXO
FR3
FYUFA
GHDGH
GNUQQ
GUQSH
HCIFZ
K9-
K9.
LK8
M0K
M0R
M0S
M1P
M2O
M2P
M7N
M7P
MBDVC
P64
PHGZM
PHGZT
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQQKQ
PQUKI
Q9U
RC3
7X8
5PM
DOI 10.1534/genetics.120.303501
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
ProQuest Central (Corporate)
Docstoc
University Readers
Calcium & Calcified Tissue Abstracts
Entomology Abstracts (Full archive)
Neurosciences Abstracts
Nucleic Acids Abstracts
Agricultural Science Collection
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Biology Database (Alumni Edition)
Medical Database (Alumni Edition)
Science Database (Alumni Edition)
ProQuest Pharma Collection
Public Health Database
Technology Research Database
ProQuest SciTech Collection
ProQuest Natural Science Journals
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Research Library
ProQuest Central (Alumni)
ProQuest One Sustainability
ProQuest Central UK/Ireland
Agricultural & Environmental Science Collection
ProQuest Central Essentials
Biological Science Collection
ProQuest Central
Natural Science Collection
ProQuest One
ProQuest Central Korea
Engineering Research Database
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Central Student
ProQuest Research Library
SciTech Premium Collection
Consumer Health Database
ProQuest Health & Medical Complete (Alumni)
Biological Sciences
Agriculture Science Database
Consumer Health Database (ProQuest)
ProQuest Health & Medical Collection
Medical Database
Research Library
Science Database
Algology Mycology and Protozoology Abstracts (Microbiology C)
Biological Science Database
Research Library (Corporate)
Biotechnology and BioEngineering Abstracts
ProQuest Central Premium
ProQuest One Academic (New)
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central Basic
Genetics Abstracts
MEDLINE - Academic
PubMed Central (Full Participant titles)
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Agricultural Science Database
Research Library Prep
ProQuest Central Student
ProQuest Central Essentials
Nucleic Acids Abstracts
SciTech Premium Collection
ProQuest One Applied & Life Sciences
ProQuest One Sustainability
Health Research Premium Collection
Natural Science Collection
Health & Medical Research Collection
Biological Science Collection
ProQuest Central (New)
ProQuest Medical Library (Alumni)
ProQuest Science Journals (Alumni Edition)
ProQuest Biological Science Collection
ProQuest Family Health
ProQuest One Academic Eastern Edition
Agricultural Science Collection
ProQuest Hospital Collection
Health Research Premium Collection (Alumni)
Biological Science Database
Neurosciences Abstracts
ProQuest Hospital Collection (Alumni)
Biotechnology and BioEngineering Abstracts
Entomology Abstracts
ProQuest Health & Medical Complete
ProQuest One Academic UKI Edition
Docstoc
Engineering Research Database
ProQuest One Academic
Calcium & Calcified Tissue Abstracts
ProQuest One Academic (New)
University Readers
Technology Research Database
ProQuest One Academic Middle East (New)
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
Research Library (Alumni Edition)
ProQuest Natural Science Collection
ProQuest Pharma Collection
ProQuest Family Health (Alumni Edition)
ProQuest Biology Journals (Alumni Edition)
ProQuest Central
ProQuest Health & Medical Research Collection
Genetics Abstracts
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
Algology Mycology and Protozoology Abstracts (Microbiology C)
Agricultural & Environmental Science Collection
ProQuest Research Library
ProQuest Public Health
ProQuest Central Basic
ProQuest Science Journals
ProQuest SciTech Collection
ProQuest Medical Library
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList Agricultural Science Database
MEDLINE
MEDLINE - Academic

CrossRef

MEDLINE - Academic
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 3
  dbid: BENPR
  name: ProQuest Central
  url: https://www.proquest.com/central
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1943-2631
EndPage 608
ExternalDocumentID PMC7536849
32796007
10_1534_genetics_120_303501
10.1534/genetics.120.303501
Genre Research Support, U.S. Gov't, Non-P.H.S
Journal Article
Research Support, N.I.H., Extramural
GrantInformation_xml – fundername: NIGMS NIH HHS
  grantid: R35 GM130151
– fundername: NHGRI NIH HHS
  grantid: R01 HG006677
GroupedDBID ---
--Z
-DZ
-~X
.-4
.55
.GJ
0R~
186
18M
29H
2KS
2WC
34G
36B
39C
3V.
53G
5GY
5RE
5VS
5WD
7X2
7X7
85S
88A
88E
88I
8AO
8C1
8FE
8FH
8FI
8FJ
8G5
8R4
8R5
9M8
A8Z
AABZA
AACZT
AAPXW
AARHZ
AASNB
AAUAY
AAUTI
AAVAP
AAYOK
ABDNZ
ABJNI
ABMNT
ABNHQ
ABPPZ
ABPTD
ABTAH
ABUWG
ABXVV
ACFRR
ACGOD
ACIHN
ACIPB
ACNCT
ACPRK
ACPVT
ACUTJ
ACYGS
ADBBV
ADIPN
ADQBN
ADVEK
AEAQA
AENEX
AFFDN
AFFNX
AFFZL
AFGWE
AFKRA
AFRAH
AGMDO
AHMBA
AJEEA
ALIPV
ALMA_UNASSIGNED_HOLDINGS
AOIJS
APEBS
ATCPS
ATGXG
AZQEC
BAWUL
BBNVY
BCRHZ
BENPR
BES
BEYMZ
BHPHI
BKNYI
BKOMP
BPHCQ
BTFSW
BVXVI
C1A
CCPQU
CJ0
CS3
D0L
DIK
DU5
DWQXO
E3Z
EBD
EBS
EJD
EMB
EMOBN
ESTFP
F5P
F8P
F9R
FD6
FLUFQ
FOEOM
FRP
FYUFA
GNUQQ
GUQSH
GX1
H13
HCIFZ
HMCUK
HYE
H~9
INIJC
K9-
KBUDW
KOP
KQ8
KSI
KSN
L7B
LK8
M0K
M0L
M0R
M1P
M2O
M2P
M7P
MV1
MVM
NHB
NOMLY
OBOKY
OCZFY
OHT
OJZSN
OK1
OMK
OPAEJ
OWPYF
PQQKQ
PROAC
PSQYO
Q2X
QF4
QM4
QM9
QN7
QO4
R0Z
RHF
RHI
ROX
RPM
RXW
SJN
SV3
TAE
TGS
TH9
TN5
TR2
TWZ
U5U
UHB
UKHRP
UKR
UNMZH
UPT
VQA
W8F
WH7
WHG
WOQ
X7M
XOL
XSW
YHG
YKV
YSK
YYP
YYQ
YZZ
ZCA
ZGI
ZXP
ZY4
~KM
AAYXX
ABDFA
ABEJV
ABGNP
ABVGC
ABXZS
ADGKP
AEUYN
AGORE
AHMMS
AJNCP
ALXQX
CITATION
JXSIZ
PHGZM
PHGZT
ACVCV
APJGH
CGR
CUY
CVF
ECM
EIF
NPM
4T-
4U-
7QP
7SS
7TK
7TM
7XB
8FD
8FK
FR3
K9.
M7N
MBDVC
P64
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQUKI
Q9U
RC3
7X8
5PM
ID FETCH-LOGICAL-c571t-5284b189c09e839d5e27fb76f561a95ef7d919626efaa91f99ca4a2992e88df33
IEDL.DBID 7X7
ISSN 1943-2631
0016-6731
IngestDate Thu Aug 21 18:18:33 EDT 2025
Thu Jul 10 22:00:09 EDT 2025
Fri Jul 11 06:23:33 EDT 2025
Fri Jul 25 19:19:23 EDT 2025
Thu Apr 03 06:52:29 EDT 2025
Tue Jul 01 01:55:43 EDT 2025
Thu Apr 24 23:00:19 EDT 2025
Wed Aug 28 03:17:49 EDT 2024
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 2
Keywords wheat
scaffolding
gene annotation
gene duplication
genome assembly
Language English
License This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)
https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model
Copyright © 2020 by the Genetics Society of America.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c571t-5284b189c09e839d5e27fb76f561a95ef7d919626efaa91f99ca4a2992e88df33
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
These authors contributed equally to this work.
ORCID 0000-0002-4450-1857
0000-0002-2386-9265
0000-0002-8859-7432
0000-0002-3692-1819
0000-0001-5091-3092
OpenAccessLink https://www.ncbi.nlm.nih.gov/pmc/articles/7536849
PMID 32796007
PQID 2453198066
PQPubID 23479
PageCount 10
ParticipantIDs pubmedcentral_primary_oai_pubmedcentral_nih_gov_7536849
proquest_miscellaneous_2522618598
proquest_miscellaneous_2434476505
proquest_journals_2453198066
pubmed_primary_32796007
crossref_primary_10_1534_genetics_120_303501
crossref_citationtrail_10_1534_genetics_120_303501
oup_primary_10_1534_genetics_120_303501
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2020-10-01
PublicationDateYYYYMMDD 2020-10-01
PublicationDate_xml – month: 10
  year: 2020
  text: 2020-10-01
  day: 01
PublicationDecade 2020
PublicationPlace United States
PublicationPlace_xml – name: United States
– name: Bethesda
PublicationTitle Genetics (Austin)
PublicationTitleAlternate Genetics
PublicationYear 2020
Publisher Oxford University Press
Genetics Society of America
Publisher_xml – name: Oxford University Press
– name: Genetics Society of America
References Quinlan (2021050609181251700_bib26) 2010; 26
Dubcovsky (2021050609181251700_bib11) 2007
Li (2021050609181251700_bib17) 2009; 25
Liu (2021050609181251700_bib20) 2020; 182
Guo (2021050609181251700_bib12) 2016; 12
Soyk (2021050609181251700_bib30) 2019; 5
Marçais (2021050609181251700_bib22) 2018; 14
Würschum (2021050609181251700_bib33) 2018; 41
Alonge (2021050609181251700_bib1) 2019; 20
Díaz (2021050609181251700_bib10) 2012
Kokot (2021050609181251700_bib14) 2017; 33
Arumuganathan (2021050609181251700_bib5) 1991; 9
Coen (2021050609181251700_bib9) 1991; 353
Shumate (2021050609181251700_bib28) 2020
Zimin (2021050609181251700_bib34) 2019; 16
Kurtz (2021050609181251700_bib15) 2004; 5
Li (2021050609181251700_bib16) 2018
Petersen (2021050609181251700_bib25) 2006; 39
Pertea (2021050609181251700_bib24) 2020; 9
Martin (2021050609181251700_bib21) 2011; 17
Würschum (2021050609181251700_bib31) 2015; 16
International Wheat Genome Sequencing Consortium (IWGSC) (2021050609181251700_bib13) 2014
Song (2021050609181251700_bib29) 2020; 6
Zimin (2021050609181251700_bib35) 2017; 6
Chapman (2021050609181251700_bib7) 2015; 16
Altschul (2021050609181251700_bib3) 1990; 215
Li (2021050609181251700_bib18) 2009; 25
Li (2021050609181251700_bib19) 2015; 31
Clavijo (2021050609181251700_bib8) 2017; 27
Alonge (2021050609181251700_bib2) 2020; 182
Ng (2021050609181251700_bib23) 2001; 2
Schatz (2021050609181251700_bib27) 2010; 20
Würschum (2021050609181251700_bib32) 2017; 89
Appels (2021050609181251700_bib4) 2018
Beales (2021050609181251700_bib6) 2007; 115
References_xml – start-page: 3094
  volume-title: Bioinformatics
  year: 2018
  ident: 2021050609181251700_bib16
  article-title: Minimap2: pairwise alignment for nucleotide sequences.
  doi: 10.1093/bioinformatics/bty191
– volume: 33
  start-page: 2759
  year: 2017
  ident: 2021050609181251700_bib14
  article-title: KMC 3: counting and manipulating k-mer statistics.
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btx304
– volume: 6
  start-page: 34
  year: 2020
  ident: 2021050609181251700_bib29
  article-title: Eight high-quality genomes reveal pan-genome architecture and ecotype differentiation of Brassica napus.
  publication-title: Nat. Plants
  doi: 10.1038/s41477-019-0577-7
– volume: 26
  start-page: 841
  year: 2010
  ident: 2021050609181251700_bib26
  article-title: BEDTools: a flexible suite of utilities for comparing genomic features.
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btq033
– volume: 39
  start-page: 70
  year: 2006
  ident: 2021050609181251700_bib25
  article-title: Phylogenetic relationships of Triticum and Aegilops and evidence for the origin of the A, B, and D genomes of common wheat (Triticum aestivum).
  publication-title: Mol. Phylogenet. Evol.
  doi: 10.1016/j.ympev.2006.01.023
– volume: 89
  start-page: 764
  year: 2017
  ident: 2021050609181251700_bib32
  article-title: Copy number variations of CBF genes at the Fr-A2 locus are essential components of winter hardiness in wheat.
  publication-title: Plant J.
  doi: 10.1111/tpj.13424
– volume: 41
  start-page: 1407
  year: 2018
  ident: 2021050609181251700_bib33
  article-title: A three-component system incorporating Ppd-D1, copy number variation at Ppd-B1, and numerous small-effect quantitative trait loci facilitates adaptation of heading time in winter wheat cultivars of worldwide origin.
  publication-title: Plant Cell Environ.
  doi: 10.1111/pce.13167
– volume: 16
  start-page: 96
  year: 2015
  ident: 2021050609181251700_bib31
  article-title: Multiply to conquer: copy number variations at Ppd-B1 and Vrn-A1 facilitate global adaptation in wheat.
  publication-title: BMC Genet.
  doi: 10.1186/s12863-015-0258-0
– volume: 27
  start-page: 885
  year: 2017
  ident: 2021050609181251700_bib8
  article-title: An improved assembly and annotation of the allohexaploid wheat genome identifies complete families of agronomic genes and provides genomic evidence for chromosomal translocations.
  publication-title: Genome Res.
  doi: 10.1101/gr.217117.116
– volume: 14
  year: 2018
  ident: 2021050609181251700_bib22
  article-title: MUMmer4: a fast and versatile genome alignment system.
  publication-title: PLOS Comput. Biol.
  doi: 10.1371/journal.pcbi.1005944
– volume: 16
  start-page: e1007981
  issue: 6
  year: 2019
  ident: 2021050609181251700_bib34
  article-title: The genome polishing tool POLCA makes fast and accurate corrections in genome assemblies.
  publication-title: PLoS Comput Biol
  doi: 10.1371/journal.pcbi.1007981
– volume: 25
  start-page: 1754
  year: 2009
  ident: 2021050609181251700_bib17
  article-title: Fast and accurate short read alignment with Burrows-Wheeler transform.
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btp324
– volume: 20
  start-page: 1165
  year: 2010
  ident: 2021050609181251700_bib27
  article-title: Assembly of large genomes using second-generation sequencing.
  publication-title: Genome Res.
  doi: 10.1101/gr.101360.109
– start-page: e33234
  volume-title: PLoS One
  year: 2012
  ident: 2021050609181251700_bib10
  article-title: Copy Number Variation Affecting the Photoperiod-B1 and Vernalization-A1 Genes Is Associated with Altered Flowering Time in Wheat (Triticum aestivum).
  doi: 10.1371/journal.pone.0033234
– start-page: 1251788
  volume-title: Science
  year: 2014
  ident: 2021050609181251700_bib13
  article-title: A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome.
  doi: 10.1126/science.1251788
– volume: 5
  start-page: R12
  year: 2004
  ident: 2021050609181251700_bib15
  article-title: Versatile and open software for comparing large genomes.
  publication-title: Genome Biol.
  doi: 10.1186/gb-2004-5-2-r12
– volume: 9
  start-page: 208
  year: 1991
  ident: 2021050609181251700_bib5
  article-title: Nuclear DNA content of some important plant species.
  publication-title: Plant Mol. Biol. Report.
  doi: 10.1007/BF02672069
– volume: 182
  start-page: 162
  year: 2020
  ident: 2021050609181251700_bib20
  article-title: Pan-genome of wild and cultivated soybeans.
  publication-title: Cell
  doi: 10.1016/j.cell.2020.05.023
– volume: 12
  year: 2016
  ident: 2021050609181251700_bib12
  article-title: De novo centromere formation and centromeric sequence expansion in wheat and its wide hybrids.
  publication-title: PLoS Genet.
  doi: 10.1371/journal.pgen.1005997
– volume: 9
  start-page: 304
  year: 2020
  ident: 2021050609181251700_bib24
  article-title: GFF utilities: GffRead and GffCompare.
  publication-title: F1000 Res.
  doi: 10.12688/f1000research.23297.1
– volume: 215
  start-page: 403
  year: 1990
  ident: 2021050609181251700_bib3
  article-title: Basic local alignment search tool.
  publication-title: J. Mol. Biol.
  doi: 10.1016/S0022-2836(05)80360-2
– volume: 16
  start-page: 26
  year: 2015
  ident: 2021050609181251700_bib7
  article-title: A whole-genome shotgun approach for assembling and anchoring the hexaploid bread wheat genome.
  publication-title: Genome Biol.
  doi: 10.1186/s13059-015-0582-8
– volume: 25
  start-page: 2078
  year: 2009
  ident: 2021050609181251700_bib18
  article-title: The sequence alignment/map format and SAMtools.
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btp352
– start-page: eaar7191
  volume-title: Science
  year: 2018
  ident: 2021050609181251700_bib4
  article-title: Shifting the limits in wheat research and breeding using a fully annotated reference genome.
  doi: 10.1126/science.aar7191
– year: 2020
  ident: 2021050609181251700_bib28
  article-title: Liftoff: an accurate gene annotation mapping tool.
  publication-title: bioRxiv
  doi: 10.1101/2020.06.24.169680
– volume: 17
  start-page: 10
  year: 2011
  ident: 2021050609181251700_bib21
  article-title: Cutadapt removes adapter sequences from high-throughput sequencing reads.
  publication-title: EMBnet. J.
  doi: 10.14806/ej.17.1.200
– volume: 20
  start-page: 224
  year: 2019
  ident: 2021050609181251700_bib1
  article-title: RaGOO: fast and accurate reference-guided scaffolding of draft genomes.
  publication-title: Genome Biol.
  doi: 10.1186/s13059-019-1829-6
– volume: 353
  start-page: 31
  year: 1991
  ident: 2021050609181251700_bib9
  article-title: The war of the whorls: genetic interactions controlling flower development.
  publication-title: Nature
  doi: 10.1038/353031a0
– start-page: 1862
  volume-title: Science
  year: 2007
  ident: 2021050609181251700_bib11
  article-title: Genome plasticity a key factor in the success of polyploid wheat under domestication.
  doi: 10.1126/science.1143986
– volume: 115
  start-page: 721
  year: 2007
  ident: 2021050609181251700_bib6
  article-title: A Pseudo-Response Regulator is misexpressed in the photoperiod insensitive Ppd-D1a mutant of wheat (Triticum aestivum L.).
  publication-title: Theor. Appl. Genet.
  doi: 10.1007/s00122-007-0603-4
– volume: 6
  start-page: 1
  year: 2017
  ident: 2021050609181251700_bib35
  article-title: The first near-complete assembly of the hexaploid bread wheat genome, Triticum aestivum.
  publication-title: Gigascience
  doi: 10.1093/gigascience/gix097
– volume: 31
  start-page: 1674
  year: 2015
  ident: 2021050609181251700_bib19
  article-title: MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph.
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btv033
– volume: 182
  start-page: 145
  year: 2020
  ident: 2021050609181251700_bib2
  article-title: Major impacts of widespread structural variation on gene expression and crop improvement in tomato.
  publication-title: Cell
  doi: 10.1016/j.cell.2020.05.021
– volume: 5
  start-page: 471
  year: 2019
  ident: 2021050609181251700_bib30
  article-title: Duplication of a domestication locus neutralized a cryptic variant that caused a breeding barrier in tomato.
  publication-title: Nat. Plants
  doi: 10.1038/s41477-019-0422-z
– volume: 2
  start-page: 186
  year: 2001
  ident: 2021050609181251700_bib23
  article-title: Function and evolution of the plant MADS-box gene family.
  publication-title: Nat. Rev. Genet.
  doi: 10.1038/35056041
SSID ssj0006958
Score 2.4784904
Snippet Abstract Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity...
Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size...
Bread wheat ( is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its...
Bread wheat ( Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size...
SourceID pubmedcentral
proquest
pubmed
crossref
oup
SourceType Open Access Repository
Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 599
SubjectTerms Agricultural research
Annotations
Assembly
Bread
Chromosomes
Chromosomes, Plant - genetics
Contig Mapping - methods
Contig Mapping - standards
Domestication
Flow cytometry
Gene Dosage
Genes
Genetics
Genome, Plant
Genomes
Genomics
Genomics - methods
Genomics - standards
Goat grass
Investigations
Reference Standards
Triticum
Triticum - genetics
Wheat
Title Chromosome-Scale Assembly of the Bread Wheat Genome Reveals Thousands of Additional Gene Copies
URI https://www.ncbi.nlm.nih.gov/pubmed/32796007
https://www.proquest.com/docview/2453198066
https://www.proquest.com/docview/2434476505
https://www.proquest.com/docview/2522618598
https://pubmed.ncbi.nlm.nih.gov/PMC7536849
Volume 216
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1La9wwEBZtQqGX0necpkGFQi91s3pbp5IsSUOhoaQJ7M3YepBA1t7G20L-fWdsrZvtYenFF41BzIw030ijbwh5z4wzAgJTzpXjuWSO5UWwLHcuKiXqEEX_XOzbmT69lF9napYO3LpUVrnaE_uN2rcOz8gPuERvKSBCfl78zLFrFN6uphYaD8k2UpdhSZeZjQnXRFuVdmKNJe4ssQ4pIQ_AOvhIsPvEOOSt_fXaWmRae-12D3T-Wzt5LxidPCVPEoqkh4PZn5EHoXlOHg19Je9ekBIpb-dt185D_gOMECje7c7rmzvaRgqQjx4BVvS034npl9CAID0PvwE0dvTiqsVKHd-h7KH318NpIYoFOm0XkFq_JJcnxxfT0zx1UsidMmwJ2WYha1ZYN7EBEJFXgZtYGx0BPVVWhWi8haXIdYhVZVm01lWygkjFQ1H4KMQrstW0Tdgh1GjPHSQZLAomaykqr4Ux2jI_0ZWWJiN8pcXSJZpx7HZxU2K6AaovV6ovQfXloPqMfBx_WgwsG5vFP4B5_k9yb2XCMi3OrvzrShl5Nw7DssK7kqoJoGWQQSpEgK9qgwxiV8A7tsjI68ErxjkJbixS_2fErPnLKIC03usjzfVVT-8NCaQupN3dPPU35DHH1L-vK9wjW8vbX-Et4KNlvd8vAvgWU7ZPto-Oz76f_wEzjQ-s
linkProvider ProQuest
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELaqrRBcEG_SFjASiAuhtePY8QGhtrRsabtCZSv1liZ-qJW6yUIW0P4pfiMzedHlsOLScyaSNTOe-WY8D0JeMWVUBI4p5LHhoWCGhYnTLDTGx3GUOx_V7WLHIzk8FZ_P4rMV8rvrhcGyys4m1obalgZz5JtcoLYk4CE_TL-FuDUKX1e7FRqNWhy6-S8I2ar3Bx9Bvq85398b7w7DdqtAaGLFZhB5JSJniTZb2gE6sLHjyudKekASmY6dV1aDWnLpfJZp5rU2mcjAanOXJNZjAhRM_qqIIJQZkNWdvdGXk972Sx23tl9iUT1r5xzFkdgEfcC2xOod4xAp1w96C75wob_uGsz9t1rzmvvbv0futriVbjeKdp-suOIBudVsspw_JCkO2Z2UVTlx4VcQu6P4mjzJr-a09BRAJt0BdGppbfvpJ1cAIT1xPwGmVnR8UWJtkK2QdtvayyY_iWSO7pZTCOYfkdMb4fJjMijKwj0lVEnLDYQ1zEdM5CLKrIyUkprZLZlJoQLCOy6mph1sjvs1rlIMcID1acf6FFifNqwPyNv-p2kz12M5-RsQz_9RbnQiTFtzUKV_lTcgL_vPcJHxdSYrHHAZaHD4IgDmeAkNomVAWDoJyJNGK_ozRVxpXDYQELWgLz0BDhJf_FJcXtQDxSFklYnQa8uP_oLcHo6Pj9Kjg9HhOrnDMfFQVzVukMHs-w_3DNDZLH_eXglKzm_6Fv4BsOFLYg
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELaqIhAXxJtAASOBuBB27fgRHxAqLUtLoULQSnsLiR9qpW6ykAW0f41fx0xedDmsuPSciWSNv5n5xh7PEPKUaasTCEwxl5bHglkWp96w2NogZVL4kDTPxT4eqr1j8X4qpxvkd_8WBssqe5_YOGpXWTwjH3GBaEkhQo5CVxbxaXfyev4txglSeNPaj9NoIXLgl78gfatf7e_CXj_jfPL2aGcv7iYMxFZqtoAsLBUFS40dGw9MwUnPdSi0CsAqciN90M4ARLnyIc8NC8bYXOTgwblPUxfwMBTc_yWdSIY2pqdDsjdWRnZRQGF5Pes6HslEjAAZ-ECxfsk45MzN1d5KVFx5aXeO8P5bt3kuEE6uk2sdg6XbLeRukA1f3iSX25mWy1skw3a7s6quZj7-AgDwFO-VZ8XZklaBAt2kb4CnOtpEAfrOlyBIP_ufQFhrenRSYZWQq1F227nT9qQSxTzdqeaQ1t8mxxei4ztks6xKf49QrRy3kOCwkDBRiCR3KtFaGebGKldCR4T3Wsxs1-IcJ22cZZjqgOqzXvUZqD5rVR-RF8NP87bDx3rx57A9_ye51W9h1jmGOvsL44g8GT6DSeM9TV560DLIYBtGoM5yjQzyZuBaJo3I3RYVw5oSrg2OHYiIXsHLIIAtxVe_lKcnTWtxSF5VKsz99Ut_TK6A7WUf9g8PHpCrHE8gmvLGLbK5-P7DPwSatigeNfZAydeLNsA_fVdOMg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Chromosome-Scale+Assembly+of+the+Bread+Wheat+Genome+Reveals+Thousands+of+Additional+Gene+Copies&rft.jtitle=Genetics+%28Austin%29&rft.au=Alonge%2C+Michael&rft.au=Shumate%2C+Alaina&rft.au=Puiu%2C+Daniela&rft.au=Zimin%2C+Aleksey+V&rft.date=2020-10-01&rft.issn=1943-2631&rft.eissn=1943-2631&rft.volume=216&rft.issue=2&rft.spage=599&rft_id=info:doi/10.1534%2Fgenetics.120.303501&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1943-2631&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1943-2631&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1943-2631&client=summon