Chromosome-Scale Assembly of the Bread Wheat Genome Reveals Thousands of Additional Gene Copies
Abstract Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference g...
Saved in:
Published in | Genetics (Austin) Vol. 216; no. 2; pp. 599 - 608 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
United States
Oxford University Press
01.10.2020
Genetics Society of America |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Abstract
Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered >5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus. |
---|---|
AbstractList | Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered .5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus. Bread wheat ( is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered >5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the photoperiod response locus. Bread wheat ( Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered >5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus. Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered >5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus. Abstract Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered >5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus. Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered >5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus.Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered >5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus. |
Author | Shumate, Alaina Zimin, Aleksey V Alonge, Michael Salzberg, Steven L Puiu, Daniela |
Author_xml | – sequence: 1 givenname: Michael surname: Alonge fullname: Alonge, Michael email: malonge11@gmail.com organization: Department of Computer Science, Johns Hopkins University, Baltimore, Maryland 21218 – sequence: 2 givenname: Alaina surname: Shumate fullname: Shumate, Alaina organization: Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218 – sequence: 3 givenname: Daniela surname: Puiu fullname: Puiu, Daniela organization: Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218 – sequence: 4 givenname: Aleksey V surname: Zimin fullname: Zimin, Aleksey V organization: Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218 – sequence: 5 givenname: Steven L surname: Salzberg fullname: Salzberg, Steven L email: salzberg@jhu.edu organization: Department of Computer Science, Johns Hopkins University, Baltimore, Maryland 21218 |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/32796007$$D View this record in MEDLINE/PubMed |
BookMark | eNqNkktr3DAUhUVJaR7tLygUQTfdeKqHJUubwmRok0Kg0KZ0KTT2VaxgS1PJDuTfV2aSkGSTriTQd47u4xyjgxADIPSekhUVvP58BQEm3-YVZWTFCReEvkJHVNe8YpLTg0f3Q3Sc8zUhRGqh3qBDzhotCWmOkNn0KY4xxxGqX60dAK9zhnE73OLo8NQDPk1gO_ynBzvhMwgFxD_hBuyQ8WUf52xDlxd23XV-8jHYYcEAb-LOQ36LXruCwru78wT9_vb1cnNeXfw4-75ZX1StaOhUCabqLVW6JRoU150A1rhtI52Q1GoBruk01ZJJcNZq6rRubW2Z1gyU6hznJ-jL3nc3b0foWghTsoPZJT_adGui9ebpS_C9uYo3phFcqloXg093Bin-nSFPZvS5hWGwAUqXhgnGJFVCq5fRmtd1IwURBf34DL2OcyozWijBqVZEykJ9eFz8Q9X3ayoA3wNtijkncA8IJWYJg7kPgylhMPswFJV-pmr9ZJcdlQn44QXtaq-N8-6_PvsH2abLGA |
CitedBy_id | crossref_primary_10_3389_fpls_2024_1458250 crossref_primary_10_1371_journal_pone_0309944 crossref_primary_10_3390_ijms26020665 crossref_primary_10_1002_tpg2_20191 crossref_primary_10_1007_s11105_024_01522_w crossref_primary_10_3390_ijms242317054 crossref_primary_10_3390_plants13111514 crossref_primary_10_1093_bioinformatics_btaa1016 crossref_primary_10_3390_life13081668 crossref_primary_10_1016_j_molp_2020_09_001 crossref_primary_10_1016_j_molp_2022_01_014 crossref_primary_10_3390_ijms25168614 crossref_primary_10_1016_j_molp_2022_01_013 crossref_primary_10_1007_s00122_023_04383_1 crossref_primary_10_1186_s12864_021_07475_8 crossref_primary_10_3389_fpls_2022_1006409 crossref_primary_10_31857_S0042132423010040 crossref_primary_10_3389_fpls_2022_984825 crossref_primary_10_3389_fgene_2021_818880 crossref_primary_10_1038_s41588_022_01022_1 crossref_primary_10_3390_life14040535 crossref_primary_10_1016_j_pbi_2022_102255 crossref_primary_10_1093_pcp_pcaa152 crossref_primary_10_1111_tpj_15289 crossref_primary_10_1002_tpg2_20212 crossref_primary_10_1093_dnares_dsab008 crossref_primary_10_3389_fpls_2021_774994 crossref_primary_10_1093_g3journal_jkab421 crossref_primary_10_1134_S2079086423020032 |
Cites_doi | 10.1093/bioinformatics/bty191 10.1093/bioinformatics/btx304 10.1038/s41477-019-0577-7 10.1093/bioinformatics/btq033 10.1016/j.ympev.2006.01.023 10.1111/tpj.13424 10.1111/pce.13167 10.1186/s12863-015-0258-0 10.1101/gr.217117.116 10.1371/journal.pcbi.1005944 10.1371/journal.pcbi.1007981 10.1093/bioinformatics/btp324 10.1101/gr.101360.109 10.1371/journal.pone.0033234 10.1126/science.1251788 10.1186/gb-2004-5-2-r12 10.1007/BF02672069 10.1016/j.cell.2020.05.023 10.1371/journal.pgen.1005997 10.12688/f1000research.23297.1 10.1016/S0022-2836(05)80360-2 10.1186/s13059-015-0582-8 10.1093/bioinformatics/btp352 10.1126/science.aar7191 10.1101/2020.06.24.169680 10.14806/ej.17.1.200 10.1186/s13059-019-1829-6 10.1038/353031a0 10.1126/science.1143986 10.1007/s00122-007-0603-4 10.1093/gigascience/gix097 10.1093/bioinformatics/btv033 10.1016/j.cell.2020.05.021 10.1038/s41477-019-0422-z 10.1038/35056041 |
ContentType | Journal Article |
Copyright | Copyright © 2020 by the Genetics Society of America 2020 Copyright © 2020 by the Genetics Society of America. Copyright Genetics Society of America Oct 2020 |
Copyright_xml | – notice: Copyright © 2020 by the Genetics Society of America 2020 – notice: Copyright © 2020 by the Genetics Society of America. – notice: Copyright Genetics Society of America Oct 2020 |
DBID | AAYXX CITATION CGR CUY CVF ECM EIF NPM 3V. 4T- 4U- 7QP 7SS 7TK 7TM 7X2 7X7 7XB 88A 88E 88I 8AO 8C1 8FD 8FE 8FH 8FI 8FJ 8FK 8G5 ABUWG AEUYN AFKRA ATCPS AZQEC BBNVY BENPR BHPHI CCPQU DWQXO FR3 FYUFA GHDGH GNUQQ GUQSH HCIFZ K9- K9. LK8 M0K M0R M0S M1P M2O M2P M7N M7P MBDVC P64 PHGZM PHGZT PJZUB PKEHL PPXIY PQEST PQGLB PQQKQ PQUKI Q9U RC3 7X8 5PM |
DOI | 10.1534/genetics.120.303501 |
DatabaseName | CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed ProQuest Central (Corporate) Docstoc University Readers Calcium & Calcified Tissue Abstracts Entomology Abstracts (Full archive) Neurosciences Abstracts Nucleic Acids Abstracts Agricultural Science Collection Health & Medical Collection ProQuest Central (purchase pre-March 2016) Biology Database (Alumni Edition) Medical Database (Alumni Edition) Science Database (Alumni Edition) ProQuest Pharma Collection Public Health Database Technology Research Database ProQuest SciTech Collection ProQuest Natural Science Journals Hospital Premium Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ProQuest Research Library ProQuest Central (Alumni) ProQuest One Sustainability ProQuest Central UK/Ireland Agricultural & Environmental Science Collection ProQuest Central Essentials Biological Science Collection ProQuest Central Natural Science Collection ProQuest One ProQuest Central Korea Engineering Research Database Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Central Student ProQuest Research Library SciTech Premium Collection Consumer Health Database ProQuest Health & Medical Complete (Alumni) Biological Sciences Agriculture Science Database Consumer Health Database (ProQuest) ProQuest Health & Medical Collection Medical Database Research Library Science Database Algology Mycology and Protozoology Abstracts (Microbiology C) Biological Science Database Research Library (Corporate) Biotechnology and BioEngineering Abstracts ProQuest Central Premium ProQuest One Academic (New) ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central Basic Genetics Abstracts MEDLINE - Academic PubMed Central (Full Participant titles) |
DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) Agricultural Science Database Research Library Prep ProQuest Central Student ProQuest Central Essentials Nucleic Acids Abstracts SciTech Premium Collection ProQuest One Applied & Life Sciences ProQuest One Sustainability Health Research Premium Collection Natural Science Collection Health & Medical Research Collection Biological Science Collection ProQuest Central (New) ProQuest Medical Library (Alumni) ProQuest Science Journals (Alumni Edition) ProQuest Biological Science Collection ProQuest Family Health ProQuest One Academic Eastern Edition Agricultural Science Collection ProQuest Hospital Collection Health Research Premium Collection (Alumni) Biological Science Database Neurosciences Abstracts ProQuest Hospital Collection (Alumni) Biotechnology and BioEngineering Abstracts Entomology Abstracts ProQuest Health & Medical Complete ProQuest One Academic UKI Edition Docstoc Engineering Research Database ProQuest One Academic Calcium & Calcified Tissue Abstracts ProQuest One Academic (New) University Readers Technology Research Database ProQuest One Academic Middle East (New) ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest One Health & Nursing Research Library (Alumni Edition) ProQuest Natural Science Collection ProQuest Pharma Collection ProQuest Family Health (Alumni Edition) ProQuest Biology Journals (Alumni Edition) ProQuest Central ProQuest Health & Medical Research Collection Genetics Abstracts Health and Medicine Complete (Alumni Edition) ProQuest Central Korea Algology Mycology and Protozoology Abstracts (Microbiology C) Agricultural & Environmental Science Collection ProQuest Research Library ProQuest Public Health ProQuest Central Basic ProQuest Science Journals ProQuest SciTech Collection ProQuest Medical Library ProQuest Central (Alumni) MEDLINE - Academic |
DatabaseTitleList | Agricultural Science Database MEDLINE MEDLINE - Academic CrossRef MEDLINE - Academic |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database – sequence: 3 dbid: BENPR name: ProQuest Central url: https://www.proquest.com/central sourceTypes: Aggregation Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Biology |
EISSN | 1943-2631 |
EndPage | 608 |
ExternalDocumentID | PMC7536849 32796007 10_1534_genetics_120_303501 10.1534/genetics.120.303501 |
Genre | Research Support, U.S. Gov't, Non-P.H.S Journal Article Research Support, N.I.H., Extramural |
GrantInformation_xml | – fundername: NIGMS NIH HHS grantid: R35 GM130151 – fundername: NHGRI NIH HHS grantid: R01 HG006677 |
GroupedDBID | --- --Z -DZ -~X .-4 .55 .GJ 0R~ 186 18M 29H 2KS 2WC 34G 36B 39C 3V. 53G 5GY 5RE 5VS 5WD 7X2 7X7 85S 88A 88E 88I 8AO 8C1 8FE 8FH 8FI 8FJ 8G5 8R4 8R5 9M8 A8Z AABZA AACZT AAPXW AARHZ AASNB AAUAY AAUTI AAVAP AAYOK ABDNZ ABJNI ABMNT ABNHQ ABPPZ ABPTD ABTAH ABUWG ABXVV ACFRR ACGOD ACIHN ACIPB ACNCT ACPRK ACPVT ACUTJ ACYGS ADBBV ADIPN ADQBN ADVEK AEAQA AENEX AFFDN AFFNX AFFZL AFGWE AFKRA AFRAH AGMDO AHMBA AJEEA ALIPV ALMA_UNASSIGNED_HOLDINGS AOIJS APEBS ATCPS ATGXG AZQEC BAWUL BBNVY BCRHZ BENPR BES BEYMZ BHPHI BKNYI BKOMP BPHCQ BTFSW BVXVI C1A CCPQU CJ0 CS3 D0L DIK DU5 DWQXO E3Z EBD EBS EJD EMB EMOBN ESTFP F5P F8P F9R FD6 FLUFQ FOEOM FRP FYUFA GNUQQ GUQSH GX1 H13 HCIFZ HMCUK HYE H~9 INIJC K9- KBUDW KOP KQ8 KSI KSN L7B LK8 M0K M0L M0R M1P M2O M2P M7P MV1 MVM NHB NOMLY OBOKY OCZFY OHT OJZSN OK1 OMK OPAEJ OWPYF PQQKQ PROAC PSQYO Q2X QF4 QM4 QM9 QN7 QO4 R0Z RHF RHI ROX RPM RXW SJN SV3 TAE TGS TH9 TN5 TR2 TWZ U5U UHB UKHRP UKR UNMZH UPT VQA W8F WH7 WHG WOQ X7M XOL XSW YHG YKV YSK YYP YYQ YZZ ZCA ZGI ZXP ZY4 ~KM AAYXX ABDFA ABEJV ABGNP ABVGC ABXZS ADGKP AEUYN AGORE AHMMS AJNCP ALXQX CITATION JXSIZ PHGZM PHGZT ACVCV APJGH CGR CUY CVF ECM EIF NPM 4T- 4U- 7QP 7SS 7TK 7TM 7XB 8FD 8FK FR3 K9. M7N MBDVC P64 PJZUB PKEHL PPXIY PQEST PQGLB PQUKI Q9U RC3 7X8 5PM |
ID | FETCH-LOGICAL-c571t-5284b189c09e839d5e27fb76f561a95ef7d919626efaa91f99ca4a2992e88df33 |
IEDL.DBID | 7X7 |
ISSN | 1943-2631 0016-6731 |
IngestDate | Thu Aug 21 18:18:33 EDT 2025 Thu Jul 10 22:00:09 EDT 2025 Fri Jul 11 06:23:33 EDT 2025 Fri Jul 25 19:19:23 EDT 2025 Thu Apr 03 06:52:29 EDT 2025 Tue Jul 01 01:55:43 EDT 2025 Thu Apr 24 23:00:19 EDT 2025 Wed Aug 28 03:17:49 EDT 2024 |
IsDoiOpenAccess | false |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 2 |
Keywords | wheat scaffolding gene annotation gene duplication genome assembly |
Language | English |
License | This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model) https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model Copyright © 2020 by the Genetics Society of America. |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c571t-5284b189c09e839d5e27fb76f561a95ef7d919626efaa91f99ca4a2992e88df33 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 These authors contributed equally to this work. |
ORCID | 0000-0002-4450-1857 0000-0002-2386-9265 0000-0002-8859-7432 0000-0002-3692-1819 0000-0001-5091-3092 |
OpenAccessLink | https://www.ncbi.nlm.nih.gov/pmc/articles/7536849 |
PMID | 32796007 |
PQID | 2453198066 |
PQPubID | 23479 |
PageCount | 10 |
ParticipantIDs | pubmedcentral_primary_oai_pubmedcentral_nih_gov_7536849 proquest_miscellaneous_2522618598 proquest_miscellaneous_2434476505 proquest_journals_2453198066 pubmed_primary_32796007 crossref_primary_10_1534_genetics_120_303501 crossref_citationtrail_10_1534_genetics_120_303501 oup_primary_10_1534_genetics_120_303501 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2020-10-01 |
PublicationDateYYYYMMDD | 2020-10-01 |
PublicationDate_xml | – month: 10 year: 2020 text: 2020-10-01 day: 01 |
PublicationDecade | 2020 |
PublicationPlace | United States |
PublicationPlace_xml | – name: United States – name: Bethesda |
PublicationTitle | Genetics (Austin) |
PublicationTitleAlternate | Genetics |
PublicationYear | 2020 |
Publisher | Oxford University Press Genetics Society of America |
Publisher_xml | – name: Oxford University Press – name: Genetics Society of America |
References | Quinlan (2021050609181251700_bib26) 2010; 26 Dubcovsky (2021050609181251700_bib11) 2007 Li (2021050609181251700_bib17) 2009; 25 Liu (2021050609181251700_bib20) 2020; 182 Guo (2021050609181251700_bib12) 2016; 12 Soyk (2021050609181251700_bib30) 2019; 5 Marçais (2021050609181251700_bib22) 2018; 14 Würschum (2021050609181251700_bib33) 2018; 41 Alonge (2021050609181251700_bib1) 2019; 20 Díaz (2021050609181251700_bib10) 2012 Kokot (2021050609181251700_bib14) 2017; 33 Arumuganathan (2021050609181251700_bib5) 1991; 9 Coen (2021050609181251700_bib9) 1991; 353 Shumate (2021050609181251700_bib28) 2020 Zimin (2021050609181251700_bib34) 2019; 16 Kurtz (2021050609181251700_bib15) 2004; 5 Li (2021050609181251700_bib16) 2018 Petersen (2021050609181251700_bib25) 2006; 39 Pertea (2021050609181251700_bib24) 2020; 9 Martin (2021050609181251700_bib21) 2011; 17 Würschum (2021050609181251700_bib31) 2015; 16 International Wheat Genome Sequencing Consortium (IWGSC) (2021050609181251700_bib13) 2014 Song (2021050609181251700_bib29) 2020; 6 Zimin (2021050609181251700_bib35) 2017; 6 Chapman (2021050609181251700_bib7) 2015; 16 Altschul (2021050609181251700_bib3) 1990; 215 Li (2021050609181251700_bib18) 2009; 25 Li (2021050609181251700_bib19) 2015; 31 Clavijo (2021050609181251700_bib8) 2017; 27 Alonge (2021050609181251700_bib2) 2020; 182 Ng (2021050609181251700_bib23) 2001; 2 Schatz (2021050609181251700_bib27) 2010; 20 Würschum (2021050609181251700_bib32) 2017; 89 Appels (2021050609181251700_bib4) 2018 Beales (2021050609181251700_bib6) 2007; 115 |
References_xml | – start-page: 3094 volume-title: Bioinformatics year: 2018 ident: 2021050609181251700_bib16 article-title: Minimap2: pairwise alignment for nucleotide sequences. doi: 10.1093/bioinformatics/bty191 – volume: 33 start-page: 2759 year: 2017 ident: 2021050609181251700_bib14 article-title: KMC 3: counting and manipulating k-mer statistics. publication-title: Bioinformatics doi: 10.1093/bioinformatics/btx304 – volume: 6 start-page: 34 year: 2020 ident: 2021050609181251700_bib29 article-title: Eight high-quality genomes reveal pan-genome architecture and ecotype differentiation of Brassica napus. publication-title: Nat. Plants doi: 10.1038/s41477-019-0577-7 – volume: 26 start-page: 841 year: 2010 ident: 2021050609181251700_bib26 article-title: BEDTools: a flexible suite of utilities for comparing genomic features. publication-title: Bioinformatics doi: 10.1093/bioinformatics/btq033 – volume: 39 start-page: 70 year: 2006 ident: 2021050609181251700_bib25 article-title: Phylogenetic relationships of Triticum and Aegilops and evidence for the origin of the A, B, and D genomes of common wheat (Triticum aestivum). publication-title: Mol. Phylogenet. Evol. doi: 10.1016/j.ympev.2006.01.023 – volume: 89 start-page: 764 year: 2017 ident: 2021050609181251700_bib32 article-title: Copy number variations of CBF genes at the Fr-A2 locus are essential components of winter hardiness in wheat. publication-title: Plant J. doi: 10.1111/tpj.13424 – volume: 41 start-page: 1407 year: 2018 ident: 2021050609181251700_bib33 article-title: A three-component system incorporating Ppd-D1, copy number variation at Ppd-B1, and numerous small-effect quantitative trait loci facilitates adaptation of heading time in winter wheat cultivars of worldwide origin. publication-title: Plant Cell Environ. doi: 10.1111/pce.13167 – volume: 16 start-page: 96 year: 2015 ident: 2021050609181251700_bib31 article-title: Multiply to conquer: copy number variations at Ppd-B1 and Vrn-A1 facilitate global adaptation in wheat. publication-title: BMC Genet. doi: 10.1186/s12863-015-0258-0 – volume: 27 start-page: 885 year: 2017 ident: 2021050609181251700_bib8 article-title: An improved assembly and annotation of the allohexaploid wheat genome identifies complete families of agronomic genes and provides genomic evidence for chromosomal translocations. publication-title: Genome Res. doi: 10.1101/gr.217117.116 – volume: 14 year: 2018 ident: 2021050609181251700_bib22 article-title: MUMmer4: a fast and versatile genome alignment system. publication-title: PLOS Comput. Biol. doi: 10.1371/journal.pcbi.1005944 – volume: 16 start-page: e1007981 issue: 6 year: 2019 ident: 2021050609181251700_bib34 article-title: The genome polishing tool POLCA makes fast and accurate corrections in genome assemblies. publication-title: PLoS Comput Biol doi: 10.1371/journal.pcbi.1007981 – volume: 25 start-page: 1754 year: 2009 ident: 2021050609181251700_bib17 article-title: Fast and accurate short read alignment with Burrows-Wheeler transform. publication-title: Bioinformatics doi: 10.1093/bioinformatics/btp324 – volume: 20 start-page: 1165 year: 2010 ident: 2021050609181251700_bib27 article-title: Assembly of large genomes using second-generation sequencing. publication-title: Genome Res. doi: 10.1101/gr.101360.109 – start-page: e33234 volume-title: PLoS One year: 2012 ident: 2021050609181251700_bib10 article-title: Copy Number Variation Affecting the Photoperiod-B1 and Vernalization-A1 Genes Is Associated with Altered Flowering Time in Wheat (Triticum aestivum). doi: 10.1371/journal.pone.0033234 – start-page: 1251788 volume-title: Science year: 2014 ident: 2021050609181251700_bib13 article-title: A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome. doi: 10.1126/science.1251788 – volume: 5 start-page: R12 year: 2004 ident: 2021050609181251700_bib15 article-title: Versatile and open software for comparing large genomes. publication-title: Genome Biol. doi: 10.1186/gb-2004-5-2-r12 – volume: 9 start-page: 208 year: 1991 ident: 2021050609181251700_bib5 article-title: Nuclear DNA content of some important plant species. publication-title: Plant Mol. Biol. Report. doi: 10.1007/BF02672069 – volume: 182 start-page: 162 year: 2020 ident: 2021050609181251700_bib20 article-title: Pan-genome of wild and cultivated soybeans. publication-title: Cell doi: 10.1016/j.cell.2020.05.023 – volume: 12 year: 2016 ident: 2021050609181251700_bib12 article-title: De novo centromere formation and centromeric sequence expansion in wheat and its wide hybrids. publication-title: PLoS Genet. doi: 10.1371/journal.pgen.1005997 – volume: 9 start-page: 304 year: 2020 ident: 2021050609181251700_bib24 article-title: GFF utilities: GffRead and GffCompare. publication-title: F1000 Res. doi: 10.12688/f1000research.23297.1 – volume: 215 start-page: 403 year: 1990 ident: 2021050609181251700_bib3 article-title: Basic local alignment search tool. publication-title: J. Mol. Biol. doi: 10.1016/S0022-2836(05)80360-2 – volume: 16 start-page: 26 year: 2015 ident: 2021050609181251700_bib7 article-title: A whole-genome shotgun approach for assembling and anchoring the hexaploid bread wheat genome. publication-title: Genome Biol. doi: 10.1186/s13059-015-0582-8 – volume: 25 start-page: 2078 year: 2009 ident: 2021050609181251700_bib18 article-title: The sequence alignment/map format and SAMtools. publication-title: Bioinformatics doi: 10.1093/bioinformatics/btp352 – start-page: eaar7191 volume-title: Science year: 2018 ident: 2021050609181251700_bib4 article-title: Shifting the limits in wheat research and breeding using a fully annotated reference genome. doi: 10.1126/science.aar7191 – year: 2020 ident: 2021050609181251700_bib28 article-title: Liftoff: an accurate gene annotation mapping tool. publication-title: bioRxiv doi: 10.1101/2020.06.24.169680 – volume: 17 start-page: 10 year: 2011 ident: 2021050609181251700_bib21 article-title: Cutadapt removes adapter sequences from high-throughput sequencing reads. publication-title: EMBnet. J. doi: 10.14806/ej.17.1.200 – volume: 20 start-page: 224 year: 2019 ident: 2021050609181251700_bib1 article-title: RaGOO: fast and accurate reference-guided scaffolding of draft genomes. publication-title: Genome Biol. doi: 10.1186/s13059-019-1829-6 – volume: 353 start-page: 31 year: 1991 ident: 2021050609181251700_bib9 article-title: The war of the whorls: genetic interactions controlling flower development. publication-title: Nature doi: 10.1038/353031a0 – start-page: 1862 volume-title: Science year: 2007 ident: 2021050609181251700_bib11 article-title: Genome plasticity a key factor in the success of polyploid wheat under domestication. doi: 10.1126/science.1143986 – volume: 115 start-page: 721 year: 2007 ident: 2021050609181251700_bib6 article-title: A Pseudo-Response Regulator is misexpressed in the photoperiod insensitive Ppd-D1a mutant of wheat (Triticum aestivum L.). publication-title: Theor. Appl. Genet. doi: 10.1007/s00122-007-0603-4 – volume: 6 start-page: 1 year: 2017 ident: 2021050609181251700_bib35 article-title: The first near-complete assembly of the hexaploid bread wheat genome, Triticum aestivum. publication-title: Gigascience doi: 10.1093/gigascience/gix097 – volume: 31 start-page: 1674 year: 2015 ident: 2021050609181251700_bib19 article-title: MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. publication-title: Bioinformatics doi: 10.1093/bioinformatics/btv033 – volume: 182 start-page: 145 year: 2020 ident: 2021050609181251700_bib2 article-title: Major impacts of widespread structural variation on gene expression and crop improvement in tomato. publication-title: Cell doi: 10.1016/j.cell.2020.05.021 – volume: 5 start-page: 471 year: 2019 ident: 2021050609181251700_bib30 article-title: Duplication of a domestication locus neutralized a cryptic variant that caused a breeding barrier in tomato. publication-title: Nat. Plants doi: 10.1038/s41477-019-0422-z – volume: 2 start-page: 186 year: 2001 ident: 2021050609181251700_bib23 article-title: Function and evolution of the plant MADS-box gene family. publication-title: Nat. Rev. Genet. doi: 10.1038/35056041 |
SSID | ssj0006958 |
Score | 2.4784904 |
Snippet | Abstract
Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity... Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size... Bread wheat ( is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its... Bread wheat ( Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size... |
SourceID | pubmedcentral proquest pubmed crossref oup |
SourceType | Open Access Repository Aggregation Database Index Database Enrichment Source Publisher |
StartPage | 599 |
SubjectTerms | Agricultural research Annotations Assembly Bread Chromosomes Chromosomes, Plant - genetics Contig Mapping - methods Contig Mapping - standards Domestication Flow cytometry Gene Dosage Genes Genetics Genome, Plant Genomes Genomics Genomics - methods Genomics - standards Goat grass Investigations Reference Standards Triticum Triticum - genetics Wheat |
Title | Chromosome-Scale Assembly of the Bread Wheat Genome Reveals Thousands of Additional Gene Copies |
URI | https://www.ncbi.nlm.nih.gov/pubmed/32796007 https://www.proquest.com/docview/2453198066 https://www.proquest.com/docview/2434476505 https://www.proquest.com/docview/2522618598 https://pubmed.ncbi.nlm.nih.gov/PMC7536849 |
Volume | 216 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1La9wwEBZtQqGX0necpkGFQi91s3pbp5IsSUOhoaQJ7M3YepBA1t7G20L-fWdsrZvtYenFF41BzIw030ijbwh5z4wzAgJTzpXjuWSO5UWwLHcuKiXqEEX_XOzbmT69lF9napYO3LpUVrnaE_uN2rcOz8gPuERvKSBCfl78zLFrFN6uphYaD8k2UpdhSZeZjQnXRFuVdmKNJe4ssQ4pIQ_AOvhIsPvEOOSt_fXaWmRae-12D3T-Wzt5LxidPCVPEoqkh4PZn5EHoXlOHg19Je9ekBIpb-dt185D_gOMECje7c7rmzvaRgqQjx4BVvS034npl9CAID0PvwE0dvTiqsVKHd-h7KH318NpIYoFOm0XkFq_JJcnxxfT0zx1UsidMmwJ2WYha1ZYN7EBEJFXgZtYGx0BPVVWhWi8haXIdYhVZVm01lWygkjFQ1H4KMQrstW0Tdgh1GjPHSQZLAomaykqr4Ux2jI_0ZWWJiN8pcXSJZpx7HZxU2K6AaovV6ovQfXloPqMfBx_WgwsG5vFP4B5_k9yb2XCMi3OrvzrShl5Nw7DssK7kqoJoGWQQSpEgK9qgwxiV8A7tsjI68ErxjkJbixS_2fErPnLKIC03usjzfVVT-8NCaQupN3dPPU35DHH1L-vK9wjW8vbX-Et4KNlvd8vAvgWU7ZPto-Oz76f_wEzjQ-s |
linkProvider | ProQuest |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELaqrRBcEG_SFjASiAuhtePY8QGhtrRsabtCZSv1liZ-qJW6yUIW0P4pfiMzedHlsOLScyaSNTOe-WY8D0JeMWVUBI4p5LHhoWCGhYnTLDTGx3GUOx_V7WLHIzk8FZ_P4rMV8rvrhcGyys4m1obalgZz5JtcoLYk4CE_TL-FuDUKX1e7FRqNWhy6-S8I2ar3Bx9Bvq85398b7w7DdqtAaGLFZhB5JSJniTZb2gE6sLHjyudKekASmY6dV1aDWnLpfJZp5rU2mcjAanOXJNZjAhRM_qqIIJQZkNWdvdGXk972Sx23tl9iUT1r5xzFkdgEfcC2xOod4xAp1w96C75wob_uGsz9t1rzmvvbv0futriVbjeKdp-suOIBudVsspw_JCkO2Z2UVTlx4VcQu6P4mjzJr-a09BRAJt0BdGppbfvpJ1cAIT1xPwGmVnR8UWJtkK2QdtvayyY_iWSO7pZTCOYfkdMb4fJjMijKwj0lVEnLDYQ1zEdM5CLKrIyUkprZLZlJoQLCOy6mph1sjvs1rlIMcID1acf6FFifNqwPyNv-p2kz12M5-RsQz_9RbnQiTFtzUKV_lTcgL_vPcJHxdSYrHHAZaHD4IgDmeAkNomVAWDoJyJNGK_ozRVxpXDYQELWgLz0BDhJf_FJcXtQDxSFklYnQa8uP_oLcHo6Pj9Kjg9HhOrnDMfFQVzVukMHs-w_3DNDZLH_eXglKzm_6Fv4BsOFLYg |
linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELaqIhAXxJtAASOBuBB27fgRHxAqLUtLoULQSnsLiR9qpW6ykAW0f41fx0xedDmsuPSciWSNv5n5xh7PEPKUaasTCEwxl5bHglkWp96w2NogZVL4kDTPxT4eqr1j8X4qpxvkd_8WBssqe5_YOGpXWTwjH3GBaEkhQo5CVxbxaXfyev4txglSeNPaj9NoIXLgl78gfatf7e_CXj_jfPL2aGcv7iYMxFZqtoAsLBUFS40dGw9MwUnPdSi0CsAqciN90M4ARLnyIc8NC8bYXOTgwblPUxfwMBTc_yWdSIY2pqdDsjdWRnZRQGF5Pes6HslEjAAZ-ECxfsk45MzN1d5KVFx5aXeO8P5bt3kuEE6uk2sdg6XbLeRukA1f3iSX25mWy1skw3a7s6quZj7-AgDwFO-VZ8XZklaBAt2kb4CnOtpEAfrOlyBIP_ufQFhrenRSYZWQq1F227nT9qQSxTzdqeaQ1t8mxxei4ztks6xKf49QrRy3kOCwkDBRiCR3KtFaGebGKldCR4T3Wsxs1-IcJ22cZZjqgOqzXvUZqD5rVR-RF8NP87bDx3rx57A9_ye51W9h1jmGOvsL44g8GT6DSeM9TV560DLIYBtGoM5yjQzyZuBaJo3I3RYVw5oSrg2OHYiIXsHLIIAtxVe_lKcnTWtxSF5VKsz99Ut_TK6A7WUf9g8PHpCrHE8gmvLGLbK5-P7DPwSatigeNfZAydeLNsA_fVdOMg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Chromosome-Scale+Assembly+of+the+Bread+Wheat+Genome+Reveals+Thousands+of+Additional+Gene+Copies&rft.jtitle=Genetics+%28Austin%29&rft.au=Alonge%2C+Michael&rft.au=Shumate%2C+Alaina&rft.au=Puiu%2C+Daniela&rft.au=Zimin%2C+Aleksey+V&rft.date=2020-10-01&rft.issn=1943-2631&rft.eissn=1943-2631&rft.volume=216&rft.issue=2&rft.spage=599&rft_id=info:doi/10.1534%2Fgenetics.120.303501&rft.externalDBID=NO_FULL_TEXT |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1943-2631&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1943-2631&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1943-2631&client=summon |