Origins of de novo genes in human and chimpanzee
The birth of new genes is an important motor of evolutionary innovation. Whereas many new genes arise by gene duplication, others originate at genomic regions that do not contain any gene or gene copy. Some of these newly expressed genes may acquire coding or non-coding functions and be preserved by...
Saved in:
Published in | arXiv.org |
---|---|
Main Authors | , , , , , , , |
Format | Paper |
Language | English |
Published |
Ithaca
Cornell University Library, arXiv.org
09.09.2015
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | The birth of new genes is an important motor of evolutionary innovation. Whereas many new genes arise by gene duplication, others originate at genomic regions that do not contain any gene or gene copy. Some of these newly expressed genes may acquire coding or non-coding functions and be preserved by natural selection. However, it is yet unclear which is the prevalence and underlying mechanisms of de novo gene emergence. In order to obtain a comprehensive view of this process we have performed in-depth sequencing of the transcriptomes of four mammalian species, human, chimpanzee, macaque and mouse, and subsequently compared the assembled transcripts and the corresponding syntenic genomic regions. This has resulted in the identification of over five thousand new transcriptional multiexonic events in human and/or chimpanzee that are not observed in the rest of species. By comparative genomics we show that the expression of these transcripts is associated with the gain of regulatory motifs upstream of the transcription start site (TSS) and of U1 snRNP sites downstream of the TSS. We also find that the coding potential of the new genes is higher than expected by chance, consistent with the presence of protein-coding genes in the dataset. Using available human tissue proteomics and ribosome profiling data we identify several de novo genes with translation evidence. These genes show significant purifying selection signatures, indicating that they are probably functional. Taken together, the data supports a model in which frequently-occurring new transcriptional events in the genome provide the raw material for the evolution of new proteins. |
---|---|
AbstractList | The birth of new genes is an important motor of evolutionary innovation. Whereas many new genes arise by gene duplication, others originate at genomic regions that do not contain any gene or gene copy. Some of these newly expressed genes may acquire coding or non-coding functions and be preserved by natural selection. However, it is yet unclear which is the prevalence and underlying mechanisms of de novo gene emergence. In order to obtain a comprehensive view of this process we have performed in-depth sequencing of the transcriptomes of four mammalian species, human, chimpanzee, macaque and mouse, and subsequently compared the assembled transcripts and the corresponding syntenic genomic regions. This has resulted in the identification of over five thousand new transcriptional multiexonic events in human and/or chimpanzee that are not observed in the rest of species. By comparative genomics we show that the expression of these transcripts is associated with the gain of regulatory motifs upstream of the transcription start site (TSS) and of U1 snRNP sites downstream of the TSS. We also find that the coding potential of the new genes is higher than expected by chance, consistent with the presence of protein-coding genes in the dataset. Using available human tissue proteomics and ribosome profiling data we identify several de novo genes with translation evidence. These genes show significant purifying selection signatures, indicating that they are probably functional. Taken together, the data supports a model in which frequently-occurring new transcriptional events in the genome provide the raw material for the evolution of new proteins. |
Author | Ruiz-Orera, Jorge Hernandez-Rodriguez, Jessica Bontrop, Ronald Sabidó, Eduard Marqués-Bonet, Tomàs M Mar Albà Kondova, Ivanela Chiva, Cristina |
Author_xml | – sequence: 1 givenname: Jorge surname: Ruiz-Orera fullname: Ruiz-Orera, Jorge – sequence: 2 givenname: Jessica surname: Hernandez-Rodriguez fullname: Hernandez-Rodriguez, Jessica – sequence: 3 givenname: Cristina surname: Chiva fullname: Chiva, Cristina – sequence: 4 givenname: Eduard surname: Sabidó fullname: Sabidó, Eduard – sequence: 5 givenname: Ivanela surname: Kondova fullname: Kondova, Ivanela – sequence: 6 givenname: Ronald surname: Bontrop fullname: Bontrop, Ronald – sequence: 7 givenname: Tomàs surname: Marqués-Bonet fullname: Marqués-Bonet, Tomàs – sequence: 8 fullname: M Mar Albà |
BookMark | eNqNyrEKwjAQgOEgClbtOxw4F9KL0TiL4ubiXoK9tin2UhPj4NPr4AM4_cP3L8SUPdNEZKhUWZgN4lzkMfZSStzuUGuVCXkJrnUcwTdQE7B_eWiJKYJj6NJgGSzXcOvcMFp-E63ErLH3SPmvS7E-Ha-HczEG_0gUn1XvU-AvVSiNwtIYvVf_XR9bpTRj |
ContentType | Paper |
Copyright | 2015. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
Copyright_xml | – notice: 2015. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
DBID | 8FE 8FG ABJCF ABUWG AFKRA AZQEC BENPR BGLVJ CCPQU DWQXO HCIFZ L6V M7S PIMPY PQEST PQQKQ PQUKI PRINS PTHSS |
DatabaseName | ProQuest SciTech Collection ProQuest Technology Collection Materials Science & Engineering Collection ProQuest Central (Alumni) ProQuest Central UK/Ireland ProQuest Central Essentials AUTh Library subscriptions: ProQuest Central Technology Collection ProQuest One Community College ProQuest Central SciTech Premium Collection (Proquest) (PQ_SDU_P3) ProQuest Engineering Collection Engineering Database Publicly Available Content Database ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China Engineering Collection |
DatabaseTitle | Publicly Available Content Database Engineering Database Technology Collection ProQuest Central Essentials ProQuest One Academic Eastern Edition ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College ProQuest Technology Collection ProQuest SciTech Collection ProQuest Central China ProQuest Central ProQuest Engineering Collection ProQuest One Academic UKI Edition ProQuest Central Korea Materials Science & Engineering Collection ProQuest One Academic Engineering Collection |
DatabaseTitleList | Publicly Available Content Database |
Database_xml | – sequence: 1 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Physics |
EISSN | 2331-8422 |
Genre | Working Paper/Pre-Print |
GroupedDBID | 8FE 8FG ABJCF ABUWG AFKRA ALMA_UNASSIGNED_HOLDINGS AZQEC BENPR BGLVJ CCPQU DWQXO FRJ HCIFZ L6V M7S M~E PIMPY PQEST PQQKQ PQUKI PRINS PTHSS |
ID | FETCH-proquest_journals_20832188593 |
IEDL.DBID | 8FG |
IngestDate | Thu Oct 10 16:41:57 EDT 2024 |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-proquest_journals_20832188593 |
OpenAccessLink | https://www.proquest.com/docview/2083218859?pq-origsite=%requestingapplication% |
PQID | 2083218859 |
PQPubID | 2050157 |
ParticipantIDs | proquest_journals_2083218859 |
PublicationCentury | 2000 |
PublicationDate | 20150909 |
PublicationDateYYYYMMDD | 2015-09-09 |
PublicationDate_xml | – month: 09 year: 2015 text: 20150909 day: 09 |
PublicationDecade | 2010 |
PublicationPlace | Ithaca |
PublicationPlace_xml | – name: Ithaca |
PublicationTitle | arXiv.org |
PublicationYear | 2015 |
Publisher | Cornell University Library, arXiv.org |
Publisher_xml | – name: Cornell University Library, arXiv.org |
SSID | ssj0002672553 |
Score | 3.0057049 |
SecondaryResourceType | preprint |
Snippet | The birth of new genes is an important motor of evolutionary innovation. Whereas many new genes arise by gene duplication, others originate at genomic regions... |
SourceID | proquest |
SourceType | Aggregation Database |
SubjectTerms | Biological evolution Gene expression Genes Genomics Human tissues Innovations Proteins Proteomics |
Title | Origins of de novo genes in human and chimpanzee |
URI | https://www.proquest.com/docview/2083218859 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LSwMxEB60i-DNJz5qCeg10H1kNzkJyq5FaC2i0FvJY6Jedmu3evDgbzcJWz0IPYZAEkLyfTOTmXwAVwb9n5dWUM5kQTOTIOWSZVQ7cpTOPFVJ7uudx5N89Jzdz9isC7i1XVrlGhMDUJtG-xi5c9K9pg7nTFwv3qlXjfKvq52ExjZEsf8Jz1eKV3e_MZYkL5zFnP6D2cAd1R5EU7nA5T5sYX0AOyHlUreHMHwImlQtaSwxSOrmsyEvHnnIW02CdB5xXj7Rr76Qsf5CPILLqny6HdH1NPPuILTzv2Wnx9BzHj2eABEqZqowqbtPKlM25hoZKsatTYdSCX0K_U0jnW3uPoddx-osJEKJPvRWyw-8cMy5UoOwPQOIbsrJ9NG1xt_lD9ggeFE |
link.rule.ids | 786,790,12792,21416,33406,33777,43633,43838 |
linkProvider | ProQuest |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3PS8MwFH5oi7ibP_HH1IBeA1vbtOlJUDaqbnXIhN1Kk7yql3aucwf_epOQ6UHYOZCEkHzfey_vvQ_gRqHpeVmllLMyoZEKkPKSRVRqciy1eSqC2NQ7j_M4e40eZ2zmAm6tS6tcY6IFatVIEyPXTrrR1OGcpbfzT2pUo8zvqpPQ2AbftNzkHvh3g3zy8htlCeJE28zhP6C17DHcA39SznGxD1tYH8COTbqU7SH0nq0qVUuaiigkdbNqyJvBHvJREyueR7SfT-S7KWWsvxGP4Ho4mN5ndL1M4a5CW_xtPDwGT_v0eAIkFX0mEhXqFyUiUfW5RIaC8aoKe6VI5Sl0N810tnn4Cnaz6XhUjB7yp3PoaI5nNi0q7YK3XHzhhebRpbh0h_UDE1N53Q |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Origins+of+de+novo+genes+in+human+and+chimpanzee&rft.jtitle=arXiv.org&rft.au=Ruiz-Orera%2C+Jorge&rft.au=Hernandez-Rodriguez%2C+Jessica&rft.au=Chiva%2C+Cristina&rft.au=Sabid%C3%B3%2C+Eduard&rft.date=2015-09-09&rft.pub=Cornell+University+Library%2C+arXiv.org&rft.eissn=2331-8422 |