Systematic discovery of new genes in the Saccharomyces cerevisiae genome

We used genome-wide comparative analysis of predicted protein sequences to identify many novel small genes, named smORFs for small open reading frames, within the budding yeast genome. Further analysis of 117 of these new genes showed that 84 are transcribed. We extended our analysis of one smORF co...

Full description

Saved in:
Bibliographic Details
Published inGenome research Vol. 13; no. 2; pp. 264 - 271
Main Authors Kessler, Marco M, Zeng, Qiandong, Hogan, Sarah, Cook, Robin, Morales, Arturo J, Cottarel, Guillaume
Format Journal Article
LanguageEnglish
Published United States Cold Spring Harbor Laboratory Press 01.02.2003
Subjects
Online AccessGet full text

Cover

Loading…
Abstract We used genome-wide comparative analysis of predicted protein sequences to identify many novel small genes, named smORFs for small open reading frames, within the budding yeast genome. Further analysis of 117 of these new genes showed that 84 are transcribed. We extended our analysis of one smORF conserved from yeast to human. This investigation provides an updated and comprehensive annotation of the yeast genome, validates additional concepts in the study of genomes in silico, and increases the expected numbers of coding sequences in a genome with the corresponding impact on future functional genomics and proteomics studies.
AbstractList We used genome-wide comparative analysis of predicted protein sequences to identify many novel small genes, named smORFs for small open reading frames, within the budding yeast genome. Further analysis of 117 of these new genes showed that 84 are transcribed. We extended our analysis of one smORF conserved from yeast to human. This investigation provides an updated and comprehensive annotation of the yeast genome, validates additional concepts in the study of genomes in silico, and increases the expected numbers of coding sequences in a genome with the corresponding impact on future functional genomics and proteomics studies.
We used genome-wide comparative analysis of predicted protein sequences to identify many novel small genes, named smORFs for sm all o pen r eading f rames, within the budding yeast genome. Further analysis of 117 of these new genes showed that 84 are transcribed. We extended our analysis of one smORF conserved from yeast to human. This investigation provides an updated and comprehensive annotation of the yeast genome, validates additional concepts in the study of genomes in silico, and increases the expected numbers of coding sequences in a genome with the corresponding impact on future functional genomics and proteomics studies.
Author Zeng, Qiandong
Morales, Arturo J
Cottarel, Guillaume
Hogan, Sarah
Kessler, Marco M
Cook, Robin
AuthorAffiliation Genome Therapeutics Corporation, Waltham, Massachusetts 02453, USA
AuthorAffiliation_xml – name: Genome Therapeutics Corporation, Waltham, Massachusetts 02453, USA
Author_xml – sequence: 1
  givenname: Marco M
  surname: Kessler
  fullname: Kessler, Marco M
  organization: Genome Therapeutics Corporation, Waltham, Massachusetts 02453, USA
– sequence: 2
  givenname: Qiandong
  surname: Zeng
  fullname: Zeng, Qiandong
– sequence: 3
  givenname: Sarah
  surname: Hogan
  fullname: Hogan, Sarah
– sequence: 4
  givenname: Robin
  surname: Cook
  fullname: Cook, Robin
– sequence: 5
  givenname: Arturo J
  surname: Morales
  fullname: Morales, Arturo J
– sequence: 6
  givenname: Guillaume
  surname: Cottarel
  fullname: Cottarel, Guillaume
BackLink https://www.ncbi.nlm.nih.gov/pubmed/12566404$$D View this record in MEDLINE/PubMed
BookMark eNpVkMtOwzAQRS1URGlhwQ-grJBYpPiZxAsWqAKKVIlFYW25ziQ1auxip0X5e1K14rGa0cy587gjNHDeAUJXBE8IweSuDhPKqMTsBJ0TwWUqeCYHfY6LIpVYkCEaxfiBMWa8KM7QkFCRZRzzczRbdLGFRrfWJKWNxu8gdImvEgdfSQ0OYmJd0q4gWWhjVjr4pjN90UCAnY1Ww57yDVyg00qvI1we4xi9Pz2-TWfp_PX5ZfowTw3P8jbNmaCaVAKXVGBqTFksCwp5RgUHWGICUmownBlR5ZpxVhJpRIkl45JQLTM2RveHuZvtsoHSgGuDXqtNsI0OnfLaqv8dZ1eq9jvFKWaZ6PU3R33wn1uIrWr6t2G91g78NqqcSkkJkT14ewBN8DEGqH52EKz2tqs6qIPtPXv996hf8ugz-wZvKoBS
CitedBy_id crossref_primary_10_1007_s12045_018_0594_2
crossref_primary_10_1016_j_copbio_2006_01_007
crossref_primary_10_1261_rna_2012310
crossref_primary_10_1534_g3_113_007922
crossref_primary_10_1002_cfg_229
crossref_primary_10_1271_bbb_60251
crossref_primary_10_1016_j_ydbio_2004_04_023
crossref_primary_10_1002_cpbi_68
crossref_primary_10_1016_j_bbrc_2010_05_093
crossref_primary_10_1101_gad_1362105
crossref_primary_10_1002_pmic_201700218
crossref_primary_10_1093_glycob_cwz011
crossref_primary_10_1186_gb_2003_5_1_r2
crossref_primary_10_1534_genetics_107_074468
crossref_primary_10_1016_j_euprot_2014_02_006
crossref_primary_10_1016_j_molp_2018_08_005
crossref_primary_10_1111_febs_15845
crossref_primary_10_1002_pmic_201700274
crossref_primary_10_1002_yea_946
crossref_primary_10_1038_nrm_2017_58
crossref_primary_10_1186_1471_2164_14_648
crossref_primary_10_1016_j_gde_2005_09_005
crossref_primary_10_1146_annurev_cellbio_100616_060516
crossref_primary_10_1186_gb_2011_12_11_r118
crossref_primary_10_1002_yea_1400
crossref_primary_10_3724_SP_J_1206_2011_00290
crossref_primary_10_1016_j_tibs_2016_05_003
crossref_primary_10_1371_journal_pgen_1000299
crossref_primary_10_1101_gr_4976706
crossref_primary_10_1128_mBio_02819_18
crossref_primary_10_1371_journal_pone_0120678
crossref_primary_10_1101_gr_1481104
crossref_primary_10_1101_sqb_2003_68_275
crossref_primary_10_1590_S1415_475738320150009
crossref_primary_10_1186_gb_2003_4_7_r45
crossref_primary_10_1101_gr_1515604
crossref_primary_10_1007_s11274_011_0756_9
crossref_primary_10_1128_EC_3_5_1088_1100_2004
crossref_primary_10_1146_annurev_micro_59_030804_121400
crossref_primary_10_1016_j_yexcr_2020_112057
crossref_primary_10_1111_j_1365_2958_2004_04182_x
crossref_primary_10_1093_hmg_ddi335
crossref_primary_10_1101_gr_4355406
Cites_doi 10.1093/genetics/123.4.725
10.1038/46558
10.1101/gr.182901
10.1038/85913
10.1073/pnas.061628798
10.1093/emboj/21.7.1800
10.1126/science.274.5287.546
10.1006/fgbi.1997.0982
10.1016/S0092-8674(00)81845-0
10.1016/S0959-437X(97)80039-1
10.1006/jmbi.2000.3519
10.1128/MCB.16.10.5744
10.1038/387s087
10.1093/nar/25.22.4619
10.1038/nbt0102-58
10.1093/nar/30.8.1851
10.1038/387s007
10.1002/(SICI)1097-0061(199711)13:14<1363::AID-YEA182>3.0.CO;2-8
10.1101/gr.7.12.1174
10.1128/MCB.19.10.7041
10.1016/S0168-9525(01)02498-2
10.1002/yea.865
10.1016/S0014-5793(00)02275-4
10.1038/85922
ContentType Journal Article
Copyright Copyright © 2003, Cold Spring Harbor Laboratory Press 2003
Copyright_xml – notice: Copyright © 2003, Cold Spring Harbor Laboratory Press 2003
DBID CGR
CUY
CVF
ECM
EIF
NPM
AAYXX
CITATION
7X8
5PM
DOI 10.1101/gr.232903
DatabaseName Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
CrossRef
MEDLINE - Academic
PubMed Central (Full Participant titles)
DatabaseTitle MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
CrossRef
MEDLINE - Academic
DatabaseTitleList MEDLINE - Academic

CrossRef
MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Anatomy & Physiology
Chemistry
Biology
EISSN 1549-5469
EndPage 271
ExternalDocumentID 10_1101_gr_232903
12566404
Genre Validation Study
Journal Article
Comparative Study
GroupedDBID ---
.GJ
18M
29H
2WC
39C
4.4
53G
5GY
5RE
5VS
AAYOK
AAZTW
ABDIX
ABDNZ
ACGFO
ACYGS
ADBBV
ADNWM
AEILP
AENEX
AI.
ALMA_UNASSIGNED_HOLDINGS
BAWUL
BTFSW
C1A
CGR
CS3
CUY
CVF
DIK
DU5
E3Z
EBS
ECM
EIF
EJD
F5P
FRP
GX1
H13
HYE
IH2
K-O
KQ8
MV1
NPM
R.V
RCX
RHF
RHI
RNS
RPM
RXW
SJN
TAE
TR2
VH1
W8F
WOQ
YKV
ZCG
ZGI
ZXP
AAYXX
ABRJW
CITATION
7X8
5PM
ID FETCH-LOGICAL-c467t-7352a1f50d2502ccd8b82e76254eeb01e99aec43c5f7a343d19c5d0934912a963
IEDL.DBID RPM
ISSN 1088-9051
IngestDate Tue Sep 17 21:01:27 EDT 2024
Sat Oct 26 06:03:01 EDT 2024
Thu Sep 12 16:29:28 EDT 2024
Thu Oct 24 10:01:32 EDT 2024
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 2
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c467t-7352a1f50d2502ccd8b82e76254eeb01e99aec43c5f7a343d19c5d0934912a963
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ObjectType-Undefined-3
E-MAIL arturo.morales@genomecorp.com; FAX (781) 398-2476.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.232903.
Corresponding author.
OpenAccessLink https://genome.cshlp.org/content/13/2/264.full.pdf
PMID 12566404
PQID 72992119
PQPubID 23479
PageCount 8
ParticipantIDs pubmedcentral_primary_oai_pubmedcentral_nih_gov_420365
proquest_miscellaneous_72992119
crossref_primary_10_1101_gr_232903
pubmed_primary_12566404
PublicationCentury 2000
PublicationDate 2003-Feb
2003-02-01
20030201
PublicationDateYYYYMMDD 2003-02-01
PublicationDate_xml – month: 02
  year: 2003
  text: 2003-Feb
PublicationDecade 2000
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle Genome research
PublicationTitleAlternate Genome Res
PublicationYear 2003
Publisher Cold Spring Harbor Laboratory Press
Publisher_xml – name: Cold Spring Harbor Laboratory Press
References 2021111810475103000_13.2.264.23
2021111810475103000_13.2.264.1
2021111810475103000_13.2.264.21
2021111810475103000_13.2.264.20
2021111810475103000_13.2.264.26
2021111810475103000_13.2.264.25
2021111810475103000_13.2.264.24
Basrai (2021111810475103000_13.2.264.2) 1999; 19
Mewes (2021111810475103000_13.2.264.17) 1997; 387
2021111810475103000_13.2.264.12
Brown (2021111810475103000_13.2.264.4) 1996; 16
2021111810475103000_13.2.264.11
2021111810475103000_13.2.264.10
2021111810475103000_13.2.264.16
2021111810475103000_13.2.264.15
Thomas (2021111810475103000_13.2.264.22) 1989; 123
2021111810475103000_13.2.264.14
2021111810475103000_13.2.264.19
Nicholas (2021111810475103000_13.2.264.18) 1997; 4
Johnston (2021111810475103000_13.2.264.13) 1997; 387
2021111810475103000_13.2.264.7
2021111810475103000_13.2.264.6
2021111810475103000_13.2.264.9
2021111810475103000_13.2.264.8
2021111810475103000_13.2.264.3
2021111810475103000_13.2.264.5
References_xml – volume: 123
  start-page: 725
  year: 1989
  ident: 2021111810475103000_13.2.264.22
  article-title: The genetic control of direct-repeat recombination in Saccharomyces: The effect of rad52 and rad1 on mitotic recombination at GAL10, a transcriptionally regulated gene.
  publication-title: Genetics
  doi: 10.1093/genetics/123.4.725
  contributor:
    fullname: Thomas
– ident: 2021111810475103000_13.2.264.21
  doi: 10.1038/46558
– ident: 2021111810475103000_13.2.264.5
  doi: 10.1101/gr.182901
– ident: 2021111810475103000_13.2.264.20
  doi: 10.1038/85913
– ident: 2021111810475103000_13.2.264.10
– ident: 2021111810475103000_13.2.264.23
  doi: 10.1073/pnas.061628798
– ident: 2021111810475103000_13.2.264.11
  doi: 10.1093/emboj/21.7.1800
– ident: 2021111810475103000_13.2.264.7
  doi: 10.1126/science.274.5287.546
– ident: 2021111810475103000_13.2.264.15
  doi: 10.1006/fgbi.1997.0982
– ident: 2021111810475103000_13.2.264.24
  doi: 10.1016/S0092-8674(00)81845-0
– ident: 2021111810475103000_13.2.264.25
  doi: 10.1016/S0959-437X(97)80039-1
– ident: 2021111810475103000_13.2.264.12
  doi: 10.1006/jmbi.2000.3519
– volume: 16
  start-page: 5744
  year: 1996
  ident: 2021111810475103000_13.2.264.4
  article-title: PAN3 encodes a subunit of the Pab1p-dependent poly(A) nuclease in Saccharomyces cerevisiae.
  publication-title: Mol. Cell. Biol
  doi: 10.1128/MCB.16.10.5744
  contributor:
    fullname: Brown
– volume: 387
  start-page: 87
  year: 1997
  ident: 2021111810475103000_13.2.264.13
  article-title: The nucleotide sequence of Saccharomyces cerevisiae chromosome XII.
  publication-title: Nature
  doi: 10.1038/387s087
  contributor:
    fullname: Johnston
– ident: 2021111810475103000_13.2.264.19
  doi: 10.1093/nar/25.22.4619
– ident: 2021111810475103000_13.2.264.14
  doi: 10.1038/nbt0102-58
– ident: 2021111810475103000_13.2.264.9
  doi: 10.1093/nar/30.8.1851
– volume: 387
  start-page: 7
  year: 1997
  ident: 2021111810475103000_13.2.264.17
  article-title: Overview of the yeast genome.
  publication-title: Nature
  doi: 10.1038/387s007
  contributor:
    fullname: Mewes
– ident: 2021111810475103000_13.2.264.1
  doi: 10.1002/(SICI)1097-0061(199711)13:14<1363::AID-YEA182>3.0.CO;2-8
– ident: 2021111810475103000_13.2.264.6
  doi: 10.1101/gr.7.12.1174
– volume: 4
  start-page: 14
  year: 1997
  ident: 2021111810475103000_13.2.264.18
  article-title: GeneDoc: Analysis and visualization of genetic variation.
  publication-title: EMBnet News
  contributor:
    fullname: Nicholas
– volume: 19
  start-page: 7041
  year: 1999
  ident: 2021111810475103000_13.2.264.2
  article-title: NORF5/HUG1 is a component of the MEC1-mediated checkpoint response to DNA damage and replication arrest in Saccharomyces cerevisiae.
  publication-title: Mol. Cell. Biol.
  doi: 10.1128/MCB.19.10.7041
  contributor:
    fullname: Basrai
– ident: 2021111810475103000_13.2.264.26
  doi: 10.1016/S0168-9525(01)02498-2
– ident: 2021111810475103000_13.2.264.16
  doi: 10.1002/yea.865
– ident: 2021111810475103000_13.2.264.3
  doi: 10.1016/S0014-5793(00)02275-4
– ident: 2021111810475103000_13.2.264.8
  doi: 10.1038/85922
SSID ssj0003488
Score 2.0035393
Snippet We used genome-wide comparative analysis of predicted protein sequences to identify many novel small genes, named smORFs for small open reading frames, within...
We used genome-wide comparative analysis of predicted protein sequences to identify many novel small genes, named smORFs for sm all o pen r eading f rames,...
SourceID pubmedcentral
proquest
crossref
pubmed
SourceType Open Access Repository
Aggregation Database
Index Database
StartPage 264
SubjectTerms Amino Acid Sequence - genetics
Animals
Caenorhabditis elegans Proteins - genetics
Cattle
Chromosome Mapping - methods
Chromosomes, Fungal - genetics
Conserved Sequence - genetics
Drosophila Proteins - genetics
Genes, Fungal - genetics
Genome, Fungal
Humans
Letters
Mice
Molecular Sequence Data
Open Reading Frames - genetics
Proteins - genetics
Saccharomyces cerevisiae - genetics
Sequence Homology, Amino Acid
Title Systematic discovery of new genes in the Saccharomyces cerevisiae genome
URI https://www.ncbi.nlm.nih.gov/pubmed/12566404
https://search.proquest.com/docview/72992119
https://pubmed.ncbi.nlm.nih.gov/PMC420365
Volume 13
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3db9MwED_RSgheELTAykexpom3tIntfPixFKqyqRPSVqlvke04WyTqTk33sP-es5MUBm-8RcrFis7nu5_tu98BnFGMYQlPkoBpVQYcZYJMJEUQsVRzRQ0tlKtGXl0myzU_38SbtiisbtMqrVbVxP7cTmx163Mr77Z62uWJTX-s5tzdnsXTHvRSxrodeut9Gc-a8je0AMc91bIJoeVNb_YTxA8i9L1zMNQnvG3PdgxH_2DMv1Ml_4g9i5fwogWNZNb83Ct4YuwAhjOLG-btA_lMfBqnPx8fwNMv3dOzedfMbQjLqyNjM_la1dolbj6QXUnQyRFHPV2TyhIEg-RKaleJhQOjByHapwHXlTROarc1r2G9-HY9XwZtE4VAow88BCkiLBmVcVgg2KFaF5nKqEEXGHNjVBgZIaTRnOm4TCXjrIiEjotQMC4iKnF5voG-3VlzAkSVAsGUOygqFFfKSIlL2NCwNEpGRVaO4LTTZX7XcGXkfo8RRvnNPm90P4JPnZZz1IG7npDW7O7rHGG-cHxzI3jb6Pz3IO1kjSB5NBtHAceR_fgNmo7nym5M5d3_fvgeniNC8lytlH6A_mF_bz4iCjmoMeLv7xdjb3y_AAe03nU
link.rule.ids 230,315,733,786,790,891,27946,27947,53816,53818
linkProvider National Library of Medicine
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1bT9swFD7ioom9jI3bujFqTYi3pLk4afwI3VBhFE2CSrxFtuOUiNVFTfsAv55jJykrPMFbJDtWknNyLvZ3vgNwGKAPi2kcO6EUuUNxjpOwOHP8sCupCFSQCVONPLiM-0N6fhPd1EVhZQ2r1FIUrv43dnVxa7GV92PZaXBinb-DHjWnZ1FnFdYjc-7V5Oi1_Q1pUhXAoQ4Y9qmaTwh1rzOauhhBMM92z0FnH9O6QdvCIb2KMl-CJf_zPqebMGyeuwKd3LnzmXDl4wtKx7e-2Gf4VIej5Lga_QIrSm_B9rHGVHz8QI6IBYjanfct-HDSXG30mjZx29C_WnBBk19FKQ0k9IFMcoLmkxhS65IUmmCYSa64NDVeuDDaJiItwLgsuDKzJmO1A8PT39e9vlO3Z3AkWteZ08XYjft55GUYRgVSZolIAoXGNaJKCc9XjHElaSijvMtDGmY-k1HmsZAyP-D44-_Cmp5o9RWIyBmGaWYLKhNUCMU5GgcVeLkS3M-SvAU_Gxml9xULR2qzF89PR9O0kmkL2o30UvwG5uCDazWZlykmEMww2bVgr5Ll8yK1ErQgXpLyYoJh314eQdlZFu5KVt_ee2MbNvrXg4v04uzyz3f4aDGCFgy-D2uz6Vz9wFhnJg6saj8B6kT_MA
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LT9tAEB4BVYELbXm0KW1ZIcTN77XjPYbQKC0PIQESnKx9GSyIE-VxCL--s2s7FLhxs-TxyvaM5rH7zTcAByHGsIQmiRNJkTsUZZyUJcoJorakItShEqYb-ew86V_TvzfxzRKkTS-MBe1LUbjl48Ati3uLrRwNpNfgxLyLsy41p2exN1K5twwfYhN0mjq99sERTasmOLQDw0BVcwqh_Xl3YxezCObbCToY8BNaD2lbBKU3meZrwOR_Eaj3CW6bd6-AJw_ubCpc-fSK1vE9H_cZNuq0lHQqiS-wpMtN2OqUWJIP5uSQWKCo3YHfhI9HzdVatxkXtwX9ywUnNDkuJtJAQ-dkmBN0o8SQW09IURJMN8kll6bXCxdGH0WkBRpPCq6N1HCgt-G69_uq23fqMQ2ORC87ddqYw_Egj32F6VQopUpFGmp0sjHVWviBZoxrSSMZ520e0UgFTMbKZxFlQcjRAezASjks9TcgImeYrpmtKCWoEJpzdBI69HMteKDSvAX7jZ6yUcXGkdkqxg-yu3FW6bUFe40GM_wH5gCEl3o4m2RYSDDDaNeCr5U-nxepDaEFyQtNLwQMC_fLO6g_y8Zd6ev7ex_cg9WL4152-uf8ZBfWLVTQYsJ_wMp0PNM_MeWZil_Wuv8B7nsBvw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Systematic+discovery+of+new+genes+in+the+Saccharomyces+cerevisiae+genome&rft.jtitle=Genome+research&rft.au=Kessler%2C+Marco+M&rft.au=Zeng%2C+Qiandong&rft.au=Hogan%2C+Sarah&rft.au=Cook%2C+Robin&rft.date=2003-02-01&rft.issn=1088-9051&rft.volume=13&rft.issue=2&rft.spage=264&rft.epage=271&rft_id=info:doi/10.1101%2Fgr.232903&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1088-9051&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1088-9051&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1088-9051&client=summon