Semantic Assembly and Annotation of Draft RNAseq Transcripts without a Reference Genome

Transcriptomes are one of the first sources of high-throughput genomic data that have benefitted from the introduction of Next-Gen Sequencing. As sequencing technology becomes more accessible, transcriptome sequencing is applicable to multiple organisms for which genome sequences are unavailable. Cu...

Full description

Saved in:
Bibliographic Details
Published inPloS one Vol. 10; no. 9; p. e0138006
Main Authors Ptitsyn, Andrey, Temanni, Ramzi, Bouchard, Christelle, Anderson, Peter A V
Format Journal Article
LanguageEnglish
Published United States Public Library of Science 22.09.2015
Public Library of Science (PLoS)
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Transcriptomes are one of the first sources of high-throughput genomic data that have benefitted from the introduction of Next-Gen Sequencing. As sequencing technology becomes more accessible, transcriptome sequencing is applicable to multiple organisms for which genome sequences are unavailable. Currently all methods for de novo assembly are based on the concept of matching the nucleotide context overlapping between short fragments-reads. However, even short reads may still contain biologically relevant information which can be used as hints in guiding the assembly process. We propose a computational workflow for the reconstruction and functional annotation of expressed gene transcripts that does not require a reference genome sequence and can be tolerant to low coverage, high error rates and other issues that often lead to poor results of de novo assembly in studies of non-model organisms. We start with either raw sequences or the output of a context-based de novo transcriptome assembly. Instead of mapping reads to a reference genome or creating a completely unsupervised clustering of reads, we assemble the unknown transcriptome using nearest homologs from a public database as seeds. We consider even distant relations, indirectly linking protein-coding fragments to entire gene families in multiple distantly related genomes. The intended application of the proposed method is an additional step of semantic (based on relations between protein-coding fragments) scaffolding following traditional (i.e. based on sequence overlap) de novo assembly. The method we developed was effective in analysis of the jellyfish Cyanea capillata transcriptome and may be applicable in other studies of gene expression in species lacking a high quality reference genome sequence. Our algorithms are implemented in C and designed for parallel computation using a high-performance computer. The software is available free of charge via an open source license.
AbstractList Transcriptomes are one of the first sources of high-throughput genomic data that have benefitted from the introduction of Next-Gen Sequencing. As sequencing technology becomes more accessible, transcriptome sequencing is applicable to multiple organisms for which genome sequences are unavailable. Currently all methods for de novo assembly are based on the concept of matching the nucleotide context overlapping between short fragments-reads. However, even short reads may still contain biologically relevant information which can be used as hints in guiding the assembly process. We propose a computational workflow for the reconstruction and functional annotation of expressed gene transcripts that does not require a reference genome sequence and can be tolerant to low coverage, high error rates and other issues that often lead to poor results of de novo assembly in studies of non-model organisms. We start with either raw sequences or the output of a context-based de novo transcriptome assembly. Instead of mapping reads to a reference genome or creating a completely unsupervised clustering of reads, we assemble the unknown transcriptome using nearest homologs from a public database as seeds. We consider even distant relations, indirectly linking protein-coding fragments to entire gene families in multiple distantly related genomes. The intended application of the proposed method is an additional step of semantic (based on relations between protein-coding fragments) scaffolding following traditional (i.e. based on sequence overlap) de novo assembly. The method we developed was effective in analysis of the jellyfish Cyanea capillata transcriptome and may be applicable in other studies of gene expression in species lacking a high quality reference genome sequence. Our algorithms are implemented in C and designed for parallel computation using a high-performance computer. The software is available free of charge via an open source license.
Transcriptomes are one of the first sources of high-throughput genomic data that have benefitted from the introduction of Next-Gen Sequencing. As sequencing technology becomes more accessible, transcriptome sequencing is applicable to multiple organisms for which genome sequences are unavailable. Currently all methods for de novo assembly are based on the concept of matching the nucleotide context overlapping between short fragments-reads. However, even short reads may still contain biologically relevant information which can be used as hints in guiding the assembly process. We propose a computational workflow for the reconstruction and functional annotation of expressed gene transcripts that does not require a reference genome sequence and can be tolerant to low coverage, high error rates and other issues that often lead to poor results of de novo assembly in studies of non-model organisms. We start with either raw sequences or the output of a context-based de novo transcriptome assembly. Instead of mapping reads to a reference genome or creating a completely unsupervised clustering of reads, we assemble the unknown transcriptome using nearest homologs from a public database as seeds. We consider even distant relations, indirectly linking protein-coding fragments to entire gene families in multiple distantly related genomes. The intended application of the proposed method is an additional step of semantic (based on relations between protein-coding fragments) scaffolding following traditional (i.e. based on sequence overlap) de novo assembly. The method we developed was effective in analysis of the jellyfish Cyanea capillata transcriptome and may be applicable in other studies of gene expression in species lacking a high quality reference genome sequence. Our algorithms are implemented in C and designed for parallel computation using a high-performance computer. The software is available free of charge via an open source license.
Audience Academic
Author Bouchard, Christelle
Anderson, Peter A V
Temanni, Ramzi
Ptitsyn, Andrey
AuthorAffiliation 2 University of South Florida Sarasota-Manatee, 8350 N. Tamiami Trail, Sarasota, FL, 34243, United States of America
University of North Carolina at Charlotte, UNITED STATES
1 Sidra Medical and Research Center, P.O. Box, 26999, Doha, Qatar
4 Department of Physiology and Functional Genomics, University of Florida; 9505 Ocean Shore Blvd, Saint Augustine, FL, 32080, United States of America
3 Whitney Laboratory for Marine Biosciences, University of Florida; 9505 Ocean Shore Blvd, Saint Augustine, FL, 32080, United States of America
AuthorAffiliation_xml – name: 3 Whitney Laboratory for Marine Biosciences, University of Florida; 9505 Ocean Shore Blvd, Saint Augustine, FL, 32080, United States of America
– name: 1 Sidra Medical and Research Center, P.O. Box, 26999, Doha, Qatar
– name: University of North Carolina at Charlotte, UNITED STATES
– name: 2 University of South Florida Sarasota-Manatee, 8350 N. Tamiami Trail, Sarasota, FL, 34243, United States of America
– name: 4 Department of Physiology and Functional Genomics, University of Florida; 9505 Ocean Shore Blvd, Saint Augustine, FL, 32080, United States of America
Author_xml – sequence: 1
  givenname: Andrey
  surname: Ptitsyn
  fullname: Ptitsyn, Andrey
  organization: Sidra Medical and Research Center, P.O. Box, 26999, Doha, Qatar
– sequence: 2
  givenname: Ramzi
  surname: Temanni
  fullname: Temanni, Ramzi
  organization: Sidra Medical and Research Center, P.O. Box, 26999, Doha, Qatar
– sequence: 3
  givenname: Christelle
  surname: Bouchard
  fullname: Bouchard, Christelle
  organization: University of South Florida Sarasota-Manatee, 8350 N. Tamiami Trail, Sarasota, FL, 34243, United States of America
– sequence: 4
  givenname: Peter A V
  surname: Anderson
  fullname: Anderson, Peter A V
  organization: Whitney Laboratory for Marine Biosciences, University of Florida; 9505 Ocean Shore Blvd, Saint Augustine, FL, 32080, United States of America; Department of Physiology and Functional Genomics, University of Florida; 9505 Ocean Shore Blvd, Saint Augustine, FL, 32080, United States of America
BackLink https://www.ncbi.nlm.nih.gov/pubmed/26393794$$D View this record in MEDLINE/PubMed
BookMark eNqNkt9v0zAQxyM0xH7Af4AgEhKChxY7TuL4BakaMCpNTOoGPFrX-NxmSuzOdoD997hrNjVoD8gPts6f-57v_D1ODow1mCQvKZlSxumHa9s7A-10E8NTQllFSPkkOaKCZZMyI-xg73yYHHt_TUjBqrJ8lhxmJROMi_wo-XmJHZjQ1OnMe-yW7W0KRqUzY2yA0FiTWp1-cqBDuvg283iTXjkwvnbNJvj0dxPWtg8ppAvU6NDUmJ6hsR0-T55qaD2-GPaT5PuXz1enXyfnF2fz09n5pC5FFiYV5XlJaq6Z0lyXpGSwzIqKiIrQghc1FAqXjGeVAqE1FSWo2EGxRAEFglLsJHm909201sthJl5STkUmspzTSMx3hLJwLTeu6cDdSguNvAtYt5Lg4gBalECpyEnFBQGVVzpfZpoTLUBUitNK6aj1cajWLztUNZrgoB2Jjm9Ms5Yr-0vmBa8qkUeBd4OAszc9-iC7xtfYtmDQ9nfv5jnPCsEi-uYf9PHuBmoFsYHGaBvr1ltROYv30QGFIJGaPkLFpbBr6mgg3cT4KOH9KCEyAf-EFfTey_nl4v_Zix9j9u0eu0Zow9rbtt86zY_BfAfWznrvUD8MmRK59f_9NOTW_3Lwf0x7tf9BD0n3hmd_ASdUARM
CitedBy_id crossref_primary_10_1093_bioinformatics_bty378
crossref_primary_10_1371_journal_pone_0218806
crossref_primary_10_3390_metabo13030388
Cites_doi 10.1007/s11427-011-4255-x
10.1155/2010/853916
10.2307/1541225
10.1093/bib/5.1.39
10.1093/database/bar009
10.1242/jeb.133.1.231
10.1186/gb-2009-10-3-r25
10.1038/nprot.2008.211
10.1093/nar/gks804
10.1101/gr.103846.109
10.1101/gr.074492.107
10.1007/s00018-009-0180-6
10.1126/science.1080339
10.1093/bioinformatics/btp336
10.1093/nar/gkj123
10.1016/S0022-2836(05)80360-2
10.1152/jn.1985.53.3.821
10.1093/bioinformatics/bts094
10.1152/jn.1983.50.3.671
10.1038/nbt.1883
10.1038/nmeth.2251
ContentType Journal Article
Copyright COPYRIGHT 2015 Public Library of Science
2015 Ptitsyn et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
2015 Ptitsyn et al 2015 Ptitsyn et al
Copyright_xml – notice: COPYRIGHT 2015 Public Library of Science
– notice: 2015 Ptitsyn et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
– notice: 2015 Ptitsyn et al 2015 Ptitsyn et al
DBID CGR
CUY
CVF
ECM
EIF
NPM
AAYXX
CITATION
IOV
ISR
3V.
7QG
7QL
7QO
7RV
7SN
7SS
7T5
7TG
7TM
7U9
7X2
7X7
7XB
88E
8AO
8C1
8FD
8FE
8FG
8FH
8FI
8FJ
8FK
ABJCF
ABUWG
AFKRA
ARAPS
ATCPS
AZQEC
BBNVY
BENPR
BGLVJ
BHPHI
C1K
CCPQU
D1I
DWQXO
FR3
FYUFA
GHDGH
GNUQQ
H94
HCIFZ
K9.
KB.
KB0
KL.
L6V
LK8
M0K
M0S
M1P
M7N
M7P
M7S
NAPCQ
P5Z
P62
P64
PATMY
PDBOC
PIMPY
PQEST
PQQKQ
PQUKI
PRINS
PTHSS
PYCSY
RC3
7X8
5PM
DOA
DOI 10.1371/journal.pone.0138006
DatabaseName Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
CrossRef
Opposing Viewpoints in Context (Gale)
Science in Context
ProQuest Central (Corporate)
Animal Behavior Abstracts
Bacteriology Abstracts (Microbiology B)
Biotechnology Research Abstracts
ProQuest Nursing and Allied Health Journals
Ecology Abstracts
Entomology Abstracts (Full archive)
Immunology Abstracts
Meteorological & Geoastrophysical Abstracts
Nucleic Acids Abstracts
Virology and AIDS Abstracts
Agricultural Science Collection
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Medical Database (Alumni Edition)
ProQuest Pharma Collection
Public Health Database (Proquest)
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Natural Science Collection
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
Materials Science & Engineering Collection
ProQuest Central (Alumni)
ProQuest Central
Advanced Technologies & Aerospace Collection
Agricultural & Environmental Science Collection
ProQuest Central Essentials
Biological Science Collection
ProQuest Central
Technology Collection
Natural Science Collection
Environmental Sciences and Pollution Management
ProQuest One Community College
ProQuest Materials Science Collection
ProQuest Central Korea
Engineering Research Database
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Central Student
AIDS and Cancer Research Abstracts
SciTech Premium Collection
ProQuest Health & Medical Complete (Alumni)
Materials Science Database
Nursing & Allied Health Database (Alumni Edition)
Meteorological & Geoastrophysical Abstracts - Academic
ProQuest Engineering Collection
Biological Sciences
Agriculture Science Database
Health & Medical Collection (Alumni Edition)
Medical Database
Algology Mycology and Protozoology Abstracts (Microbiology C)
Biological Science Database
Engineering Database
Nursing & Allied Health Premium
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
Biotechnology and BioEngineering Abstracts
Environmental Science Database
Materials Science Collection
Publicly Available Content Database (Proquest) (PQ_SDU_P3)
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
Engineering Collection
Environmental Science Collection
Genetics Abstracts
MEDLINE - Academic
PubMed Central (Full Participant titles)
Directory of Open Access Journals
DatabaseTitle MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
CrossRef
Agricultural Science Database
Publicly Available Content Database
ProQuest Central Student
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
Nucleic Acids Abstracts
SciTech Premium Collection
ProQuest Central China
Environmental Sciences and Pollution Management
Health Research Premium Collection
Meteorological & Geoastrophysical Abstracts
Natural Science Collection
Biological Science Collection
ProQuest Medical Library (Alumni)
Engineering Collection
Advanced Technologies & Aerospace Collection
Engineering Database
Virology and AIDS Abstracts
ProQuest Biological Science Collection
ProQuest One Academic Eastern Edition
Agricultural Science Collection
ProQuest Hospital Collection
ProQuest Technology Collection
Health Research Premium Collection (Alumni)
Biological Science Database
Ecology Abstracts
ProQuest Hospital Collection (Alumni)
Biotechnology and BioEngineering Abstracts
Environmental Science Collection
Entomology Abstracts
Nursing & Allied Health Premium
ProQuest Health & Medical Complete
ProQuest One Academic UKI Edition
Environmental Science Database
ProQuest Nursing & Allied Health Source (Alumni)
Engineering Research Database
ProQuest One Academic
Meteorological & Geoastrophysical Abstracts - Academic
Technology Collection
Technology Research Database
Materials Science Collection
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest Natural Science Collection
ProQuest Pharma Collection
ProQuest Central
Genetics Abstracts
ProQuest Engineering Collection
Biotechnology Research Abstracts
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
Bacteriology Abstracts (Microbiology B)
Algology Mycology and Protozoology Abstracts (Microbiology C)
Agricultural & Environmental Science Collection
AIDS and Cancer Research Abstracts
Materials Science Database
ProQuest Materials Science Collection
ProQuest Public Health
ProQuest Nursing & Allied Health Source
ProQuest SciTech Collection
Advanced Technologies & Aerospace Database
ProQuest Medical Library
Animal Behavior Abstracts
Materials Science & Engineering Collection
Immunology Abstracts
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList


Agricultural Science Database
MEDLINE - Academic
MEDLINE


Database_xml – sequence: 1
  dbid: DOA
  name: Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 4
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Sciences (General)
Biology
DocumentTitleAlternate Semantic Assembly of Transcriptomes
EISSN 1932-6203
Editor Gibas, Cynthia
Editor_xml – sequence: 1
  givenname: Cynthia
  surname: Gibas
  fullname: Gibas, Cynthia
EndPage e0138006
ExternalDocumentID 1719292471
oai_doaj_org_article_a119408790ad48f4b2f70f9a98d718df
3828218321
A471193590
10_1371_journal_pone_0138006
26393794
Genre Research Support, U.S. Gov't, Non-P.H.S
Journal Article
GeographicLocations United States--US
Sarasota Florida
Qatar
GeographicLocations_xml – name: Qatar
– name: Sarasota Florida
– name: United States--US
GroupedDBID ---
123
29O
2WC
3V.
53G
5VS
7RV
7X2
7X7
7XC
88E
8AO
8C1
8CJ
8FE
8FG
8FH
8FI
8FJ
A8Z
AAFWJ
ABDBF
ABIVO
ABJCF
ABUWG
ACGFO
ACIHN
ACIWK
ACPRK
ADBBV
ADRAZ
AEAQA
AENEX
AFKRA
AFRAH
AHMBA
ALIPV
ALMA_UNASSIGNED_HOLDINGS
AOIJS
APEBS
ARAPS
ATCPS
BAWUL
BBNVY
BBORY
BCNDV
BENPR
BGLVJ
BHPHI
BKEYQ
BPHCQ
BVXVI
BWKFM
CCPQU
CGR
CS3
CUY
CVF
D1I
D1J
D1K
DIK
DU5
E3Z
EAP
EAS
EBD
ECM
EIF
EMOBN
ESTFP
ESX
EX3
F5P
FPL
FYUFA
GROUPED_DOAJ
GX1
HCIFZ
HH5
HMCUK
HYE
IAO
IEA
IHR
IHW
INH
INR
IOV
IPNFZ
IPY
ISE
ISR
ITC
K6-
KB.
KQ8
L6V
LK5
LK8
M0K
M1P
M48
M7P
M7R
M7S
M~E
NAPCQ
NPM
O5R
O5S
OK1
P2P
P62
PATMY
PDBOC
PIMPY
PQQKQ
PROAC
PSQYO
PTHSS
PV9
PYCSY
RIG
RNS
RPM
RZL
SV3
TR2
UKHRP
WOQ
WOW
~02
~KM
AAYXX
CITATION
AFPKN
7QG
7QL
7QO
7SN
7SS
7T5
7TG
7TM
7U9
7XB
8FD
8FK
AZQEC
C1K
DWQXO
FR3
GNUQQ
H94
K9.
KL.
M7N
P64
PQEST
PQUKI
PRINS
RC3
7X8
5PM
ACUHS
-
02
AAPBV
ABPTK
ADACO
BBAFP
KM
ID FETCH-LOGICAL-c692t-817460c7f3df7f6063ab25809801575ca5deb3728da9ff196ad8665be9a5eadd3
IEDL.DBID RPM
ISSN 1932-6203
IngestDate Thu Nov 25 14:37:33 EST 2021
Tue Dec 17 15:19:00 EST 2024
Tue Sep 17 21:09:47 EDT 2024
Mon Oct 21 04:29:32 EDT 2024
Thu Oct 10 15:30:53 EDT 2024
Tue Nov 19 20:47:27 EST 2024
Tue Nov 12 23:01:10 EST 2024
Thu Aug 01 20:26:45 EDT 2024
Thu Aug 01 19:27:15 EDT 2024
Tue Aug 20 21:58:52 EDT 2024
Thu Nov 21 23:52:23 EST 2024
Tue Oct 15 23:52:38 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 9
Language English
License This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
Creative Commons Attribution License
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c692t-817460c7f3df7f6063ab25809801575ca5deb3728da9ff196ad8665be9a5eadd3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Competing Interests: The authors have declared that no competing interests exist.
Conceived and designed the experiments: PAA CB. Performed the experiments: CB. Analyzed the data: AP RT. Contributed reagents/materials/analysis tools: AP. Wrote the paper: AP PAA. Designed the algorithm and implemented the software: AP. Performed additional case studies described in Supplemental Materials: RT.
OpenAccessLink https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4578894/
PMID 26393794
PQID 1719292471
PQPubID 1436336
ParticipantIDs plos_journals_1719292471
doaj_primary_oai_doaj_org_article_a119408790ad48f4b2f70f9a98d718df
pubmedcentral_primary_oai_pubmedcentral_nih_gov_4578894
proquest_miscellaneous_1717472593
proquest_journals_1719292471
gale_infotracmisc_A471193590
gale_infotracacademiconefile_A471193590
gale_incontextgauss_ISR_A471193590
gale_incontextgauss_IOV_A471193590
gale_healthsolutions_A471193590
crossref_primary_10_1371_journal_pone_0138006
pubmed_primary_26393794
PublicationCentury 2000
PublicationDate 2015-09-22
PublicationDateYYYYMMDD 2015-09-22
PublicationDate_xml – month: 09
  year: 2015
  text: 2015-09-22
  day: 22
PublicationDecade 2010
PublicationPlace United States
PublicationPlace_xml – name: United States
– name: San Francisco
– name: San Francisco, CA USA
PublicationTitle PloS one
PublicationTitleAlternate PLoS One
PublicationYear 2015
Publisher Public Library of Science
Public Library of Science (PLoS)
Publisher_xml – sequence: 0
  name: Public Library of Science (PLoS)
– name: Public Library of Science
References 22965124 - Nucleic Acids Res. 2012 Nov 1;40(20):10084-97
2231712 - J Mol Biol. 1990 Oct 5;215(3):403-10
22227904 - Sci China Life Sci. 2011 Dec;54(12):1121-8
15153305 - Brief Bioinform. 2004 Mar;5(1):39-55
16381887 - Nucleic Acids Res. 2006 Jan 1;34(Database issue):D363-8
20625424 - J Biomed Biotechnol. 2010;2010:853916
19261174 - Genome Biol. 2009;10(3):R25
20693479 - Genome Res. 2010 Oct;20(10):1432-40
19497933 - Bioinformatics. 2009 Aug 1;25(15):1966-7
22368243 - Bioinformatics. 2012 Apr 15;28(8):1086-92
2984356 - J Neurophysiol. 1985 Mar;53(3):821-35
21447597 - Database (Oxford). 2011;2011:bar009
19859660 - Cell Mol Life Sci. 2010 Feb;67(4):569-79
6137520 - J Neurophysiol. 1983 Sep;50(3):671-83
18349386 - Genome Res. 2008 May;18(5):821-9
21572440 - Nat Biotechnol. 2011 Jul;29(7):644-52
12493901 - Science. 2002 Dec 20;298(5602):2339-40
23160280 - Nat Methods. 2013 Jan;10(1):71-3
19131956 - Nat Protoc. 2009;4(1):44-57
DR Zerbino (ref15) 2008; 18
A Bairoch (ref21) 2004; 5
V Costa (ref2) 2010; 2010
B Langmead (ref16) 2009; 10
D Boehning (ref10) 2002; 298
MH Schulz (ref11) 2012; 28
(ref5) 1984; 166
F Chen (ref20) 2006; 34
G Chen (ref9) 2011; 54
Y Surget-Groba (ref13) 2010; 20
S Marguerat (ref1) 2010; 67
W Huang da (ref23) 2009; 4
PA Anderson (ref6) 1985; 53
MG Grabherr (ref12) 2011; 29
M Magrane (ref22) 2011; 2011
W BP Appeltans (ref3) 2012
R Li (ref18) 2009; 25
PAV Anderson (ref8) 1987; 133
A Roberts (ref17) 2013; 10
SF Altschul (ref19) 1990; 215
I Nookaew (ref14) 2012; 40
GaSP Lesh-Laurie (ref4) 1991
PA Anderson (ref7) 1983; 50
References_xml – volume: 54
  start-page: 1121
  issue: 12
  year: 2011
  ident: ref9
  article-title: Overview of available methods for diverse RNA-Seq data analyses
  publication-title: Science China Life sciences
  doi: 10.1007/s11427-011-4255-x
  contributor:
    fullname: G Chen
– volume: 2010
  start-page: 853916
  year: 2010
  ident: ref2
  article-title: Uncovering the complexity of transcriptomes with RNA-Seq
  publication-title: Journal of biomedicine & biotechnology
  doi: 10.1155/2010/853916
  contributor:
    fullname: V Costa
– volume: 166
  start-page: 396
  year: 1984
  ident: ref5
  article-title: An epithelial cell-free preparation of the motor nerve net of (Coelenterata; Scyphozoa)
  publication-title: Biological bulletin
  doi: 10.2307/1541225
– volume: 5
  start-page: 39
  issue: 1
  year: 2004
  ident: ref21
  article-title: Swiss-Prot: juggling between evolution and stability
  publication-title: Briefings in bioinformatics
  doi: 10.1093/bib/5.1.39
  contributor:
    fullname: A Bairoch
– volume: 2011
  start-page: bar009
  year: 2011
  ident: ref22
  article-title: UniProt Knowledgebase: a hub of integrated protein data
  publication-title: Database: the journal of biological databases and curation
  doi: 10.1093/database/bar009
  contributor:
    fullname: M Magrane
– volume: 133
  start-page: 231
  year: 1987
  ident: ref8
  article-title: Properties and pharmacology of a TTX-insensitive Na+ current in neurons of the jellyfish Cyanea capillata
  publication-title: Journal of Experimental Biology
  doi: 10.1242/jeb.133.1.231
  contributor:
    fullname: PAV Anderson
– volume: 10
  start-page: R25
  issue: 3
  year: 2009
  ident: ref16
  article-title: Ultrafast and memory-efficient alignment of short DNA sequences to the human genome
  publication-title: Genome biology
  doi: 10.1186/gb-2009-10-3-r25
  contributor:
    fullname: B Langmead
– volume: 4
  start-page: 44
  issue: 1
  year: 2009
  ident: ref23
  article-title: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources
  publication-title: Nature protocols
  doi: 10.1038/nprot.2008.211
  contributor:
    fullname: W Huang da
– start-page: 185
  year: 1991
  ident: ref4
  article-title: Microscopic Anatomy of the Invertebrates 2
  contributor:
    fullname: GaSP Lesh-Laurie
– volume: 40
  start-page: 10084
  issue: 20
  year: 2012
  ident: ref14
  article-title: A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae
  publication-title: Nucleic acids research
  doi: 10.1093/nar/gks804
  contributor:
    fullname: I Nookaew
– volume: 20
  start-page: 1432
  issue: 10
  year: 2010
  ident: ref13
  article-title: Optimization of de novo transcriptome assembly from next-generation sequencing data
  publication-title: Genome research
  doi: 10.1101/gr.103846.109
  contributor:
    fullname: Y Surget-Groba
– volume: 18
  start-page: 821
  issue: 5
  year: 2008
  ident: ref15
  article-title: Velvet: algorithms for de novo short read assembly using de Bruijn graphs
  publication-title: Genome research
  doi: 10.1101/gr.074492.107
  contributor:
    fullname: DR Zerbino
– volume: 67
  start-page: 569
  issue: 4
  year: 2010
  ident: ref1
  article-title: RNA-seq: from technology to biology
  publication-title: Cellular and molecular life sciences: CMLS
  doi: 10.1007/s00018-009-0180-6
  contributor:
    fullname: S Marguerat
– volume: 298
  start-page: 2339
  issue: 5602
  year: 2002
  ident: ref10
  article-title: Circadian rhythms. Carbon monoxide and clocks
  publication-title: Science
  doi: 10.1126/science.1080339
  contributor:
    fullname: D Boehning
– volume: 25
  start-page: 1966
  issue: 15
  year: 2009
  ident: ref18
  article-title: SOAP2: an improved ultrafast tool for short read alignment
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btp336
  contributor:
    fullname: R Li
– volume: 34
  start-page: D363
  issue: Database issue
  year: 2006
  ident: ref20
  article-title: OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups
  publication-title: Nucleic acids research
  doi: 10.1093/nar/gkj123
  contributor:
    fullname: F Chen
– volume: 215
  start-page: 403
  issue: 3
  year: 1990
  ident: ref19
  article-title: Basic local alignment search tool
  publication-title: Journal of molecular biology
  doi: 10.1016/S0022-2836(05)80360-2
  contributor:
    fullname: SF Altschul
– volume: 53
  start-page: 821
  issue: 3
  year: 1985
  ident: ref6
  article-title: Physiology of a bidirectional, excitatory, chemical synapse
  publication-title: Journal of neurophysiology
  doi: 10.1152/jn.1985.53.3.821
  contributor:
    fullname: PA Anderson
– volume: 28
  start-page: 1086
  issue: 8
  year: 2012
  ident: ref11
  article-title: Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bts094
  contributor:
    fullname: MH Schulz
– volume: 50
  start-page: 671
  issue: 3
  year: 1983
  ident: ref7
  article-title: Action potential in neurons of motor nerve net of Cyanea (Coelenterata)
  publication-title: Journal of neurophysiology
  doi: 10.1152/jn.1983.50.3.671
  contributor:
    fullname: PA Anderson
– volume: 29
  start-page: 644
  issue: 7
  year: 2011
  ident: ref12
  article-title: Full-length transcriptome assembly from RNA-Seq data without a reference genome
  publication-title: Nature biotechnology
  doi: 10.1038/nbt.1883
  contributor:
    fullname: MG Grabherr
– year: 2012
  ident: ref3
  article-title: World Register of Marine Species
  contributor:
    fullname: W BP Appeltans
– volume: 10
  start-page: 71
  issue: 1
  year: 2013
  ident: ref17
  article-title: Streaming fragment assignment for real-time analysis of sequencing experiments
  publication-title: Nature methods
  doi: 10.1038/nmeth.2251
  contributor:
    fullname: A Roberts
SSID ssj0053866
Score 2.2362547
Snippet Transcriptomes are one of the first sources of high-throughput genomic data that have benefitted from the introduction of Next-Gen Sequencing. As sequencing...
SourceID plos
doaj
pubmedcentral
proquest
gale
crossref
pubmed
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
StartPage e0138006
SubjectTerms Algorithms
Annotations
Assembly
Bioinformatics
Biology
Clustering
Computer applications
Cyanea capillata
Data processing
Evolution
Fragmentation
Fragments
Freeware
Gene expression
Gene families
Gene mapping
Gene sequencing
Genes
Genome
Genomes
Genomics
Homology
Nervous system
Neurons
Nucleotide sequence
Organisms
Parallel processing
Physiology
Proteins
RNA - genetics
RNA sequencing
Scaffolding
Scyphozoa
Seeds
Sequence Analysis, RNA - methods
Transcriptome
Workflow
Workflow software
SummonAdditionalLinks – databaseName: Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwELbQnrggyqsLBQxCAg5pE-dh-7g8qoJEkVoKvUV-QqVusjTZA_-eGccbbVAlOHDJIZ6NsvOeeOYzIS-s1qxUziaVN2lScG8S4SQUK2WurIGQWTCcHf50XB2dFR_Py_Oto76wJ2yABx4Yd6AyKLNTwWWqbCF8oZnnqZdKCgtu1frgfVO2KaYGHwxWXFVxUC7n2UGUy_6qbdw-7s2leMLRViAKeP2jV56tLtvuupTzz87JrVB0eJvcijkkXQzvvkNuuOYO2YlW2tFXEUr69V3y7dQtgXUXhuLm7lJf_qKqsXTRNO2wBU9bT99dKd_Tk-NF537SELuCJ-kofqNt1z1VdISjpfDsdunukbPD91_eHiXxKIXEVJL1iQCmV6nhPreeeyhacgUyEqmEAAUJm1GlhaqaM2GV9B6sUlkEwtNOqhJ0zeb3yawB5u0SqiVUaS7jIHpZSM20gGsmjM-51Uymc5Js-FqvBsSMOmybcag0BgbVKIc6ymFO3iDzR1rEuw43QAvqqAX137RgTp6i6OpheHS02noBsTfD4WN4reeBAjEvGmyq-a7WXVd_-Pz1H4hOTyZELyORb0EJjIqDDPCfEEtrQrk3oQTLNZPlXVS0DVe6OuOQb0NBzDP45Ub5rl9-Ni7jQ7FRrnHtOtBAgQg1bT4nDwZdHTnLKkQ_lMWc8IkWT1g_XWkufgTI8QIcu5DFw_8hq0fkJmSdJTbdMLZHZv3V2j2GzK7XT4IR_waDoUuz
  priority: 102
  providerName: Directory of Open Access Journals
– databaseName: ProQuest Technology Collection
  dbid: 8FG
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjR3JbtNAdARBSFwqWpYaCgwICTi4tcfLzJxQWEJBokgthd6s8SylUmOnsXPo3_OePTE1qhCXHDIvlvP2N28j5KUpS5Ypa8Lc6ShMudOhsBKClSxRRoPJTBn2Dn89yPeP0y8n2Ym_cGt8WeVaJ3aK2tQa78j3Yg6-CAQLPH67uAhxaxRmV_0KjZvkVsx4jiV9YvZprYlBlvPct8slPN7z1Nld1JXdxQxdhHuOrpijbmr_oJsni_O6uc7x_Lt-8opBmt0lG96TpNOe9Jvkhq22yO1-t-TlFtn0UtvQ13609Jt75OeRnQMqzzTFZO-8PL-kqjJ0WlV1n5KntaMflsq19PBg2tgL2tmyTrM0FO9s61VLFR3G01J4dj2398nx7OP39_uhX60Q6lyyNhRAhDzS3CXGcQdBTKKAZiKSYLDAgdMqMxBlcyaMks6BlCqDg_FKK1UGvGeSB2RSARq3CS0lRG025sAKMpUlKwV8xkK7hJuSySgg4RrDxaKfoFF0aTQOkUePqgIpUniKBOQdkmGAxfnX3Rf18rTw4lSoOJZpJLiMlEmFS0vmeOSkksKAsTUuIM-QiEXfTDpIcTEF_omxGRle60UHgTMwKiyyOVWrpik-f_vxH0BHhyOgVx7I1cAOWvnGBvhPOFtrBLkzggRJ1qPjbWS5NVaa4g_Pwy_XbHj98fPhGB-KhXOVrVcdDASMEOMmAXnYc-2AWZbjNESZBoSP-HmE-vFJdfarG0GegqIXMn3079d6TO6Af5lheQ1jO2TSLlf2Cfhwbfm0E9Tfb-5GTw
  priority: 102
  providerName: ProQuest
– databaseName: Scholars Portal Open Access Journals
  dbid: M48
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3db9MwELem7oUXxMbHAgMMQgIeUiWOE8cPCJWPaSBtSBuFvUVObI9JbdI1qcT-e-6cDxFUEC99qC9Wcx--u97dz4S80HnOYmW0n9gi8LmwhZ8aCclKHCldgMvkDGeHT06T4zn_fBFf7JD-ztaOgfXW1A7vk5qvF9Of1zdvweDfuFsbRNg_NF1VpZli5c1hcO8y8I3Y5HXCh7oCWLerXmLU4icsiLphur_tMnJWDtN_OLknq0VVbwtL_-yu_M1dHd0ht7s4k85axdgjO6bcJ3udJdf0VQc3_fou-X5ulsDeq4JiAXiZL26oKjWdlWXVlulpZemHtbINPTud1eaaOv_mTpua4v-41aahig6QtRT2rpbmHpkfffz6_tjvrlvwi0Syxk9BMElQCBtpKywkNpECOaaBBCcGQV2hYg2Zt2CpVtJasFylESwvN1LFoI86uk8mJTDvgNBcQiZnQgHqIbnMWZ7CZ5gWNhI6ZzLwiN_zNVu1qBqZK60JyEZaBmUoh6yTg0feIfMHWsTEdl9U68usM7FMhaHkQSpkoDRPLc-ZFYGVSqYaHLC2HnmKosvaAdPBsrMZ-OcQB5ThZz13FIiLUWLjzaXa1HX26cu3_yA6PxsRveyIbAVKUKhu2AHeCfG2RpSHI0qw7mK0fICK1nOlzkIBMTkkzSKEJ3vl2778bFjGTbGZrjTVxtFAEgl5b-SRB62uDpxlCSIkSu4RMdLiEevHK-XVDwdLzuHwTyV_-O8XekRuQcwZY8sNY4dk0qw35jHEdU3-xJnqL6dxSr0
  priority: 102
  providerName: Scholars Portal
Title Semantic Assembly and Annotation of Draft RNAseq Transcripts without a Reference Genome
URI https://www.ncbi.nlm.nih.gov/pubmed/26393794
https://www.proquest.com/docview/1719292471
https://search.proquest.com/docview/1717472593
https://pubmed.ncbi.nlm.nih.gov/PMC4578894
https://doaj.org/article/a119408790ad48f4b2f70f9a98d718df
http://dx.doi.org/10.1371/journal.pone.0138006
Volume 10
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Nb9MwFLe2cuGCGF_rGMUgJOCQNt-2j11ZGUgtU8dgt8gf8ajUJqVpD_z3PDtORNAOiIsP8UuUvA-_9-z3fkHojRIiTHiuvFRL34uJlh7NGSQrScSVBJcZh6Z3eDZPL67jzzfJzQFKml4YW7QvxXJYrNbDYvnD1lZu1nLU1ImNLmeTGNSMsnh0iA7B_TYper38ggGnqeuRi0gwciIZbsoiH5pjOVAzgwCcGiQ4FnfckUXtb9fm3mZVVncFnn_XT_7hkKYP0QMXSeJx_cZH6CAvHqEjZ6sVfucApd8_Rt-v8jUwcCmxOeJdi9UvzAuFx0VR1gfxuNT4w5brHV7Mx1X-E1sPZteTCpud2nK_wxy3oLQYnl2u8yfoenr-dXLhuR8qeDJl4c6jwPrUl0RHShMNqUvEQVLUZ-CmIGyTPFGQW5OQKs60BtvkysDhiZzxBDRORU9RrwA-HiMsGORqeUBAAVjMRCgojAGVOiJKhMzvI6_ha7apcTMye3hGIN-oGZQZkWROJH10Zpjf0hrUa3uh3N5mTvYZDwIW-5Qwn6uY6liEmviacUYVuFil--ilEV1Wt5C2tpuNwQMHpgUZXuu1pTDIF4Uprbnl-6rKPn359g9EV4sO0VtHpEtQAsldOwN8k0HU6lCedijBfmVn-tgoWsOVKgsIRN2QFpMA7myU7-7pV-20eagplyvycm9pIE2EzDbqo2e1rracbTS_j0hHizus786AHVrgcWd3J_9953N0HwLOxNTbhOEp6u22-_wFBHU7MQBTviEw0klgxunHAbp3dj6_XAzsNgmMs5gOrKn_Br-SUn8
link.rule.ids 230,314,727,780,784,864,885,2102,2221,12056,12223,12765,21388,24318,27924,27925,31719,31720,33266,33267,33373,33374,33744,33745,43310,43579,43600,43805,53791,53793,73745,74014,74035,74302
linkProvider National Library of Medicine
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1Lb9NAEF5BKgQXRMujgUIXhAQc3Pq93hNKX0qhDShtobfVeh-lUmOncXLov2fG3pgaVYhLDtmJ5cx7dne-IeS9zvMwkUZ7qVW-FzOrvMxwKFaSSGoFITMOsXf4eJQOz-Iv58m523Cr3LXKpU-sHbUuFe6RbwcMchEoFljweXrt4dQoPF11IzTukxVETk96ZGVnf_R9vPTFYM1p6hrmIhZsO_lsTcvCbOEZnY-Tjm4FpBq3v_XOvelVWd2Vev59g_JWSDp4Qh67XJIOGuGvknumWCMPmumSN2tk1dltRT86cOlPT8nPEzMBZl4qise9k_zqhspC00FRlM2hPC0t3ZtJO6fj0aAy17SOZrVvqSju2paLOZW0Bail8OxyYp6Rs4P9092h54YreCrl4dzLQAypr5iNtGUWyphIgtQyn0PIghROyURDnc3CTEtuLdip1AiNlxsuE9A-HT0nvQLYuE5ozqFuMwEDZeAxz8M8g88gUzZiOg-53yfeksNi2mBoiPogjUHt0bBKoESEk0if7KAYWlpEwK6_KGcXwhmUkEHAYz9j3Jc6zmych5b5lkueaQi32vbJJgpRNO2krR2LAWhQgO3I8FrvagpEwSjwms2FXFSVOPz24z-ITsYdog-OyJagDkq61gb4T4iu1aHc6FCCLavO8jqq3JIrlfij9fDLpRrevfy2XcaH4tW5wpSLmgZKRqhyoz550Whty9kwRTxEHvcJ6-hzh_XdleLyVw1CHoOrz3j88t-vtUkeDk-Pj8TR4ejrK_IIss0EL9uE4QbpzWcL8xoyunn-xpntb1ZKSqA
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjR1Nb9Mw1IJOIC4TGx_rGMwgJOCQLXE-HJ9Qx1ZtfJSpY2M3y7HjMWlNuqY97N_zXuKGBU2ISw_1q5W-75f3Rchbk2UsVrnxEqt9L-JWe2kuIFiJQ2U0mMyIYe_wt1FyeBp9Po_PXf1T5coqlzqxVtSm1PiOfDfg4ItAsMCDXevKIo73hx-n1x5ukMJMq1uncZ-sgFX0WY-s7B2MjsdLvQySnSSueS6EWxytdqZlke9gvs7HrUe3jFM9w7_V1L3pVVnd5Yb-XU15yzwNH5NV51fSQcMIa-ReXqyTB82myZt1suZkuKLv3aDpD0_Iz5N8Aoi91BRTv5Ps6oaqwtBBUZRNgp6Wlu7PlJ3T8WhQ5de0tmy1nqkovsEtF3OqaDuslsLd5SR_Sk6HBz8-HXpu0YKnE8HmXgokSXzNbWgstxDShAoomPoCzBe4c1rFBmJuzlKjhLUgs8rgmLwsFyoGTjThM9IrAI0bhGYCYrg84MAYIhIZy1L4DFJtQ24yJvw-8ZYYltNmnoask2oc4pAGVRIpIh1F-mQPydDC4jTs-otydiGdcEkVBCLyUy58ZaLURhmz3LdCidSA6TW2T7aRiLJpLW1lWg6AmwJsTYbHelND4ESMAnnrQi2qSh59P_sPoJNxB-idA7IlsINWrs0B_hNO2upAbnUgQa5153gDWW6JlUr-kQD45ZIN7z5-3R7jpVhGV-TlooaB8BEi3rBPnjdc22KWJTgbUUR9wjv83EF996S4_FUPJI9A7aci2vz3Y22ThyCx8uvR6MsL8ggczxjrbhjbIr35bJG_BOdunr1yUvsb6jVOzQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Semantic+Assembly+and+Annotation+of+Draft+RNAseq+Transcripts+without+a+Reference+Genome&rft.jtitle=PloS+one&rft.au=Ptitsyn%2C+Andrey&rft.au=Temanni%2C+Ramzi&rft.au=Bouchard%2C+Christelle&rft.au=Anderson%2C+Peter+A.+V&rft.date=2015-09-22&rft.pub=Public+Library+of+Science&rft.issn=1932-6203&rft.eissn=1932-6203&rft.volume=10&rft.issue=9&rft_id=info:doi/10.1371%2Fjournal.pone.0138006&rft.externalDocID=A471193590
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1932-6203&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1932-6203&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1932-6203&client=summon