SparkBLAST: scalable BLAST processing using in-memory operations

The demand for processing ever increasing amounts of genomic data has raised new challenges for the implementation of highly scalable and efficient computational systems. In this paper we propose SparkBLAST, a parallelization of a sequence alignment application (BLAST) that employs cloud computing f...

Full description

Saved in:
Bibliographic Details
Published inBMC bioinformatics Vol. 18; no. 1; p. 318
Main Authors de Castro, Marcelo Rodrigo, Tostes, Catherine Dos Santos, Dávila, Alberto M R, Senger, Hermes, da Silva, Fabricio A B
Format Journal Article
LanguageEnglish
Published England BioMed Central Ltd 27.06.2017
BioMed Central
BMC
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The demand for processing ever increasing amounts of genomic data has raised new challenges for the implementation of highly scalable and efficient computational systems. In this paper we propose SparkBLAST, a parallelization of a sequence alignment application (BLAST) that employs cloud computing for the provisioning of computational resources and Apache Spark as the coordination framework. As a proof of concept, some radionuclide-resistant bacterial genomes were selected for similarity analysis. Experiments in Google and Microsoft Azure clouds demonstrated that SparkBLAST outperforms an equivalent system implemented on Hadoop in terms of speedup and execution times. The superior performance of SparkBLAST is mainly due to the in-memory operations available through the Spark framework, consequently reducing the number of local I/O operations required for distributed BLAST processing.
AbstractList Background The demand for processing ever increasing amounts of genomic data has raised new challenges for the implementation of highly scalable and efficient computational systems. In this paper we propose SparkBLAST, a parallelization of a sequence alignment application (BLAST) that employs cloud computing for the provisioning of computational resources and Apache Spark as the coordination framework. As a proof of concept, some radionuclide-resistant bacterial genomes were selected for similarity analysis. Results Experiments in Google and Microsoft Azure clouds demonstrated that SparkBLAST outperforms an equivalent system implemented on Hadoop in terms of speedup and execution times. Conclusions The superior performance of SparkBLAST is mainly due to the in-memory operations available through the Spark framework, consequently reducing the number of local I/O operations required for distributed BLAST processing. Keywords: Cloud computing, Comparative genomics, Scalability, Spark
Background The demand for processing ever increasing amounts of genomic data has raised new challenges for the implementation of highly scalable and efficient computational systems. In this paper we propose SparkBLAST, a parallelization of a sequence alignment application (BLAST) that employs cloud computing for the provisioning of computational resources and Apache Spark as the coordination framework. As a proof of concept, some radionuclide-resistant bacterial genomes were selected for similarity analysis. Results Experiments in Google and Microsoft Azure clouds demonstrated that SparkBLAST outperforms an equivalent system implemented on Hadoop in terms of speedup and execution times. Conclusions The superior performance of SparkBLAST is mainly due to the in-memory operations available through the Spark framework, consequently reducing the number of local I/O operations required for distributed BLAST processing.
BACKGROUNDThe demand for processing ever increasing amounts of genomic data has raised new challenges for the implementation of highly scalable and efficient computational systems. In this paper we propose SparkBLAST, a parallelization of a sequence alignment application (BLAST) that employs cloud computing for the provisioning of computational resources and Apache Spark as the coordination framework. As a proof of concept, some radionuclide-resistant bacterial genomes were selected for similarity analysis.RESULTSExperiments in Google and Microsoft Azure clouds demonstrated that SparkBLAST outperforms an equivalent system implemented on Hadoop in terms of speedup and execution times.CONCLUSIONSThe superior performance of SparkBLAST is mainly due to the in-memory operations available through the Spark framework, consequently reducing the number of local I/O operations required for distributed BLAST processing.
The demand for processing ever increasing amounts of genomic data has raised new challenges for the implementation of highly scalable and efficient computational systems. In this paper we propose SparkBLAST, a parallelization of a sequence alignment application (BLAST) that employs cloud computing for the provisioning of computational resources and Apache Spark as the coordination framework. As a proof of concept, some radionuclide-resistant bacterial genomes were selected for similarity analysis. Experiments in Google and Microsoft Azure clouds demonstrated that SparkBLAST outperforms an equivalent system implemented on Hadoop in terms of speedup and execution times. The superior performance of SparkBLAST is mainly due to the in-memory operations available through the Spark framework, consequently reducing the number of local I/O operations required for distributed BLAST processing.
The demand for processing ever increasing amounts of genomic data has raised new challenges for the implementation of highly scalable and efficient computational systems. In this paper we propose SparkBLAST, a parallelization of a sequence alignment application (BLAST) that employs cloud computing for the provisioning of computational resources and Apache Spark as the coordination framework. As a proof of concept, some radionuclide-resistant bacterial genomes were selected for similarity analysis. Experiments in Google and Microsoft Azure clouds demonstrated that SparkBLAST outperforms an equivalent system implemented on Hadoop in terms of speedup and execution times. The superior performance of SparkBLAST is mainly due to the in-memory operations available through the Spark framework, consequently reducing the number of local I/O operations required for distributed BLAST processing.
Abstract Background The demand for processing ever increasing amounts of genomic data has raised new challenges for the implementation of highly scalable and efficient computational systems. In this paper we propose SparkBLAST, a parallelization of a sequence alignment application (BLAST) that employs cloud computing for the provisioning of computational resources and Apache Spark as the coordination framework. As a proof of concept, some radionuclide-resistant bacterial genomes were selected for similarity analysis. Results Experiments in Google and Microsoft Azure clouds demonstrated that SparkBLAST outperforms an equivalent system implemented on Hadoop in terms of speedup and execution times. Conclusions The superior performance of SparkBLAST is mainly due to the in-memory operations available through the Spark framework, consequently reducing the number of local I/O operations required for distributed BLAST processing.
ArticleNumber 318
Audience Academic
Author Senger, Hermes
de Castro, Marcelo Rodrigo
Dávila, Alberto M R
Tostes, Catherine Dos Santos
da Silva, Fabricio A B
Author_xml – sequence: 1
  givenname: Marcelo Rodrigo
  surname: de Castro
  fullname: de Castro, Marcelo Rodrigo
  organization: Computer Science Department, Federal University of São Carlos, Rod. Washington Luís, Km 235, São Carlos, 21040-900, Brazil
– sequence: 2
  givenname: Catherine Dos Santos
  surname: Tostes
  fullname: Tostes, Catherine Dos Santos
  organization: LBCS-IOC, Oswaldo Cruz Foundation, Av Brasil 4365, Rio de Janeiro, 21040-900, Brazil
– sequence: 3
  givenname: Alberto M R
  surname: Dávila
  fullname: Dávila, Alberto M R
  organization: LBCS-IOC, Oswaldo Cruz Foundation, Av Brasil 4365, Rio de Janeiro, 21040-900, Brazil
– sequence: 4
  givenname: Hermes
  surname: Senger
  fullname: Senger, Hermes
  organization: Computer Science Department, Federal University of São Carlos, Rod. Washington Luís, Km 235, São Carlos, 21040-900, Brazil
– sequence: 5
  givenname: Fabricio A B
  surname: da Silva
  fullname: da Silva, Fabricio A B
  email: fabricio.silva@fiocruz.br
  organization: PROCC, Oswaldo Cruz Foundation, Av. Brasil 4365, Rio de Janeiro, 21040-900, Brazil. fabricio.silva@fiocruz.br
BackLink https://www.ncbi.nlm.nih.gov/pubmed/28655296$$D View this record in MEDLINE/PubMed
BookMark eNptkluL1DAUx4usuBf9AL5IwRd96JqkubQ-iLOLl4EBwVmfQ5qcjBnbZjZpZffbm86sw1YkkMvJ7_zDOfmfZye97yHLXmJ0iXHF30VMKlYXCIsCC1IW1ZPsDFOBC4IRO3m0P83OY9yiBFaIPctOScUZIzU_yz6udyr8ulot1jfv86hVq5oW8v053wWvIUbXb_JxP7u-6KDz4T73OwhqcL6Pz7OnVrURXjysF9mPz59urr8Wq29flteLVaFZTYeCg7KkRLVoEOWGWqpqpoXhDeO6UVxgJKw1YBE0RnNmRIkVrWljCAKDrSkvsuVB13i1lbvgOhXupVdO7gM-bKQKg9MtSIsJr4XAihBLa2JTzQBANZgKNcaUSevDQWs3Nh0YDf0QVDsTnd_07qfc-N-S0aoqxSTw5kEg-NsR4iA7FzW0rerBj1HiGlMmKoaqhL7-B936MfSpVRPFEEc0fd2R2qhUgOutT-_qSVQuGMYlT1I0UZf_odIw0DmdvGFdis8S3s4SEjPA3bBRY4xyuf4-Z_GB1cHHGMAe-4GRnOwmD3aTyUVyspucinv1uJHHjL_-Kv8Av5DPag
CitedBy_id crossref_primary_10_1093_bioinformatics_bty733
crossref_primary_10_3390_sym13122385
crossref_primary_10_1007_s11227_020_03338_3
crossref_primary_10_1016_j_csbj_2022_04_014
crossref_primary_10_1007_s11227_020_03267_1
crossref_primary_10_1186_s12859_023_05245_9
crossref_primary_10_1016_j_imu_2022_101024
crossref_primary_10_1088_1742_6596_2179_1_012038
crossref_primary_10_1016_j_ajhg_2019_09_027
crossref_primary_10_2174_1389202921999200611155418
crossref_primary_10_3934_mbe_2021387
crossref_primary_10_3389_fdata_2021_727216
crossref_primary_10_1371_journal_pone_0249410
crossref_primary_10_1186_s12859_020_03757_2
crossref_primary_10_1016_j_genrep_2020_100869
Cites_doi 10.1016/S0022-2836(05)80360-2
10.1006/jmbi.1998.2144
10.1111/1751-7915.12059
10.1126/science.278.5338.631
10.1186/1471-2105-10-421
10.1093/nar/25.17.3389
10.1016/j.jbi.2013.07.001
10.1186/gb-2010-11-8-r83
10.1186/1471-2105-12-139
10.1002/cpe.3628
10.14778/2831360.2831365
10.1101/gr.1224503
ContentType Journal Article
Copyright COPYRIGHT 2017 BioMed Central Ltd.
Copyright BioMed Central 2017
The Author(s) 2017
Copyright_xml – notice: COPYRIGHT 2017 BioMed Central Ltd.
– notice: Copyright BioMed Central 2017
– notice: The Author(s) 2017
DBID CGR
CUY
CVF
ECM
EIF
NPM
AAYXX
CITATION
ISR
3V.
7QO
7SC
7X7
7XB
88E
8AL
8AO
8FD
8FE
8FG
8FH
8FI
8FJ
8FK
ABUWG
AFKRA
ARAPS
AZQEC
BBNVY
BENPR
BGLVJ
BHPHI
CCPQU
DWQXO
FR3
FYUFA
GHDGH
GNUQQ
HCIFZ
JQ2
K7-
K9.
L7M
LK8
L~C
L~D
M0N
M0S
M1P
M7P
P5Z
P62
P64
PIMPY
PQEST
PQQKQ
PQUKI
PRINS
Q9U
7X8
5PM
DOA
DOI 10.1186/s12859-017-1723-8
DatabaseName Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
CrossRef
Science in Context
ProQuest Central (Corporate)
Biotechnology Research Abstracts
Computer and Information Systems Abstracts
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Medical Database (Alumni Edition)
Computing Database (Alumni Edition)
ProQuest Pharma Collection
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Natural Science Collection
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest Central
Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
Biological Science Collection
ProQuest Central
Technology Collection
Natural Science Collection
ProQuest One Community College
ProQuest Central Korea
Engineering Research Database
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Central Student
SciTech Premium Collection
ProQuest Computer Science Collection
Computer Science Database
ProQuest Health & Medical Complete (Alumni)
Advanced Technologies Database with Aerospace
Biological Sciences
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Computing Database
Health & Medical Collection (Alumni Edition)
Medical Database
Biological Science Database
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
Biotechnology and BioEngineering Abstracts
Publicly Available Content Database
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
ProQuest Central Basic
MEDLINE - Academic
PubMed Central (Full Participant titles)
DOAJ Directory of Open Access Journals
DatabaseTitle MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
CrossRef
Publicly Available Content Database
Computer Science Database
ProQuest Central Student
Technology Collection
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
ProQuest Health & Medical Complete (Alumni)
Computer and Information Systems Abstracts
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
ProQuest Natural Science Collection
ProQuest Pharma Collection
ProQuest Central China
ProQuest Central
Health Research Premium Collection
Biotechnology Research Abstracts
Health and Medicine Complete (Alumni Edition)
Natural Science Collection
ProQuest Central Korea
Biological Science Collection
Advanced Technologies Database with Aerospace
ProQuest Medical Library (Alumni)
Advanced Technologies & Aerospace Collection
ProQuest Computing
ProQuest Biological Science Collection
ProQuest Central Basic
ProQuest Computing (Alumni Edition)
ProQuest One Academic Eastern Edition
ProQuest Hospital Collection
ProQuest Technology Collection
Health Research Premium Collection (Alumni)
Biological Science Database
ProQuest SciTech Collection
ProQuest Hospital Collection (Alumni)
Biotechnology and BioEngineering Abstracts
Computer and Information Systems Abstracts Professional
Advanced Technologies & Aerospace Database
ProQuest Health & Medical Complete
ProQuest Medical Library
ProQuest One Academic UKI Edition
Engineering Research Database
ProQuest One Academic
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList
Publicly Available Content Database
MEDLINE - Academic
MEDLINE



Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 4
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1471-2105
EndPage 318
ExternalDocumentID oai_doaj_org_article_f1269771a22f492f805eee4ced80bdd3
A511365084
10_1186_s12859_017_1723_8
28655296
Genre Journal Article
GeographicLocations Brazil
GeographicLocations_xml – name: Brazil
GroupedDBID ---
-A0
0R~
23N
2WC
3V.
53G
5VS
6J9
7X7
88E
8AO
8FE
8FG
8FH
8FI
8FJ
AAFWJ
AAJSJ
AAKPC
ABDBF
ABUWG
ACGFO
ACGFS
ACIHN
ACIWK
ACPRK
ACRMQ
ADBBV
ADINQ
ADRAZ
ADUKV
AEAQA
AENEX
AFGXO
AFKRA
AFNRJ
AFPKN
AFRAH
AHBYD
AHMBA
AHYZX
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMTXH
AOIJS
ARAPS
AZQEC
BAPOH
BAWUL
BBNVY
BCNDV
BENPR
BFQNJ
BGLVJ
BHPHI
BMC
BPHCQ
BVXVI
C24
C6C
CCPQU
CGR
CS3
CUY
CVF
DIK
DU5
DWQXO
E3Z
EAD
EAP
EAS
EBD
EBS
ECM
EIF
EJD
EMB
EMK
EMOBN
ESX
F5P
FYUFA
GNUQQ
GROUPED_DOAJ
GX1
H13
HCIFZ
HMCUK
HYE
IAO
IHR
INH
INR
ISR
ITC
K6V
K7-
KQ8
LK8
M0N
M1P
M48
M7P
MK~
ML0
M~E
NPM
O5R
O5S
OK1
P2P
P62
PIMPY
PQQKQ
PROAC
PSQYO
RBZ
RNS
ROL
RPM
RSV
SBL
SOJ
SV3
TR2
TUS
UKHRP
W2D
WOQ
WOW
XH6
XSB
AAYXX
ALIPV
CITATION
EBLON
PGMZT
ABVAZ
7QO
7SC
7XB
8AL
8FD
8FK
AHSBF
FR3
JQ2
K9.
L7M
L~C
L~D
P64
PQEST
PQUKI
PRINS
Q9U
7X8
5PM
ID FETCH-LOGICAL-c594t-6eaf23097b046d4f4a95c7d6b56cba67107ffdef0ebdc65d731a494bd20ed1fd3
IEDL.DBID RPM
ISSN 1471-2105
IngestDate Thu Jul 04 21:11:00 EDT 2024
Tue Sep 17 21:33:12 EDT 2024
Fri Aug 16 01:47:12 EDT 2024
Fri Sep 13 01:37:19 EDT 2024
Thu Feb 22 23:23:01 EST 2024
Fri Feb 02 04:23:43 EST 2024
Thu Aug 01 20:14:29 EDT 2024
Thu Sep 12 19:57:07 EDT 2024
Thu May 23 23:24:08 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Comparative genomics
Cloud computing
Spark
Scalability
Language English
License Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c594t-6eaf23097b046d4f4a95c7d6b56cba67107ffdef0ebdc65d731a494bd20ed1fd3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
OpenAccessLink https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5488373/
PMID 28655296
PQID 1915060472
PQPubID 44065
PageCount 1
ParticipantIDs doaj_primary_oai_doaj_org_article_f1269771a22f492f805eee4ced80bdd3
pubmedcentral_primary_oai_pubmedcentral_nih_gov_5488373
proquest_miscellaneous_1914578508
proquest_journals_1915060472
gale_infotracmisc_A511365084
gale_infotracacademiconefile_A511365084
gale_incontextgauss_ISR_A511365084
crossref_primary_10_1186_s12859_017_1723_8
pubmed_primary_28655296
PublicationCentury 2000
PublicationDate 2017-06-27
PublicationDateYYYYMMDD 2017-06-27
PublicationDate_xml – month: 06
  year: 2017
  text: 2017-06-27
  day: 27
PublicationDecade 2010
PublicationPlace England
PublicationPlace_xml – name: England
– name: London
PublicationTitle BMC bioinformatics
PublicationTitleAlternate BMC Bioinformatics
PublicationYear 2017
Publisher BioMed Central Ltd
BioMed Central
BMC
Publisher_xml – name: BioMed Central Ltd
– name: BioMed Central
– name: BMC
References 20003500 - BMC Bioinformatics. 2009 Dec 15;10:421
2231712 - J Mol Biol. 1990 Oct 5;215(3):403-10
20701754 - Genome Biol. 2010;11(8):R83
9254694 - Nucleic Acids Res. 1997 Sep 1;25(17):3389-402
12952885 - Genome Res. 2003 Sep;13(9):2178-89
9790834 - J Mol Biol. 1998 Nov 6;283(4):707-25
9381173 - Science. 1997 Oct 24;278(5338):631-7
21554709 - BMC Bioinformatics. 2011 May 09;12:139
23617701 - Microb Biotechnol. 2013 Jul;6(4):349-60
23872175 - J Biomed Inform. 2013 Oct;46(5):774-81
H Senger (1723_CR5) 2016; 28
S Gopalani (1723_CR17) 2015; 113
C Camacho (1723_CR2) 2009; 10
A OĎriscoll (1723_CR4) 2013; 46
D Prakash (1723_CR16) 2013; 6
L Li (1723_CR20) 2003; 13
SF Altschul (1723_CR3) 1997; 25
P Bork (1723_CR14) 1998; 283
M Ding (1723_CR19) 2011
B Langmead (1723_CR6) 2010; 11
S Leo (1723_CR7) 2009
J Shi (1723_CR18) 2015; 8
M Zaharia (1723_CR12) 2010
A Matsunaga (1723_CR8) 2008
1723_CR10
VK Vavilapalli (1723_CR13) 2013
X Feng (1723_CR9) 2011; 12
M Zaharia (1723_CR11) 2012
RL Tatusov (1723_CR15) 1997; 278
SF Altschul (1723_CR1) 1990; 215
References_xml – volume: 215
  start-page: 403
  issue: 3
  year: 1990
  ident: 1723_CR1
  publication-title: J Mol Biol
  doi: 10.1016/S0022-2836(05)80360-2
  contributor:
    fullname: SF Altschul
– volume-title: Proceedings of the 2011 ACM Symposium on Research in Applied Computation
  year: 2011
  ident: 1723_CR19
  contributor:
    fullname: M Ding
– volume: 283
  start-page: 707
  issue: 4
  year: 1998
  ident: 1723_CR14
  publication-title: J Mol Biol
  doi: 10.1006/jmbi.1998.2144
  contributor:
    fullname: P Bork
– ident: 1723_CR10
– volume-title: Proceedings of the 4th Annual Symposium on Cloud Computing
  year: 2013
  ident: 1723_CR13
  contributor:
    fullname: VK Vavilapalli
– volume-title: Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation
  year: 2012
  ident: 1723_CR11
  contributor:
    fullname: M Zaharia
– volume: 6
  start-page: 349
  issue: 4
  year: 2013
  ident: 1723_CR16
  publication-title: Microb Biotechnol
  doi: 10.1111/1751-7915.12059
  contributor:
    fullname: D Prakash
– volume: 278
  start-page: 631
  year: 1997
  ident: 1723_CR15
  publication-title: Science
  doi: 10.1126/science.278.5338.631
  contributor:
    fullname: RL Tatusov
– volume: 10
  start-page: 421
  issue: 1
  year: 2009
  ident: 1723_CR2
  publication-title: BMC Bioinforma
  doi: 10.1186/1471-2105-10-421
  contributor:
    fullname: C Camacho
– volume: 25
  start-page: 3389
  issue: 17
  year: 1997
  ident: 1723_CR3
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/25.17.3389
  contributor:
    fullname: SF Altschul
– volume-title: Intl. Conf. Parallel Processing Workshops ICPPW’09
  year: 2009
  ident: 1723_CR7
  contributor:
    fullname: S Leo
– volume-title: Proceedings of the 5th European Conference on Computer Systems
  year: 2010
  ident: 1723_CR12
  contributor:
    fullname: M Zaharia
– volume: 46
  start-page: 774
  issue: 5
  year: 2013
  ident: 1723_CR4
  publication-title: J Biomed Inform
  doi: 10.1016/j.jbi.2013.07.001
  contributor:
    fullname: A OĎriscoll
– volume: 11
  start-page: 1
  issue: 8
  year: 2010
  ident: 1723_CR6
  publication-title: Genome Biol
  doi: 10.1186/gb-2010-11-8-r83
  contributor:
    fullname: B Langmead
– volume: 12
  start-page: 1
  issue: 1
  year: 2011
  ident: 1723_CR9
  publication-title: BMC Bioinforma
  doi: 10.1186/1471-2105-12-139
  contributor:
    fullname: X Feng
– volume: 28
  start-page: 2503
  issue: 8
  year: 2016
  ident: 1723_CR5
  publication-title: Concurr Comput: Pract Experience
  doi: 10.1002/cpe.3628
  contributor:
    fullname: H Senger
– volume-title: Intl. Conf. on eScience (eScience’08)
  year: 2008
  ident: 1723_CR8
  contributor:
    fullname: A Matsunaga
– volume: 8
  start-page: 2110
  issue: 13
  year: 2015
  ident: 1723_CR18
  publication-title: Proc VLDB Endowment
  doi: 10.14778/2831360.2831365
  contributor:
    fullname: J Shi
– volume: 13
  start-page: 2178
  issue: 9
  year: 2003
  ident: 1723_CR20
  publication-title: Genome Res
  doi: 10.1101/gr.1224503
  contributor:
    fullname: L Li
– volume: 113
  start-page: 8
  issue: 1
  year: 2015
  ident: 1723_CR17
  publication-title: Int J Comput Appl
  contributor:
    fullname: S Gopalani
SSID ssj0017805
Score 2.3906868
Snippet The demand for processing ever increasing amounts of genomic data has raised new challenges for the implementation of highly scalable and efficient...
Background The demand for processing ever increasing amounts of genomic data has raised new challenges for the implementation of highly scalable and efficient...
BACKGROUNDThe demand for processing ever increasing amounts of genomic data has raised new challenges for the implementation of highly scalable and efficient...
Abstract Background The demand for processing ever increasing amounts of genomic data has raised new challenges for the implementation of highly scalable and...
SourceID doaj
pubmedcentral
proquest
gale
crossref
pubmed
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
StartPage 318
SubjectTerms Algorithms
Alignment
Bacteria
Bioinformatics
Cloud Computing
Comparative Genomic Hybridization
Comparative genomics
Computer applications
Data processing
Databases, Factual
Fault tolerance
Genetic algorithms
Genomes
Genomics
Nucleotide sequence
Parallel processing
Provisioning
Radioisotopes
Scalability
Scheduling
Sequence Alignment
Similarity
Software
Spark
SummonAdditionalLinks – databaseName: DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3Ni9UwEA-yIHgRv62uUkUQhLBNmiapJ9-KyyriwbcLewv51EXse2zfO-x_70zSfbziwYuXQpspNL_JdGbIzC-EvAksgBfRljKrBBWyDdT1ilHFuVfetQFsCqstvsnTc_HlorvYO-oLa8IKPXAB7igxLiFGYZbzJHqedNPFGIWPQTcuhMLzybqbZGraP0Cm_mkPk2l5NDLkaaP4RwaH3VI980KZrP_vX_KeT5rXS-45oJN75O4UOdaL8sX3ya04PCC3y1mS1w_Jh-XaXv06_rpYnr2vR0Aee6LqfF-vSzcAeKl6m6-XA_2NJbbX9WodyxoYH5Hzk09nH0_pdDwC9V0vNlRGmyCB6JWDHDeIJGzfeRWk66R3VkLooFIKMTXRBS-7oFpmRS9c4E0MLIX2MTkYVkN8SmqwQw2phe97YcGIG6uZhTTVisha1ylbkXc3cJl1YcEwOXvQ0hRsDWBrEFujK3KMgO4EkcA6PwC1mkmt5l9qrchrVIdBiooBa2B-2O04ms_L72bR4Tk0EFiKirydhNIKFOPt1FIAk0JWq5nk4UwSbMjPh2-0biYbHg1kspl9UfGKvNoN45tYlzbE1TbLCKQLamDWT8oi2c079_zyXlZEzZbPDJj5yHD5MzN8QxqpW9U--x9IPid3OC78RlKuDsnB5mobX0AgtXEvs838AX7MGYg
  priority: 102
  providerName: Directory of Open Access Journals
– databaseName: ProQuest Central
  dbid: BENPR
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3da9RAEF9qi-CL-N1olSiCICyX3Wx2E1_0TlpOkSK9Fvq27FdqEZPzcvfQ_96ZTe5sEHw5yO0c3M7Hzkx25jeEvPXMgxcpDWVGCSpk7qmtFKOKc6eczT3YFFZbnMr5hfh6WVzukfm2FwbLKrdnYjyofevwHfkE8oqIhaf4xFh8C-DWk4_L3xTnR-E96zBM4w454Ezghe3B7Pj0-9nuRgGx-4dbTVbKSccQuY3iGQ0uPKflyC9F-P5_D-lbXmpcQXnLJZ08IPeHWDKd9sJ_SPZC84jc7adL3jwmnxZLs_o5-zZdnH9IO5AFdkml8Tld9v0B4LfSTfy8bugvLLq9Sdtl6LWie0IuTo7PP8_pMDCBuqISayqDqSGlqJSFrNeLWpiqcMpLW0hnjYRgQtW1D3UWrHey8CpnRlTCep4Fz2qfPyX7TduEQ5KCZZaQbLiqEgbMOjMlM5C4GhFYbgtlEvJ-yy697HExdMwnSql73mrgrUbe6jIhM2TojhAhreMX7epKDxaia8YlBKPMcF6LitcgrxCCcMGXmfU-T8gbFIdG0IoGq2KuzKbr9JfFmZ4WOJkGQk2RkHcDUd2ijpihyQA2hThXI8qjESVYlRsvb6WuB6vu9F8dTMjr3TL-EivVmtBuIo1AAKEMdv2sV5LdvmMXMK9kQtRIfUaMGa801z8i5jcklmWu8uf__1svyD2OKp1JytUR2V-vNuElBE1r-2qwhz-rjhWU
  priority: 102
  providerName: ProQuest
– databaseName: Scholars Portal Journals: Open Access
  dbid: M48
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3da9RAEF9qRfBF_DZaJYogCKvZj-wmguhVLFXUB6-Fvi2b_ahFzZ2XO_D-e2c2ubPBPvgSSHYSsvORmSEzvyHkqWcevEhlKbNaUqmEp02tGdWcO-0a4cGmsNriizo8lh9PypMdshlvNTCwuzC1w3lSx4sfL37_Wr8Bg3-dDL5SLzuGKGwUv7fgjgWtLpHLXAqJCv9Z_v2pgPD9qdkI3gUynXL4yXnhI0ZuKqH5__vNPue0xgWV5zzUwXVybQgt80mvCzfITmhvkiv9sMn1LfJ2OreL7_ufJtOjV3kHosGmqTyd5_O-XQDcWL5Kx7OW_sQa3HUO7OiVpLtNjg_eH707pMP8BOrKWi6pCjZChlHrBpJgL6O0dem0V02pXGMVxBY6Rh9iERrvVOm1YFbWsvG8CJ5FL-6Q3XbWhnskB0OtIPdwdS0tWHlhK2Yhj7UyMNGU2mbk-YZdZt7DZJiUXlTK9Lw1wFuDvDVVRvaRoVtCRLhOF2aLUzMYjImMK4hNmeU8yppHkF0IQbrgq6LxXmTkCYrDIIZFi0Uyp3bVdebD9KuZlDioBiJPmZFnA1GcgWCcHXoOYFMIezWi3BtRgpG58fJG6majowZS3QTPqHlGHm-X8U4sXGvDbJVoJOIJFbDru72SbPedmoJ5rTKiR-ozYsx4pT37liDAIc-shBb3_4cJD8hVjopdKMr1HtldLlbhIURSy-ZRso8_OBMXYA
  priority: 102
  providerName: Scholars Portal
Title SparkBLAST: scalable BLAST processing using in-memory operations
URI https://www.ncbi.nlm.nih.gov/pubmed/28655296
https://www.proquest.com/docview/1915060472/abstract/
https://search.proquest.com/docview/1914578508
https://pubmed.ncbi.nlm.nih.gov/PMC5488373
https://doaj.org/article/f1269771a22f492f805eee4ced80bdd3
Volume 18
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3da9swEBdtx2AvY9_11gVvDAYDNZYsS_aelpRmXVhLaVrIm5AluQtbHBMnD_3vd5LtELO3vcjYOoN1H747--4nhD4ZYsCLpAoTJRhmPDY4zwTBglItdB4bsClXbXHFL-7YdJ7MD1DS9cL4on2dL07LP8vTcvHL11ZWSz3s6sSG15dnEGVDXhUPD9GhiOMuRW9_HTiQ_vb3JUn5sCYOog27lzH46hi7Dfp8NyZ1OP17vshD9v_7Yt7zTP2qyT03NHmGnrbxYzhqnvM5OrDlC_S42VHy4SX6NqvU-vf452h2-zWsgf-uMyr052HV9ASArwq3flyUeOkKbR_CVWUbTahfobvJ-e3ZBW43ScA6ydgGc6sKSCMykUOma1jBVJZoYXiecJ0rDgGEKApji8jmRvPEiJgolrHc0MgaUpj4NToqV6U9RiFYYwoJhs4ypsCUI5USBcmqYpbEeSJUgL507JJVg4UhfQ6RctmwWQKbpWOzTAM0dgzdEToYa39htb6XrTBlQSiHAJQoSguW0QJEZ61l2po0yo2JA_TRiUM6oIrSVcLcq21dyx-zGzlK3G40EF6yAH1uiYoVCEartrEAFuWwrXqUJz1KsCTdn-6kLltLriXksx6DUdAAfdhNuztddVppV1tPwxxoUASrftMoyW7dna4FSPTUp8eY_gyovcf5btX87X_f-Q49oU7xI46pOEFHm_XWvocYapMPwHLmAsZ08n2AHo1G09kUjuPzq-ubgf8uAeMlSwfetv4CHwMgVA
link.rule.ids 230,315,733,786,790,870,891,2115,12083,12792,21416,24346,27957,27958,31754,31755,33408,33409,33779,33780,43345,43635,43840,53827,53829
linkProvider National Library of Medicine
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3di9QwEA96Ivoifls9tYogCOGajyatL7onHnu63oO7B_cW0iQ9D7Fdt7sP9987k3bXK4IvhTZTaOYjM5NOfkPIG888eJHCUma1pFIJT6tSM6o5d9pVwoNNYbXFiZqeyi9n-dmw4dYNZZXbNTEu1L51uEd-AHlFxMLT_MPyN8WuUfh3dWihcZ3ckEJILOnTZ7uEiyFe__AnkxXqoGOI1kZxXQa3LWgx8kURsv_fhfmKZxpXTV5xQ0d3yZ0hfkwnvcDvkWuhuU9u9h0lLx-Qj_OlXf08nE3mi_dpB_zHk1FpvE-X_ZkA8FXpJl4vGvoLC20v03YZek3oHpLTo8-LT1M6NEmgLi_lmqpga0gjSl1BputlLW2ZO-1VlStXWQUBhK5rH-osVN6p3GvBrCxl5XkWPKu9eET2mrYJT0gK1lhAguHKUlow5cwWzEKyamVgosq1Tci7LbvMssfCMDGHKJTpeWuAtwZ5a4qEHCJDd4QIYx0ftKtzM1iFqRlXEIAyy3ktS16DvEII0gVfZJX3IiGvURwGgSoarIQ5t5uuM8fz72aSYzcaCC9lQt4ORHULgnF2OFgAk0JsqxHl_ogSLMmNh7dSN4Mld-av3iXk1W4Y38TqtCa0m0gjETQog1k_7pVkN-948peXKiF6pD4jxoxHmosfEecbkslCaPH0_5_1ktyaLr7NzOz45OszcpujemeKcr1P9tarTXgOQdO6ehEt4w_0URNh
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9NAEF5BEagX3g9DAYOQkJCc2Ov1rs2JtBC1UKqKtFLVy2qfbVTiWHFyKL-e2bUdxXDrxZK948N4ZnZm5G-_QeiDTjRkkVxEiWAkIjTVkSxYEjGMFVMy1RBTDm1xRPdPyfez7Gxj1JcH7Ss5HZS_Z4NyeumxldVMDTuc2PD45x5U2dBXpcNK2-FtdAdiFhddo97-QHBU_e1PzCSnwzpxRG2R25IhY6eRG9Pnz2Rix9a_kZE8cf__2_NGfupjJzeS0fgBOu_UaDAoV4PVUg7Un38YHm-k50N0vy1Rw1Ej8gjdMuVjdLcZWnn9BH2ZVGJxtXs4mpx8DmswsTt8Ffr7sGqOHUA6DFf-Oi2jmcPyXofzyjTOVj9Fp-NvJ3v7UTuHIVJZQZYRNcJCp1IwCc20JpaIIlNMU5lRJQWFGoVZq42NjdSKZpqliSAFkRrHRidWp8_QVjkvzQsUQsDn0MOooiACdotY5ImAflgQk6QyYyJAnzpb8Kqh2-C-Tckpb2zIwYbc2ZDnAdp11loLOqZs_2C-uODtd-Q2wRRq3ERgbEmBLfiFMYYoo_NYap0G6L2zNXdcGKUD21yIVV3zg8kvPsrcwBuoYEmAPrZCdg5WV6I9uwBKOfqsnuROTxKCVfWXO5fi7WZRc2iZPc0jwwF6t152bzoAXGnmKy9DHC9RDFo_bzxwrXfnyAFiPd_sfZj-CnicpxJvPezljd98i-4dfx3zw4OjH6_QNnYBFtMIsx20tVyszGuo2JbyjY_NvzdTPhA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=SparkBLAST%3A+scalable+BLAST+processing+using+in-memory+operations&rft.jtitle=BMC+bioinformatics&rft.au=de+Castro%2C+Marcelo+Rodrigo&rft.au=Tostes%2C+Catherine+dos+Santos&rft.au=D%C3%A1vila%2C+Alberto+M.+R&rft.au=Senger%2C+Hermes&rft.date=2017-06-27&rft.pub=BioMed+Central+Ltd&rft.issn=1471-2105&rft.eissn=1471-2105&rft.volume=18&rft.issue=1&rft_id=info:doi/10.1186%2Fs12859-017-1723-8&rft.externalDBID=ISR&rft.externalDocID=A511365084
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2105&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2105&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2105&client=summon