N-body computations using skeletal frameworks on multicore CPU/graphics processing unit architectures: an empirical performance evaluation

SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches ma...

Full description

Saved in:
Bibliographic Details
Published inConcurrency and computation Vol. 26; no. 4; pp. 972 - 986
Main Authors Goli, Mehdi, González-Vélez, Horacio
Format Journal Article
LanguageEnglish
Published Blackwell Publishing Ltd 25.03.2014
Subjects
Online AccessGet full text
ISSN1532-0626
1532-0634
DOI10.1002/cpe.3076

Cover

Loading…
Abstract SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs N‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd.
AbstractList SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs N‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd.
With the emergence of general-purpose computation on graphics processing units, high-level approaches that hide the conceptual complexity of the low-level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade-off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all-pairs N-body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed-up over serial version with a Tesla M2050 with lower implementation complexity than low-level Compute Unified Device Architecture programming. Copyright copyright 2013 John Wiley & Sons, Ltd.
With the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs N ‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd.
Author González-Vélez, Horacio
Goli, Mehdi
Author_xml – sequence: 1
  givenname: Mehdi
  surname: Goli
  fullname: Goli, Mehdi
  email: Correspondence to: Mehdi Goli, Robert Gordon University, IDEAS Research Institute, St Andrew Street, Aberdeen AB25 1HG, Scotland, United Kingdom., m.goli@rgu.ac.uk
  organization: IDEAS Research Institute, Robert Gordon University, Aberdeen, UK
– sequence: 2
  givenname: Horacio
  surname: González-Vélez
  fullname: González-Vélez, Horacio
  organization: Cloud Competency Centre, National College of Ireland, Dublin, Ireland
BookMark eNp1kMtO4zAUhi3ESFwGiUfwkk2KHde5sIMKGKADRQKxtBznBEwdO9jOQF-Bpyalo46Qho2PF9__n6NvB21aZwGhfUpGlJD0UHUwYiTPNtA25SxNSMbGm-t_mm2hnRCeCaGUMLqN3q-TytULrFzb9VFG7WzAfdD2EYc5GIjS4MbLFl6dnwfsLG57E7VyHvBkdn_46GX3pFXAnXcKwmewtzpi6dWTjqBi7yEcYWkxtJ32Wg2FHfjG-VZaBRj-SNN_7v2JfjTSBNj7O3fR_dnp3eRXMr05v5gcTxPFWJYlknIo86oCzgrZlLwmPCuKLE_HFS_48DCoOB3XVVoqAnXDVTMeMrmsq4qotGG76GDVO5z80kOIotVBgTHSguuDoDwlZVqUJRvQ0QpV3oXgoRFKryRFL7URlIildDFIF0vp_7rXgc7rVvrF_9Bkhb5qA4tvOTGZnX7ldYjwtualn4ssZzkXD9fnYnbCH4rb35fiin0AIfyl2g
CitedBy_id crossref_primary_10_1002_cpe_3077
crossref_primary_10_1016_j_future_2013_12_038
crossref_primary_10_1007_s10766_016_0419_4
crossref_primary_10_1007_s10766_017_0511_4
Cites_doi 10.1142/S0129626405002209
10.1145/1863482.1863487
10.1887/0852743920
10.1007/978‐3‐642‐23397‐5_17
10.1063/1.1704156
10.1002/cpe.1860
10.1002/spe.1026
10.1038/324446a0
10.1111/j.1467‐8659.2007.01012.x
ContentType Journal Article
Copyright Copyright © 2013 John Wiley & Sons, Ltd.
Copyright_xml – notice: Copyright © 2013 John Wiley & Sons, Ltd.
DBID BSCLL
AAYXX
CITATION
7SC
8FD
JQ2
L7M
L~C
L~D
DOI 10.1002/cpe.3076
DatabaseName Istex
CrossRef
Computer and Information Systems Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Computer and Information Systems Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Advanced Technologies Database with Aerospace
ProQuest Computer Science Collection
Computer and Information Systems Abstracts Professional
DatabaseTitleList
Computer and Information Systems Abstracts
CrossRef
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1532-0634
EndPage 986
ExternalDocumentID 10_1002_cpe_3076
CPE3076
ark_67375_WNG_PB5W8QMJ_K
Genre article
GrantInformation_xml – fundername: EU's Seventh Framework Programme
  funderid: 288570
– fundername: European Commission: FP7 STREP ŞParaPhrase: Parallel Patterns for Adaptive Heterogeneous Multicore SystemsŤ;
  funderid: 288570
GroupedDBID .3N
.DC
.GA
.Y3
05W
0R~
10A
1L6
1OC
33P
3SF
3WU
4.4
50Y
50Z
51W
51X
52M
52N
52O
52P
52S
52T
52U
52W
52X
5GY
5VS
66C
702
7PT
8-0
8-1
8-3
8-4
8-5
8UM
930
A03
AAESR
AAEVG
AAHHS
AANLZ
AAONW
AASGY
AAXRX
AAZKR
ABCQN
ABCUV
ABEML
ABIJN
ACAHQ
ACBWZ
ACCFJ
ACCZN
ACPOU
ACSCC
ACXBN
ACXQS
ADBBV
ADEOM
ADIZJ
ADKYN
ADMGS
ADOZA
ADXAS
ADZMN
ADZOD
AEEZP
AEIGN
AEIMD
AEQDE
AEUQT
AEUYR
AFBPY
AFFPM
AFGKR
AFPWT
AHBTC
AITYG
AIURR
AIWBW
AJBDE
AJXKR
ALMA_UNASSIGNED_HOLDINGS
ALUQN
AMBMR
AMYDB
ATUGU
AUFTA
AZBYB
BAFTC
BDRZF
BFHJK
BHBCM
BMNLL
BROTX
BRXPI
BSCLL
BY8
CS3
D-E
D-F
DCZOG
DPXWK
DR2
DRFUL
DRSTM
EBS
EJD
F00
F01
F04
F5P
G-S
G.N
GNP
GODZA
HGLYW
HHY
HZ~
IX1
JPC
KQQ
LATKE
LAW
LC2
LC3
LEEKS
LH4
LITHE
LOXES
LP6
LP7
LUTES
LW6
LYRES
MEWTI
MK4
MRFUL
MRSTM
MSFUL
MSSTM
MXFUL
MXSTM
N04
N05
N9A
O66
O9-
OIG
P2W
P2X
P4D
PQQKQ
Q.N
Q11
QB0
QRW
R.K
ROL
RWI
RX1
SUPJJ
TN5
UB1
V2E
W8V
W99
WBKPD
WIH
WIK
WOHZO
WQJ
WRC
WXSBR
WYISQ
WZISG
XG1
XV2
~IA
~WT
AAHQN
AAMNL
AANHP
AAYCA
ACRPL
ACYXJ
ADNMO
AFWVQ
ALVPJ
AAYXX
ADMLS
AEYWJ
AGQPQ
AGYGG
CITATION
1OB
7SC
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c3366-a15e97bbe538af95d056886724b5854b53eb514db29c0edf5cf4a157adbb0c2f3
IEDL.DBID DR2
ISSN 1532-0626
IngestDate Thu Sep 04 19:04:44 EDT 2025
Tue Jul 01 00:33:51 EDT 2025
Thu Apr 24 22:53:16 EDT 2025
Wed Jan 22 16:47:55 EST 2025
Wed Oct 30 09:48:08 EDT 2024
IsPeerReviewed true
IsScholarly true
Issue 4
Language English
License http://onlinelibrary.wiley.com/termsAndConditions#vor
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c3366-a15e97bbe538af95d056886724b5854b53eb514db29c0edf5cf4a157adbb0c2f3
Notes istex:16AE1610BB93CFFEE7FDB6DDBD860E707611989F
ArticleID:CPE3076
European Commission: FP7 STREP ŞParaPhrase: Parallel Patterns for Adaptive Heterogeneous Multicore SystemsŤ; - No. 288570
EU's Seventh Framework Programme - No. 288570
ark:/67375/WNG-PB5W8QMJ-K
ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
PQID 1520928993
PQPubID 23500
PageCount 15
ParticipantIDs proquest_miscellaneous_1520928993
crossref_citationtrail_10_1002_cpe_3076
crossref_primary_10_1002_cpe_3076
wiley_primary_10_1002_cpe_3076_CPE3076
istex_primary_ark_67375_WNG_PB5W8QMJ_K
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 25 March 2014
PublicationDateYYYYMMDD 2014-03-25
PublicationDate_xml – month: 03
  year: 2014
  text: 25 March 2014
  day: 25
PublicationDecade 2010
PublicationTitle Concurrency and computation
PublicationTitleAlternate Concurrency Computat.: Pract. Exper
PublicationYear 2014
Publisher Blackwell Publishing Ltd
Publisher_xml – name: Blackwell Publishing Ltd
References Hockney R, Eastwood J. Computer Simulation Using Particles. Taylor & Francis: Abingdon, 1988.
Nyland L, Harris M, Prins J. Fast N-body simulation with CUDA. GPU Gems 2007; 3:677-695.
McGuire JB. Study of exactly soluble one-dimensional N-body problems. Journal of Mathematical Physics 1964; 5(5):622-636. DOI: 10.1063/1.1704156.
González-Vélez H, Leyton M. A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers. Software-Practice and Experience 2010; 40(12):1135-1160. DOI: 10.1002/spe.1026.
Bischof H, Gorlatch S, Leshchinskiy R, Müller J. Data parallelism in C++ template programs: a Barnes-Hut case study. Parallel Processing Letters 2005; 15(3):257-272. DOI: 10.1142/S0129626405002209.
Barnes J, Hut P. A hierarchical O(N log N) force-calculation algorithm. Nature 1986; 324:446-449. DOI: 10.1038/324446a0.
Malik M, Li T, Sharif U, Shahid R, El-Ghazawi T, Newby G. Productivity of GPUs under different programming paradigms. Concurrency and Computation: Practice and Experience 2012; 24(2):179-191. DOI: 10.1002/cpe.1860.
Owens JD, Luebke D, Govindaraju N, Harris M, Krüger J, Lefohn AE, Purcell TJ. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum 2007; 26(1):80-113. DOI: 10.1111/j.1467-8659.2007.01012.x.
Cole M. Algorithmic Skeletons: Structured Management of Parallel Computation, Research Monographs in Parallel and Distributed Computing. MIT Press/Pitman: London, 1989.
2011; 6853
1986; 324
2012
2011
2010
1964; 5
1987
2009
2007; 3
2005; 15
2012; 24
2010; 40
2007; 26
1989
1988
e_1_2_8_17_1
e_1_2_8_18_1
e_1_2_8_19_1
e_1_2_8_13_1
e_1_2_8_14_1
e_1_2_8_16_1
e_1_2_8_3_1
e_1_2_8_2_1
e_1_2_8_5_1
e_1_2_8_4_1
e_1_2_8_7_1
e_1_2_8_6_1
e_1_2_8_9_1
Bell N (e_1_2_8_11_1) 2011
Cole M (e_1_2_8_8_1) 1989
e_1_2_8_10_1
e_1_2_8_12_1
Nyland L (e_1_2_8_15_1) 2007; 3
References_xml – reference: Cole M. Algorithmic Skeletons: Structured Management of Parallel Computation, Research Monographs in Parallel and Distributed Computing. MIT Press/Pitman: London, 1989.
– reference: Hockney R, Eastwood J. Computer Simulation Using Particles. Taylor & Francis: Abingdon, 1988.
– reference: Owens JD, Luebke D, Govindaraju N, Harris M, Krüger J, Lefohn AE, Purcell TJ. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum 2007; 26(1):80-113. DOI: 10.1111/j.1467-8659.2007.01012.x.
– reference: González-Vélez H, Leyton M. A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers. Software-Practice and Experience 2010; 40(12):1135-1160. DOI: 10.1002/spe.1026.
– reference: Malik M, Li T, Sharif U, Shahid R, El-Ghazawi T, Newby G. Productivity of GPUs under different programming paradigms. Concurrency and Computation: Practice and Experience 2012; 24(2):179-191. DOI: 10.1002/cpe.1860.
– reference: Bischof H, Gorlatch S, Leshchinskiy R, Müller J. Data parallelism in C++ template programs: a Barnes-Hut case study. Parallel Processing Letters 2005; 15(3):257-272. DOI: 10.1142/S0129626405002209.
– reference: Barnes J, Hut P. A hierarchical O(N log N) force-calculation algorithm. Nature 1986; 324:446-449. DOI: 10.1038/324446a0.
– reference: Nyland L, Harris M, Prins J. Fast N-body simulation with CUDA. GPU Gems 2007; 3:677-695.
– reference: McGuire JB. Study of exactly soluble one-dimensional N-body problems. Journal of Mathematical Physics 1964; 5(5):622-636. DOI: 10.1063/1.1704156.
– volume: 324
  start-page: 446
  year: 1986
  end-page: 449
  article-title: A hierarchical ( log ) force‐calculation algorithm
  publication-title: Nature
– year: 2011
– year: 2009
– volume: 15
  start-page: 257
  issue: 3
  year: 2005
  end-page: 272
  article-title: Data parallelism in C++ template programs: a Barnes‐Hut case study
  publication-title: Parallel Processing Letters
– volume: 5
  start-page: 622
  issue: 5
  year: 1964
  end-page: 636
  article-title: Study of exactly soluble one‐dimensional N‐body problems
  publication-title: Journal of Mathematical Physics
– start-page: 359
  year: 2011
  end-page: 371
– volume: 6853
  start-page: 170
  year: 2011
  end-page: 181
– volume: 3
  start-page: 677
  year: 2007
  end-page: 695
  article-title: Fast N‐body simulation with CUDA
  publication-title: GPU Gems
– year: 1987
– year: 1988
– year: 1989
– volume: 40
  start-page: 1135
  issue: 12
  year: 2010
  end-page: 1160
  article-title: A survey of algorithmic skeleton frameworks: high‐level structured parallel programming enablers
  publication-title: Software–Practice and Experience
– volume: 24
  start-page: 179
  issue: 2
  year: 2012
  end-page: 191
  article-title: Productivity of GPUs under different programming paradigms
  publication-title: Concurrency and Computation: Practice and Experience
– volume: 26
  start-page: 80
  issue: 1
  year: 2007
  end-page: 113
  article-title: A survey of general‐purpose computation on graphics hardware
  publication-title: Computer Graphics Forum
– start-page: 5
  year: 2010
  end-page: 14
– year: 2012
– start-page: 359
  volume-title: GPU Computing Gems
  year: 2011
  ident: e_1_2_8_11_1
– volume: 3
  start-page: 677
  year: 2007
  ident: e_1_2_8_15_1
  article-title: Fast N‐body simulation with CUDA
  publication-title: GPU Gems
– ident: e_1_2_8_14_1
  doi: 10.1142/S0129626405002209
– ident: e_1_2_8_10_1
  doi: 10.1145/1863482.1863487
– ident: e_1_2_8_17_1
– ident: e_1_2_8_19_1
– ident: e_1_2_8_5_1
  doi: 10.1887/0852743920
– ident: e_1_2_8_16_1
  doi: 10.1007/978‐3‐642‐23397‐5_17
– ident: e_1_2_8_2_1
  doi: 10.1063/1.1704156
– ident: e_1_2_8_18_1
– ident: e_1_2_8_12_1
– ident: e_1_2_8_7_1
  doi: 10.1002/cpe.1860
– volume-title: Algorithmic Skeletons: Structured Management of Parallel Computation
  year: 1989
  ident: e_1_2_8_8_1
– ident: e_1_2_8_9_1
  doi: 10.1002/spe.1026
– ident: e_1_2_8_3_1
  doi: 10.1038/324446a0
– ident: e_1_2_8_4_1
– ident: e_1_2_8_13_1
– ident: e_1_2_8_6_1
  doi: 10.1111/j.1467‐8659.2007.01012.x
SSID ssj0011031
Score 2.040867
Snippet SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the...
With the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level...
With the emergence of general-purpose computation on graphics processing units, high-level approaches that hide the conceptual complexity of the low-level...
SourceID proquest
crossref
wiley
istex
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 972
SubjectTerms algorithmic skeletons
Algorithms
Complexity
Computation
Devices
general-purpose computing on graphics processing units
GPU
Graphics processing units
Hardware
parallel computing
Performance evaluation
Serials
structured parallelism
Title N-body computations using skeletal frameworks on multicore CPU/graphics processing unit architectures: an empirical performance evaluation
URI https://api.istex.fr/ark:/67375/WNG-PB5W8QMJ-K/fulltext.pdf
https://onlinelibrary.wiley.com/doi/abs/10.1002%2Fcpe.3076
https://www.proquest.com/docview/1520928993
Volume 26
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3PT9swFLYQXLgM2A8NxiZPmtgpbeLESbsbq2AItKqbVoG0g2W_2AiVpRFppY0TfwCH_Y37S_aek7QwbRLaJTnEThz7Pft79ufPjL3BNgXrcghweM0wQEGX0qF0QQy9LLO5TCLwap_D9GicHJ_Js4ZVSXthan2IxYQbeYbvr8nBtam6S9FQKG0HDZTUtomqRXjo80I5KqLTC2qpVBGECNpb3dlQdNuM90aiNarU7_dg5l2w6kebww32tS1nTTKZdOYz04HrPyQc_-9HNtmjBoTy_dpqttiKLR6zjfaAB974-xN2O_x189NM8x8c_LN6do8TV_6cVxMcsRC6c9fyuyo-LbhnKJI2Jh-Mxl2vh30BFS_rDQmUcY69CL-7gFG947rg9lt54fVKeLnczMCXauRP2fjw4MvgKGiObwggjtM00JG0_cwYi32qdn2ZI9bq9dJMJAZjFLzE1iBcy43oQ2hzJ8ElmCfTuTEhCBc_Y6vFtLDPGTcawxqHoR3kkCBm6RshNKKT1AEYiOJt9rZtSgWNtjkdsXGpalVmobCSFVXyNnu9SFnWeh5_SbPnrWGRQF9NiP-WSXU6_KBG7-Vp79PHY3WCL2vNRaFX0lKLLux0XqmI2EUUy2LR9nzj__NrajA6oPvOQxO-YOuI2hIiwgm5y1ZnV3P7EpHRzLzyPvAbQLAPeg
linkProvider Wiley-Blackwell
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELZKe4BLy1OUp5FQOWU3ceJkl55gaVn6WC2oq_aAZNkTu6oWslGzK0FP_AAO_EZ-CTPOZrdFICEuySHjxLFn7G_s8TeMPcc-BetyCHB6zdBBQZPSoXRBDJ0ss7lMIvBsn4O0P0r2TuTJCttuzsLU_BCLBTeyDD9ek4HTgnR7yRoKpW2hhqbX2Bol9CarfPNhwR0VUf6CmixVBCHC9oZ5NhTtpuSVuWiNmvXLFaB5Ga76-WZ3g31salqHmYxbs6lpwcVvJI7_-Ss32foch_JXteLcYiu2uM02mhwPfG7yd9j3wc9vP8wk_8rBP6sX-DiFy5_yaoyTFqJ37poQr4pPCu6DFIkek_eGo7anxD6Dipf1mQQqOMOBhF_ew6hecl1w-7k885QlvFyeZ-BLQvK7bLS7c9TrB_MMDgHEcZoGOpK2mxljcVjVritzhFudTpqJxKCbgpfYGkRsuRFdCG3uJLgEy2Q6NyYE4eJ7bLWYFPY-40ajZ-PQu4McEoQtXSOERoCSOgADUbzJXjR9qWBOb05ZNj6pmphZKGxkRY28yZ4tJMua0uMPMlteHRYC-nxMIXCZVMeDt2r4Wh533h_uqX18WaMvCg2Tdlt0YSezSkUUYETuLFZty_f-X7-mesMduj_4V8Gn7Hr_6PBAHbwb7D9kNxDEJRQXJ-Qjtjo9n9nHCJSm5ok3iF8K4ROT
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NbtQwELZKKyEulPIjSktrJFRO2U2cOMlyg22X0sJqQaxaiYNlj21ULc1Gza5UOPEAHHhGnoSxk-y2CCTEJTnEThx7xvONPf6GkKc4pmCshgDNa4YOCqqUDLkNYsizzGieRODZPofp4Tg5OuWnTVSlOwtT80MsFtycZvj52il4qW13SRoKpemggKY3yFqShrlzvPbfL6ijIpe-oOZKZUGIqL0lng1Zt615zRStuV69vIYzr6JVb24G6-Rj29A6ymTSmc9UB77-xuH4f39yh9xuUCh9UYvNBlkxxV2y3mZ4oI3C3yPfhz-__VBT_YWCf1Yv71EXLP-JVhM0WYjdqW0DvCo6LagPUXTkmLQ_Gnc9IfYZVLSsTyS4inOcRujVHYzqOZUFNeflmScsoeXyNANd0pHfJ-PBwYf-YdDkbwggjtM0kBE3vUwpg5OqtD2uEWzleZqxRKGTgpfYKMRrWrEehEZbDjbBOpnUSoXAbPyArBbTwjwkVEn0ayz6dqAhQdDSU4xJhCepBVAQxZvkWTuUAhpyc5dj47OoaZmZwE4WrpM3yZNFybIm9PhDmT0vDYsC8mLiAuAyLk6Gr8ToJT_J3709Esf4slZcBKql22uRhZnOKxG58CLnzGLT9vzg__Vroj86cPdH_1pwl9wc7Q_Em9fD4y1yCxFc4oLiGN8mq7OLuXmMKGmmdrw6_ALxixJL
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=N-body+computations+using+skeletal+frameworks+on+multicore+CPU%2Fgraphics+processing+unit+architectures%3A+an+empirical+performance+evaluation&rft.jtitle=Concurrency+and+computation&rft.au=Goli%2C+Mehdi&rft.au=Gonz%C3%A1lez-V%C3%A9lez%2C+Horacio&rft.date=2014-03-25&rft.pub=Blackwell+Publishing+Ltd&rft.issn=1532-0626&rft.eissn=1532-0634&rft.volume=26&rft.issue=4&rft.spage=972&rft.epage=986&rft_id=info:doi/10.1002%2Fcpe.3076&rft.externalDBID=n%2Fa&rft.externalDocID=ark_67375_WNG_PB5W8QMJ_K
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1532-0626&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1532-0626&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1532-0626&client=summon