N-body computations using skeletal frameworks on multicore CPU/graphics processing unit architectures: an empirical performance evaluation

SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches ma...

Full description

Saved in:

Bibliographic Details
Published in	Concurrency and computation Vol. 26; no. 4; pp. 972 - 986
Main Authors	Goli, Mehdi, González-Vélez, Horacio
Format	Journal Article
Language	English
Published	Blackwell Publishing Ltd 25.03.2014
Subjects	algorithmic skeletons Algorithms Complexity Computation Devices general-purpose computing on graphics processing units GPU Graphics processing units Hardware parallel computing Performance evaluation Serials structured parallelism
Online Access	Get full text
ISSN	1532-0626 1532-0634
DOI	10.1002/cpe.3076

Cover

Loading…

Abstract	SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs N‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd.
AbstractList	SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs N‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd. With the emergence of general-purpose computation on graphics processing units, high-level approaches that hide the conceptual complexity of the low-level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade-off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all-pairs N-body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed-up over serial version with a Tesla M2050 with lower implementation complexity than low-level Compute Unified Device Architecture programming. Copyright copyright 2013 John Wiley & Sons, Ltd. With the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs N ‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd.
Author	González-Vélez, Horacio Goli, Mehdi
Author_xml	– sequence: 1 givenname: Mehdi surname: Goli fullname: Goli, Mehdi email: Correspondence to: Mehdi Goli, Robert Gordon University, IDEAS Research Institute, St Andrew Street, Aberdeen AB25 1HG, Scotland, United Kingdom., m.goli@rgu.ac.uk organization: IDEAS Research Institute, Robert Gordon University, Aberdeen, UK – sequence: 2 givenname: Horacio surname: González-Vélez fullname: González-Vélez, Horacio organization: Cloud Competency Centre, National College of Ireland, Dublin, Ireland
BookMark	eNp1kMtO4zAUhi3ESFwGiUfwkk2KHde5sIMKGKADRQKxtBznBEwdO9jOQF-Bpyalo46Qho2PF9__n6NvB21aZwGhfUpGlJD0UHUwYiTPNtA25SxNSMbGm-t_mm2hnRCeCaGUMLqN3q-TytULrFzb9VFG7WzAfdD2EYc5GIjS4MbLFl6dnwfsLG57E7VyHvBkdn_46GX3pFXAnXcKwmewtzpi6dWTjqBi7yEcYWkxtJ32Wg2FHfjG-VZaBRj-SNN_7v2JfjTSBNj7O3fR_dnp3eRXMr05v5gcTxPFWJYlknIo86oCzgrZlLwmPCuKLE_HFS_48DCoOB3XVVoqAnXDVTMeMrmsq4qotGG76GDVO5z80kOIotVBgTHSguuDoDwlZVqUJRvQ0QpV3oXgoRFKryRFL7URlIildDFIF0vp_7rXgc7rVvrF_9Bkhb5qA4tvOTGZnX7ldYjwtualn4ssZzkXD9fnYnbCH4rb35fiin0AIfyl2g
CitedBy_id	crossref_primary_10_1002_cpe_3077 crossref_primary_10_1016_j_future_2013_12_038 crossref_primary_10_1007_s10766_016_0419_4 crossref_primary_10_1007_s10766_017_0511_4
Cites_doi	10.1142/S0129626405002209 10.1145/1863482.1863487 10.1887/0852743920 10.1007/978‐3‐642‐23397‐5_17 10.1063/1.1704156 10.1002/cpe.1860 10.1002/spe.1026 10.1038/324446a0 10.1111/j.1467‐8659.2007.01012.x
ContentType	Journal Article
Copyright	Copyright © 2013 John Wiley & Sons, Ltd.
Copyright_xml	– notice: Copyright © 2013 John Wiley & Sons, Ltd.
DBID	BSCLL AAYXX CITATION 7SC 8FD JQ2 L7M L~C L~D
DOI	10.1002/cpe.3076
DatabaseName	Istex CrossRef Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional
DatabaseTitle	CrossRef Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional
DatabaseTitleList	Computer and Information Systems Abstracts CrossRef
DeliveryMethod	fulltext_linktorsrc
Discipline	Computer Science
EISSN	1532-0634
EndPage	986
ExternalDocumentID	10_1002_cpe_3076 CPE3076 ark_67375_WNG_PB5W8QMJ_K
Genre	article
GrantInformation_xml	– fundername: EU's Seventh Framework Programme funderid: 288570 – fundername: European Commission: FP7 STREP ŞParaPhrase: Parallel Patterns for Adaptive Heterogeneous Multicore SystemsŤ; funderid: 288570
GroupedDBID	.3N .DC .GA .Y3 05W 0R~ 10A 1L6 1OC 33P 3SF 3WU 4.4 50Y 50Z 51W 51X 52M 52N 52O 52P 52S 52T 52U 52W 52X 5GY 5VS 66C 702 7PT 8-0 8-1 8-3 8-4 8-5 8UM 930 A03 AAESR AAEVG AAHHS AANLZ AAONW AASGY AAXRX AAZKR ABCQN ABCUV ABEML ABIJN ACAHQ ACBWZ ACCFJ ACCZN ACPOU ACSCC ACXBN ACXQS ADBBV ADEOM ADIZJ ADKYN ADMGS ADOZA ADXAS ADZMN ADZOD AEEZP AEIGN AEIMD AEQDE AEUQT AEUYR AFBPY AFFPM AFGKR AFPWT AHBTC AITYG AIURR AIWBW AJBDE AJXKR ALMA_UNASSIGNED_HOLDINGS ALUQN AMBMR AMYDB ATUGU AUFTA AZBYB BAFTC BDRZF BFHJK BHBCM BMNLL BROTX BRXPI BSCLL BY8 CS3 D-E D-F DCZOG DPXWK DR2 DRFUL DRSTM EBS EJD F00 F01 F04 F5P G-S G.N GNP GODZA HGLYW HHY HZ~ IX1 JPC KQQ LATKE LAW LC2 LC3 LEEKS LH4 LITHE LOXES LP6 LP7 LUTES LW6 LYRES MEWTI MK4 MRFUL MRSTM MSFUL MSSTM MXFUL MXSTM N04 N05 N9A O66 O9- OIG P2W P2X P4D PQQKQ Q.N Q11 QB0 QRW R.K ROL RWI RX1 SUPJJ TN5 UB1 V2E W8V W99 WBKPD WIH WIK WOHZO WQJ WRC WXSBR WYISQ WZISG XG1 XV2 ~IA ~WT AAHQN AAMNL AANHP AAYCA ACRPL ACYXJ ADNMO AFWVQ ALVPJ AAYXX ADMLS AEYWJ AGQPQ AGYGG CITATION 1OB 7SC 8FD JQ2 L7M L~C L~D
ID	FETCH-LOGICAL-c3366-a15e97bbe538af95d056886724b5854b53eb514db29c0edf5cf4a157adbb0c2f3
IEDL.DBID	DR2
ISSN	1532-0626
IngestDate	Thu Sep 04 19:04:44 EDT 2025 Tue Jul 01 00:33:51 EDT 2025 Thu Apr 24 22:53:16 EDT 2025 Wed Jan 22 16:47:55 EST 2025 Wed Oct 30 09:48:08 EDT 2024
IsPeerReviewed	true
IsScholarly	true
Issue	4
Language	English
License	http://onlinelibrary.wiley.com/termsAndConditions#vor
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c3366-a15e97bbe538af95d056886724b5854b53eb514db29c0edf5cf4a157adbb0c2f3
Notes	istex:16AE1610BB93CFFEE7FDB6DDBD860E707611989F ArticleID:CPE3076 European Commission: FP7 STREP ŞParaPhrase: Parallel Patterns for Adaptive Heterogeneous Multicore SystemsŤ; - No. 288570 EU's Seventh Framework Programme - No. 288570 ark:/67375/WNG-PB5W8QMJ-K ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23
PQID	1520928993
PQPubID	23500
PageCount	15
ParticipantIDs	proquest_miscellaneous_1520928993 crossref_citationtrail_10_1002_cpe_3076 crossref_primary_10_1002_cpe_3076 wiley_primary_10_1002_cpe_3076_CPE3076 istex_primary_ark_67375_WNG_PB5W8QMJ_K
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	25 March 2014
PublicationDateYYYYMMDD	2014-03-25
PublicationDate_xml	– month: 03 year: 2014 text: 25 March 2014 day: 25
PublicationDecade	2010
PublicationTitle	Concurrency and computation
PublicationTitleAlternate	Concurrency Computat.: Pract. Exper
PublicationYear	2014
Publisher	Blackwell Publishing Ltd
Publisher_xml	– name: Blackwell Publishing Ltd
References	Hockney R, Eastwood J. Computer Simulation Using Particles. Taylor & Francis: Abingdon, 1988. Nyland L, Harris M, Prins J. Fast N-body simulation with CUDA. GPU Gems 2007; 3:677-695. McGuire JB. Study of exactly soluble one-dimensional N-body problems. Journal of Mathematical Physics 1964; 5(5):622-636. DOI: 10.1063/1.1704156. González-Vélez H, Leyton M. A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers. Software-Practice and Experience 2010; 40(12):1135-1160. DOI: 10.1002/spe.1026. Bischof H, Gorlatch S, Leshchinskiy R, Müller J. Data parallelism in C++ template programs: a Barnes-Hut case study. Parallel Processing Letters 2005; 15(3):257-272. DOI: 10.1142/S0129626405002209. Barnes J, Hut P. A hierarchical O(N log N) force-calculation algorithm. Nature 1986; 324:446-449. DOI: 10.1038/324446a0. Malik M, Li T, Sharif U, Shahid R, El-Ghazawi T, Newby G. Productivity of GPUs under different programming paradigms. Concurrency and Computation: Practice and Experience 2012; 24(2):179-191. DOI: 10.1002/cpe.1860. Owens JD, Luebke D, Govindaraju N, Harris M, Krüger J, Lefohn AE, Purcell TJ. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum 2007; 26(1):80-113. DOI: 10.1111/j.1467-8659.2007.01012.x. Cole M. Algorithmic Skeletons: Structured Management of Parallel Computation, Research Monographs in Parallel and Distributed Computing. MIT Press/Pitman: London, 1989. 2011; 6853 1986; 324 2012 2011 2010 1964; 5 1987 2009 2007; 3 2005; 15 2012; 24 2010; 40 2007; 26 1989 1988 e_1_2_8_17_1 e_1_2_8_18_1 e_1_2_8_19_1 e_1_2_8_13_1 e_1_2_8_14_1 e_1_2_8_16_1 e_1_2_8_3_1 e_1_2_8_2_1 e_1_2_8_5_1 e_1_2_8_4_1 e_1_2_8_7_1 e_1_2_8_6_1 e_1_2_8_9_1 Bell N (e_1_2_8_11_1) 2011 Cole M (e_1_2_8_8_1) 1989 e_1_2_8_10_1 e_1_2_8_12_1 Nyland L (e_1_2_8_15_1) 2007; 3
References_xml	– reference: Cole M. Algorithmic Skeletons: Structured Management of Parallel Computation, Research Monographs in Parallel and Distributed Computing. MIT Press/Pitman: London, 1989. – reference: Hockney R, Eastwood J. Computer Simulation Using Particles. Taylor & Francis: Abingdon, 1988. – reference: Owens JD, Luebke D, Govindaraju N, Harris M, Krüger J, Lefohn AE, Purcell TJ. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum 2007; 26(1):80-113. DOI: 10.1111/j.1467-8659.2007.01012.x. – reference: González-Vélez H, Leyton M. A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers. Software-Practice and Experience 2010; 40(12):1135-1160. DOI: 10.1002/spe.1026. – reference: Malik M, Li T, Sharif U, Shahid R, El-Ghazawi T, Newby G. Productivity of GPUs under different programming paradigms. Concurrency and Computation: Practice and Experience 2012; 24(2):179-191. DOI: 10.1002/cpe.1860. – reference: Bischof H, Gorlatch S, Leshchinskiy R, Müller J. Data parallelism in C++ template programs: a Barnes-Hut case study. Parallel Processing Letters 2005; 15(3):257-272. DOI: 10.1142/S0129626405002209. – reference: Barnes J, Hut P. A hierarchical O(N log N) force-calculation algorithm. Nature 1986; 324:446-449. DOI: 10.1038/324446a0. – reference: Nyland L, Harris M, Prins J. Fast N-body simulation with CUDA. GPU Gems 2007; 3:677-695. – reference: McGuire JB. Study of exactly soluble one-dimensional N-body problems. Journal of Mathematical Physics 1964; 5(5):622-636. DOI: 10.1063/1.1704156. – volume: 324 start-page: 446 year: 1986 end-page: 449 article-title: A hierarchical ( log ) force‐calculation algorithm publication-title: Nature – year: 2011 – year: 2009 – volume: 15 start-page: 257 issue: 3 year: 2005 end-page: 272 article-title: Data parallelism in C++ template programs: a Barnes‐Hut case study publication-title: Parallel Processing Letters – volume: 5 start-page: 622 issue: 5 year: 1964 end-page: 636 article-title: Study of exactly soluble one‐dimensional N‐body problems publication-title: Journal of Mathematical Physics – start-page: 359 year: 2011 end-page: 371 – volume: 6853 start-page: 170 year: 2011 end-page: 181 – volume: 3 start-page: 677 year: 2007 end-page: 695 article-title: Fast N‐body simulation with CUDA publication-title: GPU Gems – year: 1987 – year: 1988 – year: 1989 – volume: 40 start-page: 1135 issue: 12 year: 2010 end-page: 1160 article-title: A survey of algorithmic skeleton frameworks: high‐level structured parallel programming enablers publication-title: Software–Practice and Experience – volume: 24 start-page: 179 issue: 2 year: 2012 end-page: 191 article-title: Productivity of GPUs under different programming paradigms publication-title: Concurrency and Computation: Practice and Experience – volume: 26 start-page: 80 issue: 1 year: 2007 end-page: 113 article-title: A survey of general‐purpose computation on graphics hardware publication-title: Computer Graphics Forum – start-page: 5 year: 2010 end-page: 14 – year: 2012 – start-page: 359 volume-title: GPU Computing Gems year: 2011 ident: e_1_2_8_11_1 – volume: 3 start-page: 677 year: 2007 ident: e_1_2_8_15_1 article-title: Fast N‐body simulation with CUDA publication-title: GPU Gems – ident: e_1_2_8_14_1 doi: 10.1142/S0129626405002209 – ident: e_1_2_8_10_1 doi: 10.1145/1863482.1863487 – ident: e_1_2_8_17_1 – ident: e_1_2_8_19_1 – ident: e_1_2_8_5_1 doi: 10.1887/0852743920 – ident: e_1_2_8_16_1 doi: 10.1007/978‐3‐642‐23397‐5_17 – ident: e_1_2_8_2_1 doi: 10.1063/1.1704156 – ident: e_1_2_8_18_1 – ident: e_1_2_8_12_1 – ident: e_1_2_8_7_1 doi: 10.1002/cpe.1860 – volume-title: Algorithmic Skeletons: Structured Management of Parallel Computation year: 1989 ident: e_1_2_8_8_1 – ident: e_1_2_8_9_1 doi: 10.1002/spe.1026 – ident: e_1_2_8_3_1 doi: 10.1038/324446a0 – ident: e_1_2_8_4_1 – ident: e_1_2_8_13_1 – ident: e_1_2_8_6_1 doi: 10.1111/j.1467‐8659.2007.01012.x
SSID	ssj0011031
Score	2.040867
Snippet	SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the... With the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level... With the emergence of general-purpose computation on graphics processing units, high-level approaches that hide the conceptual complexity of the low-level...
SourceID	proquest crossref wiley istex
SourceType	Aggregation Database Enrichment Source Index Database Publisher
StartPage	972
SubjectTerms	algorithmic skeletons Algorithms Complexity Computation Devices general-purpose computing on graphics processing units GPU Graphics processing units Hardware parallel computing Performance evaluation Serials structured parallelism
Title	N-body computations using skeletal frameworks on multicore CPU/graphics processing unit architectures: an empirical performance evaluation
URI	https://api.istex.fr/ark:/67375/WNG-PB5W8QMJ-K/fulltext.pdf https://onlinelibrary.wiley.com/doi/abs/10.1002%2Fcpe.3076 https://www.proquest.com/docview/1520928993
Volume	26
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3PT9swFLYQXLgM2A8NxiZPmtgpbeLESbsbq2AItKqbVoG0g2W_2AiVpRFppY0TfwCH_Y37S_aek7QwbRLaJTnEThz7Pft79ufPjL3BNgXrcghweM0wQEGX0qF0QQy9LLO5TCLwap_D9GicHJ_Js4ZVSXthan2IxYQbeYbvr8nBtam6S9FQKG0HDZTUtomqRXjo80I5KqLTC2qpVBGECNpb3dlQdNuM90aiNarU7_dg5l2w6kebww32tS1nTTKZdOYz04HrPyQc_-9HNtmjBoTy_dpqttiKLR6zjfaAB974-xN2O_x189NM8x8c_LN6do8TV_6cVxMcsRC6c9fyuyo-LbhnKJI2Jh-Mxl2vh30BFS_rDQmUcY69CL-7gFG947rg9lt54fVKeLnczMCXauRP2fjw4MvgKGiObwggjtM00JG0_cwYi32qdn2ZI9bq9dJMJAZjFLzE1iBcy43oQ2hzJ8ElmCfTuTEhCBc_Y6vFtLDPGTcawxqHoR3kkCBm6RshNKKT1AEYiOJt9rZtSgWNtjkdsXGpalVmobCSFVXyNnu9SFnWeh5_SbPnrWGRQF9NiP-WSXU6_KBG7-Vp79PHY3WCL2vNRaFX0lKLLux0XqmI2EUUy2LR9nzj__NrajA6oPvOQxO-YOuI2hIiwgm5y1ZnV3P7EpHRzLzyPvAbQLAPeg
linkProvider	Wiley-Blackwell
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELZKe4BLy1OUp5FQOWU3ceJkl55gaVn6WC2oq_aAZNkTu6oWslGzK0FP_AAO_EZ-CTPOZrdFICEuySHjxLFn7G_s8TeMPcc-BetyCHB6zdBBQZPSoXRBDJ0ss7lMIvBsn4O0P0r2TuTJCttuzsLU_BCLBTeyDD9ek4HTgnR7yRoKpW2hhqbX2Bol9CarfPNhwR0VUf6CmixVBCHC9oZ5NhTtpuSVuWiNmvXLFaB5Ga76-WZ3g31salqHmYxbs6lpwcVvJI7_-Ss32foch_JXteLcYiu2uM02mhwPfG7yd9j3wc9vP8wk_8rBP6sX-DiFy5_yaoyTFqJ37poQr4pPCu6DFIkek_eGo7anxD6Dipf1mQQqOMOBhF_ew6hecl1w-7k885QlvFyeZ-BLQvK7bLS7c9TrB_MMDgHEcZoGOpK2mxljcVjVritzhFudTpqJxKCbgpfYGkRsuRFdCG3uJLgEy2Q6NyYE4eJ7bLWYFPY-40ajZ-PQu4McEoQtXSOERoCSOgADUbzJXjR9qWBOb05ZNj6pmphZKGxkRY28yZ4tJMua0uMPMlteHRYC-nxMIXCZVMeDt2r4Wh533h_uqX18WaMvCg2Tdlt0YSezSkUUYETuLFZty_f-X7-mesMduj_4V8Gn7Hr_6PBAHbwb7D9kNxDEJRQXJ-Qjtjo9n9nHCJSm5ok3iF8K4ROT
linkToPdf	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NbtQwELZKKyEulPIjSktrJFRO2U2cOMlyg22X0sJqQaxaiYNlj21ULc1Gza5UOPEAHHhGnoSxk-y2CCTEJTnEThx7xvONPf6GkKc4pmCshgDNa4YOCqqUDLkNYsizzGieRODZPofp4Tg5OuWnTVSlOwtT80MsFtycZvj52il4qW13SRoKpemggKY3yFqShrlzvPbfL6ijIpe-oOZKZUGIqL0lng1Zt615zRStuV69vIYzr6JVb24G6-Rj29A6ymTSmc9UB77-xuH4f39yh9xuUCh9UYvNBlkxxV2y3mZ4oI3C3yPfhz-__VBT_YWCf1Yv71EXLP-JVhM0WYjdqW0DvCo6LagPUXTkmLQ_Gnc9IfYZVLSsTyS4inOcRujVHYzqOZUFNeflmScsoeXyNANd0pHfJ-PBwYf-YdDkbwggjtM0kBE3vUwpg5OqtD2uEWzleZqxRKGTgpfYKMRrWrEehEZbDjbBOpnUSoXAbPyArBbTwjwkVEn0ayz6dqAhQdDSU4xJhCepBVAQxZvkWTuUAhpyc5dj47OoaZmZwE4WrpM3yZNFybIm9PhDmT0vDYsC8mLiAuAyLk6Gr8ToJT_J3709Esf4slZcBKql22uRhZnOKxG58CLnzGLT9vzg__Vroj86cPdH_1pwl9wc7Q_Em9fD4y1yCxFc4oLiGN8mq7OLuXmMKGmmdrw6_ALxixJL
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=N-body+computations+using+skeletal+frameworks+on+multicore+CPU%2Fgraphics+processing+unit+architectures%3A+an+empirical+performance+evaluation&rft.jtitle=Concurrency+and+computation&rft.au=Goli%2C+Mehdi&rft.au=Gonz%C3%A1lez-V%C3%A9lez%2C+Horacio&rft.date=2014-03-25&rft.pub=Blackwell+Publishing+Ltd&rft.issn=1532-0626&rft.eissn=1532-0634&rft.volume=26&rft.issue=4&rft.spage=972&rft.epage=986&rft_id=info:doi/10.1002%2Fcpe.3076&rft.externalDBID=n%2Fa&rft.externalDocID=ark_67375_WNG_PB5W8QMJ_K
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1532-0626&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1532-0626&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1532-0626&client=summon