N-body computations using skeletal frameworks on multicore CPU/graphics processing unit architectures: an empirical performance evaluation
SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches ma...
Saved in:
Published in | Concurrency and computation Vol. 26; no. 4; pp. 972 - 986 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
Blackwell Publishing Ltd
25.03.2014
|
Subjects | |
Online Access | Get full text |
ISSN | 1532-0626 1532-0634 |
DOI | 10.1002/cpe.3076 |
Cover
Loading…
Abstract | SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs N‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd. |
---|---|
AbstractList | SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs N‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd. With the emergence of general-purpose computation on graphics processing units, high-level approaches that hide the conceptual complexity of the low-level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade-off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all-pairs N-body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed-up over serial version with a Tesla M2050 with lower implementation complexity than low-level Compute Unified Device Architecture programming. Copyright copyright 2013 John Wiley & Sons, Ltd. With the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs N ‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd. |
Author | González-Vélez, Horacio Goli, Mehdi |
Author_xml | – sequence: 1 givenname: Mehdi surname: Goli fullname: Goli, Mehdi email: Correspondence to: Mehdi Goli, Robert Gordon University, IDEAS Research Institute, St Andrew Street, Aberdeen AB25 1HG, Scotland, United Kingdom., m.goli@rgu.ac.uk organization: IDEAS Research Institute, Robert Gordon University, Aberdeen, UK – sequence: 2 givenname: Horacio surname: González-Vélez fullname: González-Vélez, Horacio organization: Cloud Competency Centre, National College of Ireland, Dublin, Ireland |
BookMark | eNp1kMtO4zAUhi3ESFwGiUfwkk2KHde5sIMKGKADRQKxtBznBEwdO9jOQF-Bpyalo46Qho2PF9__n6NvB21aZwGhfUpGlJD0UHUwYiTPNtA25SxNSMbGm-t_mm2hnRCeCaGUMLqN3q-TytULrFzb9VFG7WzAfdD2EYc5GIjS4MbLFl6dnwfsLG57E7VyHvBkdn_46GX3pFXAnXcKwmewtzpi6dWTjqBi7yEcYWkxtJ32Wg2FHfjG-VZaBRj-SNN_7v2JfjTSBNj7O3fR_dnp3eRXMr05v5gcTxPFWJYlknIo86oCzgrZlLwmPCuKLE_HFS_48DCoOB3XVVoqAnXDVTMeMrmsq4qotGG76GDVO5z80kOIotVBgTHSguuDoDwlZVqUJRvQ0QpV3oXgoRFKryRFL7URlIildDFIF0vp_7rXgc7rVvrF_9Bkhb5qA4tvOTGZnX7ldYjwtualn4ssZzkXD9fnYnbCH4rb35fiin0AIfyl2g |
CitedBy_id | crossref_primary_10_1002_cpe_3077 crossref_primary_10_1016_j_future_2013_12_038 crossref_primary_10_1007_s10766_016_0419_4 crossref_primary_10_1007_s10766_017_0511_4 |
Cites_doi | 10.1142/S0129626405002209 10.1145/1863482.1863487 10.1887/0852743920 10.1007/978‐3‐642‐23397‐5_17 10.1063/1.1704156 10.1002/cpe.1860 10.1002/spe.1026 10.1038/324446a0 10.1111/j.1467‐8659.2007.01012.x |
ContentType | Journal Article |
Copyright | Copyright © 2013 John Wiley & Sons, Ltd. |
Copyright_xml | – notice: Copyright © 2013 John Wiley & Sons, Ltd. |
DBID | BSCLL AAYXX CITATION 7SC 8FD JQ2 L7M L~C L~D |
DOI | 10.1002/cpe.3076 |
DatabaseName | Istex CrossRef Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
DatabaseTitle | CrossRef Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional |
DatabaseTitleList | Computer and Information Systems Abstracts CrossRef |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Computer Science |
EISSN | 1532-0634 |
EndPage | 986 |
ExternalDocumentID | 10_1002_cpe_3076 CPE3076 ark_67375_WNG_PB5W8QMJ_K |
Genre | article |
GrantInformation_xml | – fundername: EU's Seventh Framework Programme funderid: 288570 – fundername: European Commission: FP7 STREP ŞParaPhrase: Parallel Patterns for Adaptive Heterogeneous Multicore SystemsŤ; funderid: 288570 |
GroupedDBID | .3N .DC .GA .Y3 05W 0R~ 10A 1L6 1OC 33P 3SF 3WU 4.4 50Y 50Z 51W 51X 52M 52N 52O 52P 52S 52T 52U 52W 52X 5GY 5VS 66C 702 7PT 8-0 8-1 8-3 8-4 8-5 8UM 930 A03 AAESR AAEVG AAHHS AANLZ AAONW AASGY AAXRX AAZKR ABCQN ABCUV ABEML ABIJN ACAHQ ACBWZ ACCFJ ACCZN ACPOU ACSCC ACXBN ACXQS ADBBV ADEOM ADIZJ ADKYN ADMGS ADOZA ADXAS ADZMN ADZOD AEEZP AEIGN AEIMD AEQDE AEUQT AEUYR AFBPY AFFPM AFGKR AFPWT AHBTC AITYG AIURR AIWBW AJBDE AJXKR ALMA_UNASSIGNED_HOLDINGS ALUQN AMBMR AMYDB ATUGU AUFTA AZBYB BAFTC BDRZF BFHJK BHBCM BMNLL BROTX BRXPI BSCLL BY8 CS3 D-E D-F DCZOG DPXWK DR2 DRFUL DRSTM EBS EJD F00 F01 F04 F5P G-S G.N GNP GODZA HGLYW HHY HZ~ IX1 JPC KQQ LATKE LAW LC2 LC3 LEEKS LH4 LITHE LOXES LP6 LP7 LUTES LW6 LYRES MEWTI MK4 MRFUL MRSTM MSFUL MSSTM MXFUL MXSTM N04 N05 N9A O66 O9- OIG P2W P2X P4D PQQKQ Q.N Q11 QB0 QRW R.K ROL RWI RX1 SUPJJ TN5 UB1 V2E W8V W99 WBKPD WIH WIK WOHZO WQJ WRC WXSBR WYISQ WZISG XG1 XV2 ~IA ~WT AAHQN AAMNL AANHP AAYCA ACRPL ACYXJ ADNMO AFWVQ ALVPJ AAYXX ADMLS AEYWJ AGQPQ AGYGG CITATION 1OB 7SC 8FD JQ2 L7M L~C L~D |
ID | FETCH-LOGICAL-c3366-a15e97bbe538af95d056886724b5854b53eb514db29c0edf5cf4a157adbb0c2f3 |
IEDL.DBID | DR2 |
ISSN | 1532-0626 |
IngestDate | Thu Sep 04 19:04:44 EDT 2025 Tue Jul 01 00:33:51 EDT 2025 Thu Apr 24 22:53:16 EDT 2025 Wed Jan 22 16:47:55 EST 2025 Wed Oct 30 09:48:08 EDT 2024 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 4 |
Language | English |
License | http://onlinelibrary.wiley.com/termsAndConditions#vor |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c3366-a15e97bbe538af95d056886724b5854b53eb514db29c0edf5cf4a157adbb0c2f3 |
Notes | istex:16AE1610BB93CFFEE7FDB6DDBD860E707611989F ArticleID:CPE3076 European Commission: FP7 STREP ŞParaPhrase: Parallel Patterns for Adaptive Heterogeneous Multicore SystemsŤ; - No. 288570 EU's Seventh Framework Programme - No. 288570 ark:/67375/WNG-PB5W8QMJ-K ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 |
PQID | 1520928993 |
PQPubID | 23500 |
PageCount | 15 |
ParticipantIDs | proquest_miscellaneous_1520928993 crossref_citationtrail_10_1002_cpe_3076 crossref_primary_10_1002_cpe_3076 wiley_primary_10_1002_cpe_3076_CPE3076 istex_primary_ark_67375_WNG_PB5W8QMJ_K |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 25 March 2014 |
PublicationDateYYYYMMDD | 2014-03-25 |
PublicationDate_xml | – month: 03 year: 2014 text: 25 March 2014 day: 25 |
PublicationDecade | 2010 |
PublicationTitle | Concurrency and computation |
PublicationTitleAlternate | Concurrency Computat.: Pract. Exper |
PublicationYear | 2014 |
Publisher | Blackwell Publishing Ltd |
Publisher_xml | – name: Blackwell Publishing Ltd |
References | Hockney R, Eastwood J. Computer Simulation Using Particles. Taylor & Francis: Abingdon, 1988. Nyland L, Harris M, Prins J. Fast N-body simulation with CUDA. GPU Gems 2007; 3:677-695. McGuire JB. Study of exactly soluble one-dimensional N-body problems. Journal of Mathematical Physics 1964; 5(5):622-636. DOI: 10.1063/1.1704156. González-Vélez H, Leyton M. A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers. Software-Practice and Experience 2010; 40(12):1135-1160. DOI: 10.1002/spe.1026. Bischof H, Gorlatch S, Leshchinskiy R, Müller J. Data parallelism in C++ template programs: a Barnes-Hut case study. Parallel Processing Letters 2005; 15(3):257-272. DOI: 10.1142/S0129626405002209. Barnes J, Hut P. A hierarchical O(N log N) force-calculation algorithm. Nature 1986; 324:446-449. DOI: 10.1038/324446a0. Malik M, Li T, Sharif U, Shahid R, El-Ghazawi T, Newby G. Productivity of GPUs under different programming paradigms. Concurrency and Computation: Practice and Experience 2012; 24(2):179-191. DOI: 10.1002/cpe.1860. Owens JD, Luebke D, Govindaraju N, Harris M, Krüger J, Lefohn AE, Purcell TJ. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum 2007; 26(1):80-113. DOI: 10.1111/j.1467-8659.2007.01012.x. Cole M. Algorithmic Skeletons: Structured Management of Parallel Computation, Research Monographs in Parallel and Distributed Computing. MIT Press/Pitman: London, 1989. 2011; 6853 1986; 324 2012 2011 2010 1964; 5 1987 2009 2007; 3 2005; 15 2012; 24 2010; 40 2007; 26 1989 1988 e_1_2_8_17_1 e_1_2_8_18_1 e_1_2_8_19_1 e_1_2_8_13_1 e_1_2_8_14_1 e_1_2_8_16_1 e_1_2_8_3_1 e_1_2_8_2_1 e_1_2_8_5_1 e_1_2_8_4_1 e_1_2_8_7_1 e_1_2_8_6_1 e_1_2_8_9_1 Bell N (e_1_2_8_11_1) 2011 Cole M (e_1_2_8_8_1) 1989 e_1_2_8_10_1 e_1_2_8_12_1 Nyland L (e_1_2_8_15_1) 2007; 3 |
References_xml | – reference: Cole M. Algorithmic Skeletons: Structured Management of Parallel Computation, Research Monographs in Parallel and Distributed Computing. MIT Press/Pitman: London, 1989. – reference: Hockney R, Eastwood J. Computer Simulation Using Particles. Taylor & Francis: Abingdon, 1988. – reference: Owens JD, Luebke D, Govindaraju N, Harris M, Krüger J, Lefohn AE, Purcell TJ. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum 2007; 26(1):80-113. DOI: 10.1111/j.1467-8659.2007.01012.x. – reference: González-Vélez H, Leyton M. A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers. Software-Practice and Experience 2010; 40(12):1135-1160. DOI: 10.1002/spe.1026. – reference: Malik M, Li T, Sharif U, Shahid R, El-Ghazawi T, Newby G. Productivity of GPUs under different programming paradigms. Concurrency and Computation: Practice and Experience 2012; 24(2):179-191. DOI: 10.1002/cpe.1860. – reference: Bischof H, Gorlatch S, Leshchinskiy R, Müller J. Data parallelism in C++ template programs: a Barnes-Hut case study. Parallel Processing Letters 2005; 15(3):257-272. DOI: 10.1142/S0129626405002209. – reference: Barnes J, Hut P. A hierarchical O(N log N) force-calculation algorithm. Nature 1986; 324:446-449. DOI: 10.1038/324446a0. – reference: Nyland L, Harris M, Prins J. Fast N-body simulation with CUDA. GPU Gems 2007; 3:677-695. – reference: McGuire JB. Study of exactly soluble one-dimensional N-body problems. Journal of Mathematical Physics 1964; 5(5):622-636. DOI: 10.1063/1.1704156. – volume: 324 start-page: 446 year: 1986 end-page: 449 article-title: A hierarchical ( log ) force‐calculation algorithm publication-title: Nature – year: 2011 – year: 2009 – volume: 15 start-page: 257 issue: 3 year: 2005 end-page: 272 article-title: Data parallelism in C++ template programs: a Barnes‐Hut case study publication-title: Parallel Processing Letters – volume: 5 start-page: 622 issue: 5 year: 1964 end-page: 636 article-title: Study of exactly soluble one‐dimensional N‐body problems publication-title: Journal of Mathematical Physics – start-page: 359 year: 2011 end-page: 371 – volume: 6853 start-page: 170 year: 2011 end-page: 181 – volume: 3 start-page: 677 year: 2007 end-page: 695 article-title: Fast N‐body simulation with CUDA publication-title: GPU Gems – year: 1987 – year: 1988 – year: 1989 – volume: 40 start-page: 1135 issue: 12 year: 2010 end-page: 1160 article-title: A survey of algorithmic skeleton frameworks: high‐level structured parallel programming enablers publication-title: Software–Practice and Experience – volume: 24 start-page: 179 issue: 2 year: 2012 end-page: 191 article-title: Productivity of GPUs under different programming paradigms publication-title: Concurrency and Computation: Practice and Experience – volume: 26 start-page: 80 issue: 1 year: 2007 end-page: 113 article-title: A survey of general‐purpose computation on graphics hardware publication-title: Computer Graphics Forum – start-page: 5 year: 2010 end-page: 14 – year: 2012 – start-page: 359 volume-title: GPU Computing Gems year: 2011 ident: e_1_2_8_11_1 – volume: 3 start-page: 677 year: 2007 ident: e_1_2_8_15_1 article-title: Fast N‐body simulation with CUDA publication-title: GPU Gems – ident: e_1_2_8_14_1 doi: 10.1142/S0129626405002209 – ident: e_1_2_8_10_1 doi: 10.1145/1863482.1863487 – ident: e_1_2_8_17_1 – ident: e_1_2_8_19_1 – ident: e_1_2_8_5_1 doi: 10.1887/0852743920 – ident: e_1_2_8_16_1 doi: 10.1007/978‐3‐642‐23397‐5_17 – ident: e_1_2_8_2_1 doi: 10.1063/1.1704156 – ident: e_1_2_8_18_1 – ident: e_1_2_8_12_1 – ident: e_1_2_8_7_1 doi: 10.1002/cpe.1860 – volume-title: Algorithmic Skeletons: Structured Management of Parallel Computation year: 1989 ident: e_1_2_8_8_1 – ident: e_1_2_8_9_1 doi: 10.1002/spe.1026 – ident: e_1_2_8_3_1 doi: 10.1038/324446a0 – ident: e_1_2_8_4_1 – ident: e_1_2_8_13_1 – ident: e_1_2_8_6_1 doi: 10.1111/j.1467‐8659.2007.01012.x |
SSID | ssj0011031 |
Score | 2.040867 |
Snippet | SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the... With the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level... With the emergence of general-purpose computation on graphics processing units, high-level approaches that hide the conceptual complexity of the low-level... |
SourceID | proquest crossref wiley istex |
SourceType | Aggregation Database Enrichment Source Index Database Publisher |
StartPage | 972 |
SubjectTerms | algorithmic skeletons Algorithms Complexity Computation Devices general-purpose computing on graphics processing units GPU Graphics processing units Hardware parallel computing Performance evaluation Serials structured parallelism |
Title | N-body computations using skeletal frameworks on multicore CPU/graphics processing unit architectures: an empirical performance evaluation |
URI | https://api.istex.fr/ark:/67375/WNG-PB5W8QMJ-K/fulltext.pdf https://onlinelibrary.wiley.com/doi/abs/10.1002%2Fcpe.3076 https://www.proquest.com/docview/1520928993 |
Volume | 26 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3PT9swFLYQXLgM2A8NxiZPmtgpbeLESbsbq2AItKqbVoG0g2W_2AiVpRFppY0TfwCH_Y37S_aek7QwbRLaJTnEThz7Pft79ufPjL3BNgXrcghweM0wQEGX0qF0QQy9LLO5TCLwap_D9GicHJ_Js4ZVSXthan2IxYQbeYbvr8nBtam6S9FQKG0HDZTUtomqRXjo80I5KqLTC2qpVBGECNpb3dlQdNuM90aiNarU7_dg5l2w6kebww32tS1nTTKZdOYz04HrPyQc_-9HNtmjBoTy_dpqttiKLR6zjfaAB974-xN2O_x189NM8x8c_LN6do8TV_6cVxMcsRC6c9fyuyo-LbhnKJI2Jh-Mxl2vh30BFS_rDQmUcY69CL-7gFG947rg9lt54fVKeLnczMCXauRP2fjw4MvgKGiObwggjtM00JG0_cwYi32qdn2ZI9bq9dJMJAZjFLzE1iBcy43oQ2hzJ8ElmCfTuTEhCBc_Y6vFtLDPGTcawxqHoR3kkCBm6RshNKKT1AEYiOJt9rZtSgWNtjkdsXGpalVmobCSFVXyNnu9SFnWeh5_SbPnrWGRQF9NiP-WSXU6_KBG7-Vp79PHY3WCL2vNRaFX0lKLLux0XqmI2EUUy2LR9nzj__NrajA6oPvOQxO-YOuI2hIiwgm5y1ZnV3P7EpHRzLzyPvAbQLAPeg |
linkProvider | Wiley-Blackwell |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELZKe4BLy1OUp5FQOWU3ceJkl55gaVn6WC2oq_aAZNkTu6oWslGzK0FP_AAO_EZ-CTPOZrdFICEuySHjxLFn7G_s8TeMPcc-BetyCHB6zdBBQZPSoXRBDJ0ss7lMIvBsn4O0P0r2TuTJCttuzsLU_BCLBTeyDD9ek4HTgnR7yRoKpW2hhqbX2Bol9CarfPNhwR0VUf6CmixVBCHC9oZ5NhTtpuSVuWiNmvXLFaB5Ga76-WZ3g31salqHmYxbs6lpwcVvJI7_-Ss32foch_JXteLcYiu2uM02mhwPfG7yd9j3wc9vP8wk_8rBP6sX-DiFy5_yaoyTFqJ37poQr4pPCu6DFIkek_eGo7anxD6Dipf1mQQqOMOBhF_ew6hecl1w-7k885QlvFyeZ-BLQvK7bLS7c9TrB_MMDgHEcZoGOpK2mxljcVjVritzhFudTpqJxKCbgpfYGkRsuRFdCG3uJLgEy2Q6NyYE4eJ7bLWYFPY-40ajZ-PQu4McEoQtXSOERoCSOgADUbzJXjR9qWBOb05ZNj6pmphZKGxkRY28yZ4tJMua0uMPMlteHRYC-nxMIXCZVMeDt2r4Wh533h_uqX18WaMvCg2Tdlt0YSezSkUUYETuLFZty_f-X7-mesMduj_4V8Gn7Hr_6PBAHbwb7D9kNxDEJRQXJ-Qjtjo9n9nHCJSm5ok3iF8K4ROT |
linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NbtQwELZKKyEulPIjSktrJFRO2U2cOMlyg22X0sJqQaxaiYNlj21ULc1Gza5UOPEAHHhGnoSxk-y2CCTEJTnEThx7xvONPf6GkKc4pmCshgDNa4YOCqqUDLkNYsizzGieRODZPofp4Tg5OuWnTVSlOwtT80MsFtycZvj52il4qW13SRoKpemggKY3yFqShrlzvPbfL6ijIpe-oOZKZUGIqL0lng1Zt615zRStuV69vIYzr6JVb24G6-Rj29A6ymTSmc9UB77-xuH4f39yh9xuUCh9UYvNBlkxxV2y3mZ4oI3C3yPfhz-__VBT_YWCf1Yv71EXLP-JVhM0WYjdqW0DvCo6LagPUXTkmLQ_Gnc9IfYZVLSsTyS4inOcRujVHYzqOZUFNeflmScsoeXyNANd0pHfJ-PBwYf-YdDkbwggjtM0kBE3vUwpg5OqtD2uEWzleZqxRKGTgpfYKMRrWrEehEZbDjbBOpnUSoXAbPyArBbTwjwkVEn0ayz6dqAhQdDSU4xJhCepBVAQxZvkWTuUAhpyc5dj47OoaZmZwE4WrpM3yZNFybIm9PhDmT0vDYsC8mLiAuAyLk6Gr8ToJT_J3709Esf4slZcBKql22uRhZnOKxG58CLnzGLT9vzg__Vroj86cPdH_1pwl9wc7Q_Em9fD4y1yCxFc4oLiGN8mq7OLuXmMKGmmdrw6_ALxixJL |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=N-body+computations+using+skeletal+frameworks+on+multicore+CPU%2Fgraphics+processing+unit+architectures%3A+an+empirical+performance+evaluation&rft.jtitle=Concurrency+and+computation&rft.au=Goli%2C+Mehdi&rft.au=Gonz%C3%A1lez-V%C3%A9lez%2C+Horacio&rft.date=2014-03-25&rft.pub=Blackwell+Publishing+Ltd&rft.issn=1532-0626&rft.eissn=1532-0634&rft.volume=26&rft.issue=4&rft.spage=972&rft.epage=986&rft_id=info:doi/10.1002%2Fcpe.3076&rft.externalDBID=n%2Fa&rft.externalDocID=ark_67375_WNG_PB5W8QMJ_K |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1532-0626&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1532-0626&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1532-0626&client=summon |