Performance Portability Strategies for Grid C++ Expression Templates
One of the key requirements for the Lattice QCD Application Development as part of the US Exascale Computing Project is performance portability across multiple architectures. Using the Grid C ++ expression template as a starting point, we report on the progress made with regards to the Grid GPU offl...
Saved in:
Published in | EPJ Web of conferences Vol. 175; p. 9006 |
---|---|
Main Authors | , , , , , |
Format | Journal Article Conference Proceeding |
Language | English |
Published |
Les Ulis
EDP Sciences
01.01.2018
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | One of the key requirements for the Lattice QCD Application Development as part of the US Exascale Computing Project is performance portability across multiple architectures. Using the Grid C
++
expression template as a starting point, we report on the progress made with regards to the Grid GPU offloading strategies. We present both the successes and issues encountered in using CUDA, OpenACC and Just-In-Time compilation. Experimentation and performance on GPUs with a SU(3)×SU(3) streaming test will be reported. We will also report on the challenges of using current OpenMP 4.x for GPU offloading in the same code. |
---|---|
AbstractList | One of the key requirements for the Lattice QCD Application Development as part of the US Exascale Computing Project is performance portability across multiple architectures. Using the Grid C++ expression template as a starting point, we report on the progress made with regards to the Grid GPU offloading strategies. We present both the successes and issues encountered in using CUDA, OpenACC and Just-In-Time compilation. Experimentation and performance on GPUs with a SU(3)×SU(3) streaming test will be reported. We will also report on the challenges of using current OpenMP 4.x for GPU offloading in the same code. One of the key requirements for the Lattice QCD Application Development as part of the US Exascale Computing Project is performance portability across multiple architectures. Using the Grid C ++ expression template as a starting point, we report on the progress made with regards to the Grid GPU offloading strategies. We present both the successes and issues encountered in using CUDA, OpenACC and Just-In-Time compilation. Experimentation and performance on GPUs with a SU(3)×SU(3) streaming test will be reported. We will also report on the challenges of using current OpenMP 4.x for GPU offloading in the same code. |
Author | Clark, M.A. Boyle, Peter A. Lin, Meifeng DeTar, Carleton Rana, Verinder Vaquero Avilés-Casco, Alejandro |
Author_xml | – sequence: 1 givenname: Peter A. surname: Boyle fullname: Boyle, Peter A. – sequence: 2 givenname: M.A. surname: Clark fullname: Clark, M.A. – sequence: 3 givenname: Carleton surname: DeTar fullname: DeTar, Carleton – sequence: 4 givenname: Meifeng surname: Lin fullname: Lin, Meifeng – sequence: 5 givenname: Verinder surname: Rana fullname: Rana, Verinder – sequence: 6 givenname: Alejandro surname: Vaquero Avilés-Casco fullname: Vaquero Avilés-Casco, Alejandro |
BookMark | eNpNkNFLwzAQxoNMUOf-Ax8KPo65S9qkzaPMOYWBAyf4Fq7pZXRsTU06cP-91Yl4L3fc_fi-47tig8Y3xNgNhzsOkk-p3VrfuKkAXvBcggZQZ-xScIAJ8Ox98G--YKMYt9BXqnUq1SV7WFFwPuyxsZSsfOiwrHd1d0xeu4AdbWqKSX9PFqGuktl4nMw_20Ax1r5J1rRvdz0Ur9m5w12k0W8fsrfH-Xr2NFm-LJ5n98uJFVKqSZraUroCe2MhQaisVKoCtBVAmYHjeYWU5UVWcaeFsoJrAZiWOSiLuSKXDtnzSbfyuDVtqPcYjsZjbX4WPmwMhq62OzIOCbSkouBYZRxKLTgKXSGURHluZa91e9Jqg_84UOzM1h9C079vBEgoRKFF2lPZibLBxxjI_blyMN_xm9_4zf_40y-qC3pY |
Cites_doi | 10.1007/978-3-319-45550-1_20 10.1016/B978-0-12-410397-9.00001-9 |
ContentType | Journal Article Conference Proceeding |
Copyright | 2018. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and conditions, you may use this content in accordance with the terms of the License. |
Copyright_xml | – notice: 2018. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and conditions, you may use this content in accordance with the terms of the License. |
DBID | AAYXX CITATION 8FE 8FG ABUWG AFKRA ARAPS AZQEC BENPR BGLVJ CCPQU DWQXO HCIFZ P5Z P62 PHGZM PHGZT PIMPY PKEHL PQEST PQGLB PQQKQ PQUKI PRINS DOA |
DOI | 10.1051/epjconf/201817509006 |
DatabaseName | CrossRef ProQuest SciTech Collection ProQuest Technology Collection ProQuest Central (Alumni) ProQuest Central UK/Ireland Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest Technology Collection ProQuest One Community College ProQuest Central Korea SciTech Premium Collection Advanced Technologies & Aerospace Database ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Premium ProQuest One Academic (New) Publicly Available Content Database ProQuest One Academic Middle East (New) ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China DOAJ Directory of Open Access Journals |
DatabaseTitle | CrossRef Publicly Available Content Database Advanced Technologies & Aerospace Collection Technology Collection ProQuest One Academic Middle East (New) ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest One Academic Eastern Edition ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College ProQuest Technology Collection ProQuest SciTech Collection ProQuest Central China ProQuest Central Advanced Technologies & Aerospace Database ProQuest One Applied & Life Sciences ProQuest One Academic UKI Edition ProQuest Central Korea ProQuest Central (New) ProQuest One Academic ProQuest One Academic (New) |
DatabaseTitleList | Publicly Available Content Database CrossRef |
Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Physics |
EISSN | 2100-014X |
ExternalDocumentID | oai_doaj_org_article_fae095e881ad410b921a29da0bee77c5 10_1051_epjconf_201817509006 |
Genre | Conference Proceeding |
GroupedDBID | 4.4 5VS 8FE 8FG AAFWJ AAOGA AAYXX ABDBF ABZDU ACACO ACRPL ACUHS ADBBV ADMLS ADNMO AFKRA AFPKN AGQPQ ALMA_UNASSIGNED_HOLDINGS ARAPS BCNDV BENPR BGLVJ CCPQU CITATION E3Z EBS EJD ESX GI~ GROUPED_DOAJ GX1 HCIFZ HH5 IPNFZ KQ8 M~E OK1 P62 PHGZM PHGZT PIMPY PROAC RIG RNS ABUWG AZQEC DWQXO PKEHL PQEST PQGLB PQQKQ PQUKI PRINS PUEGO |
ID | FETCH-LOGICAL-c2556-33cb5f8a356250264b66d0acd00b40f17dae4784d1f926c21920a3b706ca76ef3 |
IEDL.DBID | DOA |
ISSN | 2100-014X 2101-6275 |
IngestDate | Wed Aug 27 01:31:44 EDT 2025 Mon Jul 14 08:12:23 EDT 2025 Tue Jul 01 01:22:38 EDT 2025 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Language | English |
License | http://creativecommons.org/licenses/by/4.0 |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c2556-33cb5f8a356250264b66d0acd00b40f17dae4784d1f926c21920a3b706ca76ef3 |
Notes | ObjectType-Conference Proceeding-1 SourceType-Conference Papers & Proceedings-1 content type line 21 |
OpenAccessLink | https://doaj.org/article/fae095e881ad410b921a29da0bee77c5 |
PQID | 2050828923 |
PQPubID | 1796373 |
ParticipantIDs | doaj_primary_oai_doaj_org_article_fae095e881ad410b921a29da0bee77c5 proquest_journals_2050828923 crossref_primary_10_1051_epjconf_201817509006 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 20180101 |
PublicationDateYYYYMMDD | 2018-01-01 |
PublicationDate_xml | – month: 01 year: 2018 text: 20180101 day: 01 |
PublicationDecade | 2010 |
PublicationPlace | Les Ulis |
PublicationPlace_xml | – name: Les Ulis |
PublicationTitle | EPJ Web of conferences |
PublicationYear | 2018 |
Publisher | EDP Sciences |
Publisher_xml | – name: EDP Sciences |
References | R2 R3 R4 R5 R6 R7 R8 R9 R1 |
References_xml | – ident: R7 – ident: R2 – ident: R3 – ident: R1 – ident: R6 doi: 10.1007/978-3-319-45550-1_20 – ident: R4 – ident: R5 doi: 10.1016/B978-0-12-410397-9.00001-9 – ident: R8 – ident: R9 |
SSID | ssj0000399356 |
Score | 2.130233 |
Snippet | One of the key requirements for the Lattice QCD Application Development as part of the US Exascale Computing Project is performance portability across multiple... |
SourceID | doaj proquest crossref |
SourceType | Open Website Aggregation Database Index Database |
StartPage | 9006 |
SubjectTerms | C++ (programming language) Experimentation Portability Quantum chromodynamics |
SummonAdditionalLinks | – databaseName: ProQuest dbid: BENPR link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LSwMxEA7aInjzUbFaJQdvJZjsZl8nobW1eChFWuhtyVMUbWtbQf-9mTTbCoLXzbIsQzIz3-SbbxC6UUo6p2csYVpFhBeCEQEkR2ULxplIbL5hWwzTwYQ_TpNpKLitAq2y8oneUeu5ghq5A-kJqK25fORu8UFgahTcroYRGvuo7lxw7sBXvdMbjp62VRYK8dePcHXQhhGQ5K365xJ2axavDnRa920X5yB0Uhh89Cs-eRn_P17ah57-EWrsmvLwaBtujtGemZ2gA8_gVKtTdD_atQBgzw_1tNdvXOnPmhV26_hh-aJxt93Gva_AgZ3hsXlfvEHS2UCTfm_cHZAwIoEo0A4jcayks6eIAcc4OMVlmmoqlKZUcmpZpoXhWc41s0WUKueeIipimdFUiSw1Nj5Dtdl8Zs4RVoYCeOE2TQ1XWZEnIgZlFslFrHWeNxGpDFMuNkoYpb_BTlgZDFn-NmQTdcB623dBx9o_mC-fy3AsSiuMy_FMnjOhOaOyiJiICi2oNCbLVNJErcr2ZThcq3K3FS7-X75Eh_BHm4pJC9XWy09z5XKItbwOG-UHlyTD2w priority: 102 providerName: ProQuest |
Title | Performance Portability Strategies for Grid C++ Expression Templates |
URI | https://www.proquest.com/docview/2050828923 https://doaj.org/article/fae095e881ad410b921a29da0bee77c5 |
Volume | 175 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1NS8NAEB20IngRP7Fayx68leBustkkR1v7oYdSpIXewn6CorG0FfTib3d3k2jBgxcvOSSBXd4k82ayL28BrqQUNulpExAlw4BmnATciRylyQglPDZpqbYYs9GM3s_j-cZWX04TVtoDl8BdG65tFaDTlHBFCRZZSHiYKY6F1kkivXup5byNZsrnYMe7Mav_lYvJtV482QbT2GbfcpqjSew2OdrgIm_Z_ysje5oZHMB-VR-im3Jeh7CliyPY9TpNuTqG28mP0B95FagXt36g2mVWr5C9jobLR4V6nQ7qv1dK1wJN9cvi2ZWWJzAb9Ke9UVBthBBI5xAWRJEUFjUeuW7FNk1UMKYwlwpjQbEhieKaJilVxGQhkzYJhZhHIsFM8oRpE51Co3gt9BkgqbFrUahhTFOZZGnMI-e_IiiPlErTJgQ1JPmi9LvI_Tp1TPIKwnwTwiZ0HW7f9zq3an_CxjCvYpj_FcMmtGrU8-oVWtlRYmevZwvQ8_8Y4wL23LzLryctaKyXb_rS1hNr0YbtdDBsw063P548tP2DZI_Du88vrY_LQg |
linkProvider | Directory of Open Access Journals |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LTxsxEB5RUNXeKAWRFqgP9ISs2mvv64CQCoTwKOIQJG6unwhEkzShAv4Uv7EeZzdBqtQb1_Vq5R3PeGbsb74B2LbWxE3PB8qdzaisNacaQY421FxynYdqirY4L3qX8uQqv1qA57YWBmGV7Z6YNmo3tHhGHpP0HNnWYjyyN_pNsWsU3q62LTSmanHqnx5iyjbZPT6I6_s1y7qH_f0ebboKUIt0W1QIa-IUtMDQP2Yg0hSFY9o6xoxkgZdOe1lW0vFQZ4WNFp0xLUzJCqvLwgcRv_sGlqQQNVpU1T2anekw9PapYWxMpDhFAuC2Wi_n3_zoNqa4If5J9KroqBm2WXrhDVPTgH98QnJ03WVYnZcAkouZc_sAC36wAm8TXtROPsLBxbzggCQ0agLZPpGW7dZPSBwnR-MbR_Z3dsjhY4O4HZC-_zW6wxB3FS5fRXRrsDgYDvw6EOsZpkoyFIWXtqyrXAvkgTFSC-eqqgO0FYwaTXk3VLovz7lqBKleCrID31F6s3eRNTs9GI6vVWOEKmgfI0pfVVw7yZmpM66z2mlmvC9Lm3dgo5W9akx5ouaK9-n_w1_gXa__40ydHZ-ffob3OLvpWc0GLN6P__jNGL3cm62kMgR-vraO_gV7N_4R |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=EPJ+Web+of+conferences&rft.atitle=Performance+Portability+Strategies+for+Grid+C%2B%2B+Expression+Templates&rft.au=Boyle%2C+Peter+A&rft.au=Clark%2C+MA&rft.au=DeTar%2C+Carleton&rft.au=Lin%2C+Meifeng&rft.date=2018-01-01&rft.pub=EDP+Sciences&rft.issn=2101-6275&rft.eissn=2100-014X&rft.volume=175&rft_id=info:doi/10.1051%2Fepjconf%2F201817509006 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2100-014X&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2100-014X&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2100-014X&client=summon |