Performance Portability Strategies for Grid C++ Expression Templates

One of the key requirements for the Lattice QCD Application Development as part of the US Exascale Computing Project is performance portability across multiple architectures. Using the Grid C ++ expression template as a starting point, we report on the progress made with regards to the Grid GPU offl...

Full description

Saved in:
Bibliographic Details
Published inEPJ Web of conferences Vol. 175; p. 9006
Main Authors Boyle, Peter A., Clark, M.A., DeTar, Carleton, Lin, Meifeng, Rana, Verinder, Vaquero Avilés-Casco, Alejandro
Format Journal Article Conference Proceeding
LanguageEnglish
Published Les Ulis EDP Sciences 01.01.2018
Subjects
Online AccessGet full text

Cover

Loading…
Abstract One of the key requirements for the Lattice QCD Application Development as part of the US Exascale Computing Project is performance portability across multiple architectures. Using the Grid C ++ expression template as a starting point, we report on the progress made with regards to the Grid GPU offloading strategies. We present both the successes and issues encountered in using CUDA, OpenACC and Just-In-Time compilation. Experimentation and performance on GPUs with a SU(3)×SU(3) streaming test will be reported. We will also report on the challenges of using current OpenMP 4.x for GPU offloading in the same code.
AbstractList One of the key requirements for the Lattice QCD Application Development as part of the US Exascale Computing Project is performance portability across multiple architectures. Using the Grid C++ expression template as a starting point, we report on the progress made with regards to the Grid GPU offloading strategies. We present both the successes and issues encountered in using CUDA, OpenACC and Just-In-Time compilation. Experimentation and performance on GPUs with a SU(3)×SU(3) streaming test will be reported. We will also report on the challenges of using current OpenMP 4.x for GPU offloading in the same code.
One of the key requirements for the Lattice QCD Application Development as part of the US Exascale Computing Project is performance portability across multiple architectures. Using the Grid C ++ expression template as a starting point, we report on the progress made with regards to the Grid GPU offloading strategies. We present both the successes and issues encountered in using CUDA, OpenACC and Just-In-Time compilation. Experimentation and performance on GPUs with a SU(3)×SU(3) streaming test will be reported. We will also report on the challenges of using current OpenMP 4.x for GPU offloading in the same code.
Author Clark, M.A.
Boyle, Peter A.
Lin, Meifeng
DeTar, Carleton
Rana, Verinder
Vaquero Avilés-Casco, Alejandro
Author_xml – sequence: 1
  givenname: Peter A.
  surname: Boyle
  fullname: Boyle, Peter A.
– sequence: 2
  givenname: M.A.
  surname: Clark
  fullname: Clark, M.A.
– sequence: 3
  givenname: Carleton
  surname: DeTar
  fullname: DeTar, Carleton
– sequence: 4
  givenname: Meifeng
  surname: Lin
  fullname: Lin, Meifeng
– sequence: 5
  givenname: Verinder
  surname: Rana
  fullname: Rana, Verinder
– sequence: 6
  givenname: Alejandro
  surname: Vaquero Avilés-Casco
  fullname: Vaquero Avilés-Casco, Alejandro
BookMark eNpNkNFLwzAQxoNMUOf-Ax8KPo65S9qkzaPMOYWBAyf4Fq7pZXRsTU06cP-91Yl4L3fc_fi-47tig8Y3xNgNhzsOkk-p3VrfuKkAXvBcggZQZ-xScIAJ8Ox98G--YKMYt9BXqnUq1SV7WFFwPuyxsZSsfOiwrHd1d0xeu4AdbWqKSX9PFqGuktl4nMw_20Ax1r5J1rRvdz0Ur9m5w12k0W8fsrfH-Xr2NFm-LJ5n98uJFVKqSZraUroCe2MhQaisVKoCtBVAmYHjeYWU5UVWcaeFsoJrAZiWOSiLuSKXDtnzSbfyuDVtqPcYjsZjbX4WPmwMhq62OzIOCbSkouBYZRxKLTgKXSGURHluZa91e9Jqg_84UOzM1h9C079vBEgoRKFF2lPZibLBxxjI_blyMN_xm9_4zf_40y-qC3pY
Cites_doi 10.1007/978-3-319-45550-1_20
10.1016/B978-0-12-410397-9.00001-9
ContentType Journal Article
Conference Proceeding
Copyright 2018. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and conditions, you may use this content in accordance with the terms of the License.
Copyright_xml – notice: 2018. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and conditions, you may use this content in accordance with the terms of the License.
DBID AAYXX
CITATION
8FE
8FG
ABUWG
AFKRA
ARAPS
AZQEC
BENPR
BGLVJ
CCPQU
DWQXO
HCIFZ
P5Z
P62
PHGZM
PHGZT
PIMPY
PKEHL
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
DOA
DOI 10.1051/epjconf/201817509006
DatabaseName CrossRef
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest
Technology Collection
ProQuest One Community College
ProQuest Central Korea
SciTech Premium Collection
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Premium
ProQuest One Academic (New)
Publicly Available Content Database
ProQuest One Academic Middle East (New)
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
Publicly Available Content Database
Advanced Technologies & Aerospace Collection
Technology Collection
ProQuest One Academic Middle East (New)
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest One Academic Eastern Edition
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
ProQuest Technology Collection
ProQuest SciTech Collection
ProQuest Central China
ProQuest Central
Advanced Technologies & Aerospace Database
ProQuest One Applied & Life Sciences
ProQuest One Academic UKI Edition
ProQuest Central Korea
ProQuest Central (New)
ProQuest One Academic
ProQuest One Academic (New)
DatabaseTitleList Publicly Available Content Database
CrossRef

Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Physics
EISSN 2100-014X
ExternalDocumentID oai_doaj_org_article_fae095e881ad410b921a29da0bee77c5
10_1051_epjconf_201817509006
Genre Conference Proceeding
GroupedDBID 4.4
5VS
8FE
8FG
AAFWJ
AAOGA
AAYXX
ABDBF
ABZDU
ACACO
ACRPL
ACUHS
ADBBV
ADMLS
ADNMO
AFKRA
AFPKN
AGQPQ
ALMA_UNASSIGNED_HOLDINGS
ARAPS
BCNDV
BENPR
BGLVJ
CCPQU
CITATION
E3Z
EBS
EJD
ESX
GI~
GROUPED_DOAJ
GX1
HCIFZ
HH5
IPNFZ
KQ8
M~E
OK1
P62
PHGZM
PHGZT
PIMPY
PROAC
RIG
RNS
ABUWG
AZQEC
DWQXO
PKEHL
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
PUEGO
ID FETCH-LOGICAL-c2556-33cb5f8a356250264b66d0acd00b40f17dae4784d1f926c21920a3b706ca76ef3
IEDL.DBID DOA
ISSN 2100-014X
2101-6275
IngestDate Wed Aug 27 01:31:44 EDT 2025
Mon Jul 14 08:12:23 EDT 2025
Tue Jul 01 01:22:38 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Language English
License http://creativecommons.org/licenses/by/4.0
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c2556-33cb5f8a356250264b66d0acd00b40f17dae4784d1f926c21920a3b706ca76ef3
Notes ObjectType-Conference Proceeding-1
SourceType-Conference Papers & Proceedings-1
content type line 21
OpenAccessLink https://doaj.org/article/fae095e881ad410b921a29da0bee77c5
PQID 2050828923
PQPubID 1796373
ParticipantIDs doaj_primary_oai_doaj_org_article_fae095e881ad410b921a29da0bee77c5
proquest_journals_2050828923
crossref_primary_10_1051_epjconf_201817509006
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 20180101
PublicationDateYYYYMMDD 2018-01-01
PublicationDate_xml – month: 01
  year: 2018
  text: 20180101
  day: 01
PublicationDecade 2010
PublicationPlace Les Ulis
PublicationPlace_xml – name: Les Ulis
PublicationTitle EPJ Web of conferences
PublicationYear 2018
Publisher EDP Sciences
Publisher_xml – name: EDP Sciences
References R2
R3
R4
R5
R6
R7
R8
R9
R1
References_xml – ident: R7
– ident: R2
– ident: R3
– ident: R1
– ident: R6
  doi: 10.1007/978-3-319-45550-1_20
– ident: R4
– ident: R5
  doi: 10.1016/B978-0-12-410397-9.00001-9
– ident: R8
– ident: R9
SSID ssj0000399356
Score 2.130233
Snippet One of the key requirements for the Lattice QCD Application Development as part of the US Exascale Computing Project is performance portability across multiple...
SourceID doaj
proquest
crossref
SourceType Open Website
Aggregation Database
Index Database
StartPage 9006
SubjectTerms C++ (programming language)
Experimentation
Portability
Quantum chromodynamics
SummonAdditionalLinks – databaseName: ProQuest
  dbid: BENPR
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LSwMxEA7aInjzUbFaJQdvJZjsZl8nobW1eChFWuhtyVMUbWtbQf-9mTTbCoLXzbIsQzIz3-SbbxC6UUo6p2csYVpFhBeCEQEkR2ULxplIbL5hWwzTwYQ_TpNpKLitAq2y8oneUeu5ghq5A-kJqK25fORu8UFgahTcroYRGvuo7lxw7sBXvdMbjp62VRYK8dePcHXQhhGQ5K365xJ2axavDnRa920X5yB0Uhh89Cs-eRn_P17ah57-EWrsmvLwaBtujtGemZ2gA8_gVKtTdD_atQBgzw_1tNdvXOnPmhV26_hh-aJxt93Gva_AgZ3hsXlfvEHS2UCTfm_cHZAwIoEo0A4jcayks6eIAcc4OMVlmmoqlKZUcmpZpoXhWc41s0WUKueeIipimdFUiSw1Nj5Dtdl8Zs4RVoYCeOE2TQ1XWZEnIgZlFslFrHWeNxGpDFMuNkoYpb_BTlgZDFn-NmQTdcB623dBx9o_mC-fy3AsSiuMy_FMnjOhOaOyiJiICi2oNCbLVNJErcr2ZThcq3K3FS7-X75Eh_BHm4pJC9XWy09z5XKItbwOG-UHlyTD2w
  priority: 102
  providerName: ProQuest
Title Performance Portability Strategies for Grid C++ Expression Templates
URI https://www.proquest.com/docview/2050828923
https://doaj.org/article/fae095e881ad410b921a29da0bee77c5
Volume 175
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1NS8NAEB20IngRP7Fayx68leBustkkR1v7oYdSpIXewn6CorG0FfTib3d3k2jBgxcvOSSBXd4k82ayL28BrqQUNulpExAlw4BmnATciRylyQglPDZpqbYYs9GM3s_j-cZWX04TVtoDl8BdG65tFaDTlHBFCRZZSHiYKY6F1kkivXup5byNZsrnYMe7Mav_lYvJtV482QbT2GbfcpqjSew2OdrgIm_Z_ysje5oZHMB-VR-im3Jeh7CliyPY9TpNuTqG28mP0B95FagXt36g2mVWr5C9jobLR4V6nQ7qv1dK1wJN9cvi2ZWWJzAb9Ke9UVBthBBI5xAWRJEUFjUeuW7FNk1UMKYwlwpjQbEhieKaJilVxGQhkzYJhZhHIsFM8oRpE51Co3gt9BkgqbFrUahhTFOZZGnMI-e_IiiPlErTJgQ1JPmi9LvI_Tp1TPIKwnwTwiZ0HW7f9zq3an_CxjCvYpj_FcMmtGrU8-oVWtlRYmevZwvQ8_8Y4wL23LzLryctaKyXb_rS1hNr0YbtdDBsw063P548tP2DZI_Du88vrY_LQg
linkProvider Directory of Open Access Journals
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LTxsxEB5RUNXeKAWRFqgP9ISs2mvv64CQCoTwKOIQJG6unwhEkzShAv4Uv7EeZzdBqtQb1_Vq5R3PeGbsb74B2LbWxE3PB8qdzaisNacaQY421FxynYdqirY4L3qX8uQqv1qA57YWBmGV7Z6YNmo3tHhGHpP0HNnWYjyyN_pNsWsU3q62LTSmanHqnx5iyjbZPT6I6_s1y7qH_f0ebboKUIt0W1QIa-IUtMDQP2Yg0hSFY9o6xoxkgZdOe1lW0vFQZ4WNFp0xLUzJCqvLwgcRv_sGlqQQNVpU1T2anekw9PapYWxMpDhFAuC2Wi_n3_zoNqa4If5J9KroqBm2WXrhDVPTgH98QnJ03WVYnZcAkouZc_sAC36wAm8TXtROPsLBxbzggCQ0agLZPpGW7dZPSBwnR-MbR_Z3dsjhY4O4HZC-_zW6wxB3FS5fRXRrsDgYDvw6EOsZpkoyFIWXtqyrXAvkgTFSC-eqqgO0FYwaTXk3VLovz7lqBKleCrID31F6s3eRNTs9GI6vVWOEKmgfI0pfVVw7yZmpM66z2mlmvC9Lm3dgo5W9akx5ouaK9-n_w1_gXa__40ydHZ-ffob3OLvpWc0GLN6P__jNGL3cm62kMgR-vraO_gV7N_4R
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=EPJ+Web+of+conferences&rft.atitle=Performance+Portability+Strategies+for+Grid+C%2B%2B+Expression+Templates&rft.au=Boyle%2C+Peter+A&rft.au=Clark%2C+MA&rft.au=DeTar%2C+Carleton&rft.au=Lin%2C+Meifeng&rft.date=2018-01-01&rft.pub=EDP+Sciences&rft.issn=2101-6275&rft.eissn=2100-014X&rft.volume=175&rft_id=info:doi/10.1051%2Fepjconf%2F201817509006
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2100-014X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2100-014X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2100-014X&client=summon