OpenMP application experiences: Porting to accelerated nodes
As recent enhancements to the OpenMP specification become available in its implementations, there is a need to share the results of experimentation in order to better understand the OpenMP implementation’s behavior in practice, to identify pitfalls, and to learn how the implementations can be effect...
Saved in:
Published in | Parallel computing Vol. 109; no. C; p. 102856 |
---|---|
Main Authors | , , , , , , , , , , , , , , , , , , , , , , , |
Format | Journal Article |
Language | English |
Published |
Netherlands
Elsevier B.V
01.03.2022
Elsevier |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | As recent enhancements to the OpenMP specification become available in its implementations, there is a need to share the results of experimentation in order to better understand the OpenMP implementation’s behavior in practice, to identify pitfalls, and to learn how the implementations can be effectively deployed in scientific codes. We report on experiences gained and practices adopted when using OpenMP to port a variety of ECP applications, mini-apps and libraries based on different computational motifs to accelerator-based leadership-class high-performance supercomputer systems at the United States Department of Energy. Additionally, we identify important challenges and open problems related to the deployment of OpenMP. Through our report of experiences, we find that OpenMP implementations are successful on current supercomputing platforms and that OpenMP is a promising programming model to use for applications to be run on emerging and future platforms with accelerated nodes. |
---|---|
AbstractList | As recent enhancements to the OpenMP specification become available in its implementations, there is a need to share the results of experimentation in order to better understand the OpenMP implementation’s behavior in practice, to identify pitfalls, and to learn how the implementations can be effectively deployed in scientific codes. We report on experiences gained and practices adopted when using OpenMP to port a variety of ECP applications, mini-apps and libraries based on different computational motifs to accelerator-based leadership-class high-performance supercomputer systems at the United States Department of Energy. Additionally, we identify important challenges and open problems related to the deployment of OpenMP. Through our report of experiences, we find that OpenMP implementations are successful on current supercomputing platforms and that OpenMP is a promising programming model to use for applications to be run on emerging and future platforms with accelerated nodes. |
ArticleNumber | 102856 |
Author | Scogland, Thomas Kale, Vivek Kwack, JaeHyuk Ravikumar, Kiran Doerfert, Johannes Iwasaki, Shintaro Luszczek, Piotr Tian, Shilei Pophale, Swaroop Budiardja, Reuben Boehm, Swen Hernandez, Oscar Finkel, Hal Chapman, Barbara M. Bak, Seonmyeong Yeung, P.K. Lin, Meifeng Luo, Ye Huber, Joseph Kent, Paul R.C. Bertoni, Colleen Sarkar, Vivek Eisenbach, Markus Pham, Buu |
Author_xml | – sequence: 1 givenname: Seonmyeong surname: Bak fullname: Bak, Seonmyeong email: sbak5@gatech.edu organization: Georgia Institute of Technology, Atlanta, GA 30332, USA – sequence: 2 givenname: Colleen surname: Bertoni fullname: Bertoni, Colleen email: bertoni@anl.gov organization: Argonne National Laboratory, Lemont, IL 60439, USA – sequence: 3 givenname: Swen surname: Boehm fullname: Boehm, Swen email: boehms@ornl.gov organization: Oak Ridge National Laboratory, One Bethel Valley Rd., Oak Ridge, TN 37830, USA – sequence: 4 givenname: Reuben surname: Budiardja fullname: Budiardja, Reuben email: budiardjard@ornl.gov organization: Oak Ridge National Laboratory, One Bethel Valley Rd., Oak Ridge, TN 37830, USA – sequence: 5 givenname: Barbara M. surname: Chapman fullname: Chapman, Barbara M. email: barbara.chapman@stonybrook.edu organization: Brookhaven National Laboratory, Upton, NY 11973, USA – sequence: 6 givenname: Johannes surname: Doerfert fullname: Doerfert, Johannes email: jdoerfert@anl.gov organization: Argonne National Laboratory, Lemont, IL 60439, USA – sequence: 7 givenname: Markus surname: Eisenbach fullname: Eisenbach, Markus email: eisenbachm@ornl.gov organization: Oak Ridge National Laboratory, One Bethel Valley Rd., Oak Ridge, TN 37830, USA – sequence: 8 givenname: Hal surname: Finkel fullname: Finkel, Hal email: hfinkel@anl.gov organization: Argonne National Laboratory, Lemont, IL 60439, USA – sequence: 9 givenname: Oscar surname: Hernandez fullname: Hernandez, Oscar email: oscar@ornl.gov organization: Oak Ridge National Laboratory, One Bethel Valley Rd., Oak Ridge, TN 37830, USA – sequence: 10 givenname: Joseph surname: Huber fullname: Huber, Joseph email: huberjn@ornl.gov organization: Oak Ridge National Laboratory, One Bethel Valley Rd., Oak Ridge, TN 37830, USA – sequence: 11 givenname: Shintaro surname: Iwasaki fullname: Iwasaki, Shintaro email: siwasaki@anl.gov organization: Argonne National Laboratory, Lemont, IL 60439, USA – sequence: 12 givenname: Vivek orcidid: 0000-0003-4687-1226 surname: Kale fullname: Kale, Vivek email: vkale@bnl.gov organization: Brookhaven National Laboratory, Upton, NY 11973, USA – sequence: 13 givenname: Paul R.C. surname: Kent fullname: Kent, Paul R.C. email: kentpr@ornl.gov organization: Oak Ridge National Laboratory, One Bethel Valley Rd., Oak Ridge, TN 37830, USA – sequence: 14 givenname: JaeHyuk surname: Kwack fullname: Kwack, JaeHyuk email: jkwack@anl.gov organization: Argonne National Laboratory, Lemont, IL 60439, USA – sequence: 15 givenname: Meifeng surname: Lin fullname: Lin, Meifeng email: mlin@bnl.gov organization: Brookhaven National Laboratory, Upton, NY 11973, USA – sequence: 16 givenname: Piotr surname: Luszczek fullname: Luszczek, Piotr email: luszczek@icl.utk.edu organization: University of Tennessee, Knoxville TN, 37996, USA – sequence: 17 givenname: Ye surname: Luo fullname: Luo, Ye email: yeluo@anl.gov organization: Argonne National Laboratory, Lemont, IL 60439, USA – sequence: 18 givenname: Buu surname: Pham fullname: Pham, Buu email: buupq@iastate.edu organization: Iowa State University, Ames, IA 50011, USA – sequence: 19 givenname: Swaroop surname: Pophale fullname: Pophale, Swaroop email: pophaless@ornl.gov organization: Oak Ridge National Laboratory, One Bethel Valley Rd., Oak Ridge, TN 37830, USA – sequence: 20 givenname: Kiran surname: Ravikumar fullname: Ravikumar, Kiran email: kiran.r@gatech.edu organization: Georgia Institute of Technology, Atlanta, GA 30332, USA – sequence: 21 givenname: Vivek surname: Sarkar fullname: Sarkar, Vivek email: vsarkar@gatech.edu organization: Georgia Institute of Technology, Atlanta, GA 30332, USA – sequence: 22 givenname: Thomas surname: Scogland fullname: Scogland, Thomas email: scogland1@llnl.gov organization: Lawrence Livermore National Laboratory, Livermore CA 94550, USA – sequence: 23 givenname: Shilei surname: Tian fullname: Tian, Shilei email: shilei.tian@stonybrook.edu organization: Stony Brook University, Stony Brook, NY, 11794, USA – sequence: 24 givenname: P.K. surname: Yeung fullname: Yeung, P.K. email: pk.yeung@ae.gatech.edu organization: Georgia Institute of Technology, Atlanta, GA 30332, USA |
BackLink | https://www.osti.gov/biblio/1829922$$D View this record in Osti.gov |
BookMark | eNqFkMtOwzAQRS1UJErhC9hE7FP8aBIbwQJVvKSidgFry5lMwFWxI9tC8PckDSsWsBppdM_ozjkmE-cdEnLG6JxRVl5s550J4OecctZvuCzKAzJlsuJ5JUQ5IdM-VeWSKXZEjmPcUkrLhaRTcrXu0D1tMtN1OwsmWe8y_OwwWHSA8TLb-JCse82SzwwA7jCYhE3mfIPxhBy2Zhfx9GfOyMvd7fPyIV-t7x-XN6scRFWkXBWypVDX2LIKjKhVw6tWLSoQnGFfrFVYNhSEWZQlYI0oaAtQC0NlzQuUYkbOx7s-Jqsj2ITwBt45hKSZ5Epx3ofEGILgYwzY6i7YdxO-NKN6sKS3em9JD5b0aKmn1C-qv77XkIKxu3_Y65HF_vcPi2GoNmhrbBiaNd7-yX8DaxOGvQ |
CitedBy_id | crossref_primary_10_1016_j_measurement_2024_114782 crossref_primary_10_1021_acs_jctc_1c00820 crossref_primary_10_1002_cpe_8340 crossref_primary_10_1007_s42979_023_02109_0 crossref_primary_10_1007_s11042_022_13635_z crossref_primary_10_1016_j_cpc_2022_108605 crossref_primary_10_1063_5_0144917 crossref_primary_10_3390_computers12120244 crossref_primary_10_1016_j_cpc_2023_108739 crossref_primary_10_1063_5_0198797 crossref_primary_10_32604_cmc_2024_056880 crossref_primary_10_1016_j_cpc_2024_109364 crossref_primary_10_1007_s11227_024_06254_y |
Cites_doi | 10.1145/3126908.3126952 10.1007/978-3-030-49943-3_5 10.1002/jcc.540141112 10.1103/PhysRev.140.A1133 10.1103/PhysRevLett.75.2867 10.1016/j.cpc.2015.06.001 10.1016/j.cpc.2019.05.014 10.1103/PhysRev.136.B864 10.1073/pnas.1517368112 10.1146/annurev.fluid.010908.165203 10.1109/99.660313 10.1016/0031-8914(47)90013-X 10.1016/j.cpc.2017.10.004 10.1103/PhysRev.94.1111 10.1063/5.0004860 10.1088/2041-8205/813/1/L6 10.1145/3416315.3416320 10.1177/1094342017732628 10.1109/PACT.2019.00011 10.1021/acs.jctc.9b00409 10.1016/j.cpc.2016.07.013 10.1016/j.cpc.2016.12.019 |
ContentType | Journal Article |
Copyright | 2021 Elsevier B.V. |
Copyright_xml | – notice: 2021 Elsevier B.V. |
DBID | AAYXX CITATION OTOTI |
DOI | 10.1016/j.parco.2021.102856 |
DatabaseName | CrossRef OSTI.GOV |
DatabaseTitle | CrossRef |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Computer Science |
EISSN | 1872-7336 |
ExternalDocumentID | 1829922 10_1016_j_parco_2021_102856 S0167819121001009 |
GroupedDBID | --K --M -~X .DC .~1 0R~ 123 1B1 1~. 1~5 29O 4.4 457 4G. 5VS 6OB 7-5 71M 8P~ 9JN AACTN AAEDT AAEDW AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AAXUO AAYFN ABBOA ABEFU ABFNM ABJNI ABMAC ABXDB ABYKQ ACDAQ ACGFS ACNNM ACRLP ACZNC ADBBV ADEZE ADJOM ADMUD ADTZH AEBSH AECPX AEKER AENEX AFKWA AFTJW AGHFR AGUBO AGYEJ AHHHB AHJVU AHZHX AIALX AIEXJ AIKHN AITUG AJBFU AJOXV ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOUOD ASPBG AVWKF AXJTR AZFZN BJAXD BKOJK BLXMC CS3 DU5 EBS EFJIC EFLBG EJD EO8 EO9 EP2 EP3 F5P FDB FEDTE FGOYB FIRID FNPLU FYGXN G-Q G8K GBLVA GBOLZ HLZ HVGLF HZ~ H~9 IHE J1W JJJVA KOM LG9 M41 MO0 N9A O-L O9- OAUVE OZT P-8 P-9 P2P PC. Q38 R2- RIG ROL RPZ SBC SCC SDF SDG SDP SES SEW SPC SPCBC SST SSV SSZ T5K WH7 WUQ XPP ZMT ~G- AATTM AAXKI AAYWO AAYXX ABDPE ABWVN ACRPL ACVFH ADCNI ADNMO AEIPS AEUPX AFJKZ AFPUW AFXIZ AGCQF AGQPQ AGRNS AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP BNPGV CITATION SSH AALMO ABPIF ABPTK OTOTI |
ID | FETCH-LOGICAL-c375t-958f0cbbef17ca3b9d27f947c321e016f9e6d0c3a466cebee30fccb3a08b25e83 |
IEDL.DBID | .~1 |
ISSN | 0167-8191 |
IngestDate | Thu May 18 22:32:37 EDT 2023 Tue Jul 01 02:53:29 EDT 2025 Thu Apr 24 23:03:29 EDT 2025 Fri Feb 23 02:41:56 EST 2024 |
IsDoiOpenAccess | false |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | C |
Keywords | RAJA GAMESS QMCPACK Accelerators High performance computing GridQCD Application porting experiences GenASiS SLATE GESTS LSMS OpenMP implementations |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c375t-958f0cbbef17ca3b9d27f947c321e016f9e6d0c3a466cebee30fccb3a08b25e83 |
Notes | USDOE National Nuclear Security Administration (NNSA) AC02-06CH11357; AC05-00OR22725 |
ORCID | 0000-0003-4687-1226 0000000346871226 |
OpenAccessLink | https://www.osti.gov/biblio/1829281 |
ParticipantIDs | osti_scitechconnect_1829922 crossref_primary_10_1016_j_parco_2021_102856 crossref_citationtrail_10_1016_j_parco_2021_102856 elsevier_sciencedirect_doi_10_1016_j_parco_2021_102856 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | March 2022 2022-03-00 2022-03-01 |
PublicationDateYYYYMMDD | 2022-03-01 |
PublicationDate_xml | – month: 03 year: 2022 text: March 2022 |
PublicationDecade | 2020 |
PublicationPlace | Netherlands |
PublicationPlace_xml | – name: Netherlands |
PublicationTitle | Parallel computing |
PublicationYear | 2022 |
Publisher | Elsevier B.V Elsevier |
Publisher_xml | – name: Elsevier B.V – name: Elsevier |
References | Hohenberg, Kohn (b25) 1964; 136 Eisenbach, Larkin, Lutjens, Rennich, Rogers (b24) 2017; 211 Kohn, Sham (b26) 1965; 140 Cardall, Budiardja (b15) 2015; 196 Mathuriya, Luo, Clay, Benali, Shulenburger, Kim (b31) 2017 Ishihara, Gotoh, Kaneda (b19) 2009; 41 Ravikumar, Appelhans, Yeung (b21) 2019 CHAI (b34) 2020 (b1) 2021 (b5) 2020 Wang, Stocks, Shelton, Nicholson, Temmerman, Szotek (b23) 1995; 75 Kim (b29) 2018; 30 J. Schuchart, C. Niethammer, J. Garcia, Fibers are not (P)threads: The case for loose coupling of asynchronous programming models and MPI through continuations, in: 27th European MPI Users’ Group Meeting, EuroMPI/USA ’20, 2020, pp. 39–50. Korringa (b27) 1947; 13 Schmidt, Baldridge, Boatz, Elbert, Gordon, Jensen, Koseki, Matsunaga, Nguyen, Su, Windus, Dupuis, Montgomery (b7) 1993; 14 Kohn, Rostoker (b28) 1954; 94 Pham, Gordon (b10) 2019; 15 Kwack, Bertoni, Pham, Larkin (b11) 2020 Boyle, Cossu, Yamaguchi, Portelli (b22) 2016; LATTICE2015 Endeve, Cardall, Budiardja, Mezzacappa (b12) 2010; 713 Yeung, Zhai, Sreenivasan (b20) 2015; 112 OpenMP (b36) 2020 Cardall, Budiardja (b14) 2015; 813 Beckingsale, Burmark, Hornung, Jones, Killian, Kunen, Pearce, Robinson, Ryujin, Scogland (b33) 2019 Umpire (b35) 2020 S. Iwasaki, A. Amer, K. Taura, S. Seo, P. Balaji, BOLT: Optimizing OpenMP parallel regions with user-level threads, in: 2019 28th International Conference on Parallel Architectures and Compilation Techniques, PACT ’19, 2019, pp. 29–42. Cardall, Budiardja (b18) 2018; 222 Kent (b30) 2020; 152 Dagum, Menon (b2) 1998; 5 Lu, Seo, Balaji (b32) 2015 Endeve, Cardall, Budiardja, Beck, Bejnood, Toedte, Mezzacappa, Blondin (b13) 2012; 751 LLVM Developers (b6) 2020 Mironov, Moskovsky, D’Mello, Alexeev (b9) 2019; 33 Gordon, Schmidt (b8) 2005 Cardall, Budiardja (b16) 2017; 214 Budiardja, Cardall (b17) 2019; 244 Kohn (10.1016/j.parco.2021.102856_b26) 1965; 140 Cardall (10.1016/j.parco.2021.102856_b16) 2017; 214 Kim (10.1016/j.parco.2021.102856_b29) 2018; 30 Boyle (10.1016/j.parco.2021.102856_b22) 2016; LATTICE2015 OpenMP (10.1016/j.parco.2021.102856_b36) 2020 (10.1016/j.parco.2021.102856_b1) 2021 Korringa (10.1016/j.parco.2021.102856_b27) 1947; 13 Kwack (10.1016/j.parco.2021.102856_b11) 2020 Wang (10.1016/j.parco.2021.102856_b23) 1995; 75 Dagum (10.1016/j.parco.2021.102856_b2) 1998; 5 Cardall (10.1016/j.parco.2021.102856_b15) 2015; 196 Schmidt (10.1016/j.parco.2021.102856_b7) 1993; 14 Endeve (10.1016/j.parco.2021.102856_b13) 2012; 751 Mironov (10.1016/j.parco.2021.102856_b9) 2019; 33 Cardall (10.1016/j.parco.2021.102856_b14) 2015; 813 LLVM Developers (10.1016/j.parco.2021.102856_b6) 2020 Cardall (10.1016/j.parco.2021.102856_b18) 2018; 222 10.1016/j.parco.2021.102856_b4 10.1016/j.parco.2021.102856_b3 Yeung (10.1016/j.parco.2021.102856_b20) 2015; 112 Gordon (10.1016/j.parco.2021.102856_b8) 2005 Endeve (10.1016/j.parco.2021.102856_b12) 2010; 713 Ishihara (10.1016/j.parco.2021.102856_b19) 2009; 41 Hohenberg (10.1016/j.parco.2021.102856_b25) 1964; 136 (10.1016/j.parco.2021.102856_b5) 2020 Lu (10.1016/j.parco.2021.102856_b32) 2015 Kent (10.1016/j.parco.2021.102856_b30) 2020; 152 CHAI (10.1016/j.parco.2021.102856_b34) 2020 Eisenbach (10.1016/j.parco.2021.102856_b24) 2017; 211 Mathuriya (10.1016/j.parco.2021.102856_b31) 2017 Umpire (10.1016/j.parco.2021.102856_b35) 2020 Pham (10.1016/j.parco.2021.102856_b10) 2019; 15 Budiardja (10.1016/j.parco.2021.102856_b17) 2019; 244 Beckingsale (10.1016/j.parco.2021.102856_b33) 2019 Ravikumar (10.1016/j.parco.2021.102856_b21) 2019 Kohn (10.1016/j.parco.2021.102856_b28) 1954; 94 |
References_xml | – start-page: 91 year: 2020 end-page: 113 ident: b11 article-title: Performance of the RI-MP2 fortran kernel of GAMESS on GPUs via directive-based offloading with math libraries publication-title: Accelerator Programming using Directives – volume: 713 start-page: 1219 year: 2010 end-page: 1243 ident: b12 article-title: Generation of magnetic fields by the stationary accretion shock instability publication-title: Agron. J. – volume: LATTICE2015 start-page: 023 year: 2016 ident: b22 article-title: Grid: A next generation data parallel C++ QCD library publication-title: PoS – start-page: 71 year: 2019 end-page: 81 ident: b33 article-title: RAJA: portable performance for large-scale scientific applications publication-title: 2019 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC – year: 2020 ident: b6 article-title: LLVM/Clang openmp support – volume: 33 start-page: 212 year: 2019 end-page: 224 ident: b9 article-title: An efficient MPI/OpenMP parallelization of the Hartree-Fock-Roothaan method for the first generation of Intel(R) Xeon Phi(TM) processor architecture publication-title: Int. J. High Perform. Comput. Appl. – volume: 112 start-page: 12633 year: 2015 end-page: 12638 ident: b20 article-title: Extreme events in computational turbulence publication-title: Proc. Natl. Acad. Sci. – volume: 13 start-page: 392 year: 1947 end-page: 400 ident: b27 article-title: On the calculation of the energy of a Bloch wave in a metal publication-title: Physica – year: 2020 ident: b34 article-title: CHAI – volume: 30 year: 2018 ident: b29 article-title: QMCPACK: an open source ab initio quantum Monte Carlo package for the electronic structure of atoms, molecules and solids publication-title: J. Phys.: Condens. Matter – year: 2020 ident: b36 article-title: OpenMP website – start-page: 38:1 year: 2017 end-page: 38:12 ident: b31 article-title: Embracing a new era of highly efficient and productive quantum Monte Carlo simulations publication-title: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis – reference: S. Iwasaki, A. Amer, K. Taura, S. Seo, P. Balaji, BOLT: Optimizing OpenMP parallel regions with user-level threads, in: 2019 28th International Conference on Parallel Architectures and Compilation Techniques, PACT ’19, 2019, pp. 29–42. – volume: 136 start-page: B864 year: 1964 end-page: B871 ident: b25 article-title: Inhomogeneous electron gas publication-title: Phys. Rev. – volume: 196 start-page: 506 year: 2015 end-page: 534 ident: b15 article-title: GenASiS basics: Object-oriented utilitarian functionality for large-scale physics simulations publication-title: Comput. Phys. Comm. – volume: 41 start-page: 165 year: 2009 end-page: 180 ident: b19 article-title: Study of high Reynolds number isotropic turbulence by direct numerical simulations publication-title: Annu. Rev. Fluid Mech. – volume: 214 start-page: 247 year: 2017 end-page: 248 ident: b16 article-title: GenASiS Basics: Object-oriented utilitarian functionality for large-scale physics simulations (Version 2) publication-title: Comput. Phys. Comm. – volume: 75 start-page: 2867 year: 1995 ident: b23 article-title: Order-N multiple scattering approach to electronic structure calculations publication-title: Phys. Rev. Lett. – volume: 152 year: 2020 ident: b30 article-title: QMCPACK: Advances in the development, efficiency, and application of auxiliary field and real-space variational and diffusion quantum Monte Carlo publication-title: J. Chem. Phys. – volume: 211 start-page: 2 year: 2017 end-page: 7 ident: b24 article-title: GPU acceleration of the locally selfconsistent multiple scattering code for first principles calculation of the ground state and statistical physics of materials publication-title: Comput. Phys. Comm. – year: 2019 ident: b21 article-title: GPU acceleration of extreme scale pseudo-spectral simulations of turbulence using asynchronism publication-title: Proceedings of the International Conference for High Performance Computing, Networking and Storage Analysis – volume: 14 start-page: 1347 year: 1993 end-page: 1363 ident: b7 article-title: General atomic and molecular electronic structure system publication-title: J. Comput. Chem. – volume: 140 start-page: A1133 year: 1965 end-page: A1138 ident: b26 article-title: Self-consistent equations including exchange and correlation effects publication-title: Phys. Rev. – volume: 222 start-page: 384 year: 2018 end-page: 412 ident: b18 article-title: GenASiS Mathematics : Object-oriented manifolds, operations, and solvers for large-scale physics simulations publication-title: Comput. Phys. Comm. – start-page: 1167 year: 2005 end-page: 1189 ident: b8 article-title: Advances in electronic structure theory: GAMESS a decade later publication-title: Theory and Applications of Computational Chemistry – year: 2020 ident: b5 article-title: The LLVM compiler infrastructure – volume: 751 start-page: 26 year: 2012 ident: b13 article-title: Turbulent magnetic field amplification from spiral SASI modes: Implications for core-collapse supernovae and proto-neutron star magnetization publication-title: Agron. J. – year: 2020 ident: b35 article-title: Umpire – volume: 813 start-page: L6 year: 2015 ident: b14 article-title: Stochasticity and efficiency in simplified models of core-collapse supernova explosions publication-title: Astrophys. J. Lett. – year: 2021 ident: b1 article-title: Openmp 5.0 reference guide – volume: 15 start-page: 5252 year: 2019 end-page: 5258 ident: b10 article-title: Hybrid distributed/shared memory model for the RI-MP2 method in the fragment molecular orbital framework publication-title: J. Chem. Theory Comput. – reference: J. Schuchart, C. Niethammer, J. Garcia, Fibers are not (P)threads: The case for loose coupling of asynchronous programming models and MPI through continuations, in: 27th European MPI Users’ Group Meeting, EuroMPI/USA ’20, 2020, pp. 39–50. – volume: 244 start-page: 483 year: 2019 end-page: 486 ident: b17 article-title: GenASiS Basics: Object-oriented utilitarian functionality for large-scale physics simulations (Version 3) publication-title: Comput. Phys. Comm. – volume: 94 start-page: 1111 year: 1954 end-page: 1120 ident: b28 article-title: Solution of the Schrödinger equation in periodic lattices with an application to metallic lithium publication-title: Phys. Rev. – start-page: 444 year: 2015 end-page: 454 ident: b32 article-title: MPI+ULT: overlapping communication and computation with user-level threads publication-title: 2015 IEEE 17th Int. Conf. on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th Int. Conf. on Embedded Software and Systems – volume: 5 year: 1998 ident: b2 article-title: OpenMP: An industry-standard API for shared-memory programming publication-title: IEEE Comput. Sci. Eng. – start-page: 38:1 year: 2017 ident: 10.1016/j.parco.2021.102856_b31 article-title: Embracing a new era of highly efficient and productive quantum Monte Carlo simulations doi: 10.1145/3126908.3126952 – start-page: 91 year: 2020 ident: 10.1016/j.parco.2021.102856_b11 article-title: Performance of the RI-MP2 fortran kernel of GAMESS on GPUs via directive-based offloading with math libraries doi: 10.1007/978-3-030-49943-3_5 – year: 2020 ident: 10.1016/j.parco.2021.102856_b34 – volume: 14 start-page: 1347 issue: 11 year: 1993 ident: 10.1016/j.parco.2021.102856_b7 article-title: General atomic and molecular electronic structure system publication-title: J. Comput. Chem. doi: 10.1002/jcc.540141112 – year: 2020 ident: 10.1016/j.parco.2021.102856_b5 – volume: 140 start-page: A1133 year: 1965 ident: 10.1016/j.parco.2021.102856_b26 article-title: Self-consistent equations including exchange and correlation effects publication-title: Phys. Rev. doi: 10.1103/PhysRev.140.A1133 – volume: 751 start-page: 26 issue: 1 year: 2012 ident: 10.1016/j.parco.2021.102856_b13 article-title: Turbulent magnetic field amplification from spiral SASI modes: Implications for core-collapse supernovae and proto-neutron star magnetization publication-title: Agron. J. – volume: 75 start-page: 2867 year: 1995 ident: 10.1016/j.parco.2021.102856_b23 article-title: Order-N multiple scattering approach to electronic structure calculations publication-title: Phys. Rev. Lett. doi: 10.1103/PhysRevLett.75.2867 – start-page: 1167 year: 2005 ident: 10.1016/j.parco.2021.102856_b8 article-title: Advances in electronic structure theory: GAMESS a decade later – start-page: 444 year: 2015 ident: 10.1016/j.parco.2021.102856_b32 article-title: MPI+ULT: overlapping communication and computation with user-level threads – volume: 196 start-page: 506 issn: 0010-4655 year: 2015 ident: 10.1016/j.parco.2021.102856_b15 article-title: GenASiS basics: Object-oriented utilitarian functionality for large-scale physics simulations publication-title: Comput. Phys. Comm. doi: 10.1016/j.cpc.2015.06.001 – volume: 244 start-page: 483 issn: 0010-4655 year: 2019 ident: 10.1016/j.parco.2021.102856_b17 article-title: GenASiS Basics: Object-oriented utilitarian functionality for large-scale physics simulations (Version 3) publication-title: Comput. Phys. Comm. doi: 10.1016/j.cpc.2019.05.014 – volume: 136 start-page: B864 year: 1964 ident: 10.1016/j.parco.2021.102856_b25 article-title: Inhomogeneous electron gas publication-title: Phys. Rev. doi: 10.1103/PhysRev.136.B864 – volume: 112 start-page: 12633 year: 2015 ident: 10.1016/j.parco.2021.102856_b20 article-title: Extreme events in computational turbulence publication-title: Proc. Natl. Acad. Sci. doi: 10.1073/pnas.1517368112 – volume: 41 start-page: 165 year: 2009 ident: 10.1016/j.parco.2021.102856_b19 article-title: Study of high Reynolds number isotropic turbulence by direct numerical simulations publication-title: Annu. Rev. Fluid Mech. doi: 10.1146/annurev.fluid.010908.165203 – volume: 5 issue: 1 year: 1998 ident: 10.1016/j.parco.2021.102856_b2 article-title: OpenMP: An industry-standard API for shared-memory programming publication-title: IEEE Comput. Sci. Eng. doi: 10.1109/99.660313 – volume: 13 start-page: 392 year: 1947 ident: 10.1016/j.parco.2021.102856_b27 article-title: On the calculation of the energy of a Bloch wave in a metal publication-title: Physica doi: 10.1016/0031-8914(47)90013-X – volume: 222 start-page: 384 issn: 0010-4655 year: 2018 ident: 10.1016/j.parco.2021.102856_b18 article-title: GenASiS Mathematics : Object-oriented manifolds, operations, and solvers for large-scale physics simulations publication-title: Comput. Phys. Comm. doi: 10.1016/j.cpc.2017.10.004 – start-page: 71 year: 2019 ident: 10.1016/j.parco.2021.102856_b33 article-title: RAJA: portable performance for large-scale scientific applications – year: 2019 ident: 10.1016/j.parco.2021.102856_b21 article-title: GPU acceleration of extreme scale pseudo-spectral simulations of turbulence using asynchronism – volume: 94 start-page: 1111 year: 1954 ident: 10.1016/j.parco.2021.102856_b28 article-title: Solution of the Schrödinger equation in periodic lattices with an application to metallic lithium publication-title: Phys. Rev. doi: 10.1103/PhysRev.94.1111 – year: 2020 ident: 10.1016/j.parco.2021.102856_b36 – volume: 152 issue: 17 year: 2020 ident: 10.1016/j.parco.2021.102856_b30 article-title: QMCPACK: Advances in the development, efficiency, and application of auxiliary field and real-space variational and diffusion quantum Monte Carlo publication-title: J. Chem. Phys. doi: 10.1063/5.0004860 – volume: 813 start-page: L6 year: 2015 ident: 10.1016/j.parco.2021.102856_b14 article-title: Stochasticity and efficiency in simplified models of core-collapse supernova explosions publication-title: Astrophys. J. Lett. doi: 10.1088/2041-8205/813/1/L6 – year: 2021 ident: 10.1016/j.parco.2021.102856_b1 – year: 2020 ident: 10.1016/j.parco.2021.102856_b6 – volume: 30 issue: 19 year: 2018 ident: 10.1016/j.parco.2021.102856_b29 article-title: QMCPACK: an open source ab initio quantum Monte Carlo package for the electronic structure of atoms, molecules and solids publication-title: J. Phys.: Condens. Matter – year: 2020 ident: 10.1016/j.parco.2021.102856_b35 – ident: 10.1016/j.parco.2021.102856_b4 doi: 10.1145/3416315.3416320 – volume: 33 start-page: 212 issue: 1 year: 2019 ident: 10.1016/j.parco.2021.102856_b9 article-title: An efficient MPI/OpenMP parallelization of the Hartree-Fock-Roothaan method for the first generation of Intel(R) Xeon Phi(TM) processor architecture publication-title: Int. J. High Perform. Comput. Appl. doi: 10.1177/1094342017732628 – ident: 10.1016/j.parco.2021.102856_b3 doi: 10.1109/PACT.2019.00011 – volume: 15 start-page: 5252 issue: 10 year: 2019 ident: 10.1016/j.parco.2021.102856_b10 article-title: Hybrid distributed/shared memory model for the RI-MP2 method in the fragment molecular orbital framework publication-title: J. Chem. Theory Comput. doi: 10.1021/acs.jctc.9b00409 – volume: 713 start-page: 1219 year: 2010 ident: 10.1016/j.parco.2021.102856_b12 article-title: Generation of magnetic fields by the stationary accretion shock instability publication-title: Agron. J. – volume: LATTICE2015 start-page: 023 year: 2016 ident: 10.1016/j.parco.2021.102856_b22 article-title: Grid: A next generation data parallel C++ QCD library publication-title: PoS – volume: 211 start-page: 2 year: 2017 ident: 10.1016/j.parco.2021.102856_b24 article-title: GPU acceleration of the locally selfconsistent multiple scattering code for first principles calculation of the ground state and statistical physics of materials publication-title: Comput. Phys. Comm. doi: 10.1016/j.cpc.2016.07.013 – volume: 214 start-page: 247 issn: 0010-4655 year: 2017 ident: 10.1016/j.parco.2021.102856_b16 article-title: GenASiS Basics: Object-oriented utilitarian functionality for large-scale physics simulations (Version 2) publication-title: Comput. Phys. Comm. doi: 10.1016/j.cpc.2016.12.019 |
SSID | ssj0006480 |
Score | 2.4701872 |
Snippet | As recent enhancements to the OpenMP specification become available in its implementations, there is a need to share the results of experimentation in order to... |
SourceID | osti crossref elsevier |
SourceType | Open Access Repository Enrichment Source Index Database Publisher |
StartPage | 102856 |
SubjectTerms | Accelerators Application porting experiences GAMESS GenASiS GESTS GridQCD High performance computing LSMS OpenMP implementations QMCPACK RAJA SLATE |
Title | OpenMP application experiences: Porting to accelerated nodes |
URI | https://dx.doi.org/10.1016/j.parco.2021.102856 https://www.osti.gov/biblio/1829922 |
Volume | 109 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3PS8MwFA5jXrz4W5zTkYNH69okTVPxMoZjKhuCDnYrzWsKE2mHq1f_dvP6Y06QHbyVklfa7yXvJfR73yPkCozvgwy1kwqjHaET7cSpvVLKw4KVxGYkrHeeTOV4Jh7n_rxFhk0tDNIq69hfxfQyWtd3-jWa_eVi0X9BAj0eNxjKCFVFfEIEOMtvvn5oHlKU3dNKfW8c3SgPlRyvpZ1MWAHIPJQwUNjF-u_s1M7tgttIPKMDslfvGOmgeqlD0jLZEdlvujHQenEekzvkhkye6cYvaWrWQsarW4qkUZuoaJHTGMCmG1SJSGiWJ2Z1Qmaj-9fh2Kl7IzjAA79wQl-lLmhtUi-AmOswYUEaigA484z9uDQ0MnGBx0JKsI4y3E0BNI9dpZlvFD8l7SzPzBmhgRLM9eNYohofpFoxLV3sX8RButrjHcIaTCKohcOxf8V71DDE3qISyAiBjCogO-R6bbSsdDO2D5cN2NEv90c2sm837KJr0AhFbwHZQdbKnppQcPf8v4_tkl2GdQ4l2eyCtIuPT3Npdx-F7pXTq0d2Bg9P4-k322jYzg |
linkProvider | Elsevier |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Na4MwFA_FHrbLvse67iOHHSfVRKOOXUpZsesHg7XQm5iYQMfQsrr_f3kaSwejh91EfKK_vLyXkN_7PYQehPR9wSJuK09y2-MZt1Olr8LQhYKVTGckqHeezli88F6X_rKFBk0tDNAqTeyvY3oVrc2dnkGzt16teu9AoIftBgEZoaqIrw3qVL6F2v3ROJ5tAzLzqgZqlcQ3GDTiQxXNa639CYoAiQsqBiE0sv47QVmFnnM7uWd4go7MohH36-86RS2Zn6HjpiEDNvPzHD0DPWT6hndOpbHcahlvnjDwRnWuwmWBUyF0xgGhiAznRSY3F2gxfJkPYtu0R7AFDfzSjvxQOYJzqdxApJRHGQlU5AWCElfqn1ORZJkjaOoxJvRYSeooIThNnZATX4b0Ell5kcsrhIPQI46fpgwE-YTiIeHMgRZGVDCHu7SDSINJIox2OLSw-EwakthHUgGZAJBJDWQHPW6N1rV0xv7HWQN28ssDEh3c9xt2YWjACHRvBRCEtJXeOIHm7vV_X3uPDuL5dJJMRrNxFx0SKHuouGc3yCq_vuWtXoyU_M442w9aJdt_ |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=OpenMP+application+experiences%3A+Porting+to+accelerated+nodes&rft.jtitle=Parallel+computing&rft.au=Bak%2C+Seonmyeong&rft.au=Bertoni%2C+Colleen&rft.au=Boehm%2C+Swen&rft.au=Budiardja%2C+Reuben&rft.date=2022-03-01&rft.issn=0167-8191&rft.volume=109&rft.spage=102856&rft_id=info:doi/10.1016%2Fj.parco.2021.102856&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_j_parco_2021_102856 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0167-8191&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0167-8191&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0167-8191&client=summon |