The PetscSF Scalable Communication Layer
PetscSF, the communication component of the Portable, Extensible Toolkit for Scientific Computation (PETSc), is designed to provide PETSc's communication infrastructure suitable for exascale computers that utilize GPUs and other accelerators. PetscSF provides a simple application programming in...
Saved in:
Published in | IEEE transactions on parallel and distributed systems Vol. 33; no. 4 |
---|---|
Main Authors | , , , , , , , , , |
Format | Journal Article |
Language | English |
Published |
United States
IEEE
26.05.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | PetscSF, the communication component of the Portable, Extensible Toolkit for Scientific Computation (PETSc), is designed to provide PETSc's communication infrastructure suitable for exascale computers that utilize GPUs and other accelerators. PetscSF provides a simple application programming interface (API) for managing common communication patterns in scientific computations by using a star-forest graph representation. PetscSF supports several implementations based on MPI and NVSHMEM, whose selection is based on the characteristics of the application or the target architecture. An efficient and portable model for network and intra-node communication is essential for implementing large-scale applications. The Message Passing Interface, which has been the de facto standard for distributed memory systems, has developed into a large complex API that does not yet provide high performance on the emerging heterogeneous CPU-GPU-based exascale systems. Here, we discuss the design of PetscSF, how it can overcome some difficulties of working directly with MPI on GPUs, and we demonstrate its performance, scalability, and novel features. |
---|---|
AbstractList | PetscSF, the communication component of the Portable, Extensible Toolkit for Scientific Computation (PETSc), is designed to provide PETSc's communication infrastructure suitable for exascale computers that utilize GPUs and other accelerators. PetscSF provides a simple application programming interface (API) for managing common communication patterns in scientific computations by using a star-forest graph representation. PetscSF supports several implementations based on MPI and NVSHMEM, whose selection is based on the characteristics of the application or the target architecture. An efficient and portable model for network and intra-node communication is essential for implementing large-scale applications. The Message Passing Interface, which has been the de facto standard for distributed memory systems, has developed into a large complex API that does not yet provide high performance on the emerging heterogeneous CPU-GPU-based exascale systems. Here, we discuss the design of PetscSF, how it can overcome some difficulties of working directly with MPI on GPUs, and we demonstrate its performance, scalability, and novel features. |
Author | Smith, Barry F. Faibussowitsch, Jacob Knepley, Matthew Marin, Oana Mills, Richard Tran Balay, Satish Zhang, Junchao Munson, Todd Brown, Jed Zampini, Stefano |
Author_xml | – sequence: 1 orcidid: 0000000303672358 fullname: Zhang, Junchao organization: Argonne National Lab. (ANL), Argonne, IL (United States)] (ORCID:0000000303672358 – sequence: 2 orcidid: 0000000299450639 fullname: Brown, Jed organization: Univ. of Colorado, Boulder, CO (United States)] (ORCID:0000000299450639 – sequence: 3 orcidid: 000000034407205X fullname: Balay, Satish organization: Argonne National Lab. (ANL), Argonne, IL (United States)] (ORCID:000000034407205X – sequence: 4 orcidid: 0000000221141457 fullname: Faibussowitsch, Jacob organization: Univ. of Illinois at Urbana-Champaign, IL (United States)] (ORCID:0000000221141457 – sequence: 5 orcidid: 0000000222920735 fullname: Knepley, Matthew organization: Univ. at Buffalo, NY (United States)] (ORCID:0000000222920735 – sequence: 6 fullname: Marin, Oana organization: Argonne National Lab. (ANL), Argonne, IL (United States) – sequence: 7 orcidid: 0000000306836899 fullname: Mills, Richard Tran organization: Argonne National Lab. (ANL), Argonne, IL (United States)] (ORCID:0000000306836899 – sequence: 8 orcidid: 0000000200303648 fullname: Munson, Todd organization: Argonne National Lab. (ANL), Argonne, IL (United States)] (ORCID:0000000200303648 – sequence: 9 orcidid: 0000000159558111 fullname: Smith, Barry F. organization: Argonne National Lab. (ANL), Argonne, IL (United States)] (ORCID:0000000159558111 – sequence: 10 orcidid: 0000000204350433 fullname: Zampini, Stefano organization: King Abdullah University of Science and Technology (KAUST), Thuwal (Saudi Arabia)] (ORCID:0000000204350433 |
BackLink | https://www.osti.gov/servlets/purl/1837203$$D View this record in Osti.gov |
BookMark | eNrjYmDJy89LZWLgNDQ1tdA1MrQwZgGyDUxMdS2NDC05GLiKi7MMDAxNTA1MOBk0QjJSFQJSS4qTg90UgpMTcxKTclIVnPNzc0vzMpMTSzLz8xR8EitTi3gYWNMSc4pTeaE0N4OSm2uIs4dufnFJZnxxcmZJanJGcn5eXmpySTzQTnMjA2NjohQBAA5QMpA |
ContentType | Journal Article |
CorporateAuthor | Argonne National Lab. (ANL), Argonne, IL (United States) |
CorporateAuthor_xml | – name: Argonne National Lab. (ANL), Argonne, IL (United States) |
DBID | OIOZB OTOTI |
DatabaseName | OSTI.GOV - Hybrid OSTI.GOV |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering Computer Science |
EISSN | 1558-2183 |
ExternalDocumentID | 1837203 |
GroupedDBID | --Z -~X .DC 0R~ 29I 4.4 5GY 6IK 97E AAJGR AASAJ ABVLG ACGFO ACIWK AENEX AKJIK ALMA_UNASSIGNED_HOLDINGS ASUFR ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS EJD HZ~ IEDLZ IFIPE IPLJI JAVBF LAI M43 MS~ O9- OCL OIOZB OTOTI P2P PQQKQ RIA RIC RIE RIG RNS TN5 TWZ UHB |
ID | FETCH-osti_scitechconnect_18372033 |
ISSN | 1045-9219 |
IngestDate | Wed May 29 15:15:47 EDT 2024 |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 4 |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-osti_scitechconnect_18372033 |
Notes | AC02-06CH11357; SC0016140; AC02-0000011838; AC05-00OR22725 USDOE Office of Science (SC) USDOE National Nuclear Security Administration (NNSA) |
ORCID | 000000034407205X 0000000200303648 0000000221141457 0000000204350433 0000000159558111 0000000299450639 0000000303672358 0000000222920735 0000000306836899 |
OpenAccessLink | https://www.osti.gov/servlets/purl/1837203 |
ParticipantIDs | osti_scitechconnect_1837203 |
PublicationCentury | 2000 |
PublicationDate | 2021-05-26 |
PublicationDateYYYYMMDD | 2021-05-26 |
PublicationDate_xml | – month: 05 year: 2021 text: 2021-05-26 day: 26 |
PublicationDecade | 2020 |
PublicationPlace | United States |
PublicationPlace_xml | – name: United States |
PublicationTitle | IEEE transactions on parallel and distributed systems |
PublicationYear | 2021 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0014504 |
Score | 4.7523475 |
Snippet | PetscSF, the communication component of the Portable, Extensible Toolkit for Scientific Computation (PETSc), is designed to provide PETSc's communication... |
SourceID | osti |
SourceType | Open Access Repository |
SubjectTerms | Communication extreme-scale GPU MATHEMATICS AND COMPUTING MPI PETSc |
Title | The PetscSF Scalable Communication Layer |
URI | https://www.osti.gov/servlets/purl/1837203 |
Volume | 33 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3fS8MwED7cnvTBH1NRpxLEB0Eqy5ps66OIYwwVwQl7G2mSMaGssFVE_3ovP9p1KkN9KSUtSenXXr5c7rsDOOdScE3lOFBtHgdM4YI1jqgKWNRpsFDosGl9uvcPrd4z6w_5cFEezKpLsvhKfvyoK_kPqtiGuBqV7B-QLTrFBjxHfPGICOPx1xg_6mwun7r4k4rEyqCWJB-Xd-Ldx996BmpWd6YwRF4l3G4XmATgSaJd2gBlcumaMljIReelhOZl93Ifp8OJSL8t5vteK2V9o4lwgWRGQFF4nbsCu8b38vaCzz1xgboyjcvehyY1G-dO4v51RWqsJ_LDIMptoPYWlSNo1FWryU2uy33hPy22mIqKAEG83-wPV6ASUu5UecXeEOO2KGQxGE6mKZrDEi0YbMOm5_Pk2oGzA2t6WoOtvFYG8aazBhulxI-7cIHIEY8cyZEjS8gRi9wenHVvBze9wIw9Qrpicu5KE5wks5F__HAfqtN0qg-AIDNqqU4jHrepYuNOW2jkUWEUNpTmkaLqEOorOjpaebUO6wtojqGazV71CVKlLD61b-4Ta0IeTQ |
link.rule.ids | 230,315,786,790,891 |
linkProvider | IEEE |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+PetscSF+Scalable+Communication+Layer&rft.jtitle=IEEE+transactions+on+parallel+and+distributed+systems&rft.au=Zhang%2C+Junchao&rft.au=Brown%2C+Jed&rft.au=Balay%2C+Satish&rft.au=Faibussowitsch%2C+Jacob&rft.date=2021-05-26&rft.pub=IEEE&rft.issn=1045-9219&rft.eissn=1558-2183&rft.volume=33&rft.issue=4&rft.externalDocID=1837203 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1045-9219&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1045-9219&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1045-9219&client=summon |