The PetscSF Scalable Communication Layer

PetscSF, the communication component of the Portable, Extensible Toolkit for Scientific Computation (PETSc), is designed to provide PETSc's communication infrastructure suitable for exascale computers that utilize GPUs and other accelerators. PetscSF provides a simple application programming in...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on parallel and distributed systems Vol. 33; no. 4
Main Authors Zhang, Junchao, Brown, Jed, Balay, Satish, Faibussowitsch, Jacob, Knepley, Matthew, Marin, Oana, Mills, Richard Tran, Munson, Todd, Smith, Barry F., Zampini, Stefano
Format Journal Article
LanguageEnglish
Published United States IEEE 26.05.2021
Subjects
Online AccessGet full text

Cover

Loading…
Abstract PetscSF, the communication component of the Portable, Extensible Toolkit for Scientific Computation (PETSc), is designed to provide PETSc's communication infrastructure suitable for exascale computers that utilize GPUs and other accelerators. PetscSF provides a simple application programming interface (API) for managing common communication patterns in scientific computations by using a star-forest graph representation. PetscSF supports several implementations based on MPI and NVSHMEM, whose selection is based on the characteristics of the application or the target architecture. An efficient and portable model for network and intra-node communication is essential for implementing large-scale applications. The Message Passing Interface, which has been the de facto standard for distributed memory systems, has developed into a large complex API that does not yet provide high performance on the emerging heterogeneous CPU-GPU-based exascale systems. Here, we discuss the design of PetscSF, how it can overcome some difficulties of working directly with MPI on GPUs, and we demonstrate its performance, scalability, and novel features.
AbstractList PetscSF, the communication component of the Portable, Extensible Toolkit for Scientific Computation (PETSc), is designed to provide PETSc's communication infrastructure suitable for exascale computers that utilize GPUs and other accelerators. PetscSF provides a simple application programming interface (API) for managing common communication patterns in scientific computations by using a star-forest graph representation. PetscSF supports several implementations based on MPI and NVSHMEM, whose selection is based on the characteristics of the application or the target architecture. An efficient and portable model for network and intra-node communication is essential for implementing large-scale applications. The Message Passing Interface, which has been the de facto standard for distributed memory systems, has developed into a large complex API that does not yet provide high performance on the emerging heterogeneous CPU-GPU-based exascale systems. Here, we discuss the design of PetscSF, how it can overcome some difficulties of working directly with MPI on GPUs, and we demonstrate its performance, scalability, and novel features.
Author Smith, Barry F.
Faibussowitsch, Jacob
Knepley, Matthew
Marin, Oana
Mills, Richard Tran
Balay, Satish
Zhang, Junchao
Munson, Todd
Brown, Jed
Zampini, Stefano
Author_xml – sequence: 1
  orcidid: 0000000303672358
  fullname: Zhang, Junchao
  organization: Argonne National Lab. (ANL), Argonne, IL (United States)] (ORCID:0000000303672358
– sequence: 2
  orcidid: 0000000299450639
  fullname: Brown, Jed
  organization: Univ. of Colorado, Boulder, CO (United States)] (ORCID:0000000299450639
– sequence: 3
  orcidid: 000000034407205X
  fullname: Balay, Satish
  organization: Argonne National Lab. (ANL), Argonne, IL (United States)] (ORCID:000000034407205X
– sequence: 4
  orcidid: 0000000221141457
  fullname: Faibussowitsch, Jacob
  organization: Univ. of Illinois at Urbana-Champaign, IL (United States)] (ORCID:0000000221141457
– sequence: 5
  orcidid: 0000000222920735
  fullname: Knepley, Matthew
  organization: Univ. at Buffalo, NY (United States)] (ORCID:0000000222920735
– sequence: 6
  fullname: Marin, Oana
  organization: Argonne National Lab. (ANL), Argonne, IL (United States)
– sequence: 7
  orcidid: 0000000306836899
  fullname: Mills, Richard Tran
  organization: Argonne National Lab. (ANL), Argonne, IL (United States)] (ORCID:0000000306836899
– sequence: 8
  orcidid: 0000000200303648
  fullname: Munson, Todd
  organization: Argonne National Lab. (ANL), Argonne, IL (United States)] (ORCID:0000000200303648
– sequence: 9
  orcidid: 0000000159558111
  fullname: Smith, Barry F.
  organization: Argonne National Lab. (ANL), Argonne, IL (United States)] (ORCID:0000000159558111
– sequence: 10
  orcidid: 0000000204350433
  fullname: Zampini, Stefano
  organization: King Abdullah University of Science and Technology (KAUST), Thuwal (Saudi Arabia)] (ORCID:0000000204350433
BackLink https://www.osti.gov/servlets/purl/1837203$$D View this record in Osti.gov
BookMark eNrjYmDJy89LZWLgNDQ1tdA1MrQwZgGyDUxMdS2NDC05GLiKi7MMDAxNTA1MOBk0QjJSFQJSS4qTg90UgpMTcxKTclIVnPNzc0vzMpMTSzLz8xR8EitTi3gYWNMSc4pTeaE0N4OSm2uIs4dufnFJZnxxcmZJanJGcn5eXmpySTzQTnMjA2NjohQBAA5QMpA
ContentType Journal Article
CorporateAuthor Argonne National Lab. (ANL), Argonne, IL (United States)
CorporateAuthor_xml – name: Argonne National Lab. (ANL), Argonne, IL (United States)
DBID OIOZB
OTOTI
DatabaseName OSTI.GOV - Hybrid
OSTI.GOV
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
EISSN 1558-2183
ExternalDocumentID 1837203
GroupedDBID --Z
-~X
.DC
0R~
29I
4.4
5GY
6IK
97E
AAJGR
AASAJ
ABVLG
ACGFO
ACIWK
AENEX
AKJIK
ALMA_UNASSIGNED_HOLDINGS
ASUFR
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
EBS
EJD
HZ~
IEDLZ
IFIPE
IPLJI
JAVBF
LAI
M43
MS~
O9-
OCL
OIOZB
OTOTI
P2P
PQQKQ
RIA
RIC
RIE
RIG
RNS
TN5
TWZ
UHB
ID FETCH-osti_scitechconnect_18372033
ISSN 1045-9219
IngestDate Wed May 29 15:15:47 EDT 2024
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 4
Language English
LinkModel OpenURL
MergedId FETCHMERGED-osti_scitechconnect_18372033
Notes AC02-06CH11357; SC0016140; AC02-0000011838; AC05-00OR22725
USDOE Office of Science (SC)
USDOE National Nuclear Security Administration (NNSA)
ORCID 000000034407205X
0000000200303648
0000000221141457
0000000204350433
0000000159558111
0000000299450639
0000000303672358
0000000222920735
0000000306836899
OpenAccessLink https://www.osti.gov/servlets/purl/1837203
ParticipantIDs osti_scitechconnect_1837203
PublicationCentury 2000
PublicationDate 2021-05-26
PublicationDateYYYYMMDD 2021-05-26
PublicationDate_xml – month: 05
  year: 2021
  text: 2021-05-26
  day: 26
PublicationDecade 2020
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle IEEE transactions on parallel and distributed systems
PublicationYear 2021
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0014504
Score 4.7523475
Snippet PetscSF, the communication component of the Portable, Extensible Toolkit for Scientific Computation (PETSc), is designed to provide PETSc's communication...
SourceID osti
SourceType Open Access Repository
SubjectTerms Communication
extreme-scale
GPU
MATHEMATICS AND COMPUTING
MPI
PETSc
Title The PetscSF Scalable Communication Layer
URI https://www.osti.gov/servlets/purl/1837203
Volume 33
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3fS8MwED7cnvTBH1NRpxLEB0Eqy5ps66OIYwwVwQl7G2mSMaGssFVE_3ovP9p1KkN9KSUtSenXXr5c7rsDOOdScE3lOFBtHgdM4YI1jqgKWNRpsFDosGl9uvcPrd4z6w_5cFEezKpLsvhKfvyoK_kPqtiGuBqV7B-QLTrFBjxHfPGICOPx1xg_6mwun7r4k4rEyqCWJB-Xd-Ldx996BmpWd6YwRF4l3G4XmATgSaJd2gBlcumaMljIReelhOZl93Ifp8OJSL8t5vteK2V9o4lwgWRGQFF4nbsCu8b38vaCzz1xgboyjcvehyY1G-dO4v51RWqsJ_LDIMptoPYWlSNo1FWryU2uy33hPy22mIqKAEG83-wPV6ASUu5UecXeEOO2KGQxGE6mKZrDEi0YbMOm5_Pk2oGzA2t6WoOtvFYG8aazBhulxI-7cIHIEY8cyZEjS8gRi9wenHVvBze9wIw9Qrpicu5KE5wks5F__HAfqtN0qg-AIDNqqU4jHrepYuNOW2jkUWEUNpTmkaLqEOorOjpaebUO6wtojqGazV71CVKlLD61b-4Ta0IeTQ
link.rule.ids 230,315,786,790,891
linkProvider IEEE
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+PetscSF+Scalable+Communication+Layer&rft.jtitle=IEEE+transactions+on+parallel+and+distributed+systems&rft.au=Zhang%2C+Junchao&rft.au=Brown%2C+Jed&rft.au=Balay%2C+Satish&rft.au=Faibussowitsch%2C+Jacob&rft.date=2021-05-26&rft.pub=IEEE&rft.issn=1045-9219&rft.eissn=1558-2183&rft.volume=33&rft.issue=4&rft.externalDocID=1837203
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1045-9219&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1045-9219&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1045-9219&client=summon