The PetscSF Scalable Communication Layer

PetscSF, the communication component of the Portable, Extensible Toolkit for Scientific Computation (PETSc), is designed to provide PETSc's communication infrastructure suitable for exascale computers that utilize GPUs and other accelerators. PetscSF provides a simple application programming in...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on parallel and distributed systems Vol. 33; no. 4
Main Authors Zhang, Junchao, Brown, Jed, Balay, Satish, Faibussowitsch, Jacob, Knepley, Matthew, Marin, Oana, Mills, Richard Tran, Munson, Todd, Smith, Barry F., Zampini, Stefano
Format Journal Article
LanguageEnglish
Published United States IEEE 26.05.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:PetscSF, the communication component of the Portable, Extensible Toolkit for Scientific Computation (PETSc), is designed to provide PETSc's communication infrastructure suitable for exascale computers that utilize GPUs and other accelerators. PetscSF provides a simple application programming interface (API) for managing common communication patterns in scientific computations by using a star-forest graph representation. PetscSF supports several implementations based on MPI and NVSHMEM, whose selection is based on the characteristics of the application or the target architecture. An efficient and portable model for network and intra-node communication is essential for implementing large-scale applications. The Message Passing Interface, which has been the de facto standard for distributed memory systems, has developed into a large complex API that does not yet provide high performance on the emerging heterogeneous CPU-GPU-based exascale systems. Here, we discuss the design of PetscSF, how it can overcome some difficulties of working directly with MPI on GPUs, and we demonstrate its performance, scalability, and novel features.
Bibliography:AC02-06CH11357; SC0016140; AC02-0000011838; AC05-00OR22725
USDOE Office of Science (SC)
USDOE National Nuclear Security Administration (NNSA)
ISSN:1045-9219
1558-2183