MergePath-SpMM: Parallel Sparse Matrix-Matrix Algorithm for Graph Neural Network Acceleration

Graph neural networks have seen tremendous adoption to perform complex predictive analytics on massive and unstructured real-world graphs. The trend in hardware accelerator designs has identified significant challenges with harnessing graph locality and workload imbalance due to ultra-sparse and irr...

Full description

Saved in:

Bibliographic Details
Published in	2023 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) pp. 145 - 156
Main Authors	Shan, Mohsin, Gurevin, Deniz, Nye, Jared, Ding, Caiwen, Khan, Omer
Format	Conference Proceeding
Language	English
Published	IEEE 01.04.2023
Subjects	GPU graph processing merge-path multicore neural networks parallel algorithm Sparse matrix-matrix
Online Access	Get full text
DOI	10.1109/ISPASS57527.2023.00023

Cover

More Information
Summary:	Graph neural networks have seen tremendous adoption to perform complex predictive analytics on massive and unstructured real-world graphs. The trend in hardware accelerator designs has identified significant challenges with harnessing graph locality and workload imbalance due to ultra-sparse and irregular matrix computations at a massively parallel scale. This paper addresses the load imbalance challenge and identifies that state-of-the-art either introduces complex specialized hardware to auto-tune for load-balanced execution at runtime or relies on software-only approaches that exploit parallelism. We propose a novel software-only load-balancing sparse matrix-matrix (SpMM) algorithm that unlocks fine-grain parallelism while maintaining controlled need-based targeted synchronizations to achieve robust performance scaling. The MergePath-SpMM algorithm achieves superior performance using commercial offthe-shelf GPU processors when compared to state-of-the-art hardware accelerators and software-only implementations.
DOI:	10.1109/ISPASS57527.2023.00023