GRIT: Enhancing Multi-GPU Performance with Fine-Grained Dynamic Page Placement

Multi-GPU systems have become popular to cater to the growing demands for high parallelism and large memory capacity. However, the delivered performance is constrained by the non-uniform memory access (NUMA) overhead arising from data sharing and communication across multiple GPUs. Recent multi-GPUs...

Full description

Saved in:
Bibliographic Details
Published in2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA) pp. 1080 - 1094
Main Authors Wang, Yueqi, Li, Bingyao, Jaleel, Aamer, Yang, Jun, Tang, Xulong
Format Conference Proceeding
LanguageEnglish
Published IEEE 02.03.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Multi-GPU systems have become popular to cater to the growing demands for high parallelism and large memory capacity. However, the delivered performance is constrained by the non-uniform memory access (NUMA) overhead arising from data sharing and communication across multiple GPUs. Recent multi-GPUs employ unified virtual memory (UVM) to simplify the programming effort. In UVM-enabled multi-GPUs, three popular page placement schemes are adopted to mitigate the NUMA overheads: i) on-touch page migration, ii) access counter-based migration, and iii) page duplication. However, we observe that the preferred page placement scheme varies across i) different applications, ii) different pages of the same application, and iii) even different execution phases of a single page, making it challenging to find a "one-size-fits-all" page placement scheme. To this end, we propose GRIT, which dynamically and automatically determines the appropriate page placement schemes at runtime in a fine-grained manner to enhance multi-GPU performance and scalability. Experimental results indicate that GRIT achieves an average of 60%, 49%, and 29% performance improvements over uniformly adopting on-touch migration, access counter-based migration, and page duplication, respectively.
ISSN:2378-203X
DOI:10.1109/HPCA57654.2024.00085