Contention Tracking in GPU Last-Level Cache

The Last-level cache (LLC) is one of the main GPU's shared resources that contributes to improve performance but also increases individual kernel's performance variability. This is detrimental in scenarios in which some level of performance predictability is required. While predictability...

Full description

Saved in:
Bibliographic Details
Published in2022 IEEE 40th International Conference on Computer Design (ICCD) pp. 76 - 79
Main Authors Barrera, Javier, Kosmidis, Leonidas, Tabani, Hamid, Abella, Jaume, Cazorla, Francisco J.
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.10.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The Last-level cache (LLC) is one of the main GPU's shared resources that contributes to improve performance but also increases individual kernel's performance variability. This is detrimental in scenarios in which some level of performance predictability is required. While predictability can be regained by deploying cache partitioning (isolation) mechanisms, isolation negatively affects performance efficiency. This work shows that not partitioning the LLC and providing the ability to track the contention that kernels generate on each other allows them to share LLC space, hence increasing efficiency, while the system designer obtains a clear view of how each kernel affects each other in the LLC so as to balance performance and predictability goals. In this line, we propose GPU demotion counters (GDC), a low-overhead hardware mechanism to track contention that kernels generate on each other in the shared LLC.
ISSN:2576-6996
DOI:10.1109/ICCD56317.2022.00021