Contention Tracking in GPU Last-Level Cache

The Last-level cache (LLC) is one of the main GPU's shared resources that contributes to improve performance but also increases individual kernel's performance variability. This is detrimental in scenarios in which some level of performance predictability is required. While predictability...

Full description

Saved in:

Bibliographic Details
Published in	2022 IEEE 40th International Conference on Computer Design (ICCD) pp. 76 - 79
Main Authors	Barrera, Javier, Kosmidis, Leonidas, Tabani, Hamid, Abella, Jaume, Cazorla, Francisco J.
Format	Conference Proceeding
Language	English
Published	IEEE 01.10.2022
Subjects	cache contention Electric breakdown Focusing GPU Graphics processing units Hardware Kernel LLC QoS Quality of service Timing tracking
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The Last-level cache (LLC) is one of the main GPU's shared resources that contributes to improve performance but also increases individual kernel's performance variability. This is detrimental in scenarios in which some level of performance predictability is required. While predictability can be regained by deploying cache partitioning (isolation) mechanisms, isolation negatively affects performance efficiency. This work shows that not partitioning the LLC and providing the ability to track the contention that kernels generate on each other allows them to share LLC space, hence increasing efficiency, while the system designer obtains a clear view of how each kernel affects each other in the LLC so as to balance performance and predictability goals. In this line, we propose GPU demotion counters (GDC), a low-overhead hardware mechanism to track contention that kernels generate on each other in the shared LLC.
ISSN:	2576-6996
DOI:	10.1109/ICCD56317.2022.00021