Address-Value Delta (AVD) Prediction Increasing the Effectiveness of Runahead Execution by Exploiting Regular Memory Allocation Patterns

While runahead execution is effective at parallelizing independent long-latency cache misses, it is unable to parallelize dependent long-latency cache misses. To overcome this limitation, this paper proposes a novel technique, address-value delta (AVD) prediction. An AVD predictor keeps track of the...

Full description

Saved in:
Bibliographic Details
Published in38th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'05) pp. 233 - 244
Main Authors Mutlu, Onur, Kim, Hyesoon, Patt, Yale N.
Format Conference Proceeding
LanguageEnglish
Published Washington, DC, USA IEEE Computer Society 12.11.2005
IEEE
SeriesACM Conferences
Subjects
Online AccessGet full text
ISBN9780769524405
0769524400
ISSN1072-4451
DOI10.1109/MICRO.2005.11

Cover

More Information
Summary:While runahead execution is effective at parallelizing independent long-latency cache misses, it is unable to parallelize dependent long-latency cache misses. To overcome this limitation, this paper proposes a novel technique, address-value delta (AVD) prediction. An AVD predictor keeps track of the address (pointer) load instructions for which the arithmetic difference (i.e., delta) between the effective address and the data value is stable. If such a load instruction incurs a long-latency cache miss during runahead execution, its data value is predicted by subtracting the stable delta from its effective address. This prediction enables the pre-execution of dependent instructions, including load instructions that incur long-latency cache misses. We describe how, why, and for what kind of loads AVD prediction works and evaluate the design tradeoffs in an implementable AVD predictor. Our analysis shows that stable AVDs exist because of patterns in the way data structures are allocated in memory. Our results show that augmenting a runahead processor with a simple, 16-entry AVD predictor improves the average execution time of a set of pointer-intensive applications by 12.1%.
Bibliography:SourceType-Conference Papers & Proceedings-1
ObjectType-Conference Paper-1
content type line 25
ISBN:9780769524405
0769524400
ISSN:1072-4451
DOI:10.1109/MICRO.2005.11