Managing Data Placement in Memory Systems with Multiple Memory Controllers

Modern processors such as Tilera’s Tile64, Intel’s Nehalem, and AMD’s Opteron are migrating memory controllers (MCs) on-chip, while maintaining a large, flat memory address space. This trend to utilize multiple MCs will likely continue and a core or socket will consequently need to route memory requ...

Full description

Saved in:

Bibliographic Details
Published in	International journal of parallel programming Vol. 40; no. 1; pp. 57 - 83
Main Authors	Awasthi, M., Nellans, D., Sudan, K., Balasubramonian, R., Davis, A.
Format	Journal Article
Language	English
Published	Boston Springer US 01.02.2012 Springer Nature B.V
Subjects	Adaptive systems Computer Science Controllers Data base management Delay Design DRAM Dynamic random access memory Dynamical systems Dynamics Microprocessors Migration Placement Policies Processor Architectures Queuing Software Engineering/Programming and Operating Systems Studies Theory of Computation DRAM Phase change memory Heterogeneous memory hierarchies Data locality
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Modern processors such as Tilera’s Tile64, Intel’s Nehalem, and AMD’s Opteron are migrating memory controllers (MCs) on-chip, while maintaining a large, flat memory address space. This trend to utilize multiple MCs will likely continue and a core or socket will consequently need to route memory requests to the appropriate MC via an inter- or intra-socket interconnect fabric similar to AMD’s HyperTransport TM , or Intel’s Quick-Path Interconnect TM . Such systems are therefore subject to non-uniform memory access (NUMA) latencies because of the time spent traveling to remote MCs. Each MC will act as the gateway to a particular region of the physical memory. Data placement will therefore become increasingly critical in minimizing memory access latencies. Increased competition for memory resources will also increase the memory access latency variation in future systems. Proper allocation of workload data to the appropriate MC will be important in decreasing the variation and average latency when servicing memory requests. The allocation strategy will need to be aware of queuing delays, on-chip latencies, and row-buffer hit-rates for each MC. In this paper, we propose dynamic mechanisms that take these factors into account when placing data in appropriate slices of physical memory. We introduce adaptive first-touch page placement, and dynamic page-migration mechanisms to reduce DRAM access delays for multi-MC systems. We also introduce policies that can handle data placement in memory systems that have regions with heterogeneous properties. The proposed policies yield average performance improvements of 6.5% for adaptive first-touch page-placement, and 8.9% for a dynamic page-migration policy for a system with homogeneous DRAM DIMMs. We also show improvements in systems that contain DIMMs with different performance characteristics.
Bibliography:	SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-2 content type line 23
ISSN:	0885-7458 1573-7640
DOI:	10.1007/s10766-011-0178-1