GPGPU-Accelerated Parallel and Fast Simulation of Thousand-Core Platforms

The multicore revolution and the ever-increasing complexity of computing systems is dramatically changing system design, analysis and programming of computing platforms. Future architectures will feature hundreds to thousands of simple processors and on-chip memories connected through a network-on-c...

Full description

Saved in:

Bibliographic Details
Published in	2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing pp. 53 - 62
Main Authors	Pinto, C., Raghav, S., Marongiu, A., Ruggiero, M., Atienza, D., Benini, L.
Format	Conference Proceeding
Language	English
Published	IEEE 01.05.2011
Subjects	Computational modeling Computer architecture CUDA GPGPU Graphics processing unit Hardware ISS manycore NoC Programming simulation Switches
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The multicore revolution and the ever-increasing complexity of computing systems is dramatically changing system design, analysis and programming of computing platforms. Future architectures will feature hundreds to thousands of simple processors and on-chip memories connected through a network-on-chip. Architectural simulators will remain primary tools for design space exploration, software development and performance evaluation of these massively parallel architectures. However, architectural simulation performance is a serious concern, as virtual platforms and simulation technology are not able to tackle the complexity of thousands of core future scenarios. The main contribution of this paper is the development of a new simulation approach and technology for many core processors which exploit the enormous parallel processing capability of low-cost and widely available General Purpose Graphic Processing Units (GPGPU). The simulation of many-core architectures exhibits indeed a high level of parallelism and is inherently parallelizable, but GPGPU acceleration of architectural simulation requires an in-depth revision of the data structures and functional partitioning traditionally used in parallel simulation. We demonstrate our GPGPU simulator on a target architecture composed by several cores (i.e. ARM ISA based), with instruction and data caches, connected through a Network-on-Chip (NoC). Our experiments confirm the feasibility of our approach.
ISBN:	1457701294 9781457701290
DOI:	10.1109/CCGrid.2011.64