Landing Stencil Code on Godson-T

The advent of multi-core/many-core chip technology offers both an extraordinary opportunity and a profound challenge. In particular, computer architects and system software designers are faced with a unique opportunity to introducing new architecture features as well as adequate compiler technology...

Full description

Saved in:
Bibliographic Details
Published inJournal of computer science and technology Vol. 25; no. 4; pp. 886 - 894
Main Author 崔慧敏 王蕾 范东睿 冯晓兵
Format Journal Article
LanguageEnglish
Published Boston Springer US 01.07.2010
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN1000-9000
1860-4749
DOI10.1007/s11390-010-9373-6

Cover

More Information
Summary:The advent of multi-core/many-core chip technology offers both an extraordinary opportunity and a profound challenge. In particular, computer architects and system software designers are faced with a unique opportunity to introducing new architecture features as well as adequate compiler technology -- together they may have profound impact. This paper presents a case study (using the 1-D Jacobi computation) of compiler-amendable performance optimization techniques on a many-core architecture Godson-T. Godson-T architecture has several unique features that are chosen for this study: 1) chip-level global addressable memory in particular the scratchpad memories (SPM) local to the processing cores; 2) fine-grain memory based synchronization (e.g., full-empty bit for fine-grain synchronization). Leveraging state-of-the-art performance optimization methods for 1-D stencil parallelization (e.g., timed tiling and variants), we developed and implement a number of many-core-based optimization for Godson-T. Our experimental study shows good performance in both execution time speedup and scalability, validate the value of globally accessed SPM and fine-grain synchronization mechanism (full-empty bits) under the Godson-T, and provides some useful guidelines for future compiler technology of many-core chip architectures.
Bibliography:11-2296/TP
TP332
many-core, stencil, Jacobi, compiler
SPM, fine-grain synchronization
many-core, stencil, Jacobi, compiler; SPM, fine-grain synchronization
TG76
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ObjectType-Article-2
ObjectType-Feature-1
content type line 23
ISSN:1000-9000
1860-4749
DOI:10.1007/s11390-010-9373-6