Hierarchical compilation of macro dataflow graphs for multiprocessors with local memory
This paper presents a hierarchical approach for compiling macro dataflow graphs for multiprocessors with local memory. Macro dataflow graphs comprise several nodes (or macro operations) that must be executed subject to prespecified precedence constraints. Programs consisting of multiple nested loops...
Saved in:
Published in | IEEE transactions on parallel and distributed systems Vol. 5; no. 7; pp. 720 - 736 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
Los Alamitos, CA
IEEE
01.07.1994
IEEE Computer Society |
Subjects | |
Online Access | Get full text |
ISSN | 1045-9219 |
DOI | 10.1109/71.296318 |
Cover
Loading…
Summary: | This paper presents a hierarchical approach for compiling macro dataflow graphs for multiprocessors with local memory. Macro dataflow graphs comprise several nodes (or macro operations) that must be executed subject to prespecified precedence constraints. Programs consisting of multiple nested loops, where the precedence constraints between the loops are known, can be viewed as macro dataflow graphs. The hierarchical compilation approach comprises a processor allocation phase followed by a partitioning phase. In the processor allocation phase, using estimated speedup functions for the macro nodes, computationally efficient techniques establish the sequencing and parallelism of macro operations for close-to-optimal run-times. The second phase partitions the computations in each macro node to maximize communication locality for the level of parallelism determined by the processor allocation phase. The same approach can also be used for programs consisting of multiple loop nests, when each of the nested loops can be characterized by a speedup function. These ideas have been implemented in a prototype structure-driven compiler, SDC, for expressions of matrix operations. The paper presents the performance of the compiler for several matrix expressions on a simulator of the Alewife multiprocessor.< > |
---|---|
Bibliography: | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 |
ISSN: | 1045-9219 |
DOI: | 10.1109/71.296318 |