Profiling techniques for communication in fine-grained parallel languages
Fine tuning the performance of large parallel programs is a very difficult task. A profiling tool can provide detailed insight into the utilization and communication of the different processors, which helps identify performance bottlenecks. In this paper we present two profiling techniques for the f...
Saved in:
Published in | Software, practice & experience Vol. 29; no. 6; pp. 519 - 550 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
Chichester, UK
John Wiley & Sons, Ltd
01.05.1999
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Fine tuning the performance of large parallel programs is a very difficult task. A profiling tool can provide detailed insight into the utilization and communication of the different processors, which helps identify performance bottlenecks. In this paper we present two profiling techniques for the fine‐grained parallel programming language Split‐C, which provides a simple global address space memory model. One profiler provides a detailed analysis of a program's execution. The other profiler collects cumulative information. As our experience shows, it is quite challenging to profile programs that make use of efficient, low‐overhead communication. We incorporated techniques which minimize profiling effects on the running program, and quantified the profiling overhead. We present several Split‐C applications showing that the profiler is useful in determining performance bottlenecks. Copyright © 1999 John Wiley & Sons, Ltd. |
---|---|
Bibliography: | istex:D81E1AE876868DF071D6356BB6EEFC9DF0D5FEAF ArticleID:SPE247 ark:/67375/WNG-KJN7NN2D-H ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 |
ISSN: | 0038-0644 1097-024X |
DOI: | 10.1002/(SICI)1097-024X(199905)29:6<519::AID-SPE247>3.0.CO;2-W |