Profiling techniques for communication in fine-grained parallel languages

Fine tuning the performance of large parallel programs is a very difficult task. A profiling tool can provide detailed insight into the utilization and communication of the different processors, which helps identify performance bottlenecks. In this paper we present two profiling techniques for the f...

Full description

Saved in:
Bibliographic Details
Published inSoftware, practice & experience Vol. 29; no. 6; pp. 519 - 550
Main Authors Scheiman, Chris J., Haake, Bjoern, Ibel, Maximilian, Schauser, Klaus E.
Format Journal Article
LanguageEnglish
Published Chichester, UK John Wiley & Sons, Ltd 01.05.1999
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Fine tuning the performance of large parallel programs is a very difficult task. A profiling tool can provide detailed insight into the utilization and communication of the different processors, which helps identify performance bottlenecks. In this paper we present two profiling techniques for the fine‐grained parallel programming language Split‐C, which provides a simple global address space memory model. One profiler provides a detailed analysis of a program's execution. The other profiler collects cumulative information. As our experience shows, it is quite challenging to profile programs that make use of efficient, low‐overhead communication. We incorporated techniques which minimize profiling effects on the running program, and quantified the profiling overhead. We present several Split‐C applications showing that the profiler is useful in determining performance bottlenecks. Copyright © 1999 John Wiley & Sons, Ltd.
Bibliography:istex:D81E1AE876868DF071D6356BB6EEFC9DF0D5FEAF
ArticleID:SPE247
ark:/67375/WNG-KJN7NN2D-H
ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ISSN:0038-0644
1097-024X
DOI:10.1002/(SICI)1097-024X(199905)29:6<519::AID-SPE247>3.0.CO;2-W