Pegasus: Mapping Large-Scale Workflows to Distributed Resources

Many scientific advances today are derived from analyzing large amounts of data. The computations themselves can be very complex and consume significant resources. Scientific efforts are also not conducted by individual scientists; rather, they rely on collaborations that encompass many researchers...

Full description

Saved in:
Bibliographic Details
Published inWorkflows for e-Science pp. 376 - 394
Main Authors Deelman, Ewa, Mehta, Gaurang, Singh, Gurmeet, Su, Mei-Hui, Vahi, Karan
Format Book Chapter
LanguageEnglish
Published London Springer London
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Many scientific advances today are derived from analyzing large amounts of data. The computations themselves can be very complex and consume significant resources. Scientific efforts are also not conducted by individual scientists; rather, they rely on collaborations that encompass many researchers from various organizations. The analysis is often composed of several individual application components designed by different scientists. To describe the desired analysis, the components are assembled in a workflow where the dependencies between them are defined and the data needed for the analysis are identified. To support the scale of the applications, many resources are needed in order to provide adequate performance. These resources are often drawn from a heterogeneous pool of geographically distributed compute and data resources. Running large-scale, collaborative applications in such environments has many challenges. Among them are systematic management of the applications, their components, and the data, as well as successful and efficient execution on the distributed resources.
ISBN:9781846285196
1846285194
DOI:10.1007/978-1-84628-757-2_23