Monitoring the Earth System Grid with MDS4

In production Grids for scientific applications, service and resource failures must be detected and addressed quickly. In this paper, we describe the monitoring infrastructure used by the Earth System Grid (ESG) project, a scientific collaboration that supports global climate research. ESG uses the...

Full description

Saved in:
Bibliographic Details
Published in2006 Second IEEE International Conference on e-Science and Grid Computing (e-Science'06) p. 69
Main Authors Chervenak, Ann, Schopf, Jennifer M., Pearlman, Laura, Su, Mei-hui, Bharathi, Shishir, Cinquini, Luca, D'Arcy, Mike, Miller, Neill, Bernholdt, David
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2006
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In production Grids for scientific applications, service and resource failures must be detected and addressed quickly. In this paper, we describe the monitoring infrastructure used by the Earth System Grid (ESG) project, a scientific collaboration that supports global climate research. ESG uses the Globus Toolkit Monitoring and Discovery System (MDS4) to monitor its resources. We describe how the MDS4 Index Service collects information about ESG resources and how the MDS4 Trigger Service checks specified failure conditions and notifies system administrators when failures occur. We present monitoring statistics for May 2006 and describe our experiences using MDS4 to monitor ESG resources over the last two years.
ISBN:0769527345
9780769527345
DOI:10.1109/E-SCIENCE.2006.261153