AOCMS: An Adaptive and Scalable Monitoring System for Large-Scale Clusters

In this paper, we present the design and implementation of AOCMS, an adaptive, scalable and efficient monitoring system for a large-scale cluster. We describe an adaptive architecture of AOCMS in detail, and focus on the discussion about some techniques as to enhancing the adaptation, scalability an...

Full description

Saved in:
Bibliographic Details
Published in2006 IEEE Asia-Pacific Conference on Services Computing (APSCC'06) pp. 466 - 472
Main Authors Zhenghua Xue, Xiaoshe Dong, Weiguo Wu
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2006
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In this paper, we present the design and implementation of AOCMS, an adaptive, scalable and efficient monitoring system for a large-scale cluster. We describe an adaptive architecture of AOCMS in detail, and focus on the discussion about some techniques as to enhancing the adaptation, scalability and efficiency of AOCMS. These techniques include: a solution to monitor a heterogeneous cluster; a universal applet-servlet communicating controller responsible for communication between the clients and the Web server; adaptive pools providing threads or connections to the database for the monitoring tasks on demand; and an AOP-based alarm decoupling the alarming logic from the monitoring logic. Moreover, we measured the performance of AOCMS. The results show that AOCMS runs with low overheads and responds to clients quickly
ISBN:9780769527512
0769527515
DOI:10.1109/APSCC.2006.34