Scalable method of continuous monitoring the remotely accessible resources against node failures for very large clusters
The notion of controlling, using and monitoring remote resources in a distributed data processing system through the use of proxy resource managers and agents is extended to provide failover capability so that resource coverage is preserved and maintained even in the event of either temporary or lon...
Saved in:
Main Authors | , |
---|---|
Format | Patent |
Language | English |
Published |
12.10.2010
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The notion of controlling, using and monitoring remote resources in a distributed data processing system through the use of proxy resource managers and agents is extended to provide failover capability so that resource coverage is preserved and maintained even in the event of either temporary or longer duration node failure. Mechanisms are provided for consistent determination of resource status. Mechanisms are also provided which facilitate the joining of nodes to a group of nodes while still preserving remote resource operations. Additional mechanisms are also provided for the return of remote resource management to the control of a previously failed, but now recovered node, even if the failure had resulted in a node reset. |
---|---|
Bibliography: | Application Number: US20080146008 |