Identifying likely faulty components in a distributed system

In general, techniques are described for automatically identifying likely faulty components in massively distributed complex systems. In some examples, snapshots of component parameters are automatically repeatedly fed to a pre-trained classifier and the classifier indicates whether each received sn...

Full description

Saved in:
Bibliographic Details
Main Authors RANJAN ASHISH, NAKIL HARSHAD BHASKAR, SINGLA ANKUR, AJAY HAMPAPUR, GHOSE TIRTHANKAR, RAMESH ND, MARQUES PEDRO R, REDDY RAJASHEKAR
Format Patent
LanguageEnglish
Published 15.07.2015
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In general, techniques are described for automatically identifying likely faulty components in massively distributed complex systems. In some examples, snapshots of component parameters are automatically repeatedly fed to a pre-trained classifier and the classifier indicates whether each received snapshot is likely to belong to a fault and failure class or to a non-fault/failure class. Components whose snapshots indicate a high likelihood of fault or failure are investigated, restarted or taken off line as a pre-emptive measure. The techniques may be applied in a massively distributed complex system such as a data center.
Bibliography:Application Number: CN201510152921