Computer hardware fault diagnosis

Methods, apparatus, and computer program products are disclosed for computer hardware fault diagnosis carried out in a parallel computer, where the parallel computer includes a plurality of compute nodes. The compute nodes are coupled for data communications by at least two independent data communic...

Full description

Saved in:
Bibliographic Details
Main Authors RATTERMAN, JOSEPH D, ARCHER, CHARLES J, MEGERIAN, MARK G, SMITH, BRIAN E
Format Patent
LanguageChinese
English
Published 16.03.2008
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Methods, apparatus, and computer program products are disclosed for computer hardware fault diagnosis carried out in a parallel computer, where the parallel computer includes a plurality of compute nodes. The compute nodes are coupled for data communications by at least two independent data communications networks, where each data communications network includes data communications links among the computer nodes. Typical embodiments carry out hardware fault diagnosis by executing a collective operation through a first data communications network upon a plurality of the computer nodes of the computer, executing the same collective operation through a second data communications network upon the same plurality of the computer nodes of the computer, and comparing results of the collective operations.
Bibliography:Application Number: TW200796111869