Windows HPC Server 2008 Diagnostics
For large computing clusters, diagnostics is where system administrators spend a lot of their time. Common tasks include:
- Validate cluster post deployment or configuration change.
- Troubleshoot failures.
- Measure performance degradation over time.
Windows HPC Server 2008 has 16 built-in diagnostics to help Sysadmins do diagnostics with ease. These diagnostic tests can be classified into the following categories, infrastructure, configuration report, and performance. Infrastructure tests include scheduler, system services, connectivity, and Service Oriented Architecture or the WCF broker model. While configuration report has application, network, software updates and system service tests available. Finally, we have two MPIPingPong tests that measure the cluster performance in terms of latency and bandwidth.
The diagnostic tests are flexible and easy to run, and the results are filterable and searchable. System administrators can utilize the test results to further diagnose using built-in tools like clusrun, remote desktop, and node template features.