Warewulf Node Health Check (NHC)
Warewulf Node Health Check (NHC) is a periodic "node health check" script to be executed on each compute node to verify that the node is working properly. Nodes which are determined to be "unhealthy" can be marked as down or offline so as to prevent jobs from being scheduled or run on them. This helps increase the reliability and throughput of a cluster by reducing preventable job failures due to misconfiguration, hardware failures, etc.
Source Files (show merged sources derived from linked package)