Warewulf Node Health Check (NHC)
Warewulf Node Health Check (NHC) is a periodic "node health check" script to be executed on each compute node to verify that the node is working properly. Nodes which are determined to be "unhealthy" can be marked as down or offline so as to prevent jobs from being scheduled or run on them. This helps increase the reliability and throughput of a cluster by reducing preventable job failures due to misconfiguration, hardware failures, etc.
This uses the fork at https://github.com/UCL-ARC/nhc/tree/ucl#
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout home:ellessar/warewulf4-nhc && cd $_ - Create Badge
Refresh
Source Files
| Filename | Size | Changed |
|---|---|---|
| _service | 0000000487 487 Bytes | |
| gen_tarball.sh | 0000000115 115 Bytes | |
| warewulf4-nhc-1.5.git.1741185699.2082e2e.tar.gz | 0000153713 150 KB | |
| warewulf4-nhc.changes | 0000003334 3.26 KB | |
| warewulf4-nhc.spec | 0000003268 3.19 KB |
Comments 0