After some investigation work, it was found that the script to reboot the cluster nodes did not have enough time delay between each node reboot to stop the network from over-congesting. As a result, was not able to respond to all DHCP requests from the nodes in a timely manner, and not all nodes were rebooting back into the cluster. The node reboot script has therefore been updated with an increased delay between each node rebooting to ensure that the network does not become over-congested during node rebooting, so that all DHCP requests are answered reliably by the head node.
Node Reboot Script Update
Posted in Upgrades