You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Then the trigger kicks in, but it's set not to act before 20s, so 17:46:51 was the earliest time it should have kicked in, and the latest 15s before, 17:46:36, given Slurm's trigger batching. Barely crosses the DOWN event, but it does.
Apr 07 17:47:11 xc-control slurmctld[4676]: update_node: node xc-node-n1-48 reason set to: recovery
Apr 07 17:47:11 xc-control slurmctld[4676]: update_node: node xc-node-n1-48 state set to DRAINED*
Apr 07 17:47:11 xc-control slurmctld[4676]: update_node: node xc-node-n1-48 reason set to: recovery
The text was updated successfully, but these errors were encountered:
BTW: There is not a single timeout of 240s either in slurm.baseconf or in documented defaults in man slurm.conf.
Then SUDDENLY
Glad you've noticed, thanks.
But WHY 2 minutes???
The "can't find address for" continues for a few seconds
Then the trigger kicks in, but it's set not to act before 20s, so 17:46:51 was the earliest time it should have kicked in, and the latest 15s before, 17:46:36, given Slurm's trigger batching. Barely crosses the DOWN event, but it does.
The text was updated successfully, but these errors were encountered: