Node crash recovery
Last updated
Last updated
To speed up node crash recovery process, edit kube-controller-manager pod:
Add/edit the following parameters:
--pod-eviction-timeout=30s
(default 5m0s)
--node-monitor-period=2s
(default 5s)
--node-monitor-grace-period=16s
(default 40s)
--pod-eviction-timeout=30s
(default 5m)
--node-status-update-frequency
And, of course, you can always have your deployments with replica 2 and service will be up even if one node goes down.