In an env, we found one hostmonitor didn't log anymore after send host failure notification failed.
I noticed that in the monitor_hosts it will exit if once it catch some exception.So there is risk, if one host down later, no recovery will be triggered.
In an env, we found one hostmonitor didn't log anymore after send host failure notification failed.
I noticed that in the monitor_hosts it will exit if once it catch some exception.So there is risk, if one host down later, no recovery will be triggered.