[Issue]
ComputeNodes are managed by pacemaker_remote in my environment.
When one ComputeNode is isolated in the network, masakari-hostmonitors on the other ComputeNodes will send failure notification about the isolated ComputeNode to masakari-api.
At that time, the isolated masakari-hostomonitor will recognize other ComputeNodes as offline. So it sends failure notification about online ComputeNodes.
As a result, masakari-engine runs the recovery procedure to online ComputeNodes.
[Cause]
The current masakari-hostmonitor can't determine whether or not it is isolated in the network if ComputeNodes are managed by pacemaker_remote.
[Solution]
The ComputeNode managed by pacemaker_remote should determine recognize itself as offline when it is isolated.
The state monitoring process should be skipped in that case.
[Issue] hostmonitors on the other ComputeNodes will send failure notification about the isolated ComputeNode to masakari-api. hostomonitor will recognize other ComputeNodes as offline. So it sends failure notification about online ComputeNodes.
ComputeNodes are managed by pacemaker_remote in my environment.
When one ComputeNode is isolated in the network, masakari-
At that time, the isolated masakari-
As a result, masakari-engine runs the recovery procedure to online ComputeNodes.
[Cause] hostmonitor can't determine whether or not it is isolated in the network if ComputeNodes are managed by pacemaker_remote.
The current masakari-
masakari- hostmonitor with pacemaker(not remote) will wait until it is killed if it is isolated in the network. It is implemented in the following code. /github. com/openstack/ masakari- monitors/ blob/master/ masakarimonitor s/hostmonitor/ host_handler/ handle_ host.py# L398-L402>
<https:/
But masakari- hostmonitor with pacemaker_remote won't determine if it is isolated. /github. com/openstack/ masakari- monitors/ blob/master/ masakarimonitor s/hostmonitor/ host_handler/ handle_ host.py# L93-L95>
<https:/
[Solution]
The ComputeNode managed by pacemaker_remote should determine recognize itself as offline when it is isolated.
The state monitoring process should be skipped in that case.