mmm tries to kill_host when replication delays in PASSIVE mode
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
mysql-mmm |
Confirmed
|
High
|
Unassigned |
Bug Description
In lib/Monitor/
if (!$self-
}
However, send_agent_status() starts out with this check:
# Never send anything to agents if we are in PASSIVE mode
# Never send anything to agents if we have no network connection
return if ($self->is_passive || !$main::have_net);
The result of this combination is that, under a variety of benign circumstances (such as slave replication becoming more than max_backlog seconds behind), if the monitor is in PASSIVE mode, it will call the _kill_host() routine and, if configured, STONITH unnecessarily. If the kill_host option is not configured, the result is the following frequent and confusing spam in the error log file:
"Could not kill host '%s' - there may be some duplicate ips now! (There's no binary configured for killing hosts."
Proposed solution:
Each of those 5 locations which check if(!$self-
(version: current launchpad trunk, 2.2.1)
Related branches
- mysql-mmm-core: Pending requested
-
Diff: 43 lines (+5/-0)1 file modifiedlib/Monitor/Monitor.pm (+5/-0)
- Kenny Gryp: Approve
-
Diff: 13 lines (+2/-1)1 file modifiedlib/Monitor/Monitor.pm (+2/-1)
- mysql-mmm-core: Pending requested
-
Diff: 13 lines (+2/-1)1 file modifiedlib/Monitor/Monitor.pm (+2/-1)
Changed in mysql-mmm: | |
importance: | Undecided → High |
status: | New → Confirmed |
Changed in mysql-mmm: | |
milestone: | none → 2.2.2 |
Changed in mysql-mmm: | |
milestone: | 2.2.2 → none |