Pacemaker neutron agent scripts start/stop/migration will fail if management vip moved recently
Affects | Status | Importance | Assigned to | Milestone | ||
---|---|---|---|---|---|---|
Fuel for OpenStack | ||||||
4.1.x |
Fix Committed
|
High
|
Fuel Library (Deprecated) |
Bug Description
{"build_id": "2014-03-
Steps to reproduce:
1 - Deploy Ubuntu HA (Cinder LVM backend, Swift glance backend, Neutron with GRE segmentation) 3 computes - 1 controller - 1 storage
2 - Log into first controller and run crm_resource -r vip__management_old --move --node node-3 (NOTE: replace node-3 where it is the nonprimary controller)
3 - Wait ~60s for keystone and other services to recover
4 - Run neutron agent-list
Results:
# neutron agent-list
+------
| id | agent_type | host | alive | admin_state_up |
+------
| 09699e60-
| 12236192-
| 2c0ec06d-
| ad4c9181-
| bd893993-
| f7451cfd-
# crm resource show | egrep 'l3|dhcp'
p_neutron-
p_neutron-l3-agent (ocf::mirantis:
From l3 agent logs:
p_neutron-
p_neutron-
p_neutron-
p_neutron-
p_neutron-
p_neutron-
(2013, "Lost connection to MySQL server at 'reading initial communication packet', system error: 0") None None (HTTP 500) ]
p_neutron-
p_neutron-
p_neutron-
p_neutron-
p_neutron-
We should tune OCF scripts and/or q-agent-cleanup.py to be more tolerant of keystone being unavailable for up to 2 minutes.
tags: | added: in progress |
tags: | added: backports-4.1.1 |
tags: | added: ha |
Changed in fuel: | |
assignee: | Dmitry Borodaenko (dborodaenko) → Sergey Vasilenko (xenolog) |
summary: |
- Neutron L3/DHCP agents fail when VIP fails over + Pacemaker neutron agent scripts start/stop/migration will fail if + management vip moved recently |
no longer affects: | fuel/5.0.x |
no longer affects: | fuel |
Fix proposed to branch: master /review. openstack. org/77895
Review: https:/