Problem with instance connectivity after l3 agent rescheduling
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Fix Released
|
High
|
Vladimir Kuklin | ||
4.1.x |
Fix Committed
|
High
|
Sergey Kolekonov | ||
5.0.x |
In Progress
|
High
|
Vitaly Sedelnik | ||
5.1.x |
Fix Committed
|
High
|
Vladimir Kuklin | ||
6.0.x |
Fix Released
|
High
|
Vladimir Kuklin |
Bug Description
{
"build_id": "2014-11-
"ostf_sha": "720cc1308c3a70
"build_number": "93",
"auth_
"api": "1.0",
"nailgun_sha": "e1e58a78f07d33
"production": "docker",
"fuelmain_sha": "f09f3624e55da3
"astute_sha": "3c374c9f7bfbdb
"feature_
"mirantis"
],
"release": "6.0",
"release_
],
}
}
},
"fuellib_sha": "0b1c4ab5220096
}
Steps:
1. Create next cluster - HA, CentOS, Neutron GRE, 3 controller, 2 compute, 1 cinder
2. Deploy cluster
3. Create instance
4. Manually reschedule router from one l3-agent to another
5. Check network connectivity from instance via dhcp namespace
ip netns exec qdhcp-bf6d240a-
6. Instance connectivity works fine
7. Go to node on which l3 agent that hosts router is running
8. Stop l3-agent resource via pacemaker
pcs resource ban p_neutron-l3-agent NODE
9. Wait until router will be rescheduled to another l3-agent
10. Log in to controller with l3 agent hosting router
11. Check instance connectivity via dhcp namespace
ip netns exec qdhcp-bf6d240a-
Expected - instance has connection to outside world
Actual - instance has no connection to outside world
Seems that there are problems with migration of metadata because after router rescheduling qrouter has no interfaces for internal/external networks
[root@node-1 ~]# ip netns exec qrouter-
35: lo: <LOOPBACK,
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
inet 127.0.0.1/8 scope host lo
inet6 ::1/128 scope host
valid_lft forever preferred_lft forever
Also there are errors in neutron-
2014-11-12 12:52:15.980 14417 ERROR neutron.
2014-11-12 12:52:15.980 14417 TRACE neutron.
2014-11-12 12:52:15.980 14417 TRACE neutron.
2014-11-12 12:52:15.980 14417 TRACE neutron.
2014-11-12 12:52:15.980 14417 TRACE neutron.
2014-11-12 12:52:15.980 14417 TRACE neutron.
Logs are attached
Changed in fuel: | |
status: | New → Confirmed |
assignee: | Fuel Library Team (fuel-library) → Sergey Vasilenko (xenolog) |
tags: | added: neutron |
Changed in fuel: | |
assignee: | MOS Neutron (mos-neutron) → Andrey Sledzinskiy (asledzinskiy) |
status: | Confirmed → Incomplete |
Changed in fuel: | |
assignee: | MOS Neutron (mos-neutron) → Andrey Sledzinskiy (asledzinskiy) |
tags: | added: customer-found |
tags: | added: release-notes |
I spend a lot of time for reproduce it on ISO #114
It was not simple.
but...
looks like Neutron and OVS together issue:
2014-11-20 16:07:02.307 25086 INFO oslo.messaging. _drivers. impl_rabbit [-] Delaying reconnect for 5.0 seconds ... _drivers. impl_rabbit [-] Connecting to AMQP server on 127.0.0.1:5673 _drivers. impl_rabbit [-] Connected to AMQP server on 127.0.0.1:5673 agent.linux. utils [req-2d96d235- 0261-4f69- a766-6203cfa3f4 95 None] rootwrap. conf', 'ovs-ofctl', 'dump-flows', 'br-int', 'table=23'] openvswitch/ br-int. mgmt: failed to open socket (Connection refused)\n' agent.linux. ovs_lib [req-2d96d235- 0261-4f69- a766-6203cfa3f4 95 None] Unable to execute ['ovs-ofctl', 'dump-flows', 'br-int', 'table rootwrap. conf', 'ovs-ofctl', 'dump-flows', 'br-int', 'table=23'] openvswitch/ br-int. mgmt: failed to open socket (Connection refused)\n' agent.linux. ovs_lib [req-2d96d235- 0261-4f69- a766-6203cfa3f4 95 None] Found not yet ready openvswitch port: [u'qr-b618e937-de', [u 3e:46:22: 9a'], [u'iface-id', u'b618e937- ded8-41d8- 9ddf-f3bd054a29 62'], [u'iface-status', u'active']]], [u'set', []]] plugins. openvswitch. agent.ovs_ neutron_ agent [req-2d96d235- 0261-4f69- a766-6203cfa3f4 95 None] Device b618e937- ded8-41d8- 9ddf-f3bd0 plugins. openvswitch. agent.ovs_ neutron_ agent [req-2d96d235- 0261-4f69- a766-6203cfa3f4 95 None] process_ ancillary_ network_ ports - iter plugins. openvswitch. agent.ovs_ neutron_ agent Traceback (most recent call last): plugins. openvswitch. agent.ovs_ neutron_ agent File "/usr/lib/ python2. 7/dist- packages/ neutron/ plugins/ openvswitch/ agent/ovs_ neutron ancillary_ network_ ports plugins. openvswitch. agent.ovs_ neutron_ agent self.treat_ ancillary_ devices_ added(port_ info['added' ]) plugins. openvswitch. agent.ovs_ neutron_ agent File "/usr/lib/ python2. 7/dist- packages/ neutron/ plugins/ openvswitch/ agent/ovs_ neutron_ agent.py" , line 1142, in treat_ancillary _devices_ added plugins. openvswitch. agent.ovs_ neutron_ agent raise DeviceListRetri evalError( devices= devices, error=e) plugins. openvswitch. agent.ovs_ neutron_ agent DeviceListRetri evalError: Unable to retrieve port details for devices: set([u' 19bc0777- d655-4fa4- b143-82e9d5b58b 4c', u'a4414f4c- 2192-4995- 9fb1-3d5dae2b9b fd']) because of error: Remote error: DBConnecti...
2014-11-20 16:07:07.353 25086 INFO oslo.messaging.
2014-11-20 16:07:08.702 25086 INFO oslo.messaging.
2014-11-20 16:07:53.908 25086 ERROR neutron.
Command: ['sudo', 'neutron-rootwrap', '/etc/neutron/
Exit code: 1
Stdout: ''
Stderr: 'ovs-ofctl: /var/run/
2014-11-20 16:07:53.909 25086 ERROR neutron.
=23']. Exception:
Command: ['sudo', 'neutron-rootwrap', '/etc/neutron/
Exit code: 1
Stdout: ''
Stderr: 'ovs-ofctl: /var/run/
2014-11-20 16:08:06.310 25086 WARNING neutron.
'map', [[u'attached-mac', u'fa:16:
2014-11-20 16:08:13.343 25086 WARNING neutron.
54a2962 not defined on plugin
2014-11-20 16:08:28.084 25086 ERROR neutron.
ation:28738 - failure while retrieving port details from server
2014-11-20 16:08:28.084 25086 TRACE neutron.
2014-11-20 16:08:28.084 25086 TRACE neutron.
_agent.py", line 1254, in process_
2014-11-20 16:08:28.084 25086 TRACE neutron.
2014-11-20 16:08:28.084 25086 TRACE neutron.
2014-11-20 16:08:28.084 25086 TRACE neutron.
2014-11-20 16:08:28.084 25086 TRACE neutron.