OSTF 'RabbitMQ availability' test fails after network outage and recovery (cluster_status hangs on controllers)
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Fix Committed
|
High
|
Alexey Lebedeff | ||
Mitaka |
Fix Released
|
High
|
Dina Belova |
Bug Description
Detailed bug description:
fuel-9.
Steps to reproduce:
1.Create and deploy next cluster - Neutron Vlan, cinder/swift, 3 controller, 2 compute, 1 cinder nodes
2.Run OSTF
3.Verify network
4.Simulate network outage:
For all networks except "admin":
- Locate bridge associated with needed network: "virsh net-dumpxml <network_name>"
- Find all interfaces attached to the bridge using "brctl show <bridge_name>", remember it
- Destroy network using virsh net-destroy
5. Fix network connection after 5minute pause:
For all networks except "admin":
-Restore network using "virsh net-start <network_name>"
Attach all interfaces to bridges according to data step 6 using "brctl addif <bridge> <iface>"
6.Wait until OSTF 'HA' suite passes (FAIL)
Expected results:
All steps OK
Actual result:
Step #6 fails:
Time limit exceeded
root@node-1:~# haproxy-status.sh | grep DOWN
'crm status' hangs with the following output: http://
Step #5 was executed at 2016-05-
See full lrmd and RabbitMQ logs attached in comment #8
tags: | added: area-library |
description: | updated |
no longer affects: | fuel/newton |
tags: | added: on-verification |
Dump is timed out, but I saved full env