Comment 0 for bug 1399181

Revision history for this message
Kirill Omelchenko (komelchenko) wrote : HA. Some OSTF tests don't pass after connection loss on primary controller management nic

Scenario:
1. Deploy HA env using 5.1.1 48-RC2 iso:
with nova-flat, ceph for images and volumes. 3x Controllers, 2x computes, 2x ceph-storage
2. Run Network verification tests (passes well)
3. Run OSTF (all pass fine)
4. Simulate connection loss on management interface of the primary controller node
by running next command
# brctl delif <br> <if>
where 'if' is the node interface attached to management network
and 'br' is the bridge this 'if' is attached to.
5. Run OSTF

Result we have several tests failed:
 - Create volume and boot instance from it
 - Check network connectivity from instance without floating IP
 - Check network connectivity from instance via floating IP
 - Launch instance, create snapshot, launch instance from snapshot
HA
 - Mysql node detection failed Please refer to OpenStack logs for more details.
 - Check amount of tables in databases is the same on each node
 - Check RabbitMQ is available
though 'Check galera environment state' passes well.

 - Platform tests also are failed.

And almost every test fails because of a time out or alike.

In fact on Horizon most of the actions can be done successfully, but not always and pretty much always takes quite a lot of time.
e.g. to create an instance, while trying to assign floating ip to an instance (also eventually the ip WAS assigned, but an error message is shown 'HTTP 504' and 'Unable to assign...')

Cluster status:
http://paste.openstack.org/show/144536/