[system-tests]Fix fuelweb_tests for RabbitMQ HA full cluster restart
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Fix Released
|
High
|
Dennis Dmitriev | ||
5.1.x |
Won't Fix
|
High
|
Fuel QA Team | ||
6.0.x |
Invalid
|
Undecided
|
Fuel QA Team | ||
6.1.x |
Fix Released
|
High
|
Dennis Dmitriev |
Bug Description
{
"build_id": "2014-10-
"ostf_sha": "64cb59c681658a
"build_number": "32",
"auth_
"api": "1.0",
"nailgun_sha": "b9792cb5bbecdd
"production": "docker",
"fuelmain_sha": "7bac3edb976044
"astute_sha": "f5fbd89d1e0e1f
"feature_
"mirantis"
],
"release": "5.1.1",
"release_
],
}
}
},
"fuellib_sha": "4f8414a08316a0
}
Steps:
1. Create next cluster - CentOS, HA, Flat Nova-network, Ceph for volumes and images, 3 controller+ceph, 2 compute+ceph, 1 ceph node
2. Deploy cluster
3. Destroy 1 compute+ceph, 1 ceph node
4. Restart 3 controllers
5. Check cinder services
Expected - cinder services are up during 5 minutes
Actual - 5 minutes isn't enough to cinder services up. After revert snapshot services were up after 5-10 minutes
ERROR: Deploy ceph with in HA mode
-------
Traceback (most recent call last):
File "/usr/lib/
compatabili
File "/usr/lib/
func()
File "/usr/lib/
func(
File "/home/
return func(*args, **kwagrs)
File "/home/
self.
File "/home/
result = func(*args, **kwargs)
File "/home/
raise TimeoutError(
TimeoutError: Cinder services not ready.
Logs are attached
summary: |
- Cinder services are down after cold restart all controllers + RabbitMQ is started for a very long time in HA |
Changed in fuel: | |
assignee: | Fuel QA Team (fuel-qa) → Fuel Library Team (fuel-library) |
Changed in fuel: | |
status: | Incomplete → In Progress |
assignee: | Fuel Library Team (fuel-library) → Dennis Dmitriev (ddmitriev) |
summary: |
- RabbitMQ is started for a very long time in HA + Fix fuelweb_tests for RabbitMQ HA full cluster restart |
no longer affects: | fuel/6.0.x |
summary: |
- Fix fuelweb_tests for RabbitMQ HA full cluster restart + [system-tests]Fix fuelweb_tests for RabbitMQ HA full cluster restart |
tags: | added: non-release |
Reproduced on CI test: http:// jenkins- product. srt.mirantis. net:8080/ view/5. 1_swarm/ job/5.1_ fuelmain. system_ test.centos. thread_ 3/29/console
In fact this issue is connected with rabbitmq long starting:
=============== node-1. test.domain. local/cinder- volume. log ======= ======= ======= == 22T18:19: 02.878091+ 01:00 err: 2014-10-22 17:19:02.850 3403 ERROR oslo.messaging. _drivers. impl_rabbit [req-e2d7f6ca- 5b6a-4838- a4c7-43370fa25b de - -- - -] AMQP server on 127.0.0.1:5673 is unreachable: [Errno 111] ECONNREFUSED. Trying again in 30 seconds. ======= ======= ======= ======= ===
2014-10-
=======
Unfortunately, there is no pacemaker logs in the diagnostic snapshot so it is hard to investigate what happened.