Heat operation failed after controller failover
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Fix Released
|
High
|
Bogdan Dobrelya | ||
6.1.x |
In Progress
|
High
|
Timur Nurlygayanov | ||
7.0.x |
Fix Released
|
High
|
Bogdan Dobrelya |
Bug Description
Delete stack failed with next error:
http://
haproxy status for heat:
# pxname svname stot ereq econ eresp status chkfail chkdown downtime iid
heat-api FRONTEND 0 0 OPEN 16
heat-api node-76 0 0 0 UP 0 0 0 16
heat-api node-77 0 0 0 UP 0 0 0 16
heat-api node-79 0 0 0 UP 0 0 0 16
heat-api BACKEND 0 0 0 UP 0 0 16
heat-api-cfn FRONTEND 0 0 OPEN 17
heat-api-cfn node-76 0 0 0 UP 0 0 0 17
heat-api-cfn node-77 0 0 0 UP 0 0 0 17
heat-api-cfn node-79 0 0 0 UP 0 0 0 17
heat-api-cfn BACKEND 0 0 0 UP 0 0 17
heat-api-cloudwatch FRONTEND 0 0 OPEN 18
heat-api-cloudwatch node-76 0 0 0 UP 0 0 0 18
heat-api-cloudwatch node-77 0 0 0 UP 0 0 0 18
heat-api-cloudwatch node-79 0 0 0 UP 0 0 0 18
heat-api-cloudwatch BACKEND 0 0 0 UP 0 0 18
crm status
http://
Steps To Reproduce:
Os: CentOS
HA with Neutron GRE:
1 controller + 2 controllers with mongo
1 mongo
1 cinder
2 computes
Ceilometer enabled
1. Deploy cluster, when cluster is ready
2. Navigate to fuel health tab and run all ostf tests - ha, smoke. sanity, platfrom tests are passed (configuration may fail if you do not change default cread for ssh on master node and user cred to openstack cluster)
3. Shutdown primary controller
4. Wait 5 minutes - run ostf ha suit (it passed, if not you may need to wait for a liitle bit more and run again)
5. Run smoke, sanity OSTF tests - they are passed
6. Run platfrom tests - Actual result all heat test failed (update/ create/ delete stack)
with 504 error
7. Turn on controller - repeat step 4-6 - result is the same all heat tests are failed
ssh to each controller and try to delete stack - failed with error listed about
{"build_id": "2015-06-
Second env:
UBUNTU
3 controllers with ceph + 1 compute + 2 mongo + 2 ceph
with nova Flat
1. Deploy cluster
2. When cluster ready - run ostf ha, smoke, sanity, platfrom tests - tests are passed
3. shutdown non-primary controller
4. wait near 5-7 minutes and run ostf ha - passed
5. run sanity/ smoke - passed
6. run platform tests - heat tests are failed with 504 error
7. ssh to online controllers and try to create and delete stack - failed with 504 error
Info: Add snapshot later according it is to big and to slow uploading in google drive
Info: issue do not reproduce each time, for now 2 from 5
description: | updated |
description: | updated |
Changed in fuel: | |
assignee: | MOS Heat (mos-heat) → Sergey Kraynev (skraynev) |
Changed in fuel: | |
status: | Incomplete → Confirmed |
Changed in fuel: | |
assignee: | Sergey Kraynev (skraynev) → Timur Nurlygayanov (tnurlygayanov) |
importance: | High → Critical |
importance: | Critical → High |
Changed in fuel: | |
status: | Confirmed → In Progress |
tags: | added: release-notes |
tags: | added: done |
Changed in fuel: | |
status: | In Progress → Fix Committed |
tags: | added: on-verification |
tags: | added: on-verification |
tags: | added: non-release |
https:/ /drive. google. com/a/mirantis. com/file/ d/0B_tSitrwrgvo Yjh1ek95S08zTHc /view?usp= sharing