Instance reach ERROR status after resetting compute
Affects | Status | Importance | Assigned to | Milestone | ||
---|---|---|---|---|---|---|
Mirantis OpenStack | Status tracked in 10.0.x | |||||
10.0.x |
Fix Committed
|
Medium
|
Georgy Dyuldin | |||
9.x |
Fix Released
|
Medium
|
Georgy Dyuldin |
Bug Description
Detailed bug description:
After hard resetting compute with booted instance it reach ERROR status
Steps to reproduce:
1. Deploy MOS 9.1 with 3 controllers, 2 computes, 1 cinder nodes
2. Create 2 networks, 2 subnets, router between them
3. Create 2 Cirros instances on computes (1 instance on 1 compute), wait until it will be booted
4. Reset computes (with virsh reset)
5. Wait until computes will be booted
6. Wait some time (about 5 minutes)
7. Check that instances in ACTIVE status and operable
Expected results:
All steps pass without errors, both instances in ACTIVE status at step 7
Actual result:
One of instances in ERROR status on step 7
Reproducibility:
About 30% of cases
Description of the environment:
- Operation system: Ubuntu
- Versions of components: MOS 9.1 snapshot #76
- Reference architecture: 3 controllers, 2 computes, 1 cinder nodes
- Network model: VLAN + DVR
Additional information:
Logs: http://
Changed in mos: | |
status: | New → Confirmed |
importance: | Undecided → High |
assignee: | nobody → MOS Nova (mos-nova) |
tags: | added: regression |
Changed in mos: | |
assignee: | Georgy Dyuldin (g-dyuldin) → Roman Podoliaka (rpodolyaka) |
status: | Confirmed → In Progress |
tags: |
added: area-nova removed: area-linux |
Based on the similar bug reports:
https:/ /bugs.launchpad .net/ubuntu/ +source/ libvirt/ +bug/1384532 /bugs.launchpad .net/ubuntu/ +source/ libvirt/ +bug/1513367
https:/
the error can actually be misleading and does not necessarily mean /usr/bin/kvm was not found, but rather one of the files used by qemu on start.
We should give it a try on a live environment and check dmesg / perf_events output.