nova-compute stops processing compute.$HOSTNAME occasionally on libvirt
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Invalid
|
Medium
|
Unassigned | ||
nova (Ubuntu) |
Confirmed
|
Medium
|
Unassigned |
Bug Description
root@novamanag
compute.nodexyzzy 12
Occasionally on canonistack, we find that a compute node simply stops processing its rabbit queues. A check of the logs will show no nova-compute.log activity for hours, but a restart of nova-compute will cause it to check all the instances and then process all the requests in rabbit (usually lots of duplicates from frustrated users trying to re-send delete requests and get their quota back for another deployment).
In fact, while I was typing this (having restarted nova-compute on nodexyzzy before starting), I re-ran the above command to find it now silent.
This happens often enough (once every couple of days at least) but we're not sure of how to debug this. Is there any information we can get you about a nova-compute process that is in this unhappy state?
For the record, here is the last entry in the example node's nova-compute.log when I bounced things around 09:00Z:
2012-04-19 06:35:35 DEBUG nova.virt.
Changed in nova (Ubuntu): | |
importance: | Undecided → Medium |
summary: |
- nova-compute stops processing compute.$HOSTNAME occasionally + nova-compute stops processing compute.$HOSTNAME occasionally on libvirt |
Changed in nova: | |
assignee: | nobody → Michael H Wilson (geekinutah) |
Changed in nova: | |
assignee: | Michael H Wilson (geekinutah) → wangpan (hzwangpan) |
status: | Confirmed → In Progress |
Changed in nova: | |
assignee: | wangpan (hzwangpan) → nobody |
Changed in nova: | |
assignee: | nobody → wangpan (hzwangpan) |
status: | Confirmed → In Progress |
Changed in nova: | |
assignee: | wangpan (hzwangpan) → nobody |
status: | In Progress → Confirmed |
Changed in nova: | |
status: | Incomplete → Invalid |
The symptoms are similar to what we experienced in LP#903212, however I can confirm that libvirtd seems to be responding correctly in Precise.
Is there further information that we can provide?
$ dpkg-query --show nova-* e4~20120210. 12574-0ubuntu1 hypervisor
nova-api 2012.1~
nova-common 2012.1-0ubuntu2
nova-compute 2012.1-0ubuntu2
nova-compute-
nova-compute-kvm 2012.1-0ubuntu2
$ cat /etc/lsb-release RELEASE= 12.04 CODENAME= precise DESCRIPTION= "Ubuntu precise (development branch)"
DISTRIB_ID=Ubuntu
DISTRIB_
DISTRIB_
DISTRIB_