hypervisor stats aggregates resources from deleted and existing services if they share the same hostname
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Invalid
|
Undecided
|
Unassigned | ||
OpenStack Nova Compute Charm |
Invalid
|
Undecided
|
Unassigned | ||
nova (Ubuntu) |
Invalid
|
Medium
|
Unassigned |
Bug Description
In an environment with 592 physical threads (lscpu |grep '^CPU.s' and openstack hypervisor show -f value -c vcpus both show correct counts) I am seeing 712 vcpus. (likely also seeing inflated memory_mb and other stats due to the issue.)
Querying the nova services DB table, I see: http://
It appears that of the 6 machines showing deleted in the services table, only one is showing as disabled.
Digging through the nova/db/
I'm not exactly certain of the timeline of my uninstall and reinstall of the nova-compute units on the 6 x 24vcpu servers happened (see *-ST-{1,2} nova-compute services) that caused this behavior of the services not getting disabled, but nova api for hypervisor stats might be well served to filter out deleted services as well as disabled services, or if a deleted service should never not be disabled, nova service-delete should also set the disabled flag for the service.
These services and compute_nodes do not show up in openstack hypervisor list.
Site is running up-to-date Xenial/Mitaka on openstack-charmers 17.02.
Changed in nova: | |
assignee: | nobody → Edward Hope-Morley (hopem) |
summary: |
- hypervisor stats issue after charm removal if nova-compute service not - disabled first + hypervisor stats aggregates resources from deleted and existing services + if they share the same hostname |
tags: | added: sts |
Changed in nova (Ubuntu): | |
status: | Confirmed → Triaged |
That's odd as most queries do:
model_ query(context, models.Service, read_deleted="no")