nova-compute state not updated
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Expired
|
Undecided
|
Unassigned | ||
oslo.messaging |
Incomplete
|
Undecided
|
Unassigned |
Bug Description
I'm running 2014.2.1 on CentOS7. 1 controller and 5 compute nodes are deployed using packstack.
Whenever I reboot the controller node, some of nova-compute services report state=XXX even after 60 minutes after reboot completed and controller node is up and running again:
[root@juno1 ~(keystone_admin)]# nova-manage service list
Binary Host Zone Status State Updated_At
nova-consoleauth juno1 internal enabled :-) 2014-12-19 13:17:48
nova-scheduler juno1 internal enabled :-) 2014-12-19 13:17:47
nova-conductor juno1 internal enabled :-) 2014-12-19 13:17:47
nova-cert juno1 internal enabled :-) 2014-12-19 13:17:48
nova-compute juno4 nova enabled XXX 2014-12-19 12:26:56
nova-compute juno5 nova enabled :-) 2014-12-19 13:17:47
nova-compute juno6 nova enabled :-) 2014-12-19 13:17:46
nova-compute juno3 nova enabled :-) 2014-12-19 13:17:46
nova-compute juno2 nova enabled XXX 2014-12-19 12:21:52
Here is the chunk of nova-compute log from juno4:
2014-12-19 15:46:02.082 5193 INFO oslo.messaging.
2014-12-19 15:46:02.083 5193 ERROR oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.083 5193 TRACE oslo.messaging.
2014-12-19 15:46:02.084 5193 INFO oslo.messaging.
2014-12-19 15:46:03.084 5193 INFO oslo.messaging.
2014-12-19 15:46:03.096 5193 INFO oslo.messaging.
2014-12-19 15:46:03.105 5193 ERROR oslo.messaging.
2014-12-19 15:46:03.105 5193 ERROR oslo.messaging.
2014-12-19 15:46:04.106 5193 INFO oslo.messaging.
2014-12-19 15:46:04.106 5193 INFO oslo.messaging.
2014-12-19 15:46:05.106 5193 INFO oslo.messaging.
2014-12-19 15:46:05.116 5193 INFO oslo.messaging.
2014-12-19 15:46:05.157 5193 INFO oslo.messaging.
2014-12-19 15:46:05.159 5193 INFO oslo.messaging.
2014-12-19 15:46:33.229 5193 AUDIT nova.compute.
2014-12-19 15:46:33.781 5193 ERROR oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.781 5193 TRACE oslo.messaging.
2014-12-19 15:46:33.782 5193 INFO oslo.messaging.
2014-12-19 15:46:34.782 5193 INFO oslo.messaging.
2014-12-19 15:46:34.796 5193 INFO oslo.messaging.
I see, that nova-compute lost AMQ connection with controller node because of reboot, but then connection was established again after reboot completed.
I also can manage running instances on that compute node and see correct instance states.
The problem can be solved if I do: systemctl restart openstack-
It sounds like there may be an issue in oslo.messaging with reconnecting