nova-compute unexpected input/output errors on starting instances (NFS + image-cache)
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Fix Released
|
Medium
|
Matt Riedemann | ||
Mitaka |
Won't Fix
|
Undecided
|
Joris S'heeren | ||
Newton |
Fix Committed
|
Low
|
Lee Yarwood |
Bug Description
Our setup consists of multiple controllers and multiple hypervisors. Our shared storage for the instances is on a nfs 4.1 export. Using Ubuntu 16.04 LTS and Openstack Mitaka
When we launch an instance, nova updates the mtime for the _base image to let the image cache manager know the image is actively used. I think this was added here: https:/
Because of this, in our setup, we are seeing unexpected input/output errors:
Stderr: u"/bin/touch: setting times of '/var/lib/
A full trace can be found here: http://
This error particularly shows itself when launching multiple instances at once.
Also, because of this error, the instances are rescheduled. The assigned neutron ports, however, are not deleted. This results in multiple ip's assigned to the instances, with only one of them UP. This also results in attached floating ip's not working ..
This is similar to https:/
Some more info on our environment:
-------
Using libvirt + kvm, neutron with openvswitch L3 HA
# dpkg -l | grep nova
ii nova-common 2:13.0.0-0ubuntu2 all OpenStack Compute - common files
ii nova-compute 2:13.0.0-0ubuntu2 all OpenStack Compute - compute node base
ii nova-compute-kvm 2:13.0.0-0ubuntu2 all OpenStack Compute - compute node (KVM)
ii nova-compute-
ii python-nova 2:13.0.0-0ubuntu2 all OpenStack Compute Python libraries
ii python-novaclient 2:3.3.1-2 all client library for OpenStack Compute API - Python 2.7
# dpkg -l |grep libvirt
ii libvirt-bin 1.3.1-1ubuntu10.1 amd64 programs for the libvirt library
ii libvirt0:amd64 1.3.1-1ubuntu10.1 amd64 library for interfacing with different virtualization systems
ii nova-compute-
ii python-libvirt 1.3.1-1ubuntu1 amd64 libvirt Python bindings
description: | updated |
tags: | added: compute image-cache nfs |
Changed in nova: | |
status: | New → Confirmed |
summary: |
- nova-compute unexpected input/output errors on starting instances + nova-compute unexpected input/output errors on starting instances (NFS + + image-cache) |
Is there any more useful information in dmesg or syslog when this fails? Is this 100% fail or intermittent, i.e. a timing issue?
Does the image cache base directory exist?
/var/lib/ nova/instances/ _base/
And can nova write to it?