machine-types trusty and utopic are not unique (depend on the qemu version)
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ubuntu Cloud Archive |
Fix Released
|
Undecided
|
Unassigned | ||
Liberty |
Fix Released
|
Critical
|
Unassigned | ||
qemu (Ubuntu) |
Fix Released
|
Critical
|
Unassigned | ||
Xenial |
Fix Released
|
High
|
Unassigned | ||
Yakkety |
Fix Released
|
High
|
Unassigned | ||
Zesty |
Fix Released
|
Critical
|
Unassigned |
Bug Description
[Impact]
* Guests that were created with the Trusty (or Utopic) machine type are
not unique. Due to that on migrations between the qemu versions of
multiple Ubuntu releases migrations fail
* Many migrations work just by luck, one that fails is the Utopic Type on
Cloud-Archive Liberty to Xenial.
* The further one migrates that old guest the more breakage this
accumulates. E.g. a Trusty guest (qemu 2.0) migrated to Xenial
(thinks it is qemu 2.5 type) and from there migrating to Yakkety
(which expects it to be a 2.6 type).
* The fix is minimal and makes the types definition stable across
releases as they were intended
[Test Case]
* Spawn a Guest of Type Utopic (the easiest still supported is Trusty +
Cloud Archive Liberty). And then migrate it to Xenial. Without the
patch migration fail, with the patches it works as both now agree
what the guest definition means. Please do note that both ends of the
migrations have to be fixed to get it working.
* Similar a Guest of type Trusty can with the fixes applied be migrated
X->Y->Z and back to X, while without the fix any backward way (and
probably a future forward way) will fail.
* Note: This is complex and there are many potential combinations - the
testlogs (comment #15) attached have various permutations of those.
It has shown that not only the Utopic type issue that was reported
gets fixed, but several backward migrations as well.
[Regression Potential]
* While it fixes the cases that we know, and as testing showed also
several cases that we didn't know before there are two things we can
not avoid.
1. People have to restart the source guests so that the new fixed
definition will take effect.
But Trusty guests that were already migrated to a Host that has
the error will have to be restarted before they can be migrated
further.
Note: no one has to restart guests on Trusty without Cloud
Archive; there the Trusty type is ok - it is a 2.0 which after
the fix Xenial/Yakkety agree.
2. Restarting the guests after the fix will "downgrade" the virtual
hardware. One can think of the machine types as the HW-revision of
the virtual HW. A Guest that was created as e.g. Trusty these days
on Xenial as incorrectly "too new" virtual HW, restarting the
guest will fix that - but as part of that new attributes that it
incorrectly gained when migrating/moving to the new host will be
taken away (to match the definition the guest had when it was
started)
This is actually a fix, but might appear as a regression to
somebody without knowing what was going on.
Also anybody that "wants" the new HW can just upgrade the machine
type to get it, which is actually recommended anyway [1].
[Other Info]
* This is a complex issue, please catch me (cpaelzer) on IRC if you
need/want to go into detail.
Or for Cloud Archive questions coreycb.
[1]: https:/
--- original description ---
Hi,
I'm currently live-migrating many VMs from an old server to a new one, and some VM can't be live migrated.
The source host is trusty with qemu-system-x86 1:2.3+dfsg-
The destination host is xenial with qemu-system-x86 1:2.5+dfsg-
When the issue occurs, the destination host raises an error [1] and stop the migration process.
The only difference I see between VMs where live migration works and those were it doesn't work is a different machine type.
* migration works when VM have been created with pc-i440fx-vivid
* migration doesn't work when VM have been created with pc-i440fx-utopic
[1] the qemu error report by libvirt on the destination host
2016-11-14 08:25:40.774+0000: starting up libvirt version: 1.3.1, package: 1ubuntu10.5 (Stefan Bader <email address hidden> Thu, 06 Oct 2016 13:07:20 +0200), qemu version: 2.5.0 (Debian 1:2.5+dfsg-
LC_ALL=C PATH=/usr/
2016-11-
2016-11-
2016-11-
Cheers,
no longer affects: | qemu-kvm (Ubuntu) |
no longer affects: | qemu-kvm (Ubuntu Xenial) |
no longer affects: | qemu-kvm (Ubuntu Yakkety) |
no longer affects: | qemu-kvm (Ubuntu Zesty) |
Changed in qemu (Ubuntu Xenial): | |
status: | New → Confirmed |
Changed in qemu (Ubuntu Yakkety): | |
status: | New → Confirmed |
Changed in qemu (Ubuntu Zesty): | |
status: | Confirmed → In Progress |
Changed in qemu (Ubuntu Yakkety): | |
importance: | Undecided → High |
Changed in qemu (Ubuntu Xenial): | |
importance: | Undecided → High |
Changed in cloud-archive: | |
status: | New → Confirmed |
Changed in qemu (Ubuntu Yakkety): | |
status: | Confirmed → In Progress |
Changed in qemu (Ubuntu Xenial): | |
status: | Confirmed → In Progress |
description: | updated |
description: | updated |
Changed in cloud-archive: | |
status: | Confirmed → Fix Committed |
Changed in cloud-archive: | |
status: | Fix Committed → Fix Released |
After more investigation (comparation of the machine description between the ubuntu qemu 2.3 and 2.5), I have seen that in 2.3, the utopic machine is a kvm-2.3 machine while it's a kvm-2.5 machine is 2.5 when it should be kvm-2.3 too.
I will provide a debdiff once I have validated my change.