Bug #1292234 “qcow2 image corruption on non-extent filesystems (...” : Bugs : linux package : Ubuntu

Jamie Strandboge (jdstrand) on 2014-03-13

summary:

- qcow2 image corruption in trusty (qemu 1.7)
+ qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

Serge Hallyn (serge-hallyn) on 2014-03-13

Changed in qemu (Ubuntu):
importance:	Undecided → High

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2014-03-20: Re: qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

#1

Have not yet been able to reproduce this. I'm considering adding an upstart job to your image which updates and shuts down, so I can test this in a loop.

Do you know whether (a) the --children option to snapshot delete or (b) using the same name for the new snapshot as the one you just delete are crucial to reproducing this?

Revision history for this message

Jamie Strandboge (jdstrand) wrote on 2014-03-21:

#2

I don't, I just used the options that our uvt command uses. I downgraded to saucy's qemu in the meantime so I can do my work. Do you need me to try some new test?

I'm not sure it makes any difference, but note that I am using a trusty host and kernel.

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2014-03-21: Re: [Bug 1292234] Re: qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

#3

Quoting Jamie Strandboge (<email address hidden>):
> I don't, I just used the options that our uvt command uses. I downgraded
> to saucy's qemu in the meantime so I can do my work. Do you need me to
> try some new test?

sigh, maybe.

I will keep trying.

> I'm not sure it makes any difference, but note that I am using a trusty
> host and kernel.

Right, that's what I'm using.

Have others on your team (who are not on the same thinkpad model :) seen
this as well? Have you seen it on different types of machines? Does
it happen more often if the machine is already working hard?

I wonder if I can reproduce it manually with qemu-img and qemu-nbd.

Revision history for this message

Jamie Strandboge (jdstrand) wrote on 2014-03-21: Re: qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

#4

Did you try with the image on https://chinstrap.canonical.com/~jamie/lp1292234/? I was only able to trigger it by using an old image, creating the snapshot, starting it, apt-get dist-upgrading, cleanly shutting down, then deleting the snapshot and creating another with the same name. Using a fresh install or a too new image doesn't do it for me (I guess enough has to happen in the guest to trigger it).

Ie:
$ virsh snapshot-create-as forhallyn-trusty-amd64 pristine "uvt snapshot"
$ virsh snapshot-current --name forhallyn-trusty-amd64
pristine
$ virsh start forhallyn-trusty-amd64
$ virsh snapshot-list forhallyn-trusty-amd64 # this is showing as shutoff after start, this might be different with qemu 1.5

in guest:
sudo apt-get update
sudo apt-get dist-upgrade
780 upgraded...
shutdown -h now

$ virsh snapshot-delete forhallyn-trusty-amd64 pristine --children
$ virsh snapshot-create-as forhallyn-trusty-amd64 pristine "uvt snapshot"

$ virsh start forhallyn-trusty-amd64 # this command works, but there is often disk corruption

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2014-03-21: Re: [Bug 1292234] Re: qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

#5

Quoting Jamie Strandboge (<email address hidden>):
> Did you try with the image on
> https://chinstrap.canonical.com/~jamie/lp1292234/? I was only able to

Yup! I wget that, create the snapshot, upgrade, remove and create
the snapshot, then start the vm. The upgrades take a long time
so I've only tested it 3 times so far. How likely is the failure?
Should I just keep going?

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2014-03-25: Re: qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

#6

I've not yet been able to definitively reproduce this. (On a bad nested qemu setup i had some issues which i think were unrelated). I've tried on a trusty laptop, and on a faster machine with a trusty container on a trusty kernel. Starting with the images you posted for me each time.

Revision history for this message

Seth Arnold (seth-arnold) wrote on 2014-05-01:

#7

I believe I just tripped this bug; I compressed some qcow2 images using this:

for f in sec-{lucid,precise,quantal,saucy,trusty}-{amd64,i386} ;
  do echo $f ;
  qemu-img convert -s pristine -p -f qcow2 -O qcow2 $f.qcow2 reclaimed.qcow2 ;
  mv reclaimed.qcow2 $f.qcow2 ;
  virsh snapshot-delete $f --snapshotname pristine ;
  uvt snapshot $f ;
done

The 'uvt snapshot' command makes a snapshot named 'pristine'.

AMD64 guests:
sec-lucid-amd64 booted without trouble.

sec-precise-amd64 reports:
Booting from Hard Disk...
Boot failed: not a booktable disk

No bootable device.

sec-quantal-amd64 reports:
Booting from Hard Disk...
error; file `/boot/grub/i386-pc/normal.mod' not found.
grub rescue>

sec-saucy-amd64 reports:
Booting from Hard Disk...
error: file `/boot/grub/i386-pc/normal.mod' not found.
Entering rescue mode...
grub rescue>

sec-trusty-amd64 reports:
Booting from Hard Disk...
Boot failed: not a bootable disk

No bootable device.

i386 guests:

sec-lucid-i386, sec-precise-i386, sec-quantal-i386, sec-saucy-i386 all booted fine.

sec-trusty-i386 reports:
Booting from Hard Disk...
Boot failed: not a bootable disk

No bootable device.

I use the i386 VMs significantly less often than the amd64 VMs.

Revision history for this message

Launchpad Janitor (janitor) wrote on 2014-05-01:

#8

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in qemu (Ubuntu):
status:	New → Confirmed

Revision history for this message

Jamie Strandboge (jdstrand) wrote on 2014-05-01:

#9

FYI, I periodically use and follow the same procedure that Seth described (in fact, I did it yesterday) and had no problems with qemu 1.5.0+dfsg-3ubuntu5.4 (which I've apt pinned since reporting this bug).

description:

updated

Serge Hallyn (serge-hallyn) on 2014-05-19

Changed in qemu (Ubuntu):
assignee:	nobody → Serge Hallyn (serge-hallyn)

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2014-05-22:

#10

I have a clean install of trusty on an intel laptop. I added the following upstart job in the forhallyn-trusty-amd64.img root partition:

####################################################################
description "update and shutdown"
author "Serge Hallyn <email address hidden>"

start on runlevel [2345]

script
sleep 5s
apt-get update
DEBIAN_FRONTEND='noninteractive' apt-get -y dist-upgrade
sleep 5s
shutdown -h now
end script
####################################################################

Then on the host I run this script:

####################################################################
#!/bin/bash

cp orig-with-upstart/forhallyn-trusty-amd64.img .

virsh snapshot-create-as forhallyn-trusty-amd64 pristine "uvt snapshot"
virsh start forhallyn-trusty-amd64
sleep 20s
while [ 1 ]; do
virsh list | grep -q forhallyn || break
sleep 20s
done

# guest has updated. check the image file and fs here
qemu-img check forhallyn-trusty-amd64.img
if [ $? -ne 0 ]; then
    echo "image check failed after shutdown"
    exit 1
fi
qemu-nbd -c /dev/nbd0 forhallyn-trusty-amd64.img
fsck -a /dev/nbd0p1
if [ $? -ne 0 ]; then
    echo "fs bad after shutdown"
    qemu-nbd -d /dev/nbd0
    exit 1
fi
qemu-nbd -d /dev/nbd0

# now tweak the snapshots
virsh snapshot-delete forhallyn-trusty-amd64 pristine --children
virsh snapshot-create-as forhallyn-trusty-amd64 pristine "uvt snapshot"

# and check the image file and fs again
qemu-img check forhallyn-trusty-amd64.img
if [ $? -ne 0 ]; then
    echo "image check failed after snapshot remove/create"
    exit 1
fi
qemu-nbd -c /dev/nbd0 forhallyn-trusty-amd64.img
fsck -a /dev/nbd0p1
if [ $? -ne 0 ]; then
    echo "fs bad after snapshot remove/create"
    qemu-nbd -d /dev/nbd0
    exit 1
fi
qemu-nbd -d /dev/nbd0

# all seems well
exit 0
####################################################################

I'll run that in a loop and see if it fails after 10 tries.

If you see anything there that I am NOT doing which would help to reproduce,
please let me know.

I have a clean install of trusty on an intel laptop.  I added the following upstart job in the forhallyn-trusty-amd64.img root partition:

####################################################################
description "update and shutdown"
author "Serge Hallyn <serge.hallyn@ubuntu.com>"

start on runlevel [2345]

script
	sleep 5s
	apt-get update
	DEBIAN_FRONTEND='noninteractive' apt-get -y dist-upgrade
	sleep 5s
	shutdown -h now
end script
####################################################################

Then on the host I run this script:

####################################################################
#!/bin/bash

cp orig-with-upstart/forhallyn-trusty-amd64.img .

virsh snapshot-create-as forhallyn-trusty-amd64 pristine "uvt snapshot"
virsh start forhallyn-trusty-amd64
sleep 20s
while [ 1 ]; do
	virsh list | grep -q forhallyn || break
	sleep 20s
done

# guest has updated.  check the image file and fs here
qemu-img check forhallyn-trusty-amd64.img
if [ $? -ne 0 ]; then
    echo "image check failed after shutdown"
    exit 1
fi
qemu-nbd -c /dev/nbd0 forhallyn-trusty-amd64.img
fsck -a /dev/nbd0p1
if [ $? -ne 0 ]; then
    echo "fs bad after shutdown"
    qemu-nbd -d /dev/nbd0
    exit 1
fi
qemu-nbd -d /dev/nbd0

# now tweak the snapshots
virsh snapshot-delete forhallyn-trusty-amd64 pristine --children
virsh snapshot-create-as forhallyn-trusty-amd64 pristine "uvt snapshot"

# and check the image file and fs again
qemu-img check forhallyn-trusty-amd64.img
if [ $? -ne 0 ]; then
    echo "image check failed after snapshot remove/create"
    exit 1
fi
qemu-nbd -c /dev/nbd0 forhallyn-trusty-amd64.img
fsck -a /dev/nbd0p1
if [ $? -ne 0 ]; then
    echo "fs bad after snapshot remove/create"
    qemu-nbd -d /dev/nbd0
    exit 1
fi
qemu-nbd -d /dev/nbd0

# all seems well
exit 0
####################################################################

I'll run that in a loop and see if it fails after 10 tries.

If you see anything there that I am NOT doing which would help to reproduce,
please let me know.

Serge Hallyn (serge-hallyn) on 2014-08-07

tags:

added: qcow2

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2014-08-11:

#11

As far as I know, everyone who has experienced this has been using a
thinkpad. I've first experienced this myself last week, on a new
thinkpad running utopic.

Two curious things I noticed, beside this being a thinkpad:

1. I could not start the VM with the bad image at all. Until I rebooted.
Then the image was fine, and fsck-clean. This suggests a possible problem
with the page cache on the host.

2. I then disabled KSM. I have not seen this problem since then, however I
also have not hit a vm quite as hard yet. Will have to see whether a series
of package builds manages to make this happen again with KSM disabled.

Revision history for this message

Jamie Strandboge (jdstrand) wrote on 2014-08-19:

#12

On utopic amd64, I tried the new qemu 2.1 packages and disabled KSM. They seemed to be ok for a while, but after using 'uvt update' today (which under the hood does what is decribed in the bug description), I lost 6 VMs to this bug. A reboot did not solve it. I've downgraded to saucy again. Unfortunately, the saucy packages are no longer supported and have stopped getting security updates. This is getting rather dire for me....

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2014-08-21: Re: [Bug 1292234] Re: qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

#13

Hi Jamie,

just to make sure, did you permanently disable ksm? Does

cat /sys/kernel/mm/ksm/run

still show 0?

I've so far never seen a case where a reboot did not fix the issue,
nor have I seen an issue (other than suspending the host sometimes
causing the VM to hang so that I have to destroy it) with ksm
disabled.

I had hoped to do some large parallel upgrade tests this week, but
network at linuxcon is not up to the task (even with apt-cacher-ng!)
If I can find a better room I'll see about trying there.

Revision history for this message

Jamie Strandboge (jdstrand) wrote on 2014-08-21: Re: qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

#14

I disabled KSM by setting /etc/default/qemu-kvm to have:
KSM_ENABLED=0

and did 'sudo restart qemu-kvm'. I also rebooted before seeing the problem. Since then, I downgraded to saucy's qemu-kvm which reset KSM_ENABLED=1. I didn't specifically check /sys/kernel/mm/ksm/run and of course now this is set to '1'.

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2014-08-21: Re: [Bug 1292234] Re: qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

#15

Ok - thanks Jamie.

Revision history for this message

Ryan Harper (raharper) wrote on 2014-08-29: Re: qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

#16

For the reproducers, something worth trying is to use to try is external snapshots (instead of internal which the snapshot-create-as does without flags).

instead run: snapshot-create-as --disk-only

which will basically do qemu-img create -b your_original_qcow2 -f qcow2 pristine

And store the snapshot delta in a separate file.

Revision history for this message

Ryan Harper (raharper) wrote on 2014-09-03:

#17

I've been running the scripts from comment #10. I have two VMs each running simultaneously; I've completed 24 hours of this sequence, about 50 total cycles with zero errors in the qcow2 images.

We're missing something; possibly hardware specific?

Host machine is an Intel NUC on Trusty.
Linux kriek 3.13.0-34-generic #60-Ubuntu SMP Wed Aug 13 15:45:27 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux

Ill see about increasing concurrency next.

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2014-09-03: Re: [Bug 1292234] Re: qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

#18

There are 69 commits to block/qcow* between 1.5.0 and 1.7.0.
I have compiled binaries of qemu-system-x86_64 and qemu-img
at each of those commits and pushed them to

http://people.canonical.com/~serge/binaries.0
through
http://people.canonical.com/~serge/binaries.68

Note that binaries.0 is the *latest* commit.

So to bisect with these you could start with binaries.34, then
if that shows corruption, try binaries.51, or if it does not,
try binaries.17 etc. 6 steps should get us to a single commit.
It's not certain that one of these commits caused the
regression, but it seems a reasonable place to start.

Revision history for this message

Ryan Harper (raharper) wrote on 2014-09-03: Re: qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

#19

I'm also starting work on updating uvt to use external snapshots instead; this would be an alternative to use while chasing down the bug in internal snapshots.

Revision history for this message

Jamie Strandboge (jdstrand) wrote on 2014-09-03:

#20

I tried to reproduce this many different ways with 2.1+dfsg-3ubuntu3 over the weekend and could not trigger the issue (with ksm enabled too). I don't know what version I had in comment #12. 2.1+dfsg-3ubuntu2 is plausible based on the date of the comment and the publication of this version, though I can't guarantee it wasn't 2.1+dfsg-2ubuntu2 or even 2.1+dfsg-2ubuntu1 though I did specifically mention I used 2.1. I don't see anything in the changes that jumps out that qcow2 corruption bugs were fixed since my comment, so I'm worried I just haven't been able to reproduce....

Revision history for this message

Jamie Strandboge (jdstrand) wrote on 2014-09-10:

#21

I just had this happen to me with 2.1+dfsg-3ubuntu3 on utopic. I had a VM I had been using for a days, then did a 'uvt stop -rf ...' followed by 'uvt update sec-utopic-amd64' and I was dropped to a grub rescue. :\

I'll downgrade again and regenerate the VM.

Revision history for this message

Jamie Strandboge (jdstrand) wrote on 2014-09-18:

#22

This happened again with an important VM. I still don't have a reproducer for testing the bisect packages.... :(

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2014-09-22:

#23

On my main server (3.13.0-32-generic with precise userspace) I installed a trusty container with ext3 (LVM) backing store. There I installed uvt and created 4 VMs, 2 precise amd64 and 2 precise i386. I several times did:

ubuntu@uvttest:~$ cat list
p-precise-server-amd64
p-precise-server-i386
q-precise-server-i386
q-precise-server-amd64
ubuntu@uvttest:~$ for n in `cat list`; do uvt start -fr $n; done
ubuntu@uvttest:~$ for n in `cat list`; do tmux splitw -p 25 -t $TMUX_PANE "expect vmupgrade.expect $n"; done

where vmupgrade.expect is:
=================================================================
#!/usr/bin/expect

set container [lrange $argv 0 0]
spawn ssh $container
#expect "assword:"
#send -- "ubuntu\r"

expect "$container:~$"
send -- "export DEBIAN_FRONTEND=noninteractive\r"
send -- "sudo sed -i 's/never/lts/' /etc/update-manager/release-upgrades\r"
expect "assword for ubuntu:"
send -- "ubuntu\r"

expect "$container:~$"
send -- "sudo apt-get update\r"
expect "$container:~$"
send -- "sudo do-release-upgrade -f DistUpgradeViewNonInteractive\r"
set timeout 11000
expect "$container:~$"
send -- "sudo reboot\r"
=================================================================

Then I find /lib -name xxx; sudo reboot; find /lib -name xxx; and look
through dmesg for errors, then do

ubuntu@uvttest:~$ for n in `cat list`; do uvt stop -fr $n; done

Alas I've seen no corruption yet. The goal here isn't just to reproduce
it, but to do so reliably enough to be able to bisect - this isn't it :(

Revision history for this message

Jamie Strandboge (jdstrand) wrote on 2014-11-20:

#24

FYI, I was able to reproduce this last night and uploaded forhallyn-trusty-amd64.img.corrupted.gz to https://chinstrap.canonical.com/~jamie/lp1292234/ for comparison with forhallyn-trusty-amd64.img.gz.

Revision history for this message

Chris J Arges (arges) wrote on 2014-11-24:

#25

$ od -x -N 72 forhallyn-trusty-amd64.img.corrupted | grep '[1-9]*'
refcount_table_cluster 0000 0100
0000000 4651 fb49 0000 0200 0000 0000 0000 0000
0000020 0000 0000 0000 1000 0000 0200 0000 0000
0000040 0000 0000 0000 1000 0000 0000 0300 0000
0000060 0000 0000 0100 0000 0000 0100 0000 0100
0000100 0000 0000 0500 0000

nb_snapshots = 0000 0100
snapshots_offset = 0000 0000 0500 0000

$ od -x -N 72 forhallyn-trusty-amd64.img | grep '[1-9]*'
0000000 4651 fb49 0000 0200 0000 0000 0000 0000
0000020 0000 0000 0000 1000 0000 0200 0000 0000
0000040 0000 0000 0000 1000 0000 0000 0300 0000
0000060 0000 0000 0100 0000 0000 0100 0000 0000
0000100 0000 0000 0000 0000

nb_snapshots = 0000 0000
snapshots_offset = 0000 0000 0000 0000

Looking at just the QCowHeader (and not de-scrambling BE format), I see the following differences; however I think this looks 'ok', I'll need to examine the rest of the file.

Chris J Arges (arges) on 2014-11-26

Changed in qemu (Ubuntu):
assignee:	Serge Hallyn (serge-hallyn) → Chris J Arges (arges)

Revision history for this message

Chris J Arges (arges) wrote on 2014-11-26:

#26

Ok I think I can reproduce this; after running some disk operations (bonnie++ and split a 100MB file), if I shutdown and try to boot the VM the disk cannot be booted and I'm presented with the grub menu.

However this reproducer is not yet 100% reliable. Next week I'll work on bisecting it down after testing latest upstream.

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2014-11-27: Re: [Bug 1292234] Re: qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

#27

Awesome - thank you Chris.

Revision history for this message

Ryan Harper (raharper) wrote on 2014-12-01: Re: qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

#28

can we confirm what filesystems and options are enabled when reproducing (ie, ext4 +extent mapping)[1] ? Bug 1368815 sounds very much like this. If the reproducing systems have ext4 extents mapping enabled, one could create an ext4 fs without extent mapping[2] and see if this still reproduces.

If it is related to the ext4 extents, the rate of memory pressure and speed of the underlying device would determine whether or not the file ends up being corrupt which might explain the difficulty of reproducing.

1. % sudo tune2fs -l /dev/disk/by-id/dm-name-kriek--vg-root | grep -i features
Filesystem features: has_journal ext_attr resize_inode dir_index filetype needs_recovery extent flex_bg sparse_super large_file huge_file uninit_bg dir_nlink extra_isize
2. mke2fs -t ext4 -O ^extent /dev/<device>

Revision history for this message

Chris J Arges (arges) wrote on 2014-12-01:

#29

Ryan,

The host's root filesystem is ext3/LVM (per Jamie's original configuration):

sudo tune2fs -l /dev/disk/by-id/dm-name-ubuntu--vg-root | grep -i features
Filesystem features: has_journal ext_attr resize_inode dir_index filetype needs_recovery sparse_super large_file

Revision history for this message

Jamie Strandboge (jdstrand) wrote on 2014-12-01:

#30

Actually, for me it is just ext3 without LVM.
$ sudo tune2fs -l /dev/sda3 | grep -i features
Filesystem features: has_journal ext_attr resize_inode dir_index filetype needs_recovery sparse_super large_file

Revision history for this message

Chris J Arges (arges) wrote on 2014-12-01:

#31

lp1292234-repro.sh Edit (1.1 KiB, text/x-sh)

Attached is a reproducer for this issue, here is what needs to be done to setup the reproducer:
1) The host machine's filesystem needs to be ext3
2) Install a VM (via virsh) and use a qcow2 disk
3) Ensure you can ssh without a password and the VM has bonnie++ installed
4) Adjust the variables in the script before running
5) Run the script a couple of times

While this doesn't reproduce 100% of the time, I can usually get a failure within 1-3 trials. However executing this on a ext4 host filesystem I've been unable to reproduce this issue.

Revision history for this message

Chris J Arges (arges) wrote on 2014-12-01:

#32

Also I've been able to reproduce this with the latest master in qemu, and even with the latest daily 3.18-rcX kernel on the host.

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2014-12-10: Re: [Bug 1292234] Re: qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

#33

Excellent!

Any chance you can start bisecting with http://people.canonical.com/~serge/binaries.{0..68}/{qemu-img,qemu-system-x86_64} ?

Revision history for this message

Chris J Arges (arges) wrote on 2014-12-10: Re: qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)

#34

Serge,
So I was able to just compile my own qemu and test with that.

I did attempt a reverse bisect, and was able to reproduce as early as v1.1 and also reproduce on master HEAD.
v1.0 was inconclusive because qcow2 format I made with the newer binary seemed to be incompatible with v1.0; however from Jamies testing this seems to be a working version; so I'd say somewhere between v1.0.0, v1.1.0 lies the original change that enabled this issue. As I've been unable to reproduce this without virsh, reverse bisecting and using older qemu versions is a bit challenging as machine types change, features virsh wants to use aren't available, etc.

Another interesting thing I tested today was I was able to reproduce with ext4 with extents disabled; maybe that gives more clues. Just to make sure I wasn't crazy, mkfs'd the partition to vanilla ext4 and iterated for most of the afternoon with no failures.

My next steps are going to be enabling verbose output for qcow2, looking more deeply into what gets corrupted in the file, and turning on host filesystem debugging.

--chris

Chris J Arges (arges) on 2014-12-15

Changed in qemu (Ubuntu):
status:	Confirmed → In Progress

Revision history for this message

Chris J Arges (arges) wrote on 2015-01-14:

#35

FWIW, just re-reproduced this with latest upstream kernel / qemu / fresh qcow2 image.

Chris J Arges (arges) on 2015-01-16

summary:

- qcow2 image corruption in trusty (qemu 1.7 and 2.0 candidate)
+ qcow2 image corruption on non-extent filesystems (ext3)

Chris J Arges (arges) on 2015-01-28

no longer affects:

qemu

Chris J Arges (arges) on 2015-01-30

Changed in linux (Ubuntu):
assignee:	nobody → Chris J Arges (arges)
importance:	Undecided → High
status:	New → In Progress
Changed in qemu (Ubuntu):
status:	In Progress → Invalid
assignee:	Chris J Arges (arges) → nobody
importance:	High → Undecided

Chris J Arges (arges) on 2015-01-30

description:

updated

Revision history for this message

Chris J Arges (arges) wrote on 2015-01-30:

#36

Sent e-mail upstream about this issue: http://marc.info/?l=linux-fsdevel&m=142264422605440&w=2

Revision history for this message

Josep M. Perez (josep-m-perez) wrote on 2015-02-02:

#37

Apparently this bug is also present in Debian. In my case the corrupted image was a windows one. When I run qemu-img check over it it will complain about lots of clusters, and if I pass it the repair flag, then it will end up crashing with the following message:

$ qemu-img check -r all windows.img
Repairing cluster 0 refcount=0 reference=1
Repairing cluster 1 refcount=0 reference=1
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
Repairing cluster 2 refcount=0 reference=1
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
Repairing cluster 3 refcount=0 reference=1
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
Repairing cluster 4 refcount=0 reference=1
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
Repairing cluster 5 refcount=0 reference=1
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
Repairing cluster 6 refcount=0 reference=1
...
Repairing OFLAG_COPIED data cluster: l2_entry=8000000397a59000 refcount=0
Repairing OFLAG_COPIED data cluster: l2_entry=8000000397a5a000 refcount=0
Repairing OFLAG_COPIED data cluster: l2_entry=800000000001b000 refcount=0
The following inconsistencies were found and repaired:

0 leaked clusters
97850 corruptions

Double checking the fixed image now...
[1] 27716 segmentation fault (core dumped) qemu-img check -r all windows.img

Has anyone else tried this over a copy of the corrupted image?

Apparently this bug is also present in Debian. In my case the corrupted image was a windows one. When I run qemu-img check over it it will complain about lots of clusters, and if I pass it the repair flag, then it will end up crashing with the following message:

$  qemu-img check -r all windows.img
Repairing cluster 0 refcount=0 reference=1
Repairing cluster 1 refcount=0 reference=1
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
Repairing cluster 2 refcount=0 reference=1
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
Repairing cluster 3 refcount=0 reference=1
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
Repairing cluster 4 refcount=0 reference=1
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
Repairing cluster 5 refcount=0 reference=1
qcow2: Preventing invalid write on metadata (overlaps with active L1 table); image marked as corrupt.
Repairing cluster 6 refcount=0 reference=1
...
Repairing OFLAG_COPIED data cluster: l2_entry=8000000397a59000 refcount=0
Repairing OFLAG_COPIED data cluster: l2_entry=8000000397a5a000 refcount=0
Repairing OFLAG_COPIED data cluster: l2_entry=800000000001b000 refcount=0
The following inconsistencies were found and repaired:

0 leaked clusters
    97850 corruptions

Double checking the fixed image now...
[1]    27716 segmentation fault (core dumped)  qemu-img check -r all windows.img

Has anyone else tried this over a copy of the corrupted image?

Chris J Arges (arges) on 2015-02-03

description:	updated
description:	updated

Chris J Arges (arges) on 2015-02-03

description:

updated

Revision history for this message

Chris J Arges (arges) wrote on 2015-02-03:

#38

@josep-m-perez
Yes, this is an upstream bug. So it affects anyone using the right filesystem and CONFIGs. Once we fix this upstream, then it will be submitted as a stable kernel update and make its way into all stable kernels as applicable.

Revision history for this message

Launchpad Janitor (janitor) wrote on 2015-02-09:

#39

This bug was fixed in the package linux - 3.18.0-13.14

---------------
linux (3.18.0-13.14) vivid; urgency=low

[ Andy Whitcroft ]

* hyper-v -- fix comment handing in /etc/network/interfaces
- LP: #1413020

[ Chris J Arges ]

  * [Config] Add ibmvfc to d-i
    - LP: #1416001
  * SAUCE: ext4: disable ext4_punch_hole for indirect filesystems
    - LP: #1292234

[ Leann Ogasawara ]

  * rebase to v3.18.5
  * [Config] CONFIG_X86_UP_APIC_MSI=y
  * Release Tracking Bug
    - LP: #1417475
-- Leann Ogasawara <email address hidden> Thu, 05 Feb 2015 09:58:20 +0200

Changed in linux (Ubuntu):
status:	In Progress → Fix Released

Revision history for this message

Chris J Arges (arges) wrote on 2015-02-11:

#40

Note there currently is a patch upstream:
https://lkml.org/lkml/2015/2/10/520

This fixes the original bug correctly without having to disable ext4_punch_hole for indirect filesystems. Once this lands in Linus' tree, I'll file an SRU to get this fixed across the board.

Revision history for this message

Jamie Strandboge (jdstrand) wrote on 2015-02-11:

#41

Woohoo! *Huge* thanks. This was a tricky one :)

Chris J Arges (arges) on 2015-03-20

description:

updated

Revision history for this message

Chris J Arges (arges) wrote on 2015-03-20:

#42

Sent email to upstream stable to apply this bug to affected kernels.

Chris J Arges (arges) on 2015-03-31

Changed in linux (Ubuntu):
status:	Fix Released → Confirmed

Andy Whitcroft (apw) on 2015-04-03

Changed in linux (Ubuntu):
status:	Confirmed → Fix Committed

Revision history for this message

Launchpad Janitor (janitor) wrote on 2015-04-03:

#43

Download full text (6.8 KiB)

This bug was fixed in the package linux - 3.19.0-12.12

---------------
linux (3.19.0-12.12) vivid; urgency=low

[ Andy Whitcroft ]

  * [Packaging] do_common_tools should always be on
  * [Packaging] Provides: virtualbox-guest-modules when appropriate
    - LP: #1434579

[ Chris J Arges ]

* Revert "SAUCE: ext4: disable ext4_punch_hole for indirect filesystems"
- LP: #1292234

[ Leann Ogasawara ]

* Release Tracking Bug
- LP: #1439803

[ Timo Aaltonen ]

  * SAUCE: i915_bpo: Provide a backport driver for Skylake & Cherryview
    graphics
    - LP: #1420774
  * SAUCE: i915_bpo: Update intel_ips.h file location
    - LP: #1420774
  * SAUCE: i915_bpo: Only support Skylake and Cherryview with the backport
    driver
    - LP: #1420774
  * SAUCE: i915_bpo: Rename the backport driver to i915_bpo
    - LP: #1420774
  * i915_bpo: [Config] Enable CONFIG_DRM_I915_BPO=m
    - LP: #1420774
  * SAUCE: i915_bpo: Add i915_bpo_*() calls for ubuntu/i915
    - LP: #1420774
  * SAUCE: i915_bpo: Revert "drm/i915: remove unused
    power_well/get_cdclk_freq api"
    - LP: #1420774
  * SAUCE: i915_bpo: Add i915_bpo specific power well calls
    - LP: #1420774
  * SAUCE: Backport I915_PARAM_MMAP_VERSION and I915_MMAP_WC
    - LP: #1420774
  * SAUCE: Partial backport of drm/i915: Add ioctl to set per-context
    parameters
    - LP: #1420774
  * SAUCE: drm/i915: Specify bsd rings through exec flag
    - LP: #1420774
  * SAUCE: drm/i915: add I915_PARAM_HAS_BSD2 to i915_getparam
    - LP: #1420774
  * SAUCE: drm/i915: add component support
    - LP: #1420774
  * SAUCE: drm/i915: Add tiled framebuffer modifiers
    - LP: #1420774
  * SAUCE: Backport new displayable tiling formats
    - LP: #1420774
  * SAUCE: Backport drm_crtc_vblank_reset() helper
    - LP: #1420774
  * SAUCE: drm/i915: Add I915_PARAM_REVISION
    - LP: #1420774
  * SAUCE: drm/i915: Export total subslice and EU counts
    - LP: #1420774
  * SAUCE: i915_bpo: Revert drm/mm: Support 4 GiB and larger ranges
    - LP: #1420774

[ Upstream Kernel Changes ]

  * drm/i915/skl: Split the SKL PCI ids by GT
    - LP: #1420774
  * drm: Reorganize probed mode validation
    - LP: #1420774
  * drm: Perform basic sanity checks on probed modes
    - LP: #1420774
  * drm: Do basic sanity checks for user modes
    - LP: #1420774
  * drm/atomic-helper: Export both plane and modeset check helpers
    - LP: #1420774
  * drm/atomic-helper: Again check modeset *before* plane states
    - LP: #1420774
  * drm/atomic: Introduce state->obj backpointers
    - LP: #1420774
  * drm: allow property validation for refcnted props
    - LP: #1420774
  * drm: store property instead of id in obj attachment
    - LP: #1420774
  * drm: get rid of direct property value access
    - LP: #1420774
  * drm: add atomic_set_property wrappers
    - LP: #1420774
  * drm: tweak getconnector locking
    - LP: #1420774
  * drm: add atomic_get_property
    - LP: #1420774
  * drm: Remove unneeded braces for single statement blocks
    - LP: #1420774
  * drm: refactor getproperties/getconnector
    - LP: #1420774
  * drm: add atomic properties
    - LP: #1420774
  * drm/atomic: atomic_check functions
    - LP: #1420774
  * drm: s...

This bug was fixed in the package linux - 3.19.0-12.12

---------------
linux (3.19.0-12.12) vivid; urgency=low

[ Andy Whitcroft ]

* [Packaging] do_common_tools should always be on
  * [Packaging] Provides: virtualbox-guest-modules when appropriate
    - LP: #1434579

[ Chris J Arges ]

* Revert "SAUCE: ext4: disable ext4_punch_hole for indirect filesystems"
    - LP: #1292234

[ Leann Ogasawara ]

* Release Tracking Bug
    - LP: #1439803

[ Timo Aaltonen ]

* SAUCE: i915_bpo: Provide a backport driver for Skylake & Cherryview
    graphics
    - LP: #1420774
  * SAUCE: i915_bpo: Update intel_ips.h file location
    - LP: #1420774
  * SAUCE: i915_bpo: Only support Skylake and Cherryview with the backport
    driver
    - LP: #1420774
  * SAUCE: i915_bpo: Rename the backport driver to i915_bpo
    - LP: #1420774
  * i915_bpo: [Config] Enable CONFIG_DRM_I915_BPO=m
    - LP: #1420774
  * SAUCE: i915_bpo: Add i915_bpo_*() calls for ubuntu/i915
    - LP: #1420774
  * SAUCE: i915_bpo: Revert "drm/i915: remove unused
    power_well/get_cdclk_freq api"
    - LP: #1420774
  * SAUCE: i915_bpo: Add i915_bpo specific power well calls
    - LP: #1420774
  * SAUCE: Backport I915_PARAM_MMAP_VERSION and I915_MMAP_WC
    - LP: #1420774
  * SAUCE: Partial backport of drm/i915: Add ioctl to set per-context
    parameters
    - LP: #1420774
  * SAUCE: drm/i915: Specify bsd rings through exec flag
    - LP: #1420774
  * SAUCE: drm/i915: add I915_PARAM_HAS_BSD2 to i915_getparam
    - LP: #1420774
  * SAUCE: drm/i915: add component support
    - LP: #1420774
  * SAUCE: drm/i915: Add tiled framebuffer modifiers
    - LP: #1420774
  * SAUCE: Backport new displayable tiling formats
    - LP: #1420774
  * SAUCE: Backport drm_crtc_vblank_reset() helper
    - LP: #1420774
  * SAUCE: drm/i915: Add I915_PARAM_REVISION
    - LP: #1420774
  * SAUCE: drm/i915: Export total subslice and EU counts
    - LP: #1420774
  * SAUCE: i915_bpo: Revert drm/mm: Support 4 GiB and larger ranges
    - LP: #1420774

[ Upstream Kernel Changes ]

* drm/i915/skl: Split the SKL PCI ids by GT
    - LP: #1420774
  * drm: Reorganize probed mode validation
    - LP: #1420774
  * drm: Perform basic sanity checks on probed modes
    - LP: #1420774
  * drm: Do basic sanity checks for user modes
    - LP: #1420774
  * drm/atomic-helper: Export both plane and modeset check helpers
    - LP: #1420774
  * drm/atomic-helper: Again check modeset *before* plane states
    - LP: #1420774
  * drm/atomic: Introduce state->obj backpointers
    - LP: #1420774
  * drm: allow property validation for refcnted props
    - LP: #1420774
  * drm: store property instead of id in obj attachment
    - LP: #1420774
  * drm: get rid of direct property value access
    - LP: #1420774
  * drm: add atomic_set_property wrappers
    - LP: #1420774
  * drm: tweak getconnector locking
    - LP: #1420774
  * drm: add atomic_get_property
    - LP: #1420774
  * drm: Remove unneeded braces for single statement blocks
    - LP: #1420774
  * drm: refactor getproperties/getconnector
    - LP: #1420774
  * drm: add atomic properties
    - LP: #1420774
  * drm/atomic: atomic_check functions
    - LP: #1420774
  * drm: small property creation cleanup
    - LP: #1420774
  * drm/atomic: atomic plane properties
    - LP: #1420774
  * drm/atomic: atomic connector properties
    - LP: #1420774
  * drm: Atomic modeset ioctl
    - LP: #1420774
  * drm/atomic: Hide drm.ko internal interfaces
    - LP: #1420774
  * drm: Ensure universal_planes is set for atomic
    - LP: #1420774
  * drm: fix mismerge in drm_crtc.c
    - LP: #1420774
  * drm: bit of spell-check / editorializing.
    - LP: #1420774
  * drm: add support for tiled/compressed/etc modifier in addfb2
    - LP: #1420774
  * drm: Add rotation value to plane state
    - LP: #1420774
  * drm: add helper to get crtc timings (v5)
    - LP: #1420774
  * drm: Adding edp1.4 specific dpcd macros
    - LP: #1420774
  * drm/plane-helper: Skip prepare_fb/cleanup_fb when newfb==oldfb
    - LP: #1420774
  * next: drm/atomic: Use copy_from_user to copy 64 bit data from user
    space
    - LP: #1420774
  * powerpc: Add PVR for POWER8NVL processor
    - LP: #1438938
  * kernel/sched/clock.c: add another clock for use with the soft lockup
    watchdog
    - LP: #1427075
  * powerpc: add running_clock for powerpc to prevent spurious softlockup
    warnings
    - LP: #1427075
  * ext4: fix indirect punch hole corruption
    - LP: #1292234
  * fm10k: Clean-up page reuse code
    - LP: #1397863
  * net/fm10k: Avoid double setting of NETIF_F_SG for the HW encapsulation
    feature mask
    - LP: #1397863
  * fm10k: Check tunnel header length in encap offload
    - LP: #1397863
  * fm10k: Increase the timeout for the data path reset
    - LP: #1397863
  * fm10k: Validate VLAN ID in fm10k_update_xc_addr_pf
    - LP: #1397863
  * fm10k: Resolve compile warnings with W=1
    - LP: #1397863
  * UBUNTU Config: CONFIG_IXGBE_VXLAN=y
    - LP: #1397861
  * ixgbe: cleanup sparse errors in new ixgbe_x550.c file
    - LP: #1397861
  * ixgbe: allow multiple queues in SRIOV mode
    - LP: #1397861
  * ixgbevf: enable multiple queue support
    - LP: #1397861
  * ixgbevf: add RSS support for X550
    - LP: #1397861
  * ixgbe: fix setting port VLAN
    - LP: #1397861
  * ixgbe: cleanup redundant default method set_rxpba
    - LP: #1397861
  * ixgbe: Cleanup probe to remove redundant attempt to ID PHY
    - LP: #1397861
  * ixgbe: add VXLAN offload support for X550 devices
    - LP: #1397861
  * ixgbevf: set vlan_features in a single write instead of several ORs
    - LP: #1397861
  * ixgbevf: Fix ordering of shutdown to correctly disable Rx and Tx
    - LP: #1397861
  * ixgbevf: Add code to check for Tx hang
    - LP: #1397861
  * ixgbevf: rewrite watchdog task to function similar to igbvf
    - LP: #1397861
  * ixgbevf: combine all of the tasks into a single service task
    - LP: #1397861
  * ixgbe: add Tx anti spoofing support
    - LP: #1397861
  * be2net: move definitions related to FW cmdsfrom be_hw.h to be_cmds.h
    - LP: #1439332
  * be2net: refactor code that checks flash file compatibility
    - LP: #1439332
  * be2net: avoid flashing SH-B0 UFI image on SH-P2 chip
    - LP: #1439332
  * of: iommu: Add ptr to OF node arg to of_iommu_configure()
    - LP: #1386490
  * of: Move of_dma_configure() to device.c to help re-use
    - LP: #1386490
  * of: Fix size when dma-range is not used
    - LP: #1386490
  * PCI: Add helper functions pci_get[put]_host_bridge_device()
    - LP: #1386490
  * of/pci: Add of_pci_dma_configure() to update DMA configuration
    - LP: #1386490
  * PCI: Update DMA configuration from DT
    - LP: #1386490
  * arm: dma-mapping: limit IOMMU mapping size
    - LP: #1386490
  * of: Calculate device DMA masks based on DT dma-range size
    - LP: #1386490
 -- Leann Ogasawara <leann.ogasawara@canonical.com>   Thu, 02 Apr 2015 11:09:43 -0700

Changed in linux (Ubuntu):
status:	Fix Committed → Fix Released

Revision history for this message

Seth Arnold (seth-arnold) wrote on 2015-10-14:

#44

Is this still open against the 14.04.1 LTS kernel?

Thanks

Revision history for this message

Chris J Arges (arges) wrote on 2015-10-14:

#45

The fix is the following:
$ git describe --contains 6f30b7e37a8239f9d27db626a1d3427bc7951908
v4.0-rc1~1^2

I thought this was going to be queued up for stable, but doesn't look like that happened.
If this still affects you in 3.13, 3.16, I can backport this patch. Let me know.

Revision history for this message

Seth Arnold (seth-arnold) wrote on 2015-10-20:

#46

Chris, please do, I just recreated the issue with the "uvt update -rf" recipe from earlier; four of six VMs couldn't boot to a login: prompt, presumably from this bug.

Linux hunt 3.13.0-65-generic #106-Ubuntu SMP Fri Oct 2 22:08:27 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux

(I know, it misses this week's update. I can't keep up on this treadmill...)

Thanks

Chris J Arges (arges) on 2015-10-21

no longer affects:

qemu (Ubuntu)

Chris J Arges (arges) on 2015-10-21

no longer affects:	qemu (Ubuntu Trusty)
no longer affects:	qemu (Ubuntu Vivid)
Changed in linux-lts-utopic (Ubuntu):
status:	New → Invalid
Changed in linux (Ubuntu Trusty):
assignee:	nobody → Chris J Arges (arges)
Changed in linux (Ubuntu Vivid):
assignee:	nobody → Chris J Arges (arges)
Changed in linux-lts-utopic (Ubuntu Trusty):
assignee:	nobody → Chris J Arges (arges)
Changed in linux (Ubuntu Trusty):
importance:	Undecided → High
Changed in linux (Ubuntu Vivid):
importance:	Undecided → High
Changed in linux-lts-utopic (Ubuntu Trusty):
importance:	Undecided → High
Changed in linux (Ubuntu Trusty):
status:	New → In Progress
Changed in linux (Ubuntu Vivid):
status:	New → In Progress
Changed in linux-lts-utopic (Ubuntu Trusty):
status:	New → In Progress

Revision history for this message

Chris J Arges (arges) wrote on 2015-10-21:

#47

Ok verified that this fix is in 3.16, 3.19+ kernels. Sent Trusty backport to ML.

Changed in linux (Ubuntu Vivid):
status:	In Progress → Fix Released
Changed in linux-lts-utopic (Ubuntu Trusty):
status:	In Progress → Fix Released

Chris J Arges (arges) on 2015-10-21

Changed in linux (Ubuntu Vivid):
assignee:	Chris J Arges (arges) → nobody
Changed in linux-lts-utopic (Ubuntu Trusty):
assignee:	Chris J Arges (arges) → nobody

Brad Figg (brad-figg) on 2015-10-26

Changed in linux (Ubuntu Trusty):
status:	In Progress → Fix Committed

Revision history for this message

Luis Henriques (henrix) wrote on 2015-11-16:

#48

This bug is awaiting verification that the kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-trusty' to 'verification-done-trusty'.

If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you!

tags:

added: verification-needed-trusty

Seth Arnold (seth-arnold) on 2015-11-21

tags:

added: verification-failed
removed: verification-needed-trusty

Revision history for this message

Seth Arnold (seth-arnold) wrote on 2015-11-21:

#49

I was unable to test this specific modification due to significant regressions in the proposed kernel: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1518509

Brad Figg (brad-figg) on 2015-11-23

tags:

added: verification-failed-trusty
removed: verification-failed

Revision history for this message

Seth Arnold (seth-arnold) wrote on 2015-11-24:

#50

Henrix pointed out that I also needed the linux-image-extras package. I'm now able to test this, and will report back when I've had a chance to create the VM images.

Thanks

tags:

added: verification-needed-trusty
removed: verification-failed-trusty

Revision history for this message

Seth Arnold (seth-arnold) wrote on 2015-11-25:

#51

I've tested several dozen VM snapshot and revert operations; previously, I'd have expected all my VMs to be dead by this time. This update makes libvirt / qemu / with qcow2 images usable again for me. Thanks!

tags:

added: verification-done-trusty
removed: verification-needed-trusty

Revision history for this message

Launchpad Janitor (janitor) wrote on 2015-11-30:

#52

Download full text (11.3 KiB)

This bug was fixed in the package linux - 3.13.0-70.113

---------------
linux (3.13.0-70.113) trusty; urgency=low

[ Luis Henriques ]

* Release Tracking Bug
- LP: #1516733

[ Upstream Kernel Changes ]

* arm64: errata: use KBUILD_CFLAGS_MODULE for erratum #843419
- LP: #1516682

linux (3.13.0-69.112) trusty; urgency=low

[ Luis Henriques ]

* Release Tracking Bug
- LP: #1514858

[ Joseph Salisbury ]

* SAUCE: storvsc: use small sg_tablesize on x86
- LP: #1495983

[ Luis Henriques ]

* [Config] updateconfigs after 3.13.11-ckt28 and 3.13.11-ckt29 stable
updates

[ Upstream Kernel Changes ]

  * ext4: fix indirect punch hole corruption
    - LP: #1292234
  * x86/hyperv: Mark the Hyper-V TSC as unstable
    - LP: #1498206
  * namei: permit linking with CAP_FOWNER in userns
    - LP: #1498162
  * iwlwifi: pci: add a few more PCI subvendor IDs for the 7265 series
    - LP: #1510616
  * Drivers: hv: vmbus: Increase the limit on the number of pfns we can
    handle
    - LP: #1495983
  * sctp: fix race on protocol/netns initialization
    - LP: #1514832
  * [media] v4l: omap3isp: Fix sub-device power management code
    - LP: #1514832
  * [media] rc-core: fix remove uevent generation
    - LP: #1514832
  * xtensa: fix threadptr reload on return to userspace
    - LP: #1514832
  * ARM: OMAP2+: DRA7: clockdomain: change l4per2_7xx_clkdm to SW_WKUP
    - LP: #1514832
  * mac80211: enable assoc check for mesh interfaces
    - LP: #1514832
  * PCI: Add dev_flags bit to access VPD through function 0
    - LP: #1514832
  * PCI: Add VPD function 0 quirk for Intel Ethernet devices
    - LP: #1514832
  * usb: dwc3: ep0: Fix mem corruption on OUT transfers of more than 512
    bytes
    - LP: #1514832
  * serial: 8250_pci: Add support for Pericom PI7C9X795[1248]
    - LP: #1514832
  * KVM: MMU: fix validation of mmio page fault
    - LP: #1514832
  * auxdisplay: ks0108: fix refcount
    - LP: #1514832
  * devres: fix devres_get()
    - LP: #1514832
  * iio: adis16400: Fix adis16448 gyroscope scale
    - LP: #1514832
  * iio: Add inverse unit conversion macros
    - LP: #1514832
  * iio: adis16480: Fix scale factors
    - LP: #1514832
  * iio: industrialio-buffer: Fix iio_buffer_poll return value
    - LP: #1514832
  * iio: event: Remove negative error code from iio_event_poll
    - LP: #1514832
  * NFSv4: don't set SETATTR for O_RDONLY|O_EXCL
    - LP: #1514832
  * unshare: Unsharing a thread does not require unsharing a vm
    - LP: #1514832
  * ASoC: adav80x: Remove .read_flag_mask setting from
    adav80x_regmap_config
    - LP: #1514832
  * drivers: usb :fsl: Implement Workaround for USB Erratum A007792
    - LP: #1514832
  * drivers: usb: fsl: Workaround for USB erratum-A005275
    - LP: #1514832
  * serial: 8250: don't bind to SMSC IrCC IR port
    - LP: #1514832
  * staging: comedi: adl_pci7x3x: fix digital output on PCI-7230
    - LP: #1514832
  * blk-mq: fix buffer overflow when reading sysfs file of 'pending'
    - LP: #1514832
  * xtensa: fix kernel register spilling
    - LP: #1514832
  * NFS: nfs_set_pgio_error sometimes misses errors
    - LP: #1514832
  * NFS: Fix a NULL pointer dereference of migration...

This bug was fixed in the package linux - 3.13.0-70.113

---------------
linux (3.13.0-70.113) trusty; urgency=low

[ Luis Henriques ]

* Release Tracking Bug
    - LP: #1516733

[ Upstream Kernel Changes ]

* arm64: errata: use KBUILD_CFLAGS_MODULE for erratum #843419
    - LP: #1516682

linux (3.13.0-69.112) trusty; urgency=low

[ Luis Henriques ]

* Release Tracking Bug
    - LP: #1514858

[ Joseph Salisbury ]

* SAUCE: storvsc: use small sg_tablesize on x86
    - LP: #1495983

[ Luis Henriques ]

* [Config] updateconfigs after 3.13.11-ckt28 and 3.13.11-ckt29 stable
    updates

[ Upstream Kernel Changes ]

* ext4: fix indirect punch hole corruption
    - LP: #1292234
  * x86/hyperv: Mark the Hyper-V TSC as unstable
    - LP: #1498206
  * namei: permit linking with CAP_FOWNER in userns
    - LP: #1498162
  * iwlwifi: pci: add a few more PCI subvendor IDs for the 7265 series
    - LP: #1510616
  * Drivers: hv: vmbus: Increase the limit on the number of pfns we can
    handle
    - LP: #1495983
  * sctp: fix race on protocol/netns initialization
    - LP: #1514832
  * [media] v4l: omap3isp: Fix sub-device power management code
    - LP: #1514832
  * [media] rc-core: fix remove uevent generation
    - LP: #1514832
  * xtensa: fix threadptr reload on return to userspace
    - LP: #1514832
  * ARM: OMAP2+: DRA7: clockdomain: change l4per2_7xx_clkdm to SW_WKUP
    - LP: #1514832
  * mac80211: enable assoc check for mesh interfaces
    - LP: #1514832
  * PCI: Add dev_flags bit to access VPD through function 0
    - LP: #1514832
  * PCI: Add VPD function 0 quirk for Intel Ethernet devices
    - LP: #1514832
  * usb: dwc3: ep0: Fix mem corruption on OUT transfers of more than 512
    bytes
    - LP: #1514832
  * serial: 8250_pci: Add support for Pericom PI7C9X795[1248]
    - LP: #1514832
  * KVM: MMU: fix validation of mmio page fault
    - LP: #1514832
  * auxdisplay: ks0108: fix refcount
    - LP: #1514832
  * devres: fix devres_get()
    - LP: #1514832
  * iio: adis16400: Fix adis16448 gyroscope scale
    - LP: #1514832
  * iio: Add inverse unit conversion macros
    - LP: #1514832
  * iio: adis16480: Fix scale factors
    - LP: #1514832
  * iio: industrialio-buffer: Fix iio_buffer_poll return value
    - LP: #1514832
  * iio: event: Remove negative error code from iio_event_poll
    - LP: #1514832
  * NFSv4: don't set SETATTR for O_RDONLY|O_EXCL
    - LP: #1514832
  * unshare: Unsharing a thread does not require unsharing a vm
    - LP: #1514832
  * ASoC: adav80x: Remove .read_flag_mask setting from
    adav80x_regmap_config
    - LP: #1514832
  * drivers: usb :fsl: Implement Workaround for USB Erratum A007792
    - LP: #1514832
  * drivers: usb: fsl: Workaround for USB erratum-A005275
    - LP: #1514832
  * serial: 8250: don't bind to SMSC IrCC IR port
    - LP: #1514832
  * staging: comedi: adl_pci7x3x: fix digital output on PCI-7230
    - LP: #1514832
  * blk-mq: fix buffer overflow when reading sysfs file of 'pending'
    - LP: #1514832
  * xtensa: fix kernel register spilling
    - LP: #1514832
  * NFS: nfs_set_pgio_error sometimes misses errors
    - LP: #1514832
  * NFS: Fix a NULL pointer dereference of migration recovery ops for v4.2
    client
    - LP: #1514832
  * usb: host: ehci-sys: delete useless bus_to_hcd conversion
    - LP: #1514832
  * USB: symbolserial: Use usb_get_serial_port_data
    - LP: #1514832
  * USB: ftdi_sio: Added custom PID for CustomWare products
    - LP: #1514832
  * HID: usbhid: Fix the check for HID_RESET_PENDING in hid_io_error
    - LP: #1514832
  * eCryptfs: Invalidate dcache entries when lower i_nlink is zero
    - LP: #1514832
  * libxfs: readahead of dir3 data blocks should use the read verifier
    - LP: #1514832
  * xfs: Fix xfs_attr_leafblock definition
    - LP: #1514832
  * arm64: kconfig: Move LIST_POISON to a safe value
    - LP: #1514832
  * Btrfs: check if previous transaction aborted to avoid fs corruption
    - LP: #1514832
  * DRM - radeon: Don't link train DisplayPort on HPD until we get the dpcd
    - LP: #1514832
  * rtlwifi: rtl8192cu: Add new device ID
    - LP: #1514832
  * rtlwifi: rtl8192cu: Add new device ID
    - LP: #1514832
  * of/address: Don't loop forever in of_find_matching_node_by_address().
    - LP: #1514832
  * drivercore: Fix unregistration path of platform devices
    - LP: #1514832
  * xfs: return errors from partial I/O failures to files
    - LP: #1514832
  * IB/qib: Change lkey table allocation to support more MRs
    - LP: #1514832
  * tg3: Fix temperature reporting
    - LP: #1514832
  * drm/i915: Always mark the object as dirty when used by the GPU
    - LP: #1514832
  * Add radeon suspend/resume quirk for HP Compaq dc5750.
    - LP: #1514832
  * IB/uverbs: reject invalid or unknown opcodes
    - LP: #1514832
  * hpfs: update ctime and mtime on directory modification
    - LP: #1514832
  * Input: evdev - do not report errors form flush()
    - LP: #1514832
  * crypto: ghash-clmulni: specify context size for ghash async algorithm
    - LP: #1514832
  * fs: create and use seq_show_option for escaping
    - LP: #1514832
  * ALSA: hda - Enable headphone jack detect on old Fujitsu laptops
    - LP: #1514832
  * ALSA: hda - Use ALC880_FIXUP_FUJITSU for FSC Amilo M1437
    - LP: #1514832
  * scsi: fix scsi_error_handler vs. scsi_host_dev_release race
    - LP: #1514832
  * parisc: Use double word condition in 64bit CAS operation
    - LP: #1514832
  * vmscan: fix increasing nr_isolated incurred by putback unevictable
    pages
    - LP: #1514832
  * hfs,hfsplus: cache pages correctly between bnode_create and bnode_free
    - LP: #1514832
  * hfs: fix B-tree corruption after insertion at position 0
    - LP: #1514832
  * drm/qxl: validate monitors config modes
    - LP: #1514832
  * PCI: Fix TI816X class code quirk
    - LP: #1514832
  * x86/mm: Initialize pmd_idx in page_table_range_init_count()
    - LP: #1514832
  * powerpc/rtas: Introduce rtas_get_sensor_fast() for IRQ handlers
    - LP: #1514832
  * jbd2: avoid infinite loop when destroying aborted journal
    - LP: #1514832
  * clk: versatile: off by one in clk_sp810_timerclken_of_get()
    - LP: #1514832
  * usb: gadget: m66592-udc: forever loop in set_feature()
    - LP: #1514832
  * windfarm: decrement client count when unregistering
    - LP: #1514832
  * perf hists: Update the column width for the "srcline" sort key
    - LP: #1514832
  * batman-adv: Make DAT capability changes atomic
    - LP: #1514832
  * batman-adv: Make NC capability changes atomic
    - LP: #1514832
  * powerpc/mm: Fix pte_pagesize_index() crash on 4K w/64K hash
    - LP: #1514832
  * perf stat: Get correct cpu id for print_aggr
    - LP: #1514832
  * IB/mlx4: Fix potential deadlock when sending mad to wire
    - LP: #1514832
  * IB/mlx4: Forbid using sysfs to change RoCE pkeys
    - LP: #1514832
  * IB/mlx4: Use correct SL on AH query under RoCE
    - LP: #1514832
  * IB/uverbs: Fix race between ib_uverbs_open and remove_one
    - LP: #1514832
  * mmc: core: fix race condition in mmc_wait_data_done
    - LP: #1514832
  * ipv6: fix exthdrs offload registration in out_rt path
    - LP: #1514832
  * task_work: remove fifo ordering guarantee
    - LP: #1514832
  * scsi_dh: fix randconfig build error
    - LP: #1514832
  * fs: if a coredump already exists, unlink and recreate with O_EXCL
    - LP: #1514832
  * Linux 3.13.11-ckt28
    - LP: #1514832
  * sctp: donot reset the overall_error_count in SHUTDOWN_RECEIVE state
    - LP: #1514853
  * KEYS: Fix race between key destruction and finding a keyring by name
    - LP: #1514853
  * KEYS: Fix crash when attempt to garbage collect an uninstantiated
    keyring
    - LP: #1514853
  * KEYS: Don't permit request_key() to construct a new keyring
    - LP: #1514853
  * net: Fix skb csum races when peeking
    - LP: #1500810
  * [stable-only] net: add length argument to
    skb_copy_and_csum_datagram_iovec
    - LP: #1514853
  * spi: spi-pxa2xx: Check status register to determine if SSSR_TINT is
    disabled
    - LP: #1514853
  * spi: Fix documentation of spi_alloc_master()
    - LP: #1514853
  * ARM: 8429/1: disable GCC SRA optimization
    - LP: #1514853
  * powerpc/MSI: Fix race condition in tearing down MSI interrupts
    - LP: #1514853
  * CIFS: fix type confusion in copy offload ioctl
    - LP: #1514853
  * hwmon: (nct6775) Swap STEP_UP_TIME and STEP_DOWN_TIME registers for
    most chips
    - LP: #1514853
  * USB: option: add ZTE PIDs
    - LP: #1514853
  * x86/apic: Serialize LVTT and TSC_DEADLINE writes
    - LP: #1514853
  * Btrfs: fix read corruption of compressed and shared extents
    - LP: #1514853
  * btrfs: skip waiting on ordered range for special files
    - LP: #1514853
  * arm64: head.S: initialise mdcr_el2 in el2_setup
    - LP: #1514853
  * kvm: fix zero length mmio searching
    - LP: #1514853
  * iser-target: remove command with state ISTATE_REMOVE
    - LP: #1514853
  * ARM: fix Thumb2 signal handling when ARMv6 is enabled
    - LP: #1514853
  * powerpc/mm: Recompute hash value after a failed update
    - LP: #1514853
  * x86/platform: Fix Geode LX timekeeping in the generic x86 build
    - LP: #1514853
  * arm64: compat: fix vfp save/restore across signal handlers in
    big-endian
    - LP: #1514853
  * arm64: errata: add module build workaround for erratum #843419
    - LP: #1514853
  * arm64: KVM: Disable virtual timer even if the guest is not using it
    - LP: #1514853
  * arm: KVM: Disable virtual timer even if the guest is not using it
    - LP: #1514853
  * KVM: x86: trap AMD MSRs for the TSeg base and mask
    - LP: #1514853
  * usb: Use the USB_SS_MULT() macro to get the burst multiplier.
    - LP: #1514853
  * xhci: give command abortion one more chance before killing xhci
    - LP: #1514853
  * usb: xhci: Clear XHCI_STATE_DYING on start
    - LP: #1514853
  * xhci: change xhci 1.0 only restrictions to support xhci 1.1
    - LP: #1514853
  * disabling oplocks/leases via module parm enable_oplocks broken for SMB3
    - LP: #1514853
  * cifs: use server timestamp for ntlmv2 authentication
    - LP: #1514853
  * x86/paravirt: Replace the paravirt nop with a bona fide empty function
    - LP: #1514853
  * x86/nmi/64: Fix a paravirt stack-clobbering bug in the NMI code
    - LP: #1514853
  * ASoC: pxa: pxa2xx-ac97: fix dma requestor lines
    - LP: #1514853
  * drm/qxl: only report first monitor as connected if we have no state
    - LP: #1514853
  * PCI: Fix devfn for VPD access through function 0
    - LP: #1514853
  * PCI: Use function 0 VPD for identical functions, regular VPD for others
    - LP: #1514853
  * perf header: Fixup reading of HEADER_NRCPUS feature
    - LP: #1514853
  * netfilter: nft_compat: skip family comparison in case of NFPROTO_UNSPEC
    - LP: #1514853
  * ASoC: fix broken pxa SoC support
    - LP: #1514853
  * ARM: dts: omap5-uevm.dts: fix i2c5 pinctrl offsets
    - LP: #1514853
  * vxlan: set needed headroom correctly
    - LP: #1514853
  * usbnet: Get EVENT_NO_RUNTIME_PM bit before it is cleared
    - LP: #1514853
  * net/ipv6: Correct PIM6 mrt_lock handling
    - LP: #1514853
  * netlink, mmap: transform mmap skb into full skb on taps
    - LP: #1514853
  * openvswitch: Zero flows on allocation.
    - LP: #1514853
  * fib_rules: fix fib rule dumps across multiple skbs
    - LP: #1514853
  * Btrfs: update fix for read corruption of compressed and shared extents
    - LP: #1514853
  * Linux 3.13.11-ckt29
    - LP: #1514853

-- Luis Henriques <luis.henriques@canonical.com>  Mon, 16 Nov 2015 17:47:36 +0000

Changed in linux (Ubuntu Trusty):
status:	Fix Committed → Fix Released
status:	Fix Committed → Fix Released

Ubuntu
linux package

qcow2 image corruption on non-extent filesystems (ext3)

Bug Description

Duplicates of this bug

Other bug subscribers

Bug attachments

Remote bug watches

	Status	Importance	Assigned to
linux (Ubuntu)	Fix Released	High	Chris J Arges
Trusty	Fix Released	High	Chris J Arges
Vivid	Fix Released	High	Unassigned
linux-lts-utopic (Ubuntu)	Invalid	Undecided	Unassigned
Trusty	Fix Released	High	Unassigned

Ubuntulinux package

qcow2 image corruption on non-extent filesystems (ext3)

Bug Description

Duplicates of this bug

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntu
linux package