Lost network in KVM VM / virtio_net page allocation failure
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Fix Released
|
Medium
|
Andy Whitcroft | ||
Lucid |
Fix Released
|
Medium
|
Stefan Bader | ||
Maverick |
Fix Released
|
Medium
|
Stefan Bader |
Bug Description
SRU Justification:
Impact: Under heavy network I/O load virtio-net driver crashes making VM guest unusable.
Testcase: I left a current Lucid VM running two concurrent "scp -r" of > 200 GB from NFS read-only source to a physical remote host overnight. VM quickly started emitting "page allocation errors" in the system log. Next morning when I checked the VM I could still ping it but could not establish an SSH connection.
Fix: This patch from Bruce Rogers at Novell
* [PATCH] KVM: add schedule check to napi_enable call
- http://
Binary package hint: 2.6.32-21-server
I'm seeing similar bugs in a VM running Lucid as there is reported in these Redhat bug reports:
https:/
https:/
The network connection drops in a KVM VM under high load.
"ifdown eth0 ; ifup eth0 " returns the connection.
This is the dmesg error:
[714069.829649] swapper: page allocation failure. order:0, mode:0x20
[714069.829653] Pid: 0, comm: swapper Not tainted 2.6.32-21-server #32-Ubuntu
[714069.829655] Call Trace:
[714069.829657] <IRQ> [<ffffffff810f9
[714069.829674] [<ffffffff810f9
[714069.829682] [<ffffffff8112c
[714069.829687] [<ffffffff813d6
[714069.829690] [<ffffffff813d7
[714069.829700] [<ffffffff81039
[714069.829708] [<ffffffff81470
[714069.829713] [<ffffffff8106e
[714069.829717] [<ffffffff810c4
[714069.829722] [<ffffffff81014
[714069.829725] [<ffffffff81015
[714069.829727] [<ffffffff8106e
[714069.829733] [<ffffffff8155c
[714069.829736] [<ffffffff81013
[714069.829737] <EOI> [<ffffffff81038
[714069.829746] [<ffffffff8101b
[714069.829753] [<ffffffff81011
[714069.829757] [<ffffffff8153f
[714069.829763] [<ffffffff8187a
[714069.829766] [<ffffffff8187a
[714069.829768] [<ffffffff8187a
[714069.829770] Mem-Info:
[714069.829772] Node 0 DMA per-cpu:
[714069.829775] CPU 0: hi: 0, btch: 1 usd: 0
[714069.829776] Node 0 DMA32 per-cpu:
[714069.829778] CPU 0: hi: 186, btch: 31 usd: 196
[714069.829783] active_anon:109561 inactive_
[714069.829784] active_file:7041 inactive_file:13781 isolated_file:0
[714069.829785] unevictable:0 dirty:8681 writeback:0 unstable:0
[714069.829786] free:1367 slab_reclaimabl
[714069.829787] mapped:2113 shmem:83 pagetables:1148 bounce:0
[714069.829789] Node 0 DMA free:4000kB min:60kB low:72kB high:88kB active_anon:5508kB inactive_
[714069.829799] lowmem_reserve[]: 0 994 994 994
[714069.829802] Node 0 DMA32 free:1468kB min:4000kB low:5000kB high:6000kB active_
[714069.829811] lowmem_reserve[]: 0 0 0 0
[714069.829814] Node 0 DMA: 0*4kB 2*8kB 5*16kB 6*32kB 4*64kB 3*128kB 2*256kB 1*512kB 0*1024kB 1*2048kB 0*4096kB = 4000kB
[714069.829823] Node 0 DMA32: 69*4kB 1*8kB 0*16kB 1*32kB 0*64kB 1*128kB 0*256kB 0*512kB 1*1024kB 0*2048kB 0*4096kB = 1468kB
[714069.829830] 22928 total pagecache pages
[714069.829832] 2031 pages in swap cache
[714069.829834] Swap cache stats: add 74010, delete 71979, find 90506/93667
[714069.829836] Free swap = 804372kB
[714069.829837] Total swap = 897016kB
[714069.840685] 262139 pages RAM
[714069.840689] 6216 pages reserved
[714069.840690] 22948 pages shared
[714069.840691] 234443 pages non-shared
Version details:
linux-image-
2.6.32-21-server #32-Ubuntu SMP Fri Apr 16 09:17:34 UTC 2010 x86_64 GNU/Linux
Ubuntu 2.6.32-21.32-server 2.6.32.11+drm33.2
---
AlsaDevices: Error: command ['ls', '-l', '/dev/snd/'] failed with exit code 2: ls: cannot access /dev/snd/: No such file or directory
AplayDevices: Error: [Errno 2] No such file or directory
Architecture: amd64
ArecordDevices: Error: [Errno 2] No such file or directory
CurrentDmesg:
[ 4.972735] JBD: barrier-based sync failed on vda1-8 - disabling barriers
[ 14.180150] eth0: no IPv6 routers present
DistroRelease: Ubuntu 10.04
InstallationMedia: Ubuntu-Server 10.04 "Lucid Lynx" - Beta amd64 (20100406.1)
Lsusb: Bus 001 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
MachineType: Bochs Bochs
Package: linux (not installed)
PciMultimedia:
ProcCmdLine: BOOT_IMAGE=
ProcEnviron:
PATH=(custom, no user)
LANG=fi_FI.UTF-8
SHELL=/bin/bash
ProcVersionSign
Regression: Yes
Reproducible: Yes
Tags: lucid kconfig regression-release needs-upstream-
Uname: Linux 2.6.32-22-server x86_64
UserGroups: adm admin cdrom dialout lpadmin lpadmin plugdev plugdev sambashare scanner
dmi.bios.date: 01/01/2007
dmi.bios.vendor: Bochs
dmi.bios.version: Bochs
dmi.chassis.type: 1
dmi.chassis.vendor: Bochs
dmi.modalias: dmi:bvnBochs:
dmi.product.name: Bochs
dmi.sys.vendor: Bochs
Changed in linux (Ubuntu): | |
status: | Incomplete → Triaged |
importance: | Undecided → Medium |
tags: |
added: cherry-pick kernel-net removed: needs-kernel-logs needs-upstream-testing |
tags: | added: kernel-needs-review |
Changed in linux (Ubuntu Lucid): | |
assignee: | nobody → Stefan Bader (stefan-bader-canonical) |
importance: | Undecided → Medium |
status: | New → In Progress |
Changed in linux (Ubuntu Maverick): | |
assignee: | nobody → Stefan Bader (stefan-bader-canonical) |
status: | New → In Progress |
importance: | Undecided → Medium |
Changed in linux (Ubuntu Lucid): | |
status: | In Progress → Fix Committed |
Changed in linux (Ubuntu Maverick): | |
status: | In Progress → Fix Committed |
tags: | added: verification-needed-lucid verification-needed-maverick |
upstream bug report: http:// bugs.debian. org/cgi- bin/bugreport. cgi?bug= 576838