[hardy] Under long-running load test, KVM guest freeze and host oops
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Invalid
|
High
|
Stefan Bader | ||
Hardy |
Fix Released
|
High
|
Stefan Bader | ||
Intrepid |
Fix Released
|
High
|
Stefan Bader |
Bug Description
SRU justification:
Impact: The function kvm_mmu_
Fix: Patch backported from upstream to add mmu_lock protection around the list walk.
Testcase: Running the validation suite for a longer period of time (24hrs).
When running a specific software validation suite in a KVM guest (both guest and host running hardy) for over 24 hours, the guest will eventually freeze and the host will have the following oops in dmesg:
75243.174934] Unable to handle kernel paging request at 0000000000100100 RIP:
[75243.174947] [<ffffffff882cc
[75243.174992] PGD 75072d067 PUD 76b738067 PMD 0
[75243.174997] Oops: 0000 [2] SMP
[75243.175001] CPU 4
[75243.175003] Modules linked in: tun bridge af_packet kqemu radeon drm rfcomm l2cap bluetooth kvm_intel kvm ppdev cpufreq_ondemand cpufreq_powersave cpufreq
_conservative cpufreq_userspace cpufreq_stats freq_table sbs sbshc container video output dock battery iptable_filter ip_tables x_tables ac parport_pc lp par
port ipv6 joydev serio_raw evdev psmouse pcspkr i2c_piix4 i2c_core button ext3 jbd mbcache sg sr_mod sd_mod cdrom ata_generic pata_acpi usbhid hid qla2xxx pa
ta_serverworks scsi_transport_fc aacraid ehci_hcd libata scsi_tgt ohci_hcd tg3 scsi_mod usbcore thermal processor fan fbcon tileblit font bitblit softcursor
fuse
[75243.175073] Pid: 7220, comm: kvm Tainted: G D 2.6.24-23-generic #1
[75243.175076] RIP: 0010:[<
[75243.175090] RSP: 0018:ffff81074a
[75243.175092] RAX: 0000000000000000 RBX: ffff81072051c000 RCX: 00007fff48fe3ca0
[75243.175094] RDX: 0000000000100100 RSI: 0000000000000005 RDI: ffff81072051eaf0
[75243.175097] RBP: ffff81074a48be88 R08: 0000000000000000 R09: 0000000000100100
[75243.175099] R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000000000
[75243.175101] R13: ffff81072051c020 R14: 000000004010ae42 R15: 0000000000000000
[75243.175104] FS: 00007f5240fd26e
[75243.175107] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[75243.175109] CR2: 0000000000100100 CR3: 000000076b640000 CR4: 00000000000026e0
[75243.175112] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[75243.175115] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[75243.175118] Process kvm (pid: 7220, threadinfo ffff81074a48a000, task ffff81081c2087f0)
[75243.175119] Stack: ffffffff882c80f2 0000000000000000 ffff8107fa6177c8 000000011c2087f0
[75243.175126] ffff81072051c000 ffff81074a48be88 000000004010ae42 0000000000000008
[75243.175131] ffffffff882c5771 ffff8106eb3c6168 0000000000000000 0000000000000292
[75243.175135] Call Trace:
[75243.175148] [<ffffffff882c8
[75243.175174] [<ffffffff882c5
[75243.175192] [<ffffffff80240
[75243.175224] [<ffffffff802c0
[75243.175235] [<ffffffff802c0
[75243.175254] [<ffffffff802c0
[75243.175274] [<ffffffff8020c
[75243.175307]
[75243.175308]
[75243.175309] Code: 49 8b 11 49 39 f9 0f 18 0a 75 b9 f3 c3 66 66 66 66 66 2e 0f
[75243.175322] RIP [<ffffffff882cc
[75243.175334] RSP <ffff81074a48be20>
[75243.175336] CR2: 0000000000100100
[75243.175343] ---[ end trace 01e4e553c58023ce ]---
The guest freeze and host oops above is reproducible, with sensibly the same trace in dmesg. Attached two different dmesg output from two different run of the load test on the same machine.
Changed in linux: | |
assignee: | nobody → stefan-bader-canonical |
importance: | Undecided → High |
status: | New → In Progress |
description: | updated |
Commited to Hardy as http:// kernel. ubuntu. com/git? p=ubuntu/ ubuntu- hardy.git; a=commitdiff; h=4caefc5a810ac 9653222bdfe2e4b 807505f4ea32