[Potential Regression] cpuhotplug related tests triggers kernel bug (arch/x86/xen/spinlock.c:62) and kernel panic on AWS cloud c3.xlarge
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
ubuntu-kernel-tests |
New
|
Undecided
|
Unassigned | ||
linux-aws (Ubuntu) |
Invalid
|
Undecided
|
Unassigned | ||
Bionic |
Confirmed
|
Undecided
|
Unassigned | ||
Focal |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
Issue found with 5.4.0-1107.
cpu-hotplug related tests will crash the instance, they are:
* cpuset_hotplug in ubuntu_
* cpuhotplug:
* cpu-hotplug:
Take cpuset_hotplug in ubuntu_
There is no output from the test itself (looks like it has crashed):
START ubuntu_
Persistent state client.
Persistent state client.
Waiting for pid 925631 for 4500 seconds
System python is too old, crash handling disabled
(nothing after this point)
But from the console log you will see a kernel BUG and kernel panic:
[ 3451.829941] kernel BUG at /build/
[ 3451.833383] invalid opcode: 0000 [#1] SMP PTI
[ 3451.835146] CPU: 1 PID: 14 Comm: cpuhp/1 Tainted: G C 5.4.0-1107-aws #115~18.04.1-Ubuntu
[ 3451.838679] Hardware name: Xen HVM domU, BIOS 4.11.amazon 08/24/2006
[ 3451.840965] RIP: 0010:dummy_
[ 3451.842675] Code: 8b 75 e4 74 d6 44 89 e7 e8 39 89 61 00 eb d6 44 89 e7 e8 af ab 61 00 eb cc 0f 1f 00 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 <0f> 0b 66 2e 0f 1f 84 00 00 00 00 00 80 3d 69 d0 9f 01 00 75 02 f3
[ 3451.849042] RSP: 0000:ffffb54b00
[ 3451.851021] RAX: ffffffff92c2e3d0 RBX: 000000000000003b RCX: 0000000000000000
[ 3451.853509] RDX: 0000000000400e00 RSI: 0000000000000000 RDI: 000000000000003b
[ 3451.855996] RBP: ffffb54b0000ee38 R08: ffff8a9de6c01240 R09: ffff8a9de6c01440
[ 3451.858435] R10: 0000000000000000 R11: ffffffff94664da8 R12: 0000000000000000
[ 3451.860896] R13: 0000000000000000 R14: 0000000000000000 R15: ffff8a9de6583200
[ 3451.863313] FS: 000000000000000
[ 3451.899246] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 3451.901338] CR2: 0000000000000000 CR3: 000000002040a001 CR4: 00000000001606e0
[ 3451.903757] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 3451.906184] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 3451.908623] Call Trace:
[ 3451.909869] <IRQ>
[ 3451.911014] __handle_
[ 3451.912818] handle_
[ 3451.914578] handle_
[ 3451.916198] generic_
[ 3451.917834] handle_
[ 3451.919493] evtchn_
[ 3451.921298] __xen_evtchn_
[ 3451.923046] xen_evtchn_
[ 3451.924742] xen_hvm_
[ 3451.926484] </IRQ>
[ 3451.927632] RIP: 0010:_raw_
[ 3451.929674] Code: e8 a0 3d 64 ff 4c 29 e0 4c 39 f0 76 cf 80 0b 08 eb 8a 90 90 90 0f 1f 44 00 00 55 48 89 e5 e8 d6 ad 66 ff 66 90 48 89 f7 57 9d <0f> 1f 44 00 00 5d c3 0f 1f 40 00 0f 1f 44 00 00 55 48 89 e5 c6 07
[ 3451.935996] RSP: 0000:ffffb54b00
[ 3451.939023] RAX: 0000000000000001 RBX: ffff8a9de6583200 RCX: 000000000002cc00
[ 3451.941475] RDX: 0000000000000001 RSI: 0000000000000246 RDI: 0000000000000246
[ 3451.943948] RBP: ffffb54b000fbcf8 R08: ffff8a9de6c01240 R09: ffff8a9de6c01440
[ 3451.946382] R10: 0000000000000000 R11: 0000000000000246 R12: 000000000000003b
[ 3451.948849] R13: 0000000000000000 R14: ffff8a9d8e75c600 R15: ffff8a9d8e75c6a4
[ 3451.951297] __setup_
[ 3451.952850] ? kmem_cache_
[ 3451.954661] request_
[ 3451.956376] bind_ipi_
[ 3451.958113] ? xen_qlock_
[ 3451.959723] ? snr_uncore_
[ 3451.961445] xen_init_
[ 3451.963057] ? snr_uncore_
[ 3451.964810] xen_cpu_
[ 3451.966415] cpuhp_invoke_
[ 3451.968144] cpuhp_thread_
[ 3451.969760] smpboot_
[ 3451.971400] kthread+0x121/0x140
[ 3451.972855] ? sort_range+
[ 3451.974378] ? kthread_
[ 3451.975929] ret_from_
[ 3451.977454] Modules linked in: exfat(C) ufs qnx4 hfsplus hfs minix ntfs msdos jfs xfs nfsd auth_rpcgss nfs_acl lockd grace sunrpc nls_iso8859_1 binfmt_misc serio_raw sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_
[ 3451.992926] ---[ end trace 4433bc23c8979a4c ]---
[ 3451.994720] RIP: 0010:dummy_
[ 3451.996427] Code: 8b 75 e4 74 d6 44 89 e7 e8 39 89 61 00 eb d6 44 89 e7 e8 af ab 61 00 eb cc 0f 1f 00 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 <0f> 0b 66 2e 0f 1f 84 00 00 00 00 00 80 3d 69 d0 9f 01 00 75 02 f3
[ 3452.002753] RSP: 0000:ffffb54b00
[ 3452.004708] RAX: ffffffff92c2e3d0 RBX: 000000000000003b RCX: 0000000000000000
[ 3452.007130] RDX: 0000000000400e00 RSI: 0000000000000000 RDI: 000000000000003b
[ 3452.009569] RBP: ffffb54b0000ee38 R08: ffff8a9de6c01240 R09: ffff8a9de6c01440
[ 3452.011998] R10: 0000000000000000 R11: ffffffff94664da8 R12: 0000000000000000
[ 3452.014449] R13: 0000000000000000 R14: 0000000000000000 R15: ffff8a9de6583200
[ 3452.016893] FS: 000000000000000
[ 3452.020028] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 3452.022109] CR2: 0000000000000000 CR3: 000000002040a001 CR4: 00000000001606e0
[ 3452.024568] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 3452.027003] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 3452.029446] Kernel panic - not syncing: Fatal exception in interrupt
[ 3452.031753] Kernel Offset: 0x11c00000 from 0xffffffff81000000 (relocation range: 0xffffffff80000
tags: | added: sru-20230710 |
summary: |
cpuset_hotplug in ubuntu_ltp_controllers triggers kernel bug - (arch/x86/xen/spinlock.c:62) on AWS cloud c3.xlarge + (arch/x86/xen/spinlock.c:62) and kernel panic on AWS cloud c3.xlarge |
summary: |
- cpuset_hotplug in ubuntu_ltp_controllers triggers kernel bug - (arch/x86/xen/spinlock.c:62) and kernel panic on AWS cloud c3.xlarge + [Potential Regression] cpuset_hotplug in ubuntu_ltp_controllers triggers + kernel bug (arch/x86/xen/spinlock.c:62) and kernel panic on AWS cloud + c3.xlarge |
summary: |
- [Potential Regression] cpuset_hotplug in ubuntu_ltp_controllers triggers - kernel bug (arch/x86/xen/spinlock.c:62) and kernel panic on AWS cloud - c3.xlarge + [Potential Regression] ubuntu_ltp_controllers/cpuset_hotplug and + ubuntu_ltp/cpuhotplug:cpuhotplug02 triggers kernel bug + (arch/x86/xen/spinlock.c:62) and kernel panic on AWS cloud c3.xlarge |
summary: |
- [Potential Regression] ubuntu_ltp_controllers/cpuset_hotplug and - ubuntu_ltp/cpuhotplug:cpuhotplug02 triggers kernel bug + [Potential Regression] cpuhotplug related tests triggers kernel bug (arch/x86/xen/spinlock.c:62) and kernel panic on AWS cloud c3.xlarge |
Tested with older version of LTP (commit ac1a3e40c5b0) with 5.4.0-1107. 115~18. 04.1 Bionic AWS, on AWS cloud c3.xlarge. It's triggering a system crash as well: linux-aws- 5.4-I38rpz/ linux-aws- 5.4-5.4. 0/arch/ x86/xen/ spinlock. c:62!
[20570.648998] kernel BUG at /build/
Tested with 5.4.0-1106. 114~18. 04.1 on the very same system this test can finish without any issue. "cpuset_ hotplug_ test.sh" status> >> status= "ok" type=exited termination_id=0 corefile=no
<<<test_start>>>
tag=cpuset_hotplug stime=1691141679
cmdline=
contacts=""
analysis=exit
<<<test_output>>>
incrementing stop
cpuset_hotplug 1 TINFO: CPUs are numbered continuously starting at 0 (0-3)
cpuset_hotplug 1 TINFO: Nodes are numbered continuously starting at 0 (0)
cpuset_hotplug 1 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 3 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 5 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 7 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 9 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 11 TPASS: Cpuset vs CPU hotplug test succeeded.
<<<execution_
initiation_
duration=8 termination_
cutime=80 cstime=656
<<<test_end>>>
INFO: ltp-pan reported all tests PASS
LTP Version: 20220527
Done executing testcases.
####### ####### ####### ####### ####### ####### ####### ####### #######
LTP Version: 20220527