2023-07-10 11:03:37 |
Po-Hsu Lin |
bug |
|
|
added bug |
2023-07-10 11:03:49 |
Po-Hsu Lin |
tags |
4.4 amd64 aws sru-20230612 xenial |
4.4 amd64 aws sru-20230612 ubuntu-ltp-controllers xenial |
|
2023-07-10 11:15:02 |
Po-Hsu Lin |
description |
Issue found on X-aws 4.4.0-1158.173
The cpuset_hotplug test will timeout with the 45 minutes on VMs (passed on AWS bare-metals) because of task hang.
Test output:
<<<test_start>>>
tag=cpuset_hotplug stime=1688982088
cmdline="cpuset_hotplug_test.sh"
contacts=""
analysis=exit
<<<test_output>>>
cpuset_hotplug 1 TINFO: CPUs are numbered continuously starting at 0 (0-1)
cpuset_hotplug 1 TINFO: Nodes are numbered continuously starting at 0 (0)
cpuset_hotplug 1 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 3 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 5 TPASS: Cpuset vs CPU hotplug test succeeded.
dmesg output:
[ 1083.816153] LTP: starting cpuset_hotplug (cpuset_hotplug_test.sh)
[ 1083.948273] Broke affinity for irq 4
[ 1083.948396] Broke affinity for irq 61
[ 1083.948402] Broke affinity for irq 62
[ 1083.948408] Broke affinity for irq 63
[ 1083.948413] Broke affinity for irq 64
[ 1083.949824] smpboot: CPU 1 is now offline
[ 1085.000502] installing Xen timer for CPU 1
[ 1085.020261] x86: Booting SMP configuration:
[ 1085.020263] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.021673] Skipped synchronization checks as TSC is reliable.
[ 1085.021693] cpu 1 spinlock event irq 59
[ 1085.069370] smpboot: CPU 1 is now offline
[ 1085.081932] installing Xen timer for CPU 1
[ 1085.100218] x86: Booting SMP configuration:
[ 1085.100220] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.101588] Skipped synchronization checks as TSC is reliable.
[ 1085.101608] cpu 1 spinlock event irq 59
[ 1086.405418] smpboot: CPU 1 is now offline
[ 1087.467625] installing Xen timer for CPU 1
[ 1087.484259] x86: Booting SMP configuration:
[ 1087.484261] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1087.485700] Skipped synchronization checks as TSC is reliable.
[ 1087.485721] cpu 1 spinlock event irq 59
[ 1320.308124] INFO: task systemd:1 blocked for more than 120 seconds.
[ 1320.311249] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.313959] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.318032] systemd D ffff8800ea4f7d28 0 1 0 0x00000000
[ 1320.318037] ffff8800ea4f7d28 ffff8800372adb40 ffffffff81e13500 ffff8800ea4f8000
[ 1320.318042] ffff8800ea4f8000 ffffffff81e605e4 ffff8800ea4f8000 00000000ffffffff
[ 1320.318045] ffffffff81e605e8 ffff8800ea4f7d40 ffffffff81844555 ffffffff81e605e0
[ 1320.318048] Call Trace:
[ 1320.318057] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.318061] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.318065] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.318068] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.318074] [<ffffffff8111f1bc>] proc_cgroup_show+0x4c/0x2e0
[ 1320.318078] [<ffffffff81289ac5>] proc_single_show+0x55/0x90
[ 1320.318083] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.318088] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.318093] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.318096] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.318100] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.318104] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.318118] INFO: task kworker/0:3:473 blocked for more than 120 seconds.
[ 1320.321669] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.324364] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.328026] kworker/0:3 D ffff8800364cfc68 0 473 2 0x00000000
[ 1320.328031] Workqueue: cgroup_destroy css_killed_work_fn
[ 1320.328033] ffff8800364cfc68 ffff8800d3ff4cc0 ffff880036d94600 ffff880036f9f000
[ 1320.328035] ffff8800364d0000 ffffffff81e42fa4 ffff880036f9f000 00000000ffffffff
[ 1320.328037] ffffffff81e42fa8 ffff8800364cfc80 ffffffff81844555 ffffffff81e42fa0
[ 1320.328039] Call Trace:
[ 1320.328043] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.328045] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.328047] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.328052] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.328072] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.328078] [<ffffffff81033ace>] arch_jump_label_transform+0x2e/0x120
[ 1320.328081] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328085] [<ffffffff81193a96>] __jump_label_update+0x76/0x90
[ 1320.328088] [<ffffffff81193b38>] jump_label_update+0x88/0x90
[ 1320.328092] [<ffffffff81193de1>] __static_key_slow_dec+0x41/0x90
[ 1320.328095] [<ffffffff81193e52>] static_key_slow_dec+0x22/0x60
[ 1320.328098] [<ffffffff8112263f>] cpuset_css_offline+0x3f/0x70
[ 1320.328102] [<ffffffff811195ac>] css_killed_work_fn+0x3c/0xd0
[ 1320.328106] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.328109] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.328116] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.328119] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.328122] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328128] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328131] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.328134] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328142] INFO: task irqbalance:1189 blocked for more than 120 seconds.
[ 1320.331605] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.334291] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.338224] irqbalance D ffff8800e8b07ce8 0 1189 1 0x00000000
[ 1320.338228] ffff8800e8b07ce8 ffff8800e8b07dd0 ffffffff81e13500 ffff8800369dc600
[ 1320.338232] ffff8800e8b08000 ffff8800eb64f7cc ffff8800369dc600 00000000ffffffff
[ 1320.338235] ffff8800eb64f7d0 ffff8800e8b07d00 ffffffff81844555 ffff8800eb64f7c8
[ 1320.338238] Call Trace:
[ 1320.338242] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.338249] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.338256] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.338259] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.338264] [<ffffffff81554500>] online_show+0x20/0x60
[ 1320.338269] [<ffffffff81554a63>] dev_attr_show+0x23/0x60
[ 1320.338272] [<ffffffff818467f2>] ? mutex_lock+0x12/0x30
[ 1320.338279] [<ffffffff8129aeaf>] sysfs_kf_seq_show+0xbf/0x140
[ 1320.338283] [<ffffffff812993a9>] kernfs_seq_show+0x29/0x30
[ 1320.338285] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.338289] [<ffffffff81299b5d>] kernfs_fop_read+0x10d/0x160
[ 1320.338292] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.338296] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.338299] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.338303] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.338306] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.338317] INFO: task kworker/0:2:1448 blocked for more than 120 seconds.
[ 1320.341877] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.344538] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.348296] kworker/0:2 D ffff8800bf097d38 0 1448 2 0x00000000
[ 1320.348305] Workqueue: events vmstat_shepherd
[ 1320.348307] ffff8800bf097d38 ffffffff8141c925 ffffffff81e13500 ffff88008aa32a00
[ 1320.348310] ffff8800bf098000 ffffffff81e42fa4 ffff88008aa32a00 00000000ffffffff
[ 1320.348313] ffffffff81e42fa8 ffff8800bf097d50 ffffffff81844555 ffffffff81e42fa0
[ 1320.348316] Call Trace:
[ 1320.348321] [<ffffffff8141c925>] ? find_next_bit+0x15/0x20
[ 1320.348329] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.348332] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.348336] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.348342] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.348345] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.348350] [<ffffffff811b65c1>] vmstat_shepherd+0x21/0xd0
[ 1320.348354] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.348362] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.348366] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.348369] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.348372] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.348375] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348378] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.348381] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348386] INFO: task echo:1878 blocked for more than 120 seconds.
[ 1320.351463] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.354182] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.358193] echo D ffff88008aaf3c28 0 1878 1761 0x00000000
[ 1320.358197] ffff88008aaf3c28 000000000000001b ffffffff81e13500 ffff880036a83800
[ 1320.358201] ffff88008aaf4000 ffffffff81e6aec4 ffff880036a83800 00000000ffffffff
[ 1320.358204] ffffffff81e6aec8 ffff88008aaf3c40 ffffffff81844555 ffffffff81e6aec0
[ 1320.358207] Call Trace:
[ 1320.358212] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.358218] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.358226] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.358230] [<ffffffff810ddf79>] ? vprintk_default+0x29/0x40
[ 1320.358233] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.358239] [<ffffffff81193d70>] static_key_slow_inc+0x70/0xa0
[ 1320.358243] [<ffffffff810ab19d>] sched_cpu_active+0xdd/0xf0
[ 1320.358247] [<ffffffff810a659c>] notifier_call_chain+0x4c/0x70
[ 1320.358251] [<ffffffff810a66fe>] __raw_notifier_call_chain+0xe/0x10
[ 1320.358258] [<ffffffff81084913>] cpu_notify+0x23/0x40
[ 1320.358262] [<ffffffff810850c3>] _cpu_up+0x183/0x1b0
[ 1320.358265] [<ffffffff810851a2>] cpu_up+0xb2/0xd0
[ 1320.358268] [<ffffffff8155c5ed>] cpu_subsys_online+0x3d/0x90
[ 1320.358271] [<ffffffff81556f12>] device_online+0x62/0x90
[ 1320.358274] [<ffffffff81556fc9>] online_store+0x89/0xa0
[ 1320.358277] [<ffffffff81553e8b>] dev_attr_store+0x1b/0x30
[ 1320.358281] [<ffffffff8129a89c>] sysfs_kf_write+0x3c/0x50
[ 1320.358284] [<ffffffff8129a403>] kernfs_fop_write+0x143/0x180
[ 1320.358288] [<ffffffff8121833b>] __vfs_write+0x1b/0x40
[ 1320.358291] [<ffffffff81218ad9>] vfs_write+0xa9/0x1a0
[ 1320.358295] [<ffffffff812197bc>] SyS_write+0x5c/0xe0
[ 1320.358297] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0 |
Issue found on X-aws 4.4.0-1158.173
The cpuset_hotplug test will timeout with the 45 minutes on VMs (passed on AWS bare-metals) because of task hang.
Test output:
<<<test_start>>>
tag=cpuset_hotplug stime=1688982088
cmdline="cpuset_hotplug_test.sh"
contacts=""
analysis=exit
<<<test_output>>>
cpuset_hotplug 1 TINFO: CPUs are numbered continuously starting at 0 (0-1)
cpuset_hotplug 1 TINFO: Nodes are numbered continuously starting at 0 (0)
cpuset_hotplug 1 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 3 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 5 TPASS: Cpuset vs CPU hotplug test succeeded.
This issue can be reproduced with X-aws-1128, dmesg output:
[ 1083.816153] LTP: starting cpuset_hotplug (cpuset_hotplug_test.sh)
[ 1083.948273] Broke affinity for irq 4
[ 1083.948396] Broke affinity for irq 61
[ 1083.948402] Broke affinity for irq 62
[ 1083.948408] Broke affinity for irq 63
[ 1083.948413] Broke affinity for irq 64
[ 1083.949824] smpboot: CPU 1 is now offline
[ 1085.000502] installing Xen timer for CPU 1
[ 1085.020261] x86: Booting SMP configuration:
[ 1085.020263] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.021673] Skipped synchronization checks as TSC is reliable.
[ 1085.021693] cpu 1 spinlock event irq 59
[ 1085.069370] smpboot: CPU 1 is now offline
[ 1085.081932] installing Xen timer for CPU 1
[ 1085.100218] x86: Booting SMP configuration:
[ 1085.100220] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.101588] Skipped synchronization checks as TSC is reliable.
[ 1085.101608] cpu 1 spinlock event irq 59
[ 1086.405418] smpboot: CPU 1 is now offline
[ 1087.467625] installing Xen timer for CPU 1
[ 1087.484259] x86: Booting SMP configuration:
[ 1087.484261] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1087.485700] Skipped synchronization checks as TSC is reliable.
[ 1087.485721] cpu 1 spinlock event irq 59
[ 1320.308124] INFO: task systemd:1 blocked for more than 120 seconds.
[ 1320.311249] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.313959] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.318032] systemd D ffff8800ea4f7d28 0 1 0 0x00000000
[ 1320.318037] ffff8800ea4f7d28 ffff8800372adb40 ffffffff81e13500 ffff8800ea4f8000
[ 1320.318042] ffff8800ea4f8000 ffffffff81e605e4 ffff8800ea4f8000 00000000ffffffff
[ 1320.318045] ffffffff81e605e8 ffff8800ea4f7d40 ffffffff81844555 ffffffff81e605e0
[ 1320.318048] Call Trace:
[ 1320.318057] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.318061] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.318065] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.318068] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.318074] [<ffffffff8111f1bc>] proc_cgroup_show+0x4c/0x2e0
[ 1320.318078] [<ffffffff81289ac5>] proc_single_show+0x55/0x90
[ 1320.318083] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.318088] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.318093] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.318096] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.318100] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.318104] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.318118] INFO: task kworker/0:3:473 blocked for more than 120 seconds.
[ 1320.321669] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.324364] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.328026] kworker/0:3 D ffff8800364cfc68 0 473 2 0x00000000
[ 1320.328031] Workqueue: cgroup_destroy css_killed_work_fn
[ 1320.328033] ffff8800364cfc68 ffff8800d3ff4cc0 ffff880036d94600 ffff880036f9f000
[ 1320.328035] ffff8800364d0000 ffffffff81e42fa4 ffff880036f9f000 00000000ffffffff
[ 1320.328037] ffffffff81e42fa8 ffff8800364cfc80 ffffffff81844555 ffffffff81e42fa0
[ 1320.328039] Call Trace:
[ 1320.328043] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.328045] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.328047] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.328052] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.328072] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.328078] [<ffffffff81033ace>] arch_jump_label_transform+0x2e/0x120
[ 1320.328081] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328085] [<ffffffff81193a96>] __jump_label_update+0x76/0x90
[ 1320.328088] [<ffffffff81193b38>] jump_label_update+0x88/0x90
[ 1320.328092] [<ffffffff81193de1>] __static_key_slow_dec+0x41/0x90
[ 1320.328095] [<ffffffff81193e52>] static_key_slow_dec+0x22/0x60
[ 1320.328098] [<ffffffff8112263f>] cpuset_css_offline+0x3f/0x70
[ 1320.328102] [<ffffffff811195ac>] css_killed_work_fn+0x3c/0xd0
[ 1320.328106] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.328109] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.328116] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.328119] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.328122] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328128] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328131] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.328134] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328142] INFO: task irqbalance:1189 blocked for more than 120 seconds.
[ 1320.331605] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.334291] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.338224] irqbalance D ffff8800e8b07ce8 0 1189 1 0x00000000
[ 1320.338228] ffff8800e8b07ce8 ffff8800e8b07dd0 ffffffff81e13500 ffff8800369dc600
[ 1320.338232] ffff8800e8b08000 ffff8800eb64f7cc ffff8800369dc600 00000000ffffffff
[ 1320.338235] ffff8800eb64f7d0 ffff8800e8b07d00 ffffffff81844555 ffff8800eb64f7c8
[ 1320.338238] Call Trace:
[ 1320.338242] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.338249] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.338256] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.338259] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.338264] [<ffffffff81554500>] online_show+0x20/0x60
[ 1320.338269] [<ffffffff81554a63>] dev_attr_show+0x23/0x60
[ 1320.338272] [<ffffffff818467f2>] ? mutex_lock+0x12/0x30
[ 1320.338279] [<ffffffff8129aeaf>] sysfs_kf_seq_show+0xbf/0x140
[ 1320.338283] [<ffffffff812993a9>] kernfs_seq_show+0x29/0x30
[ 1320.338285] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.338289] [<ffffffff81299b5d>] kernfs_fop_read+0x10d/0x160
[ 1320.338292] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.338296] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.338299] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.338303] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.338306] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.338317] INFO: task kworker/0:2:1448 blocked for more than 120 seconds.
[ 1320.341877] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.344538] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.348296] kworker/0:2 D ffff8800bf097d38 0 1448 2 0x00000000
[ 1320.348305] Workqueue: events vmstat_shepherd
[ 1320.348307] ffff8800bf097d38 ffffffff8141c925 ffffffff81e13500 ffff88008aa32a00
[ 1320.348310] ffff8800bf098000 ffffffff81e42fa4 ffff88008aa32a00 00000000ffffffff
[ 1320.348313] ffffffff81e42fa8 ffff8800bf097d50 ffffffff81844555 ffffffff81e42fa0
[ 1320.348316] Call Trace:
[ 1320.348321] [<ffffffff8141c925>] ? find_next_bit+0x15/0x20
[ 1320.348329] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.348332] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.348336] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.348342] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.348345] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.348350] [<ffffffff811b65c1>] vmstat_shepherd+0x21/0xd0
[ 1320.348354] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.348362] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.348366] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.348369] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.348372] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.348375] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348378] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.348381] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348386] INFO: task echo:1878 blocked for more than 120 seconds.
[ 1320.351463] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.354182] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.358193] echo D ffff88008aaf3c28 0 1878 1761 0x00000000
[ 1320.358197] ffff88008aaf3c28 000000000000001b ffffffff81e13500 ffff880036a83800
[ 1320.358201] ffff88008aaf4000 ffffffff81e6aec4 ffff880036a83800 00000000ffffffff
[ 1320.358204] ffffffff81e6aec8 ffff88008aaf3c40 ffffffff81844555 ffffffff81e6aec0
[ 1320.358207] Call Trace:
[ 1320.358212] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.358218] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.358226] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.358230] [<ffffffff810ddf79>] ? vprintk_default+0x29/0x40
[ 1320.358233] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.358239] [<ffffffff81193d70>] static_key_slow_inc+0x70/0xa0
[ 1320.358243] [<ffffffff810ab19d>] sched_cpu_active+0xdd/0xf0
[ 1320.358247] [<ffffffff810a659c>] notifier_call_chain+0x4c/0x70
[ 1320.358251] [<ffffffff810a66fe>] __raw_notifier_call_chain+0xe/0x10
[ 1320.358258] [<ffffffff81084913>] cpu_notify+0x23/0x40
[ 1320.358262] [<ffffffff810850c3>] _cpu_up+0x183/0x1b0
[ 1320.358265] [<ffffffff810851a2>] cpu_up+0xb2/0xd0
[ 1320.358268] [<ffffffff8155c5ed>] cpu_subsys_online+0x3d/0x90
[ 1320.358271] [<ffffffff81556f12>] device_online+0x62/0x90
[ 1320.358274] [<ffffffff81556fc9>] online_store+0x89/0xa0
[ 1320.358277] [<ffffffff81553e8b>] dev_attr_store+0x1b/0x30
[ 1320.358281] [<ffffffff8129a89c>] sysfs_kf_write+0x3c/0x50
[ 1320.358284] [<ffffffff8129a403>] kernfs_fop_write+0x143/0x180
[ 1320.358288] [<ffffffff8121833b>] __vfs_write+0x1b/0x40
[ 1320.358291] [<ffffffff81218ad9>] vfs_write+0xa9/0x1a0
[ 1320.358295] [<ffffffff812197bc>] SyS_write+0x5c/0xe0
[ 1320.358297] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0 |
|
2023-07-10 13:20:16 |
Po-Hsu Lin |
description |
Issue found on X-aws 4.4.0-1158.173
The cpuset_hotplug test will timeout with the 45 minutes on VMs (passed on AWS bare-metals) because of task hang.
Test output:
<<<test_start>>>
tag=cpuset_hotplug stime=1688982088
cmdline="cpuset_hotplug_test.sh"
contacts=""
analysis=exit
<<<test_output>>>
cpuset_hotplug 1 TINFO: CPUs are numbered continuously starting at 0 (0-1)
cpuset_hotplug 1 TINFO: Nodes are numbered continuously starting at 0 (0)
cpuset_hotplug 1 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 3 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 5 TPASS: Cpuset vs CPU hotplug test succeeded.
This issue can be reproduced with X-aws-1128, dmesg output:
[ 1083.816153] LTP: starting cpuset_hotplug (cpuset_hotplug_test.sh)
[ 1083.948273] Broke affinity for irq 4
[ 1083.948396] Broke affinity for irq 61
[ 1083.948402] Broke affinity for irq 62
[ 1083.948408] Broke affinity for irq 63
[ 1083.948413] Broke affinity for irq 64
[ 1083.949824] smpboot: CPU 1 is now offline
[ 1085.000502] installing Xen timer for CPU 1
[ 1085.020261] x86: Booting SMP configuration:
[ 1085.020263] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.021673] Skipped synchronization checks as TSC is reliable.
[ 1085.021693] cpu 1 spinlock event irq 59
[ 1085.069370] smpboot: CPU 1 is now offline
[ 1085.081932] installing Xen timer for CPU 1
[ 1085.100218] x86: Booting SMP configuration:
[ 1085.100220] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.101588] Skipped synchronization checks as TSC is reliable.
[ 1085.101608] cpu 1 spinlock event irq 59
[ 1086.405418] smpboot: CPU 1 is now offline
[ 1087.467625] installing Xen timer for CPU 1
[ 1087.484259] x86: Booting SMP configuration:
[ 1087.484261] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1087.485700] Skipped synchronization checks as TSC is reliable.
[ 1087.485721] cpu 1 spinlock event irq 59
[ 1320.308124] INFO: task systemd:1 blocked for more than 120 seconds.
[ 1320.311249] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.313959] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.318032] systemd D ffff8800ea4f7d28 0 1 0 0x00000000
[ 1320.318037] ffff8800ea4f7d28 ffff8800372adb40 ffffffff81e13500 ffff8800ea4f8000
[ 1320.318042] ffff8800ea4f8000 ffffffff81e605e4 ffff8800ea4f8000 00000000ffffffff
[ 1320.318045] ffffffff81e605e8 ffff8800ea4f7d40 ffffffff81844555 ffffffff81e605e0
[ 1320.318048] Call Trace:
[ 1320.318057] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.318061] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.318065] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.318068] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.318074] [<ffffffff8111f1bc>] proc_cgroup_show+0x4c/0x2e0
[ 1320.318078] [<ffffffff81289ac5>] proc_single_show+0x55/0x90
[ 1320.318083] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.318088] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.318093] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.318096] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.318100] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.318104] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.318118] INFO: task kworker/0:3:473 blocked for more than 120 seconds.
[ 1320.321669] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.324364] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.328026] kworker/0:3 D ffff8800364cfc68 0 473 2 0x00000000
[ 1320.328031] Workqueue: cgroup_destroy css_killed_work_fn
[ 1320.328033] ffff8800364cfc68 ffff8800d3ff4cc0 ffff880036d94600 ffff880036f9f000
[ 1320.328035] ffff8800364d0000 ffffffff81e42fa4 ffff880036f9f000 00000000ffffffff
[ 1320.328037] ffffffff81e42fa8 ffff8800364cfc80 ffffffff81844555 ffffffff81e42fa0
[ 1320.328039] Call Trace:
[ 1320.328043] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.328045] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.328047] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.328052] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.328072] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.328078] [<ffffffff81033ace>] arch_jump_label_transform+0x2e/0x120
[ 1320.328081] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328085] [<ffffffff81193a96>] __jump_label_update+0x76/0x90
[ 1320.328088] [<ffffffff81193b38>] jump_label_update+0x88/0x90
[ 1320.328092] [<ffffffff81193de1>] __static_key_slow_dec+0x41/0x90
[ 1320.328095] [<ffffffff81193e52>] static_key_slow_dec+0x22/0x60
[ 1320.328098] [<ffffffff8112263f>] cpuset_css_offline+0x3f/0x70
[ 1320.328102] [<ffffffff811195ac>] css_killed_work_fn+0x3c/0xd0
[ 1320.328106] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.328109] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.328116] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.328119] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.328122] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328128] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328131] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.328134] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328142] INFO: task irqbalance:1189 blocked for more than 120 seconds.
[ 1320.331605] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.334291] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.338224] irqbalance D ffff8800e8b07ce8 0 1189 1 0x00000000
[ 1320.338228] ffff8800e8b07ce8 ffff8800e8b07dd0 ffffffff81e13500 ffff8800369dc600
[ 1320.338232] ffff8800e8b08000 ffff8800eb64f7cc ffff8800369dc600 00000000ffffffff
[ 1320.338235] ffff8800eb64f7d0 ffff8800e8b07d00 ffffffff81844555 ffff8800eb64f7c8
[ 1320.338238] Call Trace:
[ 1320.338242] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.338249] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.338256] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.338259] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.338264] [<ffffffff81554500>] online_show+0x20/0x60
[ 1320.338269] [<ffffffff81554a63>] dev_attr_show+0x23/0x60
[ 1320.338272] [<ffffffff818467f2>] ? mutex_lock+0x12/0x30
[ 1320.338279] [<ffffffff8129aeaf>] sysfs_kf_seq_show+0xbf/0x140
[ 1320.338283] [<ffffffff812993a9>] kernfs_seq_show+0x29/0x30
[ 1320.338285] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.338289] [<ffffffff81299b5d>] kernfs_fop_read+0x10d/0x160
[ 1320.338292] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.338296] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.338299] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.338303] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.338306] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.338317] INFO: task kworker/0:2:1448 blocked for more than 120 seconds.
[ 1320.341877] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.344538] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.348296] kworker/0:2 D ffff8800bf097d38 0 1448 2 0x00000000
[ 1320.348305] Workqueue: events vmstat_shepherd
[ 1320.348307] ffff8800bf097d38 ffffffff8141c925 ffffffff81e13500 ffff88008aa32a00
[ 1320.348310] ffff8800bf098000 ffffffff81e42fa4 ffff88008aa32a00 00000000ffffffff
[ 1320.348313] ffffffff81e42fa8 ffff8800bf097d50 ffffffff81844555 ffffffff81e42fa0
[ 1320.348316] Call Trace:
[ 1320.348321] [<ffffffff8141c925>] ? find_next_bit+0x15/0x20
[ 1320.348329] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.348332] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.348336] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.348342] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.348345] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.348350] [<ffffffff811b65c1>] vmstat_shepherd+0x21/0xd0
[ 1320.348354] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.348362] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.348366] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.348369] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.348372] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.348375] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348378] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.348381] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348386] INFO: task echo:1878 blocked for more than 120 seconds.
[ 1320.351463] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.354182] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.358193] echo D ffff88008aaf3c28 0 1878 1761 0x00000000
[ 1320.358197] ffff88008aaf3c28 000000000000001b ffffffff81e13500 ffff880036a83800
[ 1320.358201] ffff88008aaf4000 ffffffff81e6aec4 ffff880036a83800 00000000ffffffff
[ 1320.358204] ffffffff81e6aec8 ffff88008aaf3c40 ffffffff81844555 ffffffff81e6aec0
[ 1320.358207] Call Trace:
[ 1320.358212] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.358218] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.358226] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.358230] [<ffffffff810ddf79>] ? vprintk_default+0x29/0x40
[ 1320.358233] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.358239] [<ffffffff81193d70>] static_key_slow_inc+0x70/0xa0
[ 1320.358243] [<ffffffff810ab19d>] sched_cpu_active+0xdd/0xf0
[ 1320.358247] [<ffffffff810a659c>] notifier_call_chain+0x4c/0x70
[ 1320.358251] [<ffffffff810a66fe>] __raw_notifier_call_chain+0xe/0x10
[ 1320.358258] [<ffffffff81084913>] cpu_notify+0x23/0x40
[ 1320.358262] [<ffffffff810850c3>] _cpu_up+0x183/0x1b0
[ 1320.358265] [<ffffffff810851a2>] cpu_up+0xb2/0xd0
[ 1320.358268] [<ffffffff8155c5ed>] cpu_subsys_online+0x3d/0x90
[ 1320.358271] [<ffffffff81556f12>] device_online+0x62/0x90
[ 1320.358274] [<ffffffff81556fc9>] online_store+0x89/0xa0
[ 1320.358277] [<ffffffff81553e8b>] dev_attr_store+0x1b/0x30
[ 1320.358281] [<ffffffff8129a89c>] sysfs_kf_write+0x3c/0x50
[ 1320.358284] [<ffffffff8129a403>] kernfs_fop_write+0x143/0x180
[ 1320.358288] [<ffffffff8121833b>] __vfs_write+0x1b/0x40
[ 1320.358291] [<ffffffff81218ad9>] vfs_write+0xa9/0x1a0
[ 1320.358295] [<ffffffff812197bc>] SyS_write+0x5c/0xe0
[ 1320.358297] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0 |
Issue found on X-aws 4.4.0-1158.173
The cpuset_hotplug test will timeout with the 45 minutes on VMs (passed on AWS bare-metals) because of task hang.
Test output:
<<<test_start>>>
tag=cpuset_hotplug stime=1688982088
cmdline="cpuset_hotplug_test.sh"
contacts=""
analysis=exit
<<<test_output>>>
cpuset_hotplug 1 TINFO: CPUs are numbered continuously starting at 0 (0-1)
cpuset_hotplug 1 TINFO: Nodes are numbered continuously starting at 0 (0)
cpuset_hotplug 1 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 3 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 5 TPASS: Cpuset vs CPU hotplug test succeeded.
This issue can be reproduced with X-aws-1128, dmesg output:
[ 1083.816153] LTP: starting cpuset_hotplug (cpuset_hotplug_test.sh)
[ 1083.948273] Broke affinity for irq 4
[ 1083.948396] Broke affinity for irq 61
[ 1083.948402] Broke affinity for irq 62
[ 1083.948408] Broke affinity for irq 63
[ 1083.948413] Broke affinity for irq 64
[ 1083.949824] smpboot: CPU 1 is now offline
[ 1085.000502] installing Xen timer for CPU 1
[ 1085.020261] x86: Booting SMP configuration:
[ 1085.020263] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.021673] Skipped synchronization checks as TSC is reliable.
[ 1085.021693] cpu 1 spinlock event irq 59
[ 1085.069370] smpboot: CPU 1 is now offline
[ 1085.081932] installing Xen timer for CPU 1
[ 1085.100218] x86: Booting SMP configuration:
[ 1085.100220] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.101588] Skipped synchronization checks as TSC is reliable.
[ 1085.101608] cpu 1 spinlock event irq 59
[ 1086.405418] smpboot: CPU 1 is now offline
[ 1087.467625] installing Xen timer for CPU 1
[ 1087.484259] x86: Booting SMP configuration:
[ 1087.484261] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1087.485700] Skipped synchronization checks as TSC is reliable.
[ 1087.485721] cpu 1 spinlock event irq 59
[ 1320.308124] INFO: task systemd:1 blocked for more than 120 seconds.
[ 1320.311249] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.313959] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.318032] systemd D ffff8800ea4f7d28 0 1 0 0x00000000
[ 1320.318037] ffff8800ea4f7d28 ffff8800372adb40 ffffffff81e13500 ffff8800ea4f8000
[ 1320.318042] ffff8800ea4f8000 ffffffff81e605e4 ffff8800ea4f8000 00000000ffffffff
[ 1320.318045] ffffffff81e605e8 ffff8800ea4f7d40 ffffffff81844555 ffffffff81e605e0
[ 1320.318048] Call Trace:
[ 1320.318057] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.318061] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.318065] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.318068] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.318074] [<ffffffff8111f1bc>] proc_cgroup_show+0x4c/0x2e0
[ 1320.318078] [<ffffffff81289ac5>] proc_single_show+0x55/0x90
[ 1320.318083] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.318088] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.318093] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.318096] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.318100] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.318104] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.318118] INFO: task kworker/0:3:473 blocked for more than 120 seconds.
[ 1320.321669] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.324364] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.328026] kworker/0:3 D ffff8800364cfc68 0 473 2 0x00000000
[ 1320.328031] Workqueue: cgroup_destroy css_killed_work_fn
[ 1320.328033] ffff8800364cfc68 ffff8800d3ff4cc0 ffff880036d94600 ffff880036f9f000
[ 1320.328035] ffff8800364d0000 ffffffff81e42fa4 ffff880036f9f000 00000000ffffffff
[ 1320.328037] ffffffff81e42fa8 ffff8800364cfc80 ffffffff81844555 ffffffff81e42fa0
[ 1320.328039] Call Trace:
[ 1320.328043] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.328045] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.328047] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.328052] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.328072] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.328078] [<ffffffff81033ace>] arch_jump_label_transform+0x2e/0x120
[ 1320.328081] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328085] [<ffffffff81193a96>] __jump_label_update+0x76/0x90
[ 1320.328088] [<ffffffff81193b38>] jump_label_update+0x88/0x90
[ 1320.328092] [<ffffffff81193de1>] __static_key_slow_dec+0x41/0x90
[ 1320.328095] [<ffffffff81193e52>] static_key_slow_dec+0x22/0x60
[ 1320.328098] [<ffffffff8112263f>] cpuset_css_offline+0x3f/0x70
[ 1320.328102] [<ffffffff811195ac>] css_killed_work_fn+0x3c/0xd0
[ 1320.328106] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.328109] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.328116] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.328119] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.328122] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328128] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328131] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.328134] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328142] INFO: task irqbalance:1189 blocked for more than 120 seconds.
[ 1320.331605] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.334291] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.338224] irqbalance D ffff8800e8b07ce8 0 1189 1 0x00000000
[ 1320.338228] ffff8800e8b07ce8 ffff8800e8b07dd0 ffffffff81e13500 ffff8800369dc600
[ 1320.338232] ffff8800e8b08000 ffff8800eb64f7cc ffff8800369dc600 00000000ffffffff
[ 1320.338235] ffff8800eb64f7d0 ffff8800e8b07d00 ffffffff81844555 ffff8800eb64f7c8
[ 1320.338238] Call Trace:
[ 1320.338242] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.338249] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.338256] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.338259] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.338264] [<ffffffff81554500>] online_show+0x20/0x60
[ 1320.338269] [<ffffffff81554a63>] dev_attr_show+0x23/0x60
[ 1320.338272] [<ffffffff818467f2>] ? mutex_lock+0x12/0x30
[ 1320.338279] [<ffffffff8129aeaf>] sysfs_kf_seq_show+0xbf/0x140
[ 1320.338283] [<ffffffff812993a9>] kernfs_seq_show+0x29/0x30
[ 1320.338285] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.338289] [<ffffffff81299b5d>] kernfs_fop_read+0x10d/0x160
[ 1320.338292] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.338296] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.338299] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.338303] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.338306] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.338317] INFO: task kworker/0:2:1448 blocked for more than 120 seconds.
[ 1320.341877] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.344538] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.348296] kworker/0:2 D ffff8800bf097d38 0 1448 2 0x00000000
[ 1320.348305] Workqueue: events vmstat_shepherd
[ 1320.348307] ffff8800bf097d38 ffffffff8141c925 ffffffff81e13500 ffff88008aa32a00
[ 1320.348310] ffff8800bf098000 ffffffff81e42fa4 ffff88008aa32a00 00000000ffffffff
[ 1320.348313] ffffffff81e42fa8 ffff8800bf097d50 ffffffff81844555 ffffffff81e42fa0
[ 1320.348316] Call Trace:
[ 1320.348321] [<ffffffff8141c925>] ? find_next_bit+0x15/0x20
[ 1320.348329] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.348332] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.348336] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.348342] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.348345] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.348350] [<ffffffff811b65c1>] vmstat_shepherd+0x21/0xd0
[ 1320.348354] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.348362] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.348366] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.348369] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.348372] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.348375] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348378] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.348381] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348386] INFO: task echo:1878 blocked for more than 120 seconds.
[ 1320.351463] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.354182] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.358193] echo D ffff88008aaf3c28 0 1878 1761 0x00000000
[ 1320.358197] ffff88008aaf3c28 000000000000001b ffffffff81e13500 ffff880036a83800
[ 1320.358201] ffff88008aaf4000 ffffffff81e6aec4 ffff880036a83800 00000000ffffffff
[ 1320.358204] ffffffff81e6aec8 ffff88008aaf3c40 ffffffff81844555 ffffffff81e6aec0
[ 1320.358207] Call Trace:
[ 1320.358212] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.358218] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.358226] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.358230] [<ffffffff810ddf79>] ? vprintk_default+0x29/0x40
[ 1320.358233] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.358239] [<ffffffff81193d70>] static_key_slow_inc+0x70/0xa0
[ 1320.358243] [<ffffffff810ab19d>] sched_cpu_active+0xdd/0xf0
[ 1320.358247] [<ffffffff810a659c>] notifier_call_chain+0x4c/0x70
[ 1320.358251] [<ffffffff810a66fe>] __raw_notifier_call_chain+0xe/0x10
[ 1320.358258] [<ffffffff81084913>] cpu_notify+0x23/0x40
[ 1320.358262] [<ffffffff810850c3>] _cpu_up+0x183/0x1b0
[ 1320.358265] [<ffffffff810851a2>] cpu_up+0xb2/0xd0
[ 1320.358268] [<ffffffff8155c5ed>] cpu_subsys_online+0x3d/0x90
[ 1320.358271] [<ffffffff81556f12>] device_online+0x62/0x90
[ 1320.358274] [<ffffffff81556fc9>] online_store+0x89/0xa0
[ 1320.358277] [<ffffffff81553e8b>] dev_attr_store+0x1b/0x30
[ 1320.358281] [<ffffffff8129a89c>] sysfs_kf_write+0x3c/0x50
[ 1320.358284] [<ffffffff8129a403>] kernfs_fop_write+0x143/0x180
[ 1320.358288] [<ffffffff8121833b>] __vfs_write+0x1b/0x40
[ 1320.358291] [<ffffffff81218ad9>] vfs_write+0xa9/0x1a0
[ 1320.358295] [<ffffffff812197bc>] SyS_write+0x5c/0xe0
[ 1320.358297] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
This failure will cause the following cpuset_memory to fail. And leads to test interruption. |
|
2023-07-12 06:14:21 |
Po-Hsu Lin |
description |
Issue found on X-aws 4.4.0-1158.173
The cpuset_hotplug test will timeout with the 45 minutes on VMs (passed on AWS bare-metals) because of task hang.
Test output:
<<<test_start>>>
tag=cpuset_hotplug stime=1688982088
cmdline="cpuset_hotplug_test.sh"
contacts=""
analysis=exit
<<<test_output>>>
cpuset_hotplug 1 TINFO: CPUs are numbered continuously starting at 0 (0-1)
cpuset_hotplug 1 TINFO: Nodes are numbered continuously starting at 0 (0)
cpuset_hotplug 1 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 3 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 5 TPASS: Cpuset vs CPU hotplug test succeeded.
This issue can be reproduced with X-aws-1128, dmesg output:
[ 1083.816153] LTP: starting cpuset_hotplug (cpuset_hotplug_test.sh)
[ 1083.948273] Broke affinity for irq 4
[ 1083.948396] Broke affinity for irq 61
[ 1083.948402] Broke affinity for irq 62
[ 1083.948408] Broke affinity for irq 63
[ 1083.948413] Broke affinity for irq 64
[ 1083.949824] smpboot: CPU 1 is now offline
[ 1085.000502] installing Xen timer for CPU 1
[ 1085.020261] x86: Booting SMP configuration:
[ 1085.020263] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.021673] Skipped synchronization checks as TSC is reliable.
[ 1085.021693] cpu 1 spinlock event irq 59
[ 1085.069370] smpboot: CPU 1 is now offline
[ 1085.081932] installing Xen timer for CPU 1
[ 1085.100218] x86: Booting SMP configuration:
[ 1085.100220] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.101588] Skipped synchronization checks as TSC is reliable.
[ 1085.101608] cpu 1 spinlock event irq 59
[ 1086.405418] smpboot: CPU 1 is now offline
[ 1087.467625] installing Xen timer for CPU 1
[ 1087.484259] x86: Booting SMP configuration:
[ 1087.484261] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1087.485700] Skipped synchronization checks as TSC is reliable.
[ 1087.485721] cpu 1 spinlock event irq 59
[ 1320.308124] INFO: task systemd:1 blocked for more than 120 seconds.
[ 1320.311249] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.313959] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.318032] systemd D ffff8800ea4f7d28 0 1 0 0x00000000
[ 1320.318037] ffff8800ea4f7d28 ffff8800372adb40 ffffffff81e13500 ffff8800ea4f8000
[ 1320.318042] ffff8800ea4f8000 ffffffff81e605e4 ffff8800ea4f8000 00000000ffffffff
[ 1320.318045] ffffffff81e605e8 ffff8800ea4f7d40 ffffffff81844555 ffffffff81e605e0
[ 1320.318048] Call Trace:
[ 1320.318057] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.318061] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.318065] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.318068] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.318074] [<ffffffff8111f1bc>] proc_cgroup_show+0x4c/0x2e0
[ 1320.318078] [<ffffffff81289ac5>] proc_single_show+0x55/0x90
[ 1320.318083] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.318088] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.318093] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.318096] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.318100] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.318104] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.318118] INFO: task kworker/0:3:473 blocked for more than 120 seconds.
[ 1320.321669] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.324364] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.328026] kworker/0:3 D ffff8800364cfc68 0 473 2 0x00000000
[ 1320.328031] Workqueue: cgroup_destroy css_killed_work_fn
[ 1320.328033] ffff8800364cfc68 ffff8800d3ff4cc0 ffff880036d94600 ffff880036f9f000
[ 1320.328035] ffff8800364d0000 ffffffff81e42fa4 ffff880036f9f000 00000000ffffffff
[ 1320.328037] ffffffff81e42fa8 ffff8800364cfc80 ffffffff81844555 ffffffff81e42fa0
[ 1320.328039] Call Trace:
[ 1320.328043] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.328045] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.328047] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.328052] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.328072] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.328078] [<ffffffff81033ace>] arch_jump_label_transform+0x2e/0x120
[ 1320.328081] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328085] [<ffffffff81193a96>] __jump_label_update+0x76/0x90
[ 1320.328088] [<ffffffff81193b38>] jump_label_update+0x88/0x90
[ 1320.328092] [<ffffffff81193de1>] __static_key_slow_dec+0x41/0x90
[ 1320.328095] [<ffffffff81193e52>] static_key_slow_dec+0x22/0x60
[ 1320.328098] [<ffffffff8112263f>] cpuset_css_offline+0x3f/0x70
[ 1320.328102] [<ffffffff811195ac>] css_killed_work_fn+0x3c/0xd0
[ 1320.328106] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.328109] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.328116] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.328119] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.328122] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328128] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328131] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.328134] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328142] INFO: task irqbalance:1189 blocked for more than 120 seconds.
[ 1320.331605] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.334291] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.338224] irqbalance D ffff8800e8b07ce8 0 1189 1 0x00000000
[ 1320.338228] ffff8800e8b07ce8 ffff8800e8b07dd0 ffffffff81e13500 ffff8800369dc600
[ 1320.338232] ffff8800e8b08000 ffff8800eb64f7cc ffff8800369dc600 00000000ffffffff
[ 1320.338235] ffff8800eb64f7d0 ffff8800e8b07d00 ffffffff81844555 ffff8800eb64f7c8
[ 1320.338238] Call Trace:
[ 1320.338242] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.338249] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.338256] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.338259] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.338264] [<ffffffff81554500>] online_show+0x20/0x60
[ 1320.338269] [<ffffffff81554a63>] dev_attr_show+0x23/0x60
[ 1320.338272] [<ffffffff818467f2>] ? mutex_lock+0x12/0x30
[ 1320.338279] [<ffffffff8129aeaf>] sysfs_kf_seq_show+0xbf/0x140
[ 1320.338283] [<ffffffff812993a9>] kernfs_seq_show+0x29/0x30
[ 1320.338285] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.338289] [<ffffffff81299b5d>] kernfs_fop_read+0x10d/0x160
[ 1320.338292] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.338296] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.338299] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.338303] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.338306] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.338317] INFO: task kworker/0:2:1448 blocked for more than 120 seconds.
[ 1320.341877] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.344538] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.348296] kworker/0:2 D ffff8800bf097d38 0 1448 2 0x00000000
[ 1320.348305] Workqueue: events vmstat_shepherd
[ 1320.348307] ffff8800bf097d38 ffffffff8141c925 ffffffff81e13500 ffff88008aa32a00
[ 1320.348310] ffff8800bf098000 ffffffff81e42fa4 ffff88008aa32a00 00000000ffffffff
[ 1320.348313] ffffffff81e42fa8 ffff8800bf097d50 ffffffff81844555 ffffffff81e42fa0
[ 1320.348316] Call Trace:
[ 1320.348321] [<ffffffff8141c925>] ? find_next_bit+0x15/0x20
[ 1320.348329] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.348332] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.348336] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.348342] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.348345] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.348350] [<ffffffff811b65c1>] vmstat_shepherd+0x21/0xd0
[ 1320.348354] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.348362] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.348366] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.348369] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.348372] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.348375] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348378] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.348381] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348386] INFO: task echo:1878 blocked for more than 120 seconds.
[ 1320.351463] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.354182] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.358193] echo D ffff88008aaf3c28 0 1878 1761 0x00000000
[ 1320.358197] ffff88008aaf3c28 000000000000001b ffffffff81e13500 ffff880036a83800
[ 1320.358201] ffff88008aaf4000 ffffffff81e6aec4 ffff880036a83800 00000000ffffffff
[ 1320.358204] ffffffff81e6aec8 ffff88008aaf3c40 ffffffff81844555 ffffffff81e6aec0
[ 1320.358207] Call Trace:
[ 1320.358212] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.358218] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.358226] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.358230] [<ffffffff810ddf79>] ? vprintk_default+0x29/0x40
[ 1320.358233] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.358239] [<ffffffff81193d70>] static_key_slow_inc+0x70/0xa0
[ 1320.358243] [<ffffffff810ab19d>] sched_cpu_active+0xdd/0xf0
[ 1320.358247] [<ffffffff810a659c>] notifier_call_chain+0x4c/0x70
[ 1320.358251] [<ffffffff810a66fe>] __raw_notifier_call_chain+0xe/0x10
[ 1320.358258] [<ffffffff81084913>] cpu_notify+0x23/0x40
[ 1320.358262] [<ffffffff810850c3>] _cpu_up+0x183/0x1b0
[ 1320.358265] [<ffffffff810851a2>] cpu_up+0xb2/0xd0
[ 1320.358268] [<ffffffff8155c5ed>] cpu_subsys_online+0x3d/0x90
[ 1320.358271] [<ffffffff81556f12>] device_online+0x62/0x90
[ 1320.358274] [<ffffffff81556fc9>] online_store+0x89/0xa0
[ 1320.358277] [<ffffffff81553e8b>] dev_attr_store+0x1b/0x30
[ 1320.358281] [<ffffffff8129a89c>] sysfs_kf_write+0x3c/0x50
[ 1320.358284] [<ffffffff8129a403>] kernfs_fop_write+0x143/0x180
[ 1320.358288] [<ffffffff8121833b>] __vfs_write+0x1b/0x40
[ 1320.358291] [<ffffffff81218ad9>] vfs_write+0xa9/0x1a0
[ 1320.358295] [<ffffffff812197bc>] SyS_write+0x5c/0xe0
[ 1320.358297] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
This failure will cause the following cpuset_memory to fail. And leads to test interruption. |
Issue found on X-aws 4.4.0-1158.173
The cpuset_hotplug test will timeout with the 45 minutes on specific VMs (passed on AWS bare-metals) because of task hang. Here is the list of failed instances:
* c4.large
* c5n.large
* m5a.large
* r5.large
* t3.medium
Passed on these VMs:
* c3.xlarge
* t3a.2xlarge
* x1e.xlarge
Test output:
<<<test_start>>>
tag=cpuset_hotplug stime=1688982088
cmdline="cpuset_hotplug_test.sh"
contacts=""
analysis=exit
<<<test_output>>>
cpuset_hotplug 1 TINFO: CPUs are numbered continuously starting at 0 (0-1)
cpuset_hotplug 1 TINFO: Nodes are numbered continuously starting at 0 (0)
cpuset_hotplug 1 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 3 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 5 TPASS: Cpuset vs CPU hotplug test succeeded.
This issue can be reproduced with X-aws-1128, dmesg output:
[ 1083.816153] LTP: starting cpuset_hotplug (cpuset_hotplug_test.sh)
[ 1083.948273] Broke affinity for irq 4
[ 1083.948396] Broke affinity for irq 61
[ 1083.948402] Broke affinity for irq 62
[ 1083.948408] Broke affinity for irq 63
[ 1083.948413] Broke affinity for irq 64
[ 1083.949824] smpboot: CPU 1 is now offline
[ 1085.000502] installing Xen timer for CPU 1
[ 1085.020261] x86: Booting SMP configuration:
[ 1085.020263] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.021673] Skipped synchronization checks as TSC is reliable.
[ 1085.021693] cpu 1 spinlock event irq 59
[ 1085.069370] smpboot: CPU 1 is now offline
[ 1085.081932] installing Xen timer for CPU 1
[ 1085.100218] x86: Booting SMP configuration:
[ 1085.100220] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.101588] Skipped synchronization checks as TSC is reliable.
[ 1085.101608] cpu 1 spinlock event irq 59
[ 1086.405418] smpboot: CPU 1 is now offline
[ 1087.467625] installing Xen timer for CPU 1
[ 1087.484259] x86: Booting SMP configuration:
[ 1087.484261] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1087.485700] Skipped synchronization checks as TSC is reliable.
[ 1087.485721] cpu 1 spinlock event irq 59
[ 1320.308124] INFO: task systemd:1 blocked for more than 120 seconds.
[ 1320.311249] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.313959] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.318032] systemd D ffff8800ea4f7d28 0 1 0 0x00000000
[ 1320.318037] ffff8800ea4f7d28 ffff8800372adb40 ffffffff81e13500 ffff8800ea4f8000
[ 1320.318042] ffff8800ea4f8000 ffffffff81e605e4 ffff8800ea4f8000 00000000ffffffff
[ 1320.318045] ffffffff81e605e8 ffff8800ea4f7d40 ffffffff81844555 ffffffff81e605e0
[ 1320.318048] Call Trace:
[ 1320.318057] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.318061] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.318065] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.318068] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.318074] [<ffffffff8111f1bc>] proc_cgroup_show+0x4c/0x2e0
[ 1320.318078] [<ffffffff81289ac5>] proc_single_show+0x55/0x90
[ 1320.318083] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.318088] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.318093] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.318096] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.318100] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.318104] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.318118] INFO: task kworker/0:3:473 blocked for more than 120 seconds.
[ 1320.321669] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.324364] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.328026] kworker/0:3 D ffff8800364cfc68 0 473 2 0x00000000
[ 1320.328031] Workqueue: cgroup_destroy css_killed_work_fn
[ 1320.328033] ffff8800364cfc68 ffff8800d3ff4cc0 ffff880036d94600 ffff880036f9f000
[ 1320.328035] ffff8800364d0000 ffffffff81e42fa4 ffff880036f9f000 00000000ffffffff
[ 1320.328037] ffffffff81e42fa8 ffff8800364cfc80 ffffffff81844555 ffffffff81e42fa0
[ 1320.328039] Call Trace:
[ 1320.328043] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.328045] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.328047] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.328052] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.328072] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.328078] [<ffffffff81033ace>] arch_jump_label_transform+0x2e/0x120
[ 1320.328081] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328085] [<ffffffff81193a96>] __jump_label_update+0x76/0x90
[ 1320.328088] [<ffffffff81193b38>] jump_label_update+0x88/0x90
[ 1320.328092] [<ffffffff81193de1>] __static_key_slow_dec+0x41/0x90
[ 1320.328095] [<ffffffff81193e52>] static_key_slow_dec+0x22/0x60
[ 1320.328098] [<ffffffff8112263f>] cpuset_css_offline+0x3f/0x70
[ 1320.328102] [<ffffffff811195ac>] css_killed_work_fn+0x3c/0xd0
[ 1320.328106] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.328109] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.328116] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.328119] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.328122] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328128] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328131] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.328134] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328142] INFO: task irqbalance:1189 blocked for more than 120 seconds.
[ 1320.331605] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.334291] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.338224] irqbalance D ffff8800e8b07ce8 0 1189 1 0x00000000
[ 1320.338228] ffff8800e8b07ce8 ffff8800e8b07dd0 ffffffff81e13500 ffff8800369dc600
[ 1320.338232] ffff8800e8b08000 ffff8800eb64f7cc ffff8800369dc600 00000000ffffffff
[ 1320.338235] ffff8800eb64f7d0 ffff8800e8b07d00 ffffffff81844555 ffff8800eb64f7c8
[ 1320.338238] Call Trace:
[ 1320.338242] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.338249] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.338256] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.338259] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.338264] [<ffffffff81554500>] online_show+0x20/0x60
[ 1320.338269] [<ffffffff81554a63>] dev_attr_show+0x23/0x60
[ 1320.338272] [<ffffffff818467f2>] ? mutex_lock+0x12/0x30
[ 1320.338279] [<ffffffff8129aeaf>] sysfs_kf_seq_show+0xbf/0x140
[ 1320.338283] [<ffffffff812993a9>] kernfs_seq_show+0x29/0x30
[ 1320.338285] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.338289] [<ffffffff81299b5d>] kernfs_fop_read+0x10d/0x160
[ 1320.338292] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.338296] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.338299] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.338303] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.338306] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.338317] INFO: task kworker/0:2:1448 blocked for more than 120 seconds.
[ 1320.341877] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.344538] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.348296] kworker/0:2 D ffff8800bf097d38 0 1448 2 0x00000000
[ 1320.348305] Workqueue: events vmstat_shepherd
[ 1320.348307] ffff8800bf097d38 ffffffff8141c925 ffffffff81e13500 ffff88008aa32a00
[ 1320.348310] ffff8800bf098000 ffffffff81e42fa4 ffff88008aa32a00 00000000ffffffff
[ 1320.348313] ffffffff81e42fa8 ffff8800bf097d50 ffffffff81844555 ffffffff81e42fa0
[ 1320.348316] Call Trace:
[ 1320.348321] [<ffffffff8141c925>] ? find_next_bit+0x15/0x20
[ 1320.348329] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.348332] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.348336] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.348342] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.348345] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.348350] [<ffffffff811b65c1>] vmstat_shepherd+0x21/0xd0
[ 1320.348354] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.348362] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.348366] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.348369] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.348372] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.348375] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348378] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.348381] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348386] INFO: task echo:1878 blocked for more than 120 seconds.
[ 1320.351463] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.354182] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.358193] echo D ffff88008aaf3c28 0 1878 1761 0x00000000
[ 1320.358197] ffff88008aaf3c28 000000000000001b ffffffff81e13500 ffff880036a83800
[ 1320.358201] ffff88008aaf4000 ffffffff81e6aec4 ffff880036a83800 00000000ffffffff
[ 1320.358204] ffffffff81e6aec8 ffff88008aaf3c40 ffffffff81844555 ffffffff81e6aec0
[ 1320.358207] Call Trace:
[ 1320.358212] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.358218] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.358226] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.358230] [<ffffffff810ddf79>] ? vprintk_default+0x29/0x40
[ 1320.358233] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.358239] [<ffffffff81193d70>] static_key_slow_inc+0x70/0xa0
[ 1320.358243] [<ffffffff810ab19d>] sched_cpu_active+0xdd/0xf0
[ 1320.358247] [<ffffffff810a659c>] notifier_call_chain+0x4c/0x70
[ 1320.358251] [<ffffffff810a66fe>] __raw_notifier_call_chain+0xe/0x10
[ 1320.358258] [<ffffffff81084913>] cpu_notify+0x23/0x40
[ 1320.358262] [<ffffffff810850c3>] _cpu_up+0x183/0x1b0
[ 1320.358265] [<ffffffff810851a2>] cpu_up+0xb2/0xd0
[ 1320.358268] [<ffffffff8155c5ed>] cpu_subsys_online+0x3d/0x90
[ 1320.358271] [<ffffffff81556f12>] device_online+0x62/0x90
[ 1320.358274] [<ffffffff81556fc9>] online_store+0x89/0xa0
[ 1320.358277] [<ffffffff81553e8b>] dev_attr_store+0x1b/0x30
[ 1320.358281] [<ffffffff8129a89c>] sysfs_kf_write+0x3c/0x50
[ 1320.358284] [<ffffffff8129a403>] kernfs_fop_write+0x143/0x180
[ 1320.358288] [<ffffffff8121833b>] __vfs_write+0x1b/0x40
[ 1320.358291] [<ffffffff81218ad9>] vfs_write+0xa9/0x1a0
[ 1320.358295] [<ffffffff812197bc>] SyS_write+0x5c/0xe0
[ 1320.358297] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
This failure will cause the following cpuset_memory to fail. And leads to test interruption. |
|
2023-07-12 07:03:24 |
Po-Hsu Lin |
description |
Issue found on X-aws 4.4.0-1158.173
The cpuset_hotplug test will timeout with the 45 minutes on specific VMs (passed on AWS bare-metals) because of task hang. Here is the list of failed instances:
* c4.large
* c5n.large
* m5a.large
* r5.large
* t3.medium
Passed on these VMs:
* c3.xlarge
* t3a.2xlarge
* x1e.xlarge
Test output:
<<<test_start>>>
tag=cpuset_hotplug stime=1688982088
cmdline="cpuset_hotplug_test.sh"
contacts=""
analysis=exit
<<<test_output>>>
cpuset_hotplug 1 TINFO: CPUs are numbered continuously starting at 0 (0-1)
cpuset_hotplug 1 TINFO: Nodes are numbered continuously starting at 0 (0)
cpuset_hotplug 1 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 3 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 5 TPASS: Cpuset vs CPU hotplug test succeeded.
This issue can be reproduced with X-aws-1128, dmesg output:
[ 1083.816153] LTP: starting cpuset_hotplug (cpuset_hotplug_test.sh)
[ 1083.948273] Broke affinity for irq 4
[ 1083.948396] Broke affinity for irq 61
[ 1083.948402] Broke affinity for irq 62
[ 1083.948408] Broke affinity for irq 63
[ 1083.948413] Broke affinity for irq 64
[ 1083.949824] smpboot: CPU 1 is now offline
[ 1085.000502] installing Xen timer for CPU 1
[ 1085.020261] x86: Booting SMP configuration:
[ 1085.020263] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.021673] Skipped synchronization checks as TSC is reliable.
[ 1085.021693] cpu 1 spinlock event irq 59
[ 1085.069370] smpboot: CPU 1 is now offline
[ 1085.081932] installing Xen timer for CPU 1
[ 1085.100218] x86: Booting SMP configuration:
[ 1085.100220] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.101588] Skipped synchronization checks as TSC is reliable.
[ 1085.101608] cpu 1 spinlock event irq 59
[ 1086.405418] smpboot: CPU 1 is now offline
[ 1087.467625] installing Xen timer for CPU 1
[ 1087.484259] x86: Booting SMP configuration:
[ 1087.484261] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1087.485700] Skipped synchronization checks as TSC is reliable.
[ 1087.485721] cpu 1 spinlock event irq 59
[ 1320.308124] INFO: task systemd:1 blocked for more than 120 seconds.
[ 1320.311249] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.313959] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.318032] systemd D ffff8800ea4f7d28 0 1 0 0x00000000
[ 1320.318037] ffff8800ea4f7d28 ffff8800372adb40 ffffffff81e13500 ffff8800ea4f8000
[ 1320.318042] ffff8800ea4f8000 ffffffff81e605e4 ffff8800ea4f8000 00000000ffffffff
[ 1320.318045] ffffffff81e605e8 ffff8800ea4f7d40 ffffffff81844555 ffffffff81e605e0
[ 1320.318048] Call Trace:
[ 1320.318057] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.318061] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.318065] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.318068] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.318074] [<ffffffff8111f1bc>] proc_cgroup_show+0x4c/0x2e0
[ 1320.318078] [<ffffffff81289ac5>] proc_single_show+0x55/0x90
[ 1320.318083] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.318088] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.318093] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.318096] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.318100] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.318104] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.318118] INFO: task kworker/0:3:473 blocked for more than 120 seconds.
[ 1320.321669] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.324364] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.328026] kworker/0:3 D ffff8800364cfc68 0 473 2 0x00000000
[ 1320.328031] Workqueue: cgroup_destroy css_killed_work_fn
[ 1320.328033] ffff8800364cfc68 ffff8800d3ff4cc0 ffff880036d94600 ffff880036f9f000
[ 1320.328035] ffff8800364d0000 ffffffff81e42fa4 ffff880036f9f000 00000000ffffffff
[ 1320.328037] ffffffff81e42fa8 ffff8800364cfc80 ffffffff81844555 ffffffff81e42fa0
[ 1320.328039] Call Trace:
[ 1320.328043] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.328045] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.328047] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.328052] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.328072] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.328078] [<ffffffff81033ace>] arch_jump_label_transform+0x2e/0x120
[ 1320.328081] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328085] [<ffffffff81193a96>] __jump_label_update+0x76/0x90
[ 1320.328088] [<ffffffff81193b38>] jump_label_update+0x88/0x90
[ 1320.328092] [<ffffffff81193de1>] __static_key_slow_dec+0x41/0x90
[ 1320.328095] [<ffffffff81193e52>] static_key_slow_dec+0x22/0x60
[ 1320.328098] [<ffffffff8112263f>] cpuset_css_offline+0x3f/0x70
[ 1320.328102] [<ffffffff811195ac>] css_killed_work_fn+0x3c/0xd0
[ 1320.328106] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.328109] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.328116] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.328119] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.328122] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328128] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328131] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.328134] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328142] INFO: task irqbalance:1189 blocked for more than 120 seconds.
[ 1320.331605] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.334291] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.338224] irqbalance D ffff8800e8b07ce8 0 1189 1 0x00000000
[ 1320.338228] ffff8800e8b07ce8 ffff8800e8b07dd0 ffffffff81e13500 ffff8800369dc600
[ 1320.338232] ffff8800e8b08000 ffff8800eb64f7cc ffff8800369dc600 00000000ffffffff
[ 1320.338235] ffff8800eb64f7d0 ffff8800e8b07d00 ffffffff81844555 ffff8800eb64f7c8
[ 1320.338238] Call Trace:
[ 1320.338242] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.338249] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.338256] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.338259] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.338264] [<ffffffff81554500>] online_show+0x20/0x60
[ 1320.338269] [<ffffffff81554a63>] dev_attr_show+0x23/0x60
[ 1320.338272] [<ffffffff818467f2>] ? mutex_lock+0x12/0x30
[ 1320.338279] [<ffffffff8129aeaf>] sysfs_kf_seq_show+0xbf/0x140
[ 1320.338283] [<ffffffff812993a9>] kernfs_seq_show+0x29/0x30
[ 1320.338285] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.338289] [<ffffffff81299b5d>] kernfs_fop_read+0x10d/0x160
[ 1320.338292] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.338296] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.338299] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.338303] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.338306] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.338317] INFO: task kworker/0:2:1448 blocked for more than 120 seconds.
[ 1320.341877] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.344538] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.348296] kworker/0:2 D ffff8800bf097d38 0 1448 2 0x00000000
[ 1320.348305] Workqueue: events vmstat_shepherd
[ 1320.348307] ffff8800bf097d38 ffffffff8141c925 ffffffff81e13500 ffff88008aa32a00
[ 1320.348310] ffff8800bf098000 ffffffff81e42fa4 ffff88008aa32a00 00000000ffffffff
[ 1320.348313] ffffffff81e42fa8 ffff8800bf097d50 ffffffff81844555 ffffffff81e42fa0
[ 1320.348316] Call Trace:
[ 1320.348321] [<ffffffff8141c925>] ? find_next_bit+0x15/0x20
[ 1320.348329] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.348332] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.348336] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.348342] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.348345] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.348350] [<ffffffff811b65c1>] vmstat_shepherd+0x21/0xd0
[ 1320.348354] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.348362] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.348366] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.348369] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.348372] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.348375] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348378] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.348381] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348386] INFO: task echo:1878 blocked for more than 120 seconds.
[ 1320.351463] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.354182] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.358193] echo D ffff88008aaf3c28 0 1878 1761 0x00000000
[ 1320.358197] ffff88008aaf3c28 000000000000001b ffffffff81e13500 ffff880036a83800
[ 1320.358201] ffff88008aaf4000 ffffffff81e6aec4 ffff880036a83800 00000000ffffffff
[ 1320.358204] ffffffff81e6aec8 ffff88008aaf3c40 ffffffff81844555 ffffffff81e6aec0
[ 1320.358207] Call Trace:
[ 1320.358212] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.358218] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.358226] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.358230] [<ffffffff810ddf79>] ? vprintk_default+0x29/0x40
[ 1320.358233] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.358239] [<ffffffff81193d70>] static_key_slow_inc+0x70/0xa0
[ 1320.358243] [<ffffffff810ab19d>] sched_cpu_active+0xdd/0xf0
[ 1320.358247] [<ffffffff810a659c>] notifier_call_chain+0x4c/0x70
[ 1320.358251] [<ffffffff810a66fe>] __raw_notifier_call_chain+0xe/0x10
[ 1320.358258] [<ffffffff81084913>] cpu_notify+0x23/0x40
[ 1320.358262] [<ffffffff810850c3>] _cpu_up+0x183/0x1b0
[ 1320.358265] [<ffffffff810851a2>] cpu_up+0xb2/0xd0
[ 1320.358268] [<ffffffff8155c5ed>] cpu_subsys_online+0x3d/0x90
[ 1320.358271] [<ffffffff81556f12>] device_online+0x62/0x90
[ 1320.358274] [<ffffffff81556fc9>] online_store+0x89/0xa0
[ 1320.358277] [<ffffffff81553e8b>] dev_attr_store+0x1b/0x30
[ 1320.358281] [<ffffffff8129a89c>] sysfs_kf_write+0x3c/0x50
[ 1320.358284] [<ffffffff8129a403>] kernfs_fop_write+0x143/0x180
[ 1320.358288] [<ffffffff8121833b>] __vfs_write+0x1b/0x40
[ 1320.358291] [<ffffffff81218ad9>] vfs_write+0xa9/0x1a0
[ 1320.358295] [<ffffffff812197bc>] SyS_write+0x5c/0xe0
[ 1320.358297] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
This failure will cause the following cpuset_memory to fail. And leads to test interruption. |
Issue found on X-aws 4.4.0-1158.173
The cpuset_hotplug test will timeout with the 45 minutes on specific VMs (passed on AWS bare-metals) because of task hang. Here is the list of failed instances:
* c4.large
- xen / xen-hvm, 2 cpus
- fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm invpcid_single kaiser fsgsbase bmi1 avx2 smep bmi2 erms invpcid xsaveopt
* c5n.large
- kvm, 2 cpus
- fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology nonstop_tsc aperfmperf tsc_known_freq pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single kaiser fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f rdseed adx smap clflushopt clwb avx512cd xsaveopt xsavec xgetbv1 ida arat pku
* m5a.large
- kvm, 2 cpus
- fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc extd_apicid aperfmperf tsc_known_freq pni pclmulqdq ssse3 fma cx16 sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm cmp_legacy cr8_legacy abm sse4a misalignsse 3dnowprefetch topoext vmmcall fsgsbase bmi1 avx2 smep bmi2 rdseed adx smap clflushopt sha_ni xsaveopt xsavec xgetbv1 clzero arat npt nrip_save
* r5.large
- kvm, 2 cpus
- fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology nonstop_tsc aperfmperf tsc_known_freq pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single kaiser fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f rdseed adx smap clflushopt clwb avx512cd xsaveopt xsavec xgetbv1 ida arat pku
* t3.medium
- kvm, 2 cpus
- fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology nonstop_tsc tsc_known_freq pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single kaiser fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f rdseed adx smap clflushopt clwb avx512cd xsaveopt xsavec xgetbv1 ida arat pku
Passed on these VMs:
* c3.xlarge
- xen / xen-hvm, 4 cpus
- fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx rdtscp lm constant_tsc rep_good nopl xtopology pni pclmulqdq ssse3 cx16 pcid sse4_1 sse4_2 x2apic popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm kaiser fsgsbase smep erms xsaveopt
* t3a.2xlarge
- kvm, 8 cpus
- fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc extd_apicid tsc_known_freq pni pclmulqdq ssse3 fma cx16 sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm cmp_legacy cr8_legacy abm sse4a misalignsse 3dnowprefetch topoext vmmcall fsgsbase bmi1 avx2 smep bmi2 rdseed adx smap clflushopt sha_ni xsaveopt xsavec xgetbv1 clzero arat npt nrip_save
* x1e.xlarge
- xen / xen-hvm, 4 cpus
- fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology nonstop_tsc aperfmperf pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm invpcid_single kaiser fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm xsaveopt
Test output:
<<<test_start>>>
tag=cpuset_hotplug stime=1688982088
cmdline="cpuset_hotplug_test.sh"
contacts=""
analysis=exit
<<<test_output>>>
cpuset_hotplug 1 TINFO: CPUs are numbered continuously starting at 0 (0-1)
cpuset_hotplug 1 TINFO: Nodes are numbered continuously starting at 0 (0)
cpuset_hotplug 1 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 3 TPASS: Cpuset vs CPU hotplug test succeeded.
cpuset_hotplug 5 TPASS: Cpuset vs CPU hotplug test succeeded.
This issue can be reproduced with X-aws-1128, dmesg output:
[ 1083.816153] LTP: starting cpuset_hotplug (cpuset_hotplug_test.sh)
[ 1083.948273] Broke affinity for irq 4
[ 1083.948396] Broke affinity for irq 61
[ 1083.948402] Broke affinity for irq 62
[ 1083.948408] Broke affinity for irq 63
[ 1083.948413] Broke affinity for irq 64
[ 1083.949824] smpboot: CPU 1 is now offline
[ 1085.000502] installing Xen timer for CPU 1
[ 1085.020261] x86: Booting SMP configuration:
[ 1085.020263] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.021673] Skipped synchronization checks as TSC is reliable.
[ 1085.021693] cpu 1 spinlock event irq 59
[ 1085.069370] smpboot: CPU 1 is now offline
[ 1085.081932] installing Xen timer for CPU 1
[ 1085.100218] x86: Booting SMP configuration:
[ 1085.100220] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1085.101588] Skipped synchronization checks as TSC is reliable.
[ 1085.101608] cpu 1 spinlock event irq 59
[ 1086.405418] smpboot: CPU 1 is now offline
[ 1087.467625] installing Xen timer for CPU 1
[ 1087.484259] x86: Booting SMP configuration:
[ 1087.484261] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 1087.485700] Skipped synchronization checks as TSC is reliable.
[ 1087.485721] cpu 1 spinlock event irq 59
[ 1320.308124] INFO: task systemd:1 blocked for more than 120 seconds.
[ 1320.311249] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.313959] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.318032] systemd D ffff8800ea4f7d28 0 1 0 0x00000000
[ 1320.318037] ffff8800ea4f7d28 ffff8800372adb40 ffffffff81e13500 ffff8800ea4f8000
[ 1320.318042] ffff8800ea4f8000 ffffffff81e605e4 ffff8800ea4f8000 00000000ffffffff
[ 1320.318045] ffffffff81e605e8 ffff8800ea4f7d40 ffffffff81844555 ffffffff81e605e0
[ 1320.318048] Call Trace:
[ 1320.318057] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.318061] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.318065] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.318068] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.318074] [<ffffffff8111f1bc>] proc_cgroup_show+0x4c/0x2e0
[ 1320.318078] [<ffffffff81289ac5>] proc_single_show+0x55/0x90
[ 1320.318083] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.318088] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.318093] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.318096] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.318100] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.318104] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.318118] INFO: task kworker/0:3:473 blocked for more than 120 seconds.
[ 1320.321669] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.324364] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.328026] kworker/0:3 D ffff8800364cfc68 0 473 2 0x00000000
[ 1320.328031] Workqueue: cgroup_destroy css_killed_work_fn
[ 1320.328033] ffff8800364cfc68 ffff8800d3ff4cc0 ffff880036d94600 ffff880036f9f000
[ 1320.328035] ffff8800364d0000 ffffffff81e42fa4 ffff880036f9f000 00000000ffffffff
[ 1320.328037] ffffffff81e42fa8 ffff8800364cfc80 ffffffff81844555 ffffffff81e42fa0
[ 1320.328039] Call Trace:
[ 1320.328043] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.328045] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.328047] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.328052] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.328072] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.328078] [<ffffffff81033ace>] arch_jump_label_transform+0x2e/0x120
[ 1320.328081] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328085] [<ffffffff81193a96>] __jump_label_update+0x76/0x90
[ 1320.328088] [<ffffffff81193b38>] jump_label_update+0x88/0x90
[ 1320.328092] [<ffffffff81193de1>] __static_key_slow_dec+0x41/0x90
[ 1320.328095] [<ffffffff81193e52>] static_key_slow_dec+0x22/0x60
[ 1320.328098] [<ffffffff8112263f>] cpuset_css_offline+0x3f/0x70
[ 1320.328102] [<ffffffff811195ac>] css_killed_work_fn+0x3c/0xd0
[ 1320.328106] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.328109] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.328116] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.328119] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.328122] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.328128] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328131] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.328134] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.328142] INFO: task irqbalance:1189 blocked for more than 120 seconds.
[ 1320.331605] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.334291] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.338224] irqbalance D ffff8800e8b07ce8 0 1189 1 0x00000000
[ 1320.338228] ffff8800e8b07ce8 ffff8800e8b07dd0 ffffffff81e13500 ffff8800369dc600
[ 1320.338232] ffff8800e8b08000 ffff8800eb64f7cc ffff8800369dc600 00000000ffffffff
[ 1320.338235] ffff8800eb64f7d0 ffff8800e8b07d00 ffffffff81844555 ffff8800eb64f7c8
[ 1320.338238] Call Trace:
[ 1320.338242] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.338249] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.338256] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.338259] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.338264] [<ffffffff81554500>] online_show+0x20/0x60
[ 1320.338269] [<ffffffff81554a63>] dev_attr_show+0x23/0x60
[ 1320.338272] [<ffffffff818467f2>] ? mutex_lock+0x12/0x30
[ 1320.338279] [<ffffffff8129aeaf>] sysfs_kf_seq_show+0xbf/0x140
[ 1320.338283] [<ffffffff812993a9>] kernfs_seq_show+0x29/0x30
[ 1320.338285] [<ffffffff8123e662>] seq_read+0xf2/0x3d0
[ 1320.338289] [<ffffffff81299b5d>] kernfs_fop_read+0x10d/0x160
[ 1320.338292] [<ffffffff81354d86>] ? security_file_permission+0xa6/0xc0
[ 1320.338296] [<ffffffff8121820b>] __vfs_read+0x1b/0x40
[ 1320.338299] [<ffffffff81218986>] vfs_read+0x86/0x130
[ 1320.338303] [<ffffffff812196dc>] SyS_read+0x5c/0xe0
[ 1320.338306] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
[ 1320.338317] INFO: task kworker/0:2:1448 blocked for more than 120 seconds.
[ 1320.341877] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.344538] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.348296] kworker/0:2 D ffff8800bf097d38 0 1448 2 0x00000000
[ 1320.348305] Workqueue: events vmstat_shepherd
[ 1320.348307] ffff8800bf097d38 ffffffff8141c925 ffffffff81e13500 ffff88008aa32a00
[ 1320.348310] ffff8800bf098000 ffffffff81e42fa4 ffff88008aa32a00 00000000ffffffff
[ 1320.348313] ffffffff81e42fa8 ffff8800bf097d50 ffffffff81844555 ffffffff81e42fa0
[ 1320.348316] Call Trace:
[ 1320.348321] [<ffffffff8141c925>] ? find_next_bit+0x15/0x20
[ 1320.348329] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.348332] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.348336] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.348342] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.348345] [<ffffffff8108474c>] get_online_cpus+0x2c/0x50
[ 1320.348350] [<ffffffff811b65c1>] vmstat_shepherd+0x21/0xd0
[ 1320.348354] [<ffffffff8109ec7b>] process_one_work+0x16b/0x4e0
[ 1320.348362] [<ffffffff8109f03e>] worker_thread+0x4e/0x580
[ 1320.348366] [<ffffffff8109eff0>] ? process_one_work+0x4e0/0x4e0
[ 1320.348369] [<ffffffff810a55e7>] kthread+0xe7/0x100
[ 1320.348372] [<ffffffff81844011>] ? __schedule+0x301/0x810
[ 1320.348375] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348378] [<ffffffff818494e5>] ret_from_fork+0x55/0x80
[ 1320.348381] [<ffffffff810a5500>] ? kthread_create_on_node+0x1e0/0x1e0
[ 1320.348386] INFO: task echo:1878 blocked for more than 120 seconds.
[ 1320.351463] Not tainted 4.4.0-1128-aws #142-Ubuntu
[ 1320.354182] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1320.358193] echo D ffff88008aaf3c28 0 1878 1761 0x00000000
[ 1320.358197] ffff88008aaf3c28 000000000000001b ffffffff81e13500 ffff880036a83800
[ 1320.358201] ffff88008aaf4000 ffffffff81e6aec4 ffff880036a83800 00000000ffffffff
[ 1320.358204] ffffffff81e6aec8 ffff88008aaf3c40 ffffffff81844555 ffffffff81e6aec0
[ 1320.358207] Call Trace:
[ 1320.358212] [<ffffffff81844555>] schedule+0x35/0x80
[ 1320.358218] [<ffffffff818448ae>] schedule_preempt_disabled+0xe/0x10
[ 1320.358226] [<ffffffff81846767>] __mutex_lock_slowpath+0xb7/0x130
[ 1320.358230] [<ffffffff810ddf79>] ? vprintk_default+0x29/0x40
[ 1320.358233] [<ffffffff818467ff>] mutex_lock+0x1f/0x30
[ 1320.358239] [<ffffffff81193d70>] static_key_slow_inc+0x70/0xa0
[ 1320.358243] [<ffffffff810ab19d>] sched_cpu_active+0xdd/0xf0
[ 1320.358247] [<ffffffff810a659c>] notifier_call_chain+0x4c/0x70
[ 1320.358251] [<ffffffff810a66fe>] __raw_notifier_call_chain+0xe/0x10
[ 1320.358258] [<ffffffff81084913>] cpu_notify+0x23/0x40
[ 1320.358262] [<ffffffff810850c3>] _cpu_up+0x183/0x1b0
[ 1320.358265] [<ffffffff810851a2>] cpu_up+0xb2/0xd0
[ 1320.358268] [<ffffffff8155c5ed>] cpu_subsys_online+0x3d/0x90
[ 1320.358271] [<ffffffff81556f12>] device_online+0x62/0x90
[ 1320.358274] [<ffffffff81556fc9>] online_store+0x89/0xa0
[ 1320.358277] [<ffffffff81553e8b>] dev_attr_store+0x1b/0x30
[ 1320.358281] [<ffffffff8129a89c>] sysfs_kf_write+0x3c/0x50
[ 1320.358284] [<ffffffff8129a403>] kernfs_fop_write+0x143/0x180
[ 1320.358288] [<ffffffff8121833b>] __vfs_write+0x1b/0x40
[ 1320.358291] [<ffffffff81218ad9>] vfs_write+0xa9/0x1a0
[ 1320.358295] [<ffffffff812197bc>] SyS_write+0x5c/0xe0
[ 1320.358297] [<ffffffff8184905b>] entry_SYSCALL_64_fastpath+0x22/0xd0
This failure will cause the following cpuset_memory to fail. And leads to test interruption. |
|