kernel io hangs during mdcheck/resync
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Confirmed
|
Undecided
|
Unassigned | ||
linux-signed-hwe-5.11 (Ubuntu) |
Confirmed
|
Undecided
|
Unassigned | ||
linux-signed-hwe-5.15 (Ubuntu) |
Confirmed
|
Undecided
|
Unassigned | ||
linux-signed-hwe-5.4 (Ubuntu) |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
It seems to always occur during an mdcheck/resync, if I am logged in via SSH it is still somewhat responsive and basic utilities like dmesg will work. But it apppears any write I/O will hang the terminal and nothing is written to syslog (presumably because it is blocked).
Below is output of dmesg and cat /proc/mdstat, it appears the data check was interrupted and /proc/mdstat still shows progress, and a whole slew of hung tasks including md1_resync itself.
[756484.534293] md: data-check of RAID array md0
[756484.628039] md: delaying data-check of md1 until md0 has finished (they share one or more physical units)
[756493.808773] md: md0: data-check done.
[756493.829760] md: data-check of RAID array md1
[778112.446410] md: md1: data-check interrupted.
[810654.608102] md: data-check of RAID array md1
[832291.201064] md: md1: data-check interrupted.
[899745.389485] md: data-check of RAID array md1
[921395.835305] md: md1: data-check interrupted.
[921588.558834] INFO: task systemd-journal:376 blocked for more than 120 seconds.
[921588.558846] Not tainted 5.11.0-27-generic #29~20.04.1-Ubuntu
[921588.558850] "echo 0 > /proc/sys/
[921588.558854] task:systemd-
[921588.558859] Call Trace:
[921588.558864] __schedule+
[921588.558872] schedule+0x4f/0xc0
[921588.558876] md_write_
[921588.558880] ? wait_woken+
[921588.558886] raid5_make_
[921588.558898] ? wait_woken+
[921588.558901] ? mempool_
[921588.558904] md_handle_
[921588.558907] ? __part_
[921588.558912] md_submit_
[921588.558915] submit_
[921588.558918] ? ext4_fc_
[921588.558922] submit_
[921588.558925] ? _cond_resched+
[921588.558928] ? kmem_cache_
[921588.558932] ? ext4_init_
[921588.558936] ext4_io_
[921588.558940] ext4_writepages
[921588.558944] do_writepages+
[921588.558948] ? do_writepages+
[921588.558951] ? fault_dirty_
[921588.558955] __filemap_
[921588.558960] file_write_
[921588.558962] ext4_sync_
[921588.558967] vfs_fsync_
[921588.558970] do_fsync+0x3d/0x70
[921588.558973] __x64_sys_
[921588.558976] do_syscall_
[921588.558980] entry_SYSCALL_
[921588.558984] RIP: 0033:0x7f4c97ee832b
[921588.558987] RSP: 002b:00007ffdce
[921588.558991] RAX: ffffffffffffffda RBX: 000055ced34b0fa0 RCX: 00007f4c97ee832b
[921588.558993] RDX: 00007f4c97fc8000 RSI: 000055ced3487b70 RDI: 0000000000000021
[921588.558995] RBP: 0000000000000001 R08: 0000000000000000 R09: 00007ffdceb29fa8
[921588.558996] R10: 00007f4c97d2c848 R11: 0000000000000293 R12: 00007ffdceb29fa8
[921588.558998] R13: 00007ffdceb29fa0 R14: 000055ced34b0fa0 R15: 000055ced34bcf90
[921588.559014] INFO: task mysqld:1505 blocked for more than 120 seconds.
[921588.559018] Not tainted 5.11.0-27-generic #29~20.04.1-Ubuntu
[921588.559022] "echo 0 > /proc/sys/
[921588.559025] task:mysqld state:D stack: 0 pid: 1505 ppid: 1 flags:0x00000000
[921588.559030] Call Trace:
[921588.559032] __schedule+
[921588.559036] schedule+0x4f/0xc0
[921588.559040] md_write_
[921588.559044] ? wait_woken+
[921588.559047] raid5_make_
[921588.559056] ? wait_woken+
[921588.559059] ? mempool_
[921588.559062] md_handle_
[921588.559065] ? __part_
[921588.559068] md_submit_
[921588.559071] submit_
[921588.559075] submit_
[921588.559077] ? _cond_resched+
[921588.559081] ? kmem_cache_
[921588.559084] ? ext4_init_
[921588.559088] ext4_io_
[921588.559091] ext4_writepages
[921588.559094] ? __schedule+
[921588.559097] ? hrtimer_
[921588.559100] ? timerqueue_
[921588.559105] ? futex_wait+
[921588.559109] do_writepages+
[921588.559112] ? do_writepages+
[921588.559115] ? futex_wake+
[921588.559118] __filemap_
[921588.559123] file_write_
[921588.559126] ext4_sync_
[921588.559129] vfs_fsync_
[921588.559132] ? __fget_
[921588.559136] do_fsync+0x3d/0x70
[921588.559139] __x64_sys_
[921588.559142] do_syscall_
[921588.559145] entry_SYSCALL_
[921588.559148] RIP: 0033:0x7f36188d48db
[921588.559150] RSP: 002b:00007f35f0
[921588.559154] RAX: ffffffffffffffda RBX: 00007f360502bf50 RCX: 00007f36188d48db
[921588.559156] RDX: 000d239200000000 RSI: 0000000000000000 RDI: 000000000000000a
[921588.559158] RBP: 00007f35f0548a00 R08: 0000000000000000 R09: 00007f360502beb0
[921588.559160] R10: 00007f35f0548ccf R11: 0000000000000293 R12: 00007f36042e9610
[921588.559162] R13: 000000000000000a R14: 0000000000000b00 R15: 0000000000000000
[921588.559175] INFO: task mdcheck:40625 blocked for more than 120 seconds.
[921588.559179] Not tainted 5.11.0-27-generic #29~20.04.1-Ubuntu
[921588.559182] "echo 0 > /proc/sys/
[921588.559185] task:mdcheck state:D stack: 0 pid:40625 ppid: 1 flags:0x00000000
[921588.559189] Call Trace:
[921588.559191] __schedule+
[921588.559196] schedule+0x4f/0xc0
[921588.559200] schedule_
[921588.559203] ? do_truncate+
[921588.559216] ? try_to_
[921588.559220] wait_for_
[921588.559225] kthread_
[921588.559229] md_unregister_
[921588.559232] md_reap_
[921588.559236] action_
[921588.559239] md_attr_
[921588.559243] sysfs_kf_
[921588.559246] kernfs_
[921588.559249] new_sync_
[921588.559254] vfs_write+
[921588.559257] ksys_write+
[921588.559260] __x64_sys_
[921588.559264] do_syscall_
[921588.559267] entry_SYSCALL_
[921588.559271] RIP: 0033:0x7f8c6d2791e7
[921588.559273] RSP: 002b:00007ffea5
[921588.559276] RAX: ffffffffffffffda RBX: 0000000000000005 RCX: 00007f8c6d2791e7
[921588.559278] RDX: 0000000000000005 RSI: 000055b618a3eb40 RDI: 0000000000000001
[921588.559280] RBP: 000055b618a3eb40 R08: 000000000000000a R09: 0000000000000004
[921588.559282] R10: 000055b616e97017 R11: 0000000000000246 R12: 0000000000000005
[921588.559284] R13: 00007f8c6d3546a0 R14: 00007f8c6d3554a0 R15: 00007f8c6d3548a0
[921588.559287] INFO: task md1_resync:40644 blocked for more than 120 seconds.
[921588.559291] Not tainted 5.11.0-27-generic #29~20.04.1-Ubuntu
[921588.559294] "echo 0 > /proc/sys/
[921588.559297] task:md1_resync state:D stack: 0 pid:40644 ppid: 2 flags:0x00004000
[921588.559301] Call Trace:
[921588.559303] __schedule+
[921588.559308] schedule+0x4f/0xc0
[921588.559311] md_do_sync.
[921588.559315] ? wait_woken+
[921588.559319] ? kernel_
[921588.559323] md_thread+
[921588.559326] kthread+0x114/0x150
[921588.559329] ? md_start_
[921588.559332] ? kthread_
[921588.559335] ret_from_
[921588.559341] INFO: task kworker/u16:0:41409 blocked for more than 120 seconds.
[921588.559345] Not tainted 5.11.0-27-generic #29~20.04.1-Ubuntu
[921588.559348] "echo 0 > /proc/sys/
[921588.559353] task:kworker/u16:0 state:D stack: 0 pid:41409 ppid: 2 flags:0x00004000
[921588.559357] Workqueue: writeback wb_workfn (flush-253:1)
[921588.559361] Call Trace:
[921588.559363] __schedule+
[921588.559367] schedule+0x4f/0xc0
[921588.559371] md_write_
[921588.559375] ? wait_woken+
[921588.559378] raid5_make_
[921588.559387] ? wait_woken+
[921588.559390] ? mempool_
[921588.559393] md_handle_
[921588.559396] ? __part_
[921588.559399] md_submit_
[921588.559402] submit_
[921588.559405] submit_
[921588.559408] ? _cond_resched+
[921588.559412] ? kmem_cache_
[921588.559415] ? ext4_init_
[921588.559419] ext4_io_
[921588.559422] ext4_writepages
[921588.559425] ? update_
[921588.559430] do_writepages+
[921588.559433] ? do_writepages+
[921588.559436] __writeback_
[921588.559439] writeback_
[921588.559443] __writeback_
[921588.559446] wb_writeback+
[921588.559450] wb_workfn+
[921588.559453] ? __update_
[921588.559456] process_
[921588.559459] worker_
[921588.559462] kthread+0x114/0x150
[921588.559465] ? process_
[921588.559467] ? kthread_
[921588.559471] ret_from_
[921709.391056] INFO: task jbd2/dm-1-8:319 blocked for more than 120 seconds.
[921709.391068] Not tainted 5.11.0-27-generic #29~20.04.1-Ubuntu
[921709.391073] "echo 0 > /proc/sys/
[921709.391077] task:jbd2/dm-1-8 state:D stack: 0 pid: 319 ppid: 2 flags:0x00004000
[921709.391083] Call Trace:
[921709.391087] __schedule+
[921709.391094] schedule+0x4f/0xc0
[921709.391098] md_write_
[921709.391103] ? wait_woken+
[921709.391108] raid5_make_
[921709.391120] ? wait_woken+
[921709.391123] ? mempool_
[921709.391127] md_handle_
[921709.391130] ? __part_
[921709.391134] md_submit_
[921709.391137] submit_
[921709.391141] submit_
[921709.391144] ? bio_add_
[921709.391147] submit_
[921709.391151] submit_bh+0x13/0x20
[921709.391154] jbd2_journal_
[921709.391159] ? try_to_
[921709.391164] kjournald2+
[921709.391168] ? wait_woken+
[921709.391170] kthread+0x114/0x150
[921709.391174] ? commit_
[921709.391177] ? kthread_
[921709.391180] ret_from_
[921709.391186] INFO: task systemd-journal:376 blocked for more than 241 seconds.
[921709.391191] Not tainted 5.11.0-27-generic #29~20.04.1-Ubuntu
[921709.391195] "echo 0 > /proc/sys/
[921709.391199] task:systemd-
[921709.391204] Call Trace:
[921709.391205] __schedule+
[921709.391210] schedule+0x4f/0xc0
[921709.391214] md_write_
[921709.391218] ? wait_woken+
[921709.391221] raid5_make_
[921709.391230] ? wait_woken+
[921709.391233] ? mempool_
[921709.391236] md_handle_
[921709.391239] ? __part_
[921709.391242] md_submit_
[921709.391245] submit_
[921709.391248] ? ext4_fc_
[921709.391252] submit_
[921709.391255] ? _cond_resched+
[921709.391259] ? kmem_cache_
[921709.391262] ? ext4_init_
[921709.391266] ext4_io_
[921709.391270] ext4_writepages
[921709.391274] do_writepages+
[921709.391278] ? do_writepages+
[921709.391281] ? fault_dirty_
[921709.391285] __filemap_
[921709.391290] file_write_
[921709.391293] ext4_sync_
[921709.391297] vfs_fsync_
[921709.391300] do_fsync+0x3d/0x70
[921709.391303] __x64_sys_
[921709.391306] do_syscall_
[921709.391310] entry_SYSCALL_
[921709.391314] RIP: 0033:0x7f4c97ee832b
[921709.391317] RSP: 002b:00007ffdce
[921709.391321] RAX: ffffffffffffffda RBX: 000055ced34b0fa0 RCX: 00007f4c97ee832b
[921709.391323] RDX: 00007f4c97fc8000 RSI: 000055ced3487b70 RDI: 0000000000000021
[921709.391325] RBP: 0000000000000001 R08: 0000000000000000 R09: 00007ffdceb29fa8
[921709.391327] R10: 00007f4c97d2c848 R11: 0000000000000293 R12: 00007ffdceb29fa8
[921709.391329] R13: 00007ffdceb29fa0 R14: 000055ced34b0fa0 R15: 000055ced34bcf90
[921709.391343] INFO: task mysqld:1505 blocked for more than 241 seconds.
[921709.391348] Not tainted 5.11.0-27-generic #29~20.04.1-Ubuntu
[921709.391351] "echo 0 > /proc/sys/
[921709.391355] task:mysqld state:D stack: 0 pid: 1505 ppid: 1 flags:0x00000000
[921709.391359] Call Trace:
[921709.391361] __schedule+
[921709.391366] schedule+0x4f/0xc0
[921709.391370] md_write_
[921709.391373] ? wait_woken+
[921709.391376] raid5_make_
[921709.391386] ? wait_woken+
[921709.391388] ? mempool_
[921709.391391] md_handle_
[921709.391395] ? __part_
[921709.391398] md_submit_
[921709.391401] submit_
[921709.391404] submit_
[921709.391407] ? _cond_resched+
[921709.391411] ? kmem_cache_
[921709.391414] ? ext4_init_
[921709.391418] ext4_io_
[921709.391421] ext4_writepages
[921709.391423] ? __schedule+
[921709.391427] ? hrtimer_
[921709.391430] ? timerqueue_
[921709.391435] ? futex_wait+
[921709.391439] do_writepages+
[921709.391443] ? do_writepages+
[921709.391445] ? futex_wake+
[921709.391449] __filemap_
[921709.391453] file_write_
[921709.391456] ext4_sync_
[921709.391459] vfs_fsync_
[921709.391463] ? __fget_
[921709.391466] do_fsync+0x3d/0x70
[921709.391469] __x64_sys_
[921709.391472] do_syscall_
[921709.391475] entry_SYSCALL_
[921709.391478] RIP: 0033:0x7f36188d48db
[921709.391481] RSP: 002b:00007f35f0
[921709.391484] RAX: ffffffffffffffda RBX: 00007f360502bf50 RCX: 00007f36188d48db
[921709.391486] RDX: 000d239200000000 RSI: 0000000000000000 RDI: 000000000000000a
[921709.391488] RBP: 00007f35f0548a00 R08: 0000000000000000 R09: 00007f360502beb0
[921709.391490] R10: 00007f35f0548ccf R11: 0000000000000293 R12: 00007f36042e9610
[921709.391491] R13: 000000000000000a R14: 0000000000000b00 R15: 0000000000000000
[921709.391505] INFO: task mdcheck:40625 blocked for more than 241 seconds.
[921709.391510] Not tainted 5.11.0-27-generic #29~20.04.1-Ubuntu
[921709.391514] "echo 0 > /proc/sys/
[921709.391518] task:mdcheck state:D stack: 0 pid:40625 ppid: 1 flags:0x00000000
[921709.391522] Call Trace:
[921709.391524] __schedule+
[921709.391528] schedule+0x4f/0xc0
[921709.391532] schedule_
[921709.391535] ? do_truncate+
[921709.391538] ? try_to_
[921709.391542] wait_for_
[921709.391547] kthread_
[921709.391550] md_unregister_
[921709.391554] md_reap_
[921709.391557] action_
[921709.391561] md_attr_
[921709.391564] sysfs_kf_
[921709.391567] kernfs_
[921709.391571] new_sync_
[921709.391575] vfs_write+
[921709.391578] ksys_write+
[921709.391581] __x64_sys_
[921709.391584] do_syscall_
[921709.391588] entry_SYSCALL_
[921709.391591] RIP: 0033:0x7f8c6d2791e7
[921709.391593] RSP: 002b:00007ffea5
[921709.391597] RAX: ffffffffffffffda RBX: 0000000000000005 RCX: 00007f8c6d2791e7
[921709.391599] RDX: 0000000000000005 RSI: 000055b618a3eb40 RDI: 0000000000000001
[921709.391600] RBP: 000055b618a3eb40 R08: 000000000000000a R09: 0000000000000004
[921709.391602] R10: 000055b616e97017 R11: 0000000000000246 R12: 0000000000000005
[921709.391604] R13: 00007f8c6d3546a0 R14: 00007f8c6d3554a0 R15: 00007f8c6d3548a0
[921709.391607] INFO: task md1_resync:40644 blocked for more than 241 seconds.
[921709.391611] Not tainted 5.11.0-27-generic #29~20.04.1-Ubuntu
[921709.391615] "echo 0 > /proc/sys/
[921709.391619] task:md1_resync state:D stack: 0 pid:40644 ppid: 2 flags:0x00004000
[921709.391623] Call Trace:
[921709.391625] __schedule+
[921709.391631] schedule+0x4f/0xc0
[921709.391635] md_do_sync.
[921709.391639] ? wait_woken+
[921709.391642] ? kernel_
[921709.391646] md_thread+
[921709.391650] kthread+0x114/0x150
[921709.391652] ? md_start_
[921709.391655] ? kthread_
[921709.391658] ret_from_
Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] [raid4] [raid10]
md1 : active raid5 sda3[0] sdd3[3] sdb3[1] sde3[5] sdc3[6]
15625582592 blocks super 1.2 level 5, 512k chunk, algorithm 2 [5/5] [UUUUU]
[
md0 : active raid1 sda2[0] sdb2[1] sdc2[2](S)
488128 blocks super 1.2 [2/2] [UU]
unused devices: <none>
ProblemType: Bug
DistroRelease: Ubuntu 20.04
Package: linux-image-
ProcVersionSign
Uname: Linux 5.11.0-27-generic x86_64
ApportVersion: 2.20.11-
Architecture: amd64
CasperMD5CheckR
Date: Tue Sep 7 16:35:01 2021
InstallationDate: Installed on 2014-03-12 (2736 days ago)
InstallationMedia: Ubuntu-Server 12.04.4 LTS "Precise Pangolin" - Release amd64 (20140204)
ProcEnviron:
TERM=xterm
PATH=(custom, no user)
LANG=en_US.UTF-8
SHELL=/bin/bash
SourcePackage: linux-signed-
UpgradeStatus: Upgraded to focal on 2020-12-20 (261 days ago)
---
ProblemType: Bug
AlsaVersion: Advanced Linux Sound Architecture Driver Version k5.13.0-16-generic.
AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
ApportVersion: 2.20.11-0ubuntu69
Architecture: amd64
ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
AudioDevicesInUse: Error: [Errno 2] No such file or directory: 'fuser'
Card0.Amixer.info: Error: [Errno 2] No such file or directory: 'amixer'
Card0.Amixer.
CasperMD5CheckR
DistroRelease: Ubuntu 21.10
IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
Lspci: Error: [Errno 2] No such file or directory: 'lspci'
Lspci-vt: Error: [Errno 2] No such file or directory: 'lspci'
Lsusb: Error: [Errno 2] No such file or directory: 'lsusb'
Lsusb-t: Error: [Errno 2] No such file or directory: 'lsusb'
Lsusb-v: Error: [Errno 2] No such file or directory: 'lsusb'
MachineType: innotek GmbH VirtualBox
Package: linux-signed-
PciMultimedia:
ProcEnviron:
TERM=linux
PATH=(custom, no user)
SHELL=/bin/bash
ProcFB: 0 svgadrmfb
ProcKernelCmdLine: BOOT_IMAGE=
ProcVersionSign
RelatedPackageV
linux-
linux-
linux-firmware 1.199
RfKill: Error: [Errno 2] No such file or directory: 'rfkill'
Tags: impish
Uname: Linux 5.13.0-16-generic x86_64
UpgradeStatus: No upgrade log present (probably fresh install)
UserGroups: N/A
_MarkForUpload: True
dmi.bios.date: 12/01/2006
dmi.bios.vendor: innotek GmbH
dmi.bios.version: VirtualBox
dmi.board.name: VirtualBox
dmi.board.vendor: Oracle Corporation
dmi.board.version: 1.2
dmi.chassis.type: 1
dmi.chassis.vendor: Oracle Corporation
dmi.modalias: dmi:bvninnotekG
dmi.product.family: Virtual Machine
dmi.product.name: VirtualBox
dmi.product.
dmi.sys.vendor: innotek GmbH
tags: | removed: hirsute |
Changed in linux (Ubuntu): | |
status: | Incomplete → Confirmed |
tags: | added: patch |
Hello Chad Wagner,
Thank you for reporting this issue. Could you please try installing the latest 20.04 HWE kernel and check whether the problem persists? The version currently in focal-updates is 5.11.0- 34.36~20. 04.1.