nfs4/krb5 mounts hang on server kernel 2.6.32-27-generic
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Linux |
Confirmed
|
High
|
|||
linux (Ubuntu) |
Incomplete
|
Undecided
|
Unassigned | ||
Lucid |
Won't Fix
|
Undecided
|
Unassigned |
Bug Description
After upgrading my LTS (Lucid) NFS server from 2.6.32-22-generic to 2.6.32-27-generic, kerberized NFS4 mounts hang on the clients (2.6.35-24-generic) while trying to mount. I have booted my server back and forth between the two kernels and this is 100% reproducible.
On the client, here is the stack trace of the hanging mount:
[83664.144250] SysRq : Show Blocked State
[83664.148028] task PC stack pid father
[83664.148038] mount.nfs4 D e2fbfc38 0 20082 20081 0x00000000
[83664.148038] e2fbfc48 00000086 00000002 e2fbfc38 e2fbfc7c c05d99e0 c08c4700 c08c4700
[83664.148038] a6a99df3 00004bfe c08c4700 c08c4700 a6a88065 00004bfe 00000000 c08c4700
[83664.148038] c08c4700 f6ae6580 00000001 e2fbfc7c 00000000 e2fbfc84 e2fbfc50 f823dc0c
[83664.148038] Call Trace:
[83664.148038] [<f823dc0c>] rpc_wait_
[83664.148038] [<c05c823d>] __wait_
[83664.148038] [<f823dbf0>] ? rpc_wait_
[83664.202534] [<f823dbf0>] ? rpc_wait_
[83664.202534] [<c05c830b>] out_of_
[83664.202534] [<c0165f10>] ? wake_bit_
[83664.202534] [<f823e31b>] __rpc_execute+
[83664.202534] [<f823da17>] ? rpc_init_
[83664.202534] [<c0230bef>] ? mntput_
[83664.202534] [<f823e4fe>] rpc_execute+
[83664.202534] [<f82379af>] rpc_run_
[83664.202534] [<f8237abe>] rpc_call_
[83664.202534] [<f83966b2>] _nfs4_call_
[83664.202534] [<f8394795>] nfs4_proc_
[83664.202534] [<f837e6f8>] nfs4_get_
[83664.202534] [<f8380a33>] ? nfs_alloc_
[83664.202534] [<f8378f19>] ? nfs4_init_
[83664.202534] [<f83784b4>] nfs4_server_
[83664.202534] [<f8379062>] nfs4_create_
[83664.202534] [<f83834eb>] nfs4_remote_
[83664.202534] [<c020fadf>] ? __alloc_
[83664.202534] [<c0231629>] ? alloc_vfsmnt+
[83664.202534] [<c021b354>] vfs_kern_
[83664.202534] [<f83848b9>] nfs_do_
[83664.202534] [<f83849bf>] nfs4_try_
[83664.202534] [<f8384ca1>] ? nfs_alloc_
[83664.202534] [<f8384d50>] nfs4_get_
[83664.202534] [<c0231629>] ? alloc_vfsmnt+
[83664.202534] [<c021b354>] vfs_kern_
[83664.202534] [<c022f9b3>] ? get_fs_
[83664.202534] [<c021b4fe>] do_kern_
[83664.202534] [<c0232b2c>] do_mount+
[83664.202534] [<c0232bdb>] sys_mount+0x6b/0xa0
[83664.202534] [<c05c9cc4>] syscall_
[83664.202534] Sched Debug Version: v0.09, 2.6.35-24-generic #42-Ubuntu
[83664.202534] now at 83664203.297770 msecs
[83664.202534] .jiffies : 20841050
[83664.202534] .sysctl_
[83664.202534] .sysctl_
[83664.202534] .sysctl_
[83664.202534] .sysctl_
[83664.202534] .sysctl_
[83664.202534] .sysctl_
[83664.202534]
[83664.202534] cpu#0, 2104.451 MHz
[83664.202534] .nr_running : 1
[83664.202534] .load : 1024
[83664.202534] .nr_switches : 90454305
[83664.202534] .nr_load_updates : 12864728
[83664.202534] .nr_uninterruptible : 1
[83664.202534] .next_balance : 20.841083
[83664.202534] .curr->pid : 9
[83664.202534] .clock : 83664203.054506
[83664.202534] .cpu_load[0] : 0
[83664.202534] .cpu_load[1] : 721
[83664.202534] .cpu_load[2] : 720
[83664.202534] .cpu_load[3] : 539
[83664.202534] .cpu_load[4] : 374
[83664.202534] .yld_count : 333029
[83664.202534] .sched_switch : 0
[83664.202534] .sched_count : 93158766
[83664.202534] .sched_goidle : 34377602
[83664.202534] .avg_idle : 1000000
[83664.202534] .ttwu_count : 51632626
[83664.202534] .ttwu_local : 35602671
[83664.202534] .bkl_count : 277652
[83664.202534]
[83664.202534] cfs_rq[0]:/
[83664.202534] .exec_clock : 28648774.386039
[83664.202534] .MIN_vruntime : 0.000001
[83664.202534] .min_vruntime : 84508846.663281
[83664.202534] .max_vruntime : 0.000001
[83664.202534] .spread : 0.000000
[83664.202534] .spread0 : 0.000000
[83664.202534] .nr_running : 1
[83664.202534] .load : 1024
[83664.202534] .nr_spread_over : 168463
[83664.202534] .shares : 0
[83664.202534]
[83664.202534] rt_rq[0]:/
[83664.202534] .rt_nr_running : 0
[83664.202534] .rt_throttled : 0
[83664.202534] .rt_time : 0.000000
[83664.202534] .rt_runtime : 900.000000
[83664.202534]
[83664.202534] runnable tasks:
[83664.202534] task PID tree-key switches prio exec-runtime sum-exec sum-sleep
[83664.202534] -------
[83664.202534] R events/0 9 84508840.663281 1513547 120 84508840.663281 24835.299583 83493949.900215 /
[83664.202534]
[83664.202534] cpu#1, 2104.451 MHz
[83664.202534] .nr_running : 1
[83664.202534] .load : 1024
[83664.202534] .nr_switches : 90730537
[83664.202534] .nr_load_updates : 12668068
[83664.202534] .nr_uninterruptible : 1
[83664.202534] .next_balance : 20.841046
[83664.202534] .curr->pid : 20262
[83664.202534] .clock : 83664144.033645
[83664.202534] .cpu_load[0] : 0
[83664.202534] .cpu_load[1] : 512
[83664.202534] .cpu_load[2] : 768
[83664.202534] .cpu_load[3] : 896
[83664.202534] .cpu_load[4] : 960
[83664.202534] .yld_count : 376091
[83664.202534] .sched_switch : 0
[83664.202534] .sched_count : 93423215
[83664.202534] .sched_goidle : 33055762
[83664.202534] .avg_idle : 481502
[83664.202534] .ttwu_count : 52337101
[83664.202534] .ttwu_local : 39223807
[83664.202534] .bkl_count : 287227
[83664.202534]
[83664.202534] cfs_rq[1]:/
[83664.202534] .exec_clock : 29027334.434850
[83664.202534] .MIN_vruntime : 0.000001
[83664.202534] .min_vruntime : 77328668.881570
[83664.202534] .max_vruntime : 0.000001
[83664.202534] .spread : 0.000000
[83664.202534] .spread0 : -7180177.781711
[83664.202534] .nr_running : 1
[83664.202534] .load : 1024
[83664.202534] .nr_spread_over : 155060
[83664.202534] .shares : 0
[83664.202534]
[83664.202534] rt_rq[1]:/
[83664.202534] .rt_nr_running : 0
[83664.202534] .rt_throttled : 0
[83664.202534] .rt_time : 0.000000
[83664.202534] .rt_runtime : 1000.000000
[83664.202534]
[83664.202534] runnable tasks:
[83664.202534] task PID tree-key switches prio exec-runtime sum-exec sum-sleep
[83664.202534] -------
[83664.202534] R bash 20262 77328663.096743 180 120 77328663.096743 96.589960 22615.112807 /
[83664.202534]
The mount command on the client:
brian@pc:~$ sudo mount -t nfs4 -o sec=krb5 linux:/usr/local /mnt/test/
And the export entry on the server:
/usr/local pc(sec=
The client doing the mount (i.e. above command) is "pc".
As I say, I can quite easily resolve this problem by simply going back to the -22 kernel.
ProblemType: Bug
DistroRelease: Ubuntu 10.04
Package: linux-image-
Regression: Yes
Reproducible: Yes
ProcVersionSign
Uname: Linux 2.6.32-27-generic i686
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.21.
Architecture: i386
AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/dsp', '/dev/snd/by-path', '/dev/snd/
CRDA: Error: [Errno 2] No such file or directory
Card0.Amixer.info:
Card hw:0 'SB'/'HDA ATI SB at 0xfe020000 irq 16'
Mixer name : 'Realtek ALC883'
Components : 'HDA:10ec0883,
Controls : 37
Simple ctrls : 20
Date: Thu Jan 13 09:35:05 2011
IwConfig:
lo no wireless extensions.
eth0 no wireless extensions.
MachineType: System manufacturer System Product Name
ProcCmdLine: BOOT_IMAGE=
ProcEnviron:
LANG=en_CA.UTF-8
SHELL=/bin/bash
RelatedPackageV
RfKill:
SourcePackage: linux
StagingDrivers: echo
Title: [STAGING]
dmi.bios.date: 03/28/2008
dmi.bios.vendor: Phoenix Technologies, LTD
dmi.bios.version: ASUS M2A-VM ACPI BIOS Revision 1705
dmi.board.name: M2A-VM
dmi.board.vendor: ASUSTeK Computer INC.
dmi.board.version: 1.XX
dmi.chassis.
dmi.chassis.type: 3
dmi.chassis.vendor: Chassis Manufacture
dmi.chassis.
dmi.modalias: dmi:bvnPhoenixT
dmi.product.name: System Product Name
dmi.product.
dmi.sys.vendor: System manufacturer
Changed in linux: | |
importance: | Unknown → High |
status: | Unknown → Confirmed |
Changed in linux (Ubuntu Lucid): | |
milestone: | none → lucid-updates |
Changed in linux (Ubuntu): | |
status: | New → Confirmed |
Is there any reason this bug has not even been triaged yet?
This also happens in the Maverick kernel, and I'm willing to bet it happens in the Natty kernel as well.