Frequent kernel oops related to nvidia / nv_drm_master_set
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux-restricted-modules (Ubuntu) |
Confirmed
|
Undecided
|
Unassigned | ||
nvidia-graphics-drivers-460 (Ubuntu) |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
After upgrading yesterday from ubuntu 20 to 21.04 my machine now exhibits frequent involuntary restarts, related to a kernel oops of the form
Jun 3 10:48:32 zotac kernel: [ 20.464926] RIP: 0010:nv_
[...]
Jun 3 10:48:32 zotac kernel: [ 20.464958] Call Trace:
Jun 3 10:48:32 zotac kernel: [ 20.464964] drm_new_
Jun 3 10:48:32 zotac kernel: [ 20.464994] drm_master_
Jun 3 10:48:32 zotac kernel: [ 20.465017] drm_open+0xf8/0x250 [drm]
Jun 3 10:48:32 zotac kernel: [ 20.465044] drm_stub_
Jun 3 10:48:32 zotac kernel: [ 20.465070] chrdev_
Jun 3 10:48:32 zotac kernel: [ 20.465075] ? cdev_device_
Jun 3 10:48:32 zotac kernel: [ 20.465078] do_dentry_
Jun 3 10:48:32 zotac kernel: [ 20.465081] vfs_open+0x2d/0x30
Jun 3 10:48:32 zotac kernel: [ 20.465084] do_open+0x1c3/0x340
Jun 3 10:48:32 zotac kernel: [ 20.465087] path_openat+
Jun 3 10:48:32 zotac kernel: [ 20.465090] do_filp_
Jun 3 10:48:32 zotac kernel: [ 20.465093] ? __check_
Jun 3 10:48:32 zotac kernel: [ 20.465096] do_sys_
Jun 3 10:48:32 zotac kernel: [ 20.465099] __x64_sys_
Jun 3 10:48:32 zotac kernel: [ 20.465102] do_syscall_
Jun 3 10:48:32 zotac kernel: [ 20.465105] entry_SYSCALL_
Jun 3 10:48:32 zotac kernel: [ 20.465108] RIP: 0033:0x7fe6ea270954
ProblemType: Bug
DistroRelease: Ubuntu 21.04
Package: xorg 1:7.7+22ubuntu1
ProcVersionSign
Uname: Linux 5.11.0-18-generic x86_64
NonfreeKernelMo
.proc.driver.
.proc.driver.
.proc.driver.
.proc.driver.
.proc.driver.
.proc.driver.
.proc.driver.
NVRM version: NVIDIA UNIX x86_64 Kernel Module 460.80 Fri May 7 06:55:54 UTC 2021
GCC version: gcc version 10.3.0 (Ubuntu 10.3.0-1ubuntu1)
ApportVersion: 2.20.11-0ubuntu65.1
Architecture: amd64
CasperMD5CheckR
Date: Fri Jun 4 17:00:45 2021
DistUpgraded: Fresh install
DistroCodename: hirsute
DistroVariant: ubuntu
DkmsStatus:
nvidia, 460.80, 5.11.0-18-generic, x86_64: installed
nvidia, 460.80, 5.8.0-53-generic, x86_64: installed
ExtraDebuggingI
GraphicsCard:
NVIDIA Corporation GP106 [GeForce GTX 1060 3GB] [10de:1c02] (rev a1) (prog-if 00 [VGA controller])
Subsystem: ZOTAC International (MCO) Ltd. GP106 [GeForce GTX 1060 3GB] [19da:2438]
InstallationDate: Installed on 2020-05-09 (391 days ago)
InstallationMedia: Xubuntu 20.04 LTS "Focal Fossa" - Release amd64 (20200423)
MachineType: NA ZBOX-ER51070
ProcKernelCmdLine: BOOT_IMAGE=
SourcePackage: xorg
Symptom: display
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 04/25/2019
dmi.bios.release: 5.12
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 5.12
dmi.board.
dmi.board.name: ZBOX-ER51070
dmi.board.vendor: NA
dmi.board.version: Default string
dmi.chassis.
dmi.chassis.type: 3
dmi.chassis.vendor: Default string
dmi.chassis.
dmi.modalias: dmi:bvnAmerican
dmi.product.family: Default string
dmi.product.name: ZBOX-ER51070
dmi.product.sku: Default string
dmi.product.
dmi.sys.vendor: NA
version.compiz: compiz N/A
version.libdrm2: libdrm2 2.4.104-1build1
version.
version.
version.
version.
version.
version.
version.
version.
affects: | ubuntu → xorg (Ubuntu) |
tags: | added: nvidia |
affects: | xorg (Ubuntu) → nvidia-graphics-drivers-460 (Ubuntu) |
tags: | added: oem-priority originate-from-1939083 somerville |
no longer affects: | oem-priority |
tags: | removed: oem-priority originate-from-1939083 |
tags: | removed: somerville |
Changed in linux-restricted-modules (Ubuntu): | |
status: | New → Confirmed |
affects: | nvidia-graphics-drivers-470 (Ubuntu) → linux-restricted-modules (Ubuntu) |
I'm somewhat embarrassed to report that these frequent oopses which occured during the first day after upgrading Ubuntu have not reappeared. One possible explanation is that all these crashes triggered an automatic reboot without actually powering off the machine: could this possibly have left the graphic card in an unexpected state that persisted across soft-reboots and was responsible for these problems?
Anyway, after a proper hardware reset and reboot ten+ days ago these problems have not reappeared once. I would propose to close this issue as invalid/ wontfix/ nothing- to-see- here...