radeon 0000:03:00.0: ring 0 stalled for more than 10000msec

Bug #1707695 reported by Christopher Clapp
58
This bug affects 14 people
Affects Status Importance Assigned to Milestone
Linux
Won't Fix
Medium
linux (Ubuntu)
Confirmed
Medium
Unassigned

Bug Description

My system intermittently locks up every few days. It does not seem to correspond to a particular program or action. When that happens, the screen goes to black, then returns after a few seconds. At that point, the time on the clock is static and indicates that the screen is frozen, but audio continues to play for 30 seconds to a minute. The keyboard and mouse do not affect what's being displayed, but I can Alt - SysRq - REISUB to restart.

Every time (7 crashes and counting), /var/log/kern.log indicates that

kernel: [353692.378886] radeon 0000:03:00.0: ring 0 stalled for more than 10280msec
kernel: [353692.378896] radeon 0000:03:00.0: GPU lockup (current fence id 0x00000000006e96e5 last fence id 0x00000000006e96e9 on ring 0)

just before the system locks. Different ring numbers are reported as stalling at different times.

As per the instructions, I posted a question about this issue at:

https://askubuntu.com/questions/937792/system-lockup-radeon-ring-stalled-for-more-than-10280msec?noredirect=1#comment1486875_937792

but I have not received many responses.

I'm running 16.04 LTS.
:~$ lsb_release -rd
Description: Ubuntu 16.04.2 LTS
Release: 16.04

According to the Software Center, I am running Ubuntu Software 3.20.1.

I have a Radeon HD graphics card.

:~$ lspci | grep VGA
03:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Cedar [Radeon HD 5000/6000/7350/8350 Series]

:~$ lspci -v -s 03:00.0
03:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Cedar [Radeon HD 5000/6000/7350/8350 Series] (prog-if 00 [VGA controller])
Subsystem: Gigabyte Technology Co., Ltd Cedar [Radeon HD 5000/6000/7350/8350 Series]
Flags: bus master, fast devsel, latency 0, IRQ 53
Memory at c0000000 (64-bit, prefetchable) [size=256M]
Memory at d3d20000 (64-bit, non-prefetchable) [size=128K]
I/O ports at 7000 [size=256]
Expansion ROM at d3d00000 [disabled] [size=128K]
Capabilities: <access denied>
Kernel driver in use: radeon
Kernel modules: radeon

Please let me know if additional information would be useful, and I will be glad to provide it. Thanks in advance.

ProblemType: Bug
DistroRelease: Ubuntu 16.04
Package: linux-image-4.4.0-87-generic 4.4.0-87.110
ProcVersionSignature: Ubuntu 4.4.0-87.110-generic 4.4.73
Uname: Linux 4.4.0-87-generic x86_64
ApportVersion: 2.20.1-0ubuntu2.10
Architecture: amd64
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC0: clapp 3889 F.... pulseaudio
 /dev/snd/controlC2: clapp 3889 F.... pulseaudio
 /dev/snd/controlC1: clapp 3889 F.... pulseaudio
CurrentDesktop: Unity
Date: Mon Jul 31 14:02:27 2017
HibernationDevice: RESUME=UUID=1b30dad6-8c48-42cf-ae26-66ab8b0eb446
InstallationDate: Installed on 2013-09-04 (1425 days ago)
InstallationMedia: Ubuntu 11.10 "Oneiric Ocelot" - Release amd64 (20111012)
MachineType: Dell Inc. Precision T7600
ProcFB: 0 radeondrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-4.4.0-87-generic root=UUID=33722f76-37fd-4289-b071-d7f48b65dd32 ro quiet splash crashkernel=384M-:128M vt.handoff=7
RelatedPackageVersions:
 linux-restricted-modules-4.4.0-87-generic N/A
 linux-backports-modules-4.4.0-87-generic N/A
 linux-firmware 1.157.11
RfKill:
 0: hci0: Bluetooth
  Soft blocked: no
  Hard blocked: no
SourcePackage: linux
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 05/03/2013
dmi.bios.vendor: Dell Inc.
dmi.bios.version: A07
dmi.board.name: 082WXT
dmi.board.vendor: Dell Inc.
dmi.board.version: A01
dmi.chassis.type: 7
dmi.chassis.vendor: Dell Inc.
dmi.modalias: dmi:bvnDellInc.:bvrA07:bd05/03/2013:svnDellInc.:pnPrecisionT7600:pvr01:rvnDellInc.:rn082WXT:rvrA01:cvnDellInc.:ct7:cvr:
dmi.product.name: Precision T7600
dmi.product.version: 01
dmi.sys.vendor: Dell Inc.

Revision history for this message
Christopher Clapp (christclapp) wrote :
Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote : Status changed to Confirmed

This change was made by a bot.

Changed in linux (Ubuntu):
status: New → Confirmed
Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v4.13 kernel[0].

If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".

Thanks in advance.

[0] http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.13-rc4

Changed in linux (Ubuntu):
importance: Undecided → Medium
status: Confirmed → Incomplete
Revision history for this message
vinibali (vinibali) wrote :

Hello there!
I experienced absolutely the same with an older Sapphire Radeon HD5450.
The system is a freshly installed Xubuntu 16.04.3 LTS.
I just installed the xserver-xorg-video-ati-lts-xenial package but this didn't resolv the problem.
Currently the AMD64 4.10.0-30-generic is installed on an older Asus 775 motherboard.
Do we have any other ideas?
I don't think the mainline kernel can solve anything, seems like 4.4 and 4.10 is involved in this case. I think it's more likely a DRM/Mesa/Xorg related bug.
I we can't figure out anything I will jump to the 17.04 version, however I'd like to stay in the LTS version
Thank you

Changed in linux (Ubuntu):
status: Incomplete → Confirmed
tags: added: kernel-bug-exists-upstream
Revision history for this message
Christopher Clapp (christclapp) wrote :

The problem reoccurred while running what was the most recent mainline kernel when I downloaded it (v4.13/ 2017-09-03 22:30). The log file is attached.

Thanks for your help, jsalisbury.

Revision history for this message
Kai-Heng Feng (kaihengfeng) wrote :

Please file a bug report at https://bugs.freedesktop.org/

Revision history for this message
In , Christopher Clapp (christclapp) wrote :
Download full text (3.6 KiB)

My system intermittently locks up every few days. It does not seem to correspond to a particular program or action. When that happens, the screen goes to black, then returns after a few seconds. At that point, the time on the clock is static and indicates that the screen is frozen, but audio continues to play for 30 seconds to a minute. The keyboard and mouse do not affect what's being displayed, but I can Alt - SysRq - REISUB to restart.

Every time (12 crashes and counting), /var/log/kern.log indicates that

kernel: [353692.378886] radeon 0000:03:00.0: ring 0 stalled for more than 10280msec
kernel: [353692.378896] radeon 0000:03:00.0: GPU lockup (current fence id 0x00000000006e96e5 last fence id 0x00000000006e96e9 on ring 0)

just before the system locks. Different ring numbers are reported as stalling at different times.

I am posting at bugs.freedesktop.org at the suggestion of a poster from bugs.lauchpad.net. For additional info, see:

https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1707695

I'm running 16.04 LTS.
:~$ lsb_release -rd
Description: Ubuntu 16.04.2 LTS
Release: 16.04

According to the Software Center, I am running Ubuntu Software 3.20.1.

I have a Radeon HD graphics card.

:~$ lspci | grep VGA
03:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Cedar [Radeon HD 5000/6000/7350/8350 Series]

:~$ lspci -v -s 03:00.0
03:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Cedar [Radeon HD 5000/6000/7350/8350 Series] (prog-if 00 [VGA controller])
Subsystem: Gigabyte Technology Co., Ltd Cedar [Radeon HD 5000/6000/7350/8350 Series]
Flags: bus master, fast devsel, latency 0, IRQ 53
Memory at c0000000 (64-bit, prefetchable) [size=256M]
Memory at d3d20000 (64-bit, non-prefetchable) [size=128K]
I/O ports at 7000 [size=256]
Expansion ROM at d3d00000 [disabled] [size=128K]
Capabilities: <access denied>
Kernel driver in use: radeon
Kernel modules: radeon

Please let me know if additional information would be useful, and I will be glad to provide it. Thanks in advance.

ProblemType: Bug
DistroRelease: Ubuntu 16.04
Package: linux-image-4.4.0-87-generic 4.4.0-87.110
ProcVersionSignature: Ubuntu 4.4.0-87.110-generic 4.4.73
Uname: Linux 4.4.0-87-generic x86_64
ApportVersion: 2.20.1-0ubuntu2.10
Architecture: amd64
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC0: clapp 3889 F.... pulseaudio
 /dev/snd/controlC2: clapp 3889 F.... pulseaudio
 /dev/snd/controlC1: clapp 3889 F.... pulseaudio
CurrentDesktop: Unity
Date: Mon Jul 31 14:02:27 2017
HibernationDevice: RESUME=UUID=1b30dad6-8c48-42cf-ae26-66ab8b0eb446
InstallationDate: Installed on 2013-09-04 (1425 days ago)
InstallationMedia: Ubuntu 11.10 "Oneiric Ocelot" - Release amd64 (20111012)
MachineType: Dell Inc. Precision T7600
ProcFB: 0 radeondrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-4.4.0-87-generic root=UUID=33722f76-37fd-4289-b071-d7f48b65dd32 ro quiet splash crashkernel=384M-:128M vt.handoff=7
RelatedPackageVersions:
 linux-restricted-modules-4.4.0-87-generic N/A
 linux-backports-modules-4.4.0-87-generic N/A
 linux-firmware 1.157.11
RfKill:
 0: hci0: Bluetooth
  Soft blocked: no
  Hard blocked: no
SourcePackage: linux
Upgr...

Read more...

Revision history for this message
Christopher Clapp (christclapp) wrote :

kaihengfeng, I posted at fredesktop.org as you suggested. Thanks!

See: https://bugs.freedesktop.org/show_bug.cgi?id=102909

Changed in linux:
importance: Unknown → Medium
status: Unknown → Confirmed
Revision history for this message
rinaldomerlo (rinaldomerlo) wrote :

Happens to me too intermittently Dell PC's with pre-installed Ubuntu 16.04 (I have 6 of them and it will occasionally happen on any of them). I've attached the syslog for one occurence.

Revision history for this message
vinibali (vinibali) wrote :

Hello guys!

I just got an update for you. So at the very first time I enabled the experimental GPU accelerations in the Chromium. That time was really problematic, a local email provider's website caused a lot of lockups. Since we disabled the experimental flags in Chromium, the errors has gone. You can have a try or even disable default enabled GPU features.

I hope it helps, cheers.

Revision history for this message
Pander (pander) wrote :

In Ubuntu (Lubuntu) cosmic 18.10 with Linux 4.18.0-10-generic, this bug is worse than before. How can one disable the features causing this? I prefer to have a stable screen without accelleration for now that a machine with GPU problems and multiple freezes per day.

tags: added: cosmic regression-release
Revision history for this message
Pander (pander) wrote :

$ lspci -k | grep -EA2 'VGA|3D'
03:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Hemlock [Radeon HD 5970]
        Subsystem: Advanced Micro Devices, Inc. [AMD/ATI] Hemlock [Radeon HD 5970]
        Kernel driver in use: radeon

Revision history for this message
Pander (pander) wrote :

I see a lot different workarounds, but all for older versions of Ubuntu. What would be the best workaround for 18.10?

Revision history for this message
In , Pander (pander) wrote :
Pander (pander)
tags: added: kernel-bug
tags: added: disco
Revision history for this message
In , Pander (pander) wrote :

Christopher, are you still experiencing this bug?

Revision history for this message
In , Christopher Clapp (christclapp) wrote :

(In reply to Pander from comment #3)
> Christopher, are you still experiencing this bug?

Pander, the Radeon graphics card in my computer died, so I replaced it with an Nvidia card. I didn't experience the bug again after that.

Revision history for this message
In , Michel Dänzer (michel-daenzer) wrote :

Resolving per comment 4, thanks for the report and follow-ups.

Changed in linux:
status: Confirmed → Won't Fix
Revision history for this message
Craig McQueen (cmcqueen1975) wrote :

One does not simply close a bug report about a Radeon driver because Christopher went out and got an Nvidia card.

Revision history for this message
urnenfeld (urnenfeld) wrote :

I have the same effects as the description with the following graphics card:

01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] RV630 PRO [Radeon HD 2600 PRO AGP]

- IRCC Ubuntu 16 would not spot any problems.
- Lubuntu 18.04 kernel 4.15.0-88-generic would spot the problem very seldom, at undetermined situations. You can live with the problem...
- Lubuntu 20.04 kernel 5.4.0-39-generic the problem is reproducible deterministacally with certain actions. Opening specific websites, or simply open a video with VLC (you cannot live with the problem)

Revision history for this message
reproman (reproman) wrote :

No need to throw away those cards. They are serviceable.

The Centos forum proposes /etc/X11/xorg.conf.d/20-radeon.conf. An apparently stable workaround for this bug as observed on my 20.04.1 system with inexpensive Dell Radeon graphics card.

Was something forgotten along the way as basic Radeon driver installation setting rules were propagated? These cards were working in 18.04...

According to post https://forums.centos.org/viewtopic.php?t=72792 the following option is most relevant to setting a stable acceleration mode remedially for the ring 0 freeze deadline kernel errors:

  Option "AccelMethod" "exa"

Revision history for this message
Nemanja V (vooxo) wrote :

Recently I'm also experiencing this on:

Ubuntu 20.20 (Linux 5.8.0-41-generic)
AMD® A10-5750m apu with radeon(tm) hd graphics × 4

This is an older system indeed, but still...

Revision history for this message
j spam (knowtrash2009) wrote :

This is still an on going problem.
It was working fine on Mint 18.3, then I upgraded(downgraded?) to Mint 20.3
I have been having lockups, freezes, ssh in for a while before it locks up, etc..
Very similar to above mentions of problems.

Here is the basic error that happens:
[37664.592883] radeon 0000:01:00.0: ring 0 stalled for more than 14948msec
[37664.592889] radeon 0000:01:00.0: GPU lockup (current fence id 0x00000000003b1a11 last fence id 0x00000000003b1a26 on ring 0)

Here is the system info:
System:
Kernel: 5.4.0-109-generic x86_64 bits: 64 compiler: gcc v: 9.4.0 Desktop: Cinnamon 5.2.7 wm: muffin
dm: LightDM 1.30.0 Distro: Linux Mint 20.3 Una base: Ubuntu 20.04 focal

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.