2018-02-09 09:46:56 |
Dr. Jens Harbott |
bug |
|
|
added bug |
2018-02-09 09:47:17 |
Dr. Jens Harbott |
attachment added |
|
bad-allnodes-4.13.0-33-16G-a.png https://bugs.launchpad.net/ubuntu/+source/linux-hwe/+bug/1748408/+attachment/5051741/+files/bad-allnodes-4.13.0-33-16G-a.png |
|
2018-02-09 09:52:38 |
Dr. Jens Harbott |
summary |
Servers going OOM |
Servers going OOM after updating kernel from 4.10 to 4.13 |
|
2018-02-09 12:22:38 |
Joseph Salisbury |
affects |
linux-hwe (Ubuntu) |
linux (Ubuntu) |
|
2018-02-09 12:23:03 |
Joseph Salisbury |
linux (Ubuntu): importance |
Undecided |
High |
|
2018-02-09 12:23:27 |
Joseph Salisbury |
nominated for series |
|
Ubuntu Artful |
|
2018-02-09 12:23:27 |
Joseph Salisbury |
bug task added |
|
linux (Ubuntu Artful) |
|
2018-02-09 12:23:37 |
Joseph Salisbury |
linux (Ubuntu Artful): importance |
Undecided |
Critical |
|
2018-02-09 12:24:12 |
Joseph Salisbury |
linux (Ubuntu Artful): importance |
Critical |
High |
|
2018-02-09 12:24:19 |
Joseph Salisbury |
linux (Ubuntu Artful): assignee |
|
Joseph Salisbury (jsalisbury) |
|
2018-02-09 12:24:24 |
Joseph Salisbury |
linux (Ubuntu): assignee |
|
Joseph Salisbury (jsalisbury) |
|
2018-02-09 12:25:26 |
Joseph Salisbury |
linux (Ubuntu Artful): status |
New |
In Progress |
|
2018-02-09 12:25:29 |
Joseph Salisbury |
linux (Ubuntu): status |
New |
Triaged |
|
2018-02-09 12:25:33 |
Joseph Salisbury |
linux (Ubuntu Artful): status |
In Progress |
Triaged |
|
2018-02-20 07:18:19 |
Dr. Jens Harbott |
tags |
amd64 apport-bug third-party-packages xenial |
amd64 apport-bug kernel-bug-exists-upstream third-party-packages xenial |
|
2018-02-20 14:08:04 |
Dimitri Pappas |
bug |
|
|
added subscriber Dimitri Pappas |
2018-02-26 08:38:48 |
Vivien GUEANT |
attachment added |
|
201802_nperf_memory-week.png https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1748408/+attachment/5063316/+files/201802_nperf_memory-week.png |
|
2018-02-26 08:40:53 |
Vivien GUEANT |
bug |
|
|
added subscriber Vivien GUEANT |
2018-03-08 16:37:31 |
Joseph Salisbury |
linux (Ubuntu): status |
Triaged |
In Progress |
|
2018-03-08 16:37:34 |
Joseph Salisbury |
linux (Ubuntu Artful): status |
Triaged |
In Progress |
|
2018-03-09 08:09:20 |
Joseph Salisbury |
description |
We are seeing this on multiple servers after upgrading from previous 4.10 series HWE kernels to the new 4.13 HWE series. With the new kernel, free memory is continously decreasing at a high rate and the servers start swapping and finally OOMing services within days. With the 4.10 kernel, decrease of free memory is slower and stabilizes after a while.
Latest kernel tested is linux-image-4.13.0-32-generic but the issue also affects older kernels from that series, tested back to linux-image-4.13.0-19-generic. No issue with linux-image-4.10.0-42-generic.
The servers are running as OpenStack controller nodes using either Ocata or Pike UCA plus ceph. See attached graph for the memory behaviour.
ProblemType: Bug
DistroRelease: Ubuntu 16.04
Package: linux-image-4.13.0-32-generic 4.13.0-32.35~16.04.1
ProcVersionSignature: Ubuntu 4.13.0-32.35~16.04.1-generic 4.13.13
Uname: Linux 4.13.0-32-generic x86_64
ApportVersion: 2.20.1-0ubuntu2.15
Architecture: amd64
Date: Fri Feb 9 09:45:50 2018
ProcEnviron:
LANGUAGE=en_US:
TERM=screen
PATH=(custom, no user)
LANG=en_US.utf8
SHELL=/bin/bash
SourcePackage: linux-hwe
UpgradeStatus: No upgrade log present (probably fresh install) |
== SRU Justification ==
We are seeing this on multiple servers after upgrading from previous 4.10 series HWE kernels to the new 4.13 HWE series. With the new kernel, free memory is continously decreasing at a high rate and the servers start swapping and finally OOMing services within days. With the 4.10 kernel, decrease of free memory is slower and stabilizes after a while.
Latest kernel tested is linux-image-4.13.0-32-generic but the issue also affects older kernels from that series, tested back to linux-image-4.13.0-19-generic. No issue with linux-image-4.10.0-42-generic.
The servers are running as OpenStack controller nodes using either Ocata or Pike UCA plus ceph. See attached graph for the memory behaviour.
== Fix ==
2b9478ffc550("i40e: Fix memory leak related filter programming status")
62b4c6694dfd("i40e: Add programming descriptors to cleaned_count")
== Regression Potential ==
Low. Limited to i40e and fix existing regression.
== Test Case ==
A test kernel was built with these patches and tested by the original bug reporter.
The bug reporter states the test kernel resolved the bug.
ProblemType: Bug
DistroRelease: Ubuntu 16.04
Package: linux-image-4.13.0-32-generic 4.13.0-32.35~16.04.1
ProcVersionSignature: Ubuntu 4.13.0-32.35~16.04.1-generic 4.13.13
Uname: Linux 4.13.0-32-generic x86_64
ApportVersion: 2.20.1-0ubuntu2.15
Architecture: amd64
Date: Fri Feb 9 09:45:50 2018
ProcEnviron:
LANGUAGE=en_US:
TERM=screen
PATH=(custom, no user)
LANG=en_US.utf8
SHELL=/bin/bash
SourcePackage: linux-hwe
UpgradeStatus: No upgrade log present (probably fresh install) |
|
2018-03-13 14:55:31 |
Kleber Sacilotto de Souza |
linux (Ubuntu Artful): status |
In Progress |
Fix Committed |
|
2018-03-19 10:58:33 |
Stefan Bader |
tags |
amd64 apport-bug kernel-bug-exists-upstream third-party-packages xenial |
amd64 apport-bug kernel-bug-exists-upstream third-party-packages verification-needed-artful xenial |
|
2018-03-19 13:12:27 |
Joseph Salisbury |
linux (Ubuntu): status |
In Progress |
Fix Committed |
|
2018-03-22 10:34:36 |
Janåke Rönnblom |
bug |
|
|
added subscriber Janåke Rönnblom |
2018-03-23 15:36:31 |
Dr. Jens Harbott |
tags |
amd64 apport-bug kernel-bug-exists-upstream third-party-packages verification-needed-artful xenial |
amd64 apport-bug kernel-bug-exists-upstream third-party-packages verification-done-artful xenial |
|
2018-04-03 14:10:10 |
Launchpad Janitor |
linux (Ubuntu Artful): status |
Fix Committed |
Fix Released |
|
2018-04-03 14:10:10 |
Launchpad Janitor |
cve linked |
|
2017-0861 |
|
2018-04-03 14:10:10 |
Launchpad Janitor |
cve linked |
|
2017-1000407 |
|
2018-04-03 14:10:10 |
Launchpad Janitor |
cve linked |
|
2017-15129 |
|
2018-04-03 14:10:10 |
Launchpad Janitor |
cve linked |
|
2017-16994 |
|
2018-04-03 14:10:10 |
Launchpad Janitor |
cve linked |
|
2017-17448 |
|
2018-04-03 14:10:10 |
Launchpad Janitor |
cve linked |
|
2017-17450 |
|
2018-04-03 14:10:10 |
Launchpad Janitor |
cve linked |
|
2017-17741 |
|
2018-04-03 14:10:10 |
Launchpad Janitor |
cve linked |
|
2017-17805 |
|
2018-04-03 14:10:10 |
Launchpad Janitor |
cve linked |
|
2017-17806 |
|
2018-04-03 14:10:10 |
Launchpad Janitor |
cve linked |
|
2017-17807 |
|
2018-04-03 14:10:10 |
Launchpad Janitor |
cve linked |
|
2018-1000026 |
|
2018-04-03 14:10:10 |
Launchpad Janitor |
cve linked |
|
2018-5332 |
|
2018-04-03 14:10:10 |
Launchpad Janitor |
cve linked |
|
2018-5333 |
|
2018-04-03 14:10:10 |
Launchpad Janitor |
cve linked |
|
2018-5344 |
|
2019-10-03 07:33:14 |
Po-Hsu Lin |
linux (Ubuntu): status |
Fix Committed |
Fix Released |
|