bind9 segfaults on certain stressful scenarios
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
bind9 (Ubuntu) |
Fix Released
|
Undecided
|
Sergio Durigan Junior | ||
Focal |
Fix Released
|
High
|
Sergio Durigan Junior |
Bug Description
[ Impact ]
On certain scenarios where bind9's resolver is put under stress, a segmentation fault can happen on isc__nm_
[ Test Plan ]
Unfortunately, after several attempts I wasn't able to reproduce the issue in a reliable manner. For that reason, I have been relying on the community to perform tests and determine the right fix for the issue. Some members of the community have deployments where the segmentation fault occurs after some time (typically less than 1 month). Therefore, the test plan for this bug will involve asking these kind community members to help us by installing the bind9 package from focal-proposed and leave it running for some time. The expectation here is that the segmentation fault will not manifest with the new package.
[ Where problems could occur ]
The backported patch is not entirely trivial, although it is well contained within the tcpdns code. The intention is to split tcpdns into a new, asynchronous thread which will ultimately make accessing internal socket fields safe. As is common with general code overhauls, this one also introduces a chance for some bad interaction between tcpdns and its users.
[ Other Info ]
The positive side here is that this code has been incorporated into bind9 upstream 2 years ago, and there have been no regressions reported against it to the best of my knowledge. On top of that, at least 3 community members have extensively tested a PPA with this backport and all of them reported back saying that the issue has been fixed.
It's also important to note that this backport addresses solely the bug experienced by the community users. During the review of the MP to fix the bug, Andreas found another patch that looked like it should be backported as well, but we were not sure. I raised this with upstream here:
https:/
and, as can be seen, their reply was not very encouraging. Having in mind that (a) the backport in question does solve the problems experienced by the community, (b) we have been actively working to get an MRE for bind9 on Jammy and Focal, (c) when the MRE is in place we will be able to update bind9 and get the latest code that fixes this and many other issues, and (d) it'd be very risky and somewhat unfeasible to backport all of the related fixes pointed by upstream, I decided to move forward with this SRU as is.
[ Original Description ]
The server acts as Samba AD DC
ProblemType: Bug
DistroRelease: Ubuntu 20.04
Package: bind9 1:9.16.
ProcVersionSign
Uname: Linux 5.4.0-122-generic x86_64
ApportVersion: 2.20.11-
Architecture: amd64
CasperMD5CheckR
Date: Tue Nov 22 14:05:57 2022
RelatedPackageV
bind9utils N/A
apparmor 2.13.3-7ubuntu5.1
SourcePackage: bind9
UpgradeStatus: No upgrade log present (probably fresh install)
mtime.conffile.
mtime.conffile.
mtime.conffile.
Related branches
- git-ubuntu bot: Approve
- Andreas Hasenack (community): Approve
- Canonical Server Reporter: Pending requested
-
Diff: 238 lines (+216/-0)3 files modifieddebian/changelog (+8/-0)
debian/patches/lp1997375-segfault-isc-nm-tcp-send.patch (+207/-0)
debian/patches/series (+1/-0)
CVE References
description: | updated |
description: | updated |
description: | updated |
Changed in bind9 (Ubuntu Focal): | |
status: | Confirmed → In Progress |
description: | updated |
Thank you for taking the time to file a bug report.
Unfortunately it is not possible to determine the root cause of the issue you've experienced based only on the log file attached to the bug. We will need more information to proceed with the investigation. Can you reliably reproduce the problem? If yes, are you able to obtain a coredump when the problem happens? It would also be extremely valuable if you could provide a step-by-step procedure to trigger the issue.
Since there is not enough information in your report to begin triage or to
differentiate between a local configuration problem and a bug in Ubuntu, I
am marking this bug as "Incomplete". We would be grateful if you would:
provide a more complete description of the problem, explain why you
believe this is a bug in Ubuntu rather than a problem specific to your
system, and then change the bug status back to "New".
For local configuration issues, you can find assistance here: www.ubuntu. com/support/ community
http://