Ubuntu Server x64 Kernel Oops - Random services tainted
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Fix Released
|
Undecided
|
Unassigned | ||
Hardy |
Fix Released
|
Medium
|
Andy Whitcroft |
Bug Description
Binary package hint: linux-image-
Running our LTSP server we have been constantly getting Kernel oops in relation to different tainted services, Bind, SSH, IMAP, etc. it is mostly Bind9 in the logs today was the first we have seen of IMAP. We thought it might have something to do with LTSP but it is not looking that way as even if we have all the users log off the load average still climbs up over the days for example:
19:28:04 up 4 days, 10:33, 2 users, load average: 3.00, 3.00, 3.00
That was our first clue that something was up(loadavg)
So we began surfing the logs and found that random services were crashing in syslog, that directly correlated with kernel oops'es in messages.
I know this is very general I will upload anything requested.
I am attaching excerpts from the syslog and messages that directly correlate to one another.
Changed in linux: | |
milestone: | ubuntu-8.04.2 → ubuntu-8.04.3 |
I'm working on the same physical server as the original bug report. I also noticed this in the kern.log at the end of reboot kernel messages, which may or may not be relevant:
Jun 27 10:52:21 www kernel: [ 145.904725] mtrr: type mismatch for d0000000,1000000 old: write-back new: write-combining
Jun 27 10:52:38 www kernel: [ 163.650905] mtrr: type mismatch for d0000000,1000000 old: write-back new: write-combining
Jun 27 10:53:47 www kernel: [ 232.116870] mtrr: type mismatch for d0000000,1000000 old: write-back new: write-combining
I've also attached two additional Oopses, which occurred before the original bug report's Oops. Note the different affected processes.
These Oopses result in defunct processes which necessitate a reboot to restore services. Unfortunately, the reboot fails and we must physically power-cycle the box from the console - always fun on a production SCSI RAID server.
Any help is most appreciated.