juju.worker.dependency engine.go:539 "mgo-txn-resumer" manifold worker returned unexpected error: cannot resume transactions: cannot find transaction ObjectIdHex("stuff")
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Triaged
|
Low
|
Unassigned |
Bug Description
Hi,
Today a controller agent jujud starting going crazy, using a whole lot of CPU, taking mongodb along with it.
The first sign of madness in machine-0 logs are :
2016-11-22 01:36:34 ERROR juju.worker.
2016-11-22 01:36:37 ERROR juju.worker.
with the last line repeating quite a bit.
We restarted jujud-machine-0 and juju-db on the controller, and apparently they're back up and working, even though jujud is still consuming between 150 and 200% CPU, and complaining about the missing transaction.
This is not an HA-enabled controller, so there is a single, non-replicated mongodb database.
We haven't run mgopurge.
I tried looking for the transaction, and indeed it's gone. The following displays nothing :
juju:PRIMARY> db.txns.
While looking at syslog, I couldn't find where this transaction got removed. I found references to it in the "leases" collection, and the first entry for this collection is quite MASSIVE :
{ "_id" : "3076959e-
Note that this bug appears to be similar to https:/
This is with juju version 2.0.2
Changed in juju: | |
status: | Won't Fix → New |
Changed in juju: | |
status: | New → Triaged |
milestone: | none → 2.1.1 |
tags: | added: cpe-onsite |
tags: | added: 4010 |
Relevant logs at : https:/ /private- fileshare. canonical. com/~axino/ lp1643795
If you need anything, please let me know, thanks !