Juju controller cannot connect to itself
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Fix Released
|
Critical
|
Tim Penhey |
Bug Description
During bundle deployment, a juju controller became unable to connect to itself:
2018-12-07 14:10:26 ERROR juju.worker.
2018-12-07 14:10:30 WARNING juju.mongo open.go:160 mongodb connection failed, will retry: dial tcp 127.0.0.1:37017: connect: connection refused
2018-12-07 14:10:30 WARNING juju.mongo open.go:160 mongodb connection failed, will retry: dial tcp 10.244.
See attached machine log.
Note - these errors/warnings don't necessarily indicate a problem. Only if they happen after a deploy starts:
01:01 < jhobbs> debug_log.
01:01 < jhobbs> debug_log.
01:01 < jhobbs> debug_log.
01:01 < jhobbs> thumper: the controller logs exactly match the bug description; timeout is a secondary result of some other failure and I expect the triager suspected those errors/warnings.
Are those logs red herrings?
01:04 -!- kelvinliu [kelvinliu@
01:05 < thumper> jhobbs: yeah... those are expected
01:05 < thumper> those happened during enable-ha
01:05 < thumper> when you are adding mongo's to the replicaset, it drops all connections
01:05 < thumper> these would only be a problem if we see them after the deploy has started
01:06 < thumper> mongo can take up to two minutes to settle with a new replicaset
Changed in juju: | |
status: | New → Triaged |
importance: | Undecided → High |
milestone: | none → 2.5.1 |
Changed in juju: | |
importance: | High → Critical |
assignee: | nobody → Tim Penhey (thumper) |
tags: | added: sts |
Changed in juju: | |
status: | Triaged → In Progress |
tags: | added: cdo-qa |
tags: | added: foundations-engine |
Changed in juju: | |
status: | Fix Committed → Fix Released |
Looks like the database wasn't reachable so can we get the syslogs on there so that we can see why mongo wasn't connectable? If mongo isn't up then the controller api won't come up.