AllWatcher does not report agents that are down
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Triaged
|
High
|
Unassigned |
Bug Description
The JIMM service relies on the AllModelWatcher to observe changes in status of the model. One of the things it needs to know is when machines and units are down. Unfortunately the allwatcher does not provide this information, even though it's reported on a full status.
For example, here is the reply to a FullStatus API call where the machine is down:
{
request-id: 2
response: {
applications: {
ubuntu: {
can-
charm: "cs:ubuntu-12"
endpoint-
exposed: false
life: ""
meter-
public-
relations: {}
series: "xenial"
status: {
data: {}
info: "ready"
kind: ""
life: ""
since: "2018-08-
status: "active"
version: ""
}
subordinat
units: {
"ubuntu/2": {
data: {}
info: ""
kind: ""
life: ""
since: "2018-08-
status: "idle"
version: "2.4.1"
}
charm: ""
leader: true
machine: "2"
data: {}
info: "ready"
kind: ""
life: ""
since: "2018-08-
status: "active"
version: ""
}
}
}
workload-
}
}
controller-
machines: {
"2": {
agent-status: {
data: {}
info: "agent is not communicating with the server"
kind: ""
life: ""
since: "2018-08-
status: "down"
version: "2.4.1"
}
constraints: "cores=1 mem=1024M"
containers: {}
dns-name: "52.23.229.253"
hardware: "arch=amd64 cores=1 cpu-power=350 mem=3840M root-disk=8192M availability-
has-vote: false
id: "2"
instance-id: "i-0f4045bcef2e
instance-
data: {}
info: "running"
kind: ""
life: ""
since: "2018-08-
status: "running"
version: ""
}
ip-addresses: [
"
"
"252.39.86.1"
]
jobs: [
"
]
network-
eth0: {
gateway: "172.31.32.1"
]
is-up: true
mac-address: "12:ae:04:27:c9:3a"
}
fan-252: {
]
is-up: true
mac-address: "32:86:99:8b:b8:a8"
}
}
series: "xenial"
wants-vote: false
}
}
model: {
available-
cloud-tag: "cloud-aws"
meter-status: {
color: ""
message: ""
}
model-status: {
data: {}
info: ""
kind: ""
life: ""
since: "2018-08-
status: "available"
version: ""
}
name: "jimmmodel"
region: "us-east-1"
sla: "unsupported"
type: "iaas"
version: "2.4.1"
}
offers: {}
relations: null
remote-
}
}
Note that the machine-2 status is reported as "down".
By contrast, here's the machine entry as reported by WatchAll.
The agent status is reported as "started", not "down".
machine 2 04c4615d-
model-uuid: "04c4615d-
id: "2"
instance-id: "i-0f4045bcef2e
agent-status: {
current: "started"
message: ""
since: "2018-08-
version: "2.4.1"
}
instance-status: {
current: "running"
message: "running"
since: "2018-08-
version: ""
}
life: "alive"
series: "xenial"
supported-
"lxd"
]
supported-
hardware-
arch: "amd64"
mem: 3840
root-disk: 8192
cpu-cores: 1
cpu-power: 350
availability
}
jobs: [
"JobHostUnits"
]
summary: |
- AllWatcher does not report down machines + AllWatcher does not report agents that are down |
Changed in juju: | |
status: | New → Triaged |
milestone: | none → 2.4.3 |
importance: | Undecided → High |
This is a duplicate of https:/ /bugs.launchpad .net/juju/ +bug/1453096