[Upgrade Xenial -> Bionic] Some of the OSDs are in blocked state after upgrade due to "Non-pristine devices detected"
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ceph OSD Charm |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
I was trying to upgrade ceph-osd with:
juju set-series ceph-osd bionic
# Skipping dist-upgrade as already done prior
juju config ceph-osd source=distro
Then 6/15 OSDs go into "blocked" state complaining about "Non-pristine devices detected, consult `list-disks`, `zap-disk` and `blacklist-*` actions."
I've checked that OSDs are healthy. Then I tried to use "blacklist-
Detailed info:
1. juju output:
https:/
2. ceph mon status:
https:/
3. list-disks output of ceph-osd/0 for e.g:
https:/
FYI, content of 'osd-devices' is `/dev/disk/ by-dname/ bcache2 /dev/disk/ by-dname/ bcache3 /dev/disk/ by-dname/ bcache4 /dev/disk/ by-dname/ bcache5 /dev/disk/ by-dname/ bcache6 /dev/disk/ by-dname/ bcache7` .
Some logs that maybe interesting from the unit: uniter. remotestate watcher.go:427 got application change uniter. remotestate watcher.go:448 got config change: ok=true, hashes= [26a4c44f835210 54789f1277431b1 abc01b2cb26aa78 3a3dbd32246ec9e 558ea] uniter. operation executor.go:59 running operation run config-changed hook uniter. operation executor.go:90 preparing operation "run config-changed hook" uniter. operation executor.go:90 executing operation "run config-changed hook" uniter. runner runner.go:595 starting jujuc server {unix @/var/lib/ juju/agents/ unit-ceph- osd-0/agent. socket <nil>} d/50-ceph- osd-charm. conf values: {'kernel.pid_max': 2097152, 'vm.max_map_count': 524288, 'kernel. threads- max': 2097152, 'vm.vfs_ cache_pressure' : 100, 'vm.swappiness'
2021-06-29 02:19:28 DEBUG juju.worker.
2021-06-29 02:19:28 DEBUG juju.worker.uniter resolver.go:147 no operations in progress; waiting for changes
2021-06-29 02:20:00 DEBUG juju.worker.
2021-06-29 02:20:00 DEBUG juju.worker.uniter resolver.go:147 no operations in progress; waiting for changes
2021-06-29 02:20:00 DEBUG juju.worker.
2021-06-29 02:20:00 DEBUG juju.machinelock machinelock.go:162 acquire machine lock for uniter (run config-changed hook)
2021-06-29 02:20:00 DEBUG juju.machinelock machinelock.go:172 machine lock acquired for uniter (run config-changed hook)
2021-06-29 02:20:00 DEBUG juju.worker.
2021-06-29 02:20:00 DEBUG juju.worker.
2021-06-29 02:20:00 DEBUG juju.worker.uniter agent.go:20 [AGENT-STATUS] executing: running config-changed hook
2021-06-29 02:20:00 DEBUG juju.worker.
2021-06-29 02:20:01 DEBUG worker.uniter.jujuc server.go:204 running hook tool "juju-log"
2021-06-29 02:20:01 DEBUG juju-log Hardening function 'config_changed'
2021-06-29 02:20:01 DEBUG worker.uniter.jujuc server.go:204 running hook tool "config-get"
2021-06-29 02:20:01 DEBUG worker.uniter.jujuc server.go:204 running hook tool "juju-log"
2021-06-29 02:20:01 DEBUG juju-log No hardening applied to 'config_changed'
2021-06-29 02:20:01 DEBUG worker.uniter.jujuc server.go:204 running hook tool "juju-log"
2021-06-29 02:20:01 INFO juju-log old_version: luminous
2021-06-29 02:20:01 DEBUG worker.uniter.jujuc server.go:204 running hook tool "juju-log"
2021-06-29 02:20:01 INFO juju-log new_version: luminous
2021-06-29 02:20:01 DEBUG worker.uniter.jujuc server.go:204 running hook tool "juju-log"
2021-06-29 02:20:01 ERROR juju-log Invalid upgrade path from luminous to luminous. Valid paths are: ['firefly -> hammer', 'hammer -> jewel', 'jewel -> luminous', 'luminous -> mimic', 'mimic -> nautilus', 'nautilus -> octopus']
2021-06-29 02:20:01 DEBUG worker.uniter.jujuc server.go:204 running hook tool "juju-log"
2021-06-29 02:20:01 DEBUG juju-log Updating sysctl_file: /etc/sysctl.
: 1}
2021-06-29 02:20:02 DEBUG juju-log got journal devs: {'/dev/ disk/by- dname/nvme0n1- part3'}
2021-06-29 02:20:02 DEBUG worker.uniter.jujuc server.go:204 running hook tool "juju-log"
2021-06-29 02:20:02 INFO juju-log Skipping osd devices previously processed by this uni...