pg repair action
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ceph Monitor Charm |
Fix Committed
|
Wishlist
|
Unassigned |
Bug Description
Whenever pg inconsistencies occur, manual intervention from operators is required. Many inconsistencies can be resolved by running `ceph pg repair <pg ID>`, but this is only safe in some situations. For example (list not necessarily exhaustive):
* read errors
* 0 size shards
* wrong data digest on non-primary shard
* inconsistency in an erasure coded pool
As all of the above can be easily verified programmatically, it would be very useful to have a "safe-pg-repair pgid=<num>" action that runs `ceph pg repair <pg ID>` *only* if the inconsistency falls into one of the known safe situations, and returns a warning otherwise. This would drastically simplify managing a charmed ceph cluster since operators could simply run the action as a first attempt, and spend time digging into the specifics of the inconsistency only in cases that truly require human intervention.
Changed in charm-ceph-mon: | |
importance: | Undecided → Wishlist |
tags: | added: bseng-38 |
Fix proposed to branch: master /review. opendev. org/c/openstack /charm- ceph-mon/ +/831001
Review: https:/