NTP charm should have tunable alert threshold(s)
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
NTP Charm |
Confirmed
|
Wishlist
|
Gabriel Cocenza |
Bug Description
Sometimes, a customer's corporate NTP servers may be more or less in sync (and likely close enough for the purposes of applications requiring accurate time, e.g. OpenStack), but not close enough to avoid alerts re: clock skew.
It would be good if we had some method to tune this threshold at a minimum, and perhaps others if there's other things which would be appropriate to tune. Obviously, it'd be best for the time servers in question to be brought closer into sync, but if the customer is not able to address the issue, it would be nice if we had a way to reduce the occurance of noise from alerts driven by expected levels of offset skew.
For the sake of this bug, I am interested primarily in the offset warning/critical messages like this:
WARNING: offset is out of range (0.033600) - should be between -0.010000 and 0.010000
CRITICAL: offset is out of range (0.083933) - must be between -0.050000 and 0.050000
This looks like it's controlled by the _metricdefs structure in alert.py (/opt/ntpmon-
In any case, having this type of tunability would reduce the occurance of borderline alerts for NTP offsets - and would also reduce the likeliness of an engineer ignoring real issues thinking that they're just part of the "typical noise".
tags: | added: bseng-1136 |
Changed in ntp-charm: | |
assignee: | nobody → Gabriel Cocenza (gabrielcocenza) |
If you're able to propose a merge for this we'd be happy to review. I agree we should be careful about how we expose this option, and should keep the current values as default.