capture slow heart beats in ceph logs

Bug #1966616 reported by Linda Guo
10
This bug affects 2 people
Affects Status Importance Assigned to Milestone
prometheus-grok-exporter-charm
New
High
Unassigned

Bug Description

we did not easily detect dropped packets on the ceph network

needs to look into using the grok exporter application to provide counts for slow heartbeats that would indicate a networking issues.

2022-03-21T10:14:17.974322+0000 mon.juju-a79b06-10-lxd-0 (mon.0) 9955308 : cluster [WRN] Health check failed: Slow OSD heartbeats on front (longest 2120.925ms) (OSD_SLOW_PING_TIME_FRONT)
2022-03-21T10:15:17.679289+0000 mon.juju-a79b06-10-lxd-0 (mon.0) 9955357 : cluster [INF] Health check cleared: OSD_SLOW_PING_TIME_FRONT (was: Slow OSD heartbeats on front (longest 2120.925ms))
2022-03-21T12:21:32.712067+0000 mon.juju-a79b06-10-lxd-0 (mon.0) 9961566 : cluster [WRN] Health check failed: Slow OSD heartbeats on back (longest 1130.745ms) (OSD_SLOW_PING_TIME_BACK)

Linda Guo (lihuiguo)
description: updated
Linda Guo (lihuiguo)
Changed in charm-prometheus-grok-exporter:
importance: Undecided → High
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.