Nagios report check_crm fail but it runs fine in host
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
NRPE Charm |
New
|
Undecided
|
Unassigned |
Bug Description
Nagios fail to run crm_mon command on some host even though manually running crm_mon is fine
Nagios alarm:
tcs-preprod-
CRITICAL 2022-10-21 00:13:13 2d 4h 25m 52s 4/4
check_crm CRITICAL - Running /usr/sbin/crm_mon -1 -r -f FAILED
Manually running the command in host is fine
ubuntu@
Cluster Summary:
* Stack: corosync
* Current DC: iadaz02sashcp02-k8s (version 2.0.3-4b1f869f0f) - partition with quorum
* Last updated: Fri Oct 21 00:15:46 2022
* Last change: Tue Oct 18 02:51:27 2022 by root via cibadmin on iadaz01sashcp01-k8s
* 2 nodes configured
* 4 resource instances configured
Node List:
* Online: [ iadaz01sashcp01-k8s iadaz02sashcp02-k8s ]
Full List of Resources:
* Resource Group: grp_kubeapi-
* res_kubeapi-
* res_kubeapi-
* Clone Set: cl_res_nginx_nginx [res_nginx_nginx]:
* Started: [ iadaz01sashcp01-k8s iadaz02sashcp02-k8s ]
Migration Summary:
Environment : Kubernetes 1.23