fluentd not reconnecting to ES on failures
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
kolla-ansible |
Fix Released
|
Medium
|
Doug Szumski | ||
Rocky |
New
|
Medium
|
Unassigned | ||
Stein |
Fix Released
|
Medium
|
Radosław Piliszek | ||
Train |
Fix Released
|
Medium
|
Radosław Piliszek | ||
Ussuri |
Fix Released
|
Medium
|
Doug Szumski |
Bug Description
According to the fluentd-
What I'm currently experiencing in my deployment seems to be related: Once fluentd-es plugin loses connectivity to the ES cluster, it never recovers and logs are no longer being sent:
```
2019-05-22 21:47:32 +0000 [warn]: #0 failed to flush the buffer. retry_time=0 next_retry_
2019-05-23 19:04:44 +0000 [warn]: #0 failed to flush the buffer. retry_time=0 next_retry_
2019-05-23 19:04:45 +0000 [warn]: #0 failed to flush the buffer. retry_time=1 next_retry_
2019-05-23 19:04:46 +0000 [warn]: #0 failed to flush the buffer. retry_time=2 next_retry_
[...]
```
If I wait enough I can see that fluentd gives up on pushing chunks and drops them.
I'll open a review with a proposed configuration change that I've just deployed on one of my controller nodes to see if it helps.
Changed in kolla-ansible: | |
assignee: | Krzysztof Klimonda (kklimonda) → Doug Szumski (dszumski) |
Changed in kolla-ansible: | |
assignee: | Doug Szumski (dszumski) → Krzysztof Klimonda (kklimonda) |
Changed in kolla-ansible: | |
assignee: | Krzysztof Klimonda (kklimonda) → Michal Nasiadka (mnasiadka) |
Changed in kolla-ansible: | |
assignee: | Michal Nasiadka (mnasiadka) → Doug Szumski (dszumski) |
Changed in kolla-ansible: | |
assignee: | Doug Szumski (dszumski) → Michal Nasiadka (mnasiadka) |
Changed in kolla-ansible: | |
assignee: | Michal Nasiadka (mnasiadka) → Radosław Piliszek (yoctozepto) |
Changed in kolla-ansible: | |
assignee: | Radosław Piliszek (yoctozepto) → Doug Szumski (dszumski) |
Fix proposed to branch: master /review. opendev. org/661747
Review: https:/