Systemd unit cannot restart sometimes
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Daisy |
Triaged
|
Undecided
|
Unassigned |
Bug Description
Hello,
Here is a snippet of a production incident debug where a lot of retracer-amd64 service were in a "failed" state.
I looked in some systemd parameters and here are my conclusion:
sudo systemctl show retracer-amd64 | grep -E 'Burst|
Restart=always
RestartUSec
StartLimitI
StartLimitB
So my guess is, the unit restarts 5 times, waiting only 100ms between each and is then prevented to restart, even with Restart=always.
I suggest setting the RestartSec to a higher value and increasing significantly the StartLimitBurst or reducing a LOT the StartLimitInterval (which is 60 secs).
I'm opening the bug here because I think this is related to the charm, not the application itself.
Related branches
- Brian Murray (community): Disapprove
- Junien Fridrick: Approve
-
Diff: 18 lines (+2/-1)1 file modifiedhooks/common (+2/-1)
tags: | added: id-5e2ab61280a0ed81905cff72 |
The hook, located here https:/ /code.launchpad .net/~daisy- pluckers/ charms/ xenial/ daisy-retracer/ trunk, only sets StartLImitInterval and StartLimitBurst - I guess the others are systemd defaults.
Do you have an idea of how often this has been a problem so we can properly prioritize the work?