RadosGW process dies on cluster restart
Bug #1519148 reported by
Gregory Elkinbard
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Mirantis OpenStack |
Invalid
|
High
|
Gregory Elkinbard | ||
7.0.x |
Invalid
|
High
|
Gregory Elkinbard |
Bug Description
There is a start up dependency between Ceph OSDs, Mons and RadosGW.
If RadosGW is started before OSDs and Mons are online, it will exit with an error.
This creates an unnecessary complication for the cluster start up process.
RadosGW should be registered with Supervisor to restart the process if it exits and correct pacemaker dependencies should be created
Changed in fuel: | |
assignee: | nobody → Fuel Library Team (fuel-library) |
tags: | added: area-library |
Changed in fuel: | |
milestone: | none → 8.0 |
importance: | Undecided → Medium |
status: | New → Confirmed |
tags: |
added: area-mos removed: area-library |
no longer affects: | fuel |
no longer affects: | mos/8.0.x |
Changed in mos: | |
assignee: | Kostiantyn Danylov (kdanylov) → Alexei Sheplyakov (asheplyakov) |
Changed in mos: | |
status: | Incomplete → Confirmed |
status: | Confirmed → Incomplete |
To post a comment you must log in.
So after discussion with bug reporter, this is related to cluster cold boot-up process and requires radosgw to be monitored and restarted properly. It could be useful to use respawn upstart configuration or systemd control in the future. Nevertheless, the proper fix is to make radosgw retry connection until it finds running osds and can start providing the service. I am marking this bug as high and targeting assigning it to our ceph team for investigation