Cinder-Backup stuck in creating after RBD export-diff existed and turned into zombie / <defunct>
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Cinder |
In Progress
|
Medium
|
Unassigned |
Bug Description
I just observed a volume backup (RBD driver) which is in creating for DAYS already.
I went to the host running cinder-backup and looked at the RBD processes there:
```
[...]
cinder 3396548 1 3396548 3396548 0 Jul31 ? 03:40:51 /usr/bin/python3 /usr/bin/
cinder 3396576 3396548 3396548 3396548 0 Jul31 ? 01:05:29 /usr/bin/python3 /usr/bin/
cinder 3098464 3396576 3396548 3396548 0 Aug13 ? 00:00:01 [rbd] <defunct>
cinder 3098465 3396576 3396548 3396548 0 Aug13 ? 00:01:49 rbd import-diff --id cinder-backup --conf /etc/ceph/
root 3398716 1 3396548 3396548 0 Jul31 ? 00:01:14 /usr/bin/python3 /bin/privsep-helper --config-file /etc/cinder/
[...]
```
Looking at the piece of code doing the piping of rbd export-diff into rbd import-idff at https:/
Achieving something the likes of "-o pipefail" seems to be a little more challenging.
See e.g.
summary: |
Cinder-Backup stuck in creating after RBD export-diff existed and turned - into zombie / <defunct> properly + into zombie / <defunct> |
tags: | added: backup ceph drivers rbd |
Changed in cinder: | |
importance: | Undecided → Medium |
Could you run ps to check if that rbd is zombie? And parent process may also be stuck.
Typically, rbd will return, success or fail, but in this case rbd is stuck. Need to find out how.
"-o pipefail" only affects the rc of whole command. I don't think it will help in this case,
where rbd doesn't return at all.