rpc core should abort a call() early if the connection is terminated before the timeout period expires
Bug #1368917 reported by
Chris Friesen
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Invalid
|
Undecided
|
Unassigned | ||
oslo.messaging |
Confirmed
|
Medium
|
Ken Giusti |
Bug Description
As it stands, if a client issuing an RPC call() sends a message to the rabbitmq server, then the rabbitmq server does a switchover/failover the client will wait for the full RPC timeout period (60 seconds) even though new rabbitmq server has come up long before then and some connections have been reestablished.
On a controlled switchover especially the RPC core should notice that the server has gone away and should notify any entities waiting for an RPC call() response so that they can error out early rather than waiting for the full RPC timeout period.
This was detected on Havana, but it seems to apply to all other versions as well.
Changed in oslo.messaging: | |
status: | Incomplete → New |
Changed in oslo.messaging: | |
status: | New → Confirmed |
importance: | Undecided → Low |
To post a comment you must log in.
I think this really is an olso.messaging bug, these things are mostly left up to that lib at this point