[OVN] Lack of AZs awareness in L3 port scheduler
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
neutron |
In Progress
|
Undecided
|
Rodolfo Alonso |
Bug Description
The OVN L3 port scheduler assigns the router ports to gateway chassis. It retrieves the chassis list from nodes configured as gateway (external_
As a result, we have a list of all eligible chassis for gateway ports, in all AZs where it could be scheduled.
Then, both chance and leastloaded scheduler select 5 nodes from this list (hardcoded in common/
In some use cases, where AZs are mapped to “failure domains”, this could be a problem. While in OVS l3_ha mode, router instances where placed by “neutron.
Hello Morice:
The behaviour of the OVN L3 scheduler, with AZ filtering, that you are describing is correct. And this the expected behaviour: if the OVN L3 scheduler returns several ports (in any order depending on the scheduler) and this 5 ports [1] belong to the same AZ, the router will be scheduled to this single AZ. The OVN L3 scheduler won't distribute the ports among the available AZs. In that case, you can create a new OVN L3 scheduler class if that is what you need. The OVN L3 scheduler is configurable.
What does it mean "failure domains"? If you have GW chassis that should be disabled, then you should disabled them manually or remove the AZ tag from them. I would like to know what is the use case you need and what you are expecting from the scheduler.
Regards.
[1] As you correctly commented, there is a hardcoded limit of 5 router ports.