lost net connection when live migration
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
neutron |
New
|
Undecided
|
Unassigned |
Bug Description
Description
===========
I find VM will lost net connection when live migration.
I enabled the live_migration_
My openstack version is queens, and I used linux bridge as my L2 plugin. and the physical adapter of my host I did bond0 mode.
when the live migration done, I found the ping of VM is broken down, and I use tcpdump to catch the package, I found the package from switch still reach the destination host.
Steps to reproduce
==================
1. enable the live_migration_
2. created a new VM ,and then ping the VM which created in step
3. do the live migration action.(before live migration you need to make sure the vlan sub_interface of the network which vm attached not in the dest host. )
4. when the processing of live migration, U will found the ping to the VM was broken down.
Expected result
===============
the ping should not broken down when live migration.
Actual result
=============
the ping was broken down. and the ping package from the physic switch was still set to the source.
Environment
===========
1. Exact version of OpenStack you are running. See the following
list for all releases: http://
queens
2. Which hypervisor did you use?
Libvirt + KVM
3. Which storage type did you use?
ceph
4. Which networking type did you use?
Neutron with LinuxBirdge
the network type of VM is Vlan type.
And I found the time of 5 RARP from VM is before the time of the vlan sub interface insert to the linuxbridge bridge when live migration. maybe this will help us to make sure the problem I found.
i suspect this is related to https:/ /bugs.launchpad .net/neutron/ +bug/1815989
in the case of ovs we know there is an unavoidable race betwen ovs libvirt and the neuton l2 agent.
or rather there is a race that can only be solved via a change to the l2 agent and removing libvirt form network configutation of the ovs ports.
i did not belive this race existed for linux bridge but its possible that it does.
i will need to check the code but could you provide the libvirt version you are using and ideally
a sample of the libvirt xml that we generated. if you can provide the nova compute log for both the souce and destion host and the neutron linux bridge l2 agent log for the time period during the migration that should contain all the info we need if they are running in debug log level.