2021-02-03 14:42:36 |
Chris MacNaughton |
bug |
|
|
added bug |
2021-02-03 14:42:59 |
Chris MacNaughton |
summary |
[SRU] |
[SRU] Recent update broke message handling |
|
2021-02-03 14:43:15 |
Chris MacNaughton |
summary |
[SRU] Recent update broke message handling |
[SRU] MessageTimeout and DuplicateMessage errors after udpate |
|
2021-02-03 15:39:27 |
Corey Bryant |
bug task added |
|
python-oslo.messaging (Ubuntu) |
|
2021-02-03 15:39:38 |
Corey Bryant |
nominated for series |
|
Ubuntu Bionic |
|
2021-02-03 15:39:38 |
Corey Bryant |
bug task added |
|
python-oslo.messaging (Ubuntu Bionic) |
|
2021-02-03 15:39:45 |
Corey Bryant |
python-oslo.messaging (Ubuntu Bionic): status |
New |
Triaged |
|
2021-02-03 15:39:47 |
Corey Bryant |
python-oslo.messaging (Ubuntu Bionic): importance |
Undecided |
Critical |
|
2021-02-03 15:39:50 |
Corey Bryant |
python-oslo.messaging (Ubuntu): status |
New |
Invalid |
|
2021-02-03 15:39:59 |
Corey Bryant |
bug task added |
|
cloud-archive |
|
2021-02-03 15:40:17 |
Corey Bryant |
nominated for series |
|
cloud-archive/rocky |
|
2021-02-03 15:40:17 |
Corey Bryant |
bug task added |
|
cloud-archive/rocky |
|
2021-02-03 15:40:17 |
Corey Bryant |
nominated for series |
|
cloud-archive/stein |
|
2021-02-03 15:40:17 |
Corey Bryant |
bug task added |
|
cloud-archive/stein |
|
2021-02-03 15:40:17 |
Corey Bryant |
nominated for series |
|
cloud-archive/queens |
|
2021-02-03 15:40:17 |
Corey Bryant |
bug task added |
|
cloud-archive/queens |
|
2021-02-03 15:40:34 |
Corey Bryant |
cloud-archive/stein: status |
New |
Triaged |
|
2021-02-03 15:40:37 |
Corey Bryant |
cloud-archive/rocky: status |
New |
Triaged |
|
2021-02-03 15:40:40 |
Corey Bryant |
cloud-archive/queens: status |
New |
Triaged |
|
2021-02-03 15:40:44 |
Corey Bryant |
cloud-archive/stein: importance |
Undecided |
Critical |
|
2021-02-03 15:40:45 |
Corey Bryant |
cloud-archive/rocky: importance |
Undecided |
Critical |
|
2021-02-03 15:40:47 |
Corey Bryant |
cloud-archive/queens: importance |
Undecided |
Critical |
|
2021-02-03 15:40:50 |
Corey Bryant |
cloud-archive: status |
New |
Invalid |
|
2021-02-03 15:41:54 |
Chris MacNaughton |
bug task deleted |
cloud-archive/queens |
|
|
2021-02-03 15:50:07 |
Corey Bryant |
nominated for series |
|
cloud-archive/queens |
|
2021-02-03 15:50:07 |
Corey Bryant |
bug task added |
|
cloud-archive/queens |
|
2021-02-03 15:50:14 |
Corey Bryant |
cloud-archive/queens: status |
New |
Incomplete |
|
2021-02-03 15:50:16 |
Corey Bryant |
cloud-archive/queens: status |
Incomplete |
Triaged |
|
2021-02-03 15:50:18 |
Corey Bryant |
cloud-archive/queens: importance |
Undecided |
Critical |
|
2021-02-03 15:54:32 |
Peter Sabaini |
bug |
|
|
added subscriber Canonical IS BootStack |
2021-02-03 17:17:43 |
Robie Basak |
tags |
regression-updates |
regression-update |
|
2021-02-03 17:18:06 |
Robie Basak |
description |
A recent update to oslo.messaging to resolve #1789177 causes failures.
(Below comments copied form the original bug):
After a partial upgrade (only one side, producers or consumers), there are a lot of MessageTimeout and DuplicateMessage errors in the logs. Downgrading back to 5.35.0-0ubuntu1~cloud0 fixed the problem.
Right after restarted n-ovs-agent, I can see a lot of errors in rabbitmq log[1]
which is the same as the error when rabbitmq failover issue ( the original issue of this LP )
Then after I upgraded oslo.messaging in neutron-api unit and restarted neutron-server, below errors are gone and I was able to create instance again.
After upgrading oslo.messaging in n-ovs only, exchange they communicate didn't match.
As changing exchanges they use depends on publisher-cosumer relation.
So I think there are two ways.
1. revert this patch for Q ( original failover problem will be there )
2. upgrade them with maintenance window
Thanks a lot
[1]
################################################################################
=ERROR REPORT==== 3-Feb-2021::03:25:26 ===
Channel error on connection <0.2379.1> (10.0.0.32:60430 -> 10.0.0.34:5672, vhost: 'openstack', user: 'neutron'), channel 1:
{amqp_error,not_found,
"no exchange 'reply_7da3cecc31b34bdeb96c866dc84e3044' in vhost 'openstack'",
'basic.publish'}
10.0.0.32 is neutron-api unit |
A recent update to oslo.messaging to resolve bug 1789177 causes failures.
(Below comments copied form the original bug):
After a partial upgrade (only one side, producers or consumers), there are a lot of MessageTimeout and DuplicateMessage errors in the logs. Downgrading back to 5.35.0-0ubuntu1~cloud0 fixed the problem.
Right after restarted n-ovs-agent, I can see a lot of errors in rabbitmq log[1]
which is the same as the error when rabbitmq failover issue ( the original issue of this LP )
Then after I upgraded oslo.messaging in neutron-api unit and restarted neutron-server, below errors are gone and I was able to create instance again.
After upgrading oslo.messaging in n-ovs only, exchange they communicate didn't match.
As changing exchanges they use depends on publisher-cosumer relation.
So I think there are two ways.
1. revert this patch for Q ( original failover problem will be there )
2. upgrade them with maintenance window
Thanks a lot
[1]
################################################################################
=ERROR REPORT==== 3-Feb-2021::03:25:26 ===
Channel error on connection <0.2379.1> (10.0.0.32:60430 -> 10.0.0.34:5672, vhost: 'openstack', user: 'neutron'), channel 1:
{amqp_error,not_found,
"no exchange 'reply_7da3cecc31b34bdeb96c866dc84e3044' in vhost 'openstack'",
'basic.publish'}
10.0.0.32 is neutron-api unit |
|
2021-02-03 17:54:15 |
Robie Basak |
python-oslo.messaging (Ubuntu Bionic): status |
Triaged |
Fix Committed |
|
2021-02-03 17:54:17 |
Robie Basak |
bug |
|
|
added subscriber Ubuntu Stable Release Updates Team |
2021-02-03 17:54:19 |
Robie Basak |
bug |
|
|
added subscriber SRU Verification |
2021-02-03 17:54:22 |
Robie Basak |
tags |
regression-update |
regression-update verification-needed verification-needed-bionic |
|
2021-02-03 18:02:53 |
Corey Bryant |
description |
A recent update to oslo.messaging to resolve bug 1789177 causes failures.
(Below comments copied form the original bug):
After a partial upgrade (only one side, producers or consumers), there are a lot of MessageTimeout and DuplicateMessage errors in the logs. Downgrading back to 5.35.0-0ubuntu1~cloud0 fixed the problem.
Right after restarted n-ovs-agent, I can see a lot of errors in rabbitmq log[1]
which is the same as the error when rabbitmq failover issue ( the original issue of this LP )
Then after I upgraded oslo.messaging in neutron-api unit and restarted neutron-server, below errors are gone and I was able to create instance again.
After upgrading oslo.messaging in n-ovs only, exchange they communicate didn't match.
As changing exchanges they use depends on publisher-cosumer relation.
So I think there are two ways.
1. revert this patch for Q ( original failover problem will be there )
2. upgrade them with maintenance window
Thanks a lot
[1]
################################################################################
=ERROR REPORT==== 3-Feb-2021::03:25:26 ===
Channel error on connection <0.2379.1> (10.0.0.32:60430 -> 10.0.0.34:5672, vhost: 'openstack', user: 'neutron'), channel 1:
{amqp_error,not_found,
"no exchange 'reply_7da3cecc31b34bdeb96c866dc84e3044' in vhost 'openstack'",
'basic.publish'}
10.0.0.32 is neutron-api unit |
[Impact]
A recent update to oslo.messaging to resolve #1789177 causes failures.
(Below comments copied form the original bug):
After a partial upgrade (only one side, producers or consumers), there are a lot of MessageTimeout and DuplicateMessage errors in the logs. Downgrading back to 5.35.0-0ubuntu1~cloud0 fixed the problem.
Right after restarted n-ovs-agent, I can see a lot of errors in rabbitmq log[1]
which is the same as the error when rabbitmq failover issue ( the original issue of this LP )
Then after I upgraded oslo.messaging in neutron-api unit and restarted neutron-server, below errors are gone and I was able to create instance again.
After upgrading oslo.messaging in n-ovs only, exchange they communicate didn't match.
As changing exchanges they use depends on publisher-cosumer relation.
So I think there are two ways.
1. revert this patch for Q ( original failover problem will be there )
2. upgrade them with maintenance window
Thanks a lot
[1]
################################################################################
=ERROR REPORT==== 3-Feb-2021::03:25:26 ===
Channel error on connection <0.2379.1> (10.0.0.32:60430 -> 10.0.0.34:5672, vhost: 'openstack', user: 'neutron'), channel 1:
{amqp_error,not_found,
"no exchange 'reply_7da3cecc31b34bdeb96c866dc84e3044' in vhost 'openstack'",
'basic.publish'}
10.0.0.32 is neutron-api unit
[Test Case]
This SRU needs the following scenarios tested:
1) partial upgrade of n-ovs at 5.35.0-0ubuntu3 [1] and n-api/n-gateway at 5.35.0-0ubuntu1 - instance creation will be successful
2) partial upgrade of n-api/n-gateway at 5.35.0-0ubuntu3 [1] and n-ovs at 5.35.0-0ubuntu1 - instance creation will be successful
3) partial upgrade of n-ovs at 5.35.0-0ubuntu2 [1] and n-api/n-gateway at 5.35.0-0ubuntu3 - instance creation will fail (see regression potential)
4) partial upgrade of n-api/n-gateway at 5.35.0-0ubuntu3 [1] and n-ovs at 5.35.0-0ubuntu2 - instance creation will fail (see regression potential)
5) test all neutron nodes at 5.35.0-0ubunt3 - instance creation will be successful
[1] and neutron* services restarted
[Regression Potential]
There is regression potential for clouds that have already upgraded to 5.35.0-0ubuntu2. This needs to be tested but if a cloud has fully upgraded to 5.35.0-0ubuntu2, then the same disruption that this SRU is trying to solve may once again occur in a cloud with some services running 5.35.0-0ubuntu2 and some running 5.35.0-0ubuntu3. Once that cloud is entirely at 5.35.0-0ubuntu3, messages will no longer timeout. |
|
2021-02-03 18:15:18 |
Corey Bryant |
cloud-archive/stein: status |
Triaged |
Fix Committed |
|
2021-02-03 18:15:20 |
Corey Bryant |
tags |
regression-update verification-needed verification-needed-bionic |
regression-update verification-needed verification-needed-bionic verification-stein-needed |
|
2021-02-03 19:31:30 |
Corey Bryant |
cloud-archive/rocky: status |
Triaged |
Fix Committed |
|
2021-02-03 19:31:31 |
Corey Bryant |
tags |
regression-update verification-needed verification-needed-bionic verification-stein-needed |
regression-update verification-needed verification-needed-bionic verification-rocky-needed verification-stein-needed |
|
2021-02-03 21:09:54 |
Corey Bryant |
cloud-archive/queens: status |
Triaged |
Fix Committed |
|
2021-02-03 21:09:55 |
Corey Bryant |
tags |
regression-update verification-needed verification-needed-bionic verification-rocky-needed verification-stein-needed |
regression-update verification-needed verification-needed-bionic verification-queens-needed verification-rocky-needed verification-stein-needed |
|
2021-02-03 22:07:52 |
Corey Bryant |
tags |
regression-update verification-needed verification-needed-bionic verification-queens-needed verification-rocky-needed verification-stein-needed |
regression-update verification-done-bionic verification-needed verification-rocky-done verification-stein-done |
|
2021-02-03 22:08:01 |
Corey Bryant |
tags |
regression-update verification-done-bionic verification-needed verification-rocky-done verification-stein-done |
regression-update verification-done verification-done-bionic verification-rocky-done verification-stein-done |
|
2021-02-03 22:08:50 |
Corey Bryant |
tags |
regression-update verification-done verification-done-bionic verification-rocky-done verification-stein-done |
regression-update verification-done verification-done-bionic verification-queens-needed verification-rocky-done verification-stein-done |
|
2021-02-03 22:09:51 |
Corey Bryant |
cloud-archive/stein: status |
Fix Committed |
Fix Released |
|
2021-02-03 22:10:33 |
Corey Bryant |
cloud-archive/rocky: status |
Fix Committed |
Fix Released |
|
2021-02-03 23:23:05 |
Corey Bryant |
tags |
regression-update verification-done verification-done-bionic verification-queens-needed verification-rocky-done verification-stein-done |
regression-update verification-done verification-done-bionic verification-queens-done verification-rocky-done verification-stein-done |
|
2021-02-04 08:36:24 |
Nikolay Vinogradov |
bug |
|
|
added subscriber Canonical Field Critical |
2021-02-04 13:15:12 |
Launchpad Janitor |
python-oslo.messaging (Ubuntu Bionic): status |
Fix Committed |
Fix Released |
|
2021-02-04 13:17:05 |
Corey Bryant |
cloud-archive/queens: status |
Fix Committed |
Fix Released |
|
2021-02-04 13:38:25 |
Nikolay Vinogradov |
removed subscriber Canonical Field Critical |
|
|
|
2021-02-04 13:43:52 |
Corey Bryant |
oslo.messaging: status |
New |
Invalid |
|
2021-02-04 21:46:23 |
Brian Murray |
summary |
[SRU] MessageTimeout and DuplicateMessage errors after udpate |
[SRU] MessageTimeout and DuplicateMessage errors after update |
|
2021-02-05 02:58:17 |
Brett Milford |
bug |
|
|
added subscriber Brett Milford |
2021-02-18 16:39:24 |
James Page |
nominated for series |
|
cloud-archive/train |
|
2021-02-18 16:39:24 |
James Page |
bug task added |
|
cloud-archive/train |
|
2021-02-19 15:57:34 |
Victor Morales |
bug |
|
|
added subscriber Victor Morales |