R2.0 Centos6.5-havana-build-16: contrail-control core during control node restart
Affects | Status | Importance | Assigned to | Milestone | ||
---|---|---|---|---|---|---|
Juniper Openstack | Status tracked in Trunk | |||||
R2.0 |
Fix Released
|
High
|
Nischal Sheth | |||
Trunk |
Fix Released
|
High
|
Nischal Sheth |
Bug Description
Test case : test_control_
sanity console: http://
Complete logs and cores location: /cs-shared/
Bt:
===
Core was generated by `/usr/bin/
Program terminated with signal 11, Segmentation fault.
#0 0x0000000000757d6d in BgpPath:
#0 0x0000000000757d6d in BgpPath:
#1 0x00000000007f594d in BgpXmppMessage:
#2 0x00000000007f483b in BgpXmppMessage:
#3 0x00000000007f4e74 in BgpXmppMessage:
#4 0x00000000007f39f0 in BgpXmppMessage:
#5 0x00000000007f5a17 in BgpXmppMessageB
#6 0x0000000000724c23 in RibOutUpdates:
#7 0x000000000072504a in RibOutUpdates:
#8 0x00000000007b5427 in SchedulingGroup
#9 0x00000000007b8157 in SchedulingGroup
#10 0x0000000000c9fbda in TaskImpl::execute() ()
#11 0x00002b87bd0dc18a in tbb::internal:
child=
at /ecbuilds/
#12 0x00002b87bd0d3033 in tbb::internal:
at /ecbuilds/
#13 0x00002b87bd0d1906 in tbb::internal:
j=...)
at /ecbuilds/
#14 0x00002b87bd0cc4bc in tbb::internal:
this=0x1930600)
at /ecbuilds/
#15 0x00002b87bd0cc362 in tbb::internal:
arg=0x1930600)
at /ecbuilds/
#16 0x00002b87bce939d1 in start_thread () from /lib64/
#17 0x00002b87bdd8fb5d in clone () from /lib64/libc.so.6
description: | updated |
summary: |
- R2.0 Centos6.5-havana-build-16: contrail-contol core during control node - restart + R2.0 Centos6.5-havana-build-16: contrail-control core during control + node restart |
tags: | added: blocker |
Changed in juniperopenstack: | |
milestone: | r2.0-fcs → none |
information type: | Proprietary → Public |
Changed in juniperopenstack: | |
assignee: | nobody → Nischal Sheth (nsheth) |
status: | New → In Progress |
no longer affects: | juniperopenstack/r2.1 |
Hi,
From the core analysis, it looks like control-node crashed while trying to
access a deleted path data strucure. It is not clear though why that happened.
I have checked with Shaju.. Apparnetly this does not happen consistently and
many regressions have passed since. We have to continue debugging, but need
not be blocker for R2.0, IMO.
I also checked. The recent ribout change made is not part of this build. So that
change could not have cuaused this either. Not much has changed since last 2
months in bgp code. So, this is more likely a corner-case bug..
Regards,
Ananth