agent asserts @ vn->GetName()) == 0
Affects | Status | Importance | Assigned to | Milestone | ||
---|---|---|---|---|---|---|
Juniper Openstack | Status tracked in Trunk | |||||
R2.20 |
Fix Committed
|
High
|
Ashok Singh | |||
R2.21.x |
Fix Committed
|
High
|
Ashok Singh | |||
R2.22.x |
Fix Committed
|
High
|
Ashok Singh | |||
R3.0 |
Fix Committed
|
High
|
Ashok Singh | |||
R3.0.2.x |
Fix Committed
|
High
|
Ashok Singh | |||
Trunk |
Fix Committed
|
High
|
Ashok Singh |
Bug Description
We have 40K VMIs spread over 100 agents and each VMI has two IPs, each from different VNs.
In total, 80K VNs, 40K VMIs across 100 agents.
At this scale when the control process was restarted (both the xmpp peers are brought down) we observed agent cores on many of the agent and after a while things settles down and the cluster becomes stable.
Below is the bt for the same and the core will be copied to /auto/cores/<bugid>
2016-06-27 21:38:32:453061: [root@10.84.25.20] out: [Thread debugging using libthread_db enabled]
2016-06-27 21:38:32:754443: [root@10.84.25.20] out: Using host libthread_db library "/lib/x86_
2016-06-27 21:38:32:754579: [root@10.84.25.20] out: Core was generated by `/usr/bin/
2016-06-27 21:38:33:118962: [root@10.84.25.20] out: Program terminated with signal SIGABRT, Aborted.
2016-06-27 21:38:33:119100: [root@10.84.25.20] out: #0 0x00007f3011a39cc9 in __GI_raise (sig=sig@entry=6) at ../nptl/
2016-06-27 21:38:33:150948: [root@10.84.25.20] out: 56 ../nptl/
2016-06-27 21:38:33:151086: [root@10.84.25.20] out: #0 0x00007f3011a39cc9 in __GI_raise (sig=sig@entry=6) at ../nptl/
2016-06-27 21:38:33:151195: [root@10.84.25.20] out: #1 0x00007f3011a3d0d8 in __GI_abort () at abort.c:89
2016-06-27 21:38:33:151283: [root@10.84.25.20] out: #2 0x00007f3011a32b86 in __assert_fail_base (fmt=0x7f3011b83830 "%s%s%s:%u: %s%sAssertion `%s' failed.\n
%n", assertion=
_base.cc", line=line@
2016-06-27 21:38:33:167045: [root@10.84.25.20] out: #3 0x00007f3011a32c32 in __GI___assert_fail (assertion=
== 0", file=0x127ee78 "controller/
2016-06-27 21:38:33:599229: [root@10.84.25.20] out: #4 0x0000000000e43c37 in VnUveTableBase:
2016-06-27 21:38:33:799814: [root@10.84.25.20] out: #5 0x00000000010a767a in operator() (a1=0x7f2fd4a96928, a0=0x7f2ffc524c20, this=0x7f30094b
2016-06-27 21:38:33:799954: [root@10.84.25.20] out: #6 RunNotify (entry=
2016-06-27 21:38:33:800063: [root@10.84.25.20] out: #7 DBTableBase:
2016-06-27 21:38:33:864033: [root@10.84.25.20] out: #8 0x00000000010a9d68 in DBTablePartBase
2016-06-27 21:38:33:895896: [root@10.84.25.20] out: #9 0x00000000010a5e70 in DBPartition:
2016-06-27 21:38:34:060009: [root@10.84.25.20] out: #10 0x00000000011b886f in TaskImpl::execute (this=0x7f300b2
2016-06-27 21:38:34:060146: [root@10.84.25.20] out: #11 0x00007f3012608b3a in ?? () from /usr/lib/
2016-06-27 21:38:34:060251: [root@10.84.25.20] out: #12 0x00007f3012604816 in ?? () from /usr/lib/
2016-06-27 21:38:34:060337: [root@10.84.25.20] out: #13 0x00007f3012603f4b in ?? () from /usr/lib/
2016-06-27 21:38:34:061749: [root@10.84.25.20] out: #14 0x00007f30126000ff in ?? () from /usr/lib/
2016-06-27 21:38:34:065267: [root@10.84.25.20] out: #15 0x00007f30126002f9 in ?? () from /usr/lib/
2016-06-27 21:38:34:065604: [root@10.84.25.20] out: #16 0x00007f3012824182 in start_thread (arg=0x7f30094b
2016-06-27 21:38:34:072979: [root@10.84.25.20] out: #17 0x00007f3011afd47d in clone () at ../sysdeps/
2016-06-27 21:38:34:073131: [root@10.84.25.20] out:
tags: | added: quench2 |
Review in progress for https:/ /review. opencontrail. org/21749
Submitter: Ashok Singh (<email address hidden>)