Build 42 : Alarms: Due to soaking, process status alarm not getting raised when different process of same type goes down

Bug #1585989 reported by Ankit Jain
14
This bug affects 2 people
Affects Status Importance Assigned to Milestone
Juniper Openstack
Status tracked in Trunk
R3.0
Fix Committed
High
Nikhil Bansal
Trunk
Fix Committed
High
Nikhil Bansal

Bug Description

Steps:
1) stop contrail-snmp-collector -> alarm raised after 10 sec due to soaking as expected
2) Once alarm for the process in step 1 becomes active, start contrail-snmp-collector and stop contrail-topology

Observation: contrail-snmp-collector goes for Soak_clear but after timer expiry, alarm for this does not get cleared, and also alarm for the new process contrail-topology does not get raised.

http://nodeb8:8081/analytics/alarms
{

    "analytics-node": [
        {
            "name": "nodeb8",
            "value": {
                "UVEAlarms": {
                    "alarms": [
                        {
                            "any_of": [
                                {
                                    "all_of": [
                                        {
                                            "json_operand1_value": "\"PROCESS_STATE_STOPPED\"",
                                            "rule": {
                                                "oper": "!=",
                                                "operand1": {
                                                    "keys": [
                                                        "NodeStatus",
                                                        "process_info",
                                                        "process_state"
                                                    ]
                                                },
                                                "operand2": {
                                                    "json_value": "\"PROCESS_STATE_RUNNING\""
                                                }
                                            },
                                            "json_vars": {
                                                "NodeStatus.process_info.process_name": "contrail-topology"
                                            }
                                        }
                                    ]
                                }
                            ],
                            "severity": ​3,
                            "ack": false,
                            "timestamp": ​1464258398170267,
                            "token": "eyJ0aW1lc3RhbXAiOiAxNDY0MjU4Mzk4MTcwMjY3LCAiaHR0cF9wb3J0IjogNTk5NSwgImhvc3RfaXAiOiAiMTAuMjA0LjIxNi4yMDEifQ==",
                            "type": "ProcessStatus"
                        }
                    ]
                }
            }
        }
    ]

}

Because of the same, AnalyticsTestSanity.test_analytics_alarms has failed.

Revision history for this message
OpenContrail Admin (ci-admin-f) wrote : [Review update] R3.0

Review in progress for https://review.opencontrail.org/20666
Submitter: Nikhil Bansal (<email address hidden>)

Changed in juniperopenstack:
importance: Undecided → High
Revision history for this message
OpenContrail Admin (ci-admin-f) wrote :

Review in progress for https://review.opencontrail.org/20689
Submitter: Nikhil Bansal (<email address hidden>)

Revision history for this message
OpenContrail Admin (ci-admin-f) wrote : A change has been merged

Reviewed: https://review.opencontrail.org/20689
Committed: http://github.org/Juniper/contrail-controller/commit/be02b52bf0a46a1bf336a0eb577dfb7421083ab9
Submitter: Zuul
Branch: R3.0

commit be02b52bf0a46a1bf336a0eb577dfb7421083ab9
Author: Nikhil B <email address hidden>
Date: Fri May 27 09:10:30 2016 +0530

Alarm update was not being sent in some cases

There were some cases where alarm update was not being sent from state machine
Closes-Bug: #1585989

Change-Id: Ie35f4334fb80fa302d3c6346c17fda20d246fd6e

Revision history for this message
OpenContrail Admin (ci-admin-f) wrote : [Review update] master

Review in progress for https://review.opencontrail.org/20903
Submitter: Nikhil Bansal (<email address hidden>)

Revision history for this message
OpenContrail Admin (ci-admin-f) wrote : A change has been merged

Reviewed: https://review.opencontrail.org/20903
Committed: http://github.org/Juniper/contrail-controller/commit/f35bf9f17f7621321f3713ef2df9419e72c22a07
Submitter: Zuul
Branch: master

commit f35bf9f17f7621321f3713ef2df9419e72c22a07
Author: Nikhil B <email address hidden>
Date: Fri May 27 09:10:30 2016 +0530

Alarm update was not being sent in some cases

There were some cases where alarm update was not being sent from state machine
Closes-Bug: #1585989

Change-Id: Ie35f4334fb80fa302d3c6346c17fda20d246fd6e
(cherry picked from commit be02b52bf0a46a1bf336a0eb577dfb7421083ab9)

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.