DX Unified Infrastructure Management

Expand all | Collapse all

Probe errors in UIM

  • 1.  Probe errors in UIM

    Posted Aug 16, 2016 10:43 AM

    Hi All,

     

    All of sudden we noticed that alarm_enrichment queue under hub probe queued up and not processing alarms.

     

     

    we have noticed that these probes showing error and these are deployed under primary hub since it is main hub to process all alarms n data ,

     

    what actions we have to take to make all probes green , since it is effecting alarm creation in UIM alarm console. This is one we have noticed but consequences might be more and it may cause UIM application to go down as well , not sure please advise



  • 2.  Re: Probe errors in UIM

    Posted Aug 16, 2016 11:03 AM

    we have restarted NAS probe to make all probes green, but restarting NAS took long time to establish connection with other probes, To avoid this type issues in future we want some alarm from NAS probe or alarm_enrinchment probe saying that "<X> probe got queued up and not able to process even one alarm" so that we can proactively act on these issues.

     

    Can anybody from CA UIM development team from CA have any thoughts on this?

     

    I would like to highlight this CA UIM not that stable as CA SPECTRUM n eHealth tools do



  • 3.  Re: Probe errors in UIM

    Posted Aug 17, 2016 03:13 AM

    You can try the tool: QueueCheck LUA script v2.2

    his includes an option to mail you with a queue threshold

    note: corrected the link



  • 4.  Re: Probe errors in UIM

    Posted Aug 17, 2016 04:12 AM

    H Chris,

     

    Thanks , would like to review the mail option with a queue threshold from this tool , but unfortunately when I click the "QueueCheck LUA Script" link i got the below error, could you please assist how i will get this tool even i checked this google to download the same i didnt get any download link for this

     



  • 5.  Re: Probe errors in UIM

    Posted Aug 17, 2016 04:27 AM

    I updated the link



  • 6.  Re: Probe errors in UIM

    Broadcom Employee
    Posted Aug 16, 2016 12:44 PM

    Hi Rajani,

    The relevant errors are usually reported in nas.log, Furthermore, you may see some errors in data_engine.log if there were problems in synching alarms with the database. 

    Now, with regards to specific alarm you mentioned, it will be a good to raise an IDEA on this.

     

    Thanks.

     

    Kind regards

    -Sayeed



  • 7.  Re: Probe errors in UIM

    Posted Aug 17, 2016 05:08 AM

    Hi Rajani

     

    I would also recommend you redeploy java.jre from the Archive to the primary hub. Make sure it's the latest version (1.72). Then mark all the probes that are in red > right click > Security > Validate > Yes all.

     

    I hope it's helpful.

    regards.

    iulian



  • 8.  Re: Probe errors in UIM

    Posted Aug 17, 2016 11:25 AM

    Rajini,

     

    Please see this knowledge base document since it may help.

    TEC1716358    alarm_enrichment probe will not start

    http://www.ca.com/us/support/ca-support-online/product-content/knowledgebase-articles/tec1716358.aspx

     

    David



  • 9.  Re: Probe errors in UIM

    Posted Aug 19, 2016 11:54 AM

    Rajini,

     

    Did following the steps in the document get alarm_enrichment running again?

     

    David



  • 10.  Re: Probe errors in UIM

    Posted Aug 19, 2016 12:01 PM

    Hi Mike,

     

    Restarting nas and hub probe under primary hub server worked, since it is production environment alarm_enrichment document was not followed yet



  • 11.  Re: Probe errors in UIM

    Posted Aug 19, 2016 12:13 PM

    Rajini,

     

    OK, so when you want to get alarm_enrichment working again just follow the steps.

     

    David



  • 12.  Re: Probe errors in UIM

    Posted Aug 19, 2016 12:16 PM

    yes sure mike!! I will do apply the steps and post the output here for future reference