APM 10 - Is anyone using APM10 in their production environment with 100+ agents?
I saw no answer to this so I am marking as assumed answered. Many customers have or about to convert to APM 10 so there should be "at least one or two" out there. Perhaps your Account Team can help on this as well.
using with 1000 + agents.
I have also converted to discussion for other community users to respond.
The answer to this question relates more to the actual metric load rather than agent load (some agents may be more metric prolific that others) and sizing your cluster correctly i.e. sizing each Collector to handle the required metric/agent load (max of 10 Collectors). Here is the relevant section of the latest 10.7 docops:
CA APM Sizing and Performance - CA Application Performance Management - 10.7 - CA Technologies Documentation
Hope that helps
We had over 500 application agents, about 300 epagent, and 16 IBM MQ agents in our CA APM 10.0 environment. The MQ agents were the most metrics out of the groups with our application agents producing from 2000 to 6000 metrics, our epagent producing around 120 metrics while our MQ agents had a spread of 5000 to 39,000 metrics each. We had about 900k metrics on seven collectors, postgres database, all on virtual servers running SuSE.
The 10.0 environment is slowly being phased out with a 10.5.2 on RHEL and hopefully by the end of the month, we will decommission our 10.0 environment.
The big things that helped was to insure that our servers had direct read/write to the disk, DASD instead of having the OS cache both. This helped with the harvest and smartstor duration.
We had over 150k metrics on each of the collectors with a target goal of no more than 180 to 200k metrics so that if we lose two of our collectors, the five remaining could keep up.
Hope this helps,
We do have many customers using this type of configuration/load
Could you provide more details of your question and configuration please?
Are you facing any issue on the Agents, collectors, cluster? Are you talking about 10.0 or a 10.x release? Can you describe the issues?
We need as much information as possible to be able to help you,
Retrieving data ...