How resilient is your Automation Engine?

Discussion created by Keld_Mollnitz on Jun 9, 2016
Latest reply on Oct 11, 2016 by KarthikMalali604894
Hi all, If your Automation Engine is a critical component for you, then you probably have set up your AE in a way that minimizes the risk of having total system outage. There are many ways to accomplish this, for instance by distributing the CP and WP Server Processes over several nodes - or by installing the AE in a Cluster. If you have a distributed setup, hopefully you have remembered to double the number of Server Processes, so that one node can handle the load of the other in case it fails.. If you have more than one Data Center, you could place one AE node in each Data Center. To further increase the availability you could have two AE nodes in each Data Center, to avoid the one AE node becomes single point of failure in case one Data Center fails. In my company we have a Resiliency Framework and we have defined several Resiliency Levels. Every System and Application are mapped into one of those Resiliency Levels which matches the criticality of the System or Application. If your system or application is categorized as Mission/Business Critical then you cannot have a single point of failure - every component (AE, SAN, Network devices, Database...) need to be not only doubled - but quadrupled! I wonder if anyone else is also working with these kinds of challenges? How resilient is your Automation Engine? regards Keld.