Recently we had an issue in our prod environment where due to a network outage a subset of active directories could not connect to the policy server ,which resulted in policy server queues going up and it reached a point where policy server stopped processing requests all together.
Although the network outage lasted for 90 seconds , policy server did not connect back to the AD's after network recovery and kept on timing out while trying to establish connections to the AD's.
We had to restart the policy server which then restored the connectivity.
With respect the socket connections on policy server we generally hover around 2500 but during that outage it spiked up to 8000.
Need expert guidance on why would policy server not restore connectivity back to the user directories even after network connectivity had restored itself. Is it a capacity issue , although our CPU usage for policy server stays at around 15-20% and our general processing is around
Average Throughput : 95 (request/sec)
Average Transaction Time: 11.555901ms
Policy Server version L R12sp3cr07