Is it possible to delay a failover back to an original server? IE, 'once you've failed over to the secondary, wait at least 2 minutes before failing back'?
We recently had a issue where there was a brief memory spike that caused the primary to stop working. The system immediately failed things over to the secondary (good). 20 seconds later the secondary noticed the primary was OK, so it tried to fail back over to the primary server. Even though it was responding, the primary wasn't quite ready (still investigating why but i think it had to do with copying over changes, etc).
As a result the secondary switched back to a secondary mode, thinking the primary was good. But the primary was stuck in a not synching mode and was't actually working correctly. To fix things we ended up having to reboot the primary server.
So a thought popped up: Can we delay a failback for a minute or two, to prevent sudden changes.
Is this possible?
Thanks