Symptom
Issue is seen when Gateway Failover check is enabled no matter which chassis no,WLC in Standby goes to standby recovery mode.
Gateway reachability remains intact from both the units when the config change is applied and it changes to standby hot from standby recovery as soon as we disable Gateway reachability check.
Reachability to gateway was tested with both ICMP being initiated when the unit was in standby recover mode and also from the packet capture on the uplink where gateway is configured.
Conditions
9800 HA SSO Deployment with Gateway Failover check enabled RP+RMI
Workaround
Disable the Gateway Failover check
Further Problem Description
Below is the expected sequence,
When Gateway check is Enabled ? Controller intiates ping to Gateway to determine the status.
This happens periodically.
If there is no response, fail count will start increasing & this will be reset as soon as any response is received.
When fail count keeps increasing & there is no response for the time configured (8s def). The controller will mark gateway as down and sent update to change the state to Recovery mode.
Once the Ping response / reachability to gateway is resumed. It will send another update (internally) to clear the fail count & bring the controller back to Standby state.
No gateway or RMI reachability is seen in the Path