Symptom
C9407-R/sup-1/ 16.6.4
At certain conditions, when standby supervisor is reloaded, then standby sup fails to join the SSO. It gets stuck in "standby cold-config". It reloads automatically after 20 odd minutes with below error and then get stuck in "standby cold-config" again, the cycle repeats unless the active is reloaded.
*Sep 29 14:44:40: %RF-3-NOTIF_TMO: Notification timer Expired for RF Client: NGMOD HMS RF client(10101)
*Sep 29 14:44:40: %RF-3-NOTIF_TMO: Notification timer Expired for RF Client: NGMOD HMS RF client(10101)
*Sep 29 14:44:43: %CMRP-6-RP_SB_RELOAD_REQ: R1/0: cmand: Reloading Standby RP: initiated by RF reload message
*Sep 29 14:44:43: %IOSXE_OIR-6-OFFLINECARD: Card (rp) offline in slot R0
*Sep 29 14:44:44: %REDUNDANCY-3-STANDBY_LOST: Standby processor fault (PEER_NOT_PRESENT)
*Sep 29 14:44:44: %REDUNDANCY-3-STANDBY_LOST: Standby processor fault (PEER_DOWN)
*Sep 29 14:44:44: %REDUNDANCY-3-STANDBY_LOST: Standby processor fault (PEER_REDUNDANCY_STATE_CHANGE)
*Sep 29 14:44:45: %RF-5-RF_RELOAD: Peer reload. Reason: EHSA standby down
*Sep 29 14:44:45: %IOSXE_REDUNDANCY-6-PEER: Active detected switch -1 as standby.
*Sep 29 14:48:08: %IOSXE_OIR-6-ONLINECARD: Card (rp) online in slot R0
Note the followings-
- At this time of issue, all line cards are in okay status (show module & show platform) .
There is no OIR related error or traceback associated.
- The issue is rare and not consistently seen.
Further Problem Description
A reload of Switch clears the issue.