Symptoms
Issue
iDRAC7 and iDRAC8 controllers that are being polled by SNMP community names may stop reporting status for some temperature probe and voltage sensors.Systems Management Applications that leverage SNMP for monitoring system health are showing SNMP response errors collecting Temperature Probe Status updates.There have been multiple reports that when using Nagios Systems Management Consoles for monitoring Dell iDRAC, the controller is showing an "UNKNOWN" status for "Dell Server Temperature Probe Status" and/or "Dell Server Voltage Probe Status".
ERROR: SNMP: No response from remote host
In some instances, iDRAC Web UI may report a RAC0508 Error Message on the landing page Temperature Probes web page after the SNMP failure sighting.
Temperature : RAC0508: An unexpected error occured. Wait for few minutes and refresh the page. If the problem persists, contact service provider.
The command
Racadm getsensorinfo
may not report all temperators sensor values on affected iDRACs when in this state. When walking the SNMP OIDs of the iDRAC, the OIDs for some temperature sensors may be missing.
[R/W] [R/W]
[Key = iDRAC.Embedded.1#CPU1Temp]
CPU1 Temp Ok 56C 3C 88C 8C [N] 83C [N]
Solution
Engineering is aware of this issue. This is not hardware related. iDRAC firmware 2.40.40.40 will contain a code change to resolve this issue. This release will be web-posted in Q3FY17 (October) timeframe. In the meantime, performing a racreset on the iDRAC will temporarily solve the issue.