BugZero | Cisco BugID CSCwb98743 - FN72464: Some DIMMs failing at higher than expecte...

OPERATIONAL DEFECT DATABASE

...

BugZero | Cisco BugID CSCwb98743 - FN72464: Some DIMMs failing at higher than expecte...

Cisco - Defect ID: CSCwb98743

FN72464: Some DIMMs failing at higher than expected rate

Cisco - Defect ID: CSCwb98743

FN72464: Some DIMMs failing at higher than expected rate

Last updated on August 20th, 2025

BugZero Risk Score
5.9 Medium

Overall: 5.9

Severity: 6.4

Lifecycle: 4.0

Popularity: 6.9

What is the BugZero Risk Score?

Cisco Integration

Learn more about where this data comes from

Cisco Integration

Learn more

Bug Scrub Advisor

Streamline upgrades with automated vendor bug scrubs

Bug Scrub Advisor

Learn more

BugZero Enterprise

Wish you caught this bug sooner? Get proactive today.

BugZero Enterprise

Learn more

Bug Details

Description

Symptom

Certain DIMMs from a specific manufacturing lot (specific date codes only) will fail at a higher rate than expected. The most common failure symptom will be significant single bit (correctable) errors. If left untreated, the DIMM may be a higher risk for multibit (uncorrectable) errors during runtime On NXOS devices, single bit correctable errors will be logged with the following logs: %DEVICE_TEST-3-MCE_24HR_FAIL: Module 1 has exceeded MCE 24 hour correctable threshold of 100 with ##### correctable errors within 24 hours. or %DAEMON-3-SYSTEM_MSG: corrected Socket memory error count exceeded threshold: ####### in 24h - mcelog On ACI Devices, The impacted dimm can be find from /mnt/pss/bootlogs/current/dmesg, or output of "dmesg" command, for example logs below confirms DIMMs are bad and in which DIMM-0 is bad. [ 167.751610] sbridge: HANDLING MCE MEMORY ERROR [ 167.751614] CPU 0: Machine Check Exception: 0 Bank 7: 8c00004000010091 [ 168.415928] EDAC MC0: 1 CE memory read error on CPU_SrcID#0_Channel#1_DIMM#0 (channel:1 slot:0 page:0x53232 offset:0xfc0 grain:32 syndrome:0x0 - area:DRAM err_code:0001:0091 socket:0 channel_mask:2 rank:0)

Conditions

This issue impacts a subset of DIMMs within a certain date range. Even inside this date range, not all DIMMs are impacted.

Workaround

This is a hardware error. No SW workarounds are available to address this issue.

Further Problem Description

Impacted devices: N9K family of switches running NXOS or ACI APIC family: APIC-SERVER-L3 APIC-SERVER-M3 Please see the following document for additional information: https://www.cisco.com/c/en/us/support/docs/field-notices/724/fn72464.html

Change history

2025-08-11 Added: 9.3(9)

Top Cisco Defects

9.7Defect ID: CSCvx32806
Access Points stuck in bootloop due to image checksum verification failed
9.7Defect ID: CSCvx82406
Memory leaks in IOS_PRIV_OPER_DB
9.7Defect ID: CSCwn17412
The FlexConnect local switching traffic is centralized randomly during a web-auth SSID
9.7Defect ID: CSCwa47133
ISE Evaluation log4j CVE-2021-44228
9.7Defect ID: CSCvr17167
High memory utilization under "ezman" due to excessive parity error logging

Cisco Integration

Learn more about where this data comes from

Cisco Integration

Learn more

Bug Scrub Advisor

Streamline upgrades with automated vendor bug scrubs

Bug Scrub Advisor

Learn more

BugZero Enterprise

Wish you caught this bug sooner? Get proactive today.

BugZero Enterprise

Learn more

Ready to prevent the next vendor outage?

Get a demo

OPERATIONAL DEFECT DATABASE

Cisco - Defect ID: CSCwb98743

FN72464: Some DIMMs failing at higher than expected rate

Cisco - Defect ID: CSCwb98743

FN72464: Some DIMMs failing at higher than expected rate

Last updated on August 20th, 2025

BugZero Risk Score5.9 Medium

Bug Details

Symptom

Conditions

Workaround

Further Problem Description

Links

Top Cisco Defects

Ready to prevent the next vendor outage?

BugZero Risk Score
5.9 Medium