BugZero | Hewlett Packard Enterprise BugID a00129624en_us - Notice: HPE ConvergedSystem 750

Hewlett Packard Enterprise - Defect ID: a00129624en_us

Notice: HPE ConvergedSystem 750 - VMware vSAN 6.7 May Mistakenly Report a Drive in a Disk Group With a Permanent Error

Hewlett Packard Enterprise - Defect ID: a00129624en_us

Notice: HPE ConvergedSystem 750 - VMware vSAN 6.7 May Mistakenly Report a Drive in a Disk Group With a Permanent Error

Last updated on 2/21/2023

Overall: 0N/A

Severity: 0N/A

Community: 0N/A

Lifecycle: 0N/A

What is the BugZero Risk Score?

Vendor details

Priority: Customer Notice

Overall: 0N/A

Severity: 0N/A

Community: 0N/A

Lifecycle: 0N/A

What is the BugZero Risk Score?

Vendor details

Priority: Customer Notice

Info

An issue exists where any SATA/SAS/NVMe SSD drive configured in a VMware All-Flash vSAN disk group may be mistakenly reported as failed and marked by vSAN as having a permanent error. This is due to the Medium Errors being continuously reported after multiple attempts to remap the bad area by the ESXi operating system. The SMART data will be retrieved directly from the SSD device and show the drive has available spare space to remap bad areas on the drive. vSAN 6.7 may not allow the recovery of a single Unrecoverable Read Error (URE), when it occurs in the metadata regions of an all flash vSAN disk group, without removing the disk group from the vSAN first. Depending on the version of ESXi and features enabled, the host may perform an "autoDG" creation operation on the failed disk group in an attempt to repair the bad area on a disk group. As a result, a drive may be reported as failed after multiple attempts to repair the drive using the "autoDG" operation. This may happen because of how vSAN interacts with various vendor drives in the handling of the 5-10% area used for metadata operations. Based on VMware KB 81121 , an autoDG creation feature runs a TRIM utility, and by default TRIM only runs on the first 5-10% of the metadata region. If the bad area is beyond the 5-10% on the drive, the bad area will not be remapped, causing premature replacement of the drive.

Scope

None

Resolution

None

Original Vendor Announcement

Defect ID: a00138717en_us
Advisory: (Revision) HPE Compute Scale-up Server 3200 - System May Encounter an HWERR_BIOS_HALT_DETECTED Condition During the OS Crashdump Process
Defect ID: a00142136en_us
Advisory: (Revision) HPE ProLiant DL20/ML30 Gen10 Plus Servers - Systems Configured with Intel I350-T4 or Broadcom BCM5719 Adapters May Stop Responding During a Reboot or Shutdown if All Four NIC Ports Are Disabled
Defect ID: a00119124en_us
Notice: (Revision) HPE B-series Switches - Accessing HPE B-series SANnav, Fabric OS, and TruFOS Certificates
Defect ID: a00118860en_us
Advisory: HPE InfoSight for Servers - Manually Uploaded Active Health System (AHS) Log to the Analyze Log Page Is Not Displayed After a Successful Upload to the InfoSight Portal
Defect ID: a00146525en_us
Advisory: HPE OneView - OneView May Display the Error, "Unable to Create Volume Template Error Regarding Read-Only Attribute"

Ready to prevent the next vendor outage?

Get a demo

OPERATIONAL DEFECT DATABASE

Hewlett Packard Enterprise - Defect ID: a00129624en_us

Notice: HPE ConvergedSystem 750 - VMware vSAN 6.7 May Mistakenly Report a Drive in a Disk Group With a Permanent Error

Hewlett Packard Enterprise - Defect ID: a00129624en_us

Notice: HPE ConvergedSystem 750 - VMware vSAN 6.7 May Mistakenly Report a Drive in a Disk Group With a Permanent Error

Last updated on 2/21/2023

Vendor details

Vendor details

Description

Info

Scope

Resolution

Links

Top Hewlett Packard Enterprise defects by risk score

Ready to prevent the next vendor outage?