Info
HPE SR Gen10 Plus and Gen11 controllers may stop responding and randomly reboot. On reboot, the controller will display the lockup reason code.
When this issue is encountered, depending on the AMD or Intel platform, the following messages will appear:
On an AMD Server (Note: "Slot 12" may vary)
The controller lockup message is seen in the Integrated Management
Log (IML), and it may or may not accompany with UMCE:
1901-Slot 12 Smart Array - Controller failed
on previous power-up due to lock up code 0x1084.
Action: 1. Update the controller to the latest firmware
version. 2. If the issue persists, replace the controller.
On an Intel Server
The IML message or lockup message may not be seen during POST, but the message below is seen in dmesg of the Linux CVM:
[24.734207] smartpqi 0000:00:08.0: controller is offline: status code 0x1084
Scope
This advisory applies to the following HPE SR controllers:
HPE SR416i-a Gen10 Plus x16 Lanes 4GB Cache NVMe/SAS 24G Controller (P12688-B21)
HPE SR932i-p Gen10 Plus x32 Lanes 8GB Wide Cache NVMe/SAS 24G PCIe4 x16 Controller (P04220-B21)
HPE SR932i-p Gen11 x32 Lanes 8GB Wide Cache PCI SPDM Plug-in Storage Controller (P47184-B21)
HPE SR416ie-m Gen11 x16 Lanes 4GB Cache SPDM Mezzanine Storage Controller (P39959-B21)
Resolution
The issue is resolved by updating to firmware version 3.1.26.036, available at the following URL:
Firmware Package - HPE SR932i-p Gen10 Plus /SR416i-a Gen10 Plus/SR932i-p Gen11/SR416ie-m Gen11 Controllers
Revision History
Document Version
Release Date
Details
2
August 7, 2024
Added SR932i-p Gen11 and SR416ie-m Gen11 controllers to the affected products and body of this document
1
March 28, 2024
Original Document Release