Hewlett Packard Enterprise (HPE) alerts its customers once again that all Serial-Attached SCSI solid-state drives will malfunction after 40,000 hours of operation, unless there is a vital patch.
The business made a similar announcement in November 2019, when a software defect after 32,768 hours of running caused a malfunction.
Drives that were affected
The current issue affects HPE hard drives and storage items such as HPE ProLiant, Synergy, Apollo 4200, Synergy Storage Modules, StoreEasy 1000 Storage, D3000 Storage Enclosure.
|HPE Model Number||HPE SKU||HPE SKU DESCRIPTION||HPE Spare Part SKU||HPE Firmware Fix Date|
|EK0800JVYPN||846430-B21||HPE 800GB 12G SAS WI-1 SFF SC SSD||846622-001||3/20/2020|
|EO1600JVYPP||846432-B21||HPE 1.6TB 12G SAS WI-1 SFF SC SSD||846623-001||3/20/2020|
|MK0800JVYPQ||846432-B21||HPE 800GB 12G SAS MU-1 SFF SC SSD||846624-001||3/20/2020|
|MO1600JVYPR||846436-B21||HPE 1.6TB 12G SAS MU-1 SFF SC SSD||846625-001||3/20/2020|
The company says this is a detailed list of SSDs it makes available that have been affected. The problem, however, is not specific to HPE, and may be present in other manufacturers ‘ drives.
If the SSD runs a firmware version older than HPD7 in the HPE products, they will fail after having been turned on for 40,000 hours; this translates into 4 years, 206 days, 16 hours which is around half a year less than the extended warranty available to others.
Once the failure point is reached, the data can not be retrieved, nor the hard drive. In environments with data backup systems, avoidance of such a catastrophe is possible.
HPE has heard of a SSD manufacturer’s firmware flaw and warns that if SSDs are installed and placed into operation at the same time they are likely to fail almost simultaneously.
“Restoration of data from backup will be required in non-fault tolerance modes (e.g., RAID 0) and in fault tolerance RAID mode if more drives fail than what is supported by the fault tolerance RAID mode logical drive [e.g. RAID 5 logical drive with two failed SSDs]” – HPE advisory
Last month, Dell EMC released new firmware to correct a bug which caused nine SanDisk SSDs to fail in its portfolio “after approximately 40,000 hours of usage.” Dell reported the following models to be affected:
The update corrects a test to log the index value for the circular buffer. “Assert had a poor check to verify the index value of the circular buffer. Instead of checking the max value as N, it searched for N-1, ‘ explains Dell’s advisory.
Customers that were shipped one or more of the affected SSD models were informed about this “potentially critical issue” with the recommendation to apply the update immediately.