英睿达(原镁光) SATA MX500系列 SSD 硬盘要注意磁盘耐久度问题 - 日常圈子 - 综合 - EVLIT

英睿达(原镁光) SATA MX500系列 SSD 硬盘要注意磁盘耐久度问题

2019年在京东买了一块英睿达 MX500 500GB的SSD硬盘,然后我就扔在机柜的塔式服务器里面跑Proxmox当系统盘用,也有部分VM用到这块硬盘,今年开始折腾k3s,也用这块硬盘做系统盘,没想到k3s装了一堆东西之后,每秒的磁盘IO达到了5MB/s,也没怎么在意。




SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
  1 Raw_Read_Error_Rate     0x002f   100   100   000    Pre-fail  Always       -       3
  5 Reallocate_NAND_Blk_Cnt 0x0032   100   100   010    Old_age   Always       -       1
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       12066
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       101
171 Program_Fail_Count      0x0032   100   100   000    Old_age   Always       -       0
172 Erase_Fail_Count        0x0032   100   100   000    Old_age   Always       -       0
173 Ave_Block-Erase_Count   0x0032   003   003   000    Old_age   Always       -       1462
174 Unexpect_Power_Loss_Ct  0x0032   100   100   000    Old_age   Always       -       33
180 Unused_Reserve_NAND_Blk 0x0033   000   000   000    Pre-fail  Always       -       45
183 SATA_Interfac_Downshift 0x0032   100   100   000    Old_age   Always       -       3
184 Error_Correction_Count  0x0032   100   100   000    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   046   031   000    Old_age   Always       -       54 (Min/Max 0/69)
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       1
197 Current_Pending_ECC_Cnt 0x0032   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   100   100   000    Old_age   Always       -       170
202 Percent_Lifetime_Remain 0x0030   003   003   001    Old_age   Offline      -       97
206 Write_Error_Rate        0x000e   100   100   000    Old_age   Always       -       0
210 Success_RAIN_Recov_Cnt  0x0032   100   100   000    Old_age   Always       -       3
246 Total_LBAs_Written      0x0032   100   100   000    Old_age   Always       -       115471620952
247 Host_Program_Page_Count 0x0032   100   100   000    Old_age   Always       -       3129192176
248 FTL_Program_Page_Count  0x0032   100   100   000    Old_age   Always       -       20667757649

SMART Error Log Version: 1
No Errors Logged

注意点是202 Percent_Lifetime_Remain 0x0030 003 003 001 Old_age Offline - 97这个参数,直觉告诉我,这时候硬盘的耐久度应该是还剩下97才对。



root@pve:~# msecli -L

Device Name          : /dev/sda
Model No             : CT500MX500SSD1
Serial No            : 1911E1F2550D
FW-Rev               : M3CR023
Total Size           : 500.00GB
Drive Status         : Attention! The Drive is Approaching the end of the Specified Lifetime. Prolonged usage will invalidate the warranty
Sata Link Speed      : Gen3 (6.0 Gbps)
Sata Link Max Speed  : Gen3 (6.0 Gbps)
Temp(C)              : 54



SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
  1 Raw_Read_Error_Rate     0x002f   100   100   000    Pre-fail  Always       -       3
  5 Reallocate_NAND_Blk_Cnt 0x0032   100   100   010    Old_age   Always       -       1
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       12066
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       101
171 Program_Fail_Count      0x0032   100   100   000    Old_age   Always       -       0
172 Erase_Fail_Count        0x0032   100   100   000    Old_age   Always       -       0
173 Ave_Block-Erase_Count   0x0032   003   003   000    Old_age   Always       -       1462
174 Unexpect_Power_Loss_Ct  0x0032   100   100   000    Old_age   Always       -       33
180 Unused_Reserve_NAND_Blk 0x0033   000   000   000    Pre-fail  Always       -       45
183 SATA_Interfac_Downshift 0x0032   100   100   000    Old_age   Always       -       3
184 Error_Correction_Count  0x0032   100   100   000    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   046   031   000    Old_age   Always       -       54 (Min/Max 0/69)
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       1
197 Current_Pending_ECC_Cnt 0x0032   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   100   100   000    Old_age   Always       -       170
202 Percent_Lifetime_Remain 0x0030   003   003   001    Old_age   Offline      -       97
206 Write_Error_Rate        0x000e   100   100   000    Old_age   Always       -       0
210 Success_RAIN_Recov_Cnt  0x0032   100   100   000    Old_age   Always       -       3
246 Total_LBAs_Written      0x0032   100   100   000    Old_age   Always       -       115471620952
247 Host_Program_Page_Count 0x0032   100   100   000    Old_age   Always       -       3129192176
248 FTL_Program_Page_Count  0x0032   100   100   000    Old_age   Always       -       20667757649

SMART Error Log Version: 1
No Errors Logged

而在官方工具msecli里面查询到的SMART数据的Percentage Lifetime Remaining的值却只有3,意味着非常有可能下次开机就报废了

root@pve:~# msecli -L

Device Name  : /dev/sda
 ID  Attribute Name                Attribute Data Units
 1   Raw Read Error Rate           3		Errors/Page
 5   Reallocated NAND Block Count  1		NAND Blocks
 9   Power On Hours Count          12066	Hours
 12  Power Cycle Count             101		Power Cycles
 171 Program Fail Count            0		NAND Page Program Failures
 172 Erase Fail Count              0		NAND Block Erase Failures
 173 Block Wear-Leveling Count     1462		Erases
 174 Unexpected Power Loss Count   33		Unexpected Power Loss events
 180 Unused Reserved Block Count   45		Blocks
 183 SATA Interface Downshift      3		Downshifts
 184 Error Correction Count        0		Correction Events
 187 Reported Uncorrectable Errors 0		ECC Correction Failures
 194 Enclosure Temperature         55		Current Temperature (C)
                                   69		Highest Lifetime Temperature (C)
 196 Reallocation Event Count      1		Events
 197 Current Pending ECC Count     0		ECC Counts
 198 SMART Off-line Scan           0		Errors
     Uncorrectable Errors
 199 Ultra-DMA CRC Error Count     170		Errors
 202 Percentage Lifetime Remaining 3		% Lifetime Remaining
 206 Write Error Rate              0		Program Fails/MB
 210 RAIN Successful Recovery      3		TUs successfully recovered by
     Page Count                                 RAIN
 246 Cumulative Host Write         115472345976	512 Byte Sectors
     Sector Count
 247 Host Program Page Count       3129465248	NAND Page
 248 FTL Program Page Count        20668114065	NAND Page






Google Ads



