Is my drive dying?

Hi. Woke up to a degraded pool. Is my HDD dying or could this be another issue? 16 drives on a 9300-16i HBA (from AliExpress) with firmware 16.00.12.00.


smartctl 7.4 2023-08-01 r5530 \[x86_64-linux-6.12.15-production+truenas\] (local build)
Copyright (C) 2002-23, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model:     OOS22000G
Serial Number:    00002YDW
LU WWN Device Id: 5 000c50 0e5c6e9b7
Firmware Version: OOS1
User Capacity:    22,000,969,973,760 bytes \[22.0 TB\]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    7200 rpm
Form Factor:      3.5 inches
Device is:        Not in smartctl database 7.3/6083
ATA Version is:   ACS-4 (minor revision not indicated)
SATA Version is:  SATA 3.3, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Tue Feb  3 19:00:22 2026 EET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM feature is:   Unavailable
Rd look-ahead is: Enabled
Write cache is:   Enabled
DSN feature is:   Disabled
ATA Security is:  Disabled, NOT FROZEN \[SEC1\]
Write SCT (Get) Feature Control Command failed: scsi error aborted command
Wt Cache Reorder: Unknown (SCT Feature Control command failed)

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection:                (  559) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        (1871) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x70bd) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
1 Raw_Read_Error_Rate     POSR--   078   064   044    -    56471664
3 Spin_Up_Time            PO----   095   091   000    -    0
4 Start_Stop_Count        -O–CK   100   100   020    -    405
5 Reallocated_Sector_Ct   PO–CK   100   100   010    -    0
7 Seek_Error_Rate         POSR--   083   060   045    -    198013900
9 Power_On_Hours          -O–CK   086   086   000    -    12425
10 Spin_Retry_Count        PO–C-   100   100   097    -    0
12 Power_Cycle_Count       -O–CK   100   100   020    -    410
18 Unknown_Attribute       PO-R--   100   100   050    -    0
187 Reported_Uncorrect      -O–CK   100   100   000    -    0
188 Command_Timeout         -O–CK   100   099   000    -    4
190 Airflow_Temperature_Cel -O—K   066   050   000    -    34 (Min/Max 33/35)
192 Power-Off_Retract_Count -O–CK   100   100   000    -    377
193 Load_Cycle_Count        -O–CK   099   099   000    -    2063
194 Temperature_Celsius     -O—K   034   050   000    -    34 (0 18 0 0 0)
197 Current_Pending_Sector  -O–C-   100   100   000    -    0
198 Offline_Uncorrectable   ----C-   100   100   000    -    0
199 UDMA_CRC_Error_Count    -OSRCK   200   200   000    -    0
200 Multi_Zone_Error_Rate   PO—K   100   100   001    -    0
240 Head_Flying_Hours       ------   100   100   000    -    12173 (237 51 0)
241 Total_LBAs_Written      ------   100   253   000    -    101856818835
242 Total_LBAs_Read         ------   100   253   000    -    899134717721
||||||\_ K auto-keep
|||||\_\_ C event count
||||\__\_ R error rate
|||\___\_ S speed/performance
||\____\_ O updated online
|\_____\_ P prefailure warning

General Purpose Log Directory Version 1
SMART           Log Directory Version 1 \[multi-sector log support\]
Address    Access  R/W   Size  Description
0x00       GPL,SL  R/O      1  Log Directory
0x01           SL  R/O      1  Summary SMART error log
0x02           SL  R/O      5  Comprehensive SMART error log
0x03       GPL     R/O      5  Ext. Comprehensive SMART error log
0x04       GPL     R/O    256  Device Statistics log
0x04       SL      R/O      8  Device Statistics log
0x06           SL  R/O      1  SMART self-test log
0x07       GPL     R/O      1  Extended self-test log
0x08       GPL     R/O      2  Power Conditions log
0x09           SL  R/W      1  Selective self-test log
0x0a       GPL     R/W      8  Device Statistics Notification
0x0c       GPL     R/O   2048  Pending Defects log
0x10       GPL     R/O      1  NCQ Command Error log
0x11       GPL     R/O      1  SATA Phy Event Counters log
0x13       GPL     R/O      1  SATA NCQ Send and Receive log
0x21       GPL     R/O      1  Write stream error log
0x22       GPL     R/O      1  Read stream error log
0x24       GPL     R/O    768  Current Device Internal Status Data log
0x2f       GPL     R/O      1  Set Sector Configuration
0x30       GPL,SL  R/O      9  IDENTIFY DEVICE data log
0x80-0x9f  GPL,SL  R/W     16  Host vendor specific log
0xa1       GPL,SL  VS     160  Device vendor specific log
0xa2       GPL     VS   16320  Device vendor specific log
0xa4       GPL,SL  VS     160  Device vendor specific log
0xa6       GPL     VS     192  Device vendor specific log
0xa8-0xa9  GPL,SL  VS     136  Device vendor specific log
0xab       GPL     VS       1  Device vendor specific log
0xad       GPL     VS      16  Device vendor specific log
0xb1       GPL,SL  VS     160  Device vendor specific log
0xb4       GPL,SL  VS      16  Device vendor specific log
0xb6       GPL     VS    1920  Device vendor specific log
0xbe-0xbf  GPL     VS   65535  Device vendor specific log
0xc1       GPL,SL  VS       8  Device vendor specific log
0xc3       GPL,SL  VS      24  Device vendor specific log
0xc6       GPL     VS    5184  Device vendor specific log
0xc7       GPL,SL  VS       8  Device vendor specific log
0xc9       GPL,SL  VS       8  Device vendor specific log
0xca       GPL,SL  VS      16  Device vendor specific log
0xcd       GPL,SL  VS       1  Device vendor specific log
0xce       GPL     VS       1  Device vendor specific log
0xcf       GPL     VS     512  Device vendor specific log
0xd1       GPL     VS     656  Device vendor specific log
0xd2       GPL     VS   10256  Device vendor specific log
0xd4       GPL     VS    2048  Device vendor specific log
0xda       GPL,SL  VS       1  Device vendor specific log
0xe0       GPL,SL  R/W      1  SCT Command/Status
0xe1       GPL,SL  R/W      1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (5 sectors)
No Errors Logged

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error

# 1  Extended offline    Interrupted (host reset)      00%     12410         -

# 2  Short offline       Completed without error       00%     12406         -

# 3  Short offline       Completed without error       00%     12382         -

# 4  Short offline       Completed without error       00%     12358         -

# 5  Short offline       Completed without error       00%     12334         -

# 6  Short offline       Completed without error       00%     12310         -

# 7  Short offline       Completed without error       00%     12286         -

# 8  Extended offline    Aborted by host               80%     12279         -

# 9  Short offline       Completed without error       00%     12238         -

#10  Short offline       Completed without error       00%     12214         -
#11  Short offline       Completed without error       00%     12190         -
#12  Short offline       Completed without error       00%     12166         -
#13  Short offline       Completed without error       00%     12142         -
#14  Extended offline    Interrupted (host reset)      00%     12129         -
#15  Short offline       Completed without error       00%     12070         -
#16  Short offline       Completed without error       00%     12046         -
#17  Short offline       Completed without error       00%     12022         -
#18  Short offline       Completed without error       00%     11998         -
#19  Short offline       Completed without error       00%     11974         -

SMART Selective self-test log data structure revision number 1
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
1        0        0  Not_testing
2        0        0  Not_testing
3        0        0  Not_testing
4        0        0  Not_testing
5        0        0  Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:                  3
SCT Version (vendor specific):       522 (0x020a)
Device State:                        Active (0)
Current Temperature:                    34 Celsius
Power Cycle Min/Max Temperature:     33/35 Celsius
Lifetime    Min/Max Temperature:     17/50 Celsius
Under/Over Temperature Limit Count:   0/0
SMART Status:                        0xc24f (PASSED)
Vendor specific:
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 02 00 00 00 00 00 00 00 00 00 00 00

SCT Temperature History Version:     2
Temperature Sampling Period:         4 minutes
Temperature Logging Interval:        59 minutes
Min/Max recommended Temperature:     10/40 Celsius
Min/Max Temperature Limit:            5/60 Celsius
Temperature History Size (Index):    128 (29)

Index    Estimated Time   Temperature Celsius
30    2026-01-29 13:09    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
31    2026-01-29 14:08    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
32    2026-01-29 15:07    35  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
33    2026-01-29 16:06    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
…    ..(  2 skipped).    ..  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
36    2026-01-29 19:03    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
37    2026-01-29 20:02    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
38    2026-01-29 21:01    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
39    2026-01-29 22:00    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
40    2026-01-29 22:59    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
41    2026-01-29 23:58    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
42    2026-01-30 00:57    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
43    2026-01-30 01:56    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
…    ..(  3 skipped).    ..  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
47    2026-01-30 05:52    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
48    2026-01-30 06:51    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
49    2026-01-30 07:50    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
50    2026-01-30 08:49    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
51    2026-01-30 09:48    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
52    2026-01-30 10:47    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
53    2026-01-30 11:46    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
54    2026-01-30 12:45    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
55    2026-01-30 13:44    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
56    2026-01-30 14:43    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
57    2026-01-30 15:42    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
58    2026-01-30 16:41    32  \*\*\*\*\*\*\*\*\*\*\*\*\*
59    2026-01-30 17:40    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
60    2026-01-30 18:39    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
61    2026-01-30 19:38    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
62    2026-01-30 20:37    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
63    2026-01-30 21:36    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
64    2026-01-30 22:35    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
65    2026-01-30 23:34    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
66    2026-01-31 00:33    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
67    2026-01-31 01:32    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
68    2026-01-31 02:31    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
69    2026-01-31 03:30    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
70    2026-01-31 04:29    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
71    2026-01-31 05:28    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
72    2026-01-31 06:27    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
73    2026-01-31 07:26    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
74    2026-01-31 08:25     ?  -
75    2026-01-31 09:24    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
76    2026-01-31 10:23     ?  -
77    2026-01-31 11:22    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
78    2026-01-31 12:21    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
79    2026-01-31 13:20    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
80    2026-01-31 14:19    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
81    2026-01-31 15:18    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
82    2026-01-31 16:17    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
83    2026-01-31 17:16    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
84    2026-01-31 18:15    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
85    2026-01-31 19:14    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
86    2026-01-31 20:13    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
87    2026-01-31 21:12    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
88    2026-01-31 22:11    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
89    2026-01-31 23:10    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
90    2026-02-01 00:09    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
91    2026-02-01 01:08    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
92    2026-02-01 02:07    32  \*\*\*\*\*\*\*\*\*\*\*\*\*
93    2026-02-01 03:06    32  \*\*\*\*\*\*\*\*\*\*\*\*\*
94    2026-02-01 04:05     ?  -
95    2026-02-01 05:04    32  \*\*\*\*\*\*\*\*\*\*\*\*\*
96    2026-02-01 06:03     ?  -
97    2026-02-01 07:02    32  \*\*\*\*\*\*\*\*\*\*\*\*\*
98    2026-02-01 08:01    32  \*\*\*\*\*\*\*\*\*\*\*\*\*
99    2026-02-01 09:00    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
…    ..(  3 skipped).    ..  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
103    2026-02-01 12:56    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
104    2026-02-01 13:55    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
105    2026-02-01 14:54    35  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
106    2026-02-01 15:53    35  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
107    2026-02-01 16:52    35  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
108    2026-02-01 17:51    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
109    2026-02-01 18:50    35  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
110    2026-02-01 19:49    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
111    2026-02-01 20:48    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
112    2026-02-01 21:47    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
113    2026-02-01 22:46    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
114    2026-02-01 23:45    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
115    2026-02-02 00:44    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
…    ..(  2 skipped).    ..  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
118    2026-02-02 03:41    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
119    2026-02-02 04:40    32  \*\*\*\*\*\*\*\*\*\*\*\*\*
120    2026-02-02 05:39    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
…    ..(  3 skipped).    ..  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
124    2026-02-02 09:35    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
125    2026-02-02 10:34    32  \*\*\*\*\*\*\*\*\*\*\*\*\*
126    2026-02-02 11:33    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
127    2026-02-02 12:32    32  \*\*\*\*\*\*\*\*\*\*\*\*\*
0    2026-02-02 13:31    32  \*\*\*\*\*\*\*\*\*\*\*\*\*
1    2026-02-02 14:30    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
2    2026-02-02 15:29    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
3    2026-02-02 16:28    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
4    2026-02-02 17:27    32  \*\*\*\*\*\*\*\*\*\*\*\*\*
5    2026-02-02 18:26    32  \*\*\*\*\*\*\*\*\*\*\*\*\*
6    2026-02-02 19:25    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
7    2026-02-02 20:24    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
8    2026-02-02 21:23    32  \*\*\*\*\*\*\*\*\*\*\*\*\*
9    2026-02-02 22:22    32  \*\*\*\*\*\*\*\*\*\*\*\*\*
10    2026-02-02 23:21    31  \*\*\*\*\*\*\*\*\*\*\*\*
11    2026-02-03 00:20     ?  -
12    2026-02-03 01:19    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
…    ..(  4 skipped).    ..  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
17    2026-02-03 06:14    33  \*\*\*\*\*\*\*\*\*\*\*\*\*\*
18    2026-02-03 07:13     ?  -
19    2026-02-03 08:12    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
…    ..(  3 skipped).    ..  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
23    2026-02-03 12:08    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
24    2026-02-03 13:07    35  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
25    2026-02-03 14:06    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
…    ..(  3 skipped).    ..  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
29    2026-02-03 18:02    34  \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*

SCT Error Recovery Control:
Read:    100 (10.0 seconds)
Write:    100 (10.0 seconds)

Device Statistics (GP Log 0x04)
Page  Offset Size        Value Flags Description
0x01  =====  =               =  ===  == General Statistics (rev 1) ==
0x01  0x008  4             410  —  Lifetime Power-On Resets
0x01  0x010  4           12425  —  Power-on Hours
0x01  0x018  6     99755924516  —  Logical Sectors Written
0x01  0x020  6       517647970  —  Number of Write Commands
0x01  0x028  6    895891087812  —  Logical Sectors Read
0x01  0x030  6      4508110424  —  Number of Read Commands
0x01  0x038  6               -  —  Date and Time TimeStamp
0x03  =====  =               =  ===  == Rotating Media Statistics (rev 1) ==
0x03  0x008  4           12343  —  Spindle Motor Power-on Hours
0x03  0x010  4           12115  —  Head Flying Hours
0x03  0x018  4            2063  —  Head Load Events
0x03  0x020  4               0  —  Number of Reallocated Logical Sectors
0x03  0x028  4               0  —  Read Recovery Attempts
0x03  0x030  4               0  —  Number of Mechanical Start Failures
0x03  0x038  4               0  —  Number of Realloc. Candidate Logical Sectors
0x03  0x040  4             377  —  Number of High Priority Unload Events
0x04  =====  =               =  ===  == General Errors Statistics (rev 1) ==
0x04  0x008  4               0  —  Number of Reported Uncorrectable Errors
0x04  0x010  4               1  —  Resets Between Cmd Acceptance and Completion
0x04  0x018  4               0  -D-  Physical Element Status Changed
0x05  =====  =               =  ===  == Temperature Statistics (rev 1) ==
0x05  0x008  1              34  —  Current Temperature
0x05  0x010  1              33  —  Average Short Term Temperature
0x05  0x018  1              31  —  Average Long Term Temperature
0x05  0x020  1              50  —  Highest Temperature
0x05  0x028  1              21  —  Lowest Temperature
0x05  0x030  1              47  —  Highest Average Short Term Temperature
0x05  0x038  1              23  —  Lowest Average Short Term Temperature
0x05  0x040  1              43  —  Highest Average Long Term Temperature
0x05  0x048  1              28  —  Lowest Average Long Term Temperature
0x05  0x050  4               0  —  Time in Over-Temperature
0x05  0x058  1              60  —  Specified Maximum Operating Temperature
0x05  0x060  4               0  —  Time in Under-Temperature
0x05  0x068  1               5  —  Specified Minimum Operating Temperature
0x06  =====  =               =  ===  == Transport Statistics (rev 1) ==
0x06  0x008  4             219  —  Number of Hardware Resets
0x06  0x010  4             105  —  Number of ASR Events
0x06  0x018  4               0  —  Number of Interface CRC Errors
0xff  =====  =               =  ===  == Vendor Specific Statistics (rev 1) ==
0xff  0x008  7               0  —  Vendor Specific
0xff  0x010  7               0  —  Vendor Specific
0xff  0x018  7               0  —  Vendor Specific
|||\_ C monitored condition met
||\_\_ D supports DSN
|\__\_ N normalized value

Pending Defects log (GP Log 0x0c)
No Defects Logged

SATA Phy Event Counters (GP Log 0x11)
ID      Size     Value  Description
0x000a  2            1  Device-to-host register FISes sent due to a COMRESET
0x0001  2            0  Command failed due to ICRC error
0x0003  2            0  R_ERR response for device-to-host data FIS
0x0004  2            0  R_ERR response for host-to-device data FIS
0x0006  2            0  R_ERR response for device-to-host non-data FIS
0x0007  2            0  R_ERR response for host-to-device non-data FIS
[165213.865020] sd 0:0:8:0: [sdb] tag#3413 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=3s
[165213.865028] sd 0:0:8:0: [sdb] tag#3413 Sense Key : Not Ready [current] 
[165213.865031] sd 0:0:8:0: [sdb] tag#3413 Add. Sense: Logical unit not ready, cause not reportable
[165213.865035] sd 0:0:8:0: [sdb] tag#3413 CDB: Read(16) 88 00 00 00 00 01 8a 3d a2 68 00 00 00 08 00 00
[165213.865037] I/O error, dev sdb, sector 6614262376 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[165213.865743] sd 0:0:8:0: [sdb] tag#3414 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=3s
[165213.865746] sd 0:0:8:0: [sdb] tag#3414 Sense Key : Not Ready [current] 
[165213.865749] sd 0:0:8:0: [sdb] tag#3414 Add. Sense: Logical unit not ready, cause not reportable
[165213.865752] sd 0:0:8:0: [sdb] tag#3414 CDB: Read(16) 88 00 00 00 00 01 9b a7 b4 b0 00 00 00 40 00 00
[165213.865755] I/O error, dev sdb, sector 6906426544 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[165213.866475] sd 0:0:8:0: [sdb] tag#3415 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=3s
[165213.866478] sd 0:0:8:0: [sdb] tag#3415 Sense Key : Not Ready [current] 
[165213.866481] sd 0:0:8:0: [sdb] tag#3415 Add. Sense: Logical unit not ready, cause not reportable
[165213.866484] sd 0:0:8:0: [sdb] tag#3415 CDB: Read(16) 88 00 00 00 00 01 9b a7 b6 b0 00 00 00 40 00 00
[165213.866486] I/O error, dev sdb, sector 6906427056 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[165213.867239] sd 0:0:8:0: [sdb] tag#3417 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=3s
[165213.867243] sd 0:0:8:0: [sdb] tag#3417 Sense Key : Not Ready [current] 
[165213.867246] sd 0:0:8:0: [sdb] tag#3417 Add. Sense: Logical unit not ready, cause not reportable
[165213.867249] sd 0:0:8:0: [sdb] tag#3417 CDB: Read(16) 88 00 00 00 00 01 9b a7 b6 70 00 00 00 40 00 00
[165213.867251] I/O error, dev sdb, sector 6906426992 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[165213.990055] sd 0:0:8:0: [sdb] tag#3418 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s
[165213.990061] sd 0:0:8:0: [sdb] tag#3418 Sense Key : Not Ready [current] 
[165213.990063] sd 0:0:8:0: [sdb] tag#3418 Add. Sense: Logical unit not ready, cause not reportable
[165213.990065] sd 0:0:8:0: [sdb] tag#3418 CDB: Read(16) 88 00 00 00 00 00 00 00 0a 10 00 00 00 10 00 00
[165213.990067] I/O error, dev sdb, sector 2576 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[165213.990082] sd 0:0:8:0: [sdb] tag#3422 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s
[165213.990090] sd 0:0:8:0: [sdb] tag#3424 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s
[165213.990092] sd 0:0:8:0: [sdb] tag#3424 Sense Key : Not Ready [current] 
[165213.990094] sd 0:0:8:0: [sdb] tag#3424 Add. Sense: Logical unit not ready, cause not reportable
[165213.990096] sd 0:0:8:0: [sdb] tag#3424 CDB: Read(16) 88 00 00 00 00 01 9b a7 bc f0 00 00 01 80 00 00
[165213.990097] I/O error, dev sdb, sector 6906428656 op 0x0:(READ) flags 0x0 phys_seg 6 prio class 0
[165213.990899] sd 0:0:8:0: [sdb] tag#3422 Sense Key : Not Ready [current] 
[165213.991327] sd 0:0:8:0: [sdb] tag#3419 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s
[165213.991733] sd 0:0:8:0: [sdb] tag#3422 Add. Sense: Logical unit not ready, cause not reportable
[165213.991733] sd 0:0:8:0: [sdb] tag#3419 Sense Key : Not Ready [current] 
[165213.991735] sd 0:0:8:0: [sdb] tag#3419 Add. Sense: Logical unit not ready, cause not reportable
[165213.991736] sd 0:0:8:0: [sdb] tag#3422 CDB: Read(16) 88 00 00 00 00 01 9b a7 b9 b0 00 00 01 c0 00 00
[165213.991737] sd 0:0:8:0: [sdb] tag#3419 CDB: Read(16) 88 00 00 00 00 0a 01 00 04 10 00 00 00 10 00 00
[165213.991737] I/O error, dev sdb, sector 6906427824 op 0x0:(READ) flags 0x0 phys_seg 7 prio class 0
[165213.991738] I/O error, dev sdb, sector 42966451216 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[165213.992920] sd 0:0:8:0: [sdb] tag#3420 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s
[165213.994224] sd 0:0:8:0: [sdb] tag#3420 Sense Key : Not Ready [current] 
[165213.994227] sd 0:0:8:0: [sdb] tag#3420 Add. Sense: Logical unit not ready, cause not reportable
[165213.994230] sd 0:0:8:0: [sdb] tag#3420 CDB: Read(16) 88 00 00 00 00 0a 01 00 06 10 00 00 00 10 00 00
[165213.994232] I/O error, dev sdb, sector 42966451728 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[165213.995690] sd 0:0:8:0: [sdb] tag#3421 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s
[165213.995693] sd 0:0:8:0: [sdb] tag#3421 Sense Key : Not Ready [current] 
[165213.995695] sd 0:0:8:0: [sdb] tag#3421 Add. Sense: Logical unit not ready, cause not reportable
[165213.995698] sd 0:0:8:0: [sdb] tag#3421 CDB: Read(16) 88 00 00 00 00 01 9b a7 b6 f0 00 00 01 40 00 00
[165213.995699] I/O error, dev sdb, sector 6906427120 op 0x0:(READ) flags 0x0 phys_seg 5 prio class 0

What is sudo zpool status -v showing for that pool.
Your posted data shows no completed SMART Long tests though. I tried searching the model number online. Is this a refurbished Seagate Exos? I don’t know if the SMART data would be carried over from that or reset.

SMART Extended Self-test Log
1 Like

At a glance the drive looks ok to me so perhaps check the physical connection.

This looks like the issue:

Sense Key : Not Ready
Add. Sense: Logical unit not ready, cause not reportable
I/O error, dev sdb

The drive seems to be temporarily disappearing.

1 Like

It’s a white label Exos X22.

  pool: Tank
 state: ONLINE
status: One or more devices are faulted in response to persistent errors.
        Sufficient replicas exist for the pool to continue functioning in a
        degraded state.
action: Replace the faulted device, or use 'zpool clear' to mark the device
        repaired.
  scan: resilvered 216K in 00:00:01 with 0 errors on Tue Feb  3 03:38:43 2026
config:

        NAME                                      STATE     READ WRITE CKSUM
        Tank                                      ONLINE       0     0     0
          raidz1-0                                ONLINE       0     0     0
            cb0fa265-44ee-4d45-9512-c78b1dd7ca91  FAULTED    146   128     0  too many errors
            400cc181-526c-4004-9db8-ea7378076622  ONLINE       0     0     0
            1500a789-bd21-4e54-b299-0b926da59763  ONLINE       0     0     0
            ab0e26b7-9a46-437c-8cc4-63d5575658e2  ONLINE       0     0     0
            f2c343a9-fa96-42ea-82f2-8998def66ca1  ONLINE       0     0     0

errors: No known data errors

If it was a physical connection issue wouldn’t there be some CRC errors as well?

I would get a replacement in there as soon as possible. Do you have one more SATA and power to add another drive and do an in place replacement of the ‘failing’ drive? With Raid-Z1 and the size of the current drives, that is the safest option.
Raid-Z1 isn’t recommended with large drives due to resilver time and risk of another drive in the VDEV failing. It can be a fine choice, if you have excellent backups and can restore the data for that pool from elsewhere

CRC errors happen when a frame is received but fails the check. Yours is being reset therefore no data no CRC check.

Looks like I want to jump in here for a few minute…

Reading these posts what I see is no SMART Extended Test completed, it is always terminated, this has me asking two question ?

  1. Do you know that it takes a minimum of just over 31 hours to run a SMART Long test on this drive? (1871 minutes = 31.2 hours) This is if there is no NAS activity.
  2. Are you rebooting or doing something to possibly sleep the system?

These very large capacity drives take a long time to test, and if the drive is somewhat full of data, even at 80%, a RESILVER or SCRUB would take a very long time as well. This is a pitfall of very large capacity drives.

You definitely have a pool issue at a minimum, and it may be nothing except clearing the fault indication, but you should test to figure it out.

My advice: Download the Drive Troubleshooting Flowcharts in my signature. Go through it step by step. It is actually pretty easy. Before actually doing anything, make sure you read through it, the Appendix is important data.

What I feel you will end up doing is (from a SSH window, as root):

  1. Type lsblk -n -o MODEL,NAME,SIZE,LABEL,PARTUUID and press Enter. This will list your drives and relevant data.
  2. Look for the matching cb0fa265-44ee-4d45-9512-c78b1dd7ca91 and verify it is in fact drive sdb.
  3. Now that you have verified the drive ID, run smartctl -a /dev/sdb (for the drive identified of course) and then grab the drive serial number, write it down. Always use the serial number as drive IDs can change, and of course will just to mess with your day.
  4. If this is the same drive listed above Serial Number: 00002YDW then I recommend that you first backup any important data you have, if possible. You are only running a RAIDZ1 and with these very large drives, if the data is important then you should be running at least a RAIDZ2, just my opinion of course. This is due to how long it can take to test and RESILVER these drives, if the drive has a lot of data on it.
  5. Now you have a choice to make, run a SCRUB to verify all your data is in-tact, or run a SMART Long/Extended test and ensure it completes. You do have one other choice, if you have a replacement drive, you could REPLACE the suspect drive. But make sure it is the failing drive. I would choose the last option if I had a spare drive and I could not backup my data. It is still a risk either way. Again, RAIDZ1 with very large drives is only smart if the data is not that important or has good backups to cover for a failure.

If you Replace the drive, ensure you check the SCRUB status once the drive has been RESILVERED. Ensure it lists zero problems.

If you SCRUB the pool, first run zpool clear Tank to clear the current faults. Then run zpool status Tank to verify those error ar no longer listed. This does not mean the problem is fixed. Then run your scrub zpool scrub Tank and give it 30 minutes to settle out, then run the zpool status Tank again and get an estimation of the scrub completion time. you can check on that estimation periodically if you desire. One it has completed, run the command one last time to check the status and it should show no errors in an ideal world.

If you run a SMART Long/Extended Test, realize how long it takes. The system needs to remain powered on. You can run smartctl -a /dev/sdb to get a status of the test periodically. You can check if the test aborted early, then try to relate it to something that might have happened.

Good Luck.

2 Likes