I was able to use my spare, and order an actual 2nd standby drive (I thought I had 2 cold standby drives, but could only find 1) while the vendor was replenishing their stocks. I should get my 2 replacements Soon™, they should ship this week sometime.
The other zPool had a failure after the resilver and the drive was taken offline immediately by TrueNAS, so I pulled it and sent it off for a replacement. I left the NAS powered off during the RMA, since at that point, clearly, the solar winds were affecting it, or something.
I installed the replacement, let it resilver, and ANOTHER (4th) drive had an issue (sdb, sn Y890A08UFFHG). I did a long SMART test on this drive and it passed. It’s only had this one notification, and from this post and others, I’m not sure if this is actually a critical issue or not given it passing the long SMART test. Should I replace the one with the error too?
At this point we’re up to 4 failures on this thread. I’ve never had such bad luck with hard drives before, but we’re also literally at an 11 year peak solar activity, so sunspots are on the table and not just superstition. These are all refurb’d enterprise drives, and these last 2 failures are Toshiba’s, whereas the other 2 on this thread, and 1 or 2 failures I had a long time ago, were all WD. Do you think something else is up, or I just got the lucky in the wrong way?
Critical
Device: /dev/sdb [SAT], ATA error count increased from 0 to 1.
Jan 30, 2025 15:19:16 (America/New_York)
Dismiss
1
sdb Extended offline SUCCESS
Remaining: N/A
Lifetime: 41188
Error: N/A
lsblk -bo NAME,MODEL,ROTA,PTTYPE,TYPE,START,SIZE,PARTTYPENAME,PARTUUID
NAME MODEL ROTA PTTYPE TYPE START SIZE PARTTYPENAME PARTUUID
sda WDC WUH 1 gpt disk 14000519643136
└─sda1 1 gpt part 4096 14000516497920 Solaris /usr & Apple ZFS 25ec455c-a629-4300-b16d-09deaa3a88a0
sdb TOSHIBA 1 gpt disk 14000519643136
└─sdb1 1 gpt part 4096 14000516497920 Solaris /usr & Apple ZFS c9e772ad-f118-4e92-b5bd-c872033b71a1
sdc TOSHIBA 1 gpt disk 14000519643136
└─sdc1 1 gpt part 4096 14000516497920 Solaris /usr & Apple ZFS 41644b00-0499-4742-9d4e-a8e76f823d61
sdd WDC WUH 1 gpt disk 14000519643136
└─sdd1 1 gpt part 4096 14000516497920 Solaris /usr & Apple ZFS 1851ae7d-b04d-4ce7-a718-5a8aefbcbb2b
sde TOSHIBA 1 gpt disk 14000519643136
└─sde1 1 gpt part 4096 14000516497920 Solaris /usr & Apple ZFS 0278dd80-3622-46b5-8a8f-4fa4b6b6142b
sdf TOSHIBA 1 gpt disk 14000519643136
└─sdf1 1 gpt part 4096 14000516497920 Solaris /usr & Apple ZFS 8544a5e0-4009-463e-90a2-7d937b658e61
sdg WDC WUH 1 gpt disk 14000519643136
└─sdg1 1 gpt part 4096 14000516497920 Solaris /usr & Apple ZFS 5d64fc93-6e46-48d4-b69e-d94d9dceb84c
sdh TOSHIBA 1 gpt disk 14000519643136
└─sdh1 1 gpt part 4096 14000516497920 Solaris /usr & Apple ZFS 6406fd6a-053c-4195-babe-f088281483f8
sdi WDC WUH 1 gpt disk 14000519643136
└─sdi1 1 gpt part 4096 14000516497920 Solaris /usr & Apple ZFS 8c7742ef-a8a0-4dc3-b974-316e43d8a15a
sdj WDC WUH 1 gpt disk 14000519643136
└─sdj1 1 gpt part 4096 14000516497920 Solaris /usr & Apple ZFS 3eaf1a0a-317f-402c-b8f3-60a3ac83a3cc
sdk SanDisk 0 gpt disk 128035676160
├─sdk1 0 gpt part 4096 1048576 BIOS boot f1eb4c83-ddc8-45bc-b59a-5909fba158b5
├─sdk2 0 gpt part 6144 536870912 EFI System 84d5455d-9b3d-4d74-a64a-df269a20e05c
├─sdk3 0 gpt part 34609152 110315773440 Solaris /usr & Apple ZFS 6ecbfa39-b644-4021-bcaa-5c7e684f174d
└─sdk4 0 gpt part 1054720 17179869184 Linux swap 646e196c-7554-4fbe-8f45-45f26bdf5938
└─sdk4 0 crypt 17179869184
sdl WDC WUH 1 gpt disk 14000519643136
└─sdl1 1 gpt part 4096 14000516497920 Solaris /usr & Apple ZFS 76d05838-4e24-4f1d-a2b5-69687952b39e
sdm WDC WUH 1 gpt disk 14000519643136
└─sdm1 1 gpt part 4096 14000516501504 Solaris /usr & Apple ZFS b6014162-d311-47ac-b63c-6e6a72c4f07f
sdn WDC WUH 1 gpt disk 14000519643136
└─sdn1 1 gpt part 4096 14000516501504 Solaris /usr & Apple ZFS 1a4e3a70-25d0-4c61-91bd-9f76014d1ae3
sdo TOSHIBA 1 gpt disk 14000519643136
└─sdo1 1 gpt part 4096 14000516497920 Solaris /usr & Apple ZFS 5fc7e2f4-0dde-4895-8f1a-432ca652f35e
admin@truenas[~]$
admin@truenas[~]$ lspci
00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne Root Complex
00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne IOMMU
00:01.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir PCIe Dummy Host Bridge
00:01.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir PCIe GPP Bridge
00:01.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge
00:02.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir PCIe Dummy Host Bridge
00:08.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir PCIe Dummy Host Bridge
00:08.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir Internal PCIe GPP Bridge to Bus
00:08.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir Internal PCIe GPP Bridge to Bus
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller (rev 51)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge (rev 51)
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 1
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 2
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 3
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 4
00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 5
00:18.6 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 6
00:18.7 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 7
01:00.0 Serial Attached SCSI controller: Broadcom / LSI SAS2008 PCI-Express Fusion-MPT SAS-2 [Falcon] (rev 03)
02:00.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] Matisse Switch Upstream
03:02.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] Matisse PCIe GPP Bridge
03:03.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] Matisse PCIe GPP Bridge
03:08.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] Matisse PCIe GPP Bridge
03:09.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] Matisse PCIe GPP Bridge
03:0a.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] Matisse PCIe GPP Bridge
04:00.0 USB controller: ASMedia Technology Inc. Device 3241
05:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8125 2.5GbE Controller (rev 05)
06:00.0 Non-Essential Instrumentation [1300]: Advanced Micro Devices, Inc. [AMD] Starship/Matisse Reserved SPP
06:00.1 USB controller: Advanced Micro Devices, Inc. [AMD] Matisse USB 3.0 Host Controller
06:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Matisse USB 3.0 Host Controller
07:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 51)
08:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 51)
09:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Cezanne [Radeon Vega Series / Radeon Vega Mobile Series] (rev c9)
09:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Renoir Radeon High Definition Audio Controller
09:00.2 Encryption controller: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 10h-1fh) Platform Security Processor
09:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne USB 3.1
09:00.4 USB controller: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne USB 3.1
09:00.6 Audio device: Advanced Micro Devices, Inc. [AMD] Family 17h/19h HD Audio Controller
0a:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 81)
0a:00.1 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 81)
admin@truenas[~]$
admin@truenas[~]$ sudo sas2flash -list
[sudo] password for admin:
LSI Corporation SAS2 Flash Utility
Version 20.00.00.00 (2014.09.18)
Copyright (c) 2008-2014 LSI Corporation. All rights reserved
Adapter Selected is a LSI SAS: SAS2008(B2)
Controller Number : 0
Controller : SAS2008(B2)
PCI Address : 00:01:00:00
SAS Address : 5003005-7-01a8-08b0
NVDATA Version (Default) : 14.01.00.08
NVDATA Version (Persistent) : 14.01.00.08
Firmware Product ID : 0x2213 (IT)
Firmware Version : 20.00.07.00
NVDATA Vendor : LSI
NVDATA Product ID : SAS9211-8i
BIOS Version : 07.39.02.00
UEFI BSD Version : 07.27.01.01
FCODE Version : N/A
Board Name : SAS9211-8i
Board Assembly : ARTofSERVER
Board Tracer Number : N/A
Finished Processing Commands Successfully.
Exiting SAS2Flash.
admin@truenas[~]$
sudo sas3flash -list
Avago Technologies SAS3 Flash Utility
Version 16.00.00.00 (2017.05.02)
Copyright 2008-2017 Avago Technologies. All rights reserved.
No Avago SAS adapters found! Limited Command Set Available!
ERROR: Command Not allowed without an adapter!
ERROR: Couldn't Create Command -list
Exiting Program.
admin@truenas[~]$
sudo zpool status -v
pool: boot-pool
state: ONLINE
scan: scrub repaired 0B in 00:00:08 with 0 errors on Fri Jan 31 03:45:09 2025
config:
NAME STATE READ WRITE CKSUM
boot-pool ONLINE 0 0 0
sdk3 ONLINE 0 0 0
errors: No known data errors
pool: zPool
state: ONLINE
scan: scrub repaired 0B in 04:24:43 with 0 errors on Sun Feb 2 04:24:45 2025
config:
NAME STATE READ WRITE CKSUM
zPool ONLINE 0 0 0
raidz2-0 ONLINE 0 0 0
1851ae7d-b04d-4ce7-a718-5a8aefbcbb2b ONLINE 0 0 0
0278dd80-3622-46b5-8a8f-4fa4b6b6142b ONLINE 0 0 0
25ec455c-a629-4300-b16d-09deaa3a88a0 ONLINE 0 0 0
c9e772ad-f118-4e92-b5bd-c872033b71a1 ONLINE 0 0 0
8544a5e0-4009-463e-90a2-7d937b658e61 ONLINE 0 0 0
8c7742ef-a8a0-4dc3-b974-316e43d8a15a ONLINE 0 0 0
41644b00-0499-4742-9d4e-a8e76f823d61 ONLINE 0 0 0
raidz2-1 ONLINE 0 0 0
5d64fc93-6e46-48d4-b69e-d94d9dceb84c ONLINE 0 0 0
6406fd6a-053c-4195-babe-f088281483f8 ONLINE 0 0 0
5fc7e2f4-0dde-4895-8f1a-432ca652f35e ONLINE 0 0 0
b6014162-d311-47ac-b63c-6e6a72c4f07f ONLINE 0 0 0
1a4e3a70-25d0-4c61-91bd-9f76014d1ae3 ONLINE 0 0 0
76d05838-4e24-4f1d-a2b5-69687952b39e ONLINE 0 0 0
3eaf1a0a-317f-402c-b8f3-60a3ac83a3cc ONLINE 0 0 0
errors: No known data errors
admin@truenas[~]$