Failed disk but spare is still listed as spare

Scott_Guenther · June 12, 2025, 5:45pm

I have no idea what’s going on. I had a disk fail (sdax). I had a spare in the system(sdi).

both sdi and sdax are now listed in one of my vdevs, but sdi is still listed in spare as well. However the system reports I have an unused disk sdax…so they are both spares and in the vdev? The vdev is still listed as degraded, however its 11 wide and with sdi in there it should be whole. I don’t know what my next steps are.

Johnny_Fartpants · June 12, 2025, 5:50pm

The spare (sdi) has done its job and although still listed as a spare it’s crucially ‘unavailable’ as it’s now a pool member proper. The pool is still degraded as you no longer have an available spare.

The original disk (sdax) was faulted by TrueNAS for some reason often because its producing too many errors or not responding in good time.

Next steps are to replace the failed drive (sdax) NOT the spare. Once resilver is complete the spare should automatically return to being a spare again.

neofusion · June 12, 2025, 5:52pm

Sdax is not a spare.
First, confirm that sdax is indeed bad. I recommend running sudo smartctl -a /dev/sdax and posting the output.

If it is bad, the next steps would be:

Take the disk offline.
Detach the failed disk to promote the hot spare.
Refresh the screen.
Recreate the hot spare VDEV.

The steps above are from the official documentation for version 25.04.

PK1048 · June 12, 2025, 6:12pm

That is correct if you want the Hot Spare (sdi) to permanently replace the failed drive.

If you want the Hot Spare (sdi) to go back to being a Hot Spare, then what @Johnny_Fartpants said is correct.

See Replacing Disks | TrueNAS Documentation Hub for a discussion of both approaches.

etorix · June 12, 2025, 6:21pm

Or to confirm the temporary replacement and make sdi a permanent pool member. The you can bring in a new spare.

neofusion · June 12, 2025, 6:37pm

Both options are fine.

Scott_Guenther · June 12, 2025, 6:40pm

thanks for all the help. I am so used to raid systems automatically kicking a drive out and replacing it. I have detached the failed drive (sdax) and everything went green.

I will head out there later today and replace the drive.

From what I understand now, I technically want in a degraded state as all drives in the array were there, but it was reporting because my spare was now missing, as it had been assigned? In other words, I still had the full 2 parity drives in my Z2 vdev, even though it was complaining?

Johnny_Fartpants · June 12, 2025, 7:07pm

Yep you got it.

Stux · June 14, 2025, 12:21pm

I like to order a replacement for the spare and burn it in, moving the spare into active use, otherwise they just get older while idling.

joeschmuck · June 14, 2025, 12:45pm

So far I have not seen any troubleshooting to determine if the driver is bad or if it is the HBA or a cable (data or power)? @neofusion has asked the question to push you into the right direction.

@Scott_Guenther do not assume the drive is bad until you know the drive is bad. Maybe it is bad, but find out.

I have a set of troubleshooting flowcharts in the resources on the forums and linked below for faster access. Prove it before you just replace the drive.

Topic		Replies	Views
Turn a "spare" vdev back into a "mirror" vdev TrueNAS General SCALE , ZFS	15	534	May 6, 2024
Recently upgraded to SCALE and have pool degraded TrueNAS General SCALE	8	373	January 24, 2025
Advise replacing disks with hot spare active TrueNAS General SCALE , ZFS	10	374	January 10, 2025
Hot spare activated when 2 drives failed in RAIDZ2, but now they're back online-- how do I get the pool back to normal? TrueNAS General	2	230	June 11, 2024
Pool degraded, confused about how the spare works TrueNAS General	7	100	December 18, 2025

Failed disk but spare is still listed as spare

Related topics