What's the best way to handle thie disk failure?

My trunas core TrueNAS-13.0-U6.4 is reporting “CRITICAL
Pool <???>-pool state is DEGRADED: One or more devices are faulted in response to persistent errors. Sufficient replicas exist for the pool to continue functioning in a degraded state.
The following devices are not healthy:
Disk ATA TOSHIBA HDWG480 62*****A3H is FAULTED
16 February, 2025 15:38:54 (Europe/London)”

I set it up as a raid 5 array, Raidz1 as I remember.

What is the best way to deal with this? The drive is under warranty, but has to be returned to Toshiba Germany. So I think this will take a couple of weeks.

I have a very good backup, and I’m confident in it.

I guess, but don’t know how, I need to mark the drive as dead, remove it from the array, to return it to germany.

Once I have exactly the replacement “same” drive returned to me, I will plug it in, but need to get it to rebuild the array.

Can anyone offer any guidance?

thanks

Julian

We get a lot of these questions, but we really need to have the documentation vetted by actual users. There have been changes over the years, so it is possible that the docs need updating.

If you have further questions, certainly ask them here in this thread. On the other hand, if you find something that could be improved, (or outright fixed), in the documentation, their is a “Feedback” button on the right edge of doc pages.

Here is the documentation for TrueNAS Core, version 13.0 on Disk Replacement:

You should really look at the documentation, however here it is.
In the interface, it’s just a matter of selecting the faulted disk from the pool status or layout and you find a function “replace”.

In truenas CORE:

In truenas SCALE it’s different, you go to storage/manage devices:

If I can give you an advice, always keep around a spare disk in the future.
You have specified you have a backup, and it’s good. I have a backup and a RAID-Z1 pool, too, but if I can avoid waiting for replacing the faulted disks, because I am using ZFS replication and if the RAID-Z1 fails it’s a pain in the ass to restore. You need to:

  • snapshot and replicate the snapshot from BACKUP to PRIMARY
  • restart the snapshotting from PRIMARY TO BACKUP AGAIN

Since it is a lot of data, it takes a long time, and if you do it wrong, it can turn very very bad.
Obviously this doesn’t apply if you are just rsyncing, which is way slower but safer. Still a pain in the ass.

1 Like

Thank you for the advice, it’s very much apprechiated, I’ve read throught the documentation too.
I have a question re resilvering and power down, but I’ll post that seperately.
Thank you both for your help :slight_smile: