Hi,
Since rebooting my TrueNAS SCALE yesterday, I’m getting these errors:
New alerts:
* Device: /dev/sdb [SAT], 2 Offline uncorrectable sectors.
Current alerts:
* Device: /dev/sdb [SAT], 2 Currently unreadable (pending) sectors.
* Device: /dev/sdb [SAT], 2 Offline uncorrectable sectors.
I have logged in to the dashboard, I see these errors on the bell, but when I’m looking at the storage panel, I don’t see any error. I’ve checked with zpool status - same, no errors. I checked the journal and dmesg - same, no error at all. I also checked using smartctl -H /dev/sdb - same, no errors.
So, what’s up with those messages and how can I “fix” this issue?
If smartctl -a confirms what was reported, you should look into replacing the drives. This is valid ground for RMA. And if the drives are too old for RMA, the recycling bin awaits…
Well, it helps to not look at where the errors would be found.
Relocated sectors do not necessarily lead to SMART test failures (but they would be logged, including in the SMART parameter TrueNAS is reporting to you)
Short tests are not great, which is fine because they’re cheap. Short tests just don’t fail all that often, even on pretty bad disks.
Look at the long results… run a long test if you haven’t.
Pending sectors don’t always cause long failures.
If the drive is in warranty RMA it.
If not, you could try wiping with zeros. May make the problem go away for a bit, but at the end of the day, a pending sector is a sector that contains data that can no longer be read.
It’s safer to replace it before doing that.
But it’s pending being rewritten. Hence the wipe with zeros.