Hi all, I keep getting the below errors which seem to be getting sent from the grave.
After solving my controller overheating issues, new drives etc these errors which I am not convinced are accurate are being emailed to me fairly regularly.
The dead giveaway for me, is that it is complaining that Test2 is offline, which it’s not because the pool was deleted ages ago and doesn’t show up in zpool list.
Also hdd2pool it says is offline, yet it is actually online and running well.
From that I assume other data is also out of date.
Anyone know how to reset this alerting system? It seems to have gotten screwed up somehow.
I would only add that there are some correct alerts being included also, that I have been able to address and clear. So it’s not a total screw up.
Update: Looking at the device names, I can see they are a spread across my three different controllers, Internal Motherboard, LSI 9305 in IT mode and Dell Perc H310 in IT mode. I would expect smart to be working at least on the internal ports, but I understand it also usually works with IT mode.
On the SMART side of things, it does seem that I am unable to perform a quick smart test on at least some drives. But that doesn’t explain the issues with the two pools.
Thanks.
Current alerts:
Device: /dev/sdg, failed to read SMART values.
Device: /dev/sdg, Read SMART Self-Test Log Failed.
Device: /dev/sdm [SAT], failed to read SMART Attribute Data.
Device: /dev/sdm [SAT], Read SMART Self-Test Log Failed.
Device: /dev/sdm [SAT], not capable of SMART self-check.
Device: /dev/sdm [SAT], failed to read SMART Attribute Data.
Device: /dev/sdi, failed to read SMART values.
Device: /dev/sdi, Read SMART Self-Test Log Failed.
Device: /dev/sdq [SAT], failed to read SMART Attribute Data.
Device: /dev/sdq [SAT], Read SMART Self-Test Log Failed.
Device: /dev/sdq [SAT], Read SMART Error Log Failed.
Device: /dev/sds [SAT], not capable of SMART self-check.
Device: /dev/sdk [SAT], not capable of SMART self-check.
Device: /dev/sdab [SAT], not capable of SMART self-check.
Device: /dev/sdl [SAT], FAILED SMART self-check. BACK UP DATA NOW!.
Device: /dev/sdl [SAT], 15625 Currently unreadable (pending) sectors.
Device: /dev/sdl [SAT], 15625 Offline uncorrectable sectors.
Device: /dev/sdl [SAT], Failed SMART usage Attribute: 5 Reallocated_Sector_Ct..
Device: /dev/sde, failed to read SMART values.
Device: /dev/sde, Read SMART Self-Test Log Failed.
Device: /dev/sds [SAT], failed to read SMART Attribute Data.
Device: /dev/sdi [SAT], FAILED SMART self-check. BACK UP DATA NOW!.
Device: /dev/sdi [SAT], 15625 Currently unreadable (pending) sectors.
Device: /dev/sdi [SAT], 15625 Offline uncorrectable sectors.
Device: /dev/sdi [SAT], Failed SMART usage Attribute: 5 Reallocated_Sector_Ct..
Device: /dev/sdj, failed to read SMART values.
Device: /dev/sdj, Read SMART Self-Test Log Failed.
Device: /dev/sdq [SAT], not capable of SMART self-check.
Device: /dev/sdf, failed to read SMART values.
Device: /dev/sdf, Read SMART Self-Test Log Failed.
Device: /dev/sdk, Read SMART Self-Test Log Failed.
Device: /dev/sdn [SAT], Read SMART Self-Test Log Failed.
Device: /dev/sdn [SAT], Read SMART Self-Test Log Failed.
Device: /dev/sdn [SAT], failed to read SMART Attribute Data.
Device: /dev/sdq [SAT], not capable of SMART self-check.
Device: /dev/sdq [SAT], failed to read SMART Attribute Data.
Device: /dev/sdq [SAT], Read SMART Self-Test Log Failed.
Device: /dev/sdq [SAT], Read SMART Error Log Failed.
Device: /dev/sdn [SAT], not capable of SMART self-check.
Device: /dev/sdo [SAT], FAILED SMART self-check. BACK UP DATA NOW!.
Device: /dev/sdo [SAT], 15625 Currently unreadable (pending) sectors.
Device: /dev/sdo [SAT], 15625 Offline uncorrectable sectors.
Device: /dev/sdo [SAT], Failed SMART usage Attribute: 5 Reallocated_Sector_Ct..
Pool Test2 is offline, not running scrub.
Pool hdd2pool is offline, not running scrub.