Hi all,
I recently changed all drives in my nas and I suspect during this change an error occured.
zpool status -xv
pool: Tank
state: ONLINE
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: Message ID: ZFS-8000-8A — OpenZFS documentation
scan: scrub repaired 0B in 03:09:38 with 1 errors on Thu May 16 17:52:59 2024
config:
thanks for the quick answer.
I can’t really destroy the pool (there is too much data I need to move and I don’t have where). would be an option to move the content from that folder in a different folder / poll and move it back after I recreate the folder?
I suspect that the 6 errors may actually relate to the scrub finding checksum errors.
I assume that Tank/Media/Filme is a directory - which suggests that the directory (which is probably stored in the same way as a file) is corrupt - so you may have probably lost access to everything inside.
I would start by trying to copy the contents off the pool to somewhere else. You might be more successful by looking inside the .zfs subdirectory of the dataset for the snapshots (which may well contain a valid copy of the directory blocks) and copy the files off that.
To remove the error you will need to remove the directory and the contained files (and possibly all the snapshots that contain the corrupted directory) and ensure that the blocks they used are returned to the free pool. I have no idea how to do this when a directory is corrupted.
Then, once you have deleted the directory and files contained, you will need to do a scrub to clear the error. I have, however, read somewhere that scrub > export > import > scrub may be needed to clear errors.
I have a few issues here. after the last scrub I ended up with less errors but not good enough.
Problem 1 - my last snapshot is from feb. not a bit issue
Problem 2 - I cannot delete the content.
As the error is pointing on the folder, maybe moving the content somewhere else ad deleting the folder may work.
root@tatooine[/mnt/Tank]# zpool status -xv
pool: Tank
state: ONLINE
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: Message ID: ZFS-8000-8A — OpenZFS documentation
scan: scrub repaired 0B in 03:16:26 with 1 errors on Thu May 16 22:16:40 2024
config:
Did you run a zpool clear before and after the scrub?
This is looking like an HBA issue, since there’s no way all drives in your RAIDZ1 vdev have the exact same number of checksum errors at the same time.
Maybe it’s an overheating issue, and ironically the scrub is what causes the HBA to run hot enough to be on the brink of checksum errors for some blocks.
What us the HBA? Has it got the latest IT firmware (assuming its an HBA which has IT firmware). And do you have it actively cooled… assuming its an HBA which requires active cooling…
(and it probably does, if its a good one, and you’re not using a server chassis)