Ok, so I have been looking and try to remember what I had implemented. Basically at the time I saw some documentation and guides on how to setup TrueNAS Scale.
TrueNAS is installed in one SSD (256GB). I remember that did it so that all the other 4 drives, which are drives for servers, stay dedicated to data.
The DataStore dataset is on the other 4x drives as mirror.
Now I also remember that I had another pool called Apps and inside I had a dataset app_configs. And that pool was selected the Apps. After installing more and more apps I realised the I should put everything together but my time now is short for development…
This latest pool was created on another separated SSD (128GB), and this one is the one that is corrupted..
Now from these logs:
1- Critical Snapshot Task For Dataset “Apps/vm” failed: cannot create snapshot ‘Apps/vm@auto-2026-04-26_00-00’: out of space..
2 - truenas kernel: WARNING: Pool ‘Apps’ has encountered an uncorrectable I/O failure and has been suspended.
3 - truenas kernel: zio pool=Apps vdev=/dev/disk/by-partuuid/82b0844b-98ec-4d66-beb4-a5c1825d0065 error=5 type=2 offset=70635380736 size=94208 flags=1572992
It seems that the disk went out of space or failed and pool got suspended. This was before I noticed that something was wrong in the HA and before I rebooted TrueNAS.
I noticed now that this 128 SSD is corrupted, this one I can’t mount anywhere. On Mac OSX it doesn’t show nothing regarding the device, Windows disk shows as undefined, on TrueNAS only if I install it through a USB drive I can see something, but it says 0 bytes and in dmesg I see the following:
<5>[ 10.764479] sd 8:0:0:0: [sde] Unit Not Ready
<6>[ 10.764612] sd 8:0:0:0: [sde] Sense Key : Hardware Error [current]
<6>[ 10.764617] sd 8:0:0:0: [sde] ASC=0x44 <>ASCQ=0x81
<3>[ 190.774827] sd 8:0:0:0: [sde] tag#0 timing out command, waited 180s
<3>[ 370.790929] sd 8:0:0:0: [sde] tag#0 timing out command, waited 180s
<3>[ 550.806904] sd 8:0:0:0: [sde] tag#0 timing out command, waited 180s
<6>[ 550.807094] sd 8:0:0:0: [sde] Read Capacity(10) failed: Result: hostbyte=DID_OK driverbyte=DRIVER_OK
<6>[ 550.807105] sd 8:0:0:0: [sde] Sense Key : Hardware Error [current]
<6>[ 550.807112] sd 8:0:0:0: [sde] ASC=0x44 <>ASCQ=0x81
<3>[ 730.822880] sd 8:0:0:0: [sde] tag#0 timing out command, waited 180s
<5>[ 730.823074] sd 8:0:0:0: [sde] 0 512-byte logical blocks: (0 B/0 B)
<5>[ 730.823206] sd 8:0:0:0: [sde] 0-byte physical blocks
<3>[ 910.826855] sd 8:0:0:0: [sde] tag#0 timing out command, waited 180s
<4>[ 910.827053] sd 8:0:0:0: [sde] Test WP failed, assume Write Enabled
<3>[ 1090.842829] sd 8:0:0:0: [sde] tag#0 timing out command, waited 180s
<3>[ 1090.843026] sd 8:0:0:0: [sde] Asking for cache data failed
<4>[ 1090.843156] sd 8:0:0:0: [sde] Assuming drive cache: write through
<5>[ 1090.843766] sd 8:0:0:0: [sde] Attached SCSI disk
<3>[ 1120.906851] sd 8:0:0:0: [sde] tag#0 timing out command, waited 30s
<3>[ 1138.706851] sd 8:0:0:0: [sde] tag#0 timing out command, waited 30s
<3>[ 1209.514893] sd 8:0:0:0: [sde] tag#0 timing out command, waited 60s
<3>[ 1209.583020] sd 8:0:0:0: [sde] tag#0 timing out command, waited 60s
<3>[ 1269.546802] sd 8:0:0:0: [sde] tag#0 timing out command, waited 60s
<3>[ 1269.586218] sd 8:0:0:0: [sde] tag#0 timing out command, waited 60s
<3>[ 1313.553952] sd 8:0:0:0: [sde] tag#0 timing out command, waited 15s
<3>[ 1328.570339] sd 8:0:0:0: [sde] tag#0 timing out command, waited 15s
<3>[ 1373.565528] sd 8:0:0:0: [sde] tag#0 timing out command, waited 15s
Already checked for short circuits in the SSD board but everything seems to be fine and actually it doesn’t heat on a specific location of the board.
Do you agree that the data got lost for good?
From the HA, i have a backup with more than a year. I lose some data and changes but it saves me quite some time if I would implement from scratch…
The only App that I want to be careful is Immich to see if I can recover everything and not mess up.
In the DataStore I have the Immich folder with the following datasets:
DataStore/Photos/Immich
DataStore/Photos/Immich/backup
DataStore/Photos/Immich/library
DataStore/Photos/Immich/pgData
DataStore/Photos/Immich/upload
DataStore/Photos/Immich/video
What is the best way to recover? Install the App with the data in a new location, and move/copy everything to the new location and then restore the latest backup? Or just use the same location and restore?
Sorry for the late reply but the weekend was busy.
Thanks