Pool disappears and disks are unattached

HoneyBadger · June 24, 2025, 5:59pm

This is why you may need to use a command-line program like ddrescue or a bootdisk with it available; ddrescue has options to “fail a block and continue” IIRC, rather than hang up trying to stubbornly read the same LBA.

If it is truly the same size down to the LBAs it may work.

Cusssy · June 24, 2025, 6:07pm

Before trying that method, I’d like to clear up a few doubts and rule out some options.

It doesn’t let me import the pool like the first time, because the pool is already imported. Wouldn’t it be possible to export the pool and then re-import it?

And secondly (sorry if you already answered this and I didn’t see it), if I add the disks to the existing pool, won’t it recognize that the data is already there and make it available again? I remember that this option wasn’t available before (back when the disk failed for the first time and when I first tried to import it).

Thanks

HoneyBadger · June 24, 2025, 6:24pm

If the pool is already imported now, what data are you seeing - and is it read-only or read-write?

No - the process of “add disk to pool” is for adding a new disk and treating it as blank/overwriteable, which is the opposite of what we want to accomplish here of “import disk with valid data and read from it” - so definitely don’t do any “Add this disk to extend an existing pool” operations.

Cusssy · June 24, 2025, 6:53pm

I cant access data, i only see this in the “storage” page

Okey okey, thanks and sorry if i make stupid questions

I’ll give that a try. A while ago I tried cloning the drive with Clonezilla, but it didn’t work — it finished way too fast to be a proper clone.

Hopefully this time I won’t run into any issues. Right now I don’t have a Linux machine available — the closest thing I have is a MacBook, but if needed, I can try booting Ubuntu from a USB stick on my main PC.

HoneyBadger · June 24, 2025, 8:15pm

That’s likely just an artifact of the middleware, and your pool is not actually imported then. If you check from the command-line with zpool status it will be clear there.

“There are naive questions, tedious questions, ill-phrased questions, questions put after inadequate self-criticism. But every question is a cry to understand the world. There is no such thing as a dumb question.” - Carl Sagan

A live-OS like System-Rescue is a good option - just make sure that you disconnect or do not select to overwrite any drives that you don’t want to.

Protopia · June 25, 2025, 8:10am

@Cusssy Personally at this point I think you should accept that your data is gone and create a new pool from scratch, learning from your previous mistakes by NOT using SMR drives and by creating a redundant pool from the start, and thus have a useable NAS again.

Then if you have backups, you can restore from those.

Cusssy · June 25, 2025, 10:18am

I was thinking about it last night, but if I was able to regain access to them recently, and they were only lost due to a reboot, it makes me think there’s still salvation.

Protopia · June 25, 2025, 10:44am

Well - the actual data is still there on disk so there is always a possibility (however remote) of getting it back. The question is just how vital that data is, and how much effort (and possibly money) you are prepared to spend to recover it.

What you have are 3 disks, with the data for most files spread across all 3 of them, 1 disk of which is missing a whole bunch of the information that would normally be used to determine what each block of data on the disk is part of - like the partition table, the partition details, the zfs labels etc. etc. etc.

TBH, I am surprised that it ever worked.

Do you have the details of the exact command you used to clone the partition (specifically that would confirm whether the partition (which would start at 2MB) was written to the same position on the new disk or to the beginning of the disk? (My guess is that you copied the partition to the beginning of the disk, but ZFS might have been clever enough on the previous occasions to work things out but is failing this time.)

But I think that recovery actions now will very much depend on knowing this.

Cusssy · June 25, 2025, 11:28am

I’m not sure if I understood you correctly, but I haven’t used any command to clone the disk so far. With ReclaiMe Pro, I created a .img file of the partition containing the data, and then I used Balena Etcher to burn that image onto the new Barracuda HDD I bought. I connected it to the server and it worked without any issues — until I restarted the server, and then the problem came back.

But this makes me think the issue is no longer with the damaged disk, but rather with how TrueNAS handles the pools (I’m speaking from limited knowledge here, so apologies if I say something wrong). It seems like it’s failing to import the pool because it thinks it’s already imported, just not connecting the disks for some reason. In the disk manager, it does recognize them as part of the “data” pool, and it even gives me the option to add them to it.

So I guess, as a last resort, I might try manually adding the disks (which TrueNAS recognizes as belonging to the “data” pool) back into the pool.

Thanks a lot for your patience and support.

HoneyBadger · June 25, 2025, 1:53pm

Forgive me for asking - but if cloning the .img made it work, couldn’t you just re-clone said .img onto the same HDD, and have it re-mount the pool, whereupon you then manually copy files off of it to a safe/secure/redundant location?

This will definitely cook your data if it gets the order to wipe your disks and add them to a new pool.

PK1048 · June 25, 2025, 1:56pm

I am concerned that you cloned a partition to an entire disk. That can very easily lead to some of the ZFS label issues we saw. Can you clone like to like (disk to disk or partition to partition) and try again.

HoneyBadger · June 25, 2025, 1:57pm

Thus my suggestion to use a live-OS with ddrescue included to hopefully do a full drive-to-drive clone.

Cusssy · June 25, 2025, 2:08pm

That’s what I’ve done, but I still have the same problem. In the import pool menu, it no longer lets me import the “data” pool because it’s already imported but without the disk.

That’s what I’m doing now, I hope to reconnect it, but I doubt it will do anything, I think the problem is no longer in the disk, but in TrueNas

Cusssy · June 25, 2025, 3:03pm

Okay, I’m still having the same problem, it won’t let me import the pool, could I try disconnecting the pool and importing it again?

sorry for the flood of screenshots

HoneyBadger · June 25, 2025, 3:32pm

If your pool is actually imported then it’s almost definitely in a suspended state.

What does zpool status -v show from the command line or shell on TrueNAS?

Cusssy · June 25, 2025, 3:34pm

Only shows the boot pool

maybe can i try use zpool import data again?

HoneyBadger · June 25, 2025, 3:46pm

Yes, in this case what you’re seeing in the Storage panel is incorrect. You should be able to attempt the import.

Cusssy · June 25, 2025, 3:51pm

As I assumed, it didn’t work this time either

root@truenas:~# zpool import data
cannot import 'data': I/O error
        Destroy and re-create the pool from
        a backup source.

There is no other option, I will try to see if I can re-import the pool by disconnecting it first from the web panel

HoneyBadger · June 25, 2025, 4:00pm

Please dump the tail end of /proc/spl/kstat/zfs/dbgmsg inside codeblocks but I wager we are still dealing with “unable to rebuild a stripe with a missing member” as a root cause.

Cusssy · June 25, 2025, 4:02pm

this?

root@truenas:~# tail -50 /proc/spl/kstat/zfs/dbgmsg
1750866608   spa_misc.c:404:spa_load_failed(): spa_load($import, config untrusted): FAILED: unable to open rootbp in dsl_pool_init [error=5]
1750866608   spa_misc.c:418:spa_load_note(): spa_load($import, config untrusted): UNLOADING
1750866608   spa.c:6521:spa_import(): spa_import: importing data
1750866608   spa_misc.c:418:spa_load_note(): spa_load(data, config trusted): LOADING
1750866608   vdev.c:161:vdev_dbgmsg(): disk vdev '/dev/disk/by-partuuid/b25df2bc-ce76-11ee-9fb8-d45d64208664': best uberblock found for spa data. txg 3388865
1750866608   spa_misc.c:418:spa_load_note(): spa_load(data, config untrusted): using uberblock with txg=3388865
1750866608   spa_misc.c:404:spa_load_failed(): spa_load(data, config untrusted): FAILED: unable to open rootbp in dsl_pool_init [error=5]
1750866608   spa_misc.c:418:spa_load_note(): spa_load(data, config untrusted): UNLOADING
1750866608   spa_misc.c:418:spa_load_note(): spa_load(data, config untrusted): spa_load_retry: rewind, max txg: 3388864
1750866608   spa_misc.c:418:spa_load_note(): spa_load(data, config untrusted): LOADING
1750866608   vdev.c:161:vdev_dbgmsg(): disk vdev '/dev/disk/by-partuuid/b25df2bc-ce76-11ee-9fb8-d45d64208664': best uberblock found for spa data. txg 3388864
1750866608   spa_misc.c:418:spa_load_note(): spa_load(data, config untrusted): using uberblock with txg=3388864
1750866608   spa_misc.c:404:spa_load_failed(): spa_load(data, config untrusted): FAILED: unable to open rootbp in dsl_pool_init [error=5]
1750866608   spa_misc.c:418:spa_load_note(): spa_load(data, config untrusted): UNLOADING
1750866608   spa_misc.c:418:spa_load_note(): spa_load(data, config untrusted): spa_load_retry: rewind, max txg: 3388863
1750866608   spa_misc.c:418:spa_load_note(): spa_load(data, config untrusted): LOADING
1750866608   vdev.c:161:vdev_dbgmsg(): disk vdev '/dev/disk/by-partuuid/b25df2bc-ce76-11ee-9fb8-d45d64208664': best uberblock found for spa data. txg 3388863
1750866608   spa_misc.c:418:spa_load_note(): spa_load(data, config untrusted): using uberblock with txg=3388863
1750866608   spa_misc.c:404:spa_load_failed(): spa_load(data, config untrusted): FAILED: couldn't get 'config' value in MOS directory [error=52]
1750866608   spa_misc.c:418:spa_load_note(): spa_load(data, config untrusted): UNLOADING
1750866608   spa_misc.c:418:spa_load_note(): spa_load(data, config untrusted): spa_load_retry: rewind, max txg: 3388862
1750866608   spa_misc.c:418:spa_load_note(): spa_load(data, config untrusted): LOADING
1750866608   vdev.c:161:vdev_dbgmsg(): disk vdev '/dev/disk/by-partuuid/b25df2bc-ce76-11ee-9fb8-d45d64208664': best uberblock found for spa data. txg 3388862
1750866608   spa_misc.c:418:spa_load_note(): spa_load(data, config untrusted): using uberblock with txg=3388862
1750866608   spa_misc.c:404:spa_load_failed(): spa_load(data, config untrusted): FAILED: unable to open rootbp in dsl_pool_init [error=5]
1750866608   spa_misc.c:418:spa_load_note(): spa_load(data, config untrusted): UNLOADING
1750866698   spa.c:6669:spa_tryimport(): spa_tryimport: importing data
1750866698   spa_misc.c:418:spa_load_note(): spa_load($import, config trusted): LOADING
1750866698   vdev.c:161:vdev_dbgmsg(): disk vdev '/dev/disk/by-partuuid/b25df2bc-ce76-11ee-9fb8-d45d64208664': best uberblock found for spa $import. txg 3388865
1750866698   spa_misc.c:418:spa_load_note(): spa_load($import, config untrusted): using uberblock with txg=3388865
1750866698   spa_misc.c:404:spa_load_failed(): spa_load($import, config untrusted): FAILED: unable to open rootbp in dsl_pool_init [error=5]
1750866698   spa_misc.c:418:spa_load_note(): spa_load($import, config untrusted): UNLOADING
1750866704   spa.c:6669:spa_tryimport(): spa_tryimport: importing data
1750866704   spa_misc.c:418:spa_load_note(): spa_load($import, config trusted): LOADING
1750866704   vdev.c:161:vdev_dbgmsg(): disk vdev '/dev/disk/by-partuuid/b25df2bc-ce76-11ee-9fb8-d45d64208664': best uberblock found for spa $import. txg 3388865
1750866704   spa_misc.c:418:spa_load_note(): spa_load($import, config untrusted): using uberblock with txg=3388865
1750866704   spa_misc.c:404:spa_load_failed(): spa_load($import, config untrusted): FAILED: unable to open rootbp in dsl_pool_init [error=5]
1750866704   spa_misc.c:418:spa_load_note(): spa_load($import, config untrusted): UNLOADING
1750866708   spa.c:6669:spa_tryimport(): spa_tryimport: importing data
1750866708   spa_misc.c:418:spa_load_note(): spa_load($import, config trusted): LOADING
1750866708   vdev.c:161:vdev_dbgmsg(): disk vdev '/dev/disk/by-partuuid/b25df2bc-ce76-11ee-9fb8-d45d64208664': best uberblock found for spa $import. txg 3388865
1750866708   spa_misc.c:418:spa_load_note(): spa_load($import, config untrusted): using uberblock with txg=3388865
1750866708   spa_misc.c:404:spa_load_failed(): spa_load($import, config untrusted): FAILED: unable to open rootbp in dsl_pool_init [error=5]
1750866708   spa_misc.c:418:spa_load_note(): spa_load($import, config untrusted): UNLOADING
1750866708   spa.c:6669:spa_tryimport(): spa_tryimport: importing data
1750866708   spa_misc.c:418:spa_load_note(): spa_load($import, config trusted): LOADING
1750866708   vdev.c:161:vdev_dbgmsg(): disk vdev '/dev/disk/by-partuuid/b25df2bc-ce76-11ee-9fb8-d45d64208664': best uberblock found for spa $import. txg 3388865
1750866708   spa_misc.c:418:spa_load_note(): spa_load($import, config untrusted): using uberblock with txg=3388865
1750866708   spa_misc.c:404:spa_load_failed(): spa_load($import, config untrusted): FAILED: unable to open rootbp in dsl_pool_init [error=5]
1750866708   spa_misc.c:418:spa_load_note(): spa_load($import, config untrusted): UNLOADING