I have lost my only pool and hope someone can help me recover the data.
Equipment
OS = 24.10-RC2
MBO = MSI B550 GEN3 Gaming Motherbord
CPU= AMD Ryzen 7 5700G
Ram = 32 Gb Non-ECC, DDR4 2X 16. 3200MT/s, CL16-18-18-38
Power Supply = Corsair RM750e, 750-watt
Boot Drive = Crucial P3 1TB PCIe, NVM3 M.2 SSD plugged into MBO slot
Array Drives = 5X WD Red Pro WD6003FFBX 6TB 7200 RPM 256MB Cache SATA 6.0Gb/s 3.5"
NIC = Built-in NIC on MBO and Dual Port PCIe X4 Intel 82576
I built the PC about a week ago and installed all 5 WD drives in one pool named “home”. All was working fine. I installed DDNS Updater, Web File Manager, and Frigate. (I had not yet got Frigate working) I shut down the unit to install the 2-port NIC in order to set up a bonded LAGG port. I had to reboot several times in order to get the bond working properly. When I had the NIC working properly I noted other things were not working and came up problem with storage.
GUI
According to the GUI, the pool panel named home had an offline VDEV, the ZFS Health panel said the Pool Status was offline. The Disk Health was good. At the top it said ,“Disks with exported pools 6”. IF you select “Add to Pool” and “Existing Pool” the drop-down menu for Existing Pool is empty.
CLI
Zpool import says “no pools available to import”
lsblk says
NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINTS
sda 8:0 0 5.5T 0 disk
└─sda1 8:1 0 5.5T 0 part
sdb 8:16 0 5.5T 0 disk
└─sdb1 8:17 0 5.5T 0 part
sdc 8:32 0 5.5T 0 disk
└─sdc1 8:33 0 5.5T 0 part
sdd 8:48 0 5.5T 0 disk
└─sdd1 8:49 0 5.5T 0 part
sde 8:64 0 5.5T 0 disk
└─sde1 8:65 0 5.5T 0 part
nvme0n1 259:0 0 931.5G 0 disk
├─nvme0n1p1 259:1 0 1M 0 part
├─nvme0n1p2 259:2 0 512M 0 part
└─nvme0n1p3 259:3 0 931G 0 part
It seems odd to me that the GUI still shows the pool named home albeit offline and yet the pool is not found in the dropdown or in the zpool command.
What can I do?
truenas_admin@truenas[~]$ sudo zpool status -v
pool: boot-pool
state: ONLINE
config:
NAME STATE READ WRITE CKSUM
boot-pool ONLINE 0 0 0
nvme0n1p3 ONLINE 0 0 0
I hope I am doing the HTML tag correctly. When I look at my reply to an earlier post about using the tags I get the impression I am not using them properly.
I think I now know what happened but not how to fix it. When I added the new NIC card and rebooted, it changed the /dev/sd? order of the drives. I can see before and after in syslog by comparing the serial numbers. Is there a means of putting them back in the right order. Because of the syslog I know which dev device should go with which HDD.
The drive recognition order as already stated, is not a factor.
In the TrueNAS GUI, do you have the option to Import Pool ? If yes, click the button and hopefully you will see your pool there.
If not, then post the output of zpool import and this time, click the icon above </> and the edit screen will display what looks like three dots and the line below that another 3 dots. They are really the ’ symbol. But paste your data in the middle of the two lines so the top line is the three ` and the bottom line is three as well.
If you can’t figure it out after another exchange of text messages, I recommend you shutdown, remove the NIC you installed and place the hardware is the exact same configuration it previously was in, power up. Hopefully your pool will begin to work.
If that fails, well I don’t have a lot of personal experience with manually forcing a pool to try to import, especially when it is not listed.
You didn’t do a software update too, did you? If so, maybe roll back to the previous environment after putting the hardware back to the original configuration.
we found the path for the drives were by name rather than ID (e.g. /dev/sda1 instead of ‘/dev/disk/by-id/ata-WDC_WD6003FFBX-68MU3N0_V9G5A33L-part1’) To our understanding this would mean when the OS changed the SD? entries on disks during reboot, the pool would not come up.
using those disk ids. This brought the pool back online. We had to do quite a bit of changing mount points after this to get everything working as mount points had all been changed to “/” instead of /mnt We finally got it all working except one thing. samba, this was because System dataset pool was pointing at boot-pool. When we changed that to my pool, home, in System/Advanced and then all worked. Yeah!!!
Until…I did another reboot to get AD working correctly and it went back to pool offline. I looked in syslog for the shutdown and restart. In the shutdown I see it successfuly unmount the pool files systems but in restart I see “No ZFS Pool found” when it tries to remount. Running the zpool import command shown above and point the System Dataset Pool back to my pool has it all running again.
Since we don’t know wnhat is causing it to loose the pool at reboot, unless somreone here has more suggestions, I guess it’s time to start with a fresh install.
Oh, btw, I did remove the new NIC card.
Even if I don’t get this working I learned alot about ZFS and Truenas
Yes, that was confusing. I think the addition of the NIC was a red herring. The real cause was the reboot needed to install the NIC. I think it is very possible that was the first reboot since I built the NAS. As you know, one of the beauties of Linux is the lack of need to reboot so often,
What I see now is, every time I reboot, the pool goes offline and is invisible in the GUI. Then I need to run that import command using the disk IDs…
I am still trying to figure out exactly why this is happening at reboot as running that command each time I reboot is not a long-tem option. A fresh build is still in my option list although it will mean re-copying alot of files. Thankfully I do have backups