It’s not the first time I have this problem, the issue is solved after a reboot but not after a “restart via truenas” (I have to shutdown then boot the server). What can I do to understand and prevent this problem, please ?
Please add the other hardware to your post and post the output of zpool status
Since you differentiate between the two reboots: are you running truenas virtualized?
Wait for it to fail again and save the output again. Also see what lsblk yields then.
HBAs are not my wheelhouse, maybe it needs cooling and starts overheating?
My best guess would be, judging from your screenshots that your HBA drops out and all disks are gone then. However it seems like your nvme drive for the apps pool, which is not connected to the HBA also drops out.
Would be interesting to scan the logs (potentially dmesg?) for clues.
Only reason it would happen is firmware resetting and OS losing access to the disks. That could be either environmental, hardware or software.
Upgrading would stress the software piece, obviously. Since we are not seeing reports like that its a starting point. Its easier to diagnose and get help on up to date software.
I would not only suggest an uplift to Dragonfish as your CPU is fairly new, but also look into whether or not you might be subject to some of the potential causes of the i9-14900K stability issues - I can’t speak in great detail to these but it may be a contributing factor. Also, ensure that your HBA is receiving sufficient cooling, especially as you seem to be using a closed-loop liquid cooler so your case fans might not be pushing as much air as expected.
Not part of your issue, but IT stands for Initatiator Target in this case, and not Information Technology.
Regarding not updating, pragmatically, you’re going to get better results if you can confirm the issue on the current software. And if you can’t, which is quite likely, then your issue is fixed.
You should at least update to Cobia, which is 1) on the upgrade path to Deagonfish, and 2) quite stable I believe.
And depending on your usage, it’s fairly easy to revert back to a previous boot environment.
Defiantly check the cooling on your HBA. Passively cooled HBAs need strong airflow, make sure you have a blowing over the heatsink.