Just as the title says, TrueNAS will crash randomly.
When i go to the server and check the monitor it has one line that states something along the lines of: “MiddleWare has stopped, press enter to use shell”.
At that point I physically restart the machine and TrueNAS will start 90% of the time. The other 10 percent, middleware will fail to start.
When TrueNAS does start up, I go into Storage and find that my drives have been disconnected from the Pool. The odd part is that the pool remains with 0 drives in it. In order to get back up and running I use the export/disconnect option to get rid of the ghost and then re-import my pool.
This issue was happening multiple times per day. I got rid of my LSI SAS 9207-8i and have plugged all of my drives directly into SATA connections on the motherboard. Since then I have only had one crash and that was about 72 hours after the change.
This pool has 5x10TB drives with a 500GB cache NVME.
My boot pool is 2x1TB SSD drives
I’m running off of an old ASUS prime board with a Ryzen 5 3600x.
Worst case I’ll have to buy a new board, but i’m curious if anyone else has had these problems
It does sound like a hardware issue and the tricky part now is trying to identify what exactly. Common things to consider/explore would be overheating and another could be power related issues.
Could you provide you hardware specs in detail and someone may be able to help further.
PS: can you also share the output of zpool status from the shell?
You haven’t given us many things to work with, but you did mention the CPU in use.
If you haven’t already, do this:
Go into your BIOS and set Power Supply Idle Control to Typical.
If you can’t find that option, update your BIOS and look for it again. Mind you, updating the BIOS sometimes reset some settings, so be sure to note any changes you have done previously before you update it.
If you still can’t find it, disable the C6 sleep state instead.
Your generation of CPU had an issue of falling into a deep sleep it couldn’t wake up from, the above adresses that specific issue.
I’ll add that the above may help with the crashes, not necessarily the issue of having to reimport your pool manually on restarts. So be sure to heed Johnny’s and others requests for information to troubleshoot that part further.
pool: Lorge
state: DEGRADED
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
scan: resilvered 135M in 00:00:02 with 0 errors on Tue Jul 1 11:52:47 2025
config: