I’m a near total noob, so let me get that out first.
I’m wondering if I’ve correctly identified a problem in my personal TrueNAS Scale setup. I believe that my problems were down to motherboard overheating, but I’m not entirely sure.
Right now, everything is working fine, but since I was on verge of starting a topic here asking for help, I hope that you don’t mind me starting it anyway. I got to where I am partly by working from answers in this forum and reading help topics in cli (sudo zfs help…), which are all pretty extensive and well written, btw.
I had some pool health errors that I was troubleshooting. One of my disks would stop working, it’d come online at times, and then disappear, and I’d get scrub errors and I had no idea what was the problem with it. It was a new hdd, and I suspected it to be faulty. So I relegated it to bare backup duty. Then I started to get other weird system problems. There were instances where web interface would report all disks working fine, but I couldn’t connect to one of them from pc side. My data transfer speeds would get slower with time, and In certain cases I couldn’t connect to web interface. Transfers would start fast, then usually crawl to low speed. In the end, I isolated the problem as motherboard (or something) overheating. hdd and disk temperatures, and all appeared fine, but I noticed that problems usually became more severe with time, so I started to suspect overheating. When I built the system, I started with case fans, but after a bit of stress testing I disabled them in an attempt to make the system quieter. But that was winter, and now it’s summer. I fitted a single case fan in and had it run at 100%, and suddenly all my problems went away. NAS is now stabile, I just had to spend a day trying to get one offline HDD back into pool since I messed it up by moving around sata connectors. Export/Disconect and later proper import fixed that. It also spared me a more embarrassing topic of me begging for help.
Anyway, all good for now. Still, could it be anything else that was overheating? Cpu in some way, something about hdd’s?