Motherboard overheating?

I’m a near total noob, so let me get that out first.

I’m wondering if I’ve correctly identified a problem in my personal TrueNAS Scale setup. I believe that my problems were down to motherboard overheating, but I’m not entirely sure.

Right now, everything is working fine, but since I was on verge of starting a topic here asking for help, I hope that you don’t mind me starting it anyway. I got to where I am partly by working from answers in this forum and reading help topics in cli (sudo zfs help…), which are all pretty extensive and well written, btw.

I had some pool health errors that I was troubleshooting. One of my disks would stop working, it’d come online at times, and then disappear, and I’d get scrub errors and I had no idea what was the problem with it. It was a new hdd, and I suspected it to be faulty. So I relegated it to bare backup duty. Then I started to get other weird system problems. There were instances where web interface would report all disks working fine, but I couldn’t connect to one of them from pc side. My data transfer speeds would get slower with time, and In certain cases I couldn’t connect to web interface. Transfers would start fast, then usually crawl to low speed. In the end, I isolated the problem as motherboard (or something) overheating. hdd and disk temperatures, and all appeared fine, but I noticed that problems usually became more severe with time, so I started to suspect overheating. When I built the system, I started with case fans, but after a bit of stress testing I disabled them in an attempt to make the system quieter. But that was winter, and now it’s summer. I fitted a single case fan in and had it run at 100%, and suddenly all my problems went away. NAS is now stabile, I just had to spend a day trying to get one offline HDD back into pool since I messed it up by moving around sata connectors. Export/Disconect and later proper import fixed that. It also spared me a more embarrassing topic of me begging for help.

Anyway, all good for now. Still, could it be anything else that was overheating? Cpu in some way, something about hdd’s?

Welcome to the forums.

You need to provide a breakdown of your system hardware, include the HDD models as well. We like to ensure you are not using an SMR drive, bad things happen when used with ZFS.

In the meantime, I highly recommend you run MemTest86+ for 5 complete passes and then if those pass, a CPU stress test for 6 hours. This will test a good part of your hardware out and keep you busy for at least 1 day if not longer.

In the meantime you can list your hardware configuration. Be specific, it matters in most cases.

2 Likes

Things work. Something was overheating, now it’s no longer overheating. That thing is certain. I can’t be certain that it was motherboard, I was just wondering if someone experienced something like that before.