Problems With High CPU Load & Ceashing

First my trueNAS host machine:

TrueNAS scale version: Dragonfish-24.04.2

Oddball generic x79 motherboard: Amazon.com: Gaming Motherboard Equipped, X79D 2.0 Computer Motherboard, LGA2011 M.2 NVM Micro ATX Gaming Motherboard with Dual Channel DDR3 for Xeon E5 : Electronics

ELUTENG PCI-E SATA Expansion Card

Radeon r7 370 GPU

Xeon E5 2640 v0

16GB DDR3 memory (non ECC)

Thermaltake SMART 600W ATX (80+ white)

256 team group boot ssd

6x 2TB HDDS

Issue:
Using the immich app, the initial CPU Load is expectedly high when the first backup starts, it runs at a max of ~81% completely fine for a few minutes, then the entire PC shuts off. No obvious errors. Pretty sure my first problems would be CPU and/or non ECC memory. Otherwise I’m not very sure (outside of a very peculiar collection of generic parts). Any insight is appreciated.

The SATA expansion card is most likely a problem. Usual advice is LSI HBA cards making sure they are in ‘IT mode’, depending on model. No RAID cards unless they can be put in ‘IT mode’

You need to run all you hardware and software through a burn in test / Stress testing. Test everything.

An extra detail about the sata expansion, only 2 out of the 6 are connected to that HBA. My pools are 2x RAIDZ1 * 3. Does that still sound like the HBA is probably the issue?

It could be. I was thinking something may be overheating. Did you look at CPU temps too?
Cheap SATA expansion cards are usually frowned upon because of unknown or intermittent trouble. That’s why I suggested stress testing since you have an unusual setup of unknown motherboard, non ecc ram (that might work fine with that motherboard) and SATA card.
A bit unusual pool config of 2 Vdevs of 3 wide RAIDZ1.
What was the reason for choosing that layout?

I only even learned what RAID stood for a few months ago, when I started this build I didn’t even know what zfs was. I was mistakenly trying to create a raid 10 equivalent but just fumbled it and now I’m kind of stuck currently trying to come up with a way to reconfigure it.

During heavy workload, the integrated Realtek NIC of my old system do the same that is happening to you, fail badly and crash/reboot.
Check this too

1 Like

Have you tried to run hardware stress tests on your setup? Tests for cpu, ram, etc? Your bios might have it built in or you can search the internet for bootable cd / usb versions. They are usually mentioned on gaming or pc building websites and forums. I am not sure what everyone recommends these days. Choose from well know and trusted websites for downloads.