Scale dragonfish 24.04.2.2 also during 24.04.2
128Gb memory
boot-pool 1 SSD
app-pool 2 ssd mirror
store 9 x 8TB HDD
I replaced 1 8TB disk due to 8 Offline uncorrectable sectors on the disk
During resilvering my system crashes. Nothing in the logs, nothing on screen, going straight to reboot.
This happended more then 10 times. It reached 10% sometimes more then 15% most off the times 6 or 7%
I have changed disks before, no issues then with resilvering
I pulled half my memory, same issue
pulled the rest and used the earlier pulled. same issue
I swap my gpu for a simpeler one, same issue
I stopped all VM and containers, same issue
I change /sys/module/zfs/parameters/zfs_scan_checkpoint_intval to 600 in the hope it resumes after a crash, but it always restarted
I removed the ups behind the server, same issue
I have a 850w PSU.
My metering software on the plug, shows no values over 250W
Nothing is feeling hot, all fans are working.
Without resilvering, the server runs for weeks without issue and working hard.
I have to say, while I answered that I also thought this could be an issue. The new drive is connected to one of those pci cards. I will swap one that is located on the mobo. and try again.
Run the basic stress tests, Cpu and RAM. It could be a failing motherboard. Make sure it is stable. And you didn’t specify which motherboard you have. Well you don’t specify any component. Shame as it could possibly help.
Just to inform you. I placed a new “Inspur 9211-8i 6Gbps Hba Lsi Fw: P20 IT Modus Zfs Freenas Unraid + 2 * SFF-8087 Sata” from aliexpress.
resilvering was faster then before and finished in 1 go.