Very slow resilvering speed

Question for the TrueNAS experts - BLUF I am concerned about the resilvering speed of a replacement drive while in the middle of attempting to upgrade one of my vdevs (from 6tb drives to 18tb drive). I have a 12 bay Supermicro server that has 3x vdevs with 4x drives in each in a raiz1 (one spare per vdev). I am removing the 6tb drives from one vdev and replacing with 18tb drives. The first two swaps went pretty fast, less than 12 hours. The latest one is taking much longer, with write speeds to the drive at only 40 M/s and an estimated completion time of 12 more days.

I have individually verified the throughput of the drives and non are less than 160 M/s. I am upgraded to the latest version of TrueNAS Scale community. SAS controller is Broadcom / LSI SAS3224 PCI-Express Fusion-MPT SAS-3 on firmware version 13.0.0. The resilver speed is the same regardless of whether there is any other activity (disabling smb/nfs). The write performance over shares is significantly reduced from ~180 M/s to around 20 M/s.

This is a home NAS and all critical data is backed up off site, so I’ve accepted the cost/redundancy trade offs. My concern is that I’m going to lose the data on this NAS because there is some issue, hardware of software I am not aware of. If I replace the resilvering drive with the one just removed, it resilvers that one in about an hour and is all back up and running. I greatly appreciate any input or suggestions.

/dev/sda is zfs mirrored boot pool
/dev/sdb-m is the main zfs storage pool
/dev/nvme0 is a cache drive for the main zfs storage pool

What HDDs are you using?

@NugentS Sure thing. The table of the current drives is below. XXX was removed and replaced with what is currently in sdj, YYY is a spare.

NAME MODEL SIZE ROTA TYPE TRAN SERIAL VENDOR Manufacturer Human‑Readable Model Name
sdb TOSHIBA HDWG51JUZSVA 16.4T 1 disk sas 2590A00KFWQH ATA Toshiba Toshiba N300 NAS 16TB 3.5" HDD – SATA 6Gb/s, 7,200RPM, 512MB Cache
sdc TOSHIBA HDWG51JUZSVA 16.4T 1 disk sas 25A0A01AFWQH ATA Toshiba Toshiba N300 NAS 16TB 3.5" HDD – SATA 6Gb/s, 7,200RPM, 512MB Cache
sdd TOSHIBA HDWG51JUZSVA 16.4T 1 disk sas 15U0A007FWQH ATA Toshiba Toshiba N300 NAS 16TB 3.5" HDD – SATA 6Gb/s, 7,200RPM, 512MB Cache
sde ST6000VN001-2BB186 5.5T 1 disk sas ZR11CZC8 ATA Seagate Seagate IronWolf 6TB NAS HDD – 3.5" SATA 6Gb/s, 5,400RPM, 256MB Cache
sdf ST6000VX001-2BD186 5.5T 1 disk sas ZR11739F ATA Seagate Seagate SkyHawk 6TB Surveillance HDD – 3.5" SATA 6Gb/s, ~5,425RPM, 256MB Cache
sdg ST6000VX001-2BD186 5.5T 1 disk sas ZR118TA7 ATA Seagate Seagate SkyHawk 6TB Surveillance HDD – 3.5" SATA 6Gb/s, ~5,425RPM, 256MB Cache
sdh ST18000NT001-3NF101 16.4T 1 disk sas ZVTDPSXZ ATA Seagate Seagate IronWolf Pro 18TB NAS HDD – 3.5" SATA 6Gb/s, 7,200RPM, 256MB Cache
sdi WDC WD60EFRX-68MYMN1 5.5T 1 disk sas WD-WX11D9435NV6 ATA Western Digital WD Red 6TB NAS HDD – 3.5" SATA 6Gb/s, 5,400–5,700RPM, 64MB Cache
sdj ST8000DM004-2U9188 7.3T 1 disk sas ZR16D0QY ATA Seagate Seagate BarraCuda 8TB Desktop HDD – 3.5" SATA 6Gb/s, 5,400RPM, 256MB Cache
sdk ST18000NT001-3NF101 16.4T 1 disk sas ZVTE1GD9 ATA Seagate Seagate IronWolf Pro 18TB NAS HDD – 3.5" SATA 6Gb/s, 7,200RPM, 256MB Cache
sdl ST18000NT001-3NF101 16.4T 1 disk sas ZVTE1FGT ATA Seagate Seagate IronWolf Pro 18TB NAS HDD – 3.5" SATA 6Gb/s, 7,200RPM, 256MB Cache
sdm ST18000NT001-3NF101 16.4T 1 disk sas ZVTE1FTE ATA Seagate Seagate IronWolf Pro 18TB NAS HDD – 3.5" SATA 6Gb/s, 7,200RPM, 256MB Cache
zd0 100G 0 disk - - - ZFS/Zvol or virtual block device (no physical drive)
nvme0n1 Samsung SSD 970 EVO Plus 2TB 1.8T 0 disk nvme S6S2NS0TA28800B - Samsung Samsung 970 EVO Plus 2TB NVMe M.2 SSD – PCIe 3.0 x4, up to 3,500MB/s read, 3,300MB/s write
XXX WUH721818ALE6L4 16.4T 1 disk sas <serial_here> ATA Western Digital WD Ultrastar DC HC550 18TB Enterprise HDD – 3.5" SATA/SAS 6Gb/s, 7,200RPM, 512MB Cache, HelioSeal
YYY WD181KFGX 16.4T 1 disk sas <serial_here> ATA Western Digital WD Red Pro 18TB NAS HDD – 3.5" SATA 6Gb/s, 7,200RPM, 512MB Cache, CMR, NASware 3.0

I don’t know if this is your issue but SDJ is a Barracuda and that is probably a SMR type drive and may be causing issues.

SMR vs CMR ServeTheHome

1 Like

@SmallBarky - yes sir, that certainly looks like it could be the problem. I verified that the drive is SMR. That Seagate drive is on the way back to Amazon to be replaced with Toshiba N300 8TB NAS 3.5-Inch Internal Hard Drive - CMR SATA 6 GB/s 7200 RPM 512 MB Cache - HDWG780XZSTA.

I will post back with the results of that. Much appreciated.

I just checked another 6TB drive I had attempted to resilver to and was equally as slow - WD60EZAZ - confirmed it is SMR as well. I think you figured out the problem, can’t believe I didn’t realize these drives were SMR and that it made such a huge difference during resilver. I will know for sure tomorrow when the replacement drive arrives.

I thought the report in the link was very fair in not condemning SMR drives entirely, and I agree with the conclusion that they are completely unsuitable for use with NAS/ZFS.

I suspected SMR when you mentioned very slow resilver.

:slight_smile:

2 Likes

BTW, I did extensive searching with paid accounts set for the highest level of research mode on Grok, Copilot and ChatGPT to search for possible causes of slow resilver and never a single mention of SMR at all. I specified excluding hardware failures, maybe that screwed it up. You’d think at this point those tools would at least be good for what is essentially searching the internet for you, apparently not.

You trust a dumb pattern matching program?

Mind you I am fairly suprised it didn’t bring anything up - its a fairly well known issue in the technical press.

I did a google for “ZFS Slow Resilver” and there is nothing obvious in the first page of results that mention SMR - you have to follow up on some of the links. Wierd

1 Like

No, haha, I don’t trust them at all, but I do know they can run 5000 search queries faster than I can manually. I’ve found them to be reasonably useful for similar search type activities - and it did generate the table of hard drives with the human readable make/model column. I don’t pay for them, I have accounts through work.

Don’t forget to enable “ghost compression” with zfs set compression=ghost.

1 Like

Bad @winnielinnie

:rofl:

zfs set superresilver=enabled - also known as “not using SMR”. Looks good now, I’ll mark as solved. Thanks again all.

1 Like

Wow did that ever work, I am over 100% resilvered:

1 Like