Over the last two weeks, I tried out the RAIDZ expansion feature with an 18TB drive in my main pool. I encountered a few issues that could be handled better. When performing a RAIDZ expansion, the current process can be optimized to improve user information and system performance.
Estimated Duration Information:
Clearly inform the user of the estimated duration for the RAIDZ expansion process. This can help set realistic expectations and plan accordingly(~50MB/s,https://openzfs.org/w/images/5/5e/RAIDZ_Expansion_2023.pdf).
Automatic Deactivation of Scrubs:
During the expansion process, automatically deactivate scrubs for the affected pool. This is to prevent significant IO WAIT issues, which can severely impact system performance. Once the expansion is completed, scrubs can be reactivated.
Progress Display in Task Manager:
Integrate detailed progress information from zpool status into the Task Manager. This includes the current data transfer rate, percentage completed, and estimated time to completion. For example:
expand: expansion of raidz1-0 in progress since Wed Jun 19 01:12:14 2024
1.99T / 50.9T copied at 54.8M/s, 3.91% done, 10 days 19:48:35 to go
I agree, i couldn’t even find the option, i kept trying from expand and other entry points like the ‘you have 1 disk unused tile’ agree with all your points.
Actually, most/all of the data in the vdev gets shuffled around. It stays at the same offset within the vdev, but it gets shuffled around to match the new geometry.
It preserves existing parity - So it keeps the data
Allows for online expansion - so reads and writes can continue immediately
But it shuffles the data to free the space - that takes a while.
It took ~ 11 days (~55M/s). Raidz 5 pool was filled at 75%, 4x 18TB. Now 5x 18TB. I’m on the nighty train. It would have taken longer If I hadn’t stopped the scrubbing. In the nedata screenshot you can see right what happend after I stoppend the scrubbing.
The stable build came out today. Unfortunatly, the user experience has not been improved. I have the feeling that TrueNAS is working on expanding my 4x 4 TB to 5x4TB Z1 pool, but the interface got stuck at the same screen as the OP.
Wonder if there is a way to check the intermediate progress manually?
I am running a RAIDz expansion (says it’s going to take 10 days!) and I also had poor UI experiences.
First, the job sat all day at 25%. It happened that I needed to restart the server, and since I had no idea how long this was going to take I just bit the bullet and did it.
When it came back the drive showed as unavailable. Ok, my server is kind of weird, I can accept that.
Upon rebooting again, all drives were visible. The pool showed the same capacity as before, but with all 5 drives. There was no indication that the expansion was still happening.
I confirmed via the shell that the expansion is in fact still happening, though it is going to take a very long time >_>