Running tests locally with something like TN-Bench will help you rule out inefficiencies in networking or sharing protocols.
There’s also lots of context in that thread with my personal venture into understanding bottlenecks in less-than-ideal NVME systems
1 Like