EDIT: Added results with Optane Slog and without
Results for both:
In async this performs much better as expected with 8 jobs running, 128k bs and 16 iodepth. It’s hitting 2GB/sec which is what I would expect for this layout
TEST: (groupid=0, jobs=8): err= 0: pid=785181: Sat Sep 7 22:08:48 2024
write: IOPS=16.0k, BW=2002MiB/s (2100MB/s)(235GiB/120008msec); 0 zone resets
slat (nsec): min=1000, max=15458k, avg=4962.07, stdev=15026.34
clat (usec): min=109, max=39507, avg=7971.02, stdev=1088.27
lat (usec): min=117, max=39510, avg=7975.99, stdev=1088.08
clat percentiles (usec):
| 1.00th=[ 2089], 5.00th=[ 7767], 10.00th=[ 7832], 20.00th=[ 7898],
| 30.00th=[ 7898], 40.00th=[ 7963], 50.00th=[ 7963], 60.00th=[ 7963],
| 70.00th=[ 7963], 80.00th=[ 8029], 90.00th=[ 8291], 95.00th=[ 8586],
| 99.00th=[ 9634], 99.50th=[10683], 99.90th=[22152], 99.95th=[25560],
| 99.99th=[28967]
bw ( MiB/s): min= 1806, max= 7010, per=100.00%, avg=2003.37, stdev=41.00, samples=1912
iops : min=14448, max=56082, avg=16026.95, stdev=328.00, samples=1912
lat (usec) : 250=0.01%, 500=0.01%, 750=0.10%, 1000=0.40%
lat (msec) : 2=0.48%, 4=0.24%, 10=98.12%, 20=0.53%, 50=0.13%
cpu : usr=1.70%, sys=0.47%, ctx=969751, majf=17, minf=182
IO depths : 1=0.1%, 2=0.1%, 4=0.1%, 8=50.1%, 16=49.8%, 32=0.0%, >=64=0.0%
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0%
complete : 0=0.0%, 4=95.7%, 8=1.9%, 16=2.4%, 32=0.0%, 64=0.0%, >=64=0.0%
issued rwts: total=0,1922487,0,0 short=0,0,0,0 dropped=0,0,0,0
latency : target=0, window=0, percentile=100.00%, depth=16
Run status group 0 (all jobs):
WRITE: bw=2002MiB/s (2100MB/s), 2002MiB/s-2002MiB/s (2100MB/s-2100MB/s), io=235GiB (252GB), run=120008-120008msec
With sync set to always it’s getting to 700-ish MB/sec with the Optane SLOG:
TEST: (groupid=0, jobs=8): err= 0: pid=787926: Sat Sep 7 22:18:58 2024
write: IOPS=5682, BW=710MiB/s (745MB/s)(83.2GiB/120016msec); 0 zone resets
slat (nsec): min=1641, max=1679.0k, avg=5267.34, stdev=5489.67
clat (usec): min=4472, max=78257, avg=22499.75, stdev=5464.15
lat (usec): min=4479, max=78262, avg=22505.02, stdev=5464.25
clat percentiles (usec):
| 1.00th=[12518], 5.00th=[14091], 10.00th=[16909], 20.00th=[20841],
| 30.00th=[22152], 40.00th=[22676], 50.00th=[22938], 60.00th=[23200],
| 70.00th=[23462], 80.00th=[23987], 90.00th=[24249], 95.00th=[24773],
| 99.00th=[49546], 99.50th=[62653], 99.90th=[72877], 99.95th=[73925],
| 99.99th=[74974]
bw ( KiB/s): min=622592, max=906496, per=100.00%, avg=727323.94, stdev=6124.81, samples=1912
iops : min= 4864, max= 7082, avg=5682.17, stdev=47.85, samples=1912
lat (msec) : 10=0.01%, 20=17.28%, 50=81.78%, 100=0.93%
cpu : usr=0.71%, sys=0.24%, ctx=343281, majf=0, minf=182
IO depths : 1=0.1%, 2=0.1%, 4=0.1%, 8=50.0%, 16=50.0%, 32=0.0%, >=64=0.0%
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0%
complete : 0=0.0%, 4=95.8%, 8=1.7%, 16=2.5%, 32=0.0%, 64=0.0%, >=64=0.0%
issued rwts: total=0,681974,0,0 short=0,0,0,0 dropped=0,0,0,0
latency : target=0, window=0, percentile=100.00%, depth=16
Run status group 0 (all jobs):
WRITE: bw=710MiB/s (745MB/s), 710MiB/s-710MiB/s (745MB/s-745MB/s), io=83.2GiB (89.4GB), run=120016-120016msec
With sync set to always it’s getting to about
300-ish MB/sec **without an SLOG**:
TEST: (groupid=0, jobs=8): err= 0: pid=797163: Sat Sep 7 22:50:22 2024
write: IOPS=2333, BW=292MiB/s (306MB/s)(34.2GiB/120038msec); 0 zone resets
slat (nsec): min=1400, max=151983, avg=7100.93, stdev=3644.86
clat (usec): min=32149, max=75803, avg=54819.66, stdev=1893.37
lat (usec): min=32151, max=75807, avg=54826.76, stdev=1893.41
clat percentiles (usec):
| 1.00th=[45876], 5.00th=[51643], 10.00th=[52167], 20.00th=[54789],
| 30.00th=[54789], 40.00th=[55313], 50.00th=[55313], 60.00th=[55313],
| 70.00th=[55313], 80.00th=[55837], 90.00th=[55837], 95.00th=[55837],
| 99.00th=[58459], 99.50th=[58983], 99.90th=[60556], 99.95th=[62653],
| 99.99th=[65274]
bw ( KiB/s): min=276480, max=360448, per=100.00%, avg=298751.27, stdev=1195.38, samples=1912
iops : min= 2160, max= 2816, avg=2333.99, stdev= 9.34, samples=1912
lat (msec) : 50=2.59%, 100=97.41%
cpu : usr=0.39%, sys=0.10%, ctx=140067, majf=0, minf=174
IO depths : 1=0.1%, 2=0.1%, 4=0.1%, 8=50.0%, 16=50.0%, 32=0.0%, >=64=0.0%
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0%
complete : 0=0.0%, 4=95.8%, 8=0.1%, 16=4.2%, 32=0.0%, 64=0.0%, >=64=0.0%
issued rwts: total=0,280064,0,0 short=0,0,0,0 dropped=0,0,0,0
latency : target=0, window=0, percentile=100.00%, depth=16
Run status group 0 (all jobs):
WRITE: bw=292MiB/s (306MB/s), 292MiB/s-292MiB/s (306MB/s-306MB/s), io=34.2GiB (36.7GB), run=120038-120038msec