Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[L0 v2][CTS] Fix problems reported by SYCL e2e tests #2516

Open
wants to merge 12 commits into
base: main
Choose a base branch
from

Conversation

igchor
Copy link
Member

@igchor igchor commented Jan 2, 2025

No description provided.

@github-actions github-actions bot added common Changes or additions to common utilities conformance Conformance test suite issues. level-zero L0 adapter specific issues labels Jan 2, 2025
Copy link

github-actions bot commented Jan 9, 2025

Compute Benchmarks level_zero_v2 run (with params: --compare baseline-v2):
https://github.com/oneapi-src/unified-runtime/actions/runs/12696264251

Copy link

github-actions bot commented Jan 9, 2025

Compute Benchmarks level_zero_v2 run (--compare baseline-v2):
https://github.com/oneapi-src/unified-runtime/actions/runs/12696264251
Job status: success. Test status: success.

Summary

No diffs to calculate performance change

(result is better)

Performance change in benchmark groups

Relative perf in group api (9): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
api_overhead_benchmark_l0 SubmitKernel out of order 11.515 μs 11.114000 μs 11.515 μs
api_overhead_benchmark_sycl SubmitKernel out of order 21.368000 μs 23.476 μs 21.642 μs
api_overhead_benchmark_sycl SubmitKernel in order 22.319 μs 25.185 μs 22.240000 μs
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 1.861 μs 2.109 μs 1.826000 μs
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 1.862 μs 1.667000 μs 1.867 μs
api_overhead_benchmark_ur SubmitKernel out of order CPU count 94844.000000 instr 101923.000 instr 94854.000 instr
api_overhead_benchmark_ur SubmitKernel out of order 13.891 μs 15.629 μs 13.367000 μs
api_overhead_benchmark_ur SubmitKernel in order CPU count 94844.000000 instr 107041.000 instr 94854.000 instr
api_overhead_benchmark_ur SubmitKernel in order 13.244000 μs 16.305 μs 13.282 μs
Relative perf in group memory (4): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 201.071 μs 252.552 μs 200.450000 μs
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 84.278 μs 133.161 μs 83.808000 μs
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 5.998 μs 5.545000 μs 6.025 μs
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 2.955 GB/s 3.172000 GB/s 2.930 GB/s
Relative perf in group miscellaneous (1): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
miscellaneous_benchmark_sycl VectorSum 807.632000 bw GB/s 807.892 bw GB/s 858.902 bw GB/s
Relative perf in group multithread (10): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1 3623.765 μs 6913.511 μs 3606.366000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1 8462.477 μs 17276.274 μs 8337.052000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1 24989.087000 μs 47969.092 μs 25070.404 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1 1060.893000 μs 2028.247 μs 1075.836 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1 4446.440000 μs 7352.088 μs 4546.023 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1 6422.665000 μs 8675.380 μs 6520.838 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1 25219.702 μs 25567.887 μs 24863.849000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1 1088.372 μs 1171.666 μs 1079.399000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events 28923.325000 μs 40328.398 μs 28949.925 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events 113063.962 μs 112651.427000 μs 116651.165 μs
Relative perf in group Velocity-Bench (9): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
Velocity-Bench Hashtable 380.986 M keys/sec 378.053 M keys/sec 384.547201 M keys/sec
Velocity-Bench Bitcracker 35.196 s 35.222 s 35.177000 s
Velocity-Bench CudaSift 200.653000 ms 202.890 ms -
Velocity-Bench Easywave 235.000000 ms 244.000 ms 238.000 ms
Velocity-Bench QuickSilver 121.570000 MMS/CTT 118.360 MMS/CTT 121.090 MMS/CTT
Velocity-Bench Sobel Filter 516.810000 ms 533.221 ms 519.769 ms
Velocity-Bench dl-cifar 17.075200 s 23.238 s 17.226 s
Velocity-Bench dl-mnist 2.700 s 2.740 s 2.690000 s
Velocity-Bench svm - 0.135900 s -
Relative perf in group Runtime (8): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
Runtime_IndependentDAGTaskThroughput_SingleTask 186.919 ms 266.787 ms 175.848000 ms
Runtime_IndependentDAGTaskThroughput_BasicParallelFor 186.263 ms 281.351 ms 182.065000 ms
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor 191.907 ms 277.904 ms 182.281000 ms
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor 187.252 ms 278.250 ms 177.759000 ms
Runtime_DAGTaskThroughput_SingleTask 1203.222000 ms 1689.703 ms 1226.450 ms
Runtime_DAGTaskThroughput_BasicParallelFor 1266.166000 ms 1751.814 ms 1280.960 ms
Runtime_DAGTaskThroughput_HierarchicalParallelFor 1272.352 ms 1735.799 ms 1269.362000 ms
Runtime_DAGTaskThroughput_NDRangeParallelFor 1236.965000 ms 1700.528 ms 1240.886 ms
Relative perf in group MicroBench (14): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous 4.377 ms 4.346000 ms 4.363 ms
MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous 4.431000 ms 4.524 ms 4.473 ms
MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous 4.487000 ms 4.507 ms 4.538 ms
MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous 3.711000 ms 4.611 ms 3.741 ms
MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous 618.178 ms 618.167 ms 618.119000 ms
MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous 618.175000 ms 618.207 ms 618.183 ms
MicroBench_HostDeviceBandwidth_1D_H2D_Strided 4.398 ms 4.288000 ms 4.371 ms
MicroBench_HostDeviceBandwidth_2D_H2D_Strided 4.555 ms 4.543000 ms 4.558 ms
MicroBench_HostDeviceBandwidth_3D_H2D_Strided 4.573 ms 4.535000 ms 4.535 ms
MicroBench_HostDeviceBandwidth_1D_D2H_Strided 3.813 ms 4.666 ms 3.760000 ms
MicroBench_HostDeviceBandwidth_2D_D2H_Strided 617.462 ms 617.486 ms 617.419000 ms
MicroBench_HostDeviceBandwidth_3D_D2H_Strided 617.434 ms 617.482 ms 617.406000 ms
MicroBench_LocalMem_int32_4096 29.905 ms 29.867 ms 29.840000 ms
MicroBench_LocalMem_fp32_4096 29.873 ms 29.866000 ms 29.885 ms
Relative perf in group Pattern (10): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
Pattern_Reduction_NDRange_int32 16.774 ms 16.646000 ms 16.750 ms
Pattern_Reduction_Hierarchical_int32 16.840000 ms 16.999 ms 16.868 ms
Pattern_SegmentedReduction_NDRange_int16 2.250000 ms 2.270 ms 2.250 ms
Pattern_SegmentedReduction_NDRange_int32 2.168 ms 2.170 ms 2.167000 ms
Pattern_SegmentedReduction_NDRange_int64 2.343000 ms 2.350 ms 2.343 ms
Pattern_SegmentedReduction_NDRange_fp32 2.160000 ms 2.178 ms 2.164 ms
Pattern_SegmentedReduction_Hierarchical_int16 11.799 ms 11.804 ms 11.794000 ms
Pattern_SegmentedReduction_Hierarchical_int32 11.601 ms 11.594000 ms 11.600 ms
Pattern_SegmentedReduction_Hierarchical_int64 11.786 ms 11.795 ms 11.784000 ms
Pattern_SegmentedReduction_Hierarchical_fp32 11.605 ms 11.597000 ms 11.601 ms
Relative perf in group ScalarProduct (6): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
ScalarProduct_NDRange_int32 3.984 ms 3.868000 ms 3.940 ms
ScalarProduct_NDRange_int64 5.520 ms 5.463000 ms 5.521 ms
ScalarProduct_NDRange_fp32 3.833 ms 3.784000 ms 3.935 ms
ScalarProduct_Hierarchical_int32 10.599 ms 10.530000 ms 10.558 ms
ScalarProduct_Hierarchical_int64 11.563 ms 11.483000 ms 11.554 ms
ScalarProduct_Hierarchical_fp32 10.179 ms 10.177 ms 10.160000 ms
Relative perf in group USM (7): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
USM_Allocation_latency_fp32_host 37.384 ms 37.361000 ms 37.611 ms
USM_Allocation_latency_fp32_shared 0.064000 ms 0.069 ms 0.069 ms
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch 1.353 ms 1.648 ms 1.322000 ms
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch 1.056 ms 1.034 ms 1.001000 ms
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch 1.588 ms 1.797 ms 1.579000 ms
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch 1.214 ms 1.192 ms 1.166000 ms
USM_Allocation_latency_fp32_device - 0.066000 ms -
Relative perf in group VectorAddition (3): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
VectorAddition_int32 1.492000 ms 1.593 ms 1.658 ms
VectorAddition_int64 3.219 ms 3.135 ms 3.115000 ms
VectorAddition_fp32 1.605 ms 1.559 ms 1.491000 ms
Relative perf in group Polybench (3): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
Polybench_2mm 1.217000 ms 1.221 ms 1.225 ms
Polybench_3mm 1.811 ms 1.733000 ms 1.821 ms
Polybench_Atax 6.859 ms 6.822000 ms 6.876 ms
Relative perf in group Kmeans (1): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
Kmeans_fp32 16.062 ms 16.048000 ms 16.052 ms
Relative perf in group MolecularDynamics (1): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
MolecularDynamics 0.029000 ms 0.031 ms 0.030 ms
Relative perf in group llama.cpp (6): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
llama.cpp Prompt Processing Batched 128 852.029760 token/s 791.989 token/s 821.910 token/s
llama.cpp Text Generation Batched 128 65.245587 token/s 62.602 token/s 65.177 token/s
llama.cpp Prompt Processing Batched 256 941.225659 token/s 891.414 token/s 938.009 token/s
llama.cpp Text Generation Batched 256 65.141895 token/s 62.599 token/s 65.125 token/s
llama.cpp Prompt Processing Batched 512 479.955972 token/s 444.416 token/s 476.255 token/s
llama.cpp Text Generation Batched 512 65.214514 token/s 62.639 token/s 65.195 token/s
Relative perf in group alloc/max (20): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 glibc 2635.290 ns 2464.010000 ns 2639.300 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 glibc 715.883000 ns 724.901 ns 724.206 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 glibc 1270.620 ns 1231.150000 ns 1256.870 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 glibc 748.620 ns 763.166 ns 748.475000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 glibc 888.314 ns 878.565000 ns 896.579 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 glibc 179.599 ns 176.342 ns 174.623000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 os_provider 2020.190 ns 1980.090000 ns 1984.420 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 os_provider 186.551000 ns 186.830 ns 189.637 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 os_provider 1906.400 ns 1755.850000 ns 1823.540 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 os_provider 190.687000 ns 192.109 ns 196.367 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> 4639.740 ns 4002.640000 ns 4161.600 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> 265.629 ns 253.808000 ns 257.051 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> 3196.410 ns 2995.180000 ns 3218.080 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> 294.990 ns 287.926000 ns 290.105 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> 288.779000 ns 297.851 ns 312.990 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> 217.055 ns 219.901 ns 215.629000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> 262.733 ns 270.144 ns 259.951000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> 207.134000 ns 208.751 ns 212.024 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 scalable_pool<os_provider> 994.775 ns 968.236000 ns 1060.040 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 scalable_pool<os_provider> 974.129 ns 971.817000 ns 990.434 ns
Relative perf in group multiple (12): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 glibc 32430.300 ns 31242.300000 ns 32028.400 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 glibc 4181.510 ns 4136.860 ns 4128.190000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 glibc 138061.000 ns 137667.000000 ns 139795.000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 glibc 29392.400000 ns 32264.100 ns 31031.800 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 proxy_pool<os_provider> 1178880.000 ns 1141810.000000 ns 1152010.000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 proxy_pool<os_provider> 155981.000000 ns 159931.000 ns 158010.000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 os_provider 1193550.000 ns 1160200.000000 ns 1193680.000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 os_provider 139330.000000 ns 140786.000 ns 140147.000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 scalable_pool<os_provider> 42446.900 ns 42412.100000 ns 42744.500 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 scalable_pool<os_provider> 15140.500 ns 14708.200000 ns 15111.600 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 scalable_pool<os_provider> 71557.400000 ns 73219.900 ns 73555.400 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 scalable_pool<os_provider> 25469.900 ns 28349.100 ns 25335.300000 ns
Relative perf in group LinearRegressionCoeff (1): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
LinearRegressionCoeff_fp32 - - 687.077000 ms

Details

Benchmark details - environment, command, output...
api_overhead_benchmark_l0 SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_l0 --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=l0 Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),11.524,11.515,2.68%,10.769,71.831,[CPU],[us]

api_overhead_benchmark_sycl SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=sycl Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),21.392,21.368,4.04%,20.590,275.582,[CPU],[us]

api_overhead_benchmark_sycl SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=sycl Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),22.338,22.319,3.32%,21.559,233.823,[CPU],[us]

memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueInOrderMemcpy --csv --noHeaders --iterations=10000 --IsCopyOnly=0 --sourcePlacement=Device --destinationPlacement=Device --size=1024 --count=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueInOrderMemcpy(api=sycl IsCopyOnly=0 sourcePlacement=Device destinationPlacement=Device size=1KB count=100),200.526,201.071,1.86%,193.029,412.611,[CPU],[us]

memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueInOrderMemcpy --csv --noHeaders --iterations=10000 --IsCopyOnly=0 --sourcePlacement=Host --destinationPlacement=Device --size=1024 --count=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueInOrderMemcpy(api=sycl IsCopyOnly=0 sourcePlacement=Host destinationPlacement=Device size=1KB count=100),84.259,84.278,0.95%,82.234,130.003,[CPU],[us]

memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueMemcpy --csv --noHeaders --iterations=10000 --sourcePlacement=Device --destinationPlacement=Device --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueMemcpy(api=sycl sourcePlacement=Device destinationPlacement=Device size=1KB),5.829,5.998,16.71%,4.658,82.907,[CPU],[us]

memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=StreamMemory --csv --noHeaders --iterations=10000 --type=Triad --size=10240 --memoryPlacement=Device --useEvents=0 --contents=Zeros --multiplier=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
StreamMemory(api=sycl type=Triad size=10KB useEvents=0 contents=Zeros memoryPlacement=Device multiplier=1),2.947,2.955,2.67%,0.396,3.301,[CPU],[GB/s]

api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=ExecImmediateCopyQueue --csv --noHeaders --iterations=100000 --ioq=0 --IsCopyOnly=1 --MeasureCompletionTime=0 --src=Device --dst=Device --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
ExecImmediateCopyQueue(api=sycl IsCopyOnly=1 MeasureCompletionTime=0 src=Device dst=Device size=1KB ioq=0),1.866,1.861,17.00%,1.662,92.867,[CPU],[us]

api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=ExecImmediateCopyQueue --csv --noHeaders --iterations=100000 --ioq=1 --IsCopyOnly=1 --MeasureCompletionTime=0 --src=Host --dst=Host --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
ExecImmediateCopyQueue(api=sycl IsCopyOnly=1 MeasureCompletionTime=0 src=Host dst=Host size=1KB ioq=1),1.865,1.862,6.10%,1.679,21.684,[CPU],[us]

miscellaneous_benchmark_sycl VectorSum

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/miscellaneous_benchmark_sycl --test=VectorSum --csv --noHeaders --iterations=1000 --numberOfElementsX=512 --numberOfElementsY=256 --numberOfElementsZ=256

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
VectorSum(api=sycl numberOfElementsX=512 numberOfElementsY=256 numberOfElementsZ=256),807.231,807.632,0.29%,773.857,812.325,[GPU],bw [GB/s]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=1 --NumOpsPerThread=400 --iterations=10 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=1 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),3622.673,3623.765,0.08%,3615.845,3626.290,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=8 --NumOpsPerThread=100 --iterations=10 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=100 NumThreads=8 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),8357.884,8462.477,4.98%,7571.977,8947.449,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=8 --NumOpsPerThread=400 --iterations=1000 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=8 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),25217.637,24989.087,5.03%,22320.112,32241.347,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=16 --NumOpsPerThread=10 --iterations=10000 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=10 NumThreads=16 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),1066.864,1060.893,6.02%,883.722,1571.567,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=1 --NumOpsPerThread=400 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=1 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),4465.930,4446.440,1.35%,4419.575,4610.729,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=8 --NumOpsPerThread=100 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=100 NumThreads=8 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),6387.800,6422.665,3.75%,5767.144,6674.815,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=8 --NumOpsPerThread=400 --iterations=1000 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=8 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),25444.953,25219.702,5.21%,22693.470,32161.966,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=16 --NumOpsPerThread=10 --iterations=10000 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=10 NumThreads=16 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),1093.253,1088.372,5.80%,899.498,1729.247,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=0 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=1 --NumOpsPerThread=4096 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=4096 NumThreads=1 AllocSize=1024 MeasureCompletion=1 UseEvents=0 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),28954.704,28923.325,0.28%,28835.003,29066.016,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=0 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=4 --NumOpsPerThread=4096 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=4096 NumThreads=4 AllocSize=1024 MeasureCompletion=1 UseEvents=0 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),113123.985,113063.962,0.51%,112308.061,114131.808,[CPU],[us]

api_overhead_benchmark_ur SubmitKernel out of order CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),94884.537,94844.000,0.36%,94844.000,101071.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),13.669,13.505,327.52%,12.627,14167.987,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),94884.541,94844.000,0.36%,94844.000,101068.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),14.092,13.891,405.13%,13.173,18065.775,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),94884.537,94844.000,0.36%,94844.000,101067.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),13.451,13.244,443.19%,12.596,18864.047,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),94884.537,94844.000,0.36%,94844.000,101067.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),13.451,13.244,443.19%,12.596,18864.047,[CPU],time [us]

Velocity-Bench Hashtable

Environment Variables:

Command:

/home/pmdk/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.352291 s
380.985501 million keys/second

Velocity-Bench Bitcracker

Environment Variables:

Command:

/home/pmdk/bench_workdir/bitcracker/bitcracker -f /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt -d /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt -b 60000

Output:

---------> BitCracker: BitLocker password cracking tool <---------

==================================
Retrieving Info

Reading hash file "/home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt"

              Attack

================================================
Type of attack: User Password
Psw per thread: 1
max_num_pswd_per_read: 60000
Dictionary: /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt
MAC Comparison (-m): Yes

Iter: 1, num passwords read: 60000
Kernel execution:
Effective passwords: 60000
Passwords Range:
npknpByH7N2m3OnLNH1X9DJxLrzIFWk
.....
dL_7uuf3QCz-c6K3xDu0

================================================
Bitcracker attack completed
Total passwords evaluated: 60000
Password not found!

time to subtract from total: 0.00407045 s
bitcracker - total time for whole calculation: 35.196 s

Velocity-Bench CudaSift

Environment Variables:

Command:

/home/pmdk/bench_workdir/cudaSift/cudaSift

Output:

UNKN:

UNKN: ==================================================
UNKN: User input parameters:
UNKN: Trace: ../../inputData
UNKN: ==================================================
UNKN:

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1163 1266 31.5775% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1167 1269 31.6861% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1102 1258 29.9213% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1209 1270 32.8265% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1191 1274 32.3378% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1179 1273 32.0119% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1102 1267 29.9213% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1109 1263 30.1113% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1235 1269 33.5324% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1262 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1242 1281 33.7225% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1069 1251 29.0253% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1108 1263 30.0842% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1223 1257 33.2066% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1266 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1268 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1100 1267 29.867% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1265 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1234 1265 33.5053% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1212 1261 32.908% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1245 1278 33.804% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1222 1266 33.1795% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1093 1265 29.6769% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1183 1262 32.1206% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1260 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1235 1271 33.5324% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1224 1259 33.2338% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1213 1277 32.9351% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1240 1277 33.6682% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1255 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1222 1256 33.1795% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1266 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1221 1257 33.1523% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1239 1276 33.6411% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1143 1274 31.0345% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1264 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1265 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1106 1261 30.0299% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1097 1264 29.7855% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1221 1258 33.1523% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1066 1265 28.9438% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1264 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1241 1272 33.6954% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1171 1258 31.7947% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1142 1258 31.0073% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1202 1268 32.6364% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1244 1276 33.7768% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1083 1262 29.4054% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1067 1257 28.9709% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1174 1269 31.8762% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Avg workload time = 200.653 ms

Velocity-Bench Easywave

Environment Variables:

Command:

/home/pmdk/bench_workdir/easywave/easyWave_sycl -grid /home/pmdk/bench_workdir/data/easywave/examples/e2Asean.grd -source /home/pmdk/bench_workdir/data/easywave/examples/BengkuluSept2007.flt -time 120

Output:

MAIN: Starting SYCL main program
MAIN: Attempting to clean up previous eWave tsunami files
MAIN: Clean up completed
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.30049+10)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
MAIN: Program successfully completed

Velocity-Bench QuickSilver

Environment Variables:

QS_DEVICE=GPU

Command:

/home/pmdk/bench_workdir/QuickSilver/qs -i /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp

Output:

Copyright (c) 2016
Lawrence Livermore National Security, LLC
All Rights Reserved
Quicksilver Version :
Quicksilver Git Hash :
MPI Version : 3.0
Number of MPI ranks : 1
Number of OpenMP Threads: 1
Number of OpenMP CPUs : 1

Loading params
Finished loading params
Simulation:
dt: 1e-08
fMax: 0.1
inputFile: /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp
energySpectrum:
boundaryCondition: octant
loadBalance: 1
cycleTimers: 0
debugThreads: 0
lx: 100
ly: 100
lz: 100
nParticles: 10000000
batchSize: 0
nBatches: 10
nSteps: 10
nx: 10
ny: 10
nz: 10
seed: 1029384756
xDom: 0
yDom: 0
zDom: 0
eMax: 20
eMin: 1e-09
nGroups: 230
lowWeightCutoff: 0.001
bTally: 1
fTally: 1
cTally: 1
coralBenchmark: 0
crossSectionsOut:

Geometry:
material: sourceMaterial
shape: brick
xMax: 100
xMin: 0
yMax: 100
yMin: 0
zMax: 100
zMin: 0

Material:
name: sourceMaterial
mass: 1000
nIsotopes: 10
nReactions: 9
sourceRate: 1e+10
totalCrossSection: 0.1
absorptionCrossSection: flat
fissionCrossSection: flat
scatteringCrossSection: flat
absorptionCrossSectionRatio: 0
fissionCrossSectionRatio: 0
scatteringCrossSectionRatio: 1

CrossSection:
name: flat
A: 0
B: 0
C: 0
D: 0
E: 1
nuBar: 2.4
setting GPU
setting parameters
Building partition 0
Building partition 1
Building partition 2
Building partition 3
Building MC_Domain 0
Building MC_Domain 1
Building MC_Domain 2
Building MC_Domain 3
Starting Consistency Check
Finished Consistency Check
Finished initMesh
Started copyMaterialDatabase_device
Finished copyMaterialDatabase_device
Finished copyNuclearData_device
Finished copyDomainDevice
cycle start source rr split absorb scatter fission produce collisn escape census num_seg scalar_flux cycleInit cycleTracking cycleFinalize
0 0 1000000 0 9000000 0 18533189 0 0 18533189 1151780 8848220 55527935 1.854923e+09 4.266600e-01 5.942150e-01 0.000000e+00
1 8848220 1000000 0 151478 0 34281997 0 0 34281997 1664159 8335539 94633679 5.047651e+09 3.486760e-01 7.274210e-01 0.000000e+00
2 8335539 1000000 0 663717 0 34354432 0 0 34354432 1366771 8632485 95010375 7.705930e+09 3.410000e-01 7.443660e-01 0.000000e+00
3 8632485 1000000 0 367978 0 34302727 0 0 34302727 1242216 8758247 94953591 9.992076e+09 3.762280e-01 8.063110e-01 0.000000e+00
4 8758247 1000000 0 242076 0 34141236 0 0 34141236 1168452 8831871 94599337 1.199834e+10 3.586920e-01 7.703670e-01 0.000000e+00
5 8831871 1000000 0 168070 0 33948724 0 0 33948724 1121156 8878785 94148236 1.377636e+10 3.434910e-01 7.475990e-01 0.000000e+00
6 8878785 1000000 0 120572 0 33760567 0 0 33760567 1089103 8910254 93689264 1.535668e+10 3.438220e-01 7.471400e-01 0.000000e+00
7 8910254 1000000 0 89810 0 33552179 0 0 33552179 1065203 8934861 93216931 1.676993e+10 3.439520e-01 7.663990e-01 0.000000e+00
8 8934861 1000000 0 65491 0 33384605 0 0 33384605 1047720 8952632 92768273 1.804559e+10 3.426390e-01 7.647900e-01 0.000000e+00
9 8952632 1000000 0 47165 0 33198494 0 0 33198494 1033968 8965829 92324678 1.920208e+10 3.400500e-01 7.415110e-01 0.000000e+00

Timer Cumulative Cumulative Cumulative Cumulative Cumulative Cumulative
Name number microSecs microSecs microSecs microSecs Efficiency
of calls min avg max stddev Rating
main 1 1.098e+07 1.098e+07 1.098e+07 0.000e+00 100.00
cycleInit 10 3.565e+06 3.565e+06 3.565e+06 0.000e+00 100.00
cycleTracking 10 7.410e+06 7.410e+06 7.410e+06 0.000e+00 100.00
cycleTracking_Kernel 104 4.911e+06 4.911e+06 4.911e+06 0.000e+00 100.00
cycleTracking_MPI 117 1.976e+05 1.976e+05 1.976e+05 0.000e+00 100.00
cycleTracking_Test_Done 0 0.000e+00 0.000e+00 0.000e+00 0.000e+00 0.00
cycleFinalize 20 4.020e+02 4.020e+02 4.020e+02 0.000e+00 100.00
Figure Of Merit 121.57 [Num Mega Segments / Cycle Tracking Time]

Velocity-Bench Sobel Filter

Environment Variables:

OPENCV_IO_MAX_IMAGE_PIXELS=1677721600

Command:

/home/pmdk/bench_workdir/sobel_filter/sobel_filter -i /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png -n 5

Output:

SYMN: Welcome to the SYCL version of Sobel filter workload.
SYMN: Input image file: /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png
SYMN: Launching SYCL kernel with # of iterations: 5
time to subtract from total: 7.48374 s
sobelfilter - total time for whole calculation: 0.51681 s

Velocity-Bench dl-cifar

Environment Variables:

Command:

/home/pmdk/bench_workdir/dl-cifar/dl-cifar_sycl

Output:

	Welcome to DL-CIFAR workload: SYCL version.

=======================================================================
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.30049+10)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.30049+10)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero

WL PARAMS:

WL PARAMS: ==================================================
WL PARAMS: User input parameters:
WL PARAMS: Trace: notrace
WL PARAMS: DL NW size type: WORKLOAD_DEFAULT_SIZE
WL PARAMS: ==================================================
WL PARAMS:

dataFileReadTimer->getTotalOpTime(): 8.1e-05 s
dl-cifar - total time for whole calculation: 17.0752 s

Velocity-Bench dl-mnist

Environment Variables:

NEOReadDebugKeys=1
DisableScratchPages=0

Command:

/home/pmdk/bench_workdir/dl-mnist/dl-mnist-sycl -conv_algo ONEDNN_AUTO

Output:

	Welcome to DL-MNIST workload: SYCL version.

=======================================================================
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.30049+10)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.30049+10)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero

WL PARAMS:

WL PARAMS: ==================================================
WL PARAMS: User input parameters:
WL PARAMS: Trace: notrace
WL PARAMS: Tensor management policy: per_layer
WL PARAMS: Convolution algorithm: ONEDNN_AUTO
WL PARAMS: Dataset reader format: NCHW
WL PARAMS: Dry run: YES
WL PARAMS: OneDNN Conv PD memory format: ONEDNN_CONVPD_ANY
WL PARAMS: No of iterations for inference: 400
WL PARAMS: ==================================================
WL PARAMS:

dl-mnist - total time for whole calculation: 2.7 s

Runtime_IndependentDAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_SingleTask', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.201961', '0.186919', '0.185662', '0.185662 0.186919 0.233303', '0.027150', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_BasicParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.187332', '0.186263', '0.186197', '0.186197 0.186263 0.189536', '0.001909', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.193584', '0.191907', '0.190340', '0.190340 0.191907 0.198504', '0.004333', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.187199', '0.187252', '0.187010', '0.187010 0.187252 0.187336', '0.000170', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_SingleTask', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.204353', '1.203222', '1.197084', '1.197084 1.203222 1.212754', '0.007896', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_BasicParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.266310', '1.266166', '1.265316', '1.265316 1.266166 1.267449', '0.001074', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_HierarchicalParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.271258', '1.272352', '1.267128', '1.267128 1.272352 1.274295', '0.003707', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_NDRangeParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.237697', '1.236965', '1.236695', '1.236695 1.236965 1.239432', '0.001508', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004862', '0.004377', '0.004284', '0.004284 0.004377 0.005924', '0.000921', '29.178311', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004440', '0.004431', '0.004414', '0.004414 0.004431 0.004474', '0.000031', '28.320178', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004452', '0.004487', '0.004363', '0.004363 0.004487 0.004505', '0.000077', '28.647667', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.003753', '0.003711', '0.003691', '0.003691 0.003711 0.003858', '0.000091', '33.864500', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.618187', '0.618178', '0.618129', '0.618129 0.618178 0.618253', '0.000063', '0.202223', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.618167', '0.618175', '0.618125', '0.618125 0.618175 0.618200', '0.000038', '0.202224', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004404', '0.004398', '0.004336', '0.004336 0.004398 0.004479', '0.000072', '28.831013', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004533', '0.004555', '0.004427', '0.004427 0.004555 0.004617', '0.000097', '28.234703', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004546', '0.004573', '0.004476', '0.004476 0.004573 0.004588', '0.000061', '27.924655', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.003825', '0.003813', '0.003811', '0.003811 0.003813 0.003852', '0.000023', '32.803569', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617475', '0.617462', '0.617419', '0.617419 0.617462 0.617543', '0.000063', '0.202456', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617456', '0.617434', '0.617416', '0.617416 0.617434 0.617518', '0.000054', '0.202457', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_LocalMem_int32_4096

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LocalMem_multi.csv --size=10240000

Output:

['MicroBench_LocalMem_int32_4096', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.029902', '0.029905', '0.029864', '0.029864 0.029905 0.029937', '0.000037', '10447.524046', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '312.000000']

MicroBench_LocalMem_fp32_4096

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LocalMem_multi.csv --size=10240000

Output:

['MicroBench_LocalMem_fp32_4096', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.029856', '0.029873', '0.029781', '0.029781 0.029873 0.029916', '0.000069', '10476.636597', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '312.000000']

Pattern_Reduction_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_Reduction_multi.csv --size=10240000

Output:

['Pattern_Reduction_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.016789', '0.016774', '0.016651', '0.016651 0.016774 0.016942', '0.000146', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_Reduction_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_Reduction_multi.csv --size=10240000

Output:

['Pattern_Reduction_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.016860', '0.016840', '0.016669', '0.016669 0.016840 0.017071', '0.000202', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003929', '0.003984', '0.003798', '0.003798 0.003984 0.004005', '0.000114', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.005538', '0.005520', '0.005494', '0.005494 0.005520 0.005600', '0.000055', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003866', '0.003833', '0.003785', '0.003785 0.003833 0.003979', '0.000101', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.010589', '0.010599', '0.010533', '0.010533 0.010599 0.010636', '0.000052', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011537', '0.011563', '0.011480', '0.011480 0.011563 0.011569', '0.000049', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.010179', '0.010179', '0.010167', '0.010167 0.010179 0.010191', '0.000012', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int16

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int16', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002254', '0.002250', '0.002249', '0.002249 0.002250 0.002265', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002175', '0.002168', '0.002167', '0.002167 0.002168 0.002189', '0.000013', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002344', '0.002343', '0.002341', '0.002341 0.002343 0.002349', '0.000004', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002164', '0.002160', '0.002159', '0.002159 0.002160 0.002174', '0.000008', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int16

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int16', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011805', '0.011799', '0.011789', '0.011789 0.011799 0.011827', '0.000020', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011596', '0.011601', '0.011576', '0.011576 0.011601 0.011611', '0.000018', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011794', '0.011786', '0.011766', '0.011766 0.011786 0.011830', '0.000033', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011603', '0.011605', '0.011574', '0.011574 0.011605 0.011629', '0.000027', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_host

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_host', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.037383', '0.037384', '0.037298', '0.037298 0.037384 0.037468', '0.000085', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_shared

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_shared', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.000061', '0.000064', '0.000051', '0.000051 0.000064 0.000069', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001840', '0.001353', '0.001351', '0.001351 0.001353 0.002818', '0.000846', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001056', '0.001056', '0.001045', '0.001045 0.001056 0.001067', '0.000011', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001604', '0.001588', '0.001586', '0.001586 0.001588 0.001639', '0.000030', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001212', '0.001214', '0.001203', '0.001203 0.001214 0.001217', '0.000007', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.001567', '0.001492', '0.001489', '0.001489 0.001492 0.001720', '0.000132', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003187', '0.003219', '0.003116', '0.003116 0.003219 0.003225', '0.000062', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.001626', '0.001605', '0.001582', '0.001582 0.001605 0.001691', '0.000058', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_2mm

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/2mm --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/2mm.csv --size=512

Output:

['Polybench_2mm', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.001215', '0.001217', '0.001207', '0.001207 0.001217 0.001219', '0.000006', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_3mm

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/3mm --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/3mm.csv --size=512

Output:

['Polybench_3mm', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.001815', '0.001811', '0.001804', '0.001804 0.001811 0.001830', '0.000013', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_Atax

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/atax --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Atax.csv --size=8192

Output:

['Polybench_Atax', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.006844', '0.006859', '0.006799', '0.006799 0.006859 0.006875', '0.000040', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Kmeans_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/kmeans --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Kmeans.csv --size=700000000

Output:

['Kmeans_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '700000000', '0.016070', '0.016062', '0.016059', '0.016059 0.016062 0.016089', '0.000017', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

MolecularDynamics

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/mol_dyn --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/MolecularDynamics.csv --size=8196

Output:

['MolecularDynamics', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8196', '0.000037', '0.000029', '0.000026', '0.000026 0.000029 0.000054', '0.000015', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

llama.cpp Prompt Processing Batched 128

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:53:45Z","603028528","41663342","852.029760","53.955811"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:53:52Z","1965262561","2392518","65.131323","0.079169"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:54:02Z","547296049","1791845","935.516314","3.057912"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:54:05Z","1965987226","954753","65.107251","0.031589"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:54:15Z","1058770512","2259263","483.581528","1.030139"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:54:22Z","1963542073","770085","65.188323","0.025483"

llama.cpp Text Generation Batched 128

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:51:26Z","627208594","48407083","820.245204","63.689074"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:51:32Z","1961819905","1856956","65.245587","0.061703"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:51:42Z","551112483","9660354","929.258999","16.334663"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:51:46Z","1963677287","1724201","65.183867","0.057168"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:51:56Z","1070992188","10892746","478.100829","4.848834"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:52:02Z","1956969931","705309","65.407246","0.023578"

llama.cpp Prompt Processing Batched 256

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:54:32Z","604124050","43913930","850.817560","56.935292"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:54:38Z","1958015942","3299540","65.372445","0.109901"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:54:48Z","544250796","14014501","941.225659","23.441589"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:54:52Z","1960682550","676921","65.283394","0.022495"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:55:01Z","1054608880","4525103","485.495181","2.079768"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:55:08Z","1958116352","695830","65.368951","0.023139"

llama.cpp Text Generation Batched 256

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:58:28Z","929258624","435170500","645.157069","253.691045"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:58:37Z","1958802741","1190957","65.346058","0.039704"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:58:47Z","542204813","2104250","944.303933","3.646439"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:58:50Z","1964941584","816531","65.141895","0.026992"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:59:00Z","1060489663","5238889","482.805217","2.372364"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:59:07Z","1961836071","411019","65.245005","0.013669"

llama.cpp Prompt Processing Batched 512

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:55:19Z","600565242","28178463","853.967016","38.295748"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:55:26Z","1973334818","3729424","64.865001","0.122298"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:55:36Z","543158777","2681917","942.652400","4.625722"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:55:39Z","1973595005","395787","64.856267","0.012842"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:55:49Z","1066889464","13000895","479.955972","5.766038"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:55:56Z","1971821677","2539621","64.914678","0.083577"

llama.cpp Text Generation Batched 512

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:59:17Z","592463232","20062600","864.948020","28.031353"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:59:23Z","1958915782","2054303","65.342325","0.068425"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:59:33Z","569874544","22435147","899.567401","35.698973"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:59:37Z","1961654300","645573","65.251054","0.021422"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-09T19:59:46Z","1059342445","2741901","483.321276","1.250477"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-09T19:59:53Z","1962753400","612825","65.214514","0.020308"

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2635.29,1942.65,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.883,715.879,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1270.62,1181.05,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.62,748.617,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,915.926,845.415,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.599,179.594,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1827.24,1826.22,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.551,186.546,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.59,1737.55,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,189.164,189.158,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4639.74,4631.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,250.843,250.835,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3248.78,3242.43,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,284.976,284.966,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,288.779,278.468,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.924,217.919,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,255.016,253.481,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.134,207.13,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1000.92,995.087,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,985.469,985.457,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32572.3,30968,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4141.01,4140.91,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138061,88252.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29392.4,29392.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20029e+06,1.19964e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,154645,154640,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.24345e+06,1.24292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138146,138142,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42778.8,41633.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15316.1,15315.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72051.8,71946.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26001.4,26000.9,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2635.29,1942.65,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.883,715.879,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1270.62,1181.05,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.62,748.617,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,915.926,845.415,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.599,179.594,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1827.24,1826.22,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.551,186.546,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.59,1737.55,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,189.164,189.158,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4639.74,4631.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,250.843,250.835,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3248.78,3242.43,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,284.976,284.966,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,288.779,278.468,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.924,217.919,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,255.016,253.481,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.134,207.13,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1000.92,995.087,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,985.469,985.457,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32572.3,30968,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4141.01,4140.91,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138061,88252.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29392.4,29392.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20029e+06,1.19964e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,154645,154640,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.24345e+06,1.24292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138146,138142,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42778.8,41633.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15316.1,15315.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72051.8,71946.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26001.4,26000.9,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2635.29,1942.65,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.883,715.879,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1270.62,1181.05,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.62,748.617,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,915.926,845.415,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.599,179.594,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1827.24,1826.22,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.551,186.546,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.59,1737.55,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,189.164,189.158,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4639.74,4631.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,250.843,250.835,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3248.78,3242.43,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,284.976,284.966,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,288.779,278.468,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.924,217.919,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,255.016,253.481,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.134,207.13,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1000.92,995.087,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,985.469,985.457,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32572.3,30968,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4141.01,4140.91,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138061,88252.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29392.4,29392.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20029e+06,1.19964e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,154645,154640,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.24345e+06,1.24292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138146,138142,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42778.8,41633.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15316.1,15315.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72051.8,71946.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26001.4,26000.9,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2635.29,1942.65,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.883,715.879,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1270.62,1181.05,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.62,748.617,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,915.926,845.415,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.599,179.594,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1827.24,1826.22,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.551,186.546,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.59,1737.55,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,189.164,189.158,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4639.74,4631.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,250.843,250.835,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3248.78,3242.43,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,284.976,284.966,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,288.779,278.468,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.924,217.919,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,255.016,253.481,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.134,207.13,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1000.92,995.087,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,985.469,985.457,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32572.3,30968,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4141.01,4140.91,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138061,88252.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29392.4,29392.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20029e+06,1.19964e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,154645,154640,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.24345e+06,1.24292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138146,138142,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42778.8,41633.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15316.1,15315.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72051.8,71946.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26001.4,26000.9,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2774.14,1976.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,709.863,709.858,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1240.5,1189.71,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.39,748.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,888.314,822.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,171.998,171.999,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2162.09,2160.15,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,191.681,191.676,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2080.32,2080.28,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,190.687,190.682,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4651.68,4642.7,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,265.629,265.577,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3196.41,3189.99,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,294.99,294.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,327.616,317.923,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,212.825,212.817,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.733,262.165,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,196.147,196.14,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,936.8,930.077,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,974.129,974.119,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32430.3,30850.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4191.32,4191.18,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135445,87624.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31946.6,31943.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14501e+06,1.14458e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,156773,156771,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.15579e+06,1.15534e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,140514,140510,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40339.4,39690.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14663.2,14662.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71463.9,70756.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,24938.5,24937.9,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2635.29,1942.65,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.883,715.879,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1270.62,1181.05,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.62,748.617,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,915.926,845.415,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.599,179.594,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1827.24,1826.22,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.551,186.546,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.59,1737.55,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,189.164,189.158,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4639.74,4631.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,250.843,250.835,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3248.78,3242.43,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,284.976,284.966,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,288.779,278.468,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.924,217.919,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,255.016,253.481,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.134,207.13,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1000.92,995.087,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,985.469,985.457,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32572.3,30968,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4141.01,4140.91,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138061,88252.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29392.4,29392.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20029e+06,1.19964e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,154645,154640,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.24345e+06,1.24292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138146,138142,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42778.8,41633.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15316.1,15315.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72051.8,71946.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26001.4,26000.9,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2570.07,1899.7,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,721.344,721.346,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1300.45,1212.06,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,764.097,764.096,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.078,842.294,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,186.873,186.869,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2020.19,2018.75,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.473,185.467,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1906.4,1905.93,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,202.738,202.734,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3304.3,3298.15,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,268.34,268.3,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2868.62,2862.97,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,300.159,300.152,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,283.777,281.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.055,217.053,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,263.402,262.276,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,210.679,210.671,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,994.775,961.995,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,953.456,953.443,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31252.4,29230,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4181.51,4181.49,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,142283,89799.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27763.7,27763.4,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17888e+06,1.17763e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155981,155979,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19355e+06,1.19292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139330,139328,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42446.9,41733.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15140.5,15140.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71557.4,71539.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25469.9,25469.5,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2635.29,1942.65,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.883,715.879,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1270.62,1181.05,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.62,748.617,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,915.926,845.415,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.599,179.594,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1827.24,1826.22,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.551,186.546,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.59,1737.55,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,189.164,189.158,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4639.74,4631.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,250.843,250.835,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3248.78,3242.43,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,284.976,284.966,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,288.779,278.468,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.924,217.919,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,255.016,253.481,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.134,207.13,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1000.92,995.087,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,985.469,985.457,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32572.3,30968,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4141.01,4140.91,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138061,88252.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29392.4,29392.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20029e+06,1.19964e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,154645,154640,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.24345e+06,1.24292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138146,138142,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42778.8,41633.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15316.1,15315.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72051.8,71946.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26001.4,26000.9,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2570.07,1899.7,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,721.344,721.346,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1300.45,1212.06,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,764.097,764.096,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.078,842.294,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,186.873,186.869,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2020.19,2018.75,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.473,185.467,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1906.4,1905.93,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,202.738,202.734,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3304.3,3298.15,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,268.34,268.3,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2868.62,2862.97,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,300.159,300.152,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,283.777,281.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.055,217.053,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,263.402,262.276,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,210.679,210.671,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,994.775,961.995,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,953.456,953.443,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31252.4,29230,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4181.51,4181.49,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,142283,89799.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27763.7,27763.4,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17888e+06,1.17763e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155981,155979,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19355e+06,1.19292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139330,139328,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42446.9,41733.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15140.5,15140.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71557.4,71539.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25469.9,25469.5,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2774.14,1976.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,709.863,709.858,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1240.5,1189.71,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.39,748.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,888.314,822.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,171.998,171.999,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2162.09,2160.15,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,191.681,191.676,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2080.32,2080.28,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,190.687,190.682,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4651.68,4642.7,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,265.629,265.577,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3196.41,3189.99,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,294.99,294.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,327.616,317.923,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,212.825,212.817,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.733,262.165,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,196.147,196.14,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,936.8,930.077,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,974.129,974.119,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32430.3,30850.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4191.32,4191.18,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135445,87624.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31946.6,31943.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14501e+06,1.14458e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,156773,156771,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.15579e+06,1.15534e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,140514,140510,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40339.4,39690.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14663.2,14662.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71463.9,70756.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,24938.5,24937.9,ns,,,,,

alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2635.29,1942.65,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.883,715.879,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1270.62,1181.05,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.62,748.617,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,915.926,845.415,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.599,179.594,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1827.24,1826.22,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.551,186.546,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.59,1737.55,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,189.164,189.158,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4639.74,4631.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,250.843,250.835,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3248.78,3242.43,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,284.976,284.966,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,288.779,278.468,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.924,217.919,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,255.016,253.481,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.134,207.13,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1000.92,995.087,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,985.469,985.457,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32572.3,30968,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4141.01,4140.91,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138061,88252.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29392.4,29392.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20029e+06,1.19964e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,154645,154640,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.24345e+06,1.24292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138146,138142,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42778.8,41633.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15316.1,15315.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72051.8,71946.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26001.4,26000.9,ns,,,,,

alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2774.14,1976.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,709.863,709.858,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1240.5,1189.71,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.39,748.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,888.314,822.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,171.998,171.999,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2162.09,2160.15,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,191.681,191.676,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2080.32,2080.28,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,190.687,190.682,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4651.68,4642.7,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,265.629,265.577,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3196.41,3189.99,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,294.99,294.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,327.616,317.923,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,212.825,212.817,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.733,262.165,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,196.147,196.14,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,936.8,930.077,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,974.129,974.119,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32430.3,30850.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4191.32,4191.18,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135445,87624.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31946.6,31943.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14501e+06,1.14458e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,156773,156771,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.15579e+06,1.15534e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,140514,140510,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40339.4,39690.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14663.2,14662.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71463.9,70756.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,24938.5,24937.9,ns,,,,,

alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2774.14,1976.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,709.863,709.858,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1240.5,1189.71,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.39,748.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,888.314,822.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,171.998,171.999,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2162.09,2160.15,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,191.681,191.676,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2080.32,2080.28,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,190.687,190.682,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4651.68,4642.7,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,265.629,265.577,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3196.41,3189.99,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,294.99,294.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,327.616,317.923,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,212.825,212.817,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.733,262.165,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,196.147,196.14,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,936.8,930.077,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,974.129,974.119,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32430.3,30850.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4191.32,4191.18,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135445,87624.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31946.6,31943.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14501e+06,1.14458e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,156773,156771,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.15579e+06,1.15534e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,140514,140510,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40339.4,39690.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14663.2,14662.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71463.9,70756.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,24938.5,24937.9,ns,,,,,

alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2774.14,1976.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,709.863,709.858,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1240.5,1189.71,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.39,748.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,888.314,822.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,171.998,171.999,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2162.09,2160.15,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,191.681,191.676,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2080.32,2080.28,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,190.687,190.682,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4651.68,4642.7,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,265.629,265.577,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3196.41,3189.99,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,294.99,294.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,327.616,317.923,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,212.825,212.817,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.733,262.165,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,196.147,196.14,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,936.8,930.077,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,974.129,974.119,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32430.3,30850.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4191.32,4191.18,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135445,87624.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31946.6,31943.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14501e+06,1.14458e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,156773,156771,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.15579e+06,1.15534e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,140514,140510,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40339.4,39690.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14663.2,14662.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71463.9,70756.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,24938.5,24937.9,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2635.29,1942.65,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.883,715.879,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1270.62,1181.05,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.62,748.617,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,915.926,845.415,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.599,179.594,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1827.24,1826.22,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.551,186.546,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.59,1737.55,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,189.164,189.158,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4639.74,4631.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,250.843,250.835,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3248.78,3242.43,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,284.976,284.966,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,288.779,278.468,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.924,217.919,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,255.016,253.481,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.134,207.13,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1000.92,995.087,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,985.469,985.457,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32572.3,30968,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4141.01,4140.91,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138061,88252.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29392.4,29392.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20029e+06,1.19964e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,154645,154640,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.24345e+06,1.24292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138146,138142,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42778.8,41633.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15316.1,15315.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72051.8,71946.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26001.4,26000.9,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2570.07,1899.7,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,721.344,721.346,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1300.45,1212.06,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,764.097,764.096,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.078,842.294,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,186.873,186.869,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2020.19,2018.75,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.473,185.467,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1906.4,1905.93,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,202.738,202.734,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3304.3,3298.15,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,268.34,268.3,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2868.62,2862.97,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,300.159,300.152,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,283.777,281.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.055,217.053,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,263.402,262.276,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,210.679,210.671,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,994.775,961.995,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,953.456,953.443,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31252.4,29230,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4181.51,4181.49,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,142283,89799.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27763.7,27763.4,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17888e+06,1.17763e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155981,155979,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19355e+06,1.19292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139330,139328,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42446.9,41733.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15140.5,15140.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71557.4,71539.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25469.9,25469.5,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2774.14,1976.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,709.863,709.858,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1240.5,1189.71,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.39,748.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,888.314,822.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,171.998,171.999,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2162.09,2160.15,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,191.681,191.676,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2080.32,2080.28,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,190.687,190.682,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4651.68,4642.7,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,265.629,265.577,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3196.41,3189.99,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,294.99,294.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,327.616,317.923,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,212.825,212.817,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.733,262.165,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,196.147,196.14,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,936.8,930.077,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,974.129,974.119,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32430.3,30850.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4191.32,4191.18,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135445,87624.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31946.6,31943.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14501e+06,1.14458e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,156773,156771,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.15579e+06,1.15534e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,140514,140510,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40339.4,39690.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14663.2,14662.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71463.9,70756.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,24938.5,24937.9,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2635.29,1942.65,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.883,715.879,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1270.62,1181.05,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.62,748.617,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,915.926,845.415,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.599,179.594,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1827.24,1826.22,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.551,186.546,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.59,1737.55,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,189.164,189.158,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4639.74,4631.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,250.843,250.835,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3248.78,3242.43,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,284.976,284.966,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,288.779,278.468,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.924,217.919,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,255.016,253.481,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.134,207.13,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1000.92,995.087,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,985.469,985.457,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32572.3,30968,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4141.01,4140.91,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138061,88252.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29392.4,29392.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20029e+06,1.19964e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,154645,154640,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.24345e+06,1.24292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138146,138142,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42778.8,41633.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15316.1,15315.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72051.8,71946.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26001.4,26000.9,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2570.07,1899.7,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,721.344,721.346,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1300.45,1212.06,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,764.097,764.096,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.078,842.294,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,186.873,186.869,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2020.19,2018.75,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.473,185.467,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1906.4,1905.93,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,202.738,202.734,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3304.3,3298.15,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,268.34,268.3,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2868.62,2862.97,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,300.159,300.152,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,283.777,281.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.055,217.053,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,263.402,262.276,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,210.679,210.671,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,994.775,961.995,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,953.456,953.443,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31252.4,29230,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4181.51,4181.49,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,142283,89799.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27763.7,27763.4,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17888e+06,1.17763e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155981,155979,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19355e+06,1.19292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139330,139328,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42446.9,41733.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15140.5,15140.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71557.4,71539.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25469.9,25469.5,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2774.14,1976.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,709.863,709.858,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1240.5,1189.71,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.39,748.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,888.314,822.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,171.998,171.999,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2162.09,2160.15,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,191.681,191.676,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2080.32,2080.28,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,190.687,190.682,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4651.68,4642.7,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,265.629,265.577,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3196.41,3189.99,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,294.99,294.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,327.616,317.923,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,212.825,212.817,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.733,262.165,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,196.147,196.14,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,936.8,930.077,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,974.129,974.119,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32430.3,30850.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4191.32,4191.18,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135445,87624.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31946.6,31943.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14501e+06,1.14458e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,156773,156771,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.15579e+06,1.15534e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,140514,140510,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40339.4,39690.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14663.2,14662.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71463.9,70756.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,24938.5,24937.9,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2774.14,1976.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,709.863,709.858,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1240.5,1189.71,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.39,748.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,888.314,822.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,171.998,171.999,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2162.09,2160.15,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,191.681,191.676,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2080.32,2080.28,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,190.687,190.682,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4651.68,4642.7,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,265.629,265.577,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3196.41,3189.99,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,294.99,294.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,327.616,317.923,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,212.825,212.817,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.733,262.165,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,196.147,196.14,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,936.8,930.077,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,974.129,974.119,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32430.3,30850.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4191.32,4191.18,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135445,87624.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31946.6,31943.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14501e+06,1.14458e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,156773,156771,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.15579e+06,1.15534e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,140514,140510,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40339.4,39690.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14663.2,14662.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71463.9,70756.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,24938.5,24937.9,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2570.07,1899.7,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,721.344,721.346,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1300.45,1212.06,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,764.097,764.096,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.078,842.294,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,186.873,186.869,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2020.19,2018.75,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.473,185.467,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1906.4,1905.93,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,202.738,202.734,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3304.3,3298.15,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,268.34,268.3,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2868.62,2862.97,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,300.159,300.152,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,283.777,281.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.055,217.053,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,263.402,262.276,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,210.679,210.671,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,994.775,961.995,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,953.456,953.443,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31252.4,29230,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4181.51,4181.49,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,142283,89799.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27763.7,27763.4,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17888e+06,1.17763e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155981,155979,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19355e+06,1.19292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139330,139328,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42446.9,41733.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15140.5,15140.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71557.4,71539.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25469.9,25469.5,ns,,,,,

multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2635.29,1942.65,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.883,715.879,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1270.62,1181.05,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.62,748.617,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,915.926,845.415,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.599,179.594,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1827.24,1826.22,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.551,186.546,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.59,1737.55,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,189.164,189.158,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4639.74,4631.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,250.843,250.835,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3248.78,3242.43,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,284.976,284.966,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,288.779,278.468,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.924,217.919,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,255.016,253.481,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.134,207.13,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1000.92,995.087,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,985.469,985.457,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32572.3,30968,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4141.01,4140.91,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138061,88252.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29392.4,29392.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20029e+06,1.19964e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,154645,154640,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.24345e+06,1.24292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138146,138142,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42778.8,41633.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15316.1,15315.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72051.8,71946.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26001.4,26000.9,ns,,,,,

multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2635.29,1942.65,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.883,715.879,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1270.62,1181.05,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,748.62,748.617,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,915.926,845.415,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.599,179.594,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1827.24,1826.22,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.551,186.546,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.59,1737.55,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,189.164,189.158,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4639.74,4631.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,250.843,250.835,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3248.78,3242.43,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,284.976,284.966,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,288.779,278.468,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.924,217.919,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,255.016,253.481,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.134,207.13,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1000.92,995.087,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,985.469,985.457,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32572.3,30968,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4141.01,4140.91,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138061,88252.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29392.4,29392.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20029e+06,1.19964e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,154645,154640,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.24345e+06,1.24292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138146,138142,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42778.8,41633.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15316.1,15315.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72051.8,71946.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26001.4,26000.9,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2570.07,1899.7,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,721.344,721.346,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1300.45,1212.06,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,764.097,764.096,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.078,842.294,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,186.873,186.869,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2020.19,2018.75,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.473,185.467,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1906.4,1905.93,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,202.738,202.734,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3304.3,3298.15,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,268.34,268.3,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2868.62,2862.97,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,300.159,300.152,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,283.777,281.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.055,217.053,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,263.402,262.276,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,210.679,210.671,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,994.775,961.995,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,953.456,953.443,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31252.4,29230,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4181.51,4181.49,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,142283,89799.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27763.7,27763.4,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17888e+06,1.17763e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155981,155979,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19355e+06,1.19292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139330,139328,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42446.9,41733.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15140.5,15140.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71557.4,71539.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25469.9,25469.5,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2570.07,1899.7,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,721.344,721.346,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1300.45,1212.06,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,764.097,764.096,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.078,842.294,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,186.873,186.869,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2020.19,2018.75,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.473,185.467,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1906.4,1905.93,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,202.738,202.734,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3304.3,3298.15,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,268.34,268.3,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2868.62,2862.97,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,300.159,300.152,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,283.777,281.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.055,217.053,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,263.402,262.276,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,210.679,210.671,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,994.775,961.995,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,953.456,953.443,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31252.4,29230,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4181.51,4181.49,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,142283,89799.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27763.7,27763.4,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17888e+06,1.17763e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155981,155979,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19355e+06,1.19292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139330,139328,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42446.9,41733.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15140.5,15140.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71557.4,71539.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25469.9,25469.5,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2570.07,1899.7,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,721.344,721.346,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1300.45,1212.06,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,764.097,764.096,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.078,842.294,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,186.873,186.869,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2020.19,2018.75,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.473,185.467,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1906.4,1905.93,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,202.738,202.734,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3304.3,3298.15,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,268.34,268.3,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2868.62,2862.97,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,300.159,300.152,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,283.777,281.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.055,217.053,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,263.402,262.276,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,210.679,210.671,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,994.775,961.995,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,953.456,953.443,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31252.4,29230,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4181.51,4181.49,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,142283,89799.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27763.7,27763.4,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17888e+06,1.17763e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155981,155979,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19355e+06,1.19292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139330,139328,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42446.9,41733.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15140.5,15140.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71557.4,71539.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25469.9,25469.5,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2570.07,1899.7,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,721.344,721.346,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1300.45,1212.06,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,764.097,764.096,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.078,842.294,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,186.873,186.869,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2020.19,2018.75,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.473,185.467,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1906.4,1905.93,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,202.738,202.734,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3304.3,3298.15,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,268.34,268.3,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2868.62,2862.97,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,300.159,300.152,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,283.777,281.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.055,217.053,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,263.402,262.276,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,210.679,210.671,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,994.775,961.995,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,953.456,953.443,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31252.4,29230,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4181.51,4181.49,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,142283,89799.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27763.7,27763.4,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17888e+06,1.17763e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155981,155979,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19355e+06,1.19292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139330,139328,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42446.9,41733.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15140.5,15140.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71557.4,71539.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25469.9,25469.5,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2570.07,1899.7,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,721.344,721.346,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1300.45,1212.06,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,764.097,764.096,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.078,842.294,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,186.873,186.869,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2020.19,2018.75,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.473,185.467,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1906.4,1905.93,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,202.738,202.734,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3304.3,3298.15,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,268.34,268.3,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2868.62,2862.97,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,300.159,300.152,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,283.777,281.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.055,217.053,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,263.402,262.276,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,210.679,210.671,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,994.775,961.995,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,953.456,953.443,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31252.4,29230,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4181.51,4181.49,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,142283,89799.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27763.7,27763.4,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17888e+06,1.17763e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155981,155979,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19355e+06,1.19292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139330,139328,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42446.9,41733.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15140.5,15140.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71557.4,71539.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25469.9,25469.5,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2570.07,1899.7,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,721.344,721.346,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1300.45,1212.06,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,764.097,764.096,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.078,842.294,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,186.873,186.869,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2020.19,2018.75,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.473,185.467,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1906.4,1905.93,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,202.738,202.734,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3304.3,3298.15,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,268.34,268.3,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2868.62,2862.97,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,300.159,300.152,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,283.777,281.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.055,217.053,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,263.402,262.276,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,210.679,210.671,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,994.775,961.995,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,953.456,953.443,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31252.4,29230,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4181.51,4181.49,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,142283,89799.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27763.7,27763.4,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17888e+06,1.17763e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155981,155979,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19355e+06,1.19292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139330,139328,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42446.9,41733.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15140.5,15140.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71557.4,71539.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25469.9,25469.5,ns,,,,,

multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2570.07,1899.7,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,721.344,721.346,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1300.45,1212.06,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,764.097,764.096,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.078,842.294,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,186.873,186.869,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2020.19,2018.75,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.473,185.467,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1906.4,1905.93,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,202.738,202.734,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3304.3,3298.15,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,268.34,268.3,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2868.62,2862.97,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,300.159,300.152,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,283.777,281.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.055,217.053,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,263.402,262.276,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,210.679,210.671,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,994.775,961.995,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,953.456,953.443,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31252.4,29230,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4181.51,4181.49,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,142283,89799.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27763.7,27763.4,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17888e+06,1.17763e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155981,155979,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19355e+06,1.19292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139330,139328,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42446.9,41733.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15140.5,15140.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71557.4,71539.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25469.9,25469.5,ns,,,,,

multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2570.07,1899.7,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,721.344,721.346,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1300.45,1212.06,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,764.097,764.096,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.078,842.294,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,186.873,186.869,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2020.19,2018.75,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.473,185.467,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1906.4,1905.93,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,202.738,202.734,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3304.3,3298.15,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,268.34,268.3,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2868.62,2862.97,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,300.159,300.152,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,283.777,281.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,217.055,217.053,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,263.402,262.276,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,210.679,210.671,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,994.775,961.995,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,953.456,953.443,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31252.4,29230,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4181.51,4181.49,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,142283,89799.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27763.7,27763.4,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17888e+06,1.17763e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155981,155979,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19355e+06,1.19292e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139330,139328,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42446.9,41733.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15140.5,15140.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,71557.4,71539.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25469.9,25469.5,ns,,,,,

@igchor igchor force-pushed the sycl_e2e_fixes branch 2 times, most recently from 9ec81a9 to 2a15528 Compare January 17, 2025 17:56
@igchor igchor marked this pull request as ready for review January 17, 2025 17:56
@igchor igchor requested review from a team as code owners January 17, 2025 17:56
@igchor
Copy link
Member Author

igchor commented Jan 17, 2025

Hm, looks like change in [CTS][L0 v2] adjust urEnqueueMemBufferMap is specific to L0 adapter only (for other adapters SYCL doesn't expect this behavior).

I think that for now, I will make the test L0-specific but the proper solution should be to fix the SYCL runtime to not rely on this behavior (it;s not in the spec).

igchor added 10 commits January 17, 2025 18:20
to match SYCL expectations. When pHost is set in buffer
properties, urEnqueueMemBufferMap should map memory to
that pointer (instead of creating a new allocation).
event should always be signaled (if provided by the user).
as they can be allocated by the user using L0 API directly
(and then UMF will not know about those pointers and umfFree
will do nothing).
To support both legacy and v2 adapter in SYCL tests
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
common Changes or additions to common utilities conformance Conformance test suite issues. level-zero L0 adapter specific issues
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants