Skip to content

Graph API benchmarks added#2560

Merged
pbalcer merged 1 commit intooneapi-src:mainfrom
mateuszpn:add-graph-bench
Jan 20, 2025
Merged

Graph API benchmarks added#2560
pbalcer merged 1 commit intooneapi-src:mainfrom
mateuszpn:add-graph-bench

Conversation

@mateuszpn
Copy link
Contributor

No description provided.

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/12768839561

"--iterations=1000",
"--numKernels=100",
]

No newline at end of file
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

missing newline


def bin_args(self) -> list[str]:
return [
"--iterations=1000",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is that enough iterations for the benchmark to be stable?

def bin_args(self) -> list[str]:
return [
"--iterations=1000",
"--numKernels=100",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should we add more scenarios with different number of kernels? e.g., with 1 kernel, to see the cost of the whole machinery end-to-end.

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/12768839561
Job status: success. Test status: success.

Summary

Total 125 benchmarks in mean.
Geomean 100.642%.
Improved 24 Regressed 8 (threshold 2.00%)

(result is better)

Performance change in benchmark groups

Relative perf in group api (9): 99.869%
Benchmark This PR baseline Relative perf Change -
api_overhead_benchmark_sycl SubmitKernel out of order 23.390000 μs 23.552 μs 100.69% 0.69% .
api_overhead_benchmark_l0 SubmitKernel out of order 11.218000 μs 11.255 μs 100.33% 0.33% .
api_overhead_benchmark_ur SubmitKernel out of order CPU count 101923.000000 instr 101923.000 instr 100.00% 0.00% .
api_overhead_benchmark_ur SubmitKernel in order CPU count 107041.000000 instr 107041.000 instr 100.00% 0.00% .
api_overhead_benchmark_ur SubmitKernel in order 16.323 μs 16.295000 μs 99.83% -0.17% .
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 2.077 μs 2.070000 μs 99.66% -0.34% .
api_overhead_benchmark_ur SubmitKernel out of order 15.576 μs 15.520000 μs 99.64% -0.36% .
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 1.631 μs 1.623000 μs 99.51% -0.49% .
api_overhead_benchmark_sycl SubmitKernel in order 25.159 μs 24.949000 μs 99.17% -0.83% .
Relative perf in group memory (4): 100.001%
Benchmark This PR baseline Relative perf Change -
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 5.569000 μs 5.605 μs 100.65% 0.65% .
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 3.208000 GB/s 3.201 GB/s 100.22% 0.22% .
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 252.490 μs 252.056000 μs 99.83% -0.17% .
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 133.124 μs 132.214000 μs 99.32% -0.68% .
Relative perf in group miscellaneous (1): 94.154%
Benchmark This PR baseline Relative perf Change -
miscellaneous_benchmark_sycl VectorSum 858.609 bw GB/s 808.411000 bw GB/s 94.15% -5.85% ---
Relative perf in group multithread (10): 99.538%
Benchmark This PR baseline Relative perf Change -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1 7382.914000 μs 7495.284 μs 101.52% 1.52% .
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1 25681.874000 μs 25995.395 μs 101.22% 1.22% .
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1 6922.779000 μs 6974.233 μs 100.74% 0.74% .
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events 40479.072000 μs 40660.910 μs 100.45% 0.45% .
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1 17417.035000 μs 17477.781 μs 100.35% 0.35% .
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1 1166.967000 μs 1168.649 μs 100.14% 0.14% .
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1 8705.550 μs 8634.909000 μs 99.19% -0.81% .
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1 2073.159 μs 2020.694000 μs 97.47% -2.53% -
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events 112338.240 μs 109295.935000 μs 97.29% -2.71% --
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1 48365.917 μs 46977.855000 μs 97.13% -2.87% --
Relative perf in group Velocity-Bench (9): 100.125%
Benchmark This PR baseline Relative perf Change -
Velocity-Bench Hashtable 379.627909 M keys/sec 375.506 M keys/sec 101.10% 1.10% .
Velocity-Bench Easywave 239.000000 ms 240.000 ms 100.42% 0.42% .
Velocity-Bench dl-mnist 2.730000 s 2.740 s 100.37% 0.37% .
Velocity-Bench svm 0.135200 s 0.136 s 100.30% 0.30% .
Velocity-Bench Sobel Filter 533.583000 ms 533.897 ms 100.06% 0.06% .
Velocity-Bench QuickSilver 118.200000 MMS/CTT 118.140 MMS/CTT 100.05% 0.05% .
Velocity-Bench CudaSift 203.330 ms 202.738000 ms 99.71% -0.29% .
Velocity-Bench dl-cifar 23.437 s 23.339500 s 99.58% -0.42% .
Velocity-Bench Bitcracker 35.287 s 35.129800 s 99.56% -0.44% .
Relative perf in group Runtime (8): 103.487%
Benchmark This PR baseline Relative perf Change -
Runtime_IndependentDAGTaskThroughput_BasicParallelFor 266.359000 ms 287.619 ms 107.98% 7.98% ++++
Runtime_IndependentDAGTaskThroughput_SingleTask 251.902000 ms 268.256 ms 106.49% 6.49% ++++
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor 271.741000 ms 280.817 ms 103.34% 3.34% ++
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor 271.995000 ms 280.952 ms 103.29% 3.29% ++
Runtime_DAGTaskThroughput_BasicParallelFor 1707.116000 ms 1755.738 ms 102.85% 2.85% ++
Runtime_DAGTaskThroughput_SingleTask 1651.090000 ms 1676.803 ms 101.56% 1.56% .
Runtime_DAGTaskThroughput_HierarchicalParallelFor 1691.933000 ms 1717.167 ms 101.49% 1.49% .
Runtime_DAGTaskThroughput_NDRangeParallelFor 1679.178000 ms 1697.583 ms 101.10% 1.10% .
Relative perf in group MicroBench (14): 100.154%
Benchmark This PR baseline Relative perf Change -
MicroBench_HostDeviceBandwidth_2D_H2D_Strided 4.477000 ms 4.568 ms 102.03% 2.03% +
MicroBench_HostDeviceBandwidth_1D_H2D_Strided 4.346000 ms 4.411 ms 101.50% 1.50% .
MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous 4.608000 ms 4.625 ms 100.37% 0.37% .
MicroBench_LocalMem_int32_4096 29.834000 ms 29.871 ms 100.12% 0.12% .
MicroBench_HostDeviceBandwidth_1D_D2H_Strided 4.660000 ms 4.664 ms 100.09% 0.09% .
MicroBench_LocalMem_fp32_4096 29.864000 ms 29.878 ms 100.05% 0.05% .
MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous 618.092000 ms 618.158 ms 100.01% 0.01% .
MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous 618.124000 ms 618.164 ms 100.01% 0.01% .
MicroBench_HostDeviceBandwidth_3D_D2H_Strided 617.423000 ms 617.455 ms 100.01% 0.01% .
MicroBench_HostDeviceBandwidth_2D_D2H_Strided 617.472 ms 617.470000 ms 100.00% -0.00% .
MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous 4.473 ms 4.459000 ms 99.69% -0.31% .
MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous 4.463 ms 4.447000 ms 99.64% -0.36% .
MicroBench_HostDeviceBandwidth_3D_H2D_Strided 4.466 ms 4.438000 ms 99.37% -0.63% .
MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous 4.500 ms 4.469000 ms 99.31% -0.69% .
Relative perf in group Pattern (10): 100.568%
Benchmark This PR baseline Relative perf Change -
Pattern_Reduction_NDRange_int32 16.623000 ms 17.271 ms 103.90% 3.90% ++
Pattern_Reduction_Hierarchical_int32 16.741000 ms 16.966 ms 101.34% 1.34% .
Pattern_SegmentedReduction_NDRange_fp32 2.172000 ms 2.178 ms 100.28% 0.28% .
Pattern_SegmentedReduction_NDRange_int32 2.170000 ms 2.173 ms 100.14% 0.14% .
Pattern_SegmentedReduction_NDRange_int64 2.341000 ms 2.344 ms 100.13% 0.13% .
Pattern_SegmentedReduction_NDRange_int16 2.270000 ms 2.271 ms 100.04% 0.04% .
Pattern_SegmentedReduction_Hierarchical_int16 11.806000 ms 11.811 ms 100.04% 0.04% .
Pattern_SegmentedReduction_Hierarchical_fp32 11.597000 ms 11.597 ms 100.00% 0.00% .
Pattern_SegmentedReduction_Hierarchical_int32 11.598 ms 11.595000 ms 99.97% -0.03% .
Pattern_SegmentedReduction_Hierarchical_int64 11.786 ms 11.774000 ms 99.90% -0.10% .
Relative perf in group ScalarProduct (6): 100.305%
Benchmark This PR baseline Relative perf Change -
ScalarProduct_NDRange_fp32 3.757000 ms 3.822 ms 101.73% 1.73% .
ScalarProduct_Hierarchical_fp32 10.152000 ms 10.201 ms 100.48% 0.48% .
ScalarProduct_NDRange_int32 3.865000 ms 3.874 ms 100.23% 0.23% .
ScalarProduct_Hierarchical_int32 10.543000 ms 10.543 ms 100.00% 0.00% .
ScalarProduct_Hierarchical_int64 11.496 ms 11.477000 ms 99.83% -0.17% .
ScalarProduct_NDRange_int64 5.483 ms 5.459000 ms 99.56% -0.44% .
Relative perf in group USM (7): 97.735%
Benchmark This PR baseline Relative perf Change -
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch 1.044000 ms 1.061 ms 101.63% 1.63% .
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch 1.788000 ms 1.811 ms 101.29% 1.29% .
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch 1.643000 ms 1.658 ms 100.91% 0.91% .
USM_Allocation_latency_fp32_host 37.405000 ms 37.562 ms 100.42% 0.42% .
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch 1.197000 ms 1.198 ms 100.08% 0.08% .
USM_Allocation_latency_fp32_device 0.068 ms 0.067000 ms 98.53% -1.47% .
USM_Allocation_latency_fp32_shared 0.064 ms 0.053000 ms 82.81% -17.19% ----------
Relative perf in group VectorAddition (3): 101.026%
Benchmark This PR baseline Relative perf Change -
VectorAddition_fp32 1.456000 ms 1.556 ms 106.87% 6.87% ++++
VectorAddition_int64 3.064000 ms 3.177 ms 103.69% 3.69% ++
VectorAddition_int32 1.554 ms 1.446000 ms 93.05% -6.95% ----
Relative perf in group Polybench (3): 100.147%
Benchmark This PR baseline Relative perf Change -
Polybench_2mm 1.212000 ms 1.223 ms 100.91% 0.91% .
Polybench_3mm 1.734 ms 1.730000 ms 99.77% -0.23% .
Polybench_Atax 6.882 ms 6.866000 ms 99.77% -0.23% .
Relative perf in group Kmeans (1): 100.025%
Benchmark This PR baseline Relative perf Change -
Kmeans_fp32 16.052000 ms 16.056 ms 100.02% 0.02% .
Relative perf in group LinearRegressionCoeff (1): 101.258%
Benchmark This PR baseline Relative perf Change -
LinearRegressionCoeff_fp32 863.524000 ms 874.384 ms 101.26% 1.26% .
Relative perf in group MolecularDynamics (1): 103.571%
Benchmark This PR baseline Relative perf Change -
MolecularDynamics 0.028000 ms 0.029 ms 103.57% 3.57% ++
Relative perf in group llama.cpp (6): 100.287%
Benchmark This PR baseline Relative perf Change -
llama.cpp Prompt Processing Batched 128 799.566647 token/s 776.176 token/s 103.01% 3.01% ++
llama.cpp Text Generation Batched 128 62.658740 token/s 62.610 token/s 100.08% 0.08% .
llama.cpp Text Generation Batched 256 62.667969 token/s 62.644 token/s 100.04% 0.04% .
llama.cpp Text Generation Batched 512 62.667643 token/s 62.664 token/s 100.01% 0.01% .
llama.cpp Prompt Processing Batched 512 446.621 token/s 447.955779 token/s 99.70% -0.30% .
llama.cpp Prompt Processing Batched 256 888.583 token/s 898.153602 token/s 98.93% -1.07% .
Relative perf in group alloc/max (20): 102.226%
Benchmark This PR baseline Relative perf Change -
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> 3813.870000 ns 4491.160 ns 117.76% 17.76% ++++++++++
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 os_provider 1688.180000 ns 1890.160 ns 111.96% 11.96% +++++++
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 scalable_pool<os_provider> 960.228000 ns 1039.730 ns 108.28% 8.28% +++++
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 os_provider 2011.990000 ns 2160.870 ns 107.40% 7.40% ++++
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 glibc 850.182000 ns 886.983 ns 104.33% 4.33% ++
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 scalable_pool<os_provider> 956.301000 ns 988.448 ns 103.36% 3.36% ++
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 glibc 712.064000 ns 735.791 ns 103.33% 3.33% ++
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> 285.454000 ns 294.862 ns 103.30% 3.30% ++
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 glibc 753.012000 ns 768.256 ns 102.02% 2.02% +
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> 297.490000 ns 302.838 ns 101.80% 1.80% .
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 glibc 177.480000 ns 179.529 ns 101.15% 1.15% .
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 os_provider 186.242000 ns 187.761 ns 100.82% 0.82% .
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 glibc 1230.370000 ns 1238.370 ns 100.65% 0.65% .
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> 215.466000 ns 216.850 ns 100.64% 0.64% .
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> 259.777 ns 258.719000 ns 99.59% -0.41% .
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> 212.979 ns 211.876000 ns 99.48% -0.52% .
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 os_provider 193.173 ns 191.832000 ns 99.31% -0.69% .
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> 274.346 ns 270.916000 ns 98.75% -1.25% .
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 glibc 2643.070 ns 2571.500000 ns 97.29% -2.71% --
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> 3853.960 ns 3344.700000 ns 86.79% -13.21% -------
Relative perf in group multiple (12): 101.311%
Benchmark This PR baseline Relative perf Change -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 scalable_pool<os_provider> 41338.500000 ns 43389.000 ns 104.96% 4.96% +++
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 glibc 32260.600000 ns 33605.200 ns 104.17% 4.17% ++
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 scalable_pool<os_provider> 26282.000000 ns 27311.800 ns 103.92% 3.92% ++
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 glibc 135820.000000 ns 139597.000 ns 102.78% 2.78% ++
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 scalable_pool<os_provider> 15162.100000 ns 15421.500 ns 101.71% 1.71% .
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 glibc 30751.400000 ns 31247.600 ns 101.61% 1.61% .
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 proxy_pool<os_provider> 158322.000000 ns 159366.000 ns 100.66% 0.66% .
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 proxy_pool<os_provider> 1139900.000 ns 1139210.000000 ns 99.94% -0.06% .
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 scalable_pool<os_provider> 72122.400 ns 71690.300000 ns 99.40% -0.60% .
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 os_provider 138284.000 ns 137218.000000 ns 99.23% -0.77% .
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 glibc 4186.440 ns 4139.250000 ns 98.87% -1.13% .
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 os_provider 1195530.000 ns 1180540.000000 ns 98.75% -1.25% .

Details

Benchmark details - environment, command, output...
api_overhead_benchmark_l0 SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_l0 --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=l0 Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),11.234,11.218,4.77%,10.421,157.468,[CPU],[us]

api_overhead_benchmark_sycl SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=sycl Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),23.413,23.390,3.30%,22.722,237.732,[CPU],[us]

api_overhead_benchmark_sycl SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=sycl Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),25.190,25.159,2.87%,24.371,234.161,[CPU],[us]

memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueInOrderMemcpy --csv --noHeaders --iterations=10000 --IsCopyOnly=0 --sourcePlacement=Device --destinationPlacement=Device --size=1024 --count=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueInOrderMemcpy(api=sycl IsCopyOnly=0 sourcePlacement=Device destinationPlacement=Device size=1KB count=100),252.642,252.490,1.03%,249.370,472.921,[CPU],[us]

memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueInOrderMemcpy --csv --noHeaders --iterations=10000 --IsCopyOnly=0 --sourcePlacement=Host --destinationPlacement=Device --size=1024 --count=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueInOrderMemcpy(api=sycl IsCopyOnly=0 sourcePlacement=Host destinationPlacement=Device size=1KB count=100),133.237,133.124,1.64%,132.206,336.946,[CPU],[us]

memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueMemcpy --csv --noHeaders --iterations=10000 --sourcePlacement=Device --destinationPlacement=Device --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueMemcpy(api=sycl sourcePlacement=Device destinationPlacement=Device size=1KB),5.924,5.569,16.15%,5.067,71.287,[CPU],[us]

memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=StreamMemory --csv --noHeaders --iterations=10000 --type=Triad --size=10240 --memoryPlacement=Device --useEvents=0 --contents=Zeros --multiplier=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
StreamMemory(api=sycl type=Triad size=10KB useEvents=0 contents=Zeros memoryPlacement=Device multiplier=1),3.190,3.208,3.47%,0.462,3.397,[CPU],[GB/s]

api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=ExecImmediateCopyQueue --csv --noHeaders --iterations=100000 --ioq=0 --IsCopyOnly=1 --MeasureCompletionTime=0 --src=Device --dst=Device --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
ExecImmediateCopyQueue(api=sycl IsCopyOnly=1 MeasureCompletionTime=0 src=Device dst=Device size=1KB ioq=0),2.081,2.077,6.62%,1.893,35.055,[CPU],[us]

api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=ExecImmediateCopyQueue --csv --noHeaders --iterations=100000 --ioq=1 --IsCopyOnly=1 --MeasureCompletionTime=0 --src=Host --dst=Host --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
ExecImmediateCopyQueue(api=sycl IsCopyOnly=1 MeasureCompletionTime=0 src=Host dst=Host size=1KB ioq=1),1.638,1.631,4.00%,1.543,6.177,[CPU],[us]

miscellaneous_benchmark_sycl VectorSum

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/miscellaneous_benchmark_sycl --test=VectorSum --csv --noHeaders --iterations=1000 --numberOfElementsX=512 --numberOfElementsY=256 --numberOfElementsZ=256

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
VectorSum(api=sycl numberOfElementsX=512 numberOfElementsY=256 numberOfElementsZ=256),858.004,858.609,0.51%,815.219,869.887,[GPU],bw [GB/s]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=1 --NumOpsPerThread=400 --iterations=10 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=1 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),6940.931,6922.779,0.74%,6895.537,7043.060,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=8 --NumOpsPerThread=100 --iterations=10 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=100 NumThreads=8 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),17400.104,17417.035,2.87%,16484.101,18044.265,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=8 --NumOpsPerThread=400 --iterations=1000 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=8 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),48635.985,48365.917,2.70%,46020.954,57075.582,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=16 --NumOpsPerThread=10 --iterations=10000 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=10 NumThreads=16 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),2138.679,2073.159,29.44%,1554.293,25284.051,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=1 --NumOpsPerThread=400 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=1 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),7396.509,7382.914,1.32%,7295.299,7600.440,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=8 --NumOpsPerThread=100 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=100 NumThreads=8 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),8839.235,8705.550,3.86%,8516.071,9689.231,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=8 --NumOpsPerThread=400 --iterations=1000 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=8 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),25895.026,25681.874,2.00%,24980.045,27200.552,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=16 --NumOpsPerThread=10 --iterations=10000 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=10 NumThreads=16 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),1255.315,1166.967,55.91%,866.762,16806.527,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=0 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=1 --NumOpsPerThread=4096 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=4096 NumThreads=1 AllocSize=1024 MeasureCompletion=1 UseEvents=0 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),40519.355,40479.072,0.36%,40318.849,40776.374,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=0 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=4 --NumOpsPerThread=4096 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=4096 NumThreads=4 AllocSize=1024 MeasureCompletion=1 UseEvents=0 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),112240.511,112338.240,0.27%,111592.933,112714.067,[CPU],[us]

api_overhead_benchmark_ur SubmitKernel out of order CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),101982.560,101923.000,6.01%,101815.000,2037277.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),15.469,15.297,301.92%,14.659,14783.227,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),101982.557,101923.000,6.01%,101815.000,2037277.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),15.745,15.576,305.01%,14.788,15200.911,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),107107.796,107041.000,3.97%,107041.000,1447430.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),16.449,16.266,317.64%,15.557,16536.914,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),107107.796,107041.000,3.97%,107041.000,1447432.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),16.510,16.323,327.49%,15.589,17113.868,[CPU],time [us]

Velocity-Bench Hashtable

Environment Variables:

Command:

/home/pmdk/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.353551 s
379.627909 million keys/second

Velocity-Bench Bitcracker

Environment Variables:

Command:

/home/pmdk/bench_workdir/bitcracker/bitcracker -f /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt -d /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt -b 60000

Output:

---------> BitCracker: BitLocker password cracking tool <---------

==================================
Retrieving Info

Reading hash file "/home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt"

              Attack

================================================
Type of attack: User Password
Psw per thread: 1
max_num_pswd_per_read: 60000
Dictionary: /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt
MAC Comparison (-m): Yes

Iter: 1, num passwords read: 60000
Kernel execution:
Effective passwords: 60000
Passwords Range:
npknpByH7N2m3OnLNH1X9DJxLrzIFWk
.....
dL_7uuf3QCz-c6K3xDu0

================================================
Bitcracker attack completed
Total passwords evaluated: 60000
Password not found!

time to subtract from total: 0.00390882 s
bitcracker - total time for whole calculation: 35.2866 s

Velocity-Bench CudaSift

Environment Variables:

Command:

/home/pmdk/bench_workdir/cudaSift/cudaSift

Output:

UNKN:

UNKN: ==================================================
UNKN: User input parameters:
UNKN: Trace: ../../inputData
UNKN: ==================================================
UNKN:

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1111 1271 30.1656% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1262 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1108 1272 30.0842% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1262 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1106 1278 30.0299% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1239 1274 33.6411% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1221 1272 33.1523% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1099 1259 29.8398% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1152 1278 31.2788% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1105 1272 30.0027% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1236 1270 33.5596% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1262 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1259 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1270 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1110 1267 30.1385% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1039 1255 28.2107% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1222 1257 33.1795% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1104 1259 29.9756% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1124 1267 30.5186% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1236 1271 33.5596% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1221 1261 33.1523% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1238 1272 33.6139% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1111 1265 30.1656% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1086 1258 29.4868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1262 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1264 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1219 1256 33.098% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1224 1260 33.2338% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1113 1263 30.2199% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1210 1256 32.8537% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1147 1253 31.1431% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1223 1258 33.2066% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1131 1271 30.7087% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1108 1276 30.0842% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1265 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1199 1256 32.555% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1264 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1123 1268 30.4914% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1272 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1263 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1256 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1263 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1102 1268 29.9213% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1237 1271 33.5868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1050 1266 28.5094% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1161 1250 31.5232% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1221 1257 33.1523% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1094 1273 29.704% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1268 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1242 1276 33.7225% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Avg workload time = 203.33 ms

Velocity-Bench Easywave

Environment Variables:

Command:

/home/pmdk/bench_workdir/easywave/easyWave_sycl -grid /home/pmdk/bench_workdir/data/easywave/examples/e2Asean.grd -source /home/pmdk/bench_workdir/data/easywave/examples/BengkuluSept2007.flt -time 120

Output:

MAIN: Starting SYCL main program
MAIN: Attempting to clean up previous eWave tsunami files
MAIN: Clean up completed
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.30049+10)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
MAIN: Program successfully completed

Velocity-Bench QuickSilver

Environment Variables:

QS_DEVICE=GPU

Command:

/home/pmdk/bench_workdir/QuickSilver/qs -i /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp

Output:

Copyright (c) 2016
Lawrence Livermore National Security, LLC
All Rights Reserved
Quicksilver Version :
Quicksilver Git Hash :
MPI Version : 3.0
Number of MPI ranks : 1
Number of OpenMP Threads: 1
Number of OpenMP CPUs : 1

Loading params
Finished loading params
Simulation:
dt: 1e-08
fMax: 0.1
inputFile: /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp
energySpectrum:
boundaryCondition: octant
loadBalance: 1
cycleTimers: 0
debugThreads: 0
lx: 100
ly: 100
lz: 100
nParticles: 10000000
batchSize: 0
nBatches: 10
nSteps: 10
nx: 10
ny: 10
nz: 10
seed: 1029384756
xDom: 0
yDom: 0
zDom: 0
eMax: 20
eMin: 1e-09
nGroups: 230
lowWeightCutoff: 0.001
bTally: 1
fTally: 1
cTally: 1
coralBenchmark: 0
crossSectionsOut:

Geometry:
material: sourceMaterial
shape: brick
xMax: 100
xMin: 0
yMax: 100
yMin: 0
zMax: 100
zMin: 0

Material:
name: sourceMaterial
mass: 1000
nIsotopes: 10
nReactions: 9
sourceRate: 1e+10
totalCrossSection: 0.1
absorptionCrossSection: flat
fissionCrossSection: flat
scatteringCrossSection: flat
absorptionCrossSectionRatio: 0
fissionCrossSectionRatio: 0
scatteringCrossSectionRatio: 1

CrossSection:
name: flat
A: 0
B: 0
C: 0
D: 0
E: 1
nuBar: 2.4
setting GPU
setting parameters
Building partition 0
Building partition 1
Building partition 2
Building partition 3
Building MC_Domain 0
Building MC_Domain 1
Building MC_Domain 2
Building MC_Domain 3
Starting Consistency Check
Finished Consistency Check
Finished initMesh
Started copyMaterialDatabase_device
Finished copyMaterialDatabase_device
Finished copyNuclearData_device
Finished copyDomainDevice
cycle start source rr split absorb scatter fission produce collisn escape census num_seg scalar_flux cycleInit cycleTracking cycleFinalize
0 0 1000000 0 9000000 0 18533189 0 0 18533189 1151780 8848220 55527935 1.854923e+09 3.711540e-01 6.081480e-01 0.000000e+00
1 8848220 1000000 0 151478 0 34281997 0 0 34281997 1664159 8335539 94633679 5.047651e+09 3.446580e-01 7.451310e-01 0.000000e+00
2 8335539 1000000 0 663717 0 34354432 0 0 34354432 1366771 8632485 95010375 7.705930e+09 3.413480e-01 7.599690e-01 0.000000e+00
3 8632485 1000000 0 367978 0 34302727 0 0 34302727 1242216 8758247 94953591 9.992076e+09 3.677370e-01 8.260470e-01 0.000000e+00
4 8758247 1000000 0 242076 0 34141236 0 0 34141236 1168452 8831871 94599337 1.199834e+10 3.373520e-01 7.986320e-01 0.000000e+00
5 8831871 1000000 0 168070 0 33948724 0 0 33948724 1121156 8878785 94148236 1.377636e+10 3.383960e-01 7.821140e-01 0.000000e+00
6 8878785 1000000 0 120572 0 33760567 0 0 33760567 1089103 8910254 93689264 1.535668e+10 3.422090e-01 7.806090e-01 0.000000e+00
7 8910254 1000000 0 89810 0 33552179 0 0 33552179 1065203 8934861 93216931 1.676993e+10 3.439010e-01 7.824050e-01 0.000000e+00
8 8934861 1000000 0 65491 0 33384605 0 0 33384605 1047720 8952632 92768273 1.804559e+10 3.410210e-01 7.819310e-01 0.000000e+00
9 8952632 1000000 0 47165 0 33198494 0 0 33198494 1033968 8965829 92324678 1.920208e+10 3.366740e-01 7.567830e-01 0.000000e+00

Timer Cumulative Cumulative Cumulative Cumulative Cumulative Cumulative
Name number microSecs microSecs microSecs microSecs Efficiency
of calls min avg max stddev Rating
main 1 1.109e+07 1.109e+07 1.109e+07 0.000e+00 100.00
cycleInit 10 3.464e+06 3.464e+06 3.464e+06 0.000e+00 100.00
cycleTracking 10 7.622e+06 7.622e+06 7.622e+06 0.000e+00 100.00
cycleTracking_Kernel 104 4.913e+06 4.913e+06 4.913e+06 0.000e+00 100.00
cycleTracking_MPI 117 2.069e+05 2.069e+05 2.069e+05 0.000e+00 100.00
cycleTracking_Test_Done 0 0.000e+00 0.000e+00 0.000e+00 0.000e+00 0.00
cycleFinalize 20 4.000e+02 4.000e+02 4.000e+02 0.000e+00 100.00
Figure Of Merit 118.20 [Num Mega Segments / Cycle Tracking Time]

Velocity-Bench Sobel Filter

Environment Variables:

OPENCV_IO_MAX_IMAGE_PIXELS=1677721600

Command:

/home/pmdk/bench_workdir/sobel_filter/sobel_filter -i /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png -n 5

Output:

SYMN: Welcome to the SYCL version of Sobel filter workload.
SYMN: Input image file: /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png
SYMN: Launching SYCL kernel with # of iterations: 5
time to subtract from total: 7.49573 s
sobelfilter - total time for whole calculation: 0.533583 s

Velocity-Bench dl-cifar

Environment Variables:

Command:

/home/pmdk/bench_workdir/dl-cifar/dl-cifar_sycl

Output:

	Welcome to DL-CIFAR workload: SYCL version.

=======================================================================
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.30049+10)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.30049+10)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero

WL PARAMS:

WL PARAMS: ==================================================
WL PARAMS: User input parameters:
WL PARAMS: Trace: notrace
WL PARAMS: DL NW size type: WORKLOAD_DEFAULT_SIZE
WL PARAMS: ==================================================
WL PARAMS:

dataFileReadTimer->getTotalOpTime(): 8.8e-05 s
dl-cifar - total time for whole calculation: 23.4368 s

Velocity-Bench dl-mnist

Environment Variables:

NEOReadDebugKeys=1
DisableScratchPages=0

Command:

/home/pmdk/bench_workdir/dl-mnist/dl-mnist-sycl -conv_algo ONEDNN_AUTO

Output:

	Welcome to DL-MNIST workload: SYCL version.

=======================================================================
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.30049+10)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.30049+10)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero

WL PARAMS:

WL PARAMS: ==================================================
WL PARAMS: User input parameters:
WL PARAMS: Trace: notrace
WL PARAMS: Tensor management policy: per_layer
WL PARAMS: Convolution algorithm: ONEDNN_AUTO
WL PARAMS: Dataset reader format: NCHW
WL PARAMS: Dry run: YES
WL PARAMS: OneDNN Conv PD memory format: ONEDNN_CONVPD_ANY
WL PARAMS: No of iterations for inference: 400
WL PARAMS: ==================================================
WL PARAMS:

dl-mnist - total time for whole calculation: 2.73 s

Velocity-Bench svm

Environment Variables:

Command:

/home/pmdk/bench_workdir/svm/svm_sycl /home/pmdk/bench_workdir/velocity-bench-repo/svm/SYCL/a9a /home/pmdk/bench_workdir/velocity-bench-repo/svm/SYCL/a.m

Output:

Number of args 3
Using cuSVM (Carpenter)...

Buffering input text file (6989624 B).
Load Done
Starting Training
_C 1.000000
Workgroup Size: 1024
nbrCtas 80
elemsPerCta 1248
threadsPerCta 128
Total run time: 0.064500 seconds
Iter:100
M:97683
N:123
Train done. Calulate Vector counts
Training done

Loading elapsed time : 0.0635 s
Processing elapsed time : 0.0697 s
Storing elapsed time : 0.0021 s
Total elapsed time : 0.1352 s
Result's are correct: 0.0551

Runtime_IndependentDAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_SingleTask', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.251353', '0.251902', '0.250124', '0.250124 0.251902 0.252033', '0.001067', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_BasicParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.269660', '0.266359', '0.265110', '0.265110 0.266359 0.277510', '0.006827', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.271930', '0.271995', '0.269891', '0.269891 0.271995 0.273904', '0.002007', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.271762', '0.271741', '0.271396', '0.271396 0.271741 0.272150', '0.000377', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_SingleTask', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.659606', '1.651090', '1.649908', '1.649908 1.651090 1.677822', '0.015786', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_BasicParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.706304', '1.707116', '1.703370', '1.703370 1.707116 1.708426', '0.002624', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_HierarchicalParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.692271', '1.691933', '1.690513', '1.690513 1.691933 1.694369', '0.001950', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_NDRangeParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.679591', '1.679178', '1.676119', '1.676119 1.679178 1.683477', '0.003696', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.030733', '0.004463', '0.004336', '0.004336 0.004463 0.083399', '0.045611', '28.827782', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004497', '0.004500', '0.004417', '0.004417 0.004500 0.004575', '0.000079', '28.300718', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004442', '0.004473', '0.004378', '0.004378 0.004473 0.004476', '0.000056', '28.554733', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004612', '0.004608', '0.004571', '0.004571 0.004608 0.004658', '0.000044', '27.347570', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.618146', '0.618092', '0.618088', '0.618088 0.618092 0.618259', '0.000098', '0.202237', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.618099', '0.618124', '0.618012', '0.618012 0.618124 0.618160', '0.000077', '0.202261', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004328', '0.004346', '0.004220', '0.004220 0.004346 0.004419', '0.000101', '29.620951', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004505', '0.004477', '0.004385', '0.004385 0.004477 0.004652', '0.000136', '28.508911', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004435', '0.004466', '0.004333', '0.004333 0.004466 0.004506', '0.000090', '28.845317', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004660', '0.004660', '0.004613', '0.004613 0.004660 0.004706', '0.000046', '27.096147', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617480', '0.617472', '0.617443', '0.617443 0.617472 0.617525', '0.000042', '0.202448', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617433', '0.617423', '0.617388', '0.617388 0.617423 0.617488', '0.000051', '0.202466', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_LocalMem_int32_4096

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LocalMem_multi.csv --size=10240000

Output:

['MicroBench_LocalMem_int32_4096', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.029846', '0.029834', '0.029802', '0.029802 0.029834 0.029902', '0.000051', '10469.184208', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '312.000000']

MicroBench_LocalMem_fp32_4096

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LocalMem_multi.csv --size=10240000

Output:

['MicroBench_LocalMem_fp32_4096', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.029847', '0.029864', '0.029770', '0.029770 0.029864 0.029909', '0.000071', '10480.464465', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '312.000000']

Pattern_Reduction_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_Reduction_multi.csv --size=10240000

Output:

['Pattern_Reduction_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.016632', '0.016623', '0.016427', '0.016427 0.016623 0.016845', '0.000209', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_Reduction_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_Reduction_multi.csv --size=10240000

Output:

['Pattern_Reduction_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.016797', '0.016741', '0.016633', '0.016633 0.016741 0.017017', '0.000198', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003867', '0.003865', '0.003752', '0.003752 0.003865 0.003983', '0.000116', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.005494', '0.005483', '0.005451', '0.005451 0.005483 0.005548', '0.000049', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003779', '0.003757', '0.003750', '0.003750 0.003757 0.003830', '0.000044', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.010543', '0.010543', '0.010535', '0.010535 0.010543 0.010551', '0.000008', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011483', '0.011496', '0.011422', '0.011422 0.011496 0.011532', '0.000056', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.010156', '0.010152', '0.010147', '0.010147 0.010152 0.010168', '0.000011', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int16

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int16', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002275', '0.002270', '0.002269', '0.002269 0.002270 0.002287', '0.000010', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002174', '0.002170', '0.002169', '0.002169 0.002170 0.002184', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002343', '0.002341', '0.002340', '0.002340 0.002341 0.002349', '0.000005', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002172', '0.002172', '0.002167', '0.002167 0.002172 0.002177', '0.000005', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int16

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int16', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011808', '0.011806', '0.011796', '0.011796 0.011806 0.011823', '0.000014', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011597', '0.011598', '0.011594', '0.011594 0.011598 0.011599', '0.000003', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011785', '0.011786', '0.011781', '0.011781 0.011786 0.011787', '0.000004', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011598', '0.011597', '0.011589', '0.011589 0.011597 0.011608', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_device

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_device', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.000062', '0.000068', '0.000050', '0.000050 0.000068 0.000069', '0.000011', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_host

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_host', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.037472', '0.037405', '0.037312', '0.037312 0.037405 0.037699', '0.000202', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_shared

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_shared', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.000063', '0.000064', '0.000062', '0.000062 0.000064 0.000065', '0.000002', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.021703', '0.001643', '0.001641', '0.001641 0.001643 0.061825', '0.034747', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001045', '0.001044', '0.001038', '0.001038 0.001044 0.001053', '0.000007', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001809', '0.001788', '0.001784', '0.001784 0.001788 0.001855', '0.000040', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001199', '0.001197', '0.001196', '0.001196 0.001197 0.001204', '0.000004', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.001543', '0.001554', '0.001448', '0.001448 0.001554 0.001626', '0.000090', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003098', '0.003064', '0.003056', '0.003056 0.003064 0.003173', '0.000066', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.001487', '0.001456', '0.001450', '0.001450 0.001456 0.001554', '0.000059', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_2mm

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/2mm --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/2mm.csv --size=512

Output:

['Polybench_2mm', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.001218', '0.001212', '0.001208', '0.001208 0.001212 0.001233', '0.000013', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_3mm

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/3mm --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/3mm.csv --size=512

Output:

['Polybench_3mm', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.001731', '0.001734', '0.001722', '0.001722 0.001734 0.001736', '0.000007', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_Atax

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/atax --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Atax.csv --size=8192

Output:

['Polybench_Atax', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.006885', '0.006882', '0.006867', '0.006867 0.006882 0.006905', '0.000019', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Kmeans_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/kmeans --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Kmeans.csv --size=700000000

Output:

['Kmeans_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '700000000', '0.016048', '0.016052', '0.016037', '0.016037 0.016052 0.016054', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

LinearRegressionCoeff_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/lin_reg_coeff --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LinearRegressionCoeff.csv --size=1638400000

Output:

['LinearRegressionCoeff_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1638400000', '0.856914', '0.863524', '0.842108', '0.842108 0.863524 0.865110', '0.012847', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

MolecularDynamics

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/mol_dyn --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/MolecularDynamics.csv --size=8196

Output:

['MolecularDynamics', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8196', '0.000035', '0.000028', '0.000025', '0.000025 0.000028 0.000052', '0.000015', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

llama.cpp Prompt Processing Batched 128

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:16:27Z","642086462","38066168","799.566647","45.704572"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:16:32Z","2043173621","3751192","62.647806","0.114782"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:16:43Z","592328165","14627195","864.811437","21.555690"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:16:46Z","2038162259","1459840","62.801699","0.044935"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:16:57Z","1132254195","39465089","452.629187","15.561680"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:17:03Z","2038127715","2179656","62.802795","0.067109"

llama.cpp Text Generation Batched 128

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:21:19Z","682033275","93511522","761.605436","99.620094"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:21:23Z","2042812648","1646281","62.658740","0.050419"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:21:34Z","576864200","5922290","887.631244","9.010138"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:21:37Z","2041681924","824087","62.693417","0.025232"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:21:48Z","1135366865","2044981","450.956729","0.811404"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:21:54Z","2040147143","2084860","62.740625","0.064014"

llama.cpp Prompt Processing Batched 256

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:22:06Z","647466842","41908452","793.321780","49.247843"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:22:12Z","2045062200","3303144","62.589914","0.100928"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:22:22Z","576365816","11130265","888.583237","16.734096"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:22:26Z","2044037850","1042339","62.621163","0.031917"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:22:36Z","1112579801","2202796","460.193152","0.910835"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:22:43Z","2042488113","1997265","62.668711","0.061217"

llama.cpp Text Generation Batched 256

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:17:15Z","651593736","50778943","789.525394","60.378863"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:17:21Z","2044858818","1888133","62.596051","0.057722"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:17:31Z","613097732","28285016","836.539516","38.936404"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:17:35Z","2042511409","1307687","62.667969","0.040093"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:17:45Z","1152878171","11281940","444.139874","4.339453"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:17:52Z","2045601772","1115072","62.573289","0.034090"

llama.cpp Prompt Processing Batched 512

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:12:22Z","708134688","201409040","759.026828","158.647962"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:12:29Z","2042038292","3139449","62.682586","0.096224"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:12:39Z","585859884","18966458","874.654792","28.022178"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:12:43Z","2041036468","962031","62.713246","0.029527"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:12:53Z","1146415840","6599093","446.621203","2.580368"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:13:00Z","2041514456","1740718","62.698588","0.053413"

llama.cpp Text Generation Batched 512

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:14:49Z","638483695","49346796","805.446025","57.395034"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:14:57Z","2042119832","3620395","62.680122","0.110868"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:15:07Z","576072702","11342998","889.052822","17.529744"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:15:10Z","2042826486","2534858","62.658360","0.077729"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-14T14:15:21Z","1126777646","9260011","454.417641","3.731160"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-14T14:15:28Z","2042524620","2888473","62.667643","0.088469"

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2643.07,1769.78,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,720.452,720.452,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1256.51,1210.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,759.477,759.476,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,899.469,838.579,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.48,177.476,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2064.16,2062.54,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.724,185.719,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1841.21,1841.16,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,187.905,187.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3592.83,3586.22,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,261.294,261.286,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3969.23,3963.81,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.403,285.396,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,297.49,283.975,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,216.039,215.987,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.345,261.416,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.696,207.694,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,947.498,937.967,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,951.551,951.539,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32260.6,30779,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4202.52,4202.42,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135820,87298.3,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31681.2,31680.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.1399e+06,1.13951e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,160162,160157,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14093e+06,1.14044e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,136959,136958,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42823.2,41712.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14736.3,14735.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72122.4,72105.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26282,26281.5,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2633.52,1866.41,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,712.064,712.067,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1228.13,1162.79,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,753.012,753.017,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,850.182,823.955,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.742,177.743,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1807.38,1807.01,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.396,187.391,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1686.09,1685.9,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,195.581,195.576,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3813.87,3808.58,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,258.074,258.027,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2947.2,2940.39,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,295.192,295.182,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,307.914,300.379,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,215.466,215.461,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,279.454,278.303,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,212.979,212.979,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1031.36,994.17,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,961.371,961.359,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32644.2,31113.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4149.35,4149.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135055,87931.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30751.4,30751.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21051e+06,1.20926e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155379,155376,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19553e+06,1.19505e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138284,138279,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40358.2,40264.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15162.1,15161.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72150.9,71810.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,28200.9,28200.4,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2751.51,1855.47,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,706.308,706.307,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1230.37,1147.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,746.218,746.193,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,843.969,817.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.108,176.103,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2011.99,2011.91,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.242,186.199,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1688.18,1688.04,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.173,193.167,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4125.78,4117.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,259.777,259.767,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3853.96,3845.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.454,285.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,285.704,284.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.444,211.438,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,274.346,266.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,223.538,223.531,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,960.228,948.841,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,956.301,956.287,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31513.6,30063.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4186.44,4186.33,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137036,88719.9,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30350.1,30349.8,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.13926e+06,1.13827e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,158322,158321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.2525e+06,1.25198e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139918,139917,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,41338.5,40870.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,17028.9,17024.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70352.4,69388.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25258.4,25258,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2633.52,1866.41,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,712.064,712.067,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1228.13,1162.79,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,753.012,753.017,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,850.182,823.955,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.742,177.743,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1807.38,1807.01,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.396,187.391,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1686.09,1685.9,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,195.581,195.576,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3813.87,3808.58,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,258.074,258.027,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2947.2,2940.39,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,295.192,295.182,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,307.914,300.379,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,215.466,215.461,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,279.454,278.303,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,212.979,212.979,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1031.36,994.17,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,961.371,961.359,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32644.2,31113.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4149.35,4149.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135055,87931.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30751.4,30751.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21051e+06,1.20926e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155379,155376,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19553e+06,1.19505e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138284,138279,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40358.2,40264.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15162.1,15161.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72150.9,71810.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,28200.9,28200.4,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2633.52,1866.41,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,712.064,712.067,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1228.13,1162.79,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,753.012,753.017,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,850.182,823.955,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.742,177.743,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1807.38,1807.01,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.396,187.391,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1686.09,1685.9,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,195.581,195.576,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3813.87,3808.58,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,258.074,258.027,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2947.2,2940.39,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,295.192,295.182,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,307.914,300.379,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,215.466,215.461,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,279.454,278.303,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,212.979,212.979,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1031.36,994.17,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,961.371,961.359,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32644.2,31113.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4149.35,4149.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135055,87931.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30751.4,30751.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21051e+06,1.20926e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155379,155376,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19553e+06,1.19505e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138284,138279,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40358.2,40264.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15162.1,15161.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72150.9,71810.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,28200.9,28200.4,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2643.07,1769.78,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,720.452,720.452,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1256.51,1210.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,759.477,759.476,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,899.469,838.579,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.48,177.476,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2064.16,2062.54,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.724,185.719,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1841.21,1841.16,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,187.905,187.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3592.83,3586.22,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,261.294,261.286,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3969.23,3963.81,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.403,285.396,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,297.49,283.975,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,216.039,215.987,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.345,261.416,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.696,207.694,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,947.498,937.967,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,951.551,951.539,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32260.6,30779,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4202.52,4202.42,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135820,87298.3,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31681.2,31680.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.1399e+06,1.13951e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,160162,160157,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14093e+06,1.14044e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,136959,136958,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42823.2,41712.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14736.3,14735.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72122.4,72105.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26282,26281.5,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2751.51,1855.47,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,706.308,706.307,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1230.37,1147.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,746.218,746.193,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,843.969,817.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.108,176.103,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2011.99,2011.91,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.242,186.199,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1688.18,1688.04,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.173,193.167,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4125.78,4117.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,259.777,259.767,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3853.96,3845.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.454,285.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,285.704,284.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.444,211.438,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,274.346,266.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,223.538,223.531,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,960.228,948.841,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,956.301,956.287,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31513.6,30063.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4186.44,4186.33,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137036,88719.9,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30350.1,30349.8,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.13926e+06,1.13827e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,158322,158321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.2525e+06,1.25198e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139918,139917,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,41338.5,40870.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,17028.9,17024.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70352.4,69388.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25258.4,25258,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2751.51,1855.47,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,706.308,706.307,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1230.37,1147.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,746.218,746.193,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,843.969,817.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.108,176.103,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2011.99,2011.91,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.242,186.199,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1688.18,1688.04,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.173,193.167,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4125.78,4117.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,259.777,259.767,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3853.96,3845.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.454,285.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,285.704,284.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.444,211.438,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,274.346,266.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,223.538,223.531,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,960.228,948.841,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,956.301,956.287,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31513.6,30063.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4186.44,4186.33,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137036,88719.9,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30350.1,30349.8,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.13926e+06,1.13827e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,158322,158321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.2525e+06,1.25198e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139918,139917,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,41338.5,40870.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,17028.9,17024.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70352.4,69388.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25258.4,25258,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2751.51,1855.47,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,706.308,706.307,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1230.37,1147.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,746.218,746.193,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,843.969,817.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.108,176.103,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2011.99,2011.91,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.242,186.199,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1688.18,1688.04,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.173,193.167,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4125.78,4117.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,259.777,259.767,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3853.96,3845.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.454,285.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,285.704,284.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.444,211.438,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,274.346,266.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,223.538,223.531,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,960.228,948.841,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,956.301,956.287,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31513.6,30063.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4186.44,4186.33,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137036,88719.9,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30350.1,30349.8,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.13926e+06,1.13827e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,158322,158321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.2525e+06,1.25198e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139918,139917,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,41338.5,40870.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,17028.9,17024.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70352.4,69388.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25258.4,25258,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2751.51,1855.47,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,706.308,706.307,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1230.37,1147.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,746.218,746.193,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,843.969,817.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.108,176.103,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2011.99,2011.91,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.242,186.199,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1688.18,1688.04,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.173,193.167,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4125.78,4117.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,259.777,259.767,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3853.96,3845.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.454,285.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,285.704,284.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.444,211.438,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,274.346,266.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,223.538,223.531,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,960.228,948.841,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,956.301,956.287,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31513.6,30063.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4186.44,4186.33,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137036,88719.9,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30350.1,30349.8,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.13926e+06,1.13827e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,158322,158321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.2525e+06,1.25198e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139918,139917,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,41338.5,40870.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,17028.9,17024.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70352.4,69388.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25258.4,25258,ns,,,,,

alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2633.52,1866.41,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,712.064,712.067,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1228.13,1162.79,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,753.012,753.017,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,850.182,823.955,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.742,177.743,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1807.38,1807.01,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.396,187.391,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1686.09,1685.9,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,195.581,195.576,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3813.87,3808.58,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,258.074,258.027,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2947.2,2940.39,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,295.192,295.182,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,307.914,300.379,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,215.466,215.461,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,279.454,278.303,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,212.979,212.979,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1031.36,994.17,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,961.371,961.359,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32644.2,31113.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4149.35,4149.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135055,87931.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30751.4,30751.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21051e+06,1.20926e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155379,155376,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19553e+06,1.19505e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138284,138279,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40358.2,40264.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15162.1,15161.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72150.9,71810.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,28200.9,28200.4,ns,,,,,

alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2751.51,1855.47,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,706.308,706.307,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1230.37,1147.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,746.218,746.193,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,843.969,817.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.108,176.103,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2011.99,2011.91,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.242,186.199,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1688.18,1688.04,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.173,193.167,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4125.78,4117.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,259.777,259.767,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3853.96,3845.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.454,285.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,285.704,284.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.444,211.438,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,274.346,266.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,223.538,223.531,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,960.228,948.841,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,956.301,956.287,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31513.6,30063.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4186.44,4186.33,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137036,88719.9,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30350.1,30349.8,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.13926e+06,1.13827e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,158322,158321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.2525e+06,1.25198e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139918,139917,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,41338.5,40870.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,17028.9,17024.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70352.4,69388.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25258.4,25258,ns,,,,,

alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2751.51,1855.47,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,706.308,706.307,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1230.37,1147.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,746.218,746.193,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,843.969,817.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.108,176.103,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2011.99,2011.91,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.242,186.199,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1688.18,1688.04,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.173,193.167,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4125.78,4117.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,259.777,259.767,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3853.96,3845.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.454,285.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,285.704,284.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.444,211.438,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,274.346,266.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,223.538,223.531,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,960.228,948.841,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,956.301,956.287,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31513.6,30063.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4186.44,4186.33,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137036,88719.9,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30350.1,30349.8,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.13926e+06,1.13827e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,158322,158321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.2525e+06,1.25198e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139918,139917,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,41338.5,40870.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,17028.9,17024.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70352.4,69388.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25258.4,25258,ns,,,,,

alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2751.51,1855.47,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,706.308,706.307,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1230.37,1147.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,746.218,746.193,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,843.969,817.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.108,176.103,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2011.99,2011.91,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.242,186.199,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1688.18,1688.04,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.173,193.167,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4125.78,4117.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,259.777,259.767,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3853.96,3845.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.454,285.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,285.704,284.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.444,211.438,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,274.346,266.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,223.538,223.531,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,960.228,948.841,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,956.301,956.287,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31513.6,30063.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4186.44,4186.33,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137036,88719.9,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30350.1,30349.8,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.13926e+06,1.13827e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,158322,158321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.2525e+06,1.25198e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139918,139917,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,41338.5,40870.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,17028.9,17024.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70352.4,69388.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25258.4,25258,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2643.07,1769.78,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,720.452,720.452,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1256.51,1210.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,759.477,759.476,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,899.469,838.579,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.48,177.476,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2064.16,2062.54,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.724,185.719,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1841.21,1841.16,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,187.905,187.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3592.83,3586.22,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,261.294,261.286,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3969.23,3963.81,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.403,285.396,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,297.49,283.975,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,216.039,215.987,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.345,261.416,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.696,207.694,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,947.498,937.967,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,951.551,951.539,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32260.6,30779,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4202.52,4202.42,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135820,87298.3,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31681.2,31680.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.1399e+06,1.13951e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,160162,160157,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14093e+06,1.14044e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,136959,136958,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42823.2,41712.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14736.3,14735.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72122.4,72105.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26282,26281.5,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2633.52,1866.41,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,712.064,712.067,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1228.13,1162.79,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,753.012,753.017,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,850.182,823.955,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.742,177.743,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1807.38,1807.01,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.396,187.391,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1686.09,1685.9,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,195.581,195.576,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3813.87,3808.58,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,258.074,258.027,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2947.2,2940.39,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,295.192,295.182,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,307.914,300.379,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,215.466,215.461,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,279.454,278.303,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,212.979,212.979,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1031.36,994.17,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,961.371,961.359,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32644.2,31113.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4149.35,4149.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135055,87931.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30751.4,30751.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21051e+06,1.20926e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155379,155376,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19553e+06,1.19505e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138284,138279,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40358.2,40264.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15162.1,15161.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72150.9,71810.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,28200.9,28200.4,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2751.51,1855.47,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,706.308,706.307,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1230.37,1147.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,746.218,746.193,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,843.969,817.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.108,176.103,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2011.99,2011.91,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.242,186.199,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1688.18,1688.04,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.173,193.167,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4125.78,4117.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,259.777,259.767,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3853.96,3845.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.454,285.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,285.704,284.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.444,211.438,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,274.346,266.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,223.538,223.531,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,960.228,948.841,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,956.301,956.287,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31513.6,30063.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4186.44,4186.33,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137036,88719.9,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30350.1,30349.8,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.13926e+06,1.13827e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,158322,158321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.2525e+06,1.25198e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139918,139917,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,41338.5,40870.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,17028.9,17024.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70352.4,69388.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25258.4,25258,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2633.52,1866.41,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,712.064,712.067,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1228.13,1162.79,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,753.012,753.017,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,850.182,823.955,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.742,177.743,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1807.38,1807.01,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.396,187.391,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1686.09,1685.9,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,195.581,195.576,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3813.87,3808.58,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,258.074,258.027,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2947.2,2940.39,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,295.192,295.182,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,307.914,300.379,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,215.466,215.461,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,279.454,278.303,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,212.979,212.979,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1031.36,994.17,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,961.371,961.359,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32644.2,31113.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4149.35,4149.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135055,87931.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30751.4,30751.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21051e+06,1.20926e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155379,155376,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19553e+06,1.19505e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138284,138279,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40358.2,40264.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15162.1,15161.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72150.9,71810.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,28200.9,28200.4,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2751.51,1855.47,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,706.308,706.307,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1230.37,1147.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,746.218,746.193,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,843.969,817.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.108,176.103,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2011.99,2011.91,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.242,186.199,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1688.18,1688.04,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.173,193.167,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4125.78,4117.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,259.777,259.767,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3853.96,3845.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.454,285.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,285.704,284.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.444,211.438,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,274.346,266.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,223.538,223.531,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,960.228,948.841,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,956.301,956.287,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31513.6,30063.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4186.44,4186.33,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137036,88719.9,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30350.1,30349.8,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.13926e+06,1.13827e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,158322,158321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.2525e+06,1.25198e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139918,139917,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,41338.5,40870.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,17028.9,17024.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70352.4,69388.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25258.4,25258,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2751.51,1855.47,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,706.308,706.307,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1230.37,1147.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,746.218,746.193,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,843.969,817.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.108,176.103,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2011.99,2011.91,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.242,186.199,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1688.18,1688.04,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.173,193.167,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4125.78,4117.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,259.777,259.767,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3853.96,3845.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.454,285.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,285.704,284.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.444,211.438,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,274.346,266.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,223.538,223.531,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,960.228,948.841,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,956.301,956.287,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31513.6,30063.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4186.44,4186.33,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137036,88719.9,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30350.1,30349.8,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.13926e+06,1.13827e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,158322,158321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.2525e+06,1.25198e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139918,139917,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,41338.5,40870.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,17028.9,17024.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70352.4,69388.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25258.4,25258,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2643.07,1769.78,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,720.452,720.452,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1256.51,1210.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,759.477,759.476,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,899.469,838.579,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.48,177.476,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2064.16,2062.54,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.724,185.719,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1841.21,1841.16,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,187.905,187.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3592.83,3586.22,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,261.294,261.286,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3969.23,3963.81,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.403,285.396,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,297.49,283.975,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,216.039,215.987,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.345,261.416,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.696,207.694,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,947.498,937.967,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,951.551,951.539,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32260.6,30779,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4202.52,4202.42,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135820,87298.3,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31681.2,31680.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.1399e+06,1.13951e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,160162,160157,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14093e+06,1.14044e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,136959,136958,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42823.2,41712.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14736.3,14735.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72122.4,72105.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26282,26281.5,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2751.51,1855.47,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,706.308,706.307,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1230.37,1147.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,746.218,746.193,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,843.969,817.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.108,176.103,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2011.99,2011.91,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.242,186.199,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1688.18,1688.04,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.173,193.167,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4125.78,4117.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,259.777,259.767,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3853.96,3845.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.454,285.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,285.704,284.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.444,211.438,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,274.346,266.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,223.538,223.531,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,960.228,948.841,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,956.301,956.287,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31513.6,30063.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4186.44,4186.33,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137036,88719.9,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30350.1,30349.8,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.13926e+06,1.13827e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,158322,158321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.2525e+06,1.25198e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139918,139917,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,41338.5,40870.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,17028.9,17024.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70352.4,69388.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25258.4,25258,ns,,,,,

multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2643.07,1769.78,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,720.452,720.452,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1256.51,1210.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,759.477,759.476,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,899.469,838.579,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.48,177.476,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2064.16,2062.54,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.724,185.719,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1841.21,1841.16,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,187.905,187.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3592.83,3586.22,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,261.294,261.286,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3969.23,3963.81,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.403,285.396,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,297.49,283.975,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,216.039,215.987,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.345,261.416,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.696,207.694,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,947.498,937.967,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,951.551,951.539,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32260.6,30779,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4202.52,4202.42,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135820,87298.3,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31681.2,31680.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.1399e+06,1.13951e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,160162,160157,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14093e+06,1.14044e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,136959,136958,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42823.2,41712.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14736.3,14735.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72122.4,72105.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26282,26281.5,ns,,,,,

multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2633.52,1866.41,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,712.064,712.067,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1228.13,1162.79,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,753.012,753.017,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,850.182,823.955,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.742,177.743,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1807.38,1807.01,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.396,187.391,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1686.09,1685.9,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,195.581,195.576,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3813.87,3808.58,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,258.074,258.027,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2947.2,2940.39,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,295.192,295.182,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,307.914,300.379,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,215.466,215.461,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,279.454,278.303,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,212.979,212.979,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1031.36,994.17,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,961.371,961.359,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32644.2,31113.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4149.35,4149.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135055,87931.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30751.4,30751.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21051e+06,1.20926e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155379,155376,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19553e+06,1.19505e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138284,138279,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40358.2,40264.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15162.1,15161.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72150.9,71810.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,28200.9,28200.4,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2643.07,1769.78,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,720.452,720.452,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1256.51,1210.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,759.477,759.476,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,899.469,838.579,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.48,177.476,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2064.16,2062.54,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.724,185.719,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1841.21,1841.16,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,187.905,187.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3592.83,3586.22,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,261.294,261.286,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3969.23,3963.81,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.403,285.396,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,297.49,283.975,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,216.039,215.987,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.345,261.416,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.696,207.694,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,947.498,937.967,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,951.551,951.539,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32260.6,30779,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4202.52,4202.42,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135820,87298.3,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31681.2,31680.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.1399e+06,1.13951e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,160162,160157,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14093e+06,1.14044e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,136959,136958,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42823.2,41712.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14736.3,14735.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72122.4,72105.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26282,26281.5,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2751.51,1855.47,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,706.308,706.307,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1230.37,1147.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,746.218,746.193,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,843.969,817.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.108,176.103,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2011.99,2011.91,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.242,186.199,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1688.18,1688.04,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.173,193.167,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4125.78,4117.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,259.777,259.767,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3853.96,3845.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.454,285.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,285.704,284.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.444,211.438,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,274.346,266.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,223.538,223.531,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,960.228,948.841,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,956.301,956.287,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31513.6,30063.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4186.44,4186.33,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137036,88719.9,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30350.1,30349.8,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.13926e+06,1.13827e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,158322,158321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.2525e+06,1.25198e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139918,139917,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,41338.5,40870.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,17028.9,17024.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70352.4,69388.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25258.4,25258,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2633.52,1866.41,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,712.064,712.067,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1228.13,1162.79,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,753.012,753.017,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,850.182,823.955,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.742,177.743,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1807.38,1807.01,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.396,187.391,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1686.09,1685.9,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,195.581,195.576,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3813.87,3808.58,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,258.074,258.027,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2947.2,2940.39,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,295.192,295.182,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,307.914,300.379,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,215.466,215.461,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,279.454,278.303,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,212.979,212.979,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1031.36,994.17,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,961.371,961.359,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32644.2,31113.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4149.35,4149.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135055,87931.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30751.4,30751.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21051e+06,1.20926e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155379,155376,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19553e+06,1.19505e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138284,138279,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40358.2,40264.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15162.1,15161.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72150.9,71810.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,28200.9,28200.4,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2633.52,1866.41,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,712.064,712.067,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1228.13,1162.79,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,753.012,753.017,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,850.182,823.955,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.742,177.743,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1807.38,1807.01,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.396,187.391,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1686.09,1685.9,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,195.581,195.576,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3813.87,3808.58,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,258.074,258.027,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2947.2,2940.39,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,295.192,295.182,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,307.914,300.379,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,215.466,215.461,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,279.454,278.303,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,212.979,212.979,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1031.36,994.17,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,961.371,961.359,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32644.2,31113.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4149.35,4149.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135055,87931.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30751.4,30751.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21051e+06,1.20926e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155379,155376,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19553e+06,1.19505e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138284,138279,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40358.2,40264.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15162.1,15161.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72150.9,71810.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,28200.9,28200.4,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2751.51,1855.47,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,706.308,706.307,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1230.37,1147.33,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,746.218,746.193,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,843.969,817.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.108,176.103,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2011.99,2011.91,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.242,186.199,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1688.18,1688.04,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.173,193.167,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4125.78,4117.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,259.777,259.767,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3853.96,3845.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.454,285.448,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,285.704,284.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.444,211.438,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,274.346,266.984,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,223.538,223.531,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,960.228,948.841,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,956.301,956.287,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31513.6,30063.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4186.44,4186.33,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137036,88719.9,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30350.1,30349.8,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.13926e+06,1.13827e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,158322,158321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.2525e+06,1.25198e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,139918,139917,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,41338.5,40870.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,17028.9,17024.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70352.4,69388.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25258.4,25258,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2633.52,1866.41,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,712.064,712.067,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1228.13,1162.79,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,753.012,753.017,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,850.182,823.955,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.742,177.743,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,1807.38,1807.01,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.396,187.391,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1686.09,1685.9,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,195.581,195.576,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3813.87,3808.58,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,258.074,258.027,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2947.2,2940.39,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,295.192,295.182,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,307.914,300.379,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,215.466,215.461,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,279.454,278.303,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,212.979,212.979,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1031.36,994.17,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,961.371,961.359,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32644.2,31113.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4149.35,4149.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135055,87931.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30751.4,30751.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21051e+06,1.20926e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,155379,155376,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19553e+06,1.19505e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,138284,138279,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,40358.2,40264.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15162.1,15161.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72150.9,71810.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,28200.9,28200.4,ns,,,,,

multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2643.07,1769.78,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,720.452,720.452,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1256.51,1210.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,759.477,759.476,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,899.469,838.579,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.48,177.476,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2064.16,2062.54,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.724,185.719,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1841.21,1841.16,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,187.905,187.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3592.83,3586.22,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,261.294,261.286,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3969.23,3963.81,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.403,285.396,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,297.49,283.975,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,216.039,215.987,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.345,261.416,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.696,207.694,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,947.498,937.967,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,951.551,951.539,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32260.6,30779,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4202.52,4202.42,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135820,87298.3,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31681.2,31680.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.1399e+06,1.13951e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,160162,160157,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14093e+06,1.14044e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,136959,136958,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42823.2,41712.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14736.3,14735.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72122.4,72105.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26282,26281.5,ns,,,,,

multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2643.07,1769.78,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,720.452,720.452,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1256.51,1210.39,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,759.477,759.476,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,899.469,838.579,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,177.48,177.476,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2064.16,2062.54,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,185.724,185.719,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1841.21,1841.16,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,187.905,187.9,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3592.83,3586.22,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,261.294,261.286,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3969.23,3963.81,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,285.403,285.396,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,297.49,283.975,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,216.039,215.987,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,262.345,261.416,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.696,207.694,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,947.498,937.967,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,951.551,951.539,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32260.6,30779,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4202.52,4202.42,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135820,87298.3,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31681.2,31680.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.1399e+06,1.13951e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,160162,160157,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14093e+06,1.14044e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,136959,136958,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,42823.2,41712.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14736.3,14735.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72122.4,72105.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26282,26281.5,ns,,,,,

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/12769778913

MemcpyExecute(self, 10, 16, 1024, 10000, 0, 1, 1),
MemcpyExecute(self, 4096, 1, 1024, 10, 0, 1, 0),
MemcpyExecute(self, 4096, 4, 1024, 10, 0, 1, 0),
GraphApiSinKernelSYCL(self, 0, 1),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hm, this might be too much :D

super().__init__(bench, "graph_api_benchmark_sycl", "SinKernel")

def name(self):
return f"graph_api_benchmark_sycl SinKernel"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

names need to reflect arguments, otherwise the benchmarks won't be unique in output.

@github-actions

This comment was marked as outdated.

Copy link
Contributor

@pbalcer pbalcer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The graph benchmarks didn't run because compute-benchmarks is too old. You need to update the commit being used (line 25).

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/12786535660

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/12786535660
Job status: cancelled. Test status: cancelled.

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/12786836274

@github-actions

This comment was marked as outdated.

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/12790642289

@github-actions

This comment was marked as outdated.

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/12792217891

@github-actions

This comment was marked as outdated.

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/12793013959

@mateuszpn mateuszpn marked this pull request as ready for review January 16, 2025 12:59
@mateuszpn mateuszpn requested a review from a team as a code owner January 16, 2025 12:59
MemcpyExecute(self, 10, 16, 1024, 10000, 0, 1, 1),
MemcpyExecute(self, 4096, 1, 1024, 10, 0, 1, 0),
MemcpyExecute(self, 4096, 4, 1024, 10, 0, 1, 0),
GraphApiSinKernelGraphSYCL(self, 0, 10),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please pick max 2/3 scenarios per benchmark. We need to keep the runtime of the whole job reasonable (I'm aiming for <30 minutes).


def bin_args(self) -> list[str]:
return [
"--iterations=100",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this enough iterations for the benchmarks to have reproducible results?
We aim for the stddev between runs (or, rather, the coefficient of variation of all the runs) to be smaller than 2%.

@pbalcer
Copy link
Contributor

pbalcer commented Jan 16, 2025

This failed with:

RequestError [HttpError]: Validation Failed: {"resource":"IssueComment","code":"unprocessable","field":"data","message":"Body is too long (maximum is 65536 characters)"}

If this keeps happening after you've reduced the number of scenarios, I suggest we temporarily remove output (just comment it out) from the markdown. I plan on eventually creating an HTML file per PR and then link to it in the markdown, and that will give us the ability to have longer content with all the details.

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run (with params: --filter graph):
https://github.com/oneapi-src/unified-runtime/actions/runs/12809980280

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run (--filter graph):
https://github.com/oneapi-src/unified-runtime/actions/runs/12809980280
Job status: success. Test status: success.

Summary

No diffs to calculate performance change

(result is better)

Performance change in benchmark groups

Relative perf in group graph (3): cannot calculate
Benchmark This PR baseline Relative perf Change -
graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:10 86719.406000 μs -
graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:50 248664.883000 μs -
graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:100 440612.037000 μs -
Relative perf in group api (9): cannot calculate
Benchmark This PR baseline Relative perf Change -
api_overhead_benchmark_l0 SubmitKernel out of order - 11.528000 μs
api_overhead_benchmark_sycl SubmitKernel out of order - 23.678000 μs
api_overhead_benchmark_sycl SubmitKernel in order - 24.844000 μs
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 - 2.118000 μs
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 - 1.675000 μs
api_overhead_benchmark_ur SubmitKernel out of order CPU count - 101923.000000 instr
api_overhead_benchmark_ur SubmitKernel out of order - 15.896000 μs
api_overhead_benchmark_ur SubmitKernel in order CPU count - 107041.000000 instr
api_overhead_benchmark_ur SubmitKernel in order - 16.663000 μs
Relative perf in group memory (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 - 253.805000 μs
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 - 132.929000 μs
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 - 5.638000 μs
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 - 3.151000 GB/s
Relative perf in group miscellaneous (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
miscellaneous_benchmark_sycl VectorSum - 858.609000 bw GB/s
Relative perf in group multithread (10): cannot calculate
Benchmark This PR baseline Relative perf Change -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1 - 6935.535000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1 - 17316.620000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1 - 47907.007000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1 - 2022.915000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1 - 7452.758000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1 - 8555.721000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1 - 25543.132000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1 - 1157.521000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events - 40973.625000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events - 108338.415000 μs
Relative perf in group Velocity-Bench (9): cannot calculate
Benchmark This PR baseline Relative perf Change -
Velocity-Bench Hashtable - 362.504819 M keys/sec
Velocity-Bench Bitcracker - 35.129800 s
Velocity-Bench CudaSift - 201.142000 ms
Velocity-Bench Easywave - 229.000000 ms
Velocity-Bench QuickSilver - 117.490000 MMS/CTT
Velocity-Bench Sobel Filter - 602.045000 ms
Velocity-Bench dl-cifar - 23.743900 s
Velocity-Bench dl-mnist - 2.720000 s
Velocity-Bench svm - 0.139900 s
Relative perf in group Runtime (8): cannot calculate
Benchmark This PR baseline Relative perf Change -
Runtime_IndependentDAGTaskThroughput_SingleTask - 259.395000 ms
Runtime_IndependentDAGTaskThroughput_BasicParallelFor - 275.382000 ms
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor - 278.916000 ms
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor - 278.736000 ms
Runtime_DAGTaskThroughput_SingleTask - 1678.732000 ms
Runtime_DAGTaskThroughput_BasicParallelFor - 1746.233000 ms
Runtime_DAGTaskThroughput_HierarchicalParallelFor - 1725.256000 ms
Runtime_DAGTaskThroughput_NDRangeParallelFor - 1695.816000 ms
Relative perf in group MicroBench (14): cannot calculate
Benchmark This PR baseline Relative perf Change -
MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous - 4.238000 ms
MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous - 4.317000 ms
MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous - 4.322000 ms
MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous - 4.414000 ms
MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous - 617.994000 ms
MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous - 617.954000 ms
MicroBench_HostDeviceBandwidth_1D_H2D_Strided - 4.547000 ms
MicroBench_HostDeviceBandwidth_2D_H2D_Strided - 4.781000 ms
MicroBench_HostDeviceBandwidth_3D_H2D_Strided - 4.574000 ms
MicroBench_HostDeviceBandwidth_1D_D2H_Strided - 4.702000 ms
MicroBench_HostDeviceBandwidth_2D_D2H_Strided - 617.523000 ms
MicroBench_HostDeviceBandwidth_3D_D2H_Strided - 617.254000 ms
MicroBench_LocalMem_int32_4096 - 29.866000 ms
MicroBench_LocalMem_fp32_4096 - 29.833000 ms
Relative perf in group Pattern (10): cannot calculate
Benchmark This PR baseline Relative perf Change -
Pattern_Reduction_NDRange_int32 - 16.163000 ms
Pattern_Reduction_Hierarchical_int32 - 16.411000 ms
Pattern_SegmentedReduction_NDRange_int16 - 2.264000 ms
Pattern_SegmentedReduction_NDRange_int32 - 2.164000 ms
Pattern_SegmentedReduction_NDRange_int64 - 2.336000 ms
Pattern_SegmentedReduction_NDRange_fp32 - 2.163000 ms
Pattern_SegmentedReduction_Hierarchical_int16 - 11.801000 ms
Pattern_SegmentedReduction_Hierarchical_int32 - 11.599000 ms
Pattern_SegmentedReduction_Hierarchical_int64 - 11.779000 ms
Pattern_SegmentedReduction_Hierarchical_fp32 - 11.589000 ms
Relative perf in group ScalarProduct (6): cannot calculate
Benchmark This PR baseline Relative perf Change -
ScalarProduct_NDRange_int32 - 3.733000 ms
ScalarProduct_NDRange_int64 - 5.456000 ms
ScalarProduct_NDRange_fp32 - 3.759000 ms
ScalarProduct_Hierarchical_int32 - 10.523000 ms
ScalarProduct_Hierarchical_int64 - 11.490000 ms
ScalarProduct_Hierarchical_fp32 - 10.170000 ms
Relative perf in group USM (7): cannot calculate
Benchmark This PR baseline Relative perf Change -
USM_Allocation_latency_fp32_device - 0.068000 ms
USM_Allocation_latency_fp32_host - 37.899000 ms
USM_Allocation_latency_fp32_shared - 0.066000 ms
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch - 1.661000 ms
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch - 1.046000 ms
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch - 1.814000 ms
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch - 1.195000 ms
Relative perf in group VectorAddition (3): cannot calculate
Benchmark This PR baseline Relative perf Change -
VectorAddition_int32 - 1.448000 ms
VectorAddition_int64 - 3.139000 ms
VectorAddition_fp32 - 1.445000 ms
Relative perf in group Polybench (3): cannot calculate
Benchmark This PR baseline Relative perf Change -
Polybench_2mm - 1.216000 ms
Polybench_3mm - 1.727000 ms
Polybench_Atax - 6.880000 ms
Relative perf in group Kmeans (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
Kmeans_fp32 - 16.083000 ms
Relative perf in group MolecularDynamics (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
MolecularDynamics - 0.028000 ms
Relative perf in group llama.cpp (6): cannot calculate
Benchmark This PR baseline Relative perf Change -
llama.cpp Prompt Processing Batched 128 - 838.869803 token/s
llama.cpp Text Generation Batched 128 - 63.338561 token/s
llama.cpp Prompt Processing Batched 256 - 872.377637 token/s
llama.cpp Text Generation Batched 256 - 63.361520 token/s
llama.cpp Prompt Processing Batched 512 - 434.541716 token/s
llama.cpp Text Generation Batched 512 - 63.295460 token/s
Relative perf in group alloc/max (20): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 glibc - 2589.180000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 glibc - 710.936000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 glibc - 1188.310000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 glibc - 716.901000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 glibc - 861.597000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 glibc - 175.935000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 os_provider - 2246.790000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 os_provider - 187.819000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 os_provider - 1690.250000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 os_provider - 189.702000 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> - 4441.700000 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> - 256.696000 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> - 3268.220000 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> - 306.439000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> - 299.852000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> - 213.534000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> - 263.904000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> - 197.833000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 scalable_pool<os_provider> - 1051.720000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 scalable_pool<os_provider> - 952.492000 ns
Relative perf in group multiple (12): cannot calculate
Benchmark This PR baseline Relative perf Change -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 glibc - 32574.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 glibc - 4128.530000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 glibc - 138399.000000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 glibc - 28197.400000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 proxy_pool<os_provider> - 1161430.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 proxy_pool<os_provider> - 161766.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 os_provider - 1166110.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 os_provider - 141737.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 scalable_pool<os_provider> - 42212.800000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 scalable_pool<os_provider> - 14889.200000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 scalable_pool<os_provider> - 72778.500000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 scalable_pool<os_provider> - 27538.700000 ns

Details

Benchmark details - environment, command, output...
graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:10

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=100 --numKernels=10 --withGraphs=0

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SinKernelGraph(api=sycl numKernels=10 withGraphs=0),86469.244,86719.406,3.79%,79486.716,90446.224,[CPU],[us]

graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:50

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=100 --numKernels=50 --withGraphs=0

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SinKernelGraph(api=sycl numKernels=50 withGraphs=0),248093.933,248664.883,2.19%,237445.358,258495.230,[CPU],[us]

graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:100

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=100 --numKernels=100 --withGraphs=0

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SinKernelGraph(api=sycl numKernels=100 withGraphs=0),441720.361,440612.037,1.42%,430127.709,457010.907,[CPU],[us]

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/12811438249

@pbalcer
Copy link
Contributor

pbalcer commented Jan 16, 2025

All graph-related benchmarks failed with:

Abort was called at 80 line in file:
./shared/source/command_stream/linear_stream.h

and then I think the gpu crashed:

terminate called after throwing an instance of 'sycl::_V1::exception'
  what():  No device of requested type available.

Have you seen that before?

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/12811438249
Job status: failure. Test status: failure.

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run (with params: --filter "graph"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12865513647

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run (--filter "graph"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12865513647
Job status: success. Test status: success.

Summary

No diffs to calculate performance change

(result is better)

Performance change in benchmark groups

Relative perf in group graph (14): cannot calculate
Benchmark This PR baseline Relative perf Change -
graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:10 71750.474000 μs -
graph_api_benchmark_sycl SinKernelGraph graphs:1, numKernels:10 72598.586000 μs -
graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:50 196881.249000 μs -
graph_api_benchmark_sycl SinKernelGraph graphs:1, numKernels:50 197215.709000 μs -
graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:200 666400.501000 μs -
graph_api_benchmark_sycl SinKernelGraph graphs:1, numKernels:200 733155.088000 μs -
graph_api_benchmark_sycl SubmitExecGraph ioq:0, submit:0, numKernels:20 11444.061000 μs -
graph_api_benchmark_sycl SubmitExecGraph ioq:0, submit:1, numKernels:20 98.031000 μs -
graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:0, numKernels:20 11444.025000 μs -
graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:1, numKernels:20 113.459000 μs -
graph_api_benchmark_sycl SubmitExecGraph ioq:0, submit:0, numKernels:200 116870.724000 μs -
graph_api_benchmark_sycl SubmitExecGraph ioq:0, submit:1, numKernels:200 1513.660000 μs -
graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:0, numKernels:200 117268.269000 μs -
graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:1, numKernels:200 1690.024000 μs -
Relative perf in group api (12): cannot calculate
Benchmark This PR baseline Relative perf Change -
api_overhead_benchmark_l0 SubmitKernel out of order - 11.848000 μs
api_overhead_benchmark_l0 SubmitKernel in order - 11.745000 μs
api_overhead_benchmark_sycl SubmitKernel out of order - 23.710000 μs
api_overhead_benchmark_sycl SubmitKernel in order - 24.891000 μs
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 - 2.143000 μs
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 - 1.702000 μs
api_overhead_benchmark_ur SubmitKernel out of order CPU count - 105463.000000 instr
api_overhead_benchmark_ur SubmitKernel out of order - 15.623000 μs
api_overhead_benchmark_ur SubmitKernel in order CPU count - 110815.000000 instr
api_overhead_benchmark_ur SubmitKernel in order - 16.859000 μs
api_overhead_benchmark_ur SubmitKernel in order with measure completion CPU count - 123991.000000 instr
api_overhead_benchmark_ur SubmitKernel in order with measure completion - 21.425000 μs
Relative perf in group memory (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 - 254.865000 μs
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 - 219.808000 μs
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 - 5.865000 μs
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 - 3.043000 GB/s
Relative perf in group miscellaneous (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
miscellaneous_benchmark_sycl VectorSum - 861.253000 bw GB/s
Relative perf in group multithread (10): cannot calculate
Benchmark This PR baseline Relative perf Change -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1 - 6931.139000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1 - 17007.721000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1 - 47383.460000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1 - 2073.904000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1 - 7868.958000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1 - 9035.852000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1 - 27237.512000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1 - 1194.467000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events - 42860.412000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events - 113343.613000 μs
Relative perf in group Velocity-Bench (9): cannot calculate
Benchmark This PR baseline Relative perf Change -
Velocity-Bench Hashtable - 356.084148 M keys/sec
Velocity-Bench Bitcracker - 35.118800 s
Velocity-Bench CudaSift - 204.342000 ms
Velocity-Bench Easywave - 289.000000 ms
Velocity-Bench QuickSilver - 117.450000 MMS/CTT
Velocity-Bench Sobel Filter - 621.173000 ms
Velocity-Bench dl-cifar - 23.972100 s
Velocity-Bench dl-mnist - 2.380000 s
Velocity-Bench svm - 0.140100 s
Relative perf in group Runtime (8): cannot calculate
Benchmark This PR baseline Relative perf Change -
Runtime_IndependentDAGTaskThroughput_SingleTask - 253.100000 ms
Runtime_IndependentDAGTaskThroughput_BasicParallelFor - 273.484000 ms
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor - 271.662000 ms
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor - 272.505000 ms
Runtime_DAGTaskThroughput_SingleTask - 1691.410000 ms
Runtime_DAGTaskThroughput_BasicParallelFor - 1756.502000 ms
Runtime_DAGTaskThroughput_HierarchicalParallelFor - 1721.262000 ms
Runtime_DAGTaskThroughput_NDRangeParallelFor - 1694.375000 ms
Relative perf in group MicroBench (14): cannot calculate
Benchmark This PR baseline Relative perf Change -
MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous - 5.188000 ms
MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous - 4.967000 ms
MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous - 4.769000 ms
MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous - 4.866000 ms
MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous - 618.226000 ms
MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous - 618.268000 ms
MicroBench_HostDeviceBandwidth_1D_H2D_Strided - 4.919000 ms
MicroBench_HostDeviceBandwidth_2D_H2D_Strided - 5.115000 ms
MicroBench_HostDeviceBandwidth_3D_H2D_Strided - 5.140000 ms
MicroBench_HostDeviceBandwidth_1D_D2H_Strided - 5.113000 ms
MicroBench_HostDeviceBandwidth_2D_D2H_Strided - 617.772000 ms
MicroBench_HostDeviceBandwidth_3D_D2H_Strided - 617.628000 ms
MicroBench_LocalMem_int32_4096 - 29.834000 ms
MicroBench_LocalMem_fp32_4096 - 29.857000 ms
Relative perf in group Pattern (10): cannot calculate
Benchmark This PR baseline Relative perf Change -
Pattern_Reduction_NDRange_int32 - 16.971000 ms
Pattern_Reduction_Hierarchical_int32 - 17.024000 ms
Pattern_SegmentedReduction_NDRange_int16 - 2.263000 ms
Pattern_SegmentedReduction_NDRange_int32 - 2.164000 ms
Pattern_SegmentedReduction_NDRange_int64 - 2.333000 ms
Pattern_SegmentedReduction_NDRange_fp32 - 2.163000 ms
Pattern_SegmentedReduction_Hierarchical_int16 - 11.801000 ms
Pattern_SegmentedReduction_Hierarchical_int32 - 11.587000 ms
Pattern_SegmentedReduction_Hierarchical_int64 - 11.777000 ms
Pattern_SegmentedReduction_Hierarchical_fp32 - 11.588000 ms
Relative perf in group ScalarProduct (6): cannot calculate
Benchmark This PR baseline Relative perf Change -
ScalarProduct_NDRange_int32 - 3.734000 ms
ScalarProduct_NDRange_int64 - 5.456000 ms
ScalarProduct_NDRange_fp32 - 3.767000 ms
ScalarProduct_Hierarchical_int32 - 10.555000 ms
ScalarProduct_Hierarchical_int64 - 11.508000 ms
ScalarProduct_Hierarchical_fp32 - 10.174000 ms
Relative perf in group USM (7): cannot calculate
Benchmark This PR baseline Relative perf Change -
USM_Allocation_latency_fp32_device - 0.068000 ms
USM_Allocation_latency_fp32_host - 37.633000 ms
USM_Allocation_latency_fp32_shared - 0.057000 ms
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch - 1.717000 ms
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch - 1.085000 ms
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch - 1.889000 ms
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch - 1.256000 ms
Relative perf in group VectorAddition (3): cannot calculate
Benchmark This PR baseline Relative perf Change -
VectorAddition_int32 - 1.510000 ms
VectorAddition_int64 - 3.066000 ms
VectorAddition_fp32 - 1.460000 ms
Relative perf in group Polybench (3): cannot calculate
Benchmark This PR baseline Relative perf Change -
Polybench_2mm - 1.221000 ms
Polybench_3mm - 1.730000 ms
Polybench_Atax - 6.855000 ms
Relative perf in group Kmeans (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
Kmeans_fp32 - 16.091000 ms
Relative perf in group LinearRegressionCoeff (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
LinearRegressionCoeff_fp32 - 908.423000 ms
Relative perf in group MolecularDynamics (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
MolecularDynamics - 0.030000 ms
Relative perf in group llama.cpp (6): cannot calculate
Benchmark This PR baseline Relative perf Change -
llama.cpp Prompt Processing Batched 128 - 830.457525 token/s
llama.cpp Text Generation Batched 128 - 62.530663 token/s
llama.cpp Prompt Processing Batched 256 - 872.219855 token/s
llama.cpp Text Generation Batched 256 - 62.524658 token/s
llama.cpp Prompt Processing Batched 512 - 426.427709 token/s
llama.cpp Text Generation Batched 512 - 62.477744 token/s
Relative perf in group alloc/size:10000/0/4096/iterations:200000/threads:4 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/0/4096/iterations:200000/threads:4 glibc - 2475.310000 ns
alloc/size:10000/0/4096/iterations:200000/threads:4 os_provider - 2120.000000 ns
alloc/size:10000/0/4096/iterations:200000/threads:4 proxy_pool<os_provider> - 3068.370000 ns
alloc/size:10000/0/4096/iterations:200000/threads:4 scalable_pool<os_provider> - 283.309000 ns
Relative perf in group alloc/size:10000/0/4096/iterations:200000/threads:1 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/0/4096/iterations:200000/threads:1 glibc - 706.837000 ns
alloc/size:10000/0/4096/iterations:200000/threads:1 os_provider - 197.281000 ns
alloc/size:10000/0/4096/iterations:200000/threads:1 proxy_pool<os_provider> - 268.948000 ns
alloc/size:10000/0/4096/iterations:200000/threads:1 scalable_pool<os_provider> - 213.433000 ns
Relative perf in group alloc/size:10000/100000/4096/iterations:200000/threads:4 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/100000/4096/iterations:200000/threads:4 glibc - 1259.770000 ns
alloc/size:10000/100000/4096/iterations:200000/threads:4 os_provider - 1854.120000 ns
alloc/size:10000/100000/4096/iterations:200000/threads:4 proxy_pool<os_provider> - 3771.150000 ns
alloc/size:10000/100000/4096/iterations:200000/threads:4 scalable_pool<os_provider> - 253.839000 ns
Relative perf in group alloc/size:10000/100000/4096/iterations:200000/threads:1 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/100000/4096/iterations:200000/threads:1 glibc - 726.627000 ns
alloc/size:10000/100000/4096/iterations:200000/threads:1 os_provider - 195.246000 ns
alloc/size:10000/100000/4096/iterations:200000/threads:1 proxy_pool<os_provider> - 308.264000 ns
alloc/size:10000/100000/4096/iterations:200000/threads:1 scalable_pool<os_provider> - 206.713000 ns
Relative perf in group alloc/min (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 glibc - 803.081000 ns
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 glibc - 177.090000 ns
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 scalable_pool<os_provider> - 978.697000 ns
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 scalable_pool<os_provider> - 975.381000 ns
Relative perf in group multiple (12): cannot calculate
Benchmark This PR baseline Relative perf Change -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 glibc - 33503.600000 ns
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 glibc - 4251.600000 ns
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 glibc - 141113.000000 ns
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 glibc - 30214.100000 ns
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 proxy_pool<os_provider> - 1170470.000000 ns
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 proxy_pool<os_provider> - 165011.000000 ns
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 os_provider - 1151930.000000 ns
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 os_provider - 145356.000000 ns
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 scalable_pool<os_provider> - 42332.700000 ns
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 scalable_pool<os_provider> - 15330.800000 ns
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 scalable_pool<os_provider> - 75942.600000 ns
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 scalable_pool<os_provider> - 25425.600000 ns

Details

Benchmark details - environment, command, output...
graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:10

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=100 --numKernels=10 --withGraphs=0

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SinKernelGraph(api=sycl numKernels=10 withGraphs=0),71758.575,71750.474,0.04%,71732.653,71951.131,[CPU],[us]

graph_api_benchmark_sycl SinKernelGraph graphs:1, numKernels:10

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=100 --numKernels=10 --withGraphs=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SinKernelGraph(api=sycl numKernels=10 withGraphs=1),72598.934,72598.586,0.01%,72581.222,72637.201,[CPU],[us]

graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:50

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=100 --numKernels=50 --withGraphs=0

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SinKernelGraph(api=sycl numKernels=50 withGraphs=0),196911.491,196881.249,0.07%,196833.127,197536.656,[CPU],[us]

graph_api_benchmark_sycl SinKernelGraph graphs:1, numKernels:50

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=100 --numKernels=50 --withGraphs=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SinKernelGraph(api=sycl numKernels=50 withGraphs=1),197217.485,197215.709,0.01%,197192.795,197272.519,[CPU],[us]

graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:200

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=100 --numKernels=200 --withGraphs=0

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SinKernelGraph(api=sycl numKernels=200 withGraphs=0),666590.683,666400.501,0.09%,666124.440,668706.856,[CPU],[us]

graph_api_benchmark_sycl SinKernelGraph graphs:1, numKernels:200

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=100 --numKernels=200 --withGraphs=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SinKernelGraph(api=sycl numKernels=200 withGraphs=1),754455.729,733155.088,6.30%,699634.757,890305.977,[CPU],[us]

graph_api_benchmark_sycl SubmitExecGraph ioq:0, submit:0, numKernels:20

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SubmitExecGraph --csv --noHeaders --iterations=100 --measureSubmit=0 --ioq=0 --numKernels=20

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitExecGraph(api=sycl measureSubmit=0 numKernels=20 ioq=0),11351.586,11444.061,5.22%,8119.365,11818.515,[CPU],[us]

graph_api_benchmark_sycl SubmitExecGraph ioq:0, submit:1, numKernels:20

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SubmitExecGraph --csv --noHeaders --iterations=100 --measureSubmit=1 --ioq=0 --numKernels=20

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitExecGraph(api=sycl measureSubmit=1 numKernels=20 ioq=0),100.788,98.031,8.03%,94.617,142.010,[CPU],[us]

graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:0, numKernels:20

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SubmitExecGraph --csv --noHeaders --iterations=100 --measureSubmit=0 --ioq=1 --numKernels=20

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitExecGraph(api=sycl measureSubmit=0 numKernels=20 ioq=1),11445.321,11444.025,1.19%,11144.824,11786.854,[CPU],[us]

graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:1, numKernels:20

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SubmitExecGraph --csv --noHeaders --iterations=100 --measureSubmit=1 --ioq=1 --numKernels=20

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitExecGraph(api=sycl measureSubmit=1 numKernels=20 ioq=1),116.751,113.459,8.27%,111.027,175.612,[CPU],[us]

graph_api_benchmark_sycl SubmitExecGraph ioq:0, submit:0, numKernels:200

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SubmitExecGraph --csv --noHeaders --iterations=100 --measureSubmit=0 --ioq=0 --numKernels=200

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitExecGraph(api=sycl measureSubmit=0 numKernels=200 ioq=0),116834.608,116870.724,0.55%,115339.126,118249.466,[CPU],[us]

graph_api_benchmark_sycl SubmitExecGraph ioq:0, submit:1, numKernels:200

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SubmitExecGraph --csv --noHeaders --iterations=100 --measureSubmit=1 --ioq=0 --numKernels=200

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitExecGraph(api=sycl measureSubmit=1 numKernels=200 ioq=0),1520.540,1513.660,1.39%,1483.155,1595.967,[CPU],[us]

graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:0, numKernels:200

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SubmitExecGraph --csv --noHeaders --iterations=100 --measureSubmit=0 --ioq=1 --numKernels=200

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitExecGraph(api=sycl measureSubmit=0 numKernels=200 ioq=1),117218.852,117268.269,0.87%,110821.915,119044.318,[CPU],[us]

graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:1, numKernels:200

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SubmitExecGraph --csv --noHeaders --iterations=100 --measureSubmit=1 --ioq=1 --numKernels=200

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitExecGraph(api=sycl measureSubmit=1 numKernels=200 ioq=1),1691.352,1690.024,1.20%,1637.617,1735.173,[CPU],[us]

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/12867191102

@github-actions
Copy link
Contributor

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/12867191102
Job status: success. Test status: success.

Summary

Total 128 benchmarks in mean.
Geomean 100.080%.
Improved 16 Regressed 13 (threshold 2.00%)

(result is better)

Performance change in benchmark groups

Relative perf in group api (12): 99.833%
Benchmark This PR baseline Relative perf Change -
api_overhead_benchmark_l0 SubmitKernel out of order 11.686000 μs 11.848 μs 101.39% 1.39% .
api_overhead_benchmark_ur SubmitKernel in order 16.647000 μs 16.859 μs 101.27% 1.27% .
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 1.690000 μs 1.702 μs 100.71% 0.71% .
api_overhead_benchmark_sycl SubmitKernel out of order 23.555000 μs 23.710 μs 100.66% 0.66% .
api_overhead_benchmark_ur SubmitKernel out of order CPU count 105463.000000 instr 105463.000 instr 100.00% 0.00% .
api_overhead_benchmark_ur SubmitKernel in order CPU count 110815.000000 instr 110815.000 instr 100.00% 0.00% .
api_overhead_benchmark_ur SubmitKernel in order with measure completion CPU count 123991.000000 instr 123991.000 instr 100.00% 0.00% .
api_overhead_benchmark_ur SubmitKernel in order with measure completion 21.513 μs 21.425000 μs 99.59% -0.41% .
api_overhead_benchmark_sycl SubmitKernel in order 24.997 μs 24.891000 μs 99.58% -0.42% .
api_overhead_benchmark_l0 SubmitKernel in order 11.827 μs 11.745000 μs 99.31% -0.69% .
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 2.177 μs 2.143000 μs 98.44% -1.56% .
api_overhead_benchmark_ur SubmitKernel out of order 16.083 μs 15.623000 μs 97.14% -2.86% --
Relative perf in group memory (4): 100.317%
Benchmark This PR baseline Relative perf Change -
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 3.066000 GB/s 3.043 GB/s 100.76% 0.76% .
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 5.823000 μs 5.865 μs 100.72% 0.72% .
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 219.341000 μs 219.808 μs 100.21% 0.21% .
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 255.934 μs 254.865000 μs 99.58% -0.42% .
Relative perf in group miscellaneous (1): 107.050%
Benchmark This PR baseline Relative perf Change -
miscellaneous_benchmark_sycl VectorSum 804.534000 bw GB/s 861.253 bw GB/s 107.05% 7.05% +++++
Relative perf in group multithread (10): 100.223%
Benchmark This PR baseline Relative perf Change -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1 26703.389000 μs 27237.512 μs 102.00% 2.00% ++
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1 8862.915000 μs 9035.852 μs 101.95% 1.95% .
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1 7806.548000 μs 7868.958 μs 100.80% 0.80% .
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1 2063.263000 μs 2073.904 μs 100.52% 0.52% .
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1 17033.193 μs 17007.721000 μs 99.85% -0.15% .
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events 43030.247 μs 42860.412000 μs 99.61% -0.39% .
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events 113819.124 μs 113343.613000 μs 99.58% -0.42% .
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1 6961.545 μs 6931.139000 μs 99.56% -0.44% .
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1 47670.697 μs 47383.460000 μs 99.40% -0.60% .
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1 1206.371 μs 1194.467000 μs 99.01% -0.99% .
Relative perf in group graph (10): cannot calculate
Benchmark This PR baseline Relative perf Change -
graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:10 71758.766000 μs -
graph_api_benchmark_sycl SinKernelGraph graphs:1, numKernels:10 72524.970000 μs -
graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:100 353498.172000 μs -
graph_api_benchmark_sycl SinKernelGraph graphs:1, numKernels:100 353215.904000 μs -
graph_api_benchmark_sycl SubmitExecGraph ioq:0, submit:1, numKernels:10 54.135000 μs -
graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:1, numKernels:10 61.707000 μs -
graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:1, numKernels:100 677.085000 μs -
graph_api_benchmark_sycl SubmitExecGraph ioq:0, submit:0, numKernels:10 5598.586000 μs -
graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:0, numKernels:10 5599.166000 μs -
graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:0, numKernels:100 56454.825000 μs -
Relative perf in group Velocity-Bench (9): 100.043%
Benchmark This PR baseline Relative perf Change -
Velocity-Bench Sobel Filter 598.763000 ms 621.173 ms 103.74% 3.74% +++
Velocity-Bench dl-cifar 23.931100 s 23.972 s 100.17% 0.17% .
Velocity-Bench svm 0.140100 s 0.140 s 100.00% 0.00% .
Velocity-Bench CudaSift 204.385 ms 204.342000 ms 99.98% -0.02% .
Velocity-Bench QuickSilver 117.190 MMS/CTT 117.450000 MMS/CTT 99.78% -0.22% .
Velocity-Bench dl-mnist 2.390 s 2.380000 s 99.58% -0.42% .
Velocity-Bench Easywave 291.000 ms 289.000000 ms 99.31% -0.69% .
Velocity-Bench Hashtable 352.898 M keys/sec 356.084148 M keys/sec 99.11% -0.89% .
Velocity-Bench Bitcracker 35.546 s 35.118800 s 98.80% -1.20% .
Relative perf in group Runtime (8): 98.949%
Benchmark This PR baseline Relative perf Change -
Runtime_DAGTaskThroughput_NDRangeParallelFor 1689.594000 ms 1694.375 ms 100.28% 0.28% .
Runtime_DAGTaskThroughput_SingleTask 1691.536 ms 1691.410000 ms 99.99% -0.01% .
Runtime_DAGTaskThroughput_HierarchicalParallelFor 1721.582 ms 1721.262000 ms 99.98% -0.02% .
Runtime_DAGTaskThroughput_BasicParallelFor 1760.146 ms 1756.502000 ms 99.79% -0.21% .
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor 274.722 ms 272.505000 ms 99.19% -0.81% .
Runtime_IndependentDAGTaskThroughput_BasicParallelFor 276.324 ms 273.484000 ms 98.97% -1.03% .
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor 277.470 ms 271.662000 ms 97.91% -2.09% --
Runtime_IndependentDAGTaskThroughput_SingleTask 264.863 ms 253.100000 ms 95.56% -4.44% ---
Relative perf in group MicroBench (14): 101.666%
Benchmark This PR baseline Relative perf Change -
MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous 4.842000 ms 5.188 ms 107.15% 7.15% ++++++
MicroBench_HostDeviceBandwidth_1D_D2H_Strided 4.837000 ms 5.113 ms 105.71% 5.71% ++++
MicroBench_HostDeviceBandwidth_1D_H2D_Strided 4.705000 ms 4.919 ms 104.55% 4.55% ++++
MicroBench_HostDeviceBandwidth_2D_H2D_Strided 5.024000 ms 5.115 ms 101.81% 1.81% .
MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous 4.894000 ms 4.967 ms 101.49% 1.49% .
MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous 4.713000 ms 4.769 ms 101.19% 1.19% .
MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous 4.816000 ms 4.866 ms 101.04% 1.04% .
MicroBench_HostDeviceBandwidth_3D_H2D_Strided 5.102000 ms 5.140 ms 100.74% 0.74% .
MicroBench_HostDeviceBandwidth_2D_D2H_Strided 617.496000 ms 617.772 ms 100.04% 0.04% .
MicroBench_HostDeviceBandwidth_3D_D2H_Strided 617.551000 ms 617.628 ms 100.01% 0.01% .
MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous 618.213000 ms 618.268 ms 100.01% 0.01% .
MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous 618.194000 ms 618.226 ms 100.01% 0.01% .
MicroBench_LocalMem_fp32_4096 29.856000 ms 29.857 ms 100.00% 0.00% .
MicroBench_LocalMem_int32_4096 29.857 ms 29.834000 ms 99.92% -0.08% .
Relative perf in group Pattern (10): 100.120%
Benchmark This PR baseline Relative perf Change -
Pattern_Reduction_NDRange_int32 16.581000 ms 16.971 ms 102.35% 2.35% ++
Pattern_SegmentedReduction_Hierarchical_int64 11.772000 ms 11.777 ms 100.04% 0.04% .
Pattern_SegmentedReduction_Hierarchical_int16 11.803 ms 11.801000 ms 99.98% -0.02% .
Pattern_SegmentedReduction_Hierarchical_fp32 11.594 ms 11.588000 ms 99.95% -0.05% .
Pattern_SegmentedReduction_Hierarchical_int32 11.595 ms 11.587000 ms 99.93% -0.07% .
Pattern_SegmentedReduction_NDRange_int16 2.266 ms 2.263000 ms 99.87% -0.13% .
Pattern_SegmentedReduction_NDRange_int32 2.167 ms 2.164000 ms 99.86% -0.14% .
Pattern_SegmentedReduction_NDRange_fp32 2.166 ms 2.163000 ms 99.86% -0.14% .
Pattern_SegmentedReduction_NDRange_int64 2.340 ms 2.333000 ms 99.70% -0.30% .
Pattern_Reduction_Hierarchical_int32 17.078 ms 17.024000 ms 99.68% -0.32% .
Relative perf in group ScalarProduct (6): 99.984%
Benchmark This PR baseline Relative perf Change -
ScalarProduct_NDRange_fp32 3.748000 ms 3.767 ms 100.51% 0.51% .
ScalarProduct_Hierarchical_fp32 10.142000 ms 10.174 ms 100.32% 0.32% .
ScalarProduct_Hierarchical_int32 10.542000 ms 10.555 ms 100.12% 0.12% .
ScalarProduct_Hierarchical_int64 11.500000 ms 11.508 ms 100.07% 0.07% .
ScalarProduct_NDRange_int64 5.457 ms 5.456000 ms 99.98% -0.02% .
ScalarProduct_NDRange_int32 3.775 ms 3.734000 ms 98.91% -1.09% .
Relative perf in group USM (7): 99.828%
Benchmark This PR baseline Relative perf Change -
USM_Allocation_latency_fp32_device 0.067000 ms 0.068 ms 101.49% 1.49% .
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch 1.708000 ms 1.717 ms 100.53% 0.53% .
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch 1.082000 ms 1.085 ms 100.28% 0.28% .
USM_Allocation_latency_fp32_host 37.628000 ms 37.633 ms 100.01% 0.01% .
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch 1.891 ms 1.889000 ms 99.89% -0.11% .
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch 1.277 ms 1.256000 ms 98.36% -1.64% .
USM_Allocation_latency_fp32_shared 0.058 ms 0.057000 ms 98.28% -1.72% .
Relative perf in group VectorAddition (3): 96.468%
Benchmark This PR baseline Relative perf Change -
VectorAddition_int32 1.523 ms 1.510000 ms 99.15% -0.85% .
VectorAddition_fp32 1.499 ms 1.460000 ms 97.40% -2.60% --
VectorAddition_int64 3.298 ms 3.066000 ms 92.97% -7.03% -----
Relative perf in group Polybench (3): 100.469%
Benchmark This PR baseline Relative perf Change -
Polybench_Atax 6.690000 ms 6.855 ms 102.47% 2.47% ++
Polybench_3mm 1.738 ms 1.730000 ms 99.54% -0.46% .
Polybench_2mm 1.228 ms 1.221000 ms 99.43% -0.57% .
Relative perf in group Kmeans (1): 99.863%
Benchmark This PR baseline Relative perf Change -
Kmeans_fp32 16.113 ms 16.091000 ms 99.86% -0.14% .
Relative perf in group LinearRegressionCoeff (1): 99.105%
Benchmark This PR baseline Relative perf Change -
LinearRegressionCoeff_fp32 916.629 ms 908.423000 ms 99.10% -0.90% .
Relative perf in group MolecularDynamics (1): 100.000%
Benchmark This PR baseline Relative perf Change -
MolecularDynamics 0.030000 ms 0.030 ms 100.00% 0.00% .
Relative perf in group llama.cpp (6): 99.590%
Benchmark This PR baseline Relative perf Change -
llama.cpp Prompt Processing Batched 512 428.682502 token/s 426.428 token/s 100.53% 0.53% .
llama.cpp Prompt Processing Batched 256 874.309970 token/s 872.220 token/s 100.24% 0.24% .
llama.cpp Text Generation Batched 512 62.538520 token/s 62.478 token/s 100.10% 0.10% .
llama.cpp Text Generation Batched 256 62.545637 token/s 62.525 token/s 100.03% 0.03% .
llama.cpp Text Generation Batched 128 62.534186 token/s 62.531 token/s 100.01% 0.01% .
llama.cpp Prompt Processing Batched 128 802.960 token/s 830.457525 token/s 96.69% -3.31% ---
Relative perf in group alloc/size:10000/0/4096/iterations:200000/threads:4 (4): 96.175%
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/0/4096/iterations:200000/threads:4 os_provider 2115.110000 ns 2120.000 ns 100.23% 0.23% .
alloc/size:10000/0/4096/iterations:200000/threads:4 proxy_pool<os_provider> 3074.680 ns 3068.370000 ns 99.79% -0.21% .
alloc/size:10000/0/4096/iterations:200000/threads:4 scalable_pool<os_provider> 288.322 ns 283.309000 ns 98.26% -1.74% .
alloc/size:10000/0/4096/iterations:200000/threads:4 glibc 2843.650 ns 2475.310000 ns 87.05% -12.95% ----------
Relative perf in group alloc/size:10000/0/4096/iterations:200000/threads:1 (4): 100.875%
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/0/4096/iterations:200000/threads:1 os_provider 189.975000 ns 197.281 ns 103.85% 3.85% +++
alloc/size:10000/0/4096/iterations:200000/threads:1 scalable_pool<os_provider> 212.024000 ns 213.433 ns 100.66% 0.66% .
alloc/size:10000/0/4096/iterations:200000/threads:1 proxy_pool<os_provider> 270.051 ns 268.948000 ns 99.59% -0.41% .
alloc/size:10000/0/4096/iterations:200000/threads:1 glibc 710.670 ns 706.837000 ns 99.46% -0.54% .
Relative perf in group alloc/size:10000/100000/4096/iterations:200000/threads:4 (4): 104.520%
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/100000/4096/iterations:200000/threads:4 proxy_pool<os_provider> 3341.340000 ns 3771.150 ns 112.86% 12.86% ++++++++++
alloc/size:10000/100000/4096/iterations:200000/threads:4 os_provider 1753.880000 ns 1854.120 ns 105.72% 5.72% ++++
alloc/size:10000/100000/4096/iterations:200000/threads:4 glibc 1234.200000 ns 1259.770 ns 102.07% 2.07% ++
alloc/size:10000/100000/4096/iterations:200000/threads:4 scalable_pool<os_provider> 259.035 ns 253.839000 ns 97.99% -2.01% --
Relative perf in group alloc/size:10000/100000/4096/iterations:200000/threads:1 (4): 98.834%
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/100000/4096/iterations:200000/threads:1 os_provider 191.153000 ns 195.246 ns 102.14% 2.14% ++
alloc/size:10000/100000/4096/iterations:200000/threads:1 scalable_pool<os_provider> 206.892 ns 206.713000 ns 99.91% -0.09% .
alloc/size:10000/100000/4096/iterations:200000/threads:1 glibc 750.391 ns 726.627000 ns 96.83% -3.17% --
alloc/size:10000/100000/4096/iterations:200000/threads:1 proxy_pool<os_provider> 319.262 ns 308.264000 ns 96.56% -3.44% ---
Relative perf in group alloc/min (4): 99.819%
Benchmark This PR baseline Relative perf Change -
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 scalable_pool<os_provider> 958.483000 ns 975.381 ns 101.76% 1.76% .
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 glibc 175.625000 ns 177.090 ns 100.83% 0.83% .
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 glibc 815.477 ns 803.081000 ns 98.48% -1.52% .
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 scalable_pool<os_provider> 996.175 ns 978.697000 ns 98.25% -1.75% .
Relative perf in group multiple (12): 99.977%
Benchmark This PR baseline Relative perf Change -
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 glibc 137101.000000 ns 141113.000 ns 102.93% 2.93% ++
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 scalable_pool<os_provider> 73947.800000 ns 75942.600 ns 102.70% 2.70% ++
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 scalable_pool<os_provider> 15022.500000 ns 15330.800 ns 102.05% 2.05% ++
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 glibc 32925.600000 ns 33503.600 ns 101.76% 1.76% .
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 scalable_pool<os_provider> 41963.600000 ns 42332.700 ns 100.88% 0.88% .
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 glibc 29996.200000 ns 30214.100 ns 100.73% 0.73% .
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 scalable_pool<os_provider> 25290.500000 ns 25425.600 ns 100.53% 0.53% .
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 glibc 4275.150 ns 4251.600000 ns 99.45% -0.55% .
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 proxy_pool<os_provider> 166136.000 ns 165011.000000 ns 99.32% -0.68% .
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 proxy_pool<os_provider> 1199480.000 ns 1170470.000000 ns 97.58% -2.42% --
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 os_provider 149207.000 ns 145356.000000 ns 97.42% -2.58% --
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 os_provider 1216290.000 ns 1151930.000000 ns 94.71% -5.29% ----

Details

Benchmark details - environment, command, output...
api_overhead_benchmark_l0 SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_l0 --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=l0 Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),11.736,11.686,2.60%,11.044,22.751,[CPU],[us]

api_overhead_benchmark_l0 SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_l0 --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=l0 Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),11.875,11.827,2.51%,11.138,52.407,[CPU],[us]

api_overhead_benchmark_sycl SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=sycl Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),23.711,23.555,3.67%,22.783,248.055,[CPU],[us]

api_overhead_benchmark_sycl SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=sycl Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),25.149,24.997,3.81%,23.977,276.857,[CPU],[us]

memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueInOrderMemcpy --csv --noHeaders --iterations=10000 --IsCopyOnly=0 --sourcePlacement=Device --destinationPlacement=Device --size=1024 --count=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueInOrderMemcpy(api=sycl IsCopyOnly=0 sourcePlacement=Device destinationPlacement=Device size=1KB count=100),257.481,255.934,1.87%,251.672,559.542,[CPU],[us]

memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueInOrderMemcpy --csv --noHeaders --iterations=10000 --IsCopyOnly=0 --sourcePlacement=Host --destinationPlacement=Device --size=1024 --count=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueInOrderMemcpy(api=sycl IsCopyOnly=0 sourcePlacement=Host destinationPlacement=Device size=1KB count=100),221.876,219.341,5.41%,217.393,495.271,[CPU],[us]

memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueMemcpy --csv --noHeaders --iterations=10000 --sourcePlacement=Device --destinationPlacement=Device --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueMemcpy(api=sycl sourcePlacement=Device destinationPlacement=Device size=1KB),6.190,5.823,18.65%,5.371,86.781,[CPU],[us]

memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=StreamMemory --csv --noHeaders --iterations=10000 --type=Triad --size=10240 --memoryPlacement=Device --useEvents=0 --contents=Zeros --multiplier=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
StreamMemory(api=sycl type=Triad size=10KB useEvents=0 contents=Zeros memoryPlacement=Device multiplier=1),3.009,3.066,6.47%,0.391,3.349,[CPU],[GB/s]

api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=ExecImmediateCopyQueue --csv --noHeaders --iterations=100000 --ioq=0 --IsCopyOnly=1 --MeasureCompletionTime=0 --src=Device --dst=Device --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
ExecImmediateCopyQueue(api=sycl IsCopyOnly=1 MeasureCompletionTime=0 src=Device dst=Device size=1KB ioq=0),2.185,2.177,4.58%,1.950,9.472,[CPU],[us]

api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=ExecImmediateCopyQueue --csv --noHeaders --iterations=100000 --ioq=1 --IsCopyOnly=1 --MeasureCompletionTime=0 --src=Host --dst=Host --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
ExecImmediateCopyQueue(api=sycl IsCopyOnly=1 MeasureCompletionTime=0 src=Host dst=Host size=1KB ioq=1),1.702,1.690,12.23%,1.581,62.548,[CPU],[us]

miscellaneous_benchmark_sycl VectorSum

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/miscellaneous_benchmark_sycl --test=VectorSum --csv --noHeaders --iterations=1000 --numberOfElementsX=512 --numberOfElementsY=256 --numberOfElementsZ=256

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
VectorSum(api=sycl numberOfElementsX=512 numberOfElementsY=256 numberOfElementsZ=256),804.184,804.534,0.39%,768.188,814.955,[GPU],bw [GB/s]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=1 --NumOpsPerThread=400 --iterations=10 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=1 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),6982.009,6961.545,1.01%,6919.134,7163.248,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=8 --NumOpsPerThread=100 --iterations=10 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=100 NumThreads=8 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),17122.490,17033.193,2.81%,16506.742,17993.198,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=8 --NumOpsPerThread=400 --iterations=1000 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=8 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),47610.015,47670.697,1.88%,44330.582,50714.375,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=16 --NumOpsPerThread=10 --iterations=10000 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=10 NumThreads=16 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),2122.286,2063.263,26.70%,1554.760,17892.655,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=1 --NumOpsPerThread=400 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=1 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),7832.878,7806.548,1.36%,7678.085,8012.571,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=8 --NumOpsPerThread=100 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=100 NumThreads=8 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),9022.873,8862.915,4.24%,8568.164,9878.470,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=8 --NumOpsPerThread=400 --iterations=1000 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=8 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),26858.437,26703.389,1.90%,25565.374,28478.006,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=16 --NumOpsPerThread=10 --iterations=10000 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=10 NumThreads=16 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),1298.584,1206.371,48.36%,898.513,14846.891,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=0 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=1 --NumOpsPerThread=4096 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=4096 NumThreads=1 AllocSize=1024 MeasureCompletion=1 UseEvents=0 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),43050.073,43030.247,0.58%,42744.063,43712.055,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=0 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=4 --NumOpsPerThread=4096 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=4096 NumThreads=4 AllocSize=1024 MeasureCompletion=1 UseEvents=0 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),113932.980,113819.124,0.30%,113453.412,114696.042,[CPU],[us]

graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:10

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=100 --numKernels=10 --withGraphs=0

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SinKernelGraph(api=sycl numKernels=10 withGraphs=0),71768.456,71758.766,0.05%,71744.114,71950.760,[CPU],[us]

graph_api_benchmark_sycl SinKernelGraph graphs:1, numKernels:10

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=100 --numKernels=10 --withGraphs=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SinKernelGraph(api=sycl numKernels=10 withGraphs=1),72524.180,72524.970,0.01%,72508.079,72548.247,[CPU],[us]

graph_api_benchmark_sycl SinKernelGraph graphs:0, numKernels:100

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=100 --numKernels=100 --withGraphs=0

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SinKernelGraph(api=sycl numKernels=100 withGraphs=0),353569.119,353498.172,0.08%,353374.564,354664.083,[CPU],[us]

graph_api_benchmark_sycl SinKernelGraph graphs:1, numKernels:100

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SinKernelGraph --csv --noHeaders --iterations=100 --numKernels=100 --withGraphs=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SinKernelGraph(api=sycl numKernels=100 withGraphs=1),353216.020,353215.904,0.02%,353077.465,353365.544,[CPU],[us]

graph_api_benchmark_sycl SubmitExecGraph ioq:0, submit:1, numKernels:10

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SubmitExecGraph --csv --noHeaders --iterations=100 --measureSubmit=1 --ioq=0 --numKernels=10

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitExecGraph(api=sycl measureSubmit=1 numKernels=10 ioq=0),55.614,54.135,10.84%,52.081,96.470,[CPU],[us]

graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:1, numKernels:10

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SubmitExecGraph --csv --noHeaders --iterations=100 --measureSubmit=1 --ioq=1 --numKernels=10

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitExecGraph(api=sycl measureSubmit=1 numKernels=10 ioq=1),63.415,61.707,10.13%,59.340,111.006,[CPU],[us]

graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:1, numKernels:100

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SubmitExecGraph --csv --noHeaders --iterations=100 --measureSubmit=1 --ioq=1 --numKernels=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitExecGraph(api=sycl measureSubmit=1 numKernels=100 ioq=1),679.951,677.085,2.60%,657.384,754.589,[CPU],[us]

graph_api_benchmark_sycl SubmitExecGraph ioq:0, submit:0, numKernels:10

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SubmitExecGraph --csv --noHeaders --iterations=100 --measureSubmit=0 --ioq=0 --numKernels=10

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitExecGraph(api=sycl measureSubmit=0 numKernels=10 ioq=0),5607.752,5598.586,0.98%,5505.458,5801.260,[CPU],[us]

graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:0, numKernels:10

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SubmitExecGraph --csv --noHeaders --iterations=100 --measureSubmit=0 --ioq=1 --numKernels=10

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitExecGraph(api=sycl measureSubmit=0 numKernels=10 ioq=1),5544.580,5599.166,5.96%,3971.178,5765.667,[CPU],[us]

graph_api_benchmark_sycl SubmitExecGraph ioq:1, submit:0, numKernels:100

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/graph_api_benchmark_sycl --test=SubmitExecGraph --csv --noHeaders --iterations=100 --measureSubmit=0 --ioq=1 --numKernels=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitExecGraph(api=sycl measureSubmit=0 numKernels=100 ioq=1),56459.023,56454.825,0.27%,56088.883,56816.253,[CPU],[us]

api_overhead_benchmark_ur SubmitKernel out of order CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),105523.702,105463.000,6.01%,105355.000,2107734.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),15.971,15.705,340.71%,15.093,17222.495,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),105523.702,105463.000,6.01%,105355.000,2107734.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),16.321,16.083,281.19%,15.478,14527.047,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),110882.265,110815.000,3.98%,110815.000,1500622.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),17.085,16.854,271.31%,16.122,14673.663,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),110882.264,110815.000,3.98%,110815.000,1500622.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),16.874,16.647,263.42%,16.014,14071.962,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order with measure completion CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=1 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),124415.487,123991.000,3.81%,123364.000,1589304.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),22.126,21.505,359.60%,20.338,25180.849,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order with measure completion

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=1 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),124419.985,123991.000,3.80%,123392.000,1588990.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),22.029,21.513,229.06%,20.358,15975.313,[CPU],time [us]

Velocity-Bench Hashtable

Environment Variables:

Command:

/home/pmdk/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.380331 s
352.897570 million keys/second

Velocity-Bench Bitcracker

Environment Variables:

Command:

/home/pmdk/bench_workdir/bitcracker/bitcracker -f /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt -d /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt -b 60000

Output:

---------> BitCracker: BitLocker password cracking tool <---------

==================================
Retrieving Info

Reading hash file "/home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt"

              Attack

================================================
Type of attack: User Password
Psw per thread: 1
max_num_pswd_per_read: 60000
Dictionary: /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt
MAC Comparison (-m): Yes

Iter: 1, num passwords read: 60000
Kernel execution:
Effective passwords: 60000
Passwords Range:
npknpByH7N2m3OnLNH1X9DJxLrzIFWk
.....
dL_7uuf3QCz-c6K3xDu0

================================================
Bitcracker attack completed
Total passwords evaluated: 60000
Password not found!

time to subtract from total: 0.00379828 s
bitcracker - total time for whole calculation: 35.546 s

Velocity-Bench CudaSift

Environment Variables:

Command:

/home/pmdk/bench_workdir/cudaSift/cudaSift

Output:

UNKN:

UNKN: ==================================================
UNKN: User input parameters:
UNKN: Trace: ../../inputData
UNKN: ==================================================
UNKN:

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1152 1267 31.2788% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1262 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1169 1269 31.7404% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1124 1260 30.5186% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1263 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1240 1275 33.6682% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1238 1271 33.6139% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1209 1263 32.8265% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1129 1264 30.6544% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1063 1269 28.8623% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1097 1257 29.7855% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1129 1269 30.6544% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1142 1276 31.0073% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1238 1273 33.6139% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1239 1274 33.6411% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1094 1260 29.704% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1257 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1264 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1088 1246 29.5411% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1260 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1215 1250 32.9894% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1138 1261 30.8987% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1181 1283 32.0662% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1112 1265 30.1928% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1263 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1263 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1261 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1266 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1191 1258 32.3378% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1163 1268 31.5775% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1125 1265 30.5458% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1265 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1124 1263 30.5186% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1234 1268 33.5053% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1148 1268 31.1702% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1094 1263 29.704% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1098 1257 29.8127% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1112 1268 30.1928% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1189 1268 32.2835% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1118 1260 30.3557% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1237 1273 33.5868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1096 1262 29.7583% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1245 1278 33.804% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1241 1276 33.6954% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1061 1266 28.808% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1264 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1060 1259 28.7809% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1261 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1193 1267 32.3921% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1132 1268 30.7358% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Avg workload time = 204.385 ms

Velocity-Bench Easywave

Environment Variables:

Command:

/home/pmdk/bench_workdir/easywave/easyWave_sycl -grid /home/pmdk/bench_workdir/data/easywave/examples/e2Asean.grd -source /home/pmdk/bench_workdir/data/easywave/examples/BengkuluSept2007.flt -time 120

Output:

MAIN: Starting SYCL main program
MAIN: Attempting to clean up previous eWave tsunami files
MAIN: Clean up completed
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
MAIN: Program successfully completed

Velocity-Bench QuickSilver

Environment Variables:

QS_DEVICE=GPU

Command:

/home/pmdk/bench_workdir/QuickSilver/qs -i /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp

Output:

Copyright (c) 2016
Lawrence Livermore National Security, LLC
All Rights Reserved
Quicksilver Version :
Quicksilver Git Hash :
MPI Version : 3.0
Number of MPI ranks : 1
Number of OpenMP Threads: 1
Number of OpenMP CPUs : 1

Loading params
Finished loading params
Simulation:
dt: 1e-08
fMax: 0.1
inputFile: /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp
energySpectrum:
boundaryCondition: octant
loadBalance: 1
cycleTimers: 0
debugThreads: 0
lx: 100
ly: 100
lz: 100
nParticles: 10000000
batchSize: 0
nBatches: 10
nSteps: 10
nx: 10
ny: 10
nz: 10
seed: 1029384756
xDom: 0
yDom: 0
zDom: 0
eMax: 20
eMin: 1e-09
nGroups: 230
lowWeightCutoff: 0.001
bTally: 1
fTally: 1
cTally: 1
coralBenchmark: 0
crossSectionsOut:

Geometry:
material: sourceMaterial
shape: brick
xMax: 100
xMin: 0
yMax: 100
yMin: 0
zMax: 100
zMin: 0

Material:
name: sourceMaterial
mass: 1000
nIsotopes: 10
nReactions: 9
sourceRate: 1e+10
totalCrossSection: 0.1
absorptionCrossSection: flat
fissionCrossSection: flat
scatteringCrossSection: flat
absorptionCrossSectionRatio: 0
fissionCrossSectionRatio: 0
scatteringCrossSectionRatio: 1

CrossSection:
name: flat
A: 0
B: 0
C: 0
D: 0
E: 1
nuBar: 2.4
setting GPU
setting parameters
Building partition 0
Building partition 1
Building partition 2
Building partition 3
Building MC_Domain 0
Building MC_Domain 1
Building MC_Domain 2
Building MC_Domain 3
Starting Consistency Check
Finished Consistency Check
Finished initMesh
Started copyMaterialDatabase_device
Finished copyMaterialDatabase_device
Finished copyNuclearData_device
Finished copyDomainDevice
cycle start source rr split absorb scatter fission produce collisn escape census num_seg scalar_flux cycleInit cycleTracking cycleFinalize
0 0 1000000 0 9000000 0 18533189 0 0 18533189 1151780 8848220 55527935 1.854923e+09 4.316810e-01 6.222860e-01 0.000000e+00
1 8848220 1000000 0 151478 0 34281997 0 0 34281997 1664159 8335539 94633679 5.047651e+09 3.639650e-01 7.642930e-01 0.000000e+00
2 8335539 1000000 0 663717 0 34354432 0 0 34354432 1366771 8632485 95010375 7.705930e+09 3.614890e-01 7.844610e-01 0.000000e+00
3 8632485 1000000 0 367978 0 34302727 0 0 34302727 1242216 8758247 94953591 9.992076e+09 3.684410e-01 8.416660e-01 0.000000e+00
4 8758247 1000000 0 242076 0 34141236 0 0 34141236 1168452 8831871 94599337 1.199834e+10 3.602160e-01 7.946330e-01 0.000000e+00
5 8831871 1000000 0 168070 0 33948724 0 0 33948724 1121156 8878785 94148236 1.377636e+10 3.603510e-01 7.694460e-01 0.000000e+00
6 8878785 1000000 0 120572 0 33760567 0 0 33760567 1089103 8910254 93689264 1.535668e+10 3.596770e-01 7.623760e-01 0.000000e+00
7 8910254 1000000 0 89810 0 33552179 0 0 33552179 1065203 8934861 93216931 1.676993e+10 3.331780e-01 7.835380e-01 0.000000e+00
8 8934861 1000000 0 65491 0 33384605 0 0 33384605 1047720 8952632 92768273 1.804559e+10 3.323780e-01 7.897290e-01 0.000000e+00
9 8952632 1000000 0 47165 0 33198494 0 0 33198494 1033968 8965829 92324678 1.920208e+10 3.325050e-01 7.748610e-01 0.000000e+00

Timer Cumulative Cumulative Cumulative Cumulative Cumulative Cumulative
Name number microSecs microSecs microSecs microSecs Efficiency
of calls min avg max stddev Rating
main 1 1.129e+07 1.129e+07 1.129e+07 0.000e+00 100.00
cycleInit 10 3.604e+06 3.604e+06 3.604e+06 0.000e+00 100.00
cycleTracking 10 7.687e+06 7.687e+06 7.687e+06 0.000e+00 100.00
cycleTracking_Kernel 104 4.919e+06 4.919e+06 4.919e+06 0.000e+00 100.00
cycleTracking_MPI 117 2.132e+05 2.132e+05 2.132e+05 0.000e+00 100.00
cycleTracking_Test_Done 0 0.000e+00 0.000e+00 0.000e+00 0.000e+00 0.00
cycleFinalize 20 4.290e+02 4.290e+02 4.290e+02 0.000e+00 100.00
Figure Of Merit 117.19 [Num Mega Segments / Cycle Tracking Time]

Velocity-Bench Sobel Filter

Environment Variables:

OPENCV_IO_MAX_IMAGE_PIXELS=1677721600

Command:

/home/pmdk/bench_workdir/sobel_filter/sobel_filter -i /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png -n 5

Output:

SYMN: Welcome to the SYCL version of Sobel filter workload.
SYMN: Input image file: /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png
SYMN: Launching SYCL kernel with # of iterations: 5
time to subtract from total: 7.49288 s
sobelfilter - total time for whole calculation: 0.598763 s

Velocity-Bench dl-cifar

Environment Variables:

Command:

/home/pmdk/bench_workdir/dl-cifar/dl-cifar_sycl

Output:

	Welcome to DL-CIFAR workload: SYCL version.

=======================================================================
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero

WL PARAMS:

WL PARAMS: ==================================================
WL PARAMS: User input parameters:
WL PARAMS: Trace: notrace
WL PARAMS: DL NW size type: WORKLOAD_DEFAULT_SIZE
WL PARAMS: ==================================================
WL PARAMS:

dataFileReadTimer->getTotalOpTime(): 8.8e-05 s
dl-cifar - total time for whole calculation: 23.9311 s

Velocity-Bench dl-mnist

Environment Variables:

NEOReadDebugKeys=1
DisableScratchPages=0

Command:

/home/pmdk/bench_workdir/dl-mnist/dl-mnist-sycl -conv_algo ONEDNN_AUTO

Output:

	Welcome to DL-MNIST workload: SYCL version.

=======================================================================
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero

WL PARAMS:

WL PARAMS: ==================================================
WL PARAMS: User input parameters:
WL PARAMS: Trace: notrace
WL PARAMS: Tensor management policy: per_layer
WL PARAMS: Convolution algorithm: ONEDNN_AUTO
WL PARAMS: Dataset reader format: NCHW
WL PARAMS: Dry run: YES
WL PARAMS: OneDNN Conv PD memory format: ONEDNN_CONVPD_ANY
WL PARAMS: No of iterations for inference: 400
WL PARAMS: ==================================================
WL PARAMS:

dl-mnist - total time for whole calculation: 2.39 s

Velocity-Bench svm

Environment Variables:

Command:

/home/pmdk/bench_workdir/svm/svm_sycl /home/pmdk/bench_workdir/velocity-bench-repo/svm/SYCL/a9a /home/pmdk/bench_workdir/velocity-bench-repo/svm/SYCL/a.m

Output:

Number of args 3
Using cuSVM (Carpenter)...

Buffering input text file (6989624 B).
Load Done
Starting Training
_C 1.000000
Workgroup Size: 1024
nbrCtas 80
elemsPerCta 1248
threadsPerCta 128
Total run time: 0.069228 seconds
Iter:100
M:97683
N:123
Train done. Calulate Vector counts
Training done

Loading elapsed time : 0.0636 s
Processing elapsed time : 0.0744 s
Storing elapsed time : 0.0021 s
Total elapsed time : 0.1401 s
Result's are correct: 0.0551

Runtime_IndependentDAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_SingleTask', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.264934', '0.264863', '0.264761', '0.264761 0.264863 0.265177', '0.000217', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_BasicParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.287811', '0.276324', '0.273280', '0.273280 0.276324 0.313828', '0.022583', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.280495', '0.277470', '0.272724', '0.272724 0.277470 0.291290', '0.009645', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.276602', '0.274722', '0.271699', '0.271699 0.274722 0.283384', '0.006065', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_SingleTask', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.693178', '1.691536', '1.689684', '1.689684 1.691536 1.698313', '0.004543', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_BasicParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.759587', '1.760146', '1.757688', '1.757688 1.760146 1.760926', '0.001690', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_HierarchicalParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.722728', '1.721582', '1.721579', '1.721579 1.721582 1.725022', '0.001987', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_NDRangeParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.693497', '1.689594', '1.689335', '1.689335 1.689594 1.701561', '0.006985', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.005851', '0.004842', '0.004717', '0.004717 0.004842 0.007995', '0.001858', '26.502484', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004901', '0.004894', '0.004788', '0.004788 0.004894 0.005021', '0.000117', '26.106999', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004755', '0.004713', '0.004708', '0.004708 0.004713 0.004844', '0.000077', '26.549977', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004820', '0.004816', '0.004812', '0.004812 0.004816 0.004832', '0.000011', '25.978058', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.618200', '0.618194', '0.618193', '0.618193 0.618194 0.618214', '0.000012', '0.202202', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.618215', '0.618213', '0.618182', '0.618182 0.618213 0.618249', '0.000034', '0.202206', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004746', '0.004705', '0.004503', '0.004503 0.004705 0.005030', '0.000266', '27.759025', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.005072', '0.005024', '0.005005', '0.005005 0.005024 0.005186', '0.000099', '24.975769', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.005098', '0.005102', '0.004995', '0.004995 0.005102 0.005198', '0.000101', '25.023647', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004922', '0.004837', '0.004833', '0.004833 0.004837 0.005097', '0.000151', '25.864543', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617523', '0.617496', '0.617495', '0.617495 0.617496 0.617578', '0.000048', '0.202431', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617549', '0.617551', '0.617529', '0.617529 0.617551 0.617569', '0.000020', '0.202420', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_LocalMem_int32_4096

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LocalMem_multi.csv --size=10240000

Output:

['MicroBench_LocalMem_int32_4096', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.029873', '0.029857', '0.029833', '0.029833 0.029857 0.029930', '0.000051', '10458.270345', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '312.000000']

MicroBench_LocalMem_fp32_4096

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LocalMem_multi.csv --size=10240000

Output:

['MicroBench_LocalMem_fp32_4096', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.029843', '0.029856', '0.029795', '0.029795 0.029856 0.029877', '0.000043', '10471.612214', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '312.000000']

Pattern_Reduction_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_Reduction_multi.csv --size=10240000

Output:

['Pattern_Reduction_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.016672', '0.016581', '0.016487', '0.016487 0.016581 0.016947', '0.000243', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_Reduction_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_Reduction_multi.csv --size=10240000

Output:

['Pattern_Reduction_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.017029', '0.017078', '0.016878', '0.016878 0.017078 0.017133', '0.000134', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003801', '0.003775', '0.003745', '0.003745 0.003775 0.003884', '0.000073', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.005477', '0.005457', '0.005420', '0.005420 0.005457 0.005553', '0.000069', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003747', '0.003748', '0.003736', '0.003736 0.003748 0.003757', '0.000011', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.010538', '0.010542', '0.010530', '0.010530 0.010542 0.010543', '0.000007', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011490', '0.011500', '0.011461', '0.011461 0.011500 0.011509', '0.000025', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.010136', '0.010142', '0.010125', '0.010125 0.010142 0.010143', '0.000010', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int16

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int16', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002273', '0.002266', '0.002263', '0.002263 0.002266 0.002291', '0.000015', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002169', '0.002167', '0.002166', '0.002166 0.002167 0.002174', '0.000004', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002344', '0.002340', '0.002340', '0.002340 0.002340 0.002353', '0.000007', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002168', '0.002166', '0.002161', '0.002161 0.002166 0.002179', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int16

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int16', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011805', '0.011803', '0.011795', '0.011795 0.011803 0.011816', '0.000010', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011598', '0.011595', '0.011588', '0.011588 0.011595 0.011612', '0.000012', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011778', '0.011772', '0.011759', '0.011759 0.011772 0.011802', '0.000022', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011596', '0.011594', '0.011583', '0.011583 0.011594 0.011611', '0.000014', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_device

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_device', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.000062', '0.000067', '0.000047', '0.000047 0.000067 0.000072', '0.000013', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_host

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_host', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.037633', '0.037628', '0.037577', '0.037577 0.037628 0.037693', '0.000058', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_shared

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_shared', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.000061', '0.000058', '0.000055', '0.000055 0.000058 0.000070', '0.000008', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.002254', '0.001708', '0.001688', '0.001688 0.001708 0.003367', '0.000964', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001096', '0.001082', '0.001081', '0.001081 0.001082 0.001124', '0.000025', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001918', '0.001891', '0.001886', '0.001886 0.001891 0.001978', '0.000051', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001276', '0.001277', '0.001264', '0.001264 0.001277 0.001287', '0.000011', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.001551', '0.001523', '0.001510', '0.001510 0.001523 0.001619', '0.000059', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003218', '0.003298', '0.003044', '0.003044 0.003298 0.003313', '0.000151', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.001527', '0.001499', '0.001474', '0.001474 0.001499 0.001608', '0.000071', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_2mm

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/2mm --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/2mm.csv --size=512

Output:

['Polybench_2mm', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.001226', '0.001228', '0.001217', '0.001217 0.001228 0.001234', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_3mm

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/3mm --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/3mm.csv --size=512

Output:

['Polybench_3mm', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.001735', '0.001738', '0.001725', '0.001725 0.001738 0.001743', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_Atax

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/atax --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Atax.csv --size=8192

Output:

['Polybench_Atax', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.006688', '0.006690', '0.006683', '0.006683 0.006690 0.006692', '0.000005', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Kmeans_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/kmeans --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Kmeans.csv --size=700000000

Output:

['Kmeans_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '700000000', '0.016103', '0.016113', '0.016081', '0.016081 0.016113 0.016114', '0.000019', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

LinearRegressionCoeff_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/lin_reg_coeff --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LinearRegressionCoeff.csv --size=1638400000

Output:

['LinearRegressionCoeff_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1638400000', '0.916958', '0.916629', '0.916119', '0.916119 0.916629 0.918127', '0.001044', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

MolecularDynamics

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/mol_dyn --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/MolecularDynamics.csv --size=8196

Output:

['MolecularDynamics', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8196', '0.000038', '0.000030', '0.000026', '0.000026 0.000030 0.000059', '0.000018', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

llama.cpp Prompt Processing Batched 128

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:15:19Z","638544037","27084814","802.960246","33.484216"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:15:25Z","2047974998","3665841","62.500923","0.111733"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:15:36Z","597965186","3206654","856.256888","4.603484"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:15:39Z","2046043435","2369055","62.559834","0.072320"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:15:50Z","1193466149","28538327","429.203422","10.502423"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:15:57Z","2042246061","1836970","62.676132","0.056311"

llama.cpp Text Generation Batched 128

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:13:36Z","615834646","1095602","831.394106","1.478850"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:13:43Z","2046886272","3876177","62.534186","0.118173"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:13:53Z","585367191","2068658","874.673403","3.090092"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:13:57Z","2078557770","72468334","61.638430","2.053265"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:14:07Z","1194374076","5036625","428.682502","1.803741"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:14:14Z","2045966515","1614667","62.562150","0.049352"

llama.cpp Prompt Processing Batched 256

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:20:56Z","619683237","1793082","826.234112","2.391187"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:21:01Z","2075037706","32885605","61.697897","0.967905"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:21:12Z","585617615","3075174","874.309970","4.601186"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:21:15Z","2202719807","16392964","58.112577","0.436424"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:21:26Z","1193796277","1997961","428.884859","0.718586"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:21:34Z","2110380209","36931028","60.667594","1.072679"

llama.cpp Text Generation Batched 256

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:14:25Z","1193284066","1294039152","695.673058","307.313588"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:14:37Z","2046921235","3708838","62.533102","0.113044"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:14:47Z","589329038","931186","868.786347","1.372607"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:14:51Z","2046507748","2322485","62.545637","0.070877"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:15:01Z","1204367860","5269334","425.125789","1.858984"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:15:08Z","2046287284","2664779","62.552396","0.081336"

llama.cpp Prompt Processing Batched 512

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:13:36Z","615834646","1095602","831.394106","1.478850"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:13:43Z","2046886272","3876177","62.534186","0.118173"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:13:53Z","585367191","2068658","874.673403","3.090092"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:13:57Z","2078557770","72468334","61.638430","2.053265"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:14:07Z","1194374076","5036625","428.682502","1.803741"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:14:14Z","2045966515","1614667","62.562150","0.049352"

llama.cpp Text Generation Batched 512

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:23:25Z","636129397","20192813","805.490678","24.532410"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:23:32Z","2046918875","3264860","62.533138","0.099614"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:23:42Z","590691734","5606143","866.842324","8.160025"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:23:45Z","2047936485","1446730","62.501963","0.044141"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-20T12:23:56Z","1213046386","21137827","422.179603","7.300782"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-20T12:24:03Z","2046739340","1425855","62.538520","0.043559"

alloc/size:10000/0/4096/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2843.65,1816.27,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,696.701,696.703,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.17,1176.85,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,729.563,729.565,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.735,765.849,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,175.625,175.624,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2205.17,2203.59,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,195.111,195.065,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1753.88,1753.24,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.296,191.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3016.85,2968.45,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.267,272.258,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3201.31,3152.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,319.262,319.256,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,288.322,287.324,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.311,219.306,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,252.841,249.68,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.539,203.536,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,973.518,953.704,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.323,975.257,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33531,30934.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4275.15,4274.97,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,143398,89269.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29288.5,29288.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19948e+06,1.19925e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,166136,166132,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.23512e+06,1.23445e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,149207,149205,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41503.8,41358.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15022.5,15022,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76237.9,76215.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25290.5,25289.9,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2809.56,1867.77,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,710.67,710.67,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1328.4,1245.24,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,750.391,750.36,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.477,785.279,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,191.696,191.694,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2070.23,2069.91,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.975,189.929,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1854.43,1853.95,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.153,191.147,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3074.68,3028.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.152,265.145,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3341.34,3293.26,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,321.676,321.67,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,299.579,298.149,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.024,212.02,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.894,272.577,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,206.892,206.888,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1066.76,1047.05,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,958.483,958.441,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32305,30842.4,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4246.42,4246.31,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,134460,86058.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30737.1,30736.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.1819e+06,1.18043e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,171721,171720,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15124e+06,1.15055e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,153318,153317,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41963.6,40507.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15172.3,15171.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,72423.1,72403.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25243.4,25238.7,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2903.72,1885.28,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,721.425,721.423,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.2,1183.98,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,757.62,757.616,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,800.265,769.131,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,174.853,174.819,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2115.11,2112.86,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.566,189.56,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1738.83,1738.81,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.447,187.441,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3138.07,3093.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.051,270.045,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3404.55,3355.52,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.196,298.189,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,281.482,278.07,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,209.434,209.378,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.035,258.185,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,210.006,210.004,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,996.175,985.662,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,954.72,954.707,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32925.6,30733.3,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4310.26,4310.13,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137101,87511.7,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29996.2,29995.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21345e+06,1.21277e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163007,163005,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21629e+06,1.21351e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,145603,145598,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42144.3,40994.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14467,14466.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73947.8,73929.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25654.3,25653.8,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2809.56,1867.77,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,710.67,710.67,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1328.4,1245.24,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,750.391,750.36,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.477,785.279,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,191.696,191.694,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2070.23,2069.91,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.975,189.929,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1854.43,1853.95,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.153,191.147,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3074.68,3028.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.152,265.145,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3341.34,3293.26,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,321.676,321.67,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,299.579,298.149,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.024,212.02,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.894,272.577,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,206.892,206.888,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1066.76,1047.05,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,958.483,958.441,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32305,30842.4,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4246.42,4246.31,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,134460,86058.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30737.1,30736.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.1819e+06,1.18043e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,171721,171720,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15124e+06,1.15055e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,153318,153317,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41963.6,40507.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15172.3,15171.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,72423.1,72403.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25243.4,25238.7,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2809.56,1867.77,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,710.67,710.67,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1328.4,1245.24,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,750.391,750.36,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.477,785.279,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,191.696,191.694,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2070.23,2069.91,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.975,189.929,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1854.43,1853.95,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.153,191.147,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3074.68,3028.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.152,265.145,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3341.34,3293.26,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,321.676,321.67,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,299.579,298.149,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.024,212.02,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.894,272.577,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,206.892,206.888,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1066.76,1047.05,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,958.483,958.441,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32305,30842.4,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4246.42,4246.31,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,134460,86058.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30737.1,30736.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.1819e+06,1.18043e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,171721,171720,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15124e+06,1.15055e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,153318,153317,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41963.6,40507.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15172.3,15171.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,72423.1,72403.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25243.4,25238.7,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2843.65,1816.27,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,696.701,696.703,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.17,1176.85,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,729.563,729.565,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.735,765.849,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,175.625,175.624,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2205.17,2203.59,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,195.111,195.065,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1753.88,1753.24,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.296,191.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3016.85,2968.45,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.267,272.258,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3201.31,3152.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,319.262,319.256,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,288.322,287.324,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.311,219.306,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,252.841,249.68,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.539,203.536,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,973.518,953.704,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.323,975.257,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33531,30934.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4275.15,4274.97,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,143398,89269.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29288.5,29288.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19948e+06,1.19925e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,166136,166132,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.23512e+06,1.23445e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,149207,149205,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41503.8,41358.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15022.5,15022,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76237.9,76215.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25290.5,25289.9,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2903.72,1885.28,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,721.425,721.423,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.2,1183.98,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,757.62,757.616,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,800.265,769.131,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,174.853,174.819,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2115.11,2112.86,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.566,189.56,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1738.83,1738.81,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.447,187.441,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3138.07,3093.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.051,270.045,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3404.55,3355.52,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.196,298.189,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,281.482,278.07,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,209.434,209.378,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.035,258.185,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,210.006,210.004,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,996.175,985.662,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,954.72,954.707,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32925.6,30733.3,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4310.26,4310.13,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137101,87511.7,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29996.2,29995.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21345e+06,1.21277e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163007,163005,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21629e+06,1.21351e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,145603,145598,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42144.3,40994.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14467,14466.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73947.8,73929.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25654.3,25653.8,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2809.56,1867.77,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,710.67,710.67,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1328.4,1245.24,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,750.391,750.36,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.477,785.279,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,191.696,191.694,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2070.23,2069.91,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.975,189.929,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1854.43,1853.95,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.153,191.147,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3074.68,3028.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.152,265.145,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3341.34,3293.26,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,321.676,321.67,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,299.579,298.149,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.024,212.02,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.894,272.577,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,206.892,206.888,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1066.76,1047.05,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,958.483,958.441,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32305,30842.4,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4246.42,4246.31,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,134460,86058.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30737.1,30736.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.1819e+06,1.18043e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,171721,171720,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15124e+06,1.15055e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,153318,153317,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41963.6,40507.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15172.3,15171.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,72423.1,72403.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25243.4,25238.7,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2843.65,1816.27,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,696.701,696.703,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.17,1176.85,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,729.563,729.565,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.735,765.849,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,175.625,175.624,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2205.17,2203.59,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,195.111,195.065,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1753.88,1753.24,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.296,191.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3016.85,2968.45,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.267,272.258,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3201.31,3152.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,319.262,319.256,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,288.322,287.324,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.311,219.306,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,252.841,249.68,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.539,203.536,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,973.518,953.704,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.323,975.257,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33531,30934.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4275.15,4274.97,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,143398,89269.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29288.5,29288.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19948e+06,1.19925e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,166136,166132,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.23512e+06,1.23445e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,149207,149205,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41503.8,41358.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15022.5,15022,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76237.9,76215.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25290.5,25289.9,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2809.56,1867.77,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,710.67,710.67,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1328.4,1245.24,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,750.391,750.36,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.477,785.279,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,191.696,191.694,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2070.23,2069.91,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.975,189.929,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1854.43,1853.95,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.153,191.147,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3074.68,3028.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.152,265.145,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3341.34,3293.26,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,321.676,321.67,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,299.579,298.149,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.024,212.02,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.894,272.577,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,206.892,206.888,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1066.76,1047.05,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,958.483,958.441,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32305,30842.4,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4246.42,4246.31,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,134460,86058.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30737.1,30736.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.1819e+06,1.18043e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,171721,171720,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15124e+06,1.15055e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,153318,153317,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41963.6,40507.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15172.3,15171.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,72423.1,72403.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25243.4,25238.7,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2809.56,1867.77,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,710.67,710.67,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1328.4,1245.24,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,750.391,750.36,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.477,785.279,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,191.696,191.694,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2070.23,2069.91,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.975,189.929,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1854.43,1853.95,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.153,191.147,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3074.68,3028.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.152,265.145,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3341.34,3293.26,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,321.676,321.67,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,299.579,298.149,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.024,212.02,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.894,272.577,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,206.892,206.888,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1066.76,1047.05,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,958.483,958.441,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32305,30842.4,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4246.42,4246.31,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,134460,86058.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30737.1,30736.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.1819e+06,1.18043e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,171721,171720,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15124e+06,1.15055e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,153318,153317,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41963.6,40507.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15172.3,15171.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,72423.1,72403.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25243.4,25238.7,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2903.72,1885.28,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,721.425,721.423,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.2,1183.98,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,757.62,757.616,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,800.265,769.131,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,174.853,174.819,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2115.11,2112.86,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.566,189.56,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1738.83,1738.81,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.447,187.441,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3138.07,3093.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.051,270.045,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3404.55,3355.52,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.196,298.189,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,281.482,278.07,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,209.434,209.378,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.035,258.185,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,210.006,210.004,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,996.175,985.662,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,954.72,954.707,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32925.6,30733.3,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4310.26,4310.13,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137101,87511.7,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29996.2,29995.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21345e+06,1.21277e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163007,163005,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21629e+06,1.21351e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,145603,145598,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42144.3,40994.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14467,14466.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73947.8,73929.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25654.3,25653.8,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2809.56,1867.77,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,710.67,710.67,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1328.4,1245.24,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,750.391,750.36,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.477,785.279,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,191.696,191.694,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2070.23,2069.91,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.975,189.929,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1854.43,1853.95,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.153,191.147,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3074.68,3028.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.152,265.145,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3341.34,3293.26,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,321.676,321.67,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,299.579,298.149,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.024,212.02,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.894,272.577,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,206.892,206.888,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1066.76,1047.05,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,958.483,958.441,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32305,30842.4,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4246.42,4246.31,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,134460,86058.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30737.1,30736.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.1819e+06,1.18043e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,171721,171720,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15124e+06,1.15055e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,153318,153317,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41963.6,40507.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15172.3,15171.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,72423.1,72403.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25243.4,25238.7,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2843.65,1816.27,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,696.701,696.703,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.17,1176.85,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,729.563,729.565,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.735,765.849,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,175.625,175.624,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2205.17,2203.59,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,195.111,195.065,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1753.88,1753.24,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.296,191.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3016.85,2968.45,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.267,272.258,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3201.31,3152.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,319.262,319.256,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,288.322,287.324,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.311,219.306,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,252.841,249.68,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.539,203.536,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,973.518,953.704,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.323,975.257,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33531,30934.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4275.15,4274.97,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,143398,89269.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29288.5,29288.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19948e+06,1.19925e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,166136,166132,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.23512e+06,1.23445e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,149207,149205,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41503.8,41358.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15022.5,15022,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76237.9,76215.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25290.5,25289.9,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2843.65,1816.27,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,696.701,696.703,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.17,1176.85,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,729.563,729.565,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.735,765.849,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,175.625,175.624,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2205.17,2203.59,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,195.111,195.065,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1753.88,1753.24,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.296,191.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3016.85,2968.45,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.267,272.258,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3201.31,3152.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,319.262,319.256,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,288.322,287.324,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.311,219.306,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,252.841,249.68,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.539,203.536,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,973.518,953.704,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.323,975.257,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33531,30934.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4275.15,4274.97,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,143398,89269.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29288.5,29288.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19948e+06,1.19925e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,166136,166132,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.23512e+06,1.23445e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,149207,149205,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41503.8,41358.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15022.5,15022,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76237.9,76215.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25290.5,25289.9,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2809.56,1867.77,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,710.67,710.67,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1328.4,1245.24,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,750.391,750.36,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.477,785.279,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,191.696,191.694,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2070.23,2069.91,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.975,189.929,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1854.43,1853.95,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.153,191.147,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3074.68,3028.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.152,265.145,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3341.34,3293.26,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,321.676,321.67,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,299.579,298.149,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.024,212.02,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.894,272.577,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,206.892,206.888,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1066.76,1047.05,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,958.483,958.441,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32305,30842.4,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4246.42,4246.31,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,134460,86058.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30737.1,30736.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.1819e+06,1.18043e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,171721,171720,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15124e+06,1.15055e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,153318,153317,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41963.6,40507.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15172.3,15171.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,72423.1,72403.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25243.4,25238.7,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2903.72,1885.28,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,721.425,721.423,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.2,1183.98,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,757.62,757.616,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,800.265,769.131,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,174.853,174.819,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2115.11,2112.86,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.566,189.56,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1738.83,1738.81,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.447,187.441,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3138.07,3093.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.051,270.045,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3404.55,3355.52,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.196,298.189,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,281.482,278.07,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,209.434,209.378,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.035,258.185,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,210.006,210.004,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,996.175,985.662,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,954.72,954.707,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32925.6,30733.3,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4310.26,4310.13,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137101,87511.7,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29996.2,29995.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21345e+06,1.21277e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163007,163005,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21629e+06,1.21351e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,145603,145598,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42144.3,40994.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14467,14466.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73947.8,73929.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25654.3,25653.8,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2809.56,1867.77,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,710.67,710.67,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1328.4,1245.24,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,750.391,750.36,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.477,785.279,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,191.696,191.694,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2070.23,2069.91,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.975,189.929,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1854.43,1853.95,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.153,191.147,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3074.68,3028.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.152,265.145,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3341.34,3293.26,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,321.676,321.67,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,299.579,298.149,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.024,212.02,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.894,272.577,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,206.892,206.888,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1066.76,1047.05,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,958.483,958.441,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32305,30842.4,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4246.42,4246.31,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,134460,86058.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30737.1,30736.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.1819e+06,1.18043e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,171721,171720,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15124e+06,1.15055e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,153318,153317,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41963.6,40507.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15172.3,15171.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,72423.1,72403.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25243.4,25238.7,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2903.72,1885.28,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,721.425,721.423,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.2,1183.98,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,757.62,757.616,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,800.265,769.131,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,174.853,174.819,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2115.11,2112.86,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.566,189.56,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1738.83,1738.81,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.447,187.441,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3138.07,3093.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.051,270.045,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3404.55,3355.52,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.196,298.189,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,281.482,278.07,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,209.434,209.378,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.035,258.185,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,210.006,210.004,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,996.175,985.662,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,954.72,954.707,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32925.6,30733.3,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4310.26,4310.13,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137101,87511.7,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29996.2,29995.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21345e+06,1.21277e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163007,163005,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21629e+06,1.21351e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,145603,145598,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42144.3,40994.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14467,14466.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73947.8,73929.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25654.3,25653.8,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2809.56,1867.77,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,710.67,710.67,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1328.4,1245.24,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,750.391,750.36,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.477,785.279,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,191.696,191.694,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2070.23,2069.91,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.975,189.929,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1854.43,1853.95,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.153,191.147,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3074.68,3028.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.152,265.145,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3341.34,3293.26,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,321.676,321.67,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,299.579,298.149,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.024,212.02,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.894,272.577,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,206.892,206.888,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1066.76,1047.05,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,958.483,958.441,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32305,30842.4,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4246.42,4246.31,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,134460,86058.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30737.1,30736.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.1819e+06,1.18043e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,171721,171720,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15124e+06,1.15055e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,153318,153317,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41963.6,40507.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15172.3,15171.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,72423.1,72403.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25243.4,25238.7,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2903.72,1885.28,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,721.425,721.423,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.2,1183.98,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,757.62,757.616,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,800.265,769.131,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,174.853,174.819,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2115.11,2112.86,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.566,189.56,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1738.83,1738.81,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.447,187.441,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3138.07,3093.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.051,270.045,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3404.55,3355.52,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.196,298.189,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,281.482,278.07,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,209.434,209.378,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.035,258.185,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,210.006,210.004,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,996.175,985.662,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,954.72,954.707,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32925.6,30733.3,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4310.26,4310.13,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137101,87511.7,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29996.2,29995.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21345e+06,1.21277e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163007,163005,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21629e+06,1.21351e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,145603,145598,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42144.3,40994.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14467,14466.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73947.8,73929.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25654.3,25653.8,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2843.65,1816.27,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,696.701,696.703,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.17,1176.85,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,729.563,729.565,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.735,765.849,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,175.625,175.624,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2205.17,2203.59,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,195.111,195.065,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1753.88,1753.24,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.296,191.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3016.85,2968.45,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.267,272.258,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3201.31,3152.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,319.262,319.256,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,288.322,287.324,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.311,219.306,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,252.841,249.68,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.539,203.536,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,973.518,953.704,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.323,975.257,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33531,30934.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4275.15,4274.97,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,143398,89269.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29288.5,29288.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19948e+06,1.19925e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,166136,166132,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.23512e+06,1.23445e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,149207,149205,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41503.8,41358.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15022.5,15022,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76237.9,76215.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25290.5,25289.9,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2903.72,1885.28,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,721.425,721.423,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.2,1183.98,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,757.62,757.616,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,800.265,769.131,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,174.853,174.819,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2115.11,2112.86,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.566,189.56,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1738.83,1738.81,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.447,187.441,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3138.07,3093.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.051,270.045,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3404.55,3355.52,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.196,298.189,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,281.482,278.07,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,209.434,209.378,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.035,258.185,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,210.006,210.004,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,996.175,985.662,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,954.72,954.707,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32925.6,30733.3,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4310.26,4310.13,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137101,87511.7,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29996.2,29995.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21345e+06,1.21277e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163007,163005,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21629e+06,1.21351e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,145603,145598,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42144.3,40994.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14467,14466.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73947.8,73929.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25654.3,25653.8,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2903.72,1885.28,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,721.425,721.423,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.2,1183.98,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,757.62,757.616,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,800.265,769.131,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,174.853,174.819,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2115.11,2112.86,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.566,189.56,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1738.83,1738.81,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.447,187.441,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3138.07,3093.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.051,270.045,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3404.55,3355.52,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.196,298.189,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,281.482,278.07,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,209.434,209.378,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.035,258.185,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,210.006,210.004,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,996.175,985.662,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,954.72,954.707,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32925.6,30733.3,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4310.26,4310.13,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137101,87511.7,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29996.2,29995.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21345e+06,1.21277e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163007,163005,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21629e+06,1.21351e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,145603,145598,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42144.3,40994.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14467,14466.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73947.8,73929.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25654.3,25653.8,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2843.65,1816.27,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,696.701,696.703,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.17,1176.85,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,729.563,729.565,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.735,765.849,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,175.625,175.624,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2205.17,2203.59,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,195.111,195.065,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1753.88,1753.24,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.296,191.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3016.85,2968.45,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.267,272.258,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3201.31,3152.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,319.262,319.256,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,288.322,287.324,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.311,219.306,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,252.841,249.68,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.539,203.536,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,973.518,953.704,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.323,975.257,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33531,30934.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4275.15,4274.97,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,143398,89269.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29288.5,29288.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19948e+06,1.19925e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,166136,166132,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.23512e+06,1.23445e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,149207,149205,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41503.8,41358.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15022.5,15022,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76237.9,76215.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25290.5,25289.9,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2843.65,1816.27,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,696.701,696.703,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.17,1176.85,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,729.563,729.565,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.735,765.849,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,175.625,175.624,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2205.17,2203.59,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,195.111,195.065,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1753.88,1753.24,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.296,191.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3016.85,2968.45,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.267,272.258,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3201.31,3152.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,319.262,319.256,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,288.322,287.324,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.311,219.306,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,252.841,249.68,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.539,203.536,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,973.518,953.704,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.323,975.257,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33531,30934.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4275.15,4274.97,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,143398,89269.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29288.5,29288.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19948e+06,1.19925e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,166136,166132,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.23512e+06,1.23445e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,149207,149205,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41503.8,41358.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15022.5,15022,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76237.9,76215.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25290.5,25289.9,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2903.72,1885.28,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,721.425,721.423,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.2,1183.98,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,757.62,757.616,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,800.265,769.131,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,174.853,174.819,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2115.11,2112.86,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.566,189.56,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1738.83,1738.81,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.447,187.441,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3138.07,3093.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.051,270.045,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3404.55,3355.52,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.196,298.189,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,281.482,278.07,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,209.434,209.378,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.035,258.185,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,210.006,210.004,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,996.175,985.662,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,954.72,954.707,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32925.6,30733.3,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4310.26,4310.13,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137101,87511.7,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29996.2,29995.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21345e+06,1.21277e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163007,163005,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21629e+06,1.21351e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,145603,145598,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42144.3,40994.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14467,14466.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73947.8,73929.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25654.3,25653.8,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2843.65,1816.27,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,696.701,696.703,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.17,1176.85,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,729.563,729.565,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.735,765.849,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,175.625,175.624,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2205.17,2203.59,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,195.111,195.065,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1753.88,1753.24,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.296,191.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3016.85,2968.45,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.267,272.258,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3201.31,3152.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,319.262,319.256,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,288.322,287.324,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.311,219.306,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,252.841,249.68,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.539,203.536,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,973.518,953.704,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.323,975.257,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33531,30934.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4275.15,4274.97,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,143398,89269.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29288.5,29288.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19948e+06,1.19925e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,166136,166132,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.23512e+06,1.23445e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,149207,149205,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41503.8,41358.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15022.5,15022,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76237.9,76215.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25290.5,25289.9,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2809.56,1867.77,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,710.67,710.67,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1328.4,1245.24,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,750.391,750.36,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.477,785.279,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,191.696,191.694,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2070.23,2069.91,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.975,189.929,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1854.43,1853.95,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.153,191.147,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3074.68,3028.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.152,265.145,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3341.34,3293.26,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,321.676,321.67,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,299.579,298.149,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.024,212.02,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.894,272.577,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,206.892,206.888,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1066.76,1047.05,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,958.483,958.441,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32305,30842.4,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4246.42,4246.31,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,134460,86058.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30737.1,30736.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.1819e+06,1.18043e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,171721,171720,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15124e+06,1.15055e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,153318,153317,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41963.6,40507.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15172.3,15171.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,72423.1,72403.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25243.4,25238.7,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2843.65,1816.27,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,696.701,696.703,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.17,1176.85,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,729.563,729.565,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.735,765.849,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,175.625,175.624,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2205.17,2203.59,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,195.111,195.065,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1753.88,1753.24,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.296,191.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3016.85,2968.45,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.267,272.258,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3201.31,3152.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,319.262,319.256,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,288.322,287.324,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.311,219.306,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,252.841,249.68,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.539,203.536,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,973.518,953.704,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.323,975.257,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33531,30934.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4275.15,4274.97,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,143398,89269.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29288.5,29288.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19948e+06,1.19925e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,166136,166132,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.23512e+06,1.23445e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,149207,149205,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41503.8,41358.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15022.5,15022,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76237.9,76215.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25290.5,25289.9,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2903.72,1885.28,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,721.425,721.423,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.2,1183.98,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,757.62,757.616,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,800.265,769.131,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,174.853,174.819,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2115.11,2112.86,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.566,189.56,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1738.83,1738.81,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.447,187.441,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3138.07,3093.05,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.051,270.045,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3404.55,3355.52,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.196,298.189,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,281.482,278.07,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,209.434,209.378,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.035,258.185,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,210.006,210.004,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,996.175,985.662,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,954.72,954.707,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32925.6,30733.3,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4310.26,4310.13,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137101,87511.7,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29996.2,29995.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21345e+06,1.21277e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163007,163005,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21629e+06,1.21351e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,145603,145598,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42144.3,40994.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14467,14466.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73947.8,73929.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25654.3,25653.8,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2843.65,1816.27,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,696.701,696.703,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1234.17,1176.85,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,729.563,729.565,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.735,765.849,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,175.625,175.624,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2205.17,2203.59,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,195.111,195.065,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1753.88,1753.24,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.296,191.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3016.85,2968.45,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.267,272.258,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3201.31,3152.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,319.262,319.256,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,288.322,287.324,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.311,219.306,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,252.841,249.68,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.539,203.536,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,973.518,953.704,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.323,975.257,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33531,30934.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4275.15,4274.97,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,143398,89269.9,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29288.5,29288.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19948e+06,1.19925e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,166136,166132,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.23512e+06,1.23445e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,149207,149205,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41503.8,41358.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15022.5,15022,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76237.9,76215.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25290.5,25289.9,ns,,,,,

@pbalcer pbalcer merged commit 64e8089 into oneapi-src:main Jan 20, 2025
10 of 71 checks passed
@mateuszpn mateuszpn deleted the add-graph-bench branch February 5, 2025 12:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants