one-api Intel Core i7-12700H testing with a Intel NUC12SNKi72 (SNADL357.0055.2022.0923.1555 BIOS) and Intel Arctm A770M DG2 16GB on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2305317-NE-ONEAPI56485 .
one-api Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Intel Core i7-12700H Intel Core i7-12700H @ 4.60GHz (14 Cores / 20 Threads) Intel NUC12SNKi72 (SNADL357.0055.2022.0923.1555 BIOS) Intel Alder Lake PCH 16GB 1024GB SAMSUNG MZVL21T0HCLR-00A00 Intel Arctm A770M DG2 16GB (1400MHz) Realtek ALC274 S27H85x Intel I225-LM + Intel Alder Lake-P PCH CNVi WiFi Ubuntu 22.04 5.19.0-42-generic (x86_64) GNOME Shell 42.5 X Server 1.21.1.3 + Wayland 4.6 Mesa 23.1.0-devel (git-722bcd7973) OpenCL 3.0 + OpenCL 3.0 1.3.238 GCC 11.3.0 ext4 2560x1440 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave (EPP: performance) - CPU Microcode: 0x429 - Thermald 2.4.9 - Python 3.10.6 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
one-api embree: Pathtracer - Crown embree: Pathtracer ISPC - Crown embree: Pathtracer - Asian Dragon embree: Pathtracer - Asian Dragon Obj embree: Pathtracer ISPC - Asian Dragon embree: Pathtracer ISPC - Asian Dragon Obj oidn: RT.hdr_alb_nrm.3840x2160 oidn: RT.ldr_alb_nrm.3840x2160 oidn: RTLightmap.hdr.4096x4096 openvkl: vklBenchmark ISPC openvkl: vklBenchmark Scalar ospray: particle_volume/ao/real_time ospray: particle_volume/scivis/real_time ospray: particle_volume/pathtracer/real_time ospray: gravity_spheres_volume/dim_512/ao/real_time ospray: gravity_spheres_volume/dim_512/scivis/real_time ospray: gravity_spheres_volume/dim_512/pathtracer/real_time onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU ospray-studio: 1 - 4K - 1 - Path Tracer ospray-studio: 2 - 4K - 1 - Path Tracer ospray-studio: 3 - 4K - 1 - Path Tracer ospray-studio: 1 - 4K - 16 - Path Tracer ospray-studio: 1 - 4K - 32 - Path Tracer ospray-studio: 2 - 4K - 16 - Path Tracer ospray-studio: 2 - 4K - 32 - Path Tracer ospray-studio: 3 - 4K - 16 - Path Tracer ospray-studio: 3 - 4K - 32 - Path Tracer ospray-studio: 1 - 1080p - 1 - Path Tracer ospray-studio: 2 - 1080p - 1 - Path Tracer ospray-studio: 3 - 1080p - 1 - Path Tracer ospray-studio: 1 - 1080p - 16 - Path Tracer ospray-studio: 1 - 1080p - 32 - Path Tracer ospray-studio: 2 - 1080p - 16 - Path Tracer ospray-studio: 2 - 1080p - 32 - Path Tracer ospray-studio: 3 - 1080p - 16 - Path Tracer ospray-studio: 3 - 1080p - 32 - Path Tracer openvino: Face Detection FP16 - CPU openvino: Face Detection FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP32 - CPU openvino: Person Detection FP32 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU oneapi-level-zero: Peak Integer Compute oneapi-level-zero: Device-To-Host Bandwidth oneapi-level-zero: Device-To-Host Bandwidth oneapi-level-zero: Host-To-Device Bandwidth oneapi-level-zero: Host-To-Device Bandwidth oneapi-level-zero: Peak Kernel Launch Latency oneapi-level-zero: Peak Half-Precision Compute oneapi-level-zero: Peak Single-Precision Compute oneapi-level-zero: Host-To-Device-To-Host Image Copy oneapi-level-zero: Peak Float16 Global Memory Bandwidth oneapi-level-zero: Peak System Memory Copy to Shared Memory Intel Core i7-12700H 12.5294 12.7506 14.3172 13.0284 15.4809 13.5652 0.25 0.25 0.13 166 95 4.85712 4.83244 143.910 2.42430 2.35171 3.40126 4.00860 11.0113 1.40640 2.31913 15.7012 9.24334 7.64000 15.3596 1.89285 3.21723 4363.03 2197.06 4393.36 2189.17 4391.73 2199.05 10442 10690 12550 173296 345018 177509 352951 207484 412408 2672 2738 3219 43014 88776 44021 90719 51537 105865 2.47 2390.66 1.59 3711.78 1.56 3755.55 174.77 34.25 8.87 671.50 457.24 13.07 269.83 74.07 27.96 214.27 916.73 21.80 362.98 16.47 8583.37 2.32 9383.71 2.13 3851.93 11.301711 23751.76 11.300157 23755.02 8.01910 16169.2 8099.49 10.4280 394.608 18.9603 OpenBenchmarking.org
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.0.1 Binary: Pathtracer - Model: Crown Intel Core i7-12700H 3 6 9 12 15 SE +/- 0.03, N = 3 12.53 MIN: 11.98 / MAX: 13.09
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.0.1 Binary: Pathtracer ISPC - Model: Crown Intel Core i7-12700H 3 6 9 12 15 SE +/- 0.03, N = 3 12.75 MIN: 12.22 / MAX: 13.36
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.0.1 Binary: Pathtracer - Model: Asian Dragon Intel Core i7-12700H 4 8 12 16 20 SE +/- 0.06, N = 3 14.32 MIN: 13.69 / MAX: 14.96
Embree Binary: Pathtracer - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.0.1 Binary: Pathtracer - Model: Asian Dragon Obj Intel Core i7-12700H 3 6 9 12 15 SE +/- 0.06, N = 3 13.03 MIN: 12.54 / MAX: 13.56
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.0.1 Binary: Pathtracer ISPC - Model: Asian Dragon Intel Core i7-12700H 4 8 12 16 20 SE +/- 0.07, N = 3 15.48 MIN: 14.82 / MAX: 16.03
Embree Binary: Pathtracer ISPC - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.0.1 Binary: Pathtracer ISPC - Model: Asian Dragon Obj Intel Core i7-12700H 3 6 9 12 15 SE +/- 0.03, N = 3 13.57 MIN: 13.11 / MAX: 13.99
Intel Open Image Denoise Run: RT.hdr_alb_nrm.3840x2160 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RT.hdr_alb_nrm.3840x2160 Intel Core i7-12700H 0.0563 0.1126 0.1689 0.2252 0.2815 SE +/- 0.00, N = 3 0.25
Intel Open Image Denoise Run: RT.ldr_alb_nrm.3840x2160 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RT.ldr_alb_nrm.3840x2160 Intel Core i7-12700H 0.0563 0.1126 0.1689 0.2252 0.2815 SE +/- 0.00, N = 3 0.25
Intel Open Image Denoise Run: RTLightmap.hdr.4096x4096 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RTLightmap.hdr.4096x4096 Intel Core i7-12700H 0.0293 0.0586 0.0879 0.1172 0.1465 SE +/- 0.00, N = 3 0.13
OpenVKL Benchmark: vklBenchmark ISPC OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 1.3.1 Benchmark: vklBenchmark ISPC Intel Core i7-12700H 40 80 120 160 200 SE +/- 0.33, N = 3 166 MIN: 19 / MAX: 2411
OpenVKL Benchmark: vklBenchmark Scalar OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 1.3.1 Benchmark: vklBenchmark Scalar Intel Core i7-12700H 20 40 60 80 100 SE +/- 0.33, N = 3 95 MIN: 10 / MAX: 1846
OSPRay Benchmark: particle_volume/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: particle_volume/ao/real_time Intel Core i7-12700H 1.0929 2.1858 3.2787 4.3716 5.4645 SE +/- 0.00142, N = 3 4.85712
OSPRay Benchmark: particle_volume/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: particle_volume/scivis/real_time Intel Core i7-12700H 1.0873 2.1746 3.2619 4.3492 5.4365 SE +/- 0.00127, N = 3 4.83244
OSPRay Benchmark: particle_volume/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: particle_volume/pathtracer/real_time Intel Core i7-12700H 30 60 90 120 150 SE +/- 0.05, N = 3 143.91
OSPRay Benchmark: gravity_spheres_volume/dim_512/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: gravity_spheres_volume/dim_512/ao/real_time Intel Core i7-12700H 0.5455 1.091 1.6365 2.182 2.7275 SE +/- 0.00353, N = 3 2.42430
OSPRay Benchmark: gravity_spheres_volume/dim_512/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time Intel Core i7-12700H 0.5291 1.0582 1.5873 2.1164 2.6455 SE +/- 0.00164, N = 3 2.35171
OSPRay Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time Intel Core i7-12700H 0.7653 1.5306 2.2959 3.0612 3.8265 SE +/- 0.00746, N = 3 3.40126
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU Intel Core i7-12700H 0.9019 1.8038 2.7057 3.6076 4.5095 SE +/- 0.03295, N = 13 4.00860 MIN: 3.48 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU Intel Core i7-12700H 3 6 9 12 15 SE +/- 0.01, N = 3 11.01 MIN: 10.91 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU Intel Core i7-12700H 0.3164 0.6328 0.9492 1.2656 1.582 SE +/- 0.00809, N = 3 1.40640 MIN: 1.31 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU Intel Core i7-12700H 0.5218 1.0436 1.5654 2.0872 2.609 SE +/- 0.00399, N = 3 2.31913 MIN: 2.25 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Intel Core i7-12700H 4 8 12 16 20 SE +/- 0.01, N = 3 15.70 MIN: 15.48 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU Intel Core i7-12700H 3 6 9 12 15 SE +/- 0.08400, N = 3 9.24334 MIN: 4.89 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU Intel Core i7-12700H 2 4 6 8 10 SE +/- 0.00121, N = 3 7.64000 MIN: 7.25 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Intel Core i7-12700H 4 8 12 16 20 SE +/- 0.01, N = 3 15.36 MIN: 15.06 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU Intel Core i7-12700H 0.4259 0.8518 1.2777 1.7036 2.1295 SE +/- 0.00247, N = 3 1.89285 MIN: 1.72 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU Intel Core i7-12700H 0.7239 1.4478 2.1717 2.8956 3.6195 SE +/- 0.00187, N = 3 3.21723 MIN: 3.01 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Intel Core i7-12700H 900 1800 2700 3600 4500 SE +/- 38.13, N = 3 4363.03 MIN: 4260.21 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Intel Core i7-12700H 500 1000 1500 2000 2500 SE +/- 3.14, N = 3 2197.06 MIN: 2145.62 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU Intel Core i7-12700H 900 1800 2700 3600 4500 SE +/- 13.35, N = 3 4393.36 MIN: 4257.27 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU Intel Core i7-12700H 500 1000 1500 2000 2500 SE +/- 9.16, N = 3 2189.17 MIN: 2143.1 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU Intel Core i7-12700H 900 1800 2700 3600 4500 SE +/- 5.34, N = 3 4391.73 MIN: 4264.83 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.1 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU Intel Core i7-12700H 500 1000 1500 2000 2500 SE +/- 3.36, N = 3 2199.05 MIN: 2149.08 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer Intel Core i7-12700H 2K 4K 6K 8K 10K SE +/- 17.75, N = 3 10442 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer Intel Core i7-12700H 2K 4K 6K 8K 10K SE +/- 38.96, N = 3 10690 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer Intel Core i7-12700H 3K 6K 9K 12K 15K SE +/- 31.47, N = 3 12550 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer Intel Core i7-12700H 40K 80K 120K 160K 200K SE +/- 309.91, N = 3 173296 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer Intel Core i7-12700H 70K 140K 210K 280K 350K SE +/- 233.35, N = 3 345018 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer Intel Core i7-12700H 40K 80K 120K 160K 200K SE +/- 131.21, N = 3 177509 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer Intel Core i7-12700H 80K 160K 240K 320K 400K SE +/- 156.54, N = 3 352951 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer Intel Core i7-12700H 40K 80K 120K 160K 200K SE +/- 235.62, N = 3 207484 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer Intel Core i7-12700H 90K 180K 270K 360K 450K SE +/- 117.06, N = 3 412408 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer Intel Core i7-12700H 600 1200 1800 2400 3000 SE +/- 5.29, N = 3 2672 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer Intel Core i7-12700H 600 1200 1800 2400 3000 SE +/- 4.91, N = 3 2738 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer Intel Core i7-12700H 700 1400 2100 2800 3500 SE +/- 16.95, N = 3 3219 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer Intel Core i7-12700H 9K 18K 27K 36K 45K SE +/- 83.76, N = 3 43014 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer Intel Core i7-12700H 20K 40K 60K 80K 100K SE +/- 265.79, N = 3 88776 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer Intel Core i7-12700H 9K 18K 27K 36K 45K SE +/- 41.70, N = 3 44021 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer Intel Core i7-12700H 20K 40K 60K 80K 100K SE +/- 21.85, N = 3 90719 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer Intel Core i7-12700H 11K 22K 33K 44K 55K SE +/- 41.14, N = 3 51537 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer Intel Core i7-12700H 20K 40K 60K 80K 100K SE +/- 208.27, N = 3 105865 1. (CXX) g++ options: -O3 -lm -ldl
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Face Detection FP16 - Device: CPU Intel Core i7-12700H 0.5558 1.1116 1.6674 2.2232 2.779 SE +/- 0.03, N = 3 2.47 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Face Detection FP16 - Device: CPU Intel Core i7-12700H 500 1000 1500 2000 2500 SE +/- 13.39, N = 3 2390.66 MIN: 1174.27 / MAX: 2909.86 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Person Detection FP16 - Device: CPU Intel Core i7-12700H 0.3578 0.7156 1.0734 1.4312 1.789 SE +/- 0.01, N = 3 1.59 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Person Detection FP16 - Device: CPU Intel Core i7-12700H 800 1600 2400 3200 4000 SE +/- 12.84, N = 3 3711.78 MIN: 2982.27 / MAX: 4576.61 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Person Detection FP32 - Device: CPU Intel Core i7-12700H 0.351 0.702 1.053 1.404 1.755 SE +/- 0.02, N = 3 1.56 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Person Detection FP32 - Device: CPU Intel Core i7-12700H 800 1600 2400 3200 4000 SE +/- 24.81, N = 3 3755.55 MIN: 2933.05 / MAX: 4638.91 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Vehicle Detection FP16 - Device: CPU Intel Core i7-12700H 40 80 120 160 200 SE +/- 0.43, N = 3 174.77 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Vehicle Detection FP16 - Device: CPU Intel Core i7-12700H 8 16 24 32 40 SE +/- 0.08, N = 3 34.25 MIN: 15.45 / MAX: 49.51 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Face Detection FP16-INT8 - Device: CPU Intel Core i7-12700H 2 4 6 8 10 SE +/- 0.07, N = 3 8.87 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Face Detection FP16-INT8 - Device: CPU Intel Core i7-12700H 140 280 420 560 700 SE +/- 3.52, N = 3 671.50 MIN: 512.96 / MAX: 1376.6 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Vehicle Detection FP16-INT8 - Device: CPU Intel Core i7-12700H 100 200 300 400 500 SE +/- 4.44, N = 3 457.24 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Vehicle Detection FP16-INT8 - Device: CPU Intel Core i7-12700H 3 6 9 12 15 SE +/- 0.13, N = 3 13.07 MIN: 7.94 / MAX: 30.53 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Weld Porosity Detection FP16 - Device: CPU Intel Core i7-12700H 60 120 180 240 300 SE +/- 1.90, N = 3 269.83 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Weld Porosity Detection FP16 - Device: CPU Intel Core i7-12700H 16 32 48 64 80 SE +/- 0.52, N = 3 74.07 MIN: 45.14 / MAX: 113.31 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Machine Translation EN To DE FP16 - Device: CPU Intel Core i7-12700H 7 14 21 28 35 SE +/- 0.18, N = 3 27.96 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Machine Translation EN To DE FP16 - Device: CPU Intel Core i7-12700H 50 100 150 200 250 SE +/- 1.27, N = 3 214.27 MIN: 169.29 / MAX: 290.35 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Weld Porosity Detection FP16-INT8 - Device: CPU Intel Core i7-12700H 200 400 600 800 1000 SE +/- 7.44, N = 3 916.73 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Weld Porosity Detection FP16-INT8 - Device: CPU Intel Core i7-12700H 5 10 15 20 25 SE +/- 0.18, N = 3 21.80 MIN: 9.66 / MAX: 58.2 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Person Vehicle Bike Detection FP16 - Device: CPU Intel Core i7-12700H 80 160 240 320 400 SE +/- 1.40, N = 3 362.98 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Person Vehicle Bike Detection FP16 - Device: CPU Intel Core i7-12700H 4 8 12 16 20 SE +/- 0.06, N = 3 16.47 MIN: 8.54 / MAX: 28.41 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU Intel Core i7-12700H 2K 4K 6K 8K 10K SE +/- 71.73, N = 3 8583.37 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU Intel Core i7-12700H 0.522 1.044 1.566 2.088 2.61 SE +/- 0.02, N = 3 2.32 MIN: 0.98 / MAX: 17.46 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU Intel Core i7-12700H 2K 4K 6K 8K 10K SE +/- 58.56, N = 3 9383.71 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU Intel Core i7-12700H 0.4793 0.9586 1.4379 1.9172 2.3965 SE +/- 0.01, N = 3 2.13 MIN: 0.87 / MAX: 14.45 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
oneAPI Level Zero Tests Test: Peak Integer Compute OpenBenchmarking.org GFLOPS, More Is Better oneAPI Level Zero Tests Test: Peak Integer Compute Intel Core i7-12700H 800 1600 2400 3200 4000 SE +/- 7.60, N = 3 3851.93 1. (CXX) g++ options: -ldl
oneAPI Level Zero Tests Test: Device-To-Host Bandwidth OpenBenchmarking.org GB/s, More Is Better oneAPI Level Zero Tests Test: Device-To-Host Bandwidth Intel Core i7-12700H 3 6 9 12 15 SE +/- 0.00, N = 3 11.30 1. (CXX) g++ options: -ldl
oneAPI Level Zero Tests Test: Device-To-Host Bandwidth OpenBenchmarking.org usec, Fewer Is Better oneAPI Level Zero Tests Test: Device-To-Host Bandwidth Intel Core i7-12700H 5K 10K 15K 20K 25K SE +/- 6.11, N = 3 23751.76 1. (CXX) g++ options: -ldl
oneAPI Level Zero Tests Test: Host-To-Device Bandwidth OpenBenchmarking.org GB/s, More Is Better oneAPI Level Zero Tests Test: Host-To-Device Bandwidth Intel Core i7-12700H 3 6 9 12 15 SE +/- 0.00, N = 3 11.30 1. (CXX) g++ options: -ldl
oneAPI Level Zero Tests Test: Host-To-Device Bandwidth OpenBenchmarking.org usec, Fewer Is Better oneAPI Level Zero Tests Test: Host-To-Device Bandwidth Intel Core i7-12700H 5K 10K 15K 20K 25K SE +/- 6.67, N = 3 23755.02 1. (CXX) g++ options: -ldl
oneAPI Level Zero Tests Test: Peak Kernel Launch Latency OpenBenchmarking.org us, Fewer Is Better oneAPI Level Zero Tests Test: Peak Kernel Launch Latency Intel Core i7-12700H 2 4 6 8 10 SE +/- 0.02278, N = 3 8.01910 1. (CXX) g++ options: -ldl
oneAPI Level Zero Tests Test: Peak Half-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better oneAPI Level Zero Tests Test: Peak Half-Precision Compute Intel Core i7-12700H 3K 6K 9K 12K 15K SE +/- 44.10, N = 3 16169.2 1. (CXX) g++ options: -ldl
oneAPI Level Zero Tests Test: Peak Single-Precision Compute OpenBenchmarking.org GB/s, More Is Better oneAPI Level Zero Tests Test: Peak Single-Precision Compute Intel Core i7-12700H 2K 4K 6K 8K 10K SE +/- 0.03, N = 3 8099.49 1. (CXX) g++ options: -ldl
oneAPI Level Zero Tests Test: Host-To-Device-To-Host Image Copy OpenBenchmarking.org GB/s, More Is Better oneAPI Level Zero Tests Test: Host-To-Device-To-Host Image Copy Intel Core i7-12700H 3 6 9 12 15 SE +/- 0.03, N = 3 10.43 1. (CXX) g++ options: -ldl
oneAPI Level Zero Tests Test: Peak Float16 Global Memory Bandwidth OpenBenchmarking.org GB/s, More Is Better oneAPI Level Zero Tests Test: Peak Float16 Global Memory Bandwidth Intel Core i7-12700H 90 180 270 360 450 SE +/- 0.02, N = 3 394.61 1. (CXX) g++ options: -ldl
oneAPI Level Zero Tests Test: Peak System Memory Copy to Shared Memory OpenBenchmarking.org GB/s, More Is Better oneAPI Level Zero Tests Test: Peak System Memory Copy to Shared Memory Intel Core i7-12700H 5 10 15 20 25 SE +/- 0.03, N = 3 18.96 1. (CXX) g++ options: -ldl
Phoronix Test Suite v10.8.5