Tests for a future article. AMD EPYC 9124 16-Core testing with a Supermicro H13SSW (1.1 BIOS) and astdrmfb on AlmaLinux 9.2 via the Phoronix Test Suite.
a Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-islProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113eJava Notes: OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Notes: Python 3.9.16Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
b c Processor: 2 x AMD EPYC 9254 24-Core @ 2.90GHz (48 Cores / 96 Threads), Motherboard: Supermicro H13DSH (1.5 BIOS), Memory: 24 x 32 GB DDR5-4800MT/s Samsung M321R4GA3BB6-CQKET, Disk: 2 x 1920GB SAMSUNG MZQL21T9HCJR-00A07, Graphics: astdrmfb
OS: AlmaLinux 9.2, Kernel: 5.14.0-284.25.1.el9_2.x86_64 (x86_64), Compiler: GCC 11.3.1 20221121, File-System: ext4, Screen Resolution: 1024x768
d Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-islProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa101111Java Notes: OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Notes: Python 3.9.16Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
e f g Changed Processor to AMD EPYC 9124 16-Core @ 3.00GHz (16 Cores / 32 Threads) .
Changed Motherboard to Supermicro H13SSW (1.1 BIOS) .
Changed Memory to 12 x 64 GB DDR5-4800MT/s HMCG94MEBRA123N .
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
Model: Bumper Beam
a: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
b: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
c: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
d: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
e: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
f: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
g: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
Model: Chrysler Neon 1M
a: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
b: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
c: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
d: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
e: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
f: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
g: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
Model: Cell Phone Drop Test
a: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
b: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
c: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
d: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
e: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
f: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
g: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
Model: Bird Strike on Windshield
a: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
b: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
c: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
d: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
e: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
f: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
g: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
Model: Rubber O-Ring Seal Installation
a: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
b: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
c: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
d: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
e: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
f: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
g: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
Model: INIVOL and Fluid Structure Interaction Drop Container
a: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
b: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
c: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
d: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
e: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
f: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
g: The test run did not produce a result. E: ./engine_linux64_gf_ompi: error while loading shared libraries: libmpi.so.40: cannot open shared object file: No such file or directory
Remhos Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example c a b f g d e 7 14 21 28 35 16.24 16.35 16.79 30.73 30.75 30.76 30.85 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Mount St. Helens a b c d e f g 7 14 21 28 35 11.02 11.32 11.33 26.74 26.80 26.87 27.70 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Layered Halfspace a c b g e f d 16 32 48 64 80 26.89 27.49 28.65 69.96 70.19 70.54 71.61 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Tomographic Model c b a f d e g 7 14 21 28 35 12.04 12.10 12.31 26.97 27.33 27.46 27.75 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Homogeneous Halfspace b c a e g f d 8 16 24 32 40 14.46 14.81 15.11 35.03 35.38 35.54 35.57 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Water-layered Halfspace a c b f e d g 14 28 42 56 70 26.99 27.06 29.46 61.28 62.33 62.44 62.81 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi
nekRS nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: Kershaw b a c g d e f 2000M 4000M 6000M 8000M 10000M 11240300000 11106900000 10826700000 10500600000 10318900000 10264000000 9976450000 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: TurboPipe Periodic g f d e a b c 2000M 4000M 6000M 8000M 10000M 7964910000 7955790000 7934570000 7931010000 6767710000 6757360000 6754170000 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer - Model: Crown c b a f g d e 12 24 36 48 60 55.40 55.39 54.90 21.59 21.58 21.48 21.44 MIN: 53.71 / MAX: 58.99 MIN: 54.02 / MAX: 57.64 MIN: 53.27 / MAX: 57.28 MIN: 21.45 / MAX: 21.84 MIN: 21.43 / MAX: 21.89 MIN: 21.32 / MAX: 21.8 MIN: 21.3 / MAX: 21.78
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Crown c b a g f d e 13 26 39 52 65 56.81 56.46 56.09 22.77 22.66 22.59 22.57 MIN: 55.27 / MAX: 59.91 MIN: 54.53 / MAX: 59.89 MIN: 54.05 / MAX: 59.82 MIN: 22.57 / MAX: 23.16 MIN: 22.45 / MAX: 22.99 MIN: 22.39 / MAX: 22.98 MIN: 22.39 / MAX: 22.93
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer - Model: Asian Dragon a b c g e f d 13 26 39 52 65 60.14 59.91 59.79 24.82 24.73 24.70 24.69 MIN: 58.97 / MAX: 62 MIN: 58.66 / MAX: 61.96 MIN: 58.46 / MAX: 62.03 MIN: 24.74 / MAX: 25 MIN: 24.67 / MAX: 24.86 MIN: 24.63 / MAX: 24.84 MIN: 24.62 / MAX: 24.84
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer - Model: Asian Dragon Obj b c a d g e f 12 24 36 48 60 53.81 53.69 53.57 22.26 22.19 22.16 22.15 MIN: 52.72 / MAX: 55.86 MIN: 52.63 / MAX: 55.24 MIN: 52.17 / MAX: 55.38 MIN: 22.18 / MAX: 22.42 MIN: 22.12 / MAX: 22.33 MIN: 22.08 / MAX: 22.35 MIN: 22.07 / MAX: 22.32
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon c a b g d f e 15 30 45 60 75 67.50 67.34 67.20 28.48 28.36 28.32 28.31 MIN: 65.64 / MAX: 71.17 MIN: 65.61 / MAX: 70.54 MIN: 65.48 / MAX: 70.41 MIN: 28.37 / MAX: 28.69 MIN: 28.26 / MAX: 28.59 MIN: 28.23 / MAX: 28.55 MIN: 28.21 / MAX: 28.56
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon Obj c b a e f g d 13 26 39 52 65 56.93 56.69 56.49 23.94 23.94 23.88 23.87 MIN: 55.56 / MAX: 59.67 MIN: 55.42 / MAX: 58.97 MIN: 55.29 / MAX: 58.38 MIN: 23.84 / MAX: 24.18 MIN: 23.84 / MAX: 24.16 MIN: 23.79 / MAX: 24.08 MIN: 23.78 / MAX: 24.08
SVT-AV1 OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K a b c g f e d 1.1707 2.3414 3.5121 4.6828 5.8535 5.203 5.149 5.049 4.143 4.138 4.114 4.107 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K b a c g e f d 20 40 60 80 100 91.32 90.81 90.42 67.81 67.72 67.39 66.99 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 12 - Input: Bosphorus 4K b a d c e f g 40 80 120 160 200 166.38 163.46 163.19 163.06 162.61 161.85 160.32 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 13 - Input: Bosphorus 4K b a e d c g f 40 80 120 160 200 166.69 163.01 162.05 161.85 161.50 161.32 160.80 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 4 - Input: Bosphorus 1080p c b a g e d f 3 6 9 12 15 12.62 12.59 12.48 11.02 10.98 10.91 10.74 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 8 - Input: Bosphorus 1080p c a b e d f g 30 60 90 120 150 143.55 141.22 138.34 119.31 118.95 118.49 118.48 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 12 - Input: Bosphorus 1080p g d e f c b a 110 220 330 440 550 528.53 526.22 525.17 521.52 431.90 427.69 422.99 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 13 - Input: Bosphorus 1080p d e g f b c a 130 260 390 520 650 604.99 597.01 586.75 585.37 542.61 516.91 510.36 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/ao/real_time c a b g d f e 4 8 12 16 20 15.98720 15.98600 15.97850 5.57553 5.57469 5.57320 5.54107
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/scivis/real_time b c a d g e f 4 8 12 16 20 15.98880 15.97780 15.95280 5.57001 5.56539 5.56353 5.55581
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/pathtracer/real_time a c b d f g e 50 100 150 200 250 215.10 214.14 214.07 151.91 151.78 151.68 151.51
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/ao/real_time a b c g e f d 4 8 12 16 20 14.23690 14.17830 14.13990 5.62278 5.62040 5.61454 5.60747
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time a c b g e d f 4 8 12 16 20 13.87390 13.83170 13.76660 5.47725 5.46153 5.45329 5.45227
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time c b a g f d e 4 8 12 16 20 16.53500 16.43650 16.34680 6.60085 6.59563 6.58745 6.58270
Build: allmodconfig
a: The test quit with a non-zero exit status.
b: The test quit with a non-zero exit status.
c: The test quit with a non-zero exit status.
d: The test quit with a non-zero exit status.
e: The test quit with a non-zero exit status.
f: The test quit with a non-zero exit status.
g: The test quit with a non-zero exit status.
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 32 a b c e f g d 8M 16M 24M 32M 40M 39499000 39486000 39453000 35315000 35271000 35236000 35228000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 57 a b c f g e d 13M 26M 39M 52M 65M 59401000 59296000 57519000 52879000 52854000 52827000 52665000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 2 - Buffer Length: 256 - Filter Length: 32 a b c f e g d 17M 34M 51M 68M 85M 77181000 77019000 76924000 68861000 68846000 68678000 67054000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 2 - Buffer Length: 256 - Filter Length: 57 c a b f d e g 30M 60M 90M 120M 150M 118550000 117490000 114010000 105740000 105650000 105480000 104800000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 4 - Buffer Length: 256 - Filter Length: 32 a b c e d f g 30M 60M 90M 120M 150M 153850000 153690000 153670000 138620000 138600000 138580000 138460000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 4 - Buffer Length: 256 - Filter Length: 57 b a c e g f d 40M 80M 120M 160M 200M 196590000 196220000 194510000 191230000 190750000 189880000 188930000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 32 a c b d e g f 70M 140M 210M 280M 350M 307540000 306760000 305110000 278030000 277780000 277410000 276390000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 57 a c b d e g f 80M 160M 240M 320M 400M 369430000 366990000 366930000 363310000 357990000 357810000 350450000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 512 c b a d f e g 3M 6M 9M 12M 15M 14225000 14021000 13909000 12683000 12681000 12366000 12256000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 32 c b a d e f g 130M 260M 390M 520M 650M 603650000 602470000 594230000 545360000 545140000 545020000 543050000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 57 a f e b d g c 150M 300M 450M 600M 750M 699740000 693340000 692920000 692760000 689150000 682070000 674930000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 2 - Buffer Length: 256 - Filter Length: 512 c a b e f d g 6M 12M 18M 24M 30M 28227000 27901000 27736000 25207000 25199000 24627000 22727000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 32 b c a g d e f 300M 600M 900M 1200M 1500M 1190300000 1184800000 1183500000 1047100000 1047100000 1046600000 1041900000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 57 c b a d g e f 300M 600M 900M 1200M 1500M 1254800000 1214200000 1192100000 1035000000 1033400000 1032000000 1024600000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 4 - Buffer Length: 256 - Filter Length: 512 b c a e d f g 12M 24M 36M 48M 60M 55588000 55165000 52911000 50380000 50258000 49977000 49556000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 32 b a c d e f g 500M 1000M 1500M 2000M 2500M 2212100000 2207700000 2206800000 1059500000 1057500000 1057100000 1056200000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 57 c b a g e f d 400M 800M 1200M 1600M 2000M 2010300000 2001900000 1994400000 1099300000 1095400000 1094600000 1093300000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 512 a c b g d f e 20M 40M 60M 80M 100M 109870000 109140000 108080000 100170000 99594000 99441000 97005000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 96 - Buffer Length: 256 - Filter Length: 32 a c b g f d e 600M 1200M 1800M 2400M 3000M 3005800000 2999800000 2995400000 1065700000 1065300000 1065200000 1065100000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 96 - Buffer Length: 256 - Filter Length: 57 b c a d f g e 600M 1200M 1800M 2400M 3000M 2571100000 2564900000 2559800000 1120800000 1120500000 1118200000 1117800000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 512 b a c e g f d 50M 100M 150M 200M 250M 216150000 216080000 214910000 196040000 194670000 194500000 193850000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 512 b a c g d e f 90M 180M 270M 360M 450M 429620000 425810000 424400000 274070000 273760000 273480000 273390000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 512 c a b f d e g 130M 260M 390M 520M 650M 622630000 622560000 610950000 283030000 282920000 281830000 281730000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 96 - Buffer Length: 256 - Filter Length: 512 b c a g d f e 150M 300M 450M 600M 750M 718140000 715030000 711640000 286530000 286250000 285920000 285880000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_point_select - Threads: 1 e f d c b a 1300 2600 3900 5200 6500 5976 5954 5898 4471 4405 4331
Test: oltp_point_select - Threads: 1
g: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_read_write - Threads: 128 b a f e g d 20K 40K 60K 80K 100K 89099 85757 60310 60145 59944 59727
Test: oltp_read_write - Threads: 128
c: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_update_index - Threads: 1 e f g d a c 300 600 900 1200 1500 1490 1483 1481 1479 1212 1189
Test: oltp_update_index - Threads: 1
b: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_point_select - Threads: 16 e f g b c 15K 30K 45K 60K 75K 70250 70105 69923 67515 65406
Test: oltp_point_select - Threads: 16
a: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
d: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_point_select - Threads: 32 b a d f e g 20K 40K 60K 80K 100K 106180 104627 98149 97368 96907 96840
Test: oltp_point_select - Threads: 32
c: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_point_select - Threads: 64 b a f e g d 30K 60K 90K 120K 150K 130802 127567 119092 118657 118549 115675
Test: oltp_point_select - Threads: 64
c: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_update_index - Threads: 16 f c g d e a 3K 6K 9K 12K 15K 12692 12681 12627 12622 12567 12558
Test: oltp_update_index - Threads: 16
b: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_update_index - Threads: 32 a b d c g e 4K 8K 12K 16K 20K 18361 17817 17612 17565 17135 17117
Test: oltp_update_index - Threads: 32
f: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_update_index - Threads: 64 b c e d f 5K 10K 15K 20K 25K 24371 23324 21271 21108 21067
Test: oltp_update_index - Threads: 64
a: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
g: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_point_select - Threads: 128 b a c f e d 30K 60K 90K 120K 150K 159728 159242 149962 130389 129904 129492
Test: oltp_point_select - Threads: 128
g: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_update_index - Threads: 128 b a c f e g 6K 12K 18K 24K 30K 27464 27087 26546 24830 24611 24574
Test: oltp_update_index - Threads: 128
d: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_update_non_index - Threads: 16 g d e a b 4K 8K 12K 16K 20K 18735 18563 18557 18095 18068
Test: oltp_update_non_index - Threads: 16
c: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
f: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_update_non_index - Threads: 32 b a f e d 6K 12K 18K 24K 30K 28914 28735 26695 26285 26273
Test: oltp_update_non_index - Threads: 32
c: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
g: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_update_non_index - Threads: 128 c a e g f 11K 22K 33K 44K 55K 52865 51105 42138 41695 41424
Test: oltp_update_non_index - Threads: 128
b: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
d: The test quit with a non-zero exit status. E: FATAL: Thread initialization failed!
Neural Magic DeepSparse OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream a b c f g d e 9 18 27 36 45 39.50 39.47 39.45 13.09 13.07 13.07 12.94
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream a b c d g f e 130 260 390 520 650 605.04 605.73 605.92 606.10 607.16 607.82 607.91
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream c a b e g f d 300 600 900 1200 1500 1418.90 1417.07 1403.07 511.41 509.14 508.21 508.09
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream a b c e f g d 150 300 450 600 750 672.46 672.37 671.26 257.89 257.50 257.28 257.27
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream c a b e d f g 40 80 120 160 200 201.54 201.39 201.25 71.27 71.14 71.04 70.93
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream e d f g a c b 30 60 90 120 150 112.06 112.25 112.41 112.48 118.75 118.78 118.95
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Classroom - Compute: CPU-Only a b c f e d g 40 80 120 160 200 66.42 66.64 66.72 181.70 182.56 182.99 183.29
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Barbershop - Compute: CPU-Only c a b f g e d 140 280 420 560 700 254.72 254.88 255.30 667.87 669.09 670.64 670.87
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only c a b f e g d 50 100 150 200 250 80.41 80.54 80.76 223.95 224.10 224.12 224.15
OpenVINO OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection FP16 - Device: CPU b c a g f e d 7 14 21 28 35 30.44 30.43 30.41 10.48 10.48 10.47 10.47 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection FP16 - Device: CPU b c a g f e d 160 320 480 640 800 393.23 393.37 393.60 759.92 760.57 761.16 761.59 MIN: 360.87 / MAX: 433.13 MIN: 362.57 / MAX: 433.51 MIN: 363.29 / MAX: 431.61 MIN: 737.63 / MAX: 771.07 MIN: 741.4 / MAX: 770.88 MIN: 741.99 / MAX: 776.56 MIN: 738.34 / MAX: 772.36 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU b c a f e g d 60 120 180 240 300 284.22 282.67 282.55 107.39 107.27 107.04 107.02 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU b c a f e d g 20 40 60 80 100 42.20 42.43 42.44 74.43 74.50 74.71 74.71 MIN: 36.84 / MAX: 61.97 MIN: 36.31 / MAX: 62.36 MIN: 36.14 / MAX: 61.98 MIN: 65.68 / MAX: 83.49 MIN: 66.5 / MAX: 80.32 MIN: 66.12 / MAX: 81.09 MIN: 66.29 / MAX: 79.68 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Detection FP32 - Device: CPU b c a g e d f 60 120 180 240 300 284.99 284.31 283.97 107.24 107.24 106.90 106.76 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Detection FP32 - Device: CPU b c a e g d f 20 40 60 80 100 42.09 42.19 42.24 74.54 74.58 74.81 74.87 MIN: 37.13 / MAX: 58.71 MIN: 36.21 / MAX: 65.64 MIN: 36.59 / MAX: 61.56 MIN: 65.97 / MAX: 82.9 MIN: 67.63 / MAX: 78.73 MIN: 66.88 / MAX: 80.7 MIN: 66.72 / MAX: 80.96 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16 - Device: CPU a c b d g e f 400 800 1200 1600 2000 2033.17 2029.79 2028.01 797.64 793.90 793.75 791.74 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16 - Device: CPU a c b d e g f 3 6 9 12 15 5.89 5.90 5.91 10.01 10.06 10.06 10.09 MIN: 4.67 / MAX: 18.4 MIN: 4.83 / MAX: 13.4 MIN: 4.84 / MAX: 12.9 MIN: 5.7 / MAX: 19.52 MIN: 5.29 / MAX: 19.07 MIN: 5.2 / MAX: 19.38 MIN: 5.4 / MAX: 19.17 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU b c a g d f e 13 26 39 52 65 56.06 56.02 56.01 20.05 20.03 20.01 20.00 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU b c a g d e f 90 180 270 360 450 213.62 213.79 213.94 398.13 398.52 398.91 399.24 MIN: 197.2 / MAX: 235.23 MIN: 197.29 / MAX: 236.32 MIN: 201.64 / MAX: 242.71 MIN: 379.09 / MAX: 404.71 MIN: 382.1 / MAX: 404.98 MIN: 386.2 / MAX: 407.29 MIN: 387.9 / MAX: 408.93 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16 - Device: CPU a c b d e g f 1300 2600 3900 5200 6500 5882.91 5840.53 5836.27 2564.78 2562.54 2557.66 2539.97 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16 - Device: CPU a b c d e g f 0.7065 1.413 2.1195 2.826 3.5325 2.03 2.05 2.05 3.11 3.11 3.12 3.14 MIN: 1.66 / MAX: 7.51 MIN: 1.6 / MAX: 7 MIN: 1.62 / MAX: 6.96 MIN: 1.94 / MAX: 11.57 MIN: 1.93 / MAX: 9.72 MIN: 1.88 / MAX: 11.92 MIN: 1.93 / MAX: 11.65 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16 - Device: CPU c b a d f e g 160 320 480 640 800 757.38 750.49 748.44 344.67 343.49 342.81 341.36 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16 - Device: CPU c b a d f e g 6 12 18 24 30 15.83 15.98 16.02 23.20 23.28 23.32 23.42 MIN: 12.38 / MAX: 32.97 MIN: 12.74 / MAX: 33.34 MIN: 12.5 / MAX: 33.94 MIN: 15.1 / MAX: 31.6 MIN: 15.73 / MAX: 30.77 MIN: 19.49 / MAX: 30.99 MIN: 20.46 / MAX: 32.43 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU c b a f d g e 600 1200 1800 2400 3000 2881.14 2880.58 2873.24 1180.85 1175.67 1175.58 1174.60 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU b c a f d g e 2 4 6 8 10 4.16 4.16 4.17 6.76 6.79 6.79 6.80 MIN: 3.42 / MAX: 11.2 MIN: 3.43 / MAX: 10.26 MIN: 3.39 / MAX: 10.07 MIN: 4.04 / MAX: 15.47 MIN: 3.8 / MAX: 15.48 MIN: 3.79 / MAX: 15.41 MIN: 4.04 / MAX: 15.37 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16 - Device: CPU c b a e d g f 600 1200 1800 2400 3000 2987.33 2986.46 2945.26 1039.82 1039.61 1039.37 1038.47 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16 - Device: CPU d e g f b c a 4 8 12 16 20 15.36 15.36 15.37 15.38 16.02 16.02 16.26 MIN: 8.08 / MAX: 24.34 MIN: 8.02 / MAX: 23.81 MIN: 7.99 / MAX: 23.98 MIN: 7.99 / MAX: 24 MIN: 14.41 / MAX: 30.55 MIN: 14.63 / MAX: 33.79 MIN: 14.71 / MAX: 28.14 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU b c a f e d g 2K 4K 6K 8K 10K 9849.07 9845.27 9837.58 3548.78 3544.18 3540.88 3533.64 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU f d e g b a c 1.0935 2.187 3.2805 4.374 5.4675 4.50 4.51 4.51 4.52 4.85 4.86 4.86 MIN: 2.98 / MAX: 13.86 MIN: 2.98 / MAX: 13.05 MIN: 2.96 / MAX: 16.06 MIN: 2.77 / MAX: 13.57 MIN: 4.25 / MAX: 12.86 MIN: 4.23 / MAX: 12.81 MIN: 4.34 / MAX: 12.27 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU b c a e g d f 200 400 600 800 1000 854.51 849.30 842.91 373.64 372.26 370.57 369.26 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU b c a e g d f 5 10 15 20 25 14.03 14.12 14.23 21.40 21.47 21.57 21.65 MIN: 11.59 / MAX: 26.04 MIN: 11.51 / MAX: 26.04 MIN: 11.51 / MAX: 25.86 MIN: 19.07 / MAX: 25.3 MIN: 17.62 / MAX: 28.13 MIN: 19.5 / MAX: 24.76 MIN: 19.48 / MAX: 24.27 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU c b a f d e g 70 140 210 280 350 317.33 317.28 317.22 124.30 124.12 123.61 123.41 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU b c a f d e g 14 28 42 56 70 37.79 37.79 37.80 64.31 64.41 64.68 64.77 MIN: 32.97 / MAX: 53.7 MIN: 33.29 / MAX: 54.88 MIN: 33.35 / MAX: 56.45 MIN: 50.85 / MAX: 70.77 MIN: 37.44 / MAX: 73.04 MIN: 38.02 / MAX: 72.52 MIN: 55.8 / MAX: 69.46 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU c b a d e g f 1200 2400 3600 4800 6000 5802.65 5780.44 5776.94 2013.77 2007.53 2006.09 2004.76 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU d e g f c b a 2 4 6 8 10 7.93 7.96 7.96 7.97 8.24 8.27 8.28 MIN: 4.2 / MAX: 16.92 MIN: 4.19 / MAX: 16.59 MIN: 4.19 / MAX: 14.2 MIN: 4.37 / MAX: 16.86 MIN: 7.62 / MAX: 23.32 MIN: 7.37 / MAX: 25.18 MIN: 7.44 / MAX: 23.35 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU c a b f d g e 500 1000 1500 2000 2500 2455.51 2454.09 2450.26 1041.87 1036.99 1031.60 1028.64 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU a c b f d g e 2 4 6 8 10 4.88 4.88 4.89 7.67 7.70 7.74 7.77 MIN: 3.95 / MAX: 16.05 MIN: 3.9 / MAX: 14.94 MIN: 3.93 / MAX: 13.44 MIN: 5.32 / MAX: 16.6 MIN: 5.51 / MAX: 16.06 MIN: 6.06 / MAX: 12.66 MIN: 5.42 / MAX: 16.35 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16 - Device: CPU a c b g f d e 300 600 900 1200 1500 1560.03 1551.63 1546.02 538.01 533.74 532.59 530.99 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16 - Device: CPU g f d e a c b 7 14 21 28 35 29.72 29.95 30.02 30.10 30.72 30.89 31.00 MIN: 19.46 / MAX: 38.99 MIN: 19.01 / MAX: 38.08 MIN: 18.78 / MAX: 38.72 MIN: 22.61 / MAX: 39.15 MIN: 29.51 / MAX: 35.07 MIN: 29.48 / MAX: 36.29 MIN: 29.59 / MAX: 36.33 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU b a c e g d f 20K 40K 60K 80K 100K 87359.23 86884.64 86789.80 32032.06 32008.03 32002.62 31951.64 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU d e f g a b c 0.1215 0.243 0.3645 0.486 0.6075 0.49 0.49 0.49 0.49 0.54 0.54 0.54 MIN: 0.3 / MAX: 9.28 MIN: 0.3 / MAX: 9.07 MIN: 0.3 / MAX: 8.2 MIN: 0.3 / MAX: 8.84 MIN: 0.45 / MAX: 7.64 MIN: 0.45 / MAX: 7.81 MIN: 0.45 / MAX: 5.03 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU a b c e g f d 300 600 900 1200 1500 1244.69 1239.67 1237.29 432.32 432.20 431.94 395.66 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU e g f a b c d 9 18 27 36 45 36.98 36.98 37.01 38.50 38.66 38.75 40.40 MIN: 32.02 / MAX: 44.78 MIN: 32.61 / MAX: 41.91 MIN: 32.25 / MAX: 43.6 MIN: 36.77 / MAX: 44.23 MIN: 37.22 / MAX: 43.52 MIN: 37.46 / MAX: 43.52 MIN: 26.93 / MAX: 74.83 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU c b a g f d e 30K 60K 90K 120K 150K 123484.28 120728.22 120606.38 45097.99 44968.43 44958.07 44933.27 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU a b c d e f g 0.0788 0.1576 0.2364 0.3152 0.394 0.34 0.34 0.34 0.35 0.35 0.35 0.35 MIN: 0.29 / MAX: 7.33 MIN: 0.29 / MAX: 10.87 MIN: 0.29 / MAX: 7.09 MIN: 0.23 / MAX: 9.09 MIN: 0.23 / MAX: 8.84 MIN: 0.23 / MAX: 9.15 MIN: 0.23 / MAX: 8.63 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie -ldl
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: File Status - Threads: 50 - Files: 1000000 a g b d f e c 500K 1000K 1500K 2000K 2500K 2173913 2036660 1941748 1818182 1795332 320924 284252
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: File Status - Threads: 100 - Files: 1000000 g f c a d e b 400K 800K 1200K 1600K 2000K 2049180 1964637 1893939 1886792 600601 235627 161970
Kripke Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.6 d g f e 50M 100M 150M 200M 250M 240994500 237175700 236591000 236243900 1. (CXX) g++ options: -O3 -fopenmp -ldl
a: The test quit with a non-zero exit status.
b: The test quit with a non-zero exit status.
c: The test quit with a non-zero exit status.
BRL-CAD BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.36 VGR Performance Metric a b c d e f g 170K 340K 510K 680K 850K 772162 768517 762529 298064 296125 295603 295522 1. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6
easyWave The easyWave software allows simulating tsunami generation and propagation in the context of early warning systems. EasyWave supports making use of OpenMP for CPU multi-threading and there are also GPU ports available but not currently incorporated as part of this test profile. The easyWave tsunami generation software is run with one of the example/reference input files for measuring the CPU execution time. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 240 g e d f 0.3728 0.7456 1.1184 1.4912 1.864 1.648 1.654 1.657 1.657 1. (CXX) g++ options: -O3 -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 g f e d 9 18 27 36 45 37.95 38.02 38.07 38.11 1. (CXX) g++ options: -O3 -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400 g f d e 20 40 60 80 100 97.53 97.99 98.98 99.42 1. (CXX) g++ options: -O3 -fopenmp
Embree OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer - Model: Asian Dragon g f d e 6 12 18 24 30 24.96 24.89 24.85 24.83 MIN: 24.9 / MAX: 25.13 MIN: 24.81 / MAX: 25.06 MIN: 24.78 / MAX: 25 MIN: 24.76 / MAX: 24.96
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer - Model: Asian Dragon Obj d e f g 5 10 15 20 25 22.35 22.29 22.27 22.26 MIN: 22.28 / MAX: 22.5 MIN: 22.22 / MAX: 22.46 MIN: 22.2 / MAX: 22.44 MIN: 22.18 / MAX: 22.43
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer - Model: Crown e d g f 5 10 15 20 25 21.99 21.89 21.83 21.77 MIN: 21.84 / MAX: 22.32 MIN: 21.74 / MAX: 22.23 MIN: 21.69 / MAX: 22.17 MIN: 21.63 / MAX: 22.18
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon g e f d 7 14 21 28 35 27.91 27.83 27.83 27.74 MIN: 27.81 / MAX: 28.17 MIN: 27.72 / MAX: 28.1 MIN: 27.73 / MAX: 28.13 MIN: 27.64 / MAX: 27.98
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon Obj g e f d 6 12 18 24 30 23.71 23.53 23.50 23.35 MIN: 23.61 / MAX: 23.93 MIN: 23.43 / MAX: 23.73 MIN: 23.4 / MAX: 23.74 MIN: 23.26 / MAX: 23.57
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown f g d e 5 10 15 20 25 22.44 22.42 22.39 22.34 MIN: 22.25 / MAX: 22.78 MIN: 22.22 / MAX: 22.85 MIN: 22.2 / MAX: 22.85 MIN: 22.15 / MAX: 22.75
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 2.0.0 Benchmark: vklBenchmarkCPU ISPC g f e d 110 220 330 440 550 489 488 487 487 MIN: 36 / MAX: 6969 MIN: 36 / MAX: 6952 MIN: 36 / MAX: 6956 MIN: 36 / MAX: 6949
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU g e f d 0.48 0.96 1.44 1.92 2.4 2.11813 2.12570 2.13062 2.13332 MIN: 1.99 MIN: 2.01 MIN: 1.97 MIN: 2 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU e g d f 0.3539 0.7078 1.0617 1.4156 1.7695 1.54911 1.55118 1.55824 1.57282 MIN: 1.51 MIN: 1.52 MIN: 1.51 MIN: 1.53 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU g d e f 0.3019 0.6038 0.9057 1.2076 1.5095 1.33564 1.33789 1.33861 1.34183 MIN: 1.31 MIN: 1.31 MIN: 1.31 MIN: 1.31 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU d f g e 0.8649 1.7298 2.5947 3.4596 4.3245 3.81576 3.81823 3.82381 3.84421 MIN: 3.26 MIN: 3.25 MIN: 3.29 MIN: 3.27 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU d g f e 0.1426 0.2852 0.4278 0.5704 0.713 0.628236 0.629108 0.630325 0.633975 MIN: 0.6 MIN: 0.6 MIN: 0.6 MIN: 0.6 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU g f d e 0.6893 1.3786 2.0679 2.7572 3.4465 3.05458 3.05674 3.05991 3.06370 MIN: 2.97 MIN: 2.97 MIN: 2.96 MIN: 2.97 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU d f g e 0.7615 1.523 2.2845 3.046 3.8075 3.37782 3.37956 3.38156 3.38436 MIN: 3.33 MIN: 3.33 MIN: 3.33 MIN: 3.33 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU g e d f 0.1914 0.3828 0.5742 0.7656 0.957 0.843492 0.844434 0.847805 0.850691 MIN: 0.83 MIN: 0.83 MIN: 0.83 MIN: 0.83 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU g d f e 0.4315 0.863 1.2945 1.726 2.1575 1.91274 1.91374 1.91422 1.91781 MIN: 1.88 MIN: 1.88 MIN: 1.88 MIN: 1.88 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU d f g e 0.5772 1.1544 1.7316 2.3088 2.886 2.49408 2.49714 2.51441 2.56522 MIN: 2.3 MIN: 2.26 MIN: 2.3 MIN: 2.32 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU g d f e 0.148 0.296 0.444 0.592 0.74 0.647700 0.652259 0.653182 0.657610 MIN: 0.57 MIN: 0.57 MIN: 0.57 MIN: 0.57 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU f d g e 0.2575 0.515 0.7725 1.03 1.2875 1.00136 1.03749 1.12723 1.14432 MIN: 0.92 MIN: 0.92 MIN: 0.93 MIN: 1.07 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU f d g e 0.2881 0.5762 0.8643 1.1524 1.4405 1.20653 1.25758 1.27918 1.28043 MIN: 1.18 MIN: 1.21 MIN: 1.24 MIN: 1.24 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU e f d g 0.1378 0.2756 0.4134 0.5512 0.689 0.575794 0.600834 0.603950 0.612320 MIN: 0.52 MIN: 0.53 MIN: 0.53 MIN: 0.53 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU d g e f 0.2388 0.4776 0.7164 0.9552 1.194 1.02875 1.04567 1.05425 1.06144 MIN: 0.96 MIN: 0.98 MIN: 0.97 MIN: 0.98 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU f g e d 400 800 1200 1600 2000 1636.76 1637.37 1641.00 1641.92 MIN: 1585.98 MIN: 1584.58 MIN: 1595.55 MIN: 1584.81 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU g f e d 400 800 1200 1600 2000 1631.99 1636.44 1639.36 1642.51 MIN: 1581.62 MIN: 1585.81 MIN: 1581.93 MIN: 1593.16 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU g f e d 400 800 1200 1600 2000 1641.40 1642.35 1643.97 1643.99 MIN: 1589.91 MIN: 1586.17 MIN: 1590.89 MIN: 1588.03 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU d g e f 200 400 600 800 1000 838.52 848.03 849.71 851.49 MIN: 796.3 MIN: 807.34 MIN: 805.98 MIN: 807.97 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU g d f e 200 400 600 800 1000 837.60 849.16 849.34 851.66 MIN: 796.61 MIN: 806.44 MIN: 805.8 MIN: 809.45 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU e f d g 200 400 600 800 1000 841.08 845.31 847.38 847.42 MIN: 798.46 MIN: 803.78 MIN: 806.33 MIN: 806.72 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
a Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-islProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113eJava Notes: OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Notes: Python 3.9.16Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 7 October 2023 01:18 by user .
b Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-islProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113eJava Notes: OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Notes: Python 3.9.16Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 7 October 2023 13:27 by user .
c Processor: 2 x AMD EPYC 9254 24-Core @ 2.90GHz (48 Cores / 96 Threads), Motherboard: Supermicro H13DSH (1.5 BIOS), Memory: 24 x 32 GB DDR5-4800MT/s Samsung M321R4GA3BB6-CQKET, Disk: 2 x 1920GB SAMSUNG MZQL21T9HCJR-00A07, Graphics: astdrmfb
OS: AlmaLinux 9.2, Kernel: 5.14.0-284.25.1.el9_2.x86_64 (x86_64), Compiler: GCC 11.3.1 20221121, File-System: ext4, Screen Resolution: 1024x768
Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-islProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113eJava Notes: OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Notes: Python 3.9.16Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 7 October 2023 19:27 by user .
d Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-islProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa101111Java Notes: OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Notes: Python 3.9.16Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 20 October 2023 12:43 by user .
e Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-islProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa101111Java Notes: OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Notes: Python 3.9.16Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 20 October 2023 22:56 by user .
f Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-islProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa101111Java Notes: OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Notes: Python 3.9.16Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 21 October 2023 12:57 by user .
g Processor: AMD EPYC 9124 16-Core @ 3.00GHz (16 Cores / 32 Threads), Motherboard: Supermicro H13SSW (1.1 BIOS), Memory: 12 x 64 GB DDR5-4800MT/s HMCG94MEBRA123N, Disk: 2 x 1920GB SAMSUNG MZQL21T9HCJR-00A07, Graphics: astdrmfb
OS: AlmaLinux 9.2, Kernel: 5.14.0-284.25.1.el9_2.x86_64 (x86_64), Compiler: GCC 11.3.1 20221121, File-System: ext4, Screen Resolution: 1024x768
Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-islProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa101111Java Notes: OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Notes: Python 3.9.16Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 22 October 2023 13:18 by user .