Tests for a future article. 2 x Intel Xeon Max 9480 testing with a Supermicro X13DEM v1.10 (1.3 BIOS) and ASPEED on Fedora Linux 38 via the Phoronix Test Suite.
a Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driverProcessor Notes: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x2c0001d1Python Notes: Python 3.11.3Security Notes: SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
b Processor: 2 x Intel Xeon Max 9480 @ 3.50GHz (112 Cores / 224 Threads), Motherboard: Supermicro X13DEM v1.10 (1.3 BIOS), Chipset: Intel Device 1bce, Memory: 512GB, Disk: 2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007, Graphics: ASPEED, Monitor: VE228, Network: 2 x Broadcom BCM57508 NetXtreme-E 10Gb/25Gb/40Gb/50Gb/100Gb/200Gb
OS: Fedora Linux 38, Kernel: 6.2.15-300.fc38.x86_64 (x86_64), Compiler: GCC 13.1.1 20230511 + Clang 16.0.3 + LLVM 16.0.3, File-System: xfs, Screen Resolution: 1920x1080
ff OpenBenchmarking.org Phoronix Test Suite 2 x Intel Xeon Max 9480 @ 3.50GHz (112 Cores / 224 Threads) Supermicro X13DEM v1.10 (1.3 BIOS) Intel Device 1bce 512GB 2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007 ASPEED VE228 2 x Broadcom BCM57508 NetXtreme-E 10Gb/25Gb/40Gb/50Gb/100Gb/200Gb Fedora Linux 38 6.2.15-300.fc38.x86_64 (x86_64) GCC 13.1.1 20230511 + Clang 16.0.3 + LLVM 16.0.3 xfs 1920x1080 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Compiler File-System Screen Resolution Ff Benchmarks System Logs - Transparent Huge Pages: madvise - --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver - Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x2c0001d1 - Python 3.11.3 - SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
a vs. b Comparison Phoronix Test Suite Baseline +262.8% +262.8% +525.6% +525.6% +788.4% +788.4% 1051% 25.8% 14.9% 14.6% 13.9% 13.1% 11.7% 10.9% 10.6% 10.4% 9.4% 9.4% 8.5% 7.7% 7.6% 7.4% 6.7% 5.3% 5.2% 5.2% 5.1% 4.7% 4.4% 4.3% 4.2% 3.9% 3.8% 3.6% 2.9% 2.1% 2% 2% Ringcoin r2c - Stock - double-long - 256 c2c - Stock - float-long - 512 25.1% V.D.F - CPU 17.3% V.D.F - CPU 17.3% c2c - FFTW - double-long - 128 r2c - FFTW - double-long - 256 r2c - FFTW - float-long - 256 2 - 4K - 32 - Path Tracer - CPU clover_bm64_short r2c - FFTW - double-long - 512 c2c - Stock - double - 128 c2c - Stock - double-long - 128 10.5% c2c - Stock - double - 512 c2c - Stock - float-long - 256 9.5% 2 - 4K - 1 - Path Tracer - CPU c2c - FFTW - double-long - 256 r2c - FFTW - float - 256 8.7% r2c - Stock - float - 256 8.6% r2c - FFTW - float-long - 512 c2c - FFTW - float - 256 8% r2c - FFTW - double - 256 c2c - FFTW - float - 128 A.G.R.R.0.F - CPU r2c - Stock - double - 128 7.1% r2c - Stock - double-long - 512 2 - 4K - 16 - Path Tracer - CPU 6.5% r2c - FFTW - float - 128 6.5% r2c - FFTW - double - 512 5.8% 1 - 1080p - 32 - Path Tracer - CPU P.D.F - CPU P.D.F - CPU c2c - Stock - double-long - 256 M.T.E.T.D.F - CPU 5% M.T.E.T.D.F - CPU 5% c2c - Stock - float-long - 128 4.9% 1 - 4K - 1 - Path Tracer - CPU 4.8% c2c - FFTW - double - 128 4.7% 3 - 1080p - 32 - Path Tracer - CPU 4.7% c2c - FFTW - float - 512 c2c - FFTW - double - 256 4.6% r2c - Stock - float - 512 4.6% Deepcoin 4.6% scrypt 4.6% O_ae_pyscf_UHF r2c - FFTW - double-long - 128 4.3% 3 - 1080p - 1 - Path Tracer - CPU 4.3% r2c - Stock - float-long - 256 3 - 1080p - 16 - Path Tracer - CPU 4.2% FeCO6_b3lyp_gms 4.2% clover_bm c2c - FFTW - float-long - 512 4.1% c2c - FFTW - float-long - 128 1 - 4K - 32 - Path Tracer - CPU 1 - 1080p - 16 - Path Tracer - CPU 3.6% r2c - Stock - float - 128 H4_ae 3.5% r2c - Stock - double - 256 3.2% R.S.A.F - CPU 2.9% c2c - FFTW - float-long - 256 R.S.A.F - CPU 2.8% r2c - FFTW - float - 512 2.5% clover_bm16 2.4% 3 - 4K - 1 - Path Tracer - CPU 2.4% 2 - 1080p - 1 - Path Tracer - CPU 2.4% LiH_ae_MSD 2.3% Garlicoin 2.3% 1 - 1080p - 1 - Path Tracer - CPU 2.1% 1 - 4K - 16 - Path Tracer - CPU P.D.F - CPU P.D.F - CPU Cpuminer-Opt HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale OpenVINO OpenVINO HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale OSPRay Studio CloverLeaf HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale OSPRay Studio HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale OpenVINO HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale OSPRay Studio HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale OSPRay Studio OpenVINO OpenVINO HeFFTe - Highly Efficient FFT for Exascale OpenVINO OpenVINO HeFFTe - Highly Efficient FFT for Exascale OSPRay Studio HeFFTe - Highly Efficient FFT for Exascale OSPRay Studio HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale Cpuminer-Opt Cpuminer-Opt QMCPACK HeFFTe - Highly Efficient FFT for Exascale OSPRay Studio HeFFTe - Highly Efficient FFT for Exascale OSPRay Studio QMCPACK CloverLeaf HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale OSPRay Studio OSPRay Studio HeFFTe - Highly Efficient FFT for Exascale QMCPACK HeFFTe - Highly Efficient FFT for Exascale OpenVINO HeFFTe - Highly Efficient FFT for Exascale OpenVINO HeFFTe - Highly Efficient FFT for Exascale CloverLeaf OSPRay Studio OSPRay Studio QMCPACK Cpuminer-Opt OSPRay Studio OSPRay Studio OpenVINO OpenVINO a b
ff cloverleaf: clover_bm16 qmcpack: O_ae_pyscf_UHF build-gem5: Time To Compile cpuminer-opt: Garlicoin qmcpack: FeCO6_b3lyp_gms qmcpack: Li2_STO_ae qmcpack: LiH_ae_MSD ospray-studio: 2 - 1080p - 32 - Path Tracer - CPU ospray-studio: 3 - 1080p - 16 - Path Tracer - CPU ospray-studio: 1 - 1080p - 32 - Path Tracer - CPU ospray-studio: 1 - 1080p - 16 - Path Tracer - CPU ospray-studio: 3 - 1080p - 32 - Path Tracer - CPU ospray-studio: 2 - 1080p - 16 - Path Tracer - CPU quantlib: Multi-Threaded ospray-studio: 3 - 4K - 32 - Path Tracer - CPU ospray-studio: 2 - 4K - 32 - Path Tracer - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Face Detection FP16-INT8 - CPU ospray-studio: 3 - 4K - 1 - Path Tracer - CPU ospray-studio: 1 - 4K - 1 - Path Tracer - CPU ospray-studio: 2 - 4K - 1 - Path Tracer - CPU ospray-studio: 1 - 4K - 32 - Path Tracer - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU cloverleaf: clover_bm64_short openvino: Face Detection FP16 - CPU openvino: Face Detection FP16 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP32 - CPU openvino: Person Detection FP32 - CPU openvino: Face Detection Retail FP16 - CPU openvino: Face Detection Retail FP16 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Road Segmentation ADAS FP16 - CPU openvino: Road Segmentation ADAS FP16 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16 - CPU openvino: Handwritten English Recognition FP16 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU ospray-studio: 3 - 1080p - 1 - Path Tracer - CPU ospray-studio: 2 - 1080p - 1 - Path Tracer - CPU ospray-studio: 1 - 1080p - 1 - Path Tracer - CPU ospray-studio: 3 - 4K - 16 - Path Tracer - CPU qmcpack: simple-H2O ospray-studio: 2 - 4K - 16 - Path Tracer - CPU ospray-studio: 1 - 4K - 16 - Path Tracer - CPU cloverleaf: clover_bm cpuminer-opt: Ringcoin cpuminer-opt: Myriad-Groestl cpuminer-opt: Deepcoin cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: Triple SHA-256, Onecoin cpuminer-opt: LBC, LBRY Credits cpuminer-opt: scrypt cpuminer-opt: Blake-2 S cpuminer-opt: Skeincoin cpuminer-opt: Magi qmcpack: H4_ae quantlib: Single-Threaded heffte: c2c - Stock - double - 512 heffte: c2c - Stock - double-long - 512 heffte: c2c - FFTW - double-long - 512 heffte: c2c - FFTW - double - 512 heffte: r2c - Stock - double-long - 512 heffte: r2c - FFTW - double - 512 heffte: c2c - Stock - float - 512 heffte: c2c - Stock - float-long - 512 heffte: r2c - Stock - double - 512 heffte: r2c - FFTW - double-long - 512 heffte: c2c - FFTW - float - 512 heffte: c2c - FFTW - float-long - 512 heffte: r2c - Stock - float - 512 heffte: r2c - Stock - float-long - 512 heffte: r2c - FFTW - float - 512 heffte: r2c - FFTW - float-long - 512 heffte: c2c - FFTW - double - 256 heffte: c2c - FFTW - float - 256 heffte: r2c - FFTW - double - 256 heffte: c2c - Stock - double - 256 heffte: r2c - FFTW - double-long - 256 heffte: c2c - FFTW - double-long - 256 heffte: c2c - Stock - double-long - 256 heffte: r2c - FFTW - double - 128 heffte: r2c - Stock - float-long - 256 heffte: r2c - FFTW - double-long - 128 heffte: c2c - FFTW - double - 128 heffte: r2c - Stock - double - 256 heffte: c2c - Stock - float-long - 256 heffte: r2c - FFTW - float-long - 256 heffte: c2c - FFTW - float-long - 256 heffte: r2c - Stock - double-long - 256 heffte: c2c - FFTW - double-long - 128 heffte: r2c - Stock - double-long - 128 heffte: c2c - FFTW - float-long - 128 heffte: c2c - Stock - float - 256 heffte: c2c - Stock - float - 128 heffte: c2c - FFTW - float - 128 heffte: c2c - Stock - double-long - 128 heffte: r2c - Stock - float - 256 heffte: r2c - FFTW - float-long - 128 heffte: c2c - Stock - float-long - 128 heffte: r2c - FFTW - float - 256 heffte: r2c - FFTW - float - 128 heffte: r2c - Stock - double - 128 heffte: r2c - Stock - float - 128 heffte: c2c - Stock - double - 128 heffte: r2c - Stock - float-long - 128 a b 540.77 313.48 271.371 21770 150.81 135.63 118.27 14526 7954 13423 6558 16130 7000 254351.8 68616 66307 344.53 324.39 1995 1672 1856 59552 18.3 6106.89 69.07 331.96 111.23 74.75 1497.11 79.98 461.84 85.38 432.58 9.94 11244.03 22.32 5009.99 134.38 275.1 32.21 1146.17 6.93 16074.67 47.04 2378.91 31.34 3570.4 6.77 16277.65 0.58 76079.37 4.24 25636.44 14.59 2528.2 0.34 113019.68 539 461 435 36981 36.365 32908 32679 38.11 1521.29 34040 24960 205870 293830 44560 864.21 385610 98770 1943.59 17.03 3374.9 66.9899 70.9057 81.8561 82.7703 131.835 152.226 144.988 156.61 138.996 143.889 153.009 166.289 281.35 282.967 314.76 318.074 77.5444 178.389 156.914 93.0989 125.631 74.4546 85.7117 107.367 344.022 112.175 81.6107 180.517 189.416 295.486 172.216 160.38 69.5656 108.822 127.501 178.871 107.218 118.069 80.6584 362.6 164.452 108.79 333.309 162.588 113.205 139.874 72.125 141.025 553.98 300.34 268.693 21290 157.11 135.73 121 14526 8287 12752 6797 16884 6968 251353.6 69170 58605 348.63 320.18 2043 1753 1697 57371 18.45 6061.06 61.86 334.67 110.31 75.2 1487.31 76.05 485.89 83.7 441.39 10.08 11084.65 22.74 4915.16 141.13 261.98 33.15 1114.41 7.02 15816.99 46.87 2387.67 31.4 3563.39 6.85 16076.48 0.54 77488.89 4.25 25515.81 17.12 2154.43 0.34 111459.35 562 472 444 37359 36.561 35051 32018 36.59 17510 33810 23870 202620 289590 45400 826.56 381310 98550 1953.44 17.62 3380.5 73.9725 71.425 83.2926 84.1653 140.671 143.878 142.489 125.233 137.713 159.639 160.125 159.724 268.915 283.078 307.2 345.077 74.1161 165.2 168.986 92.1398 143.953 81.4263 90.1166 106.777 358.685 107.579 77.9418 174.894 173.051 336.543 177.195 201.743 79.9377 107.145 132.462 181.545 105.962 127.051 72.9901 333.949 162.927 103.742 306.553 152.696 105.723 144.928 79.7728 139.99 OpenBenchmarking.org
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: O_ae_pyscf_UHF a b 70 140 210 280 350 313.48 300.34 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
Cpuminer-Opt Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Garlicoin a b 5K 10K 15K 20K 25K 21770 21290 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: FeCO6_b3lyp_gms a b 30 60 90 120 150 150.81 157.11 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: Li2_STO_ae a b 30 60 90 120 150 135.63 135.73 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: LiH_ae_MSD a b 30 60 90 120 150 118.27 121.00 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU a b 3K 6K 9K 12K 15K 14526 14526
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Multi-Threaded a b 50K 100K 150K 200K 250K 254351.8 251353.6 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU a b 15K 30K 45K 60K 75K 68616 69170
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU a b 80 160 240 320 400 344.53 348.63 MIN: 244.26 / MAX: 616.07 MIN: 272.63 / MAX: 578.72 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU a b 70 140 210 280 350 324.39 320.18 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU a b 400 800 1200 1600 2000 1995 2043
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU a b 5 10 15 20 25 18.30 18.45 MIN: 13.06 / MAX: 164.52 MIN: 12.93 / MAX: 150 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU a b 1300 2600 3900 5200 6500 6106.89 6061.06 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection FP16 - Device: CPU a b 70 140 210 280 350 331.96 334.67 MIN: 187.36 / MAX: 1084 MIN: 183.18 / MAX: 985.34 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection FP16 - Device: CPU a b 20 40 60 80 100 111.23 110.31 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU a b 20 40 60 80 100 74.75 75.20 MIN: 63.42 / MAX: 277.21 MIN: 57.73 / MAX: 264.58 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU a b 300 600 900 1200 1500 1497.11 1487.31 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU a b 20 40 60 80 100 79.98 76.05 MIN: 48.77 / MAX: 460.59 MIN: 49.34 / MAX: 447.22 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU a b 110 220 330 440 550 461.84 485.89 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Detection FP32 - Device: CPU a b 20 40 60 80 100 85.38 83.70 MIN: 46.88 / MAX: 594.38 MIN: 45.91 / MAX: 581.94 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Detection FP32 - Device: CPU a b 100 200 300 400 500 432.58 441.39 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16 - Device: CPU a b 3 6 9 12 15 9.94 10.08 MIN: 7.66 / MAX: 77.8 MIN: 7.51 / MAX: 82.52 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16 - Device: CPU a b 2K 4K 6K 8K 10K 11244.03 11084.65 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU a b 5 10 15 20 25 22.32 22.74 MIN: 15.03 / MAX: 203.42 MIN: 13.51 / MAX: 170.71 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU a b 1100 2200 3300 4400 5500 5009.99 4915.16 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU a b 30 60 90 120 150 134.38 141.13 MIN: 34.96 / MAX: 477.44 MIN: 41.95 / MAX: 530.13 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU a b 60 120 180 240 300 275.10 261.98 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16 - Device: CPU a b 8 16 24 32 40 32.21 33.15 MIN: 24.83 / MAX: 157.9 MIN: 24.53 / MAX: 163.42 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16 - Device: CPU a b 200 400 600 800 1000 1146.17 1114.41 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU a b 2 4 6 8 10 6.93 7.02 MIN: 5.7 / MAX: 46.87 MIN: 5.69 / MAX: 46.54 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU a b 3K 6K 9K 12K 15K 16074.67 15816.99 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU a b 11 22 33 44 55 47.04 46.87 MIN: 38.11 / MAX: 115.43 MIN: 37.56 / MAX: 101.62 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU a b 500 1000 1500 2000 2500 2378.91 2387.67 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16 - Device: CPU a b 7 14 21 28 35 31.34 31.40 MIN: 25.42 / MAX: 166.09 MIN: 25.73 / MAX: 154.04 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16 - Device: CPU a b 800 1600 2400 3200 4000 3570.40 3563.39 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16 - Device: CPU a b 2 4 6 8 10 6.77 6.85 MIN: 4.36 / MAX: 107.93 MIN: 4.32 / MAX: 100.8 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16 - Device: CPU a b 3K 6K 9K 12K 15K 16277.65 16076.48 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU a b 0.1305 0.261 0.3915 0.522 0.6525 0.58 0.54 MIN: 0.3 / MAX: 65.08 MIN: 0.3 / MAX: 79.98 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU a b 17K 34K 51K 68K 85K 76079.37 77488.89 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU a b 0.9563 1.9126 2.8689 3.8252 4.7815 4.24 4.25 MIN: 2.77 / MAX: 56.99 MIN: 2.75 / MAX: 63.39 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU a b 5K 10K 15K 20K 25K 25636.44 25515.81 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16 - Device: CPU a b 4 8 12 16 20 14.59 17.12 MIN: 9.44 / MAX: 121 MIN: 9.17 / MAX: 229.82 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16 - Device: CPU a b 500 1000 1500 2000 2500 2528.20 2154.43 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU a b 0.0765 0.153 0.2295 0.306 0.3825 0.34 0.34 MIN: 0.27 / MAX: 35.34 MIN: 0.27 / MAX: 38.43 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU a b 20K 40K 60K 80K 100K 113019.68 111459.35 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU a b 120 240 360 480 600 539 562
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: simple-H2O a b 8 16 24 32 40 36.37 36.56 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU a b 8K 16K 24K 32K 40K 32908 35051
Cpuminer-Opt Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Ringcoin a b 4K 8K 12K 16K 20K 1521.29 17510.00 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Quad SHA-256, Pyrite a b 40K 80K 120K 160K 200K 205870 202620 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Triple SHA-256, Onecoin a b 60K 120K 180K 240K 300K 293830 289590 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: H4_ae a b 4 8 12 16 20 17.03 17.62 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Single-Threaded a b 700 1400 2100 2800 3500 3374.9 3380.5 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
a Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driverProcessor Notes: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x2c0001d1Python Notes: Python 3.11.3Security Notes: SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 31 October 2023 18:06 by user .
b Processor: 2 x Intel Xeon Max 9480 @ 3.50GHz (112 Cores / 224 Threads), Motherboard: Supermicro X13DEM v1.10 (1.3 BIOS), Chipset: Intel Device 1bce, Memory: 512GB, Disk: 2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007, Graphics: ASPEED, Monitor: VE228, Network: 2 x Broadcom BCM57508 NetXtreme-E 10Gb/25Gb/40Gb/50Gb/100Gb/200Gb
OS: Fedora Linux 38, Kernel: 6.2.15-300.fc38.x86_64 (x86_64), Compiler: GCC 13.1.1 20230511 + Clang 16.0.3 + LLVM 16.0.3, File-System: xfs, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driverProcessor Notes: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x2c0001d1Python Notes: Python 3.11.3Security Notes: SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 31 October 2023 20:09 by user .