Tests for a future article. 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 22.10 via the Phoronix Test Suite.
a Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0xd000389Java Notes: OpenJDK Runtime Environment (build 11.0.19+7-post-Ubuntu-0ubuntu122.10.1)Python Notes: Python 3.10.7Security Notes: dodt: Mitigation of DOITM + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
b Processor: 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads), Motherboard: Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS), Chipset: Intel Ice Lake IEH, Memory: 512GB, Disk: 7682GB INTEL SSDPF2KX076TZ, Graphics: ASPEED, Monitor: VE228, Network: 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP
OS: Ubuntu 22.10, Kernel: 6.2.0-rc5-phx-dodt (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server 1.21.1.3, Vulkan: 1.3.224, Compiler: GCC 12.2.0, File-System: ext4, Screen Resolution: 1920x1080
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 128 a b 400 800 1200 1600 2000 SE +/- 54.55, N = 2 1055.3 1946.7 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 256 b a 130 260 390 520 650 SE +/- 2.65, N = 2 592.5 599.8 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 a b 140 280 420 560 700 SE +/- 2.35, N = 2 633.2 639.3 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 b a 300 600 900 1200 1500 SE +/- 1.25, N = 2 1216.0 1219.9 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
Remhos Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example b a 3 6 9 12 15 SE +/- 0.10, N = 2 12.37 12.20 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
OpenBenchmarking.org FPS, More Is Better dav1d 1.2.1 Video Input: Summer Nature 4K a b 60 120 180 240 300 SE +/- 0.08, N = 2 282.53 282.65 1. (CC) gcc options: -pthread -lm
OpenBenchmarking.org FPS, More Is Better dav1d 1.2.1 Video Input: Summer Nature 1080p b a 150 300 450 600 750 SE +/- 0.50, N = 2 699.09 699.97 1. (CC) gcc options: -pthread -lm
OpenBenchmarking.org FPS, More Is Better dav1d 1.2.1 Video Input: Chimera 1080p 10-bit b a 100 200 300 400 500 SE +/- 0.41, N = 2 476.77 476.82 1. (CC) gcc options: -pthread -lm
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer - Model: Crown b a 16 32 48 64 80 SE +/- 0.12, N = 2 70.79 72.04 MIN: 67 / MAX: 79.71 MIN: 68.2 / MAX: 79.55
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Crown a b 20 40 60 80 100 SE +/- 0.10, N = 2 87.93 87.93 MIN: 85.27 / MAX: 92.58 MIN: 84.73 / MAX: 92.37
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/ao/real_time b a 6 12 18 24 30 SE +/- 0.09, N = 2 24.62 24.64
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/ao/real_time b a 5 10 15 20 25 SE +/- 0.22, N = 2 20.95 21.21
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time b a 5 10 15 20 25 SE +/- 0.05, N = 2 20.48 20.81
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time b a 5 10 15 20 25 SE +/- 0.00, N = 2 22.58 22.70
Opus Codec Encoding Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus five times. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.4 WAV To Opus Encode a b 8 16 24 32 40 SE +/- 0.01, N = 2 36.74 36.73 1. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 32 a b 200M 400M 600M 800M 1000M SE +/- 2085000.00, N = 2 992540000 993445000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 57 b a 300M 600M 900M 1200M 1500M SE +/- 20600000.00, N = 2 1185500000 1197700000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 32 a b 400M 800M 1200M 1600M 2000M SE +/- 2750000.00, N = 2 1805000000 1825450000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 57 a b 400M 800M 1200M 1600M 2000M SE +/- 13650000.00, N = 2 2069200000 2076650000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 128 - Buffer Length: 256 - Filter Length: 32 b a 600M 1200M 1800M 2400M 3000M SE +/- 6350000.00, N = 2 2945150000 2961100000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 128 - Buffer Length: 256 - Filter Length: 57 b a 500M 1000M 1500M 2000M 2500M SE +/- 13350000.00, N = 2 2426350000 2519200000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 160 - Buffer Length: 256 - Filter Length: 32 b a 700M 1400M 2100M 2800M 3500M SE +/- 9150000.00, N = 2 3381950000 3390700000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 160 - Buffer Length: 256 - Filter Length: 57 a b 600M 1200M 1800M 2400M 3000M SE +/- 17250000.00, N = 2 2602300000 2636450000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 512 b a 90M 180M 270M 360M 450M SE +/- 2775000.00, N = 2 396265000 400730000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 512 a b 160M 320M 480M 640M 800M SE +/- 1250000.00, N = 2 725840000 730310000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 128 - Buffer Length: 256 - Filter Length: 512 b a 200M 400M 600M 800M 1000M SE +/- 2180000.00, N = 2 945190000 949400000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 160 - Buffer Length: 256 - Filter Length: 512 b a 200M 400M 600M 800M 1000M SE +/- 1800000.00, N = 2 1011200000 1013200000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Apache IoTDB Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200
a: Test failed to run.
Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500
a: Test failed to run.
Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200
a: Test failed to run.
Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500
a: Test failed to run.
Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200
a: Test failed to run.
Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500
a: Test failed to run.
Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200
a: Test failed to run.
Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500
a: Test failed to run.
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.30 b a 600 1200 1800 2400 3000 SE +/- 1.80, N = 2 SE +/- 2.00, N = 2 2607.6 2622.9 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 b a 20 40 60 80 100 SE +/- 0.95, N = 2 SE +/- 1.11, N = 2 94.34 94.83 1. (CXX) g++ options: -O3
OpenBenchmarking.org Seconds, Fewer Is Better Z3 Theorem Prover 4.12.1 SMT File: 2.smt2 a b 20 40 60 80 100 SE +/- 0.02, N = 2 SE +/- 0.05, N = 2 88.00 87.18 1. (CXX) g++ options: -lpthread -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -O3 -fPIC
srsRAN Project srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: Downlink Processor Benchmark a b 120 240 360 480 600 SE +/- 0.70, N = 2 SE +/- 1.25, N = 2 556.5 556.8 1. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Total b a 2K 4K 6K 8K 10K SE +/- 33.95, N = 2 SE +/- 47.35, N = 2 9756.7 9800.5 1. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Thread b a 40 80 120 160 200 SE +/- 0.90, N = 2 SE +/- 1.70, N = 2 164.7 164.8 1. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer - Model: Asian Dragon b a 20 40 60 80 100 SE +/- 0.14, N = 2 SE +/- 0.04, N = 2 85.13 85.24 MIN: 83.65 / MAX: 90.45 MIN: 83.75 / MAX: 89.99
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer - Model: Asian Dragon Obj a b 20 40 60 80 100 SE +/- 0.03, N = 2 SE +/- 0.03, N = 2 76.96 77.26 MIN: 75.53 / MAX: 82.14 MIN: 75.78 / MAX: 81.08
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon a b 20 40 60 80 100 SE +/- 0.32, N = 2 SE +/- 0.24, N = 2 104.41 104.55 MIN: 101.88 / MAX: 109.22 MIN: 102.2 / MAX: 108.91
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon Obj a b 20 40 60 80 100 SE +/- 0.06, N = 2 SE +/- 0.05, N = 2 89.84 90.01 MIN: 87.68 / MAX: 94.71 MIN: 87.6 / MAX: 94.43
VVenC VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Fast a b 1.2863 2.5726 3.8589 5.1452 6.4315 SE +/- 0.019, N = 2 SE +/- 0.063, N = 2 5.672 5.717 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Faster a b 3 6 9 12 15 SE +/- 0.03, N = 2 SE +/- 0.10, N = 2 10.28 10.43 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 1080p - Video Preset: Fast a b 4 8 12 16 20 SE +/- 0.06, N = 2 SE +/- 0.04, N = 2 15.71 15.72 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 1080p - Video Preset: Faster a b 7 14 21 28 35 SE +/- 0.37, N = 2 SE +/- 0.23, N = 2 29.08 29.18 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 32 b a 7M 14M 21M 28M 35M SE +/- 0.00, N = 2 SE +/- 0.00, N = 2 32267000 32338000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 57 a b 12M 24M 36M 48M 60M SE +/- 500.00, N = 2 SE +/- 1500.00, N = 2 53918500 53926500 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 512 b a 3M 6M 9M 12M 15M SE +/- 34000.00, N = 2 SE +/- 1000.00, N = 2 13291000 13323000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 32 a b 110M 220M 330M 440M 550M SE +/- 1500000.00, N = 2 SE +/- 830000.00, N = 2 493660000 498410000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 57 a b 130M 260M 390M 520M 650M SE +/- 3155000.00, N = 2 SE +/- 11305000.00, N = 2 615105000 623535000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 512 b a 40M 80M 120M 160M 200M SE +/- 650000.00, N = 2 SE +/- 795000.00, N = 2 198790000 201615000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Dragonflydb Dragonfly is an open-source database server that is a "modern Redis replacement" that aims to be the fastest memory store while being compliant with the Redis and Memcached protocols. For benchmarking Dragonfly, Memtier_benchmark is used as a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.
Clients Per Thread: 10 - Set To Get Ratio: 1:5
b: The test run did not produce a result. E: Connection error: Connection reset by peer
a: The test run did not produce a result. E: Connection error: Connection reset by peer
Clients Per Thread: 20 - Set To Get Ratio: 1:5
b: The test run did not produce a result. E: Connection error: Connection refused
a: The test run did not produce a result. E: Connection error: Connection refused
Clients Per Thread: 50 - Set To Get Ratio: 1:5
b: The test run did not produce a result. E: Connection error: Connection refused
a: The test run did not produce a result. E: Connection error: Connection refused
Clients Per Thread: 60 - Set To Get Ratio: 1:5
b: The test run did not produce a result. E: Connection error: Connection refused
a: The test run did not produce a result. E: Connection error: Connection refused
Clients Per Thread: 10 - Set To Get Ratio: 1:10
b: The test run did not produce a result. E: Connection error: Connection reset by peer
a: The test run did not produce a result. E: Connection error: Connection reset by peer
Clients Per Thread: 20 - Set To Get Ratio: 1:10
b: The test run did not produce a result. E: Connection error: Connection refused
a: The test run did not produce a result. E: Connection error: Connection refused
Clients Per Thread: 50 - Set To Get Ratio: 1:10
b: The test run did not produce a result. E: Connection error: Connection refused
a: The test run did not produce a result. E: Connection error: Connection refused
Clients Per Thread: 60 - Set To Get Ratio: 1:10
b: The test run did not produce a result. E: Connection error: Connection refused
a: The test run did not produce a result. E: Connection error: Connection refused
Clients Per Thread: 10 - Set To Get Ratio: 1:100
b: The test run did not produce a result. E: Connection error: Connection reset by peer
a: The test run did not produce a result. E: Connection error: Connection reset by peer
Clients Per Thread: 20 - Set To Get Ratio: 1:100
b: The test run did not produce a result. E: Connection error: Connection refused
a: The test run did not produce a result. E: Connection error: Connection refused
Clients Per Thread: 50 - Set To Get Ratio: 1:100
b: The test run did not produce a result. E: Connection error: Connection refused
a: The test run did not produce a result. E: Connection error: Connection refused
Clients Per Thread: 60 - Set To Get Ratio: 1:100
b: The test run did not produce a result. E: Connection error: Connection refused
a: The test run did not produce a result. E: Connection error: Connection refused
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Zlib a b 1500 3000 4500 6000 7500 SE +/- 8.83, N = 2 SE +/- 4.39, N = 2 6879.86 6880.22 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Cloning b a 3K 6K 9K 12K 15K SE +/- 654.66, N = 2 SE +/- 3270.70, N = 2 13172.81 16195.03 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Pthread b a 20K 40K 60K 80K 100K SE +/- 894.90, N = 2 SE +/- 279.15, N = 2 90361.70 92131.54 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Floating Point b a 5K 10K 15K 20K 25K SE +/- 9.14, N = 2 SE +/- 6.86, N = 2 21133.02 21134.81 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Matrix 3D Math b a 3K 6K 9K 12K 15K SE +/- 5.06, N = 2 SE +/- 9.80, N = 2 12742.70 12743.81 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Vector Shuffle a b 10K 20K 30K 40K 50K SE +/- 1.26, N = 2 SE +/- 42.22, N = 2 48054.48 48076.78 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Wide Vector Math a b 500K 1000K 1500K 2000K 2500K SE +/- 1200.16, N = 2 SE +/- 497.69, N = 2 2195391.41 2196242.21 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Fused Multiply-Add a b 40M 80M 120M 160M 200M SE +/- 118686.48, N = 2 SE +/- 92010.25, N = 2 181083180.47 181314757.42 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Vector Floating Point b a 30K 60K 90K 120K 150K SE +/- 235.00, N = 2 SE +/- 872.22, N = 2 131100.25 132479.08 1. (CXX) g++ options: -O2 -std=gnu99 -lc
GPAW GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GPAW 23.6 Input: Carbon Nanotube a b 10 20 30 40 50 SE +/- 0.02, N = 2 SE +/- 0.03, N = 2 45.82 45.64 1. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v2-v2 - Model: mobilenet-v2 b a 2 4 6 8 10 SE +/- 0.02, N = 2 SE +/- 0.13, N = 2 7.94 7.91 MIN: 7.81 / MAX: 10.99 MIN: 7.68 / MAX: 9.6 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3 - Model: mobilenet-v3 b a 2 4 6 8 10 SE +/- 0.03, N = 2 SE +/- 0.14, N = 2 8.77 8.71 MIN: 8.59 / MAX: 32.78 MIN: 8.43 / MAX: 9.8 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: shufflenet-v2 a b 3 6 9 12 15 SE +/- 0.03, N = 2 SE +/- 0.06, N = 2 9.82 9.75 MIN: 9.6 / MAX: 12.61 MIN: 9.56 / MAX: 13.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mnasnet a b 2 4 6 8 10 SE +/- 0.07, N = 2 SE +/- 0.01, N = 2 7.61 7.43 MIN: 7.33 / MAX: 43.29 MIN: 7.16 / MAX: 15.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: efficientnet-b0 b a 3 6 9 12 15 SE +/- 0.26, N = 2 SE +/- 0.23, N = 2 11.64 11.48 MIN: 10.85 / MAX: 37.54 MIN: 10.9 / MAX: 56.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: blazeface b a 1.0373 2.0746 3.1119 4.1492 5.1865 SE +/- 0.01, N = 2 SE +/- 0.09, N = 2 4.61 4.49 MIN: 4.49 / MAX: 5.33 MIN: 4.31 / MAX: 5.13 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: googlenet a b 4 8 12 16 20 SE +/- 1.05, N = 2 SE +/- 0.37, N = 2 17.06 16.72 MIN: 15.5 / MAX: 66.12 MIN: 15.67 / MAX: 100.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vgg16 a b 6 12 18 24 30 SE +/- 0.84, N = 2 SE +/- 0.34, N = 2 26.27 26.19 MIN: 24.05 / MAX: 301.35 MIN: 24.19 / MAX: 341.6 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet18 a b 3 6 9 12 15 SE +/- 1.04, N = 2 SE +/- 0.31, N = 2 10.31 9.63 MIN: 9.03 / MAX: 33.3 MIN: 9.16 / MAX: 26.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: alexnet a b 1.2713 2.5426 3.8139 5.0852 6.3565 SE +/- 0.43, N = 2 SE +/- 0.22, N = 2 5.65 5.44 MIN: 5.03 / MAX: 6.71 MIN: 5.08 / MAX: 7.52 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet50 b a 4 8 12 16 20 SE +/- 0.83, N = 2 SE +/- 0.36, N = 2 18.15 17.57 MIN: 16.98 / MAX: 42.81 MIN: 16.92 / MAX: 18.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: yolov4-tiny a b 6 12 18 24 30 SE +/- 0.65, N = 2 SE +/- 0.64, N = 2 24.77 24.48 MIN: 22.68 / MAX: 208.18 MIN: 22.66 / MAX: 47.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: squeezenet_ssd b a 4 8 12 16 20 SE +/- 0.41, N = 2 SE +/- 0.07, N = 2 16.13 15.78 MIN: 15.35 / MAX: 39.36 MIN: 15.4 / MAX: 43.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: regnety_400m b a 9 18 27 36 45 SE +/- 1.10, N = 2 SE +/- 0.86, N = 2 39.37 38.20 MIN: 37.07 / MAX: 103.97 MIN: 36.18 / MAX: 62.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vision_transformer a b 11 22 33 44 55 SE +/- 2.46, N = 2 SE +/- 1.19, N = 2 46.50 45.56 MIN: 42.6 / MAX: 72.28 MIN: 43.24 / MAX: 70.59 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: FastestDet b a 3 6 9 12 15 SE +/- 0.28, N = 2 SE +/- 0.05, N = 2 10.01 9.62 MIN: 9.4 / MAX: 59.43 MIN: 9.35 / MAX: 10.52 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only a b 6 12 18 24 30 SE +/- 0.09, N = 2 SE +/- 0.06, N = 2 23.69 23.62
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Barbershop - Compute: CPU-Only a b 50 100 150 200 250 SE +/- 0.32, N = 2 SE +/- 1.32, N = 2 239.55 239.03
a Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0xd000389Java Notes: OpenJDK Runtime Environment (build 11.0.19+7-post-Ubuntu-0ubuntu122.10.1)Python Notes: Python 3.10.7Security Notes: dodt: Mitigation of DOITM + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 6 August 2023 10:22 by user phoronix.
b Processor: 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads), Motherboard: Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS), Chipset: Intel Ice Lake IEH, Memory: 512GB, Disk: 7682GB INTEL SSDPF2KX076TZ, Graphics: ASPEED, Monitor: VE228, Network: 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP
OS: Ubuntu 22.10, Kernel: 6.2.0-rc5-phx-dodt (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server 1.21.1.3, Vulkan: 1.3.224, Compiler: GCC 12.2.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0xd000389Java Notes: OpenJDK Runtime Environment (build 11.0.19+7-post-Ubuntu-0ubuntu122.10.1)Python Notes: Python 3.10.7Security Notes: dodt: Mitigation of DOITM + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 6 August 2023 13:40 by user phoronix.