AMD Ryzen Zen 4 ECC memory benchmarks by Michael Larabel for a future article.
ECC Disabled Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
ECC Processor: AMD Ryzen 9 7900X 12-Core @ 5.73GHz (12 Cores / 24 Threads), Motherboard: ASRockRack B650D4U-2L2T/BCM (2.09 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 32 GB DDR5-4800MT/s MTC20C2085S1EC48BA1, Disk: 3201GB Micron_7450_MTFDKCC3T2TFS + 0GB Virtual HDisk0 + 0GB Virtual HDisk1 + 0GB Virtual HDisk2 + 0GB Virtual HDisk3, Graphics: ASPEED 512MB, Audio: AMD Device 1640, Monitor: VA2431, Network: 2 x Intel I210 + 2 x Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMA
OS: Ubuntu 22.04, Kernel: 6.6.0-060600rc1daily20230913-generic (x86_64), Desktop: GNOME Shell 42.9, Display Server: X Server, Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, Screen Resolution: 1920x1200
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 100 - Inserts: 3000 - Rounds: 30 ECC Disabled ECC 50 100 150 200 250 SE +/- 0.80, N = 3 SE +/- 1.11, N = 3 230.48 229.28 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 300 - Inserts: 1000 - Rounds: 30 ECC Disabled ECC 20 40 60 80 100 SE +/- 0.34, N = 3 SE +/- 0.43, N = 3 109.15 109.62 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 300 - Inserts: 3000 - Rounds: 30 ECC Disabled ECC 80 160 240 320 400 SE +/- 2.72, N = 3 SE +/- 0.37, N = 3 359.99 358.40 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 500 - Inserts: 1000 - Rounds: 30 ECC Disabled ECC 40 80 120 160 200 SE +/- 2.32, N = 12 SE +/- 1.92, N = 12 184.11 184.75 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 500 - Inserts: 3000 - Rounds: 30 ECC Disabled ECC 140 280 420 560 700 SE +/- 5.67, N = 8 SE +/- 7.62, N = 9 646.61 638.92 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Open - Threads: 50 - Files: 100000 ECC Disabled ECC 200K 400K 600K 800K 1000K SE +/- 10350.72, N = 12 SE +/- 15897.44, N = 15 839379 830359
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Open - Threads: 100 - Files: 100000 ECC Disabled ECC 200K 400K 600K 800K 1000K SE +/- 13606.70, N = 15 SE +/- 13410.36, N = 15 805165 799264
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Open - Threads: 20 - Files: 1000000 ECC Disabled ECC 400K 800K 1200K 1600K 2000K SE +/- 1042.35, N = 3 SE +/- 4344.35, N = 3 1767827 1807250
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Open - Threads: 50 - Files: 1000000 ECC Disabled ECC 400K 800K 1200K 1600K 2000K SE +/- 19536.69, N = 3 SE +/- 23302.59, N = 3 1961170 1988614
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Delete - Threads: 20 - Files: 100000 ECC Disabled ECC 30K 60K 90K 120K 150K SE +/- 673.12, N = 3 SE +/- 660.95, N = 3 130612 129429
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Delete - Threads: 50 - Files: 100000 ECC Disabled ECC 30K 60K 90K 120K 150K SE +/- 1147.56, N = 7 SE +/- 954.30, N = 15 127197 125519
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Open - Threads: 100 - Files: 1000000 ECC Disabled ECC 400K 800K 1200K 1600K 2000K SE +/- 27286.10, N = 3 SE +/- 24661.11, N = 15 2065407 1989761
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Rename - Threads: 20 - Files: 100000 ECC Disabled ECC 20K 40K 60K 80K 100K SE +/- 1104.00, N = 15 SE +/- 1330.48, N = 3 101358 98948
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Rename - Threads: 50 - Files: 100000 ECC Disabled ECC 20K 40K 60K 80K 100K SE +/- 323.23, N = 3 SE +/- 470.34, N = 3 100202 99905
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Create - Threads: 20 - Files: 1000000 ECC Disabled ECC 20K 40K 60K 80K 100K SE +/- 362.23, N = 3 SE +/- 267.90, N = 3 91144 90059
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Create - Threads: 50 - Files: 1000000 ECC Disabled ECC 20K 40K 60K 80K 100K SE +/- 267.21, N = 3 SE +/- 648.45, N = 3 89838 90748
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Delete - Threads: 100 - Files: 100000 ECC Disabled ECC 30K 60K 90K 120K 150K SE +/- 1147.47, N = 15 SE +/- 1255.07, N = 15 127189 127903
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Delete - Threads: 20 - Files: 1000000 ECC Disabled ECC 30K 60K 90K 120K 150K SE +/- 736.46, N = 3 SE +/- 382.11, N = 3 143370 143625
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Delete - Threads: 50 - Files: 1000000 ECC Disabled ECC 30K 60K 90K 120K 150K SE +/- 1096.64, N = 3 SE +/- 283.49, N = 3 141653 140418
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Rename - Threads: 100 - Files: 100000 ECC Disabled ECC 20K 40K 60K 80K 100K SE +/- 959.88, N = 15 SE +/- 915.43, N = 15 100433 97496
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Rename - Threads: 20 - Files: 1000000 ECC Disabled ECC 20K 40K 60K 80K 100K SE +/- 378.60, N = 3 SE +/- 646.22, N = 3 108282 108155
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Rename - Threads: 50 - Files: 1000000 ECC Disabled ECC 20K 40K 60K 80K 100K SE +/- 1059.57, N = 3 SE +/- 354.41, N = 3 106306 105277
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Create - Threads: 100 - Files: 1000000 ECC Disabled ECC 20K 40K 60K 80K 100K SE +/- 358.07, N = 3 SE +/- 455.68, N = 3 89839 90122
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Delete - Threads: 100 - Files: 1000000 ECC Disabled ECC 30K 60K 90K 120K 150K SE +/- 1332.20, N = 3 SE +/- 1323.41, N = 3 141149 140586
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Rename - Threads: 100 - Files: 1000000 ECC Disabled ECC 20K 40K 60K 80K 100K SE +/- 223.84, N = 3 SE +/- 1138.94, N = 4 103357 105242
Apache IoTDB Apache IotDB is a time series database and this benchmark is facilitated using the IoT Benchmaark [https://github.com/thulab/iot-benchmark/]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 ECC Disabled ECC 14M 28M 42M 56M 70M SE +/- 427325.61, N = 3 SE +/- 351004.04, N = 3 66014976 65805942
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 ECC Disabled ECC 15 30 45 60 75 SE +/- 0.56, N = 3 SE +/- 0.44, N = 3 68.57 69.44 MAX: 13857.17 MAX: 11348.56
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 ECC Disabled ECC 14M 28M 42M 56M 70M SE +/- 274619.09, N = 3 SE +/- 218304.75, N = 3 65712193 65758975
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 ECC Disabled ECC 50 100 150 200 250 SE +/- 3.71, N = 3 SE +/- 1.77, N = 3 242.49 241.75 MAX: 26726.56 MAX: 27025.88
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 ECC Disabled ECC 16M 32M 48M 64M 80M SE +/- 375866.34, N = 3 SE +/- 321408.08, N = 3 72601202 72019505
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 ECC Disabled ECC 20 40 60 80 100 SE +/- 0.96, N = 3 SE +/- 0.76, N = 3 100.88 101.64 MAX: 12636.24 MAX: 11366.39
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 ECC Disabled ECC 16M 32M 48M 64M 80M SE +/- 402756.47, N = 3 SE +/- 157085.43, N = 3 73026782 73119059
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 ECC Disabled ECC 80 160 240 320 400 SE +/- 2.67, N = 3 SE +/- 0.99, N = 3 343.57 345.18 MAX: 27057.97 MAX: 26517.23
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 ECC Disabled ECC 15M 30M 45M 60M 75M SE +/- 458439.80, N = 3 SE +/- 327817.64, N = 3 70558904 70732193
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 ECC Disabled ECC 15 30 45 60 75 SE +/- 0.58, N = 3 SE +/- 0.25, N = 3 66.65 66.73 MAX: 23921.42 MAX: 23906.66
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 ECC Disabled ECC 16M 32M 48M 64M 80M SE +/- 483767.39, N = 3 SE +/- 328990.74, N = 3 71997380 73102972
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 ECC Disabled ECC 60 120 180 240 300 SE +/- 3.62, N = 3 SE +/- 2.06, N = 3 252.53 246.82 MAX: 27096.99 MAX: 26952.07
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 ECC Disabled ECC 14M 28M 42M 56M 70M SE +/- 93381.16, N = 3 SE +/- 302463.46, N = 3 66754764 66848344
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 ECC Disabled ECC 30 60 90 120 150 SE +/- 0.16, N = 3 SE +/- 0.32, N = 3 114.67 114.38 MAX: 23908.06 MAX: 23891.9
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 ECC Disabled ECC 15M 30M 45M 60M 75M SE +/- 147482.62, N = 3 SE +/- 261100.74, N = 3 68475092 67564513
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 ECC Disabled ECC 90 180 270 360 450 SE +/- 1.46, N = 3 SE +/- 6.10, N = 3 431.96 431.60 MAX: 27343.45 MAX: 29656.61
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only ECC Disabled ECC 15 30 45 60 75 SE +/- 0.12, N = 3 SE +/- 0.10, N = 3 67.05 67.06
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Classroom - Compute: CPU-Only ECC Disabled ECC 40 80 120 160 200 SE +/- 0.19, N = 3 SE +/- 0.08, N = 3 172.09 171.67
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only ECC Disabled ECC 20 40 60 80 100 SE +/- 0.07, N = 3 SE +/- 0.13, N = 3 84.19 84.30
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Barbershop - Compute: CPU-Only ECC Disabled ECC 140 280 420 560 700 SE +/- 0.93, N = 3 SE +/- 0.13, N = 3 626.30 626.02
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only ECC Disabled ECC 50 100 150 200 250 SE +/- 0.20, N = 3 SE +/- 0.40, N = 3 207.14 207.94
CockroachDB CockroachDB is a cloud-native, distributed SQL database for data intensive applications. This test profile uses a server-less CockroachDB configuration to test various Coackroach workloads on the local host with a single node. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: MoVR - Concurrency: 128 ECC Disabled ECC 400 800 1200 1600 2000 SE +/- 0.46, N = 3 SE +/- 2.12, N = 3 1716.5 1709.8
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 10% Reads - Concurrency: 128 ECC Disabled ECC 14K 28K 42K 56K 70K SE +/- 203.95, N = 3 SE +/- 757.41, N = 4 65769.8 65696.6
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 10% Reads - Concurrency: 256 ECC Disabled ECC 14K 28K 42K 56K 70K SE +/- 21.82, N = 3 SE +/- 60.26, N = 3 64158.1 63955.2
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 50% Reads - Concurrency: 128 ECC Disabled ECC 20K 40K 60K 80K 100K SE +/- 990.71, N = 3 SE +/- 206.20, N = 3 78240.7 77301.1
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 50% Reads - Concurrency: 256 ECC Disabled ECC 16K 32K 48K 64K 80K SE +/- 241.90, N = 3 SE +/- 18.97, N = 3 76074.8 75288.6
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 60% Reads - Concurrency: 128 ECC Disabled ECC 20K 40K 60K 80K 100K SE +/- 727.86, N = 3 SE +/- 395.32, N = 3 83011.6 81837.5
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 60% Reads - Concurrency: 256 ECC Disabled ECC 20K 40K 60K 80K 100K SE +/- 156.76, N = 3 SE +/- 113.54, N = 3 79793.7 79189.8
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 95% Reads - Concurrency: 128 ECC Disabled ECC 20K 40K 60K 80K 100K SE +/- 76.68, N = 3 SE +/- 71.75, N = 3 100834.4 99577.6
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 95% Reads - Concurrency: 256 ECC Disabled ECC 20K 40K 60K 80K 100K SE +/- 571.60, N = 3 SE +/- 1100.25, N = 3 98267.7 97540.6
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Crown ECC Disabled ECC 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 24.38 24.56 MIN: 24.07 / MAX: 25.34 MIN: 24.27 / MAX: 25.25
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon ECC Disabled ECC 6 12 18 24 30 SE +/- 0.00, N = 3 SE +/- 0.06, N = 3 26.14 26.20 MIN: 25.95 / MAX: 26.71 MIN: 25.89 / MAX: 26.82
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon Obj ECC Disabled ECC 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 22.34 22.33 MIN: 22.15 / MAX: 22.84 MIN: 22.12 / MAX: 22.85
GPAW GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GPAW 23.6 Input: Carbon Nanotube ECC Disabled ECC 40 80 120 160 200 SE +/- 0.11, N = 3 SE +/- 0.24, N = 3 172.01 171.77 1. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi
Graph500 This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org bfs median_TEPS, More Is Better Graph500 3.0 Scale: 26 ECC Disabled ECC 70M 140M 210M 280M 350M 305288000 282503000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org bfs max_TEPS, More Is Better Graph500 3.0 Scale: 26 ECC Disabled ECC 70M 140M 210M 280M 350M 306781000 283689000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org sssp median_TEPS, More Is Better Graph500 3.0 Scale: 26 ECC Disabled ECC 20M 40M 60M 80M 100M 93647900 91728900 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org sssp max_TEPS, More Is Better Graph500 3.0 Scale: 26 ECC Disabled ECC 30M 60M 90M 120M 150M 124887000 120930000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare ECC Disabled ECC 0.4766 0.9532 1.4298 1.9064 2.383 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 2.118 2.106 1. (CXX) g++ options: -O3
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: WPA PSK ECC Disabled ECC 30K 60K 90K 120K 150K SE +/- 15.84, N = 3 SE +/- 308.39, N = 3 131245 131313 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: Blowfish ECC Disabled ECC 7K 14K 21K 28K 35K SE +/- 11.85, N = 3 SE +/- 20.78, N = 3 34438 34416 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: HMAC-SHA512 ECC Disabled ECC 30M 60M 90M 120M 150M SE +/- 1072155.00, N = 3 SE +/- 182924.70, N = 3 161177000 162879333 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: MD5 ECC Disabled ECC 800K 1600K 2400K 3200K 4000K SE +/- 8685.88, N = 3 SE +/- 6691.62, N = 3 3685667 3666333 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
Kvazaar This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Slow ECC Disabled ECC 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 16.51 16.55 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Medium ECC Disabled ECC 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 16.94 16.95 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Very Fast ECC Disabled ECC 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 40.34 40.36 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Super Fast ECC Disabled ECC 12 24 36 48 60 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 51.21 51.26 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Ultra Fast ECC Disabled ECC 15 30 45 60 75 SE +/- 0.05, N = 3 SE +/- 0.09, N = 3 66.03 65.88 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Laghos Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Triple Point Problem ECC Disabled ECC 40 80 120 160 200 SE +/- 0.48, N = 3 SE +/- 0.85, N = 3 163.07 163.36 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Sedov Blast Wave, ube_922_hex.mesh ECC Disabled ECC 40 80 120 160 200 SE +/- 1.21, N = 3 SE +/- 0.26, N = 3 177.52 176.27 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 ECC Disabled ECC 0.8861 1.7722 2.6583 3.5444 4.4305 SE +/- 0.018, N = 3 SE +/- 0.029, N = 3 3.938 3.837 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 10, Lossless ECC Disabled ECC 0.8588 1.7176 2.5764 3.4352 4.294 SE +/- 0.041, N = 5 SE +/- 0.007, N = 3 3.817 3.723 1. (CXX) g++ options: -O3 -fPIC -lm
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 ECC Disabled ECC 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 107.2 107.4 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 ECC Disabled ECC 50 100 150 200 250 SE +/- 0.15, N = 3 SE +/- 0.23, N = 3 215.0 214.9 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 512 ECC Disabled ECC 50M 100M 150M 200M 250M SE +/- 138604.15, N = 3 SE +/- 381575.68, N = 3 241173333 244130000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 24 - Buffer Length: 256 - Filter Length: 512 ECC Disabled ECC 70M 140M 210M 280M 350M SE +/- 105039.68, N = 3 SE +/- 453370.10, N = 3 304490000 306083333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: CPU ECC Disabled ECC 0.8955 1.791 2.6865 3.582 4.4775 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 3.98 3.97 MIN: 3.86 / MAX: 4.21 MIN: 3.87 / MAX: 4.22
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: CPU ECC Disabled ECC 0.7245 1.449 2.1735 2.898 3.6225 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 3.22 3.22 MIN: 1.23 / MAX: 3.72 MIN: 1.2 / MAX: 3.72
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: CPU ECC Disabled ECC 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 6.25 6.28 MIN: 5.72 / MAX: 6.47 MIN: 5.72 / MAX: 6.55
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: CPU ECC Disabled ECC 0.8235 1.647 2.4705 3.294 4.1175 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.66 3.66 MIN: 1.51 / MAX: 4.18 MIN: 1.52 / MAX: 4.17
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: CPU ECC Disabled ECC 4 8 12 16 20 SE +/- 0.13, N = 7 SE +/- 0.13, N = 3 14.87 14.97 MIN: 13.14 / MAX: 15.38 MIN: 13.47 / MAX: 15.18
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 8192 MiB ECC Disabled ECC 4K 8K 12K 16K 20K SE +/- 243.33, N = 3 SE +/- 158.15, N = 7 18187.18 17667.19 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB ECC Disabled ECC 3K 6K 9K 12K 15K SE +/- 161.44, N = 15 SE +/- 85.40, N = 3 15938.54 15125.32 1. (CC) gcc options: -O3 -march=native
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms ECC Disabled ECC 0.2332 0.4664 0.6996 0.9328 1.166 SE +/- 0.00078, N = 3 SE +/- 0.00211, N = 3 1.03624 1.03599
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C ECC Disabled ECC 8K 16K 24K 32K 40K SE +/- 104.76, N = 3 SE +/- 14.62, N = 3 39142.85 39530.58 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C ECC Disabled ECC 2K 4K 6K 8K 10K SE +/- 28.33, N = 3 SE +/- 33.18, N = 3 9766.48 9818.08 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C ECC Disabled ECC 500 1000 1500 2000 2500 SE +/- 25.44, N = 4 SE +/- 28.31, N = 4 2342.69 2353.15 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C ECC Disabled ECC 5K 10K 15K 20K 25K SE +/- 51.21, N = 3 SE +/- 28.63, N = 3 22407.05 22585.66 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D ECC Disabled ECC 300 600 900 1200 1500 SE +/- 8.10, N = 3 SE +/- 0.84, N = 3 1300.77 1276.30 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C ECC Disabled ECC 9K 18K 27K 36K 45K SE +/- 184.92, N = 3 SE +/- 132.77, N = 3 41776.27 42129.34 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C ECC Disabled ECC 5K 10K 15K 20K 25K SE +/- 21.67, N = 3 SE +/- 39.09, N = 3 22628.24 22482.36 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B ECC Disabled ECC 4K 8K 12K 16K 20K SE +/- 24.91, N = 3 SE +/- 16.53, N = 3 18369.26 18505.14 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 ECC Disabled ECC 30K 60K 90K 120K 150K SE +/- 118.52, N = 3 SE +/- 155.77, N = 3 118602.17 118799.08 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 ECC Disabled ECC 20K 40K 60K 80K 100K SE +/- 24.09, N = 3 SE +/- 266.94, N = 3 111520.62 108646.65 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
NWChem NWChem is an open-source high performance computational chemistry package. Per NWChem's documentation, "NWChem aims to provide its users with computational chemistry tools that are scalable both in their ability to treat large scientific computational chemistry problems efficiently, and in their use of available parallel computing resources from high-performance parallel supercomputers to conventional workstation clusters." Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball ECC Disabled ECC 1100 2200 3300 4400 5500 5290.5 5115.7 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
Input: drivaerFastback, Small Mesh Size
ECC Disabled: The test quit with a non-zero exit status. E: [0] --> FOAM FATAL ERROR:
ECC: The test quit with a non-zero exit status. E: [0] --> FOAM FATAL ERROR:
Input: drivaerFastback, Medium Mesh Size
ECC Disabled: The test quit with a non-zero exit status. E: [0] --> FOAM FATAL ERROR:
ECC: The test quit with a non-zero exit status. E: [0] --> FOAM FATAL ERROR:
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam ECC Disabled ECC 20 40 60 80 100 SE +/- 0.53, N = 3 SE +/- 0.17, N = 3 98.55 97.51
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 ECC Disabled ECC 6000M 12000M 18000M 24000M 30000M SE +/- 283646194.48, N = 3 SE +/- 20175809.49, N = 3 26965794363 27345804760 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 ECC Disabled ECC 2000M 4000M 6000M 8000M 10000M SE +/- 829361.75, N = 3 SE +/- 2949369.22, N = 3 8644944773 8675080823 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 ECC Disabled ECC 2K 4K 6K 8K 10K SE +/- 10.04, N = 3 SE +/- 35.75, N = 3 11581.9 11651.1 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 ECC Disabled ECC 60K 120K 180K 240K 300K SE +/- 52.39, N = 3 SE +/- 76.05, N = 3 292853.7 293872.1 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 ECC Disabled ECC 20000M 40000M 60000M 80000M 100000M SE +/- 19425517.36, N = 3 SE +/- 8950166.36, N = 3 102741841967 103120195583 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM ECC Disabled ECC 40000M 80000M 120000M 160000M 200000M SE +/- 62857109.96, N = 3 SE +/- 59771765.00, N = 3 200652491073 201399715157 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM ECC Disabled ECC 40000M 80000M 120000M 160000M 200000M SE +/- 81118637.19, N = 3 SE +/- 52395564.84, N = 3 171468566437 172523134883 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 ECC Disabled ECC 16000M 32000M 48000M 64000M 80000M SE +/- 10402041.03, N = 3 SE +/- 5719713.01, N = 3 73055647790 73295799273 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenVINO OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU ECC Disabled ECC 20 40 60 80 100 SE +/- 0.10, N = 3 SE +/- 0.09, N = 3 76.87 76.73 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU ECC Disabled ECC 20 40 60 80 100 SE +/- 0.11, N = 3 SE +/- 0.09, N = 3 77.99 78.12 MIN: 54.1 / MAX: 104.16 MIN: 56.83 / MAX: 103.58 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU ECC Disabled ECC 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 20.76 20.73 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU ECC Disabled ECC 60 120 180 240 300 SE +/- 0.11, N = 3 SE +/- 0.33, N = 3 288.56 288.95 MIN: 267.28 / MAX: 296.44 MIN: 267.2 / MAX: 297.65 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU ECC Disabled ECC 300 600 900 1200 1500 SE +/- 0.79, N = 3 SE +/- 1.03, N = 3 1306.54 1307.43 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU ECC Disabled ECC 1.0283 2.0566 3.0849 4.1132 5.1415 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.57 4.56 MIN: 2.52 / MAX: 13.14 MIN: 2.54 / MAX: 13.32 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU ECC Disabled ECC 800 1600 2400 3200 4000 SE +/- 9.13, N = 3 SE +/- 6.12, N = 3 3623.19 3630.27 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU ECC Disabled ECC 0.7178 1.4356 2.1534 2.8712 3.589 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.19 3.19 MIN: 1.75 / MAX: 9.52 MIN: 1.74 / MAX: 11.21 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU ECC Disabled ECC 90 180 270 360 450 SE +/- 0.37, N = 3 SE +/- 1.67, N = 3 401.89 402.05 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU ECC Disabled ECC 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 14.91 14.90 MIN: 11.06 / MAX: 24.21 MIN: 11.1 / MAX: 21.3 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU ECC Disabled ECC 20 40 60 80 100 SE +/- 0.13, N = 3 SE +/- 0.06, N = 3 105.91 105.39 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU ECC Disabled ECC 13 26 39 52 65 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 56.59 56.88 MIN: 45.24 / MAX: 67.38 MIN: 44.67 / MAX: 72.07 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU ECC Disabled ECC 500 1000 1500 2000 2500 SE +/- 3.30, N = 3 SE +/- 0.76, N = 3 2130.59 2128.62 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU ECC Disabled ECC 1.26 2.52 3.78 5.04 6.3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 5.59 5.60 MIN: 2.76 / MAX: 13.23 MIN: 2.88 / MAX: 13.28 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU ECC Disabled ECC 300 600 900 1200 1500 SE +/- 5.33, N = 3 SE +/- 4.37, N = 3 1223.55 1229.15 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU ECC Disabled ECC 1.1003 2.2006 3.3009 4.4012 5.5015 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 4.89 4.87 MIN: 3.29 / MAX: 11.23 MIN: 3.3 / MAX: 8.95 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU ECC Disabled ECC 100 200 300 400 500 SE +/- 0.60, N = 3 SE +/- 1.10, N = 3 468.29 468.28 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU ECC Disabled ECC 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 25.57 25.58 MIN: 16.54 / MAX: 37.23 MIN: 16.44 / MAX: 40.48 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU ECC Disabled ECC 8K 16K 24K 32K 40K SE +/- 91.39, N = 3 SE +/- 41.68, N = 3 39479.81 39510.35 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU ECC Disabled ECC 0.0608 0.1216 0.1824 0.2432 0.304 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.27 0.27 MIN: 0.15 / MAX: 7.04 MIN: 0.15 / MAX: 7.01 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/ao/real_time ECC Disabled ECC 1.3415 2.683 4.0245 5.366 6.7075 SE +/- 0.00390, N = 3 SE +/- 0.00168, N = 3 5.93739 5.96233
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/scivis/real_time ECC Disabled ECC 2 4 6 8 10 SE +/- 0.00404, N = 3 SE +/- 0.00349, N = 3 5.93370 6.02105
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/pathtracer/real_time ECC Disabled ECC 40 80 120 160 200 SE +/- 0.46, N = 3 SE +/- 0.07, N = 3 194.14 193.50
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/ao/real_time ECC Disabled ECC 1.3417 2.6834 4.0251 5.3668 6.7085 SE +/- 0.02068, N = 3 SE +/- 0.01403, N = 3 5.96291 5.81175
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time ECC Disabled ECC 1.3036 2.6072 3.9108 5.2144 6.518 SE +/- 0.01901, N = 3 SE +/- 0.00078, N = 3 5.79397 5.63794
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time ECC Disabled ECC 2 4 6 8 10 SE +/- 0.00292, N = 3 SE +/- 0.00444, N = 3 6.94833 6.95107
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 800 - Mode: Read Only ECC Disabled ECC 200K 400K 600K 800K 1000K SE +/- 6825.25, N = 3 SE +/- 1968.16, N = 3 1057581 1050229 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 800 - Mode: Read Only - Average Latency ECC Disabled ECC 0.1715 0.343 0.5145 0.686 0.8575 SE +/- 0.005, N = 3 SE +/- 0.001, N = 3 0.756 0.762 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 1000 - Mode: Read Only ECC Disabled ECC 200K 400K 600K 800K 1000K SE +/- 5787.70, N = 3 SE +/- 3860.24, N = 3 1006054 995905 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 1000 - Mode: Read Only - Average Latency ECC Disabled ECC 0.2259 0.4518 0.6777 0.9036 1.1295 SE +/- 0.006, N = 3 SE +/- 0.004, N = 3 0.994 1.004 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 800 - Mode: Read Write ECC Disabled ECC 300 600 900 1200 1500 SE +/- 40.18, N = 12 SE +/- 42.18, N = 9 1517 1531 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 800 - Mode: Read Write - Average Latency ECC Disabled ECC 110 220 330 440 550 SE +/- 14.11, N = 12 SE +/- 14.93, N = 9 531.29 525.98 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 1000 - Mode: Read Write ECC Disabled ECC 300 600 900 1200 1500 SE +/- 40.02, N = 12 SE +/- 47.44, N = 12 1445 1388 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 1 - Clients: 1000 - Mode: Read Write - Average Latency ECC Disabled ECC 160 320 480 640 800 SE +/- 20.48, N = 12 SE +/- 24.99, N = 12 698.43 729.62 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Only ECC Disabled ECC 200K 400K 600K 800K 1000K SE +/- 9395.63, N = 3 SE +/- 4872.30, N = 3 923799 915305 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency ECC Disabled ECC 0.1967 0.3934 0.5901 0.7868 0.9835 SE +/- 0.009, N = 3 SE +/- 0.005, N = 3 0.866 0.874 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only ECC Disabled ECC 200K 400K 600K 800K 1000K SE +/- 2229.74, N = 3 SE +/- 6917.40, N = 3 881612 856806 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency ECC Disabled ECC 0.2626 0.5252 0.7878 1.0504 1.313 SE +/- 0.003, N = 3 SE +/- 0.010, N = 3 1.134 1.167 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Write ECC Disabled ECC 8K 16K 24K 32K 40K SE +/- 246.74, N = 3 SE +/- 374.53, N = 3 36881 35767 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency ECC Disabled ECC 5 10 15 20 25 SE +/- 0.15, N = 3 SE +/- 0.24, N = 3 21.69 22.37 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write ECC Disabled ECC 7K 14K 21K 28K 35K SE +/- 312.09, N = 3 SE +/- 418.19, N = 3 32685 32176 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency ECC Disabled ECC 7 14 21 28 35 SE +/- 0.29, N = 3 SE +/- 0.40, N = 3 30.60 31.09 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Integer ECC Disabled ECC 14K 28K 42K 56K 70K SE +/- 333.42, N = 3 SE +/- 247.45, N = 3 63428.87 62763.29 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Integer ECC Disabled ECC 13K 26K 39K 52K 65K SE +/- 309.22, N = 3 SE +/- 259.25, N = 3 63014.91 62587.40 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Integer ECC Disabled ECC 12K 24K 36K 48K 60K SE +/- 634.29, N = 3 SE +/- 523.00, N = 3 56261.71 55609.57 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Integer ECC Disabled ECC 13K 26K 39K 52K 65K SE +/- 410.79, N = 3 SE +/- 168.42, N = 3 59817.80 58968.86 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Add - Benchmark: Floating Point ECC Disabled ECC 12K 24K 36K 48K 60K SE +/- 563.17, N = 3 SE +/- 412.39, N = 3 55931.83 56004.72 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Floating Point ECC Disabled ECC 14K 28K 42K 56K 70K SE +/- 349.92, N = 3 SE +/- 316.77, N = 3 63329.35 62547.20 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Floating Point ECC Disabled ECC 14K 28K 42K 56K 70K SE +/- 215.92, N = 3 SE +/- 76.35, N = 3 63394.10 63205.02 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Floating Point ECC Disabled ECC 12K 24K 36K 48K 60K SE +/- 321.29, N = 3 SE +/- 220.23, N = 3 55640.51 54750.05 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Floating Point ECC Disabled ECC 13K 26K 39K 52K 65K SE +/- 148.97, N = 3 SE +/- 286.79, N = 3 59227.55 59313.91 1. (CC) gcc options: -O3 -march=native
Remhos Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example ECC Disabled ECC 7 14 21 28 35 SE +/- 0.29, N = 3 SE +/- 0.16, N = 3 32.13 31.71 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
RocksDB This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Random Fill ECC Disabled ECC 300K 600K 900K 1200K 1500K SE +/- 1908.55, N = 3 SE +/- 1532.71, N = 3 1215131 1211432 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Random Read ECC Disabled ECC 20M 40M 60M 80M 100M SE +/- 218187.96, N = 3 SE +/- 55540.17, N = 3 106335519 106312822 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Update Random ECC Disabled ECC 200K 400K 600K 800K 1000K SE +/- 582.27, N = 3 SE +/- 1160.52, N = 3 780282 776559 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Sequential Fill ECC Disabled ECC 300K 600K 900K 1200K 1500K SE +/- 3332.23, N = 3 SE +/- 3425.42, N = 3 1324715 1326951 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Random Fill Sync ECC Disabled ECC 30K 60K 90K 120K 150K SE +/- 394.20, N = 3 SE +/- 2406.06, N = 15 127674 131422 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read While Writing ECC Disabled ECC 700K 1400K 2100K 2800K 3500K SE +/- 2700.07, N = 3 SE +/- 26280.53, N = 3 3277737 3293184 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read Random Write Random ECC Disabled ECC 600K 1200K 1800K 2400K 3000K SE +/- 17250.82, N = 3 SE +/- 9449.73, N = 3 2727147 2715841 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
srsRAN Project srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: Downlink Processor Benchmark ECC Disabled ECC 200 400 600 800 1000 SE +/- 6.06, N = 3 SE +/- 3.71, N = 3 1062.9 1064.7 1. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Total ECC Disabled ECC 700 1400 2100 2800 3500 SE +/- 1.53, N = 3 SE +/- 7.41, N = 3 3444.6 3457.7 1. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Thread ECC Disabled ECC 50 100 150 200 250 SE +/- 0.92, N = 3 SE +/- 0.35, N = 3 214.4 212.0 1. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest
OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Scale ECC Disabled ECC 8K 16K 24K 32K 40K SE +/- 13.93, N = 5 SE +/- 22.50, N = 5 35867.3 35849.8 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Triad ECC Disabled ECC 9K 18K 27K 36K 45K SE +/- 5.66, N = 5 SE +/- 23.27, N = 5 39833.5 39769.0 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Add ECC Disabled ECC 9K 18K 27K 36K 45K SE +/- 25.08, N = 5 SE +/- 24.36, N = 5 39735.4 39666.0 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
SVT-AV1 OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K ECC Disabled ECC 1.1527 2.3054 3.4581 4.6108 5.7635 SE +/- 0.006, N = 3 SE +/- 0.012, N = 3 5.099 5.123 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K ECC Disabled ECC 16 32 48 64 80 SE +/- 0.52, N = 3 SE +/- 0.40, N = 3 72.09 72.36 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 12 - Input: Bosphorus 4K ECC Disabled ECC 40 80 120 160 200 SE +/- 0.18, N = 3 SE +/- 0.32, N = 3 160.06 161.59 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 13 - Input: Bosphorus 4K ECC Disabled ECC 40 80 120 160 200 SE +/- 0.32, N = 3 SE +/- 0.44, N = 3 159.53 160.91 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 4K ECC Disabled ECC 20 40 60 80 100 SE +/- 1.68, N = 12 SE +/- 1.61, N = 12 92.55 92.71 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K ECC Disabled ECC 20 40 60 80 100 SE +/- 0.20, N = 3 SE +/- 0.20, N = 3 100.16 99.57 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 4K ECC Disabled ECC 20 40 60 80 100 SE +/- 0.17, N = 3 SE +/- 0.26, N = 3 86.92 87.32 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 16 - Model: ResNet-50 ECC Disabled ECC 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 35.75 35.68
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 32 - Model: ResNet-50 ECC Disabled ECC 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 36.66 36.61
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 64 - Model: ResNet-50 ECC Disabled ECC 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 36.78 36.74
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 256 - Model: ResNet-50 ECC Disabled ECC 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 36.91 36.83
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Medium ECC Disabled ECC 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 12.07 12.09
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Very Fast ECC Disabled ECC 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 34.95 35.02
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Super Fast ECC Disabled ECC 9 18 27 36 45 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 36.97 37.09
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast ECC Disabled ECC 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.12, N = 3 43.29 43.40
WRF WRF, the Weather Research and Forecasting Model, is a "next-generation mesoscale numerical weather prediction system designed for both atmospheric research and operational forecasting applications. It features two dynamical cores, a data assimilation system, and a software architecture supporting parallel computation and system extensibility." Learn more via the OpenBenchmarking.org test page.
Input: conus 2.5km
ECC Disabled: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-1U4LW-C252 exited on signal 9 (Killed).
ECC: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-1U4LW-C252 exited on signal 9 (Killed).
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction ECC Disabled ECC 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 16.63 16.65 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction ECC Disabled ECC 20 40 60 80 100 SE +/- 0.70, N = 3 SE +/- 0.60, N = 3 78.22 77.11 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
ECC Disabled Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 1 October 2023 19:04 by user root.
ECC Processor: AMD Ryzen 9 7900X 12-Core @ 5.73GHz (12 Cores / 24 Threads), Motherboard: ASRockRack B650D4U-2L2T/BCM (2.09 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 32 GB DDR5-4800MT/s MTC20C2085S1EC48BA1, Disk: 3201GB Micron_7450_MTFDKCC3T2TFS + 0GB Virtual HDisk0 + 0GB Virtual HDisk1 + 0GB Virtual HDisk2 + 0GB Virtual HDisk3, Graphics: ASPEED 512MB, Audio: AMD Device 1640, Monitor: VA2431, Network: 2 x Intel I210 + 2 x Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMA
OS: Ubuntu 22.04, Kernel: 6.6.0-060600rc1daily20230913-generic (x86_64), Desktop: GNOME Shell 42.9, Display Server: X Server, Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 3 October 2023 05:22 by user root.