AMD EPYC 7763 1P spec_rstack_overflow

Benchmarks by Michael Larabel for a future article looking at AMD Inception impact.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2308109-NE-EPYC7763169
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Timed Code Compilation 4 Tests
C/C++ Compiler Tests 6 Tests
CPU Massive 12 Tests
Creator Workloads 6 Tests
Database Test Suite 10 Tests
Game Development 3 Tests
HPC - High Performance Computing 11 Tests
Java Tests 4 Tests
Common Kernel Benchmarks 2 Tests
Linear Algebra 2 Tests
Machine Learning 4 Tests
Molecular Dynamics 3 Tests
MPI Benchmarks 2 Tests
Multi-Core 16 Tests
NVIDIA GPU Compute 2 Tests
Intel oneAPI 4 Tests
OpenMPI Tests 7 Tests
Programmer / Developer System Benchmarks 6 Tests
Python Tests 8 Tests
Renderers 2 Tests
Scientific Computing 6 Tests
Server 11 Tests
Server CPU Tests 8 Tests
Single-Threaded 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
off
August 10 2023
  9 Hours, 10 Minutes
safe RET no microcode
August 09 2023
  9 Hours, 26 Minutes
safe RET
August 10 2023
  9 Hours, 17 Minutes
Invert Hiding All Results Option
  9 Hours, 18 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC 7763 1P spec_rstack_overflowOpenBenchmarking.orgPhoronix Test SuiteAMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads)AMD DAYTONA_X (RYM1009B BIOS)AMD Starship/Matisse256GB800GB INTEL SSDPF21Q800GBASPEEDVE2282 x Mellanox MT27710Ubuntu 22.046.5.0-rc5-phx-tues (x86_64)GNOME Shell 42.5X Server 1.21.1.31.3.224GCC 11.3.0 + LLVM 14.0.0ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionAMD EPYC 7763 1P Spec_rstack_overflow BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NONE / errors=remount-ro,relatime,rw / Block Size: 4096- off: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 - safe RET no microcode: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 - safe RET: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1 - OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)- Python 3.10.6- off: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - safe RET no microcode: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - safe RET: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPCoffsafe RET no microcodesafe RET100200300400500SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 3453452453MIN: 84 / MAX: 2528MIN: 85 / MAX: 2535MIN: 84 / MAX: 2520

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: ResNet-50offsafe RET no microcodesafe RET48121620SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 317.7815.5615.65

CockroachDB

CockroachDB is a cloud-native, distributed SQL database for data intensive applications. This test profile uses a server-less CockroachDB configuration to test various Coackroach workloads on the local host with a single node. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 50% Reads - Concurrency: 128offsafe RET no microcodesafe RET20K40K60K80K100KSE +/- 275.86, N = 3SE +/- 719.41, N = 15SE +/- 948.29, N = 15103635.0100851.499601.6

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 95% Reads - Concurrency: 128offsafe RET no microcodesafe RET30K60K90K120K150KSE +/- 931.05, N = 3SE +/- 1043.12, N = 13SE +/- 1387.70, N = 15135187.2131487.0132046.0

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 8192offsafe RET no microcodesafe RET80160240320400SE +/- 3.35, N = 3SE +/- 0.73, N = 3SE +/- 1.18, N = 33553013011. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigoffsafe RET no microcodesafe RET70140210280350SE +/- 0.49, N = 3SE +/- 0.90, N = 3SE +/- 0.79, N = 3289.06344.24338.16

ClickHouse

ClickHouse is an open-source, high performance OLAP data management system. This test profile uses ClickHouse's standard benchmark recommendations per https://clickhouse.com/docs/en/operations/performance-test/ / https://github.com/ClickHouse/ClickBench/tree/main/clickhouse with the 100 million rows web analytics dataset. The reported value is the query processing time using the geometric mean of all separate queries performed as an aggregate. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third Runoffsafe RET no microcodesafe RET80160240320400SE +/- 2.21, N = 3SE +/- 3.27, N = 5SE +/- 1.85, N = 3362.64329.19337.45MIN: 31.5 / MAX: 4285.71MIN: 31.32 / MAX: 2857.14MIN: 31.46 / MAX: 4000

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second Runoffsafe RET no microcodesafe RET80160240320400SE +/- 1.42, N = 3SE +/- 2.16, N = 5SE +/- 4.86, N = 3361.81337.01337.12MIN: 31.46 / MAX: 4000MIN: 30.49 / MAX: 3529.41MIN: 30.79 / MAX: 4000

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold Cacheoffsafe RET no microcodesafe RET80160240320400SE +/- 0.68, N = 3SE +/- 3.38, N = 5SE +/- 2.94, N = 3349.43323.42318.12MIN: 31.06 / MAX: 4285.71MIN: 30.82 / MAX: 5000MIN: 30.57 / MAX: 3333.33

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Timeoffsafe RET no microcodesafe RET140280420560700633.52644.36643.711. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Timeoffsafe RET no microcodesafe RET306090120150140.62145.06144.021. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 4096offsafe RET no microcodesafe RET130260390520650SE +/- 5.48, N = 3SE +/- 2.96, N = 3SE +/- 3.51, N = 35904124181. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/pathtracer/real_timeoffsafe RET no microcodesafe RET306090120150SE +/- 0.21, N = 3SE +/- 1.83, N = 3SE +/- 0.07, N = 3157.83155.17156.42

Apache IoTDB

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500offsafe RET no microcodesafe RET306090120150SE +/- 0.95, N = 10SE +/- 1.63, N = 5SE +/- 0.86, N = 15117.97123.61120.06MAX: 4652.25MAX: 4533.33MAX: 4495.21

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500offsafe RET no microcodesafe RET8M16M24M32M40MSE +/- 302926.36, N = 10SE +/- 394126.89, N = 5SE +/- 288707.73, N = 1539463981.4237720117.4038833415.97

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninjaoffsafe RET no microcodesafe RET4080120160200SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.15, N = 3176.37182.17181.53

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: INIVOL and Fluid Structure Interaction Drop Containeroffsafe RET no microcodesafe RET4080120160200SE +/- 0.16, N = 3SE +/- 0.39, N = 3SE +/- 0.50, N = 3162.13163.02163.97

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To Compileoffsafe RET no microcodesafe RET4080120160200SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 3164.27172.75173.06

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmarkoffsafe RET no microcodesafe RET100200300400500SE +/- 1.76, N = 3SE +/- 2.01, N = 3SE +/- 0.84, N = 3457.23418.95422.58

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bird Strike on Windshieldoffsafe RET no microcodesafe RET306090120150SE +/- 0.07, N = 3SE +/- 0.67, N = 3SE +/- 0.73, N = 3144.83152.91152.27

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/scivis/real_timeoffsafe RET no microcodesafe RET48121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 317.7517.7317.73

Neural Magic DeepSparse

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Streamoffsafe RET no microcodesafe RET150300450600750SE +/- 1.21, N = 3SE +/- 1.47, N = 3SE +/- 0.96, N = 3678.93679.58682.21

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Streamoffsafe RET no microcodesafe RET1122334455SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 346.7046.7246.57

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latencyoffsafe RET no microcodesafe RET0.06660.13320.19980.26640.333SE +/- 0.000, N = 3SE +/- 0.003, N = 3SE +/- 0.004, N = 30.2560.2960.2891. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Onlyoffsafe RET no microcodesafe RET700K1400K2100K2800K3500KSE +/- 1705.16, N = 3SE +/- 29158.10, N = 3SE +/- 34286.68, N = 33128719270728027684451. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latencyoffsafe RET no microcodesafe RET48121620SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 312.9914.5014.591. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Writeoffsafe RET no microcodesafe RET13K26K39K52K65KSE +/- 418.78, N = 3SE +/- 66.28, N = 3SE +/- 207.71, N = 36160455175548371. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysisoffsafe RET no microcodesafe RET306090120150SE +/- 0.85, N = 3SE +/- 0.66, N = 3SE +/- 1.05, N = 3136.69137.52138.851. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm

Apache Spark

This is a benchmark of Apache Spark with its PySpark interface. Apache Spark is an open-source unified analytics engine for large-scale data processing and dealing with big data. This test profile benchmars the Apache Spark in a single-system configuration using spark-submit. The test makes use of DIYBigData's pyspark-benchmark (https://github.com/DIYBigData/pyspark-benchmark/) for generating of test data and various Apache Spark operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Broadcast Inner Join Test Timeoffsafe RET no microcodesafe RET0.31730.63460.95191.26921.5865SE +/- 0.01, N = 15SE +/- 0.02, N = 3SE +/- 0.01, N = 31.301.391.41

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Inner Join Test Timeoffsafe RET no microcodesafe RET0.49950.9991.49851.9982.4975SE +/- 0.02, N = 15SE +/- 0.05, N = 3SE +/- 0.06, N = 31.882.222.14

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Repartition Test Timeoffsafe RET no microcodesafe RET0.53551.0711.60652.1422.6775SE +/- 0.04, N = 15SE +/- 0.04, N = 3SE +/- 0.04, N = 32.092.382.26

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Group By Test Timeoffsafe RET no microcodesafe RET1.16332.32663.48994.65325.8165SE +/- 0.04, N = 15SE +/- 0.08, N = 3SE +/- 0.07, N = 34.915.175.15

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmarkoffsafe RET no microcodesafe RET714212835SE +/- 0.12, N = 15SE +/- 0.01, N = 3SE +/- 0.33, N = 331.8432.0231.43

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - SHA-512 Benchmark Timeoffsafe RET no microcodesafe RET0.78081.56162.34243.12323.904SE +/- 0.03, N = 15SE +/- 0.04, N = 3SE +/- 0.04, N = 33.393.423.47

Apache Cassandra

This is a benchmark of the Apache Cassandra NoSQL database management system making use of cassandra-stress. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.1.3Test: Writesoffsafe RET no microcodesafe RET50K100K150K200K250KSE +/- 413.74, N = 3SE +/- 950.59, N = 3SE +/- 479.91, N = 3238741233069236241

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To Compileoffsafe RET no microcodesafe RET306090120150SE +/- 0.19, N = 3SE +/- 0.33, N = 3SE +/- 0.24, N = 3121.95125.66125.06

Redis 7.0.12 + memtier_benchmark

Memtier_benchmark is a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10offsafe RET no microcodesafe RET500K1000K1500K2000K2500KSE +/- 30210.22, N = 3SE +/- 16754.20, N = 10SE +/- 792.70, N = 32195705.512154339.262157815.691. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Apache IoTDB

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500offsafe RET no microcodesafe RET20406080100SE +/- 2.14, N = 3SE +/- 0.81, N = 4SE +/- 1.29, N = 378.9279.1982.24MAX: 1729.94MAX: 5165.86MAX: 3625.32

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500offsafe RET no microcodesafe RET13M26M39M52M65MSE +/- 817020.04, N = 3SE +/- 648692.91, N = 4SE +/- 721225.08, N = 358682618.1858073516.7957099408.15

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200offsafe RET no microcodesafe RET918273645SE +/- 0.55, N = 15SE +/- 0.62, N = 3SE +/- 0.52, N = 1537.7035.1037.53MAX: 802.64MAX: 728.37MAX: 755.16

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200offsafe RET no microcodesafe RET10M20M30M40M50MSE +/- 574678.74, N = 15SE +/- 614274.26, N = 3SE +/- 543529.82, N = 1543665846.2846538766.0144027904.89

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/ao/real_timeoffsafe RET no microcodesafe RET48121620SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 318.0218.0317.98

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bumper Beamoffsafe RET no microcodesafe RET20406080100SE +/- 0.33, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 387.7293.6893.90

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000offsafe RET no microcodesafe RET40K80K120K160K200KSE +/- 362.13, N = 3SE +/- 352.89, N = 3SE +/- 314.03, N = 3166499.89140555.98143271.261. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500offsafe RET no microcodesafe RET40K80K120K160K200KSE +/- 284.72, N = 3SE +/- 284.55, N = 3SE +/- 251.96, N = 3169583.15144020.03142619.841. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Neural Magic DeepSparse

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Streamoffsafe RET no microcodesafe RET1224364860SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 355.3955.3755.40

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Streamoffsafe RET no microcodesafe RET120240360480600SE +/- 0.44, N = 3SE +/- 0.39, N = 3SE +/- 0.39, N = 3576.97577.06576.82

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Rubber O-Ring Seal Installationoffsafe RET no microcodesafe RET20406080100SE +/- 0.23, N = 3SE +/- 0.17, N = 3SE +/- 0.24, N = 377.4685.0484.48

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: CPU-Onlyoffsafe RET no microcodesafe RET20406080100SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 384.5084.6984.49

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUoffsafe RET no microcodesafe RET9001800270036004500SE +/- 14.65, N = 3SE +/- 6.87, N = 3SE +/- 10.77, N = 34092.914124.744114.58MIN: 3409.52 / MAX: 4641.43MIN: 2129.26 / MAX: 5016.36MIN: 2087 / MAX: 5053.621. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUoffsafe RET no microcodesafe RET246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 37.687.587.601. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Redis 7.0.12 + memtier_benchmark

Memtier_benchmark is a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5offsafe RET no microcodesafe RET500K1000K1500K2000K2500KSE +/- 11955.97, N = 3SE +/- 31351.12, N = 3SE +/- 1778.76, N = 32204628.922218601.792145436.261. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5offsafe RET no microcodesafe RET500K1000K1500K2000K2500KSE +/- 14704.83, N = 3SE +/- 17712.54, N = 3SE +/- 4916.89, N = 32197287.302167181.092145052.141. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigoffsafe RET no microcodesafe RET918273645SE +/- 0.34, N = 5SE +/- 0.35, N = 6SE +/- 0.37, N = 631.1937.6237.24

Redis 7.0.12 + memtier_benchmark

Memtier_benchmark is a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10offsafe RET no microcodesafe RET500K1000K1500K2000K2500KSE +/- 14630.02, N = 3SE +/- 17754.58, N = 3SE +/- 2448.62, N = 32177211.802173694.772172804.711. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Read Random Write Randomoffsafe RET no microcodesafe RET600K1200K1800K2400K3000KSE +/- 35283.44, N = 4SE +/- 18652.89, N = 3SE +/- 21895.20, N = 32951684287276528390851. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUoffsafe RET no microcodesafe RET2004006008001000SE +/- 0.32, N = 3SE +/- 0.27, N = 3SE +/- 1.09, N = 31141.431142.291142.53MIN: 998.76 / MAX: 1165.45MIN: 985.75 / MAX: 1168.76MIN: 999.01 / MAX: 1177.021. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUoffsafe RET no microcodesafe RET714212835SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 327.8327.7927.821. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/scivis/real_timeoffsafe RET no microcodesafe RET246810SE +/- 0.00864, N = 3SE +/- 0.01659, N = 3SE +/- 0.01059, N = 38.331748.327498.33813

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/ao/real_timeoffsafe RET no microcodesafe RET3691215SE +/- 0.02460, N = 3SE +/- 0.02941, N = 3SE +/- 0.01456, N = 38.960518.969418.94049

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUoffsafe RET no microcodesafe RET714212835SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 328.4028.3828.39MIN: 14.74 / MAX: 48.66MIN: 14.89 / MAX: 51.63MIN: 14.64 / MAX: 50.331. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUoffsafe RET no microcodesafe RET2004006008001000SE +/- 0.17, N = 3SE +/- 0.72, N = 3SE +/- 0.13, N = 31126.031126.641126.141. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Update Randomoffsafe RET no microcodesafe RET100K200K300K400K500KSE +/- 893.82, N = 3SE +/- 426.73, N = 3SE +/- 185.49, N = 34622874281124269471. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Neural Magic DeepSparse

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Streamoffsafe RET no microcodesafe RET1530456075SE +/- 0.15, N = 3SE +/- 0.14, N = 3SE +/- 0.16, N = 365.5965.8965.57

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Streamoffsafe RET no microcodesafe RET110220330440550SE +/- 1.15, N = 3SE +/- 0.96, N = 3SE +/- 1.01, N = 3487.25485.01487.37

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Streamoffsafe RET no microcodesafe RET130260390520650SE +/- 0.18, N = 3SE +/- 0.26, N = 3SE +/- 0.11, N = 3596.59596.56596.78

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Streamoffsafe RET no microcodesafe RET1224364860SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 353.6053.6053.57

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timeoffsafe RET no microcodesafe RET3691215SE +/- 0.13, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 313.1413.2613.25

Apache IoTDB

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200offsafe RET no microcodesafe RET918273645SE +/- 0.61, N = 3SE +/- 0.11, N = 3SE +/- 0.49, N = 336.5838.5435.82MAX: 2252.73MAX: 3276.77MAX: 3267.55

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200offsafe RET no microcodesafe RET11M22M33M44M55MSE +/- 681823.31, N = 3SE +/- 147114.88, N = 3SE +/- 634314.77, N = 349501499.1347445770.1850578426.54

Neural Magic DeepSparse

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Streamoffsafe RET no microcodesafe RET2004006008001000SE +/- 0.50, N = 3SE +/- 0.36, N = 3SE +/- 0.34, N = 3839.72840.77840.42

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Streamoffsafe RET no microcodesafe RET918273645SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 337.6037.7037.63

Apache IoTDB

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200offsafe RET no microcodesafe RET48121620SE +/- 0.19, N = 3SE +/- 0.16, N = 12SE +/- 0.21, N = 813.8314.0514.73MAX: 596.78MAX: 609.96MAX: 645.11

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200offsafe RET no microcodesafe RET200K400K600K800K1000KSE +/- 8467.91, N = 3SE +/- 6730.71, N = 12SE +/- 7998.38, N = 8960525.66947741.34918691.45

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500offsafe RET no microcodesafe RET714212835SE +/- 0.29, N = 3SE +/- 0.49, N = 3SE +/- 0.22, N = 331.4931.9327.73MAX: 939.96MAX: 930.97MAX: 938.92

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500offsafe RET no microcodesafe RET300K600K900K1200K1500KSE +/- 4294.81, N = 3SE +/- 13029.07, N = 3SE +/- 5073.96, N = 31415756.331408658.831583717.62

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Cell Phone Drop Testoffsafe RET no microcodesafe RET816243240SE +/- 0.11, N = 3SE +/- 0.26, N = 3SE +/- 0.03, N = 333.1036.3736.40

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_bareoffsafe RET no microcodesafe RET1.28932.57863.86795.15726.4465SE +/- 0.012, N = 3SE +/- 0.006, N = 3SE +/- 0.010, N = 35.6805.7305.7061. (CXX) g++ options: -O3

Neural Magic DeepSparse

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Streamoffsafe RET no microcodesafe RET1530456075SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 368.1568.2668.23

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Streamoffsafe RET no microcodesafe RET100200300400500SE +/- 0.24, N = 3SE +/- 0.42, N = 3SE +/- 0.31, N = 3468.83468.36468.22

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Ratingoffsafe RET no microcodesafe RET80K160K240K320K400KSE +/- 845.58, N = 3SE +/- 380.69, N = 3SE +/- 605.48, N = 33855853830393835151. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Ratingoffsafe RET no microcodesafe RET80K160K240K320K400KSE +/- 1018.75, N = 3SE +/- 25.38, N = 3SE +/- 435.27, N = 33843743348123355951. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2offsafe RET no microcodesafe RET200M400M600M800M1000MSE +/- 839009.73, N = 3SE +/- 575791.94, N = 3SE +/- 367255.40, N = 310117990009996454009991021001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Neural Magic DeepSparse

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Streamoffsafe RET no microcodesafe RET246810SE +/- 0.0095, N = 3SE +/- 0.0167, N = 3SE +/- 0.0271, N = 38.30788.36128.3219

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Streamoffsafe RET no microcodesafe RET8001600240032004000SE +/- 4.76, N = 3SE +/- 7.72, N = 3SE +/- 12.28, N = 33840.633816.773834.24

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Layered Halfspaceoffsafe RET no microcodesafe RET714212835SE +/- 0.35, N = 3SE +/- 0.23, N = 3SE +/- 0.18, N = 331.8531.8331.661. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Water-layered Halfspaceoffsafe RET no microcodesafe RET714212835SE +/- 0.15, N = 3SE +/- 0.25, N = 3SE +/- 0.35, N = 329.7730.4329.591. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Apache IoTDB

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200offsafe RET no microcodesafe RET48121620SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.19, N = 314.0513.6113.36MAX: 858.17MAX: 854.4MAX: 881.3

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200offsafe RET no microcodesafe RET300K600K900K1200K1500KSE +/- 1566.77, N = 3SE +/- 6553.14, N = 3SE +/- 4253.29, N = 31176385.351202637.361211172.13

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500offsafe RET no microcodesafe RET816243240SE +/- 0.24, N = 3SE +/- 0.02, N = 3SE +/- 0.22, N = 332.3630.2930.14MAX: 646.51MAX: 715.01MAX: 641.04

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500offsafe RET no microcodesafe RET300K600K900K1200K1500KSE +/- 7578.67, N = 3SE +/- 1525.49, N = 3SE +/- 9180.92, N = 31271946.571342031.111345598.59

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-Onlyoffsafe RET no microcodesafe RET612182430SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 327.3427.5827.46

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atomsoffsafe RET no microcodesafe RET0.08580.17160.25740.34320.429SE +/- 0.00017, N = 3SE +/- 0.00029, N = 3SE +/- 0.00028, N = 30.381300.381150.38098

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Homogeneous Halfspaceoffsafe RET no microcodesafe RET48121620SE +/- 0.07, N = 3SE +/- 0.20, N = 4SE +/- 0.21, N = 317.4217.6417.691. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Remhos

Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Exampleoffsafe RET no microcodesafe RET48121620SE +/- 0.23, N = 3SE +/- 0.17, N = 3SE +/- 0.19, N = 317.3817.7917.961. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rateoffsafe RET no microcodesafe RET612182430SE +/- 0.21, N = 8SE +/- 0.34, N = 3SE +/- 0.26, N = 524.2024.7023.701. (CC) gcc options: -O3 -march=native -fopenmp

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Tomographic Modeloffsafe RET no microcodesafe RET48121620SE +/- 0.09, N = 3SE +/- 0.15, N = 3SE +/- 0.20, N = 314.1314.4014.191. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Mount St. Helensoffsafe RET no microcodesafe RET3691215SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 311.8011.9812.011. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeansoffsafe RET no microcodesafe RET9001800270036004500SE +/- 42.66, N = 4SE +/- 44.47, N = 4SE +/- 28.11, N = 4399340964143

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Crownoffsafe RET no microcodesafe RET1326395265SE +/- 0.08, N = 3SE +/- 0.14, N = 3SE +/- 0.15, N = 357.4257.3057.31MIN: 56.59 / MAX: 58.54MIN: 56.26 / MAX: 58.69MIN: 56.2 / MAX: 58.59

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian Dragonoffsafe RET no microcodesafe RET1428425670SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 364.6064.3964.67MIN: 64.05 / MAX: 66.13MIN: 63.77 / MAX: 66.16MIN: 64.11 / MAX: 66.01

SQLite

This is a simple benchmark of SQLite. At present this test profile just measures the time to perform a pre-defined number of insertions on an indexed database with a variable number of concurrent repetitions -- up to the maximum number of CPU threads available. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 16offsafe RET no microcodesafe RET246810SE +/- 0.020, N = 3SE +/- 0.024, N = 3SE +/- 0.052, N = 36.2738.8348.8001. (CC) gcc options: -O2 -lz -lm

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jythonoffsafe RET no microcodesafe RET9001800270036004500SE +/- 18.07, N = 4SE +/- 47.28, N = 4SE +/- 49.88, N = 4419341914241

SQLite

This is a simple benchmark of SQLite. At present this test profile just measures the time to perform a pre-defined number of insertions on an indexed database with a variable number of concurrent repetitions -- up to the maximum number of CPU threads available. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 8offsafe RET no microcodesafe RET1.12642.25283.37924.50565.632SE +/- 0.013, N = 3SE +/- 0.016, N = 3SE +/- 0.036, N = 33.7554.8505.0061. (CC) gcc options: -O2 -lz -lm

104 Results Shown

OpenVKL
TensorFlow
CockroachDB:
  KV, 50% Reads - 128
  KV, 95% Reads - 128
MariaDB
Timed Linux Kernel Compilation
ClickHouse:
  100M Rows Hits Dataset, Third Run
  100M Rows Hits Dataset, Second Run
  100M Rows Hits Dataset, First Run / Cold Cache
OpenFOAM:
  drivaerFastback, Medium Mesh Size - Execution Time
  drivaerFastback, Medium Mesh Size - Mesh Time
MariaDB
OSPRay
Apache IoTDB:
  200 - 100 - 500:
    Average Latency
    point/sec
Timed LLVM Compilation
OpenRadioss
Timed Node.js Compilation
Numpy Benchmark
OpenRadioss
OSPRay
Neural Magic DeepSparse:
  BERT-Large, NLP Question Answering - Asynchronous Multi-Stream:
    ms/batch
    items/sec
PostgreSQL:
  100 - 800 - Read Only - Average Latency
  100 - 800 - Read Only
  100 - 800 - Read Write - Average Latency
  100 - 800 - Read Write
Timed MrBayes Analysis
Apache Spark:
  1000000 - 100 - Broadcast Inner Join Test Time
  1000000 - 100 - Inner Join Test Time
  1000000 - 100 - Repartition Test Time
  1000000 - 100 - Group By Test Time
  1000000 - 100 - Calculate Pi Benchmark
  1000000 - 100 - SHA-512 Benchmark Time
Apache Cassandra
Timed Godot Game Engine Compilation
Redis 7.0.12 + memtier_benchmark
Apache IoTDB:
  500 - 100 - 500:
    Average Latency
    point/sec
  200 - 100 - 200:
    Average Latency
    point/sec
OSPRay
OpenRadioss
nginx:
  1000
  500
Neural Magic DeepSparse:
  BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream:
    ms/batch
    items/sec
OpenRadioss
Blender
OpenVINO:
  Person Detection FP16 - CPU:
    ms
    FPS
Redis 7.0.12 + memtier_benchmark:
  Redis - 50 - 1:5
  Redis - 100 - 1:5
Timed Linux Kernel Compilation
Redis 7.0.12 + memtier_benchmark
RocksDB
OpenVINO:
  Face Detection FP16-INT8 - CPU:
    ms
    FPS
OSPRay:
  gravity_spheres_volume/dim_512/scivis/real_time
  gravity_spheres_volume/dim_512/ao/real_time
OpenVINO:
  Weld Porosity Detection FP16 - CPU:
    ms
    FPS
RocksDB
Neural Magic DeepSparse:
  NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
    ms/batch
    items/sec
OSPRay
Apache IoTDB:
  500 - 100 - 200:
    Average Latency
    point/sec
Neural Magic DeepSparse:
  NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream:
    ms/batch
    items/sec
Apache IoTDB:
  200 - 1 - 200:
    Average Latency
    point/sec
  500 - 1 - 500:
    Average Latency
    point/sec
OpenRadioss
GROMACS
Neural Magic DeepSparse:
  ResNet-50, Baseline - Asynchronous Multi-Stream:
    ms/batch
    items/sec
7-Zip Compression:
  Decompression Rating
  Compression Rating
Algebraic Multi-Grid Benchmark
Neural Magic DeepSparse:
  ResNet-50, Sparse INT8 - Asynchronous Multi-Stream:
    ms/batch
    items/sec
SPECFEM3D:
  Layered Halfspace
  Water-layered Halfspace
Apache IoTDB:
  500 - 1 - 200:
    Average Latency
    point/sec
  200 - 1 - 500:
    Average Latency
    point/sec
Blender
NAMD
SPECFEM3D
Remhos
ACES DGEMM
SPECFEM3D:
  Tomographic Model
  Mount St. Helens
DaCapo Benchmark
Embree:
  Pathtracer ISPC - Crown
  Pathtracer ISPC - Asian Dragon
SQLite
DaCapo Benchmark
SQLite