AMD EPYC 7763 1P spec_rstack_overflow

Benchmarks by Michael Larabel for a future article looking at AMD Inception impact.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2308109-NE-EPYC7763169
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
off
August 10 2023
  9 Hours, 10 Minutes
safe RET no microcode
August 09 2023
  9 Hours, 26 Minutes
safe RET
August 10 2023
  9 Hours, 17 Minutes
Invert Behavior (Only Show Selected Data)
  9 Hours, 18 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC 7763 1P spec_rstack_overflowOpenBenchmarking.orgPhoronix Test SuiteAMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads)AMD DAYTONA_X (RYM1009B BIOS)AMD Starship/Matisse256GB800GB INTEL SSDPF21Q800GBASPEEDVE2282 x Mellanox MT27710Ubuntu 22.046.5.0-rc5-phx-tues (x86_64)GNOME Shell 42.5X Server 1.21.1.31.3.224GCC 11.3.0 + LLVM 14.0.0ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionAMD EPYC 7763 1P Spec_rstack_overflow BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NONE / errors=remount-ro,relatime,rw / Block Size: 4096- safe RET no microcode: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 - off: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 - safe RET: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1 - OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)- Python 3.10.6- safe RET no microcode: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - off: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - safe RET: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

safe RET no microcodeoffsafe RETResult OverviewPhoronix Test Suite100%109%118%128%137%SQLiteMariaDBTimed Linux Kernel CompilationnginxTensorFlowPostgreSQLNumpy BenchmarkClickHouse7-Zip CompressionApache SparkOpenRadiossRocksDBTimed Node.js CompilationACES DGEMMRemhosTimed LLVM CompilationCockroachDBTimed Godot Game Engine CompilationOpenFOAMApache CassandraApache IoTDBRedis 7.0.12 + memtier_benchmarkTimed MrBayes AnalysisSPECFEM3DAlgebraic Multi-Grid BenchmarkGROMACSBlenderOpenVINOEmbreeOpenVKLNeural Magic DeepSparseOSPRayNAMDDaCapo Benchmark

AMD EPYC 7763 1P spec_rstack_overflowdacapobench: Jythondacapobench: Tradebeansopenradioss: Bumper Beamopenradioss: Cell Phone Drop Testopenradioss: Bird Strike on Windshieldspecfem3d: Homogeneous Halfspacespecfem3d: Water-layered Halfspacespecfem3d: Layered Halfspacespecfem3d: Tomographic Modelremhos: Sample Remap Examplespecfem3d: Mount St. Helensopenradioss: Rubber O-Ring Seal Installationopenradioss: INIVOL and Fluid Structure Interaction Drop Containermrbayes: Primate Phylogeny Analysistensorflow: CPU - 64 - ResNet-50numpy: deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamgromacs: MPI CPU - water_GMX50_barenamd: ATPase Simulation - 327,506 Atomsopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUmt-dgemm: Sustained Floating-Point Rateamg: openfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timecompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingbuild-llvm: Ninjabuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigblender: BMW27 - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlybuild-godot: Time To Compileembree: Pathtracer ISPC - Crownembree: Pathtracer ISPC - Asian Dragonopenvkl: vklBenchmark ISPCospray: particle_volume/ao/real_timeospray: particle_volume/scivis/real_timeospray: particle_volume/pathtracer/real_timeospray: gravity_spheres_volume/dim_512/ao/real_timeospray: gravity_spheres_volume/dim_512/scivis/real_timeospray: gravity_spheres_volume/dim_512/pathtracer/real_timebuild-nodejs: Time To Compilenginx: 500nginx: 1000apache-iotdb: 200 - 1 - 200apache-iotdb: 200 - 1 - 200apache-iotdb: 200 - 1 - 500apache-iotdb: 200 - 1 - 500apache-iotdb: 500 - 1 - 200apache-iotdb: 500 - 1 - 200apache-iotdb: 500 - 1 - 500apache-iotdb: 500 - 1 - 500apache-iotdb: 200 - 100 - 200apache-iotdb: 200 - 100 - 200apache-iotdb: 200 - 100 - 500apache-iotdb: 200 - 100 - 500apache-iotdb: 500 - 100 - 200apache-iotdb: 500 - 100 - 200apache-iotdb: 500 - 100 - 500apache-iotdb: 500 - 100 - 500spark: 1000000 - 100 - SHA-512 Benchmark Timespark: 1000000 - 100 - Calculate Pi Benchmarkspark: 1000000 - 100 - Group By Test Timespark: 1000000 - 100 - Repartition Test Timespark: 1000000 - 100 - Inner Join Test Timespark: 1000000 - 100 - Broadcast Inner Join Test Timeclickhouse: 100M Rows Hits Dataset, First Run / Cold Cacheclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, Third Runcockroach: KV, 50% Reads - 128cockroach: KV, 95% Reads - 128memtier-benchmark: Redis - 50 - 1:5memtier-benchmark: Redis - 100 - 1:5memtier-benchmark: Redis - 50 - 1:10memtier-benchmark: Redis - 100 - 1:10sqlite: 8sqlite: 16rocksdb: Update Randrocksdb: Read Rand Write Randcassandra: Writespgbench: 100 - 800 - Read Onlypgbench: 100 - 800 - Read Only - Average Latencypgbench: 100 - 800 - Read Writepgbench: 100 - 800 - Read Write - Average Latencymysqlslap: 4096mysqlslap: 8192safe RET no microcodeoffsafe RET4191409693.6836.37152.9117.64368039730.42753170931.82926187014.40441923817.78811.98216346085.04163.02137.51815.56418.9537.6987840.7721485.006965.8910468.364668.25953816.76768.361246.7239679.576653.5972596.5579577.058955.37475.7300.381157.584124.7427.791142.291126.6428.3824.695818999645400145.06069644.36223334812383039182.16937.623344.24227.5884.69125.66357.295664.391645218.028817.7305155.1658.969418.3274913.2621172.749144020.03140555.98947741.3414.051342031.1130.291202637.3613.611408658.8331.9346538766.0135.1037720117.40123.6147445770.1838.5458073516.7979.193.4232.025.172.382.221.39323.42337.01329.19100851.4131487.02218601.792167181.092173694.772154339.264.8508.834428112287276523306927072800.2965517514.4994123014193399387.7233.10144.8317.41712093329.77253538631.84563042414.13426560617.37511.80123873277.46162.13136.68617.78457.2337.6037839.7201487.245065.5911468.830668.14833840.63078.307846.7020678.933053.5968596.5915576.972255.39225.6800.381307.684092.9127.831141.431126.0328.4024.2005511011799000140.61562633.51902384374385585176.37431.192289.06327.3484.50121.94857.422964.596445318.022617.7511157.8298.960518.3317413.1355164.268169583.15166499.89960525.6613.831271946.5732.361176385.3514.051415756.3331.4943665846.2837.7039463981.42117.9749501499.1336.5858682618.1878.923.3931.844.912.091.881.30349.43361.81362.64103635.0135187.22204628.922197287.302177211.802195705.513.7556.273462287295168423874131287190.2566160412.9885903554241414393.9036.40152.2717.69029865029.59086826031.65994088514.18805896217.95812.01038078184.48163.97138.85115.65422.5837.6319840.4236487.367765.5662468.217068.23253834.24398.321946.5677682.207153.5729596.7822576.816655.40285.7060.380987.604114.5827.821142.531126.1428.3923.702889999102100144.02174643.71316335595383515181.52837.243338.15727.4684.49125.06057.313864.674245317.981717.7305156.4198.940498.3381313.2538173.064142619.84143271.26918691.4514.731345598.5930.141211172.1313.361583717.6227.7344027904.8937.5338833415.97120.0650578426.5435.8257099408.1582.243.4731.435.152.262.141.41318.12337.12337.4599601.6132046.02145436.262145052.142172804.712157815.695.0068.800426947283908523624127684450.2895483714.589418301OpenBenchmarking.org

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jythonsafe RET no microcodeoffsafe RET9001800270036004500SE +/- 47.28, N = 4SE +/- 18.07, N = 4SE +/- 49.88, N = 4419141934241

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeanssafe RET no microcodeoffsafe RET9001800270036004500SE +/- 44.47, N = 4SE +/- 42.66, N = 4SE +/- 28.11, N = 4409639934143

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bumper Beamsafe RET no microcodeoffsafe RET20406080100SE +/- 0.08, N = 3SE +/- 0.33, N = 3SE +/- 0.03, N = 393.6887.7293.90

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Cell Phone Drop Testsafe RET no microcodeoffsafe RET816243240SE +/- 0.26, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 336.3733.1036.40

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bird Strike on Windshieldsafe RET no microcodeoffsafe RET306090120150SE +/- 0.67, N = 3SE +/- 0.07, N = 3SE +/- 0.73, N = 3152.91144.83152.27

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Homogeneous Halfspacesafe RET no microcodeoffsafe RET48121620SE +/- 0.20, N = 4SE +/- 0.07, N = 3SE +/- 0.21, N = 317.6417.4217.691. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Water-layered Halfspacesafe RET no microcodeoffsafe RET714212835SE +/- 0.25, N = 3SE +/- 0.15, N = 3SE +/- 0.35, N = 330.4329.7729.591. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Layered Halfspacesafe RET no microcodeoffsafe RET714212835SE +/- 0.23, N = 3SE +/- 0.35, N = 3SE +/- 0.18, N = 331.8331.8531.661. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Tomographic Modelsafe RET no microcodeoffsafe RET48121620SE +/- 0.15, N = 3SE +/- 0.09, N = 3SE +/- 0.20, N = 314.4014.1314.191. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Remhos

Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Examplesafe RET no microcodeoffsafe RET48121620SE +/- 0.17, N = 3SE +/- 0.23, N = 3SE +/- 0.19, N = 317.7917.3817.961. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Mount St. Helenssafe RET no microcodeoffsafe RET3691215SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 311.9811.8012.011. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Rubber O-Ring Seal Installationsafe RET no microcodeoffsafe RET20406080100SE +/- 0.17, N = 3SE +/- 0.23, N = 3SE +/- 0.24, N = 385.0477.4684.48

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: INIVOL and Fluid Structure Interaction Drop Containersafe RET no microcodeoffsafe RET4080120160200SE +/- 0.39, N = 3SE +/- 0.16, N = 3SE +/- 0.50, N = 3163.02162.13163.97

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysissafe RET no microcodeoffsafe RET306090120150SE +/- 0.66, N = 3SE +/- 0.85, N = 3SE +/- 1.05, N = 3137.52136.69138.851. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: ResNet-50safe RET no microcodeoffsafe RET48121620SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 315.5617.7815.65

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmarksafe RET no microcodeoffsafe RET100200300400500SE +/- 2.01, N = 3SE +/- 1.76, N = 3SE +/- 0.84, N = 3418.95457.23422.58

Neural Magic DeepSparse

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET918273645SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 337.7037.6037.63

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET2004006008001000SE +/- 0.36, N = 3SE +/- 0.50, N = 3SE +/- 0.34, N = 3840.77839.72840.42

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET110220330440550SE +/- 0.96, N = 3SE +/- 1.15, N = 3SE +/- 1.01, N = 3485.01487.25487.37

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET1530456075SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.16, N = 365.8965.5965.57

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET100200300400500SE +/- 0.42, N = 3SE +/- 0.24, N = 3SE +/- 0.31, N = 3468.36468.83468.22

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET1530456075SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 368.2668.1568.23

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET8001600240032004000SE +/- 7.72, N = 3SE +/- 4.76, N = 3SE +/- 12.28, N = 33816.773840.633834.24

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET246810SE +/- 0.0167, N = 3SE +/- 0.0095, N = 3SE +/- 0.0271, N = 38.36128.30788.3219

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET1122334455SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 346.7246.7046.57

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET150300450600750SE +/- 1.47, N = 3SE +/- 1.21, N = 3SE +/- 0.96, N = 3679.58678.93682.21

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET1224364860SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 353.6053.6053.57

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET130260390520650SE +/- 0.26, N = 3SE +/- 0.18, N = 3SE +/- 0.11, N = 3596.56596.59596.78

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET120240360480600SE +/- 0.39, N = 3SE +/- 0.44, N = 3SE +/- 0.39, N = 3577.06576.97576.82

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET1224364860SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 355.3755.3955.40

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_baresafe RET no microcodeoffsafe RET1.28932.57863.86795.15726.4465SE +/- 0.006, N = 3SE +/- 0.012, N = 3SE +/- 0.010, N = 35.7305.6805.7061. (CXX) g++ options: -O3

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atomssafe RET no microcodeoffsafe RET0.08580.17160.25740.34320.429SE +/- 0.00029, N = 3SE +/- 0.00017, N = 3SE +/- 0.00028, N = 30.381150.381300.38098

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUsafe RET no microcodeoffsafe RET246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 37.587.687.601. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUsafe RET no microcodeoffsafe RET9001800270036004500SE +/- 6.87, N = 3SE +/- 14.65, N = 3SE +/- 10.77, N = 34124.744092.914114.58MIN: 2129.26 / MAX: 5016.36MIN: 3409.52 / MAX: 4641.43MIN: 2087 / MAX: 5053.621. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUsafe RET no microcodeoffsafe RET714212835SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 327.7927.8327.821. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUsafe RET no microcodeoffsafe RET2004006008001000SE +/- 0.27, N = 3SE +/- 0.32, N = 3SE +/- 1.09, N = 31142.291141.431142.53MIN: 985.75 / MAX: 1168.76MIN: 998.76 / MAX: 1165.45MIN: 999.01 / MAX: 1177.021. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUsafe RET no microcodeoffsafe RET2004006008001000SE +/- 0.72, N = 3SE +/- 0.17, N = 3SE +/- 0.13, N = 31126.641126.031126.141. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUsafe RET no microcodeoffsafe RET714212835SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 328.3828.4028.39MIN: 14.89 / MAX: 51.63MIN: 14.74 / MAX: 48.66MIN: 14.64 / MAX: 50.331. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratesafe RET no microcodeoffsafe RET612182430SE +/- 0.34, N = 3SE +/- 0.21, N = 8SE +/- 0.26, N = 524.7024.2023.701. (CC) gcc options: -O3 -march=native -fopenmp

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2safe RET no microcodeoffsafe RET200M400M600M800M1000MSE +/- 575791.94, N = 3SE +/- 839009.73, N = 3SE +/- 367255.40, N = 399964540010117990009991021001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Timesafe RET no microcodeoffsafe RET306090120150145.06140.62144.021. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Timesafe RET no microcodeoffsafe RET140280420560700644.36633.52643.711. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Ratingsafe RET no microcodeoffsafe RET80K160K240K320K400KSE +/- 25.38, N = 3SE +/- 1018.75, N = 3SE +/- 435.27, N = 33348123843743355951. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Ratingsafe RET no microcodeoffsafe RET80K160K240K320K400KSE +/- 380.69, N = 3SE +/- 845.58, N = 3SE +/- 605.48, N = 33830393855853835151. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninjasafe RET no microcodeoffsafe RET4080120160200SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.15, N = 3182.17176.37181.53

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigsafe RET no microcodeoffsafe RET918273645SE +/- 0.35, N = 6SE +/- 0.34, N = 5SE +/- 0.37, N = 637.6231.1937.24

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigsafe RET no microcodeoffsafe RET70140210280350SE +/- 0.90, N = 3SE +/- 0.49, N = 3SE +/- 0.79, N = 3344.24289.06338.16

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-Onlysafe RET no microcodeoffsafe RET612182430SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 327.5827.3427.46

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: CPU-Onlysafe RET no microcodeoffsafe RET20406080100SE +/- 0.02, N = 3SE +/- 0.13, N = 3SE +/- 0.04, N = 384.6984.5084.49

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To Compilesafe RET no microcodeoffsafe RET306090120150SE +/- 0.33, N = 3SE +/- 0.19, N = 3SE +/- 0.24, N = 3125.66121.95125.06

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Crownsafe RET no microcodeoffsafe RET1326395265SE +/- 0.14, N = 3SE +/- 0.08, N = 3SE +/- 0.15, N = 357.3057.4257.31MIN: 56.26 / MAX: 58.69MIN: 56.59 / MAX: 58.54MIN: 56.2 / MAX: 58.59

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian Dragonsafe RET no microcodeoffsafe RET1428425670SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 364.3964.6064.67MIN: 63.77 / MAX: 66.16MIN: 64.05 / MAX: 66.13MIN: 64.11 / MAX: 66.01

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPCsafe RET no microcodeoffsafe RET100200300400500SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 3452453453MIN: 85 / MAX: 2535MIN: 84 / MAX: 2528MIN: 84 / MAX: 2520

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/ao/real_timesafe RET no microcodeoffsafe RET48121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 318.0318.0217.98

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/scivis/real_timesafe RET no microcodeoffsafe RET48121620SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 317.7317.7517.73

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/pathtracer/real_timesafe RET no microcodeoffsafe RET306090120150SE +/- 1.83, N = 3SE +/- 0.21, N = 3SE +/- 0.07, N = 3155.17157.83156.42

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/ao/real_timesafe RET no microcodeoffsafe RET3691215SE +/- 0.02941, N = 3SE +/- 0.02460, N = 3SE +/- 0.01456, N = 38.969418.960518.94049

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/scivis/real_timesafe RET no microcodeoffsafe RET246810SE +/- 0.01659, N = 3SE +/- 0.00864, N = 3SE +/- 0.01059, N = 38.327498.331748.33813

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timesafe RET no microcodeoffsafe RET3691215SE +/- 0.00, N = 3SE +/- 0.13, N = 3SE +/- 0.01, N = 313.2613.1413.25

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To Compilesafe RET no microcodeoffsafe RET4080120160200SE +/- 0.12, N = 3SE +/- 0.14, N = 3SE +/- 0.05, N = 3172.75164.27173.06

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500safe RET no microcodeoffsafe RET40K80K120K160K200KSE +/- 284.55, N = 3SE +/- 284.72, N = 3SE +/- 251.96, N = 3144020.03169583.15142619.841. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000safe RET no microcodeoffsafe RET40K80K120K160K200KSE +/- 352.89, N = 3SE +/- 362.13, N = 3SE +/- 314.03, N = 3140555.98166499.89143271.261. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Apache IoTDB

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200safe RET no microcodeoffsafe RET200K400K600K800K1000KSE +/- 6730.71, N = 12SE +/- 8467.91, N = 3SE +/- 7998.38, N = 8947741.34960525.66918691.45

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200safe RET no microcodeoffsafe RET48121620SE +/- 0.16, N = 12SE +/- 0.19, N = 3SE +/- 0.21, N = 814.0513.8314.73MAX: 609.96MAX: 596.78MAX: 645.11

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500safe RET no microcodeoffsafe RET300K600K900K1200K1500KSE +/- 1525.49, N = 3SE +/- 7578.67, N = 3SE +/- 9180.92, N = 31342031.111271946.571345598.59

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500safe RET no microcodeoffsafe RET816243240SE +/- 0.02, N = 3SE +/- 0.24, N = 3SE +/- 0.22, N = 330.2932.3630.14MAX: 715.01MAX: 646.51MAX: 641.04

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200safe RET no microcodeoffsafe RET300K600K900K1200K1500KSE +/- 6553.14, N = 3SE +/- 1566.77, N = 3SE +/- 4253.29, N = 31202637.361176385.351211172.13

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200safe RET no microcodeoffsafe RET48121620SE +/- 0.14, N = 3SE +/- 0.07, N = 3SE +/- 0.19, N = 313.6114.0513.36MAX: 854.4MAX: 858.17MAX: 881.3

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500safe RET no microcodeoffsafe RET300K600K900K1200K1500KSE +/- 13029.07, N = 3SE +/- 4294.81, N = 3SE +/- 5073.96, N = 31408658.831415756.331583717.62

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500safe RET no microcodeoffsafe RET714212835SE +/- 0.49, N = 3SE +/- 0.29, N = 3SE +/- 0.22, N = 331.9331.4927.73MAX: 930.97MAX: 939.96MAX: 938.92

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200safe RET no microcodeoffsafe RET10M20M30M40M50MSE +/- 614274.26, N = 3SE +/- 574678.74, N = 15SE +/- 543529.82, N = 1546538766.0143665846.2844027904.89

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200safe RET no microcodeoffsafe RET918273645SE +/- 0.62, N = 3SE +/- 0.55, N = 15SE +/- 0.52, N = 1535.1037.7037.53MAX: 728.37MAX: 802.64MAX: 755.16

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500safe RET no microcodeoffsafe RET8M16M24M32M40MSE +/- 394126.89, N = 5SE +/- 302926.36, N = 10SE +/- 288707.73, N = 1537720117.4039463981.4238833415.97

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500safe RET no microcodeoffsafe RET306090120150SE +/- 1.63, N = 5SE +/- 0.95, N = 10SE +/- 0.86, N = 15123.61117.97120.06MAX: 4533.33MAX: 4652.25MAX: 4495.21

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200safe RET no microcodeoffsafe RET11M22M33M44M55MSE +/- 147114.88, N = 3SE +/- 681823.31, N = 3SE +/- 634314.77, N = 347445770.1849501499.1350578426.54

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200safe RET no microcodeoffsafe RET918273645SE +/- 0.11, N = 3SE +/- 0.61, N = 3SE +/- 0.49, N = 338.5436.5835.82MAX: 3276.77MAX: 2252.73MAX: 3267.55

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500safe RET no microcodeoffsafe RET13M26M39M52M65MSE +/- 648692.91, N = 4SE +/- 817020.04, N = 3SE +/- 721225.08, N = 358073516.7958682618.1857099408.15

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500safe RET no microcodeoffsafe RET20406080100SE +/- 0.81, N = 4SE +/- 2.14, N = 3SE +/- 1.29, N = 379.1978.9282.24MAX: 5165.86MAX: 1729.94MAX: 3625.32

Apache Spark

This is a benchmark of Apache Spark with its PySpark interface. Apache Spark is an open-source unified analytics engine for large-scale data processing and dealing with big data. This test profile benchmars the Apache Spark in a single-system configuration using spark-submit. The test makes use of DIYBigData's pyspark-benchmark (https://github.com/DIYBigData/pyspark-benchmark/) for generating of test data and various Apache Spark operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - SHA-512 Benchmark Timesafe RET no microcodeoffsafe RET0.78081.56162.34243.12323.904SE +/- 0.04, N = 3SE +/- 0.03, N = 15SE +/- 0.04, N = 33.423.393.47

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmarksafe RET no microcodeoffsafe RET714212835SE +/- 0.01, N = 3SE +/- 0.12, N = 15SE +/- 0.33, N = 332.0231.8431.43

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Group By Test Timesafe RET no microcodeoffsafe RET1.16332.32663.48994.65325.8165SE +/- 0.08, N = 3SE +/- 0.04, N = 15SE +/- 0.07, N = 35.174.915.15

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Repartition Test Timesafe RET no microcodeoffsafe RET0.53551.0711.60652.1422.6775SE +/- 0.04, N = 3SE +/- 0.04, N = 15SE +/- 0.04, N = 32.382.092.26

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Inner Join Test Timesafe RET no microcodeoffsafe RET0.49950.9991.49851.9982.4975SE +/- 0.05, N = 3SE +/- 0.02, N = 15SE +/- 0.06, N = 32.221.882.14

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Broadcast Inner Join Test Timesafe RET no microcodeoffsafe RET0.31730.63460.95191.26921.5865SE +/- 0.02, N = 3SE +/- 0.01, N = 15SE +/- 0.01, N = 31.391.301.41

ClickHouse

ClickHouse is an open-source, high performance OLAP data management system. This test profile uses ClickHouse's standard benchmark recommendations per https://clickhouse.com/docs/en/operations/performance-test/ / https://github.com/ClickHouse/ClickBench/tree/main/clickhouse with the 100 million rows web analytics dataset. The reported value is the query processing time using the geometric mean of all separate queries performed as an aggregate. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold Cachesafe RET no microcodeoffsafe RET80160240320400SE +/- 3.38, N = 5SE +/- 0.68, N = 3SE +/- 2.94, N = 3323.42349.43318.12MIN: 30.82 / MAX: 5000MIN: 31.06 / MAX: 4285.71MIN: 30.57 / MAX: 3333.33

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second Runsafe RET no microcodeoffsafe RET80160240320400SE +/- 2.16, N = 5SE +/- 1.42, N = 3SE +/- 4.86, N = 3337.01361.81337.12MIN: 30.49 / MAX: 3529.41MIN: 31.46 / MAX: 4000MIN: 30.79 / MAX: 4000

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third Runsafe RET no microcodeoffsafe RET80160240320400SE +/- 3.27, N = 5SE +/- 2.21, N = 3SE +/- 1.85, N = 3329.19362.64337.45MIN: 31.32 / MAX: 2857.14MIN: 31.5 / MAX: 4285.71MIN: 31.46 / MAX: 4000

CockroachDB

CockroachDB is a cloud-native, distributed SQL database for data intensive applications. This test profile uses a server-less CockroachDB configuration to test various Coackroach workloads on the local host with a single node. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 50% Reads - Concurrency: 128safe RET no microcodeoffsafe RET20K40K60K80K100KSE +/- 719.41, N = 15SE +/- 275.86, N = 3SE +/- 948.29, N = 15100851.4103635.099601.6

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 95% Reads - Concurrency: 128safe RET no microcodeoffsafe RET30K60K90K120K150KSE +/- 1043.12, N = 13SE +/- 931.05, N = 3SE +/- 1387.70, N = 15131487.0135187.2132046.0

Redis 7.0.12 + memtier_benchmark

Memtier_benchmark is a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5safe RET no microcodeoffsafe RET500K1000K1500K2000K2500KSE +/- 31351.12, N = 3SE +/- 11955.97, N = 3SE +/- 1778.76, N = 32218601.792204628.922145436.261. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5safe RET no microcodeoffsafe RET500K1000K1500K2000K2500KSE +/- 17712.54, N = 3SE +/- 14704.83, N = 3SE +/- 4916.89, N = 32167181.092197287.302145052.141. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10safe RET no microcodeoffsafe RET500K1000K1500K2000K2500KSE +/- 17754.58, N = 3SE +/- 14630.02, N = 3SE +/- 2448.62, N = 32173694.772177211.802172804.711. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10safe RET no microcodeoffsafe RET500K1000K1500K2000K2500KSE +/- 16754.20, N = 10SE +/- 30210.22, N = 3SE +/- 792.70, N = 32154339.262195705.512157815.691. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

SQLite

This is a simple benchmark of SQLite. At present this test profile just measures the time to perform a pre-defined number of insertions on an indexed database with a variable number of concurrent repetitions -- up to the maximum number of CPU threads available. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 8safe RET no microcodeoffsafe RET1.12642.25283.37924.50565.632SE +/- 0.016, N = 3SE +/- 0.013, N = 3SE +/- 0.036, N = 34.8503.7555.0061. (CC) gcc options: -O2 -lz -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 16safe RET no microcodeoffsafe RET246810SE +/- 0.024, N = 3SE +/- 0.020, N = 3SE +/- 0.052, N = 38.8346.2738.8001. (CC) gcc options: -O2 -lz -lm

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Update Randomsafe RET no microcodeoffsafe RET100K200K300K400K500KSE +/- 426.73, N = 3SE +/- 893.82, N = 3SE +/- 185.49, N = 34281124622874269471. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Read Random Write Randomsafe RET no microcodeoffsafe RET600K1200K1800K2400K3000KSE +/- 18652.89, N = 3SE +/- 35283.44, N = 4SE +/- 21895.20, N = 32872765295168428390851. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Apache Cassandra

This is a benchmark of the Apache Cassandra NoSQL database management system making use of cassandra-stress. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.1.3Test: Writessafe RET no microcodeoffsafe RET50K100K150K200K250KSE +/- 950.59, N = 3SE +/- 413.74, N = 3SE +/- 479.91, N = 3233069238741236241

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Onlysafe RET no microcodeoffsafe RET700K1400K2100K2800K3500KSE +/- 29158.10, N = 3SE +/- 1705.16, N = 3SE +/- 34286.68, N = 32707280312871927684451. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latencysafe RET no microcodeoffsafe RET0.06660.13320.19980.26640.333SE +/- 0.003, N = 3SE +/- 0.000, N = 3SE +/- 0.004, N = 30.2960.2560.2891. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Writesafe RET no microcodeoffsafe RET13K26K39K52K65KSE +/- 66.28, N = 3SE +/- 418.78, N = 3SE +/- 207.71, N = 35517561604548371. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latencysafe RET no microcodeoffsafe RET48121620SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 314.5012.9914.591. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 4096safe RET no microcodeoffsafe RET130260390520650SE +/- 2.96, N = 3SE +/- 5.48, N = 3SE +/- 3.51, N = 34125904181. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 8192safe RET no microcodeoffsafe RET80160240320400SE +/- 0.73, N = 3SE +/- 3.35, N = 3SE +/- 1.18, N = 33013553011. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

104 Results Shown

DaCapo Benchmark:
  Jython
  Tradebeans
OpenRadioss:
  Bumper Beam
  Cell Phone Drop Test
  Bird Strike on Windshield
SPECFEM3D:
  Homogeneous Halfspace
  Water-layered Halfspace
  Layered Halfspace
  Tomographic Model
Remhos
SPECFEM3D
OpenRadioss:
  Rubber O-Ring Seal Installation
  INIVOL and Fluid Structure Interaction Drop Container
Timed MrBayes Analysis
TensorFlow
Numpy Benchmark
Neural Magic DeepSparse:
  NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  ResNet-50, Baseline - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  ResNet-50, Sparse INT8 - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  BERT-Large, NLP Question Answering - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream:
    items/sec
    ms/batch
GROMACS
NAMD
OpenVINO:
  Person Detection FP16 - CPU:
    FPS
    ms
  Face Detection FP16-INT8 - CPU:
    FPS
    ms
  Weld Porosity Detection FP16 - CPU:
    FPS
    ms
ACES DGEMM
Algebraic Multi-Grid Benchmark
OpenFOAM:
  drivaerFastback, Medium Mesh Size - Mesh Time
  drivaerFastback, Medium Mesh Size - Execution Time
7-Zip Compression:
  Compression Rating
  Decompression Rating
Timed LLVM Compilation
Timed Linux Kernel Compilation:
  defconfig
  allmodconfig
Blender:
  BMW27 - CPU-Only
  Pabellon Barcelona - CPU-Only
Timed Godot Game Engine Compilation
Embree:
  Pathtracer ISPC - Crown
  Pathtracer ISPC - Asian Dragon
OpenVKL
OSPRay:
  particle_volume/ao/real_time
  particle_volume/scivis/real_time
  particle_volume/pathtracer/real_time
  gravity_spheres_volume/dim_512/ao/real_time
  gravity_spheres_volume/dim_512/scivis/real_time
  gravity_spheres_volume/dim_512/pathtracer/real_time
Timed Node.js Compilation
nginx:
  500
  1000
Apache IoTDB:
  200 - 1 - 200:
    point/sec
    Average Latency
  200 - 1 - 500:
    point/sec
    Average Latency
  500 - 1 - 200:
    point/sec
    Average Latency
  500 - 1 - 500:
    point/sec
    Average Latency
  200 - 100 - 200:
    point/sec
    Average Latency
  200 - 100 - 500:
    point/sec
    Average Latency
  500 - 100 - 200:
    point/sec
    Average Latency
  500 - 100 - 500:
    point/sec
    Average Latency
Apache Spark:
  1000000 - 100 - SHA-512 Benchmark Time
  1000000 - 100 - Calculate Pi Benchmark
  1000000 - 100 - Group By Test Time
  1000000 - 100 - Repartition Test Time
  1000000 - 100 - Inner Join Test Time
  1000000 - 100 - Broadcast Inner Join Test Time
ClickHouse:
  100M Rows Hits Dataset, First Run / Cold Cache
  100M Rows Hits Dataset, Second Run
  100M Rows Hits Dataset, Third Run
CockroachDB:
  KV, 50% Reads - 128
  KV, 95% Reads - 128
Redis 7.0.12 + memtier_benchmark:
  Redis - 50 - 1:5
  Redis - 100 - 1:5
  Redis - 50 - 1:10
  Redis - 100 - 1:10
SQLite:
  8
  16
RocksDB:
  Update Rand
  Read Rand Write Rand
Apache Cassandra
PostgreSQL:
  100 - 800 - Read Only
  100 - 800 - Read Only - Average Latency
  100 - 800 - Read Write
  100 - 800 - Read Write - Average Latency
MariaDB:
  4096
  8192