AMD EPYC 7763 1P spec_rstack_overflow

Benchmarks by Michael Larabel for a future article looking at AMD Inception impact.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2308109-NE-EPYC7763169
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
off
August 10 2023
  9 Hours, 10 Minutes
safe RET no microcode
August 09 2023
  9 Hours, 26 Minutes
safe RET
August 10 2023
  9 Hours, 17 Minutes
Invert Behavior (Only Show Selected Data)
  9 Hours, 18 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC 7763 1P spec_rstack_overflowOpenBenchmarking.orgPhoronix Test SuiteAMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads)AMD DAYTONA_X (RYM1009B BIOS)AMD Starship/Matisse256GB800GB INTEL SSDPF21Q800GBASPEEDVE2282 x Mellanox MT27710Ubuntu 22.046.5.0-rc5-phx-tues (x86_64)GNOME Shell 42.5X Server 1.21.1.31.3.224GCC 11.3.0 + LLVM 14.0.0ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionAMD EPYC 7763 1P Spec_rstack_overflow BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NONE / errors=remount-ro,relatime,rw / Block Size: 4096- off: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 - safe RET no microcode: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 - safe RET: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1 - OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)- Python 3.10.6- off: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - safe RET no microcode: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - safe RET: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

offsafe RET no microcodesafe RETResult OverviewPhoronix Test Suite100%109%118%128%137%SQLiteMariaDBTimed Linux Kernel CompilationnginxTensorFlowPostgreSQLNumpy BenchmarkClickHouse7-Zip CompressionApache SparkOpenRadiossRocksDBTimed Node.js CompilationACES DGEMMRemhosTimed LLVM CompilationCockroachDBTimed Godot Game Engine CompilationOpenFOAMApache CassandraApache IoTDBRedis 7.0.12 + memtier_benchmarkTimed MrBayes AnalysisSPECFEM3DAlgebraic Multi-Grid BenchmarkGROMACSBlenderOpenVINOEmbreeOpenVKLNeural Magic DeepSparseOSPRayNAMDDaCapo Benchmark

AMD EPYC 7763 1P spec_rstack_overflowcompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingmt-dgemm: Sustained Floating-Point Rateamg: cassandra: Writesapache-iotdb: 200 - 1 - 200apache-iotdb: 200 - 1 - 200apache-iotdb: 200 - 1 - 500apache-iotdb: 200 - 1 - 500apache-iotdb: 500 - 1 - 200apache-iotdb: 500 - 1 - 200apache-iotdb: 500 - 1 - 500apache-iotdb: 500 - 1 - 500apache-iotdb: 200 - 100 - 200apache-iotdb: 200 - 100 - 200apache-iotdb: 200 - 100 - 500apache-iotdb: 200 - 100 - 500apache-iotdb: 500 - 100 - 200apache-iotdb: 500 - 100 - 200apache-iotdb: 500 - 100 - 500apache-iotdb: 500 - 100 - 500spark: 1000000 - 100 - SHA-512 Benchmark Timespark: 1000000 - 100 - Calculate Pi Benchmarkspark: 1000000 - 100 - Group By Test Timespark: 1000000 - 100 - Repartition Test Timespark: 1000000 - 100 - Inner Join Test Timespark: 1000000 - 100 - Broadcast Inner Join Test Timeblender: BMW27 - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlyclickhouse: 100M Rows Hits Dataset, First Run / Cold Cacheclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, Third Runcockroach: KV, 50% Reads - 128cockroach: KV, 95% Reads - 128dacapobench: Jythondacapobench: Tradebeansembree: Pathtracer ISPC - Crownembree: Pathtracer ISPC - Asian Dragongromacs: MPI CPU - water_GMX50_baremysqlslap: 4096mysqlslap: 8192namd: ATPase Simulation - 327,506 Atomsdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamnginx: 500nginx: 1000numpy: openfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenradioss: Bumper Beamopenradioss: Cell Phone Drop Testopenradioss: Bird Strike on Windshieldopenradioss: Rubber O-Ring Seal Installationopenradioss: INIVOL and Fluid Structure Interaction Drop Containeropenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvkl: vklBenchmark ISPCospray: particle_volume/ao/real_timeospray: particle_volume/scivis/real_timeospray: particle_volume/pathtracer/real_timeospray: gravity_spheres_volume/dim_512/ao/real_timeospray: gravity_spheres_volume/dim_512/scivis/real_timeospray: gravity_spheres_volume/dim_512/pathtracer/real_timepgbench: 100 - 800 - Read Onlypgbench: 100 - 800 - Read Only - Average Latencypgbench: 100 - 800 - Read Writepgbench: 100 - 800 - Read Write - Average Latencymemtier-benchmark: Redis - 50 - 1:5memtier-benchmark: Redis - 100 - 1:5memtier-benchmark: Redis - 50 - 1:10memtier-benchmark: Redis - 100 - 1:10remhos: Sample Remap Examplerocksdb: Update Randrocksdb: Read Rand Write Randspecfem3d: Mount St. Helensspecfem3d: Layered Halfspacespecfem3d: Tomographic Modelspecfem3d: Homogeneous Halfspacespecfem3d: Water-layered Halfspacesqlite: 8sqlite: 16tensorflow: CPU - 64 - ResNet-50build-godot: Time To Compilebuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigbuild-llvm: Ninjamrbayes: Primate Phylogeny Analysisbuild-nodejs: Time To Compileoffsafe RET no microcodesafe RET38437438558524.2005511011799000238741960525.6613.831271946.5732.361176385.3514.051415756.3331.4943665846.2837.7039463981.42117.9749501499.1336.5858682618.1878.923.3931.844.912.091.881.3027.3484.50349.43361.81362.64103635.0135187.24193399357.422964.59645.6805903550.3813037.6037839.7201487.245065.5911468.830668.14833840.63078.307846.7020678.933053.5968596.5915576.972255.3922169583.15166499.89457.23140.61562633.5190287.7233.10144.8377.46162.137.684092.9127.831141.431126.0328.4045318.022617.7511157.8298.960518.3317413.135531287190.2566160412.9882204628.922197287.302177211.802195705.5117.375462287295168411.80123873231.84563042414.13426560617.41712093329.7725353863.7556.27317.78121.94831.192289.063176.374136.686164.26833481238303924.695818999645400233069947741.3414.051342031.1130.291202637.3613.611408658.8331.9346538766.0135.1037720117.40123.6147445770.1838.5458073516.7979.193.4232.025.172.382.221.3927.5884.69323.42337.01329.19100851.4131487.04191409657.295664.39165.7304123010.3811537.6987840.7721485.006965.8910468.364668.25953816.76768.361246.7239679.576653.5972596.5579577.058955.3747144020.03140555.98418.95145.06069644.3622393.6836.37152.9185.04163.027.584124.7427.791142.291126.6428.3845218.028817.7305155.1658.969418.3274913.262127072800.2965517514.4992218601.792167181.092173694.772154339.2617.788428112287276511.98216346031.82926187014.40441923817.64368039730.4275317094.8508.83415.56125.66337.623344.242182.169137.518172.74933559538351523.702889999102100236241918691.4514.731345598.5930.141211172.1313.361583717.6227.7344027904.8937.5338833415.97120.0650578426.5435.8257099408.1582.243.4731.435.152.262.141.4127.4684.49318.12337.12337.4599601.6132046.04241414357.313864.67425.7064183010.3809837.6319840.4236487.367765.5662468.217068.23253834.24398.321946.5677682.207153.5729596.7822576.816655.4028142619.84143271.26422.58144.02174643.7131693.9036.40152.2784.48163.977.604114.5827.821142.531126.1428.3945317.981717.7305156.4198.940498.3381313.253827684450.2895483714.5892145436.262145052.142172804.712157815.6917.958426947283908512.01038078131.65994088514.18805896217.69029865029.5908682605.0068.80015.65125.06037.243338.157181.528138.851173.064OpenBenchmarking.org

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Ratingsafe RET no microcodesafe REToff80K160K240K320K400KSE +/- 25.38, N = 3SE +/- 435.27, N = 3SE +/- 1018.75, N = 33348123355953843741. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Ratingsafe RET no microcodesafe REToff80K160K240K320K400KSE +/- 380.69, N = 3SE +/- 605.48, N = 3SE +/- 845.58, N = 33830393835153855851. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratesafe REToffsafe RET no microcode612182430SE +/- 0.26, N = 5SE +/- 0.21, N = 8SE +/- 0.34, N = 323.7024.2024.701. (CC) gcc options: -O3 -march=native -fopenmp

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2safe RETsafe RET no microcodeoff200M400M600M800M1000MSE +/- 367255.40, N = 3SE +/- 575791.94, N = 3SE +/- 839009.73, N = 399910210099964540010117990001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Apache Cassandra

This is a benchmark of the Apache Cassandra NoSQL database management system making use of cassandra-stress. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.1.3Test: Writessafe RET no microcodesafe REToff50K100K150K200K250KSE +/- 950.59, N = 3SE +/- 479.91, N = 3SE +/- 413.74, N = 3233069236241238741

Apache IoTDB

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200safe RETsafe RET no microcodeoff200K400K600K800K1000KSE +/- 7998.38, N = 8SE +/- 6730.71, N = 12SE +/- 8467.91, N = 3918691.45947741.34960525.66

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200safe RETsafe RET no microcodeoff48121620SE +/- 0.21, N = 8SE +/- 0.16, N = 12SE +/- 0.19, N = 314.7314.0513.83MAX: 645.11MAX: 609.96MAX: 596.78

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500offsafe RET no microcodesafe RET300K600K900K1200K1500KSE +/- 7578.67, N = 3SE +/- 1525.49, N = 3SE +/- 9180.92, N = 31271946.571342031.111345598.59

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500offsafe RET no microcodesafe RET816243240SE +/- 0.24, N = 3SE +/- 0.02, N = 3SE +/- 0.22, N = 332.3630.2930.14MAX: 646.51MAX: 715.01MAX: 641.04

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200offsafe RET no microcodesafe RET300K600K900K1200K1500KSE +/- 1566.77, N = 3SE +/- 6553.14, N = 3SE +/- 4253.29, N = 31176385.351202637.361211172.13

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200offsafe RET no microcodesafe RET48121620SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.19, N = 314.0513.6113.36MAX: 858.17MAX: 854.4MAX: 881.3

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500safe RET no microcodeoffsafe RET300K600K900K1200K1500KSE +/- 13029.07, N = 3SE +/- 4294.81, N = 3SE +/- 5073.96, N = 31408658.831415756.331583717.62

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500safe RET no microcodeoffsafe RET714212835SE +/- 0.49, N = 3SE +/- 0.29, N = 3SE +/- 0.22, N = 331.9331.4927.73MAX: 930.97MAX: 939.96MAX: 938.92

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200offsafe RETsafe RET no microcode10M20M30M40M50MSE +/- 574678.74, N = 15SE +/- 543529.82, N = 15SE +/- 614274.26, N = 343665846.2844027904.8946538766.01

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200offsafe RETsafe RET no microcode918273645SE +/- 0.55, N = 15SE +/- 0.52, N = 15SE +/- 0.62, N = 337.7037.5335.10MAX: 802.64MAX: 755.16MAX: 728.37

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500safe RET no microcodesafe REToff8M16M24M32M40MSE +/- 394126.89, N = 5SE +/- 288707.73, N = 15SE +/- 302926.36, N = 1037720117.4038833415.9739463981.42

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500safe RET no microcodesafe REToff306090120150SE +/- 1.63, N = 5SE +/- 0.86, N = 15SE +/- 0.95, N = 10123.61120.06117.97MAX: 4533.33MAX: 4495.21MAX: 4652.25

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200safe RET no microcodeoffsafe RET11M22M33M44M55MSE +/- 147114.88, N = 3SE +/- 681823.31, N = 3SE +/- 634314.77, N = 347445770.1849501499.1350578426.54

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200safe RET no microcodeoffsafe RET918273645SE +/- 0.11, N = 3SE +/- 0.61, N = 3SE +/- 0.49, N = 338.5436.5835.82MAX: 3276.77MAX: 2252.73MAX: 3267.55

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500safe RETsafe RET no microcodeoff13M26M39M52M65MSE +/- 721225.08, N = 3SE +/- 648692.91, N = 4SE +/- 817020.04, N = 357099408.1558073516.7958682618.18

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500safe RETsafe RET no microcodeoff20406080100SE +/- 1.29, N = 3SE +/- 0.81, N = 4SE +/- 2.14, N = 382.2479.1978.92MAX: 3625.32MAX: 5165.86MAX: 1729.94

Apache Spark

This is a benchmark of Apache Spark with its PySpark interface. Apache Spark is an open-source unified analytics engine for large-scale data processing and dealing with big data. This test profile benchmars the Apache Spark in a single-system configuration using spark-submit. The test makes use of DIYBigData's pyspark-benchmark (https://github.com/DIYBigData/pyspark-benchmark/) for generating of test data and various Apache Spark operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - SHA-512 Benchmark Timesafe RETsafe RET no microcodeoff0.78081.56162.34243.12323.904SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 153.473.423.39

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmarksafe RET no microcodeoffsafe RET714212835SE +/- 0.01, N = 3SE +/- 0.12, N = 15SE +/- 0.33, N = 332.0231.8431.43

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Group By Test Timesafe RET no microcodesafe REToff1.16332.32663.48994.65325.8165SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 155.175.154.91

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Repartition Test Timesafe RET no microcodesafe REToff0.53551.0711.60652.1422.6775SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 152.382.262.09

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Inner Join Test Timesafe RET no microcodesafe REToff0.49950.9991.49851.9982.4975SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 152.222.141.88

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Broadcast Inner Join Test Timesafe RETsafe RET no microcodeoff0.31730.63460.95191.26921.5865SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 151.411.391.30

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-Onlysafe RET no microcodesafe REToff612182430SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 327.5827.4627.34

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: CPU-Onlysafe RET no microcodeoffsafe RET20406080100SE +/- 0.02, N = 3SE +/- 0.13, N = 3SE +/- 0.04, N = 384.6984.5084.49

ClickHouse

ClickHouse is an open-source, high performance OLAP data management system. This test profile uses ClickHouse's standard benchmark recommendations per https://clickhouse.com/docs/en/operations/performance-test/ / https://github.com/ClickHouse/ClickBench/tree/main/clickhouse with the 100 million rows web analytics dataset. The reported value is the query processing time using the geometric mean of all separate queries performed as an aggregate. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold Cachesafe RETsafe RET no microcodeoff80160240320400SE +/- 2.94, N = 3SE +/- 3.38, N = 5SE +/- 0.68, N = 3318.12323.42349.43MIN: 30.57 / MAX: 3333.33MIN: 30.82 / MAX: 5000MIN: 31.06 / MAX: 4285.71

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second Runsafe RET no microcodesafe REToff80160240320400SE +/- 2.16, N = 5SE +/- 4.86, N = 3SE +/- 1.42, N = 3337.01337.12361.81MIN: 30.49 / MAX: 3529.41MIN: 30.79 / MAX: 4000MIN: 31.46 / MAX: 4000

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third Runsafe RET no microcodesafe REToff80160240320400SE +/- 3.27, N = 5SE +/- 1.85, N = 3SE +/- 2.21, N = 3329.19337.45362.64MIN: 31.32 / MAX: 2857.14MIN: 31.46 / MAX: 4000MIN: 31.5 / MAX: 4285.71

CockroachDB

CockroachDB is a cloud-native, distributed SQL database for data intensive applications. This test profile uses a server-less CockroachDB configuration to test various Coackroach workloads on the local host with a single node. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 50% Reads - Concurrency: 128safe RETsafe RET no microcodeoff20K40K60K80K100KSE +/- 948.29, N = 15SE +/- 719.41, N = 15SE +/- 275.86, N = 399601.6100851.4103635.0

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 95% Reads - Concurrency: 128safe RET no microcodesafe REToff30K60K90K120K150KSE +/- 1043.12, N = 13SE +/- 1387.70, N = 15SE +/- 931.05, N = 3131487.0132046.0135187.2

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jythonsafe REToffsafe RET no microcode9001800270036004500SE +/- 49.88, N = 4SE +/- 18.07, N = 4SE +/- 47.28, N = 4424141934191

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeanssafe RETsafe RET no microcodeoff9001800270036004500SE +/- 28.11, N = 4SE +/- 44.47, N = 4SE +/- 42.66, N = 4414340963993

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Crownsafe RET no microcodesafe REToff1326395265SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.08, N = 357.3057.3157.42MIN: 56.26 / MAX: 58.69MIN: 56.2 / MAX: 58.59MIN: 56.59 / MAX: 58.54

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian Dragonsafe RET no microcodeoffsafe RET1428425670SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 364.3964.6064.67MIN: 63.77 / MAX: 66.16MIN: 64.05 / MAX: 66.13MIN: 64.11 / MAX: 66.01

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_bareoffsafe RETsafe RET no microcode1.28932.57863.86795.15726.4465SE +/- 0.012, N = 3SE +/- 0.010, N = 3SE +/- 0.006, N = 35.6805.7065.7301. (CXX) g++ options: -O3

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 4096safe RET no microcodesafe REToff130260390520650SE +/- 2.96, N = 3SE +/- 3.51, N = 3SE +/- 5.48, N = 34124185901. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 8192safe RET no microcodesafe REToff80160240320400SE +/- 0.73, N = 3SE +/- 1.18, N = 3SE +/- 3.35, N = 33013013551. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atomsoffsafe RET no microcodesafe RET0.08580.17160.25740.34320.429SE +/- 0.00017, N = 3SE +/- 0.00029, N = 3SE +/- 0.00028, N = 30.381300.381150.38098

Neural Magic DeepSparse

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Streamoffsafe RETsafe RET no microcode918273645SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 337.6037.6337.70

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Streamsafe RET no microcodesafe REToff2004006008001000SE +/- 0.36, N = 3SE +/- 0.34, N = 3SE +/- 0.50, N = 3840.77840.42839.72

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET110220330440550SE +/- 0.96, N = 3SE +/- 1.15, N = 3SE +/- 1.01, N = 3485.01487.25487.37

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RET1530456075SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.16, N = 365.8965.5965.57

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Streamsafe RETsafe RET no microcodeoff100200300400500SE +/- 0.31, N = 3SE +/- 0.42, N = 3SE +/- 0.24, N = 3468.22468.36468.83

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Streamsafe RET no microcodesafe REToff1530456075SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 368.2668.2368.15

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Streamsafe RET no microcodesafe REToff8001600240032004000SE +/- 7.72, N = 3SE +/- 12.28, N = 3SE +/- 4.76, N = 33816.773834.243840.63

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Streamsafe RET no microcodesafe REToff246810SE +/- 0.0167, N = 3SE +/- 0.0271, N = 3SE +/- 0.0095, N = 38.36128.32198.3078

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Streamsafe REToffsafe RET no microcode1122334455SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 346.5746.7046.72

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Streamsafe RETsafe RET no microcodeoff150300450600750SE +/- 0.96, N = 3SE +/- 1.47, N = 3SE +/- 1.21, N = 3682.21679.58678.93

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Streamsafe REToffsafe RET no microcode1224364860SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 353.5753.6053.60

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Streamsafe REToffsafe RET no microcode130260390520650SE +/- 0.11, N = 3SE +/- 0.18, N = 3SE +/- 0.26, N = 3596.78596.59596.56

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Streamsafe REToffsafe RET no microcode120240360480600SE +/- 0.39, N = 3SE +/- 0.44, N = 3SE +/- 0.39, N = 3576.82576.97577.06

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Streamsafe REToffsafe RET no microcode1224364860SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 355.4055.3955.37

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500safe RETsafe RET no microcodeoff40K80K120K160K200KSE +/- 251.96, N = 3SE +/- 284.55, N = 3SE +/- 284.72, N = 3142619.84144020.03169583.151. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000safe RET no microcodesafe REToff40K80K120K160K200KSE +/- 352.89, N = 3SE +/- 314.03, N = 3SE +/- 362.13, N = 3140555.98143271.26166499.891. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmarksafe RET no microcodesafe REToff100200300400500SE +/- 2.01, N = 3SE +/- 0.84, N = 3SE +/- 1.76, N = 3418.95422.58457.23

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Timesafe RET no microcodesafe REToff306090120150145.06144.02140.621. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Timesafe RET no microcodesafe REToff140280420560700644.36643.71633.521. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bumper Beamsafe RETsafe RET no microcodeoff20406080100SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.33, N = 393.9093.6887.72

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Cell Phone Drop Testsafe RETsafe RET no microcodeoff816243240SE +/- 0.03, N = 3SE +/- 0.26, N = 3SE +/- 0.11, N = 336.4036.3733.10

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bird Strike on Windshieldsafe RET no microcodesafe REToff306090120150SE +/- 0.67, N = 3SE +/- 0.73, N = 3SE +/- 0.07, N = 3152.91152.27144.83

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Rubber O-Ring Seal Installationsafe RET no microcodesafe REToff20406080100SE +/- 0.17, N = 3SE +/- 0.24, N = 3SE +/- 0.23, N = 385.0484.4877.46

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: INIVOL and Fluid Structure Interaction Drop Containersafe RETsafe RET no microcodeoff4080120160200SE +/- 0.50, N = 3SE +/- 0.39, N = 3SE +/- 0.16, N = 3163.97163.02162.13

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUsafe RET no microcodesafe REToff246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 37.587.607.681. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUsafe RET no microcodesafe REToff9001800270036004500SE +/- 6.87, N = 3SE +/- 10.77, N = 3SE +/- 14.65, N = 34124.744114.584092.91MIN: 2129.26 / MAX: 5016.36MIN: 2087 / MAX: 5053.62MIN: 3409.52 / MAX: 4641.431. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUsafe RET no microcodesafe REToff714212835SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 327.7927.8227.831. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUsafe RETsafe RET no microcodeoff2004006008001000SE +/- 1.09, N = 3SE +/- 0.27, N = 3SE +/- 0.32, N = 31142.531142.291141.43MIN: 999.01 / MAX: 1177.02MIN: 985.75 / MAX: 1168.76MIN: 998.76 / MAX: 1165.451. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUoffsafe RETsafe RET no microcode2004006008001000SE +/- 0.17, N = 3SE +/- 0.13, N = 3SE +/- 0.72, N = 31126.031126.141126.641. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUoffsafe RETsafe RET no microcode714212835SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 328.4028.3928.38MIN: 14.74 / MAX: 48.66MIN: 14.64 / MAX: 50.33MIN: 14.89 / MAX: 51.631. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPCsafe RET no microcodeoffsafe RET100200300400500SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 3452453453MIN: 85 / MAX: 2535MIN: 84 / MAX: 2528MIN: 84 / MAX: 2520

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/ao/real_timesafe REToffsafe RET no microcode48121620SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 317.9818.0218.03

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/scivis/real_timesafe RET no microcodesafe REToff48121620SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 317.7317.7317.75

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/pathtracer/real_timesafe RET no microcodesafe REToff306090120150SE +/- 1.83, N = 3SE +/- 0.07, N = 3SE +/- 0.21, N = 3155.17156.42157.83

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/ao/real_timesafe REToffsafe RET no microcode3691215SE +/- 0.01456, N = 3SE +/- 0.02460, N = 3SE +/- 0.02941, N = 38.940498.960518.96941

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/scivis/real_timesafe RET no microcodeoffsafe RET246810SE +/- 0.01659, N = 3SE +/- 0.00864, N = 3SE +/- 0.01059, N = 38.327498.331748.33813

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timeoffsafe RETsafe RET no microcode3691215SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 313.1413.2513.26

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Onlysafe RET no microcodesafe REToff700K1400K2100K2800K3500KSE +/- 29158.10, N = 3SE +/- 34286.68, N = 3SE +/- 1705.16, N = 32707280276844531287191. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latencysafe RET no microcodesafe REToff0.06660.13320.19980.26640.333SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.000, N = 30.2960.2890.2561. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Writesafe RETsafe RET no microcodeoff13K26K39K52K65KSE +/- 207.71, N = 3SE +/- 66.28, N = 3SE +/- 418.78, N = 35483755175616041. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latencysafe RETsafe RET no microcodeoff48121620SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 314.5914.5012.991. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Redis 7.0.12 + memtier_benchmark

Memtier_benchmark is a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5safe REToffsafe RET no microcode500K1000K1500K2000K2500KSE +/- 1778.76, N = 3SE +/- 11955.97, N = 3SE +/- 31351.12, N = 32145436.262204628.922218601.791. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5safe RETsafe RET no microcodeoff500K1000K1500K2000K2500KSE +/- 4916.89, N = 3SE +/- 17712.54, N = 3SE +/- 14704.83, N = 32145052.142167181.092197287.301. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10safe RETsafe RET no microcodeoff500K1000K1500K2000K2500KSE +/- 2448.62, N = 3SE +/- 17754.58, N = 3SE +/- 14630.02, N = 32172804.712173694.772177211.801. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10safe RET no microcodesafe REToff500K1000K1500K2000K2500KSE +/- 16754.20, N = 10SE +/- 792.70, N = 3SE +/- 30210.22, N = 32154339.262157815.692195705.511. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Remhos

Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Examplesafe RETsafe RET no microcodeoff48121620SE +/- 0.19, N = 3SE +/- 0.17, N = 3SE +/- 0.23, N = 317.9617.7917.381. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Update Randomsafe RETsafe RET no microcodeoff100K200K300K400K500KSE +/- 185.49, N = 3SE +/- 426.73, N = 3SE +/- 893.82, N = 34269474281124622871. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Read Random Write Randomsafe RETsafe RET no microcodeoff600K1200K1800K2400K3000KSE +/- 21895.20, N = 3SE +/- 18652.89, N = 3SE +/- 35283.44, N = 42839085287276529516841. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Mount St. Helenssafe RETsafe RET no microcodeoff3691215SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 312.0111.9811.801. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Layered Halfspaceoffsafe RET no microcodesafe RET714212835SE +/- 0.35, N = 3SE +/- 0.23, N = 3SE +/- 0.18, N = 331.8531.8331.661. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Tomographic Modelsafe RET no microcodesafe REToff48121620SE +/- 0.15, N = 3SE +/- 0.20, N = 3SE +/- 0.09, N = 314.4014.1914.131. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Homogeneous Halfspacesafe RETsafe RET no microcodeoff48121620SE +/- 0.21, N = 3SE +/- 0.20, N = 4SE +/- 0.07, N = 317.6917.6417.421. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Water-layered Halfspacesafe RET no microcodeoffsafe RET714212835SE +/- 0.25, N = 3SE +/- 0.15, N = 3SE +/- 0.35, N = 330.4329.7729.591. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SQLite

This is a simple benchmark of SQLite. At present this test profile just measures the time to perform a pre-defined number of insertions on an indexed database with a variable number of concurrent repetitions -- up to the maximum number of CPU threads available. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 8safe RETsafe RET no microcodeoff1.12642.25283.37924.50565.632SE +/- 0.036, N = 3SE +/- 0.016, N = 3SE +/- 0.013, N = 35.0064.8503.7551. (CC) gcc options: -O2 -lz -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 16safe RET no microcodesafe REToff246810SE +/- 0.024, N = 3SE +/- 0.052, N = 3SE +/- 0.020, N = 38.8348.8006.2731. (CC) gcc options: -O2 -lz -lm

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: ResNet-50safe RET no microcodesafe REToff48121620SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 315.5615.6517.78

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To Compilesafe RET no microcodesafe REToff306090120150SE +/- 0.33, N = 3SE +/- 0.24, N = 3SE +/- 0.19, N = 3125.66125.06121.95

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigsafe RET no microcodesafe REToff918273645SE +/- 0.35, N = 6SE +/- 0.37, N = 6SE +/- 0.34, N = 537.6237.2431.19

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigsafe RET no microcodesafe REToff70140210280350SE +/- 0.90, N = 3SE +/- 0.79, N = 3SE +/- 0.49, N = 3344.24338.16289.06

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninjasafe RET no microcodesafe REToff4080120160200SE +/- 0.11, N = 3SE +/- 0.15, N = 3SE +/- 0.11, N = 3182.17181.53176.37

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysissafe RETsafe RET no microcodeoff306090120150SE +/- 1.05, N = 3SE +/- 0.66, N = 3SE +/- 0.85, N = 3138.85137.52136.691. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To Compilesafe RETsafe RET no microcodeoff4080120160200SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.14, N = 3173.06172.75164.27

104 Results Shown

7-Zip Compression:
  Compression Rating
  Decompression Rating
ACES DGEMM
Algebraic Multi-Grid Benchmark
Apache Cassandra
Apache IoTDB:
  200 - 1 - 200:
    point/sec
    Average Latency
  200 - 1 - 500:
    point/sec
    Average Latency
  500 - 1 - 200:
    point/sec
    Average Latency
  500 - 1 - 500:
    point/sec
    Average Latency
  200 - 100 - 200:
    point/sec
    Average Latency
  200 - 100 - 500:
    point/sec
    Average Latency
  500 - 100 - 200:
    point/sec
    Average Latency
  500 - 100 - 500:
    point/sec
    Average Latency
Apache Spark:
  1000000 - 100 - SHA-512 Benchmark Time
  1000000 - 100 - Calculate Pi Benchmark
  1000000 - 100 - Group By Test Time
  1000000 - 100 - Repartition Test Time
  1000000 - 100 - Inner Join Test Time
  1000000 - 100 - Broadcast Inner Join Test Time
Blender:
  BMW27 - CPU-Only
  Pabellon Barcelona - CPU-Only
ClickHouse:
  100M Rows Hits Dataset, First Run / Cold Cache
  100M Rows Hits Dataset, Second Run
  100M Rows Hits Dataset, Third Run
CockroachDB:
  KV, 50% Reads - 128
  KV, 95% Reads - 128
DaCapo Benchmark:
  Jython
  Tradebeans
Embree:
  Pathtracer ISPC - Crown
  Pathtracer ISPC - Asian Dragon
GROMACS
MariaDB:
  4096
  8192
NAMD
Neural Magic DeepSparse:
  NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  ResNet-50, Baseline - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  ResNet-50, Sparse INT8 - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  BERT-Large, NLP Question Answering - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream:
    items/sec
    ms/batch
nginx:
  500
  1000
Numpy Benchmark
OpenFOAM:
  drivaerFastback, Medium Mesh Size - Mesh Time
  drivaerFastback, Medium Mesh Size - Execution Time
OpenRadioss:
  Bumper Beam
  Cell Phone Drop Test
  Bird Strike on Windshield
  Rubber O-Ring Seal Installation
  INIVOL and Fluid Structure Interaction Drop Container
OpenVINO:
  Person Detection FP16 - CPU:
    FPS
    ms
  Face Detection FP16-INT8 - CPU:
    FPS
    ms
  Weld Porosity Detection FP16 - CPU:
    FPS
    ms
OpenVKL
OSPRay:
  particle_volume/ao/real_time
  particle_volume/scivis/real_time
  particle_volume/pathtracer/real_time
  gravity_spheres_volume/dim_512/ao/real_time
  gravity_spheres_volume/dim_512/scivis/real_time
  gravity_spheres_volume/dim_512/pathtracer/real_time
PostgreSQL:
  100 - 800 - Read Only
  100 - 800 - Read Only - Average Latency
  100 - 800 - Read Write
  100 - 800 - Read Write - Average Latency
Redis 7.0.12 + memtier_benchmark:
  Redis - 50 - 1:5
  Redis - 100 - 1:5
  Redis - 50 - 1:10
  Redis - 100 - 1:10
Remhos
RocksDB:
  Update Rand
  Read Rand Write Rand
SPECFEM3D:
  Mount St. Helens
  Layered Halfspace
  Tomographic Model
  Homogeneous Halfspace
  Water-layered Halfspace
SQLite:
  8
  16
TensorFlow
Timed Godot Game Engine Compilation
Timed Linux Kernel Compilation:
  defconfig
  allmodconfig
Timed LLVM Compilation
Timed MrBayes Analysis
Timed Node.js Compilation