AMD EPYC 7763 1P spec_rstack_overflow

Benchmarks by Michael Larabel for a future article looking at AMD Inception impact.

HTML result view exported from: https://openbenchmarking.org/result/2308112-NE-EPYC7763124&grs&rdt.

AMD EPYC 7763 1P spec_rstack_overflowProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen Resolutionsafe RET no microcodeoffsafe RETIBPBAMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads)AMD DAYTONA_X (RYM1009B BIOS)AMD Starship/Matisse256GB800GB INTEL SSDPF21Q800GBASPEEDVE2282 x Mellanox MT27710Ubuntu 22.046.5.0-rc5-phx-tues (x86_64)GNOME Shell 42.5X Server 1.21.1.31.3.224GCC 11.3.0 + LLVM 14.0.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- safe RET no microcode: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173- off: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173- safe RET: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1- IBPB: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1Java Details- OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Details- Python 3.10.6Security Details- safe RET no microcode: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - off: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - safe RET: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - IBPB: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC 7763 1P spec_rstack_overflowmysqlslap: 4096pgbench: 100 - 800 - Read Onlypgbench: 100 - 800 - Read Only - Average Latencyrocksdb: Update Randsqlite: 16rocksdb: Read Rand Write Randsqlite: 8dacapobench: Tradebeansopenradioss: Bumper Beammysqlslap: 8192build-linux-kernel: defconfigspark: 1000000 - 100 - Inner Join Test Timeopenradioss: Rubber O-Ring Seal Installationspark: 1000000 - 100 - Broadcast Inner Join Test Timenginx: 500nginx: 1000pgbench: 100 - 800 - Read Writepgbench: 100 - 800 - Read Write - Average Latencybuild-linux-kernel: allmodconfigopenradioss: Cell Phone Drop Testbuild-nodejs: Time To Compileapache-iotdb: 500 - 1 - 200numpy: spark: 1000000 - 100 - Group By Test Timebuild-llvm: Ninjaapache-iotdb: 500 - 1 - 500compress-7zip: Compression Ratingapache-iotdb: 500 - 1 - 200tensorflow: CPU - 64 - ResNet-50cockroach: KV, 95% Reads - 128apache-iotdb: 500 - 1 - 500build-godot: Time To Compileopenradioss: Bird Strike on Windshieldclickhouse: 100M Rows Hits Dataset, Third Runclickhouse: 100M Rows Hits Dataset, First Run / Cold Cachespark: 1000000 - 100 - SHA-512 Benchmark Timecockroach: KV, 50% Reads - 128cassandra: Writesapache-iotdb: 500 - 100 - 200apache-iotdb: 200 - 100 - 200remhos: Sample Remap Exampleapache-iotdb: 200 - 1 - 500clickhouse: 100M Rows Hits Dataset, Second Runapache-iotdb: 200 - 1 - 200apache-iotdb: 500 - 100 - 200apache-iotdb: 200 - 100 - 200dacapobench: Jythonmemtier-benchmark: Redis - 50 - 1:5mrbayes: Primate Phylogeny Analysisopenradioss: INIVOL and Fluid Structure Interaction Drop Containerapache-iotdb: 200 - 1 - 500openfoam: drivaerFastback, Medium Mesh Size - Mesh Timeapache-iotdb: 200 - 100 - 500apache-iotdb: 200 - 100 - 500apache-iotdb: 200 - 1 - 200apache-iotdb: 500 - 100 - 500mt-dgemm: Sustained Floating-Point Ratememtier-benchmark: Redis - 100 - 1:5openvino: Person Detection FP16 - CPUospray: particle_volume/pathtracer/real_timespecfem3d: Water-layered Halfspaceapache-iotdb: 500 - 100 - 500openvino: Person Detection FP16 - CPUmemtier-benchmark: Redis - 100 - 1:10specfem3d: Tomographic Modelembree: Pathtracer ISPC - Asian Dragonspark: 1000000 - 100 - Calculate Pi Benchmarkopenfoam: drivaerFastback, Medium Mesh Size - Execution Timememtier-benchmark: Redis - 50 - 1:10specfem3d: Mount St. Helensspecfem3d: Homogeneous Halfspaceblender: BMW27 - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlyamg: namd: ATPase Simulation - 327,506 Atomsspecfem3d: Layered Halfspaceospray: gravity_spheres_volume/dim_512/pathtracer/real_timegromacs: MPI CPU - water_GMX50_bareospray: gravity_spheres_volume/dim_512/scivis/real_timeopenvino: Weld Porosity Detection FP16 - CPUospray: gravity_spheres_volume/dim_512/ao/real_timeopenvino: Weld Porosity Detection FP16 - CPUopenvkl: vklBenchmark ISPCcompress-7zip: Decompression Ratingdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamospray: particle_volume/scivis/real_timeopenvino: Face Detection FP16-INT8 - CPUospray: particle_volume/ao/real_timedeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamopenvino: Face Detection FP16-INT8 - CPUdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamembree: Pathtracer ISPC - Crowndeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamspark: 1000000 - 100 - Calculate Pi Benchmark Using Dataframespark: 1000000 - 100 - Repartition Test Timesafe RET no microcodeoffsafe RETIBPB41227072800.2964281128.83428727654.850409693.6830137.6232.2285.041.39144020.03140555.985517514.499344.24236.37172.74913.61418.955.17182.16931.933348121202637.3615.56131487.01408658.83125.663152.91329.19323.423.42100851.423306938.5435.1017.78830.29337.0114.0547445770.1846538766.0141912218601.79137.518163.021342031.11145.06069123.6137720117.40947741.3479.1924.6958182167181.097.58155.16530.42753170958073516.794124.742154339.2614.40441923864.391632.02644.362232173694.7711.98216346017.64368039727.5884.699996454000.3811531.82926187013.26215.7308.327491126.648.9694128.384523830398.36123816.767665.8910485.0069679.576617.730527.7918.028846.7239468.36461142.2968.259537.698755.3747577.058957.2956840.772153.5972596.55792.3859031287190.2564622876.27329516843.755399387.7235531.1921.8877.461.30169583.15166499.896160412.988289.06333.10164.26814.05457.234.91176.37431.493843741176385.3517.78135187.21415756.33121.948144.83362.64349.433.39103635.023874136.5837.7017.37532.36361.8113.8349501499.1343665846.2841932204628.92136.686162.131271946.57140.61562117.9739463981.42960525.6678.9224.2005512197287.307.68157.82929.77253538658682618.184092.912195705.5114.13426560664.596431.84633.519022177211.8011.80123873217.41712093327.3484.5010117990000.3813031.84563042413.13555.6808.331741126.038.9605128.404533855858.30783840.630765.5911487.2450678.933017.751127.8318.022646.7020468.83061141.4368.148337.603755.3922576.972257.4229839.720153.5968596.59152.0941827684450.2894269478.80028390855.006414393.9030137.2432.1484.481.41142619.84143271.265483714.589338.15736.40173.06413.36422.585.15181.52827.733355951211172.1315.65132046.01583717.62125.060152.27337.45318.123.4799601.623624135.8237.5317.95830.14337.1214.7350578426.5444027904.8942412145436.26138.851163.971345598.59144.02174120.0638833415.97918691.4582.2423.7028892145052.147.60156.41929.59086826057099408.154114.582157815.6914.18805896264.674231.43643.713162172804.7112.01038078117.69029865027.4684.499991021000.3809831.65994088513.25385.7068.338131126.148.9404928.394533835158.32193834.243965.5662487.3677682.207117.730527.8217.981746.5677468.21701142.5368.232537.631955.4028576.816657.3138840.423653.5729596.78222.2627417338270.4613222317.93421300064.7935305113.6827640.0852.4199.041.64137051.69135431.465046315.854352.17840.11195.49311.93389.925.74204.08031.113717991344749.2017.45119163.81441637.63135.962160.70349.50336.693.7195416.022081436.5636.4718.65831.74347.9214.7849316970.2844816394.7444462092844.22144.829171.751287324.35148.25442120.5338572529.56921701.4480.7824.2514742126493.297.46153.43930.05383480857529201.424204.342148876.0314.22582522763.466831.49645.409582137964.9811.95144857717.69591431927.7385.6310051386670.3853432.00976214213.17095.7078.265491116.898.8923928.624503854878.34113824.843165.7767485.6842681.396117.674327.7117.966346.6121467.31501144.9868.345637.681855.5026575.733557.2967840.888153.5782596.79022.602.31OpenBenchmarking.org

MariaDB

Clients: 4096

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 4096safe RET no microcodeoffsafe RETIBPB130260390520650SE +/- 2.96, N = 3SE +/- 5.48, N = 3SE +/- 3.51, N = 3SE +/- 0.71, N = 34125904182741. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Onlysafe RET no microcodeoffsafe RETIBPB700K1400K2100K2800K3500KSE +/- 29158.10, N = 3SE +/- 1705.16, N = 3SE +/- 34286.68, N = 3SE +/- 2988.66, N = 327072803128719276844517338271. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latencysafe RET no microcodeoffsafe RETIBPB0.10370.20740.31110.41480.5185SE +/- 0.003, N = 3SE +/- 0.000, N = 3SE +/- 0.004, N = 3SE +/- 0.001, N = 30.2960.2560.2890.4611. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

RocksDB

Test: Update Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Update Randomsafe RET no microcodeoffsafe RETIBPB100K200K300K400K500KSE +/- 426.73, N = 3SE +/- 893.82, N = 3SE +/- 185.49, N = 3SE +/- 110.81, N = 34281124622874269473222311. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

SQLite

Threads / Copies: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 16safe RET no microcodeoffsafe RETIBPB246810SE +/- 0.024, N = 3SE +/- 0.020, N = 3SE +/- 0.052, N = 3SE +/- 0.007, N = 38.8346.2738.8007.9341. (CC) gcc options: -O2 -lz -lm

RocksDB

Test: Read Random Write Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Read Random Write Randomsafe RET no microcodeoffsafe RETIBPB600K1200K1800K2400K3000KSE +/- 18652.89, N = 3SE +/- 35283.44, N = 4SE +/- 21895.20, N = 3SE +/- 7875.65, N = 328727652951684283908521300061. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

SQLite

Threads / Copies: 8

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 8safe RET no microcodeoffsafe RETIBPB1.12642.25283.37924.50565.632SE +/- 0.016, N = 3SE +/- 0.013, N = 3SE +/- 0.036, N = 3SE +/- 0.010, N = 34.8503.7555.0064.7931. (CC) gcc options: -O2 -lz -lm

DaCapo Benchmark

Java Test: Tradebeans

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeanssafe RET no microcodeoffsafe RETIBPB11002200330044005500SE +/- 44.47, N = 4SE +/- 42.66, N = 4SE +/- 28.11, N = 4SE +/- 56.17, N = 44096399341435305

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bumper Beamsafe RET no microcodeoffsafe RETIBPB306090120150SE +/- 0.08, N = 3SE +/- 0.33, N = 3SE +/- 0.03, N = 3SE +/- 0.23, N = 393.6887.7293.90113.68

MariaDB

Clients: 8192

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 8192safe RET no microcodeoffsafe RETIBPB80160240320400SE +/- 0.73, N = 3SE +/- 3.35, N = 3SE +/- 1.18, N = 3SE +/- 0.62, N = 33013553012761. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigsafe RET no microcodeoffsafe RETIBPB918273645SE +/- 0.35, N = 6SE +/- 0.34, N = 5SE +/- 0.37, N = 6SE +/- 0.37, N = 737.6231.1937.2440.09

Apache Spark

Row Count: 1000000 - Partitions: 100 - Inner Join Test Time

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Inner Join Test Timesafe RET no microcodeoffsafe RETIBPB0.54231.08461.62692.16922.7115SE +/- 0.05, N = 3SE +/- 0.02, N = 15SE +/- 0.06, N = 3SE +/- 0.04, N = 32.221.882.142.41

OpenRadioss

Model: Rubber O-Ring Seal Installation

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Rubber O-Ring Seal Installationsafe RET no microcodeoffsafe RETIBPB20406080100SE +/- 0.17, N = 3SE +/- 0.23, N = 3SE +/- 0.24, N = 3SE +/- 0.34, N = 385.0477.4684.4899.04

Apache Spark

Row Count: 1000000 - Partitions: 100 - Broadcast Inner Join Test Time

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Broadcast Inner Join Test Timesafe RET no microcodeoffsafe RETIBPB0.3690.7381.1071.4761.845SE +/- 0.02, N = 3SE +/- 0.01, N = 15SE +/- 0.01, N = 3SE +/- 0.03, N = 31.391.301.411.64

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500safe RET no microcodeoffsafe RETIBPB40K80K120K160K200KSE +/- 284.55, N = 3SE +/- 284.72, N = 3SE +/- 251.96, N = 3SE +/- 262.73, N = 3144020.03169583.15142619.84137051.691. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000safe RET no microcodeoffsafe RETIBPB40K80K120K160K200KSE +/- 352.89, N = 3SE +/- 362.13, N = 3SE +/- 314.03, N = 3SE +/- 242.54, N = 3140555.98166499.89143271.26135431.461. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Writesafe RET no microcodeoffsafe RETIBPB13K26K39K52K65KSE +/- 66.28, N = 3SE +/- 418.78, N = 3SE +/- 207.71, N = 3SE +/- 133.40, N = 3551756160454837504631. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latencysafe RET no microcodeoffsafe RETIBPB48121620SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 314.5012.9914.5915.851. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigsafe RET no microcodeoffsafe RETIBPB80160240320400SE +/- 0.90, N = 3SE +/- 0.49, N = 3SE +/- 0.79, N = 3SE +/- 0.72, N = 3344.24289.06338.16352.18

OpenRadioss

Model: Cell Phone Drop Test

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Cell Phone Drop Testsafe RET no microcodeoffsafe RETIBPB918273645SE +/- 0.26, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 3SE +/- 0.14, N = 336.3733.1036.4040.11

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To Compilesafe RET no microcodeoffsafe RETIBPB4080120160200SE +/- 0.12, N = 3SE +/- 0.14, N = 3SE +/- 0.05, N = 3SE +/- 0.16, N = 3172.75164.27173.06195.49

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200safe RET no microcodeoffsafe RETIBPB48121620SE +/- 0.14, N = 3SE +/- 0.07, N = 3SE +/- 0.19, N = 3SE +/- 0.13, N = 313.6114.0513.3611.93MAX: 854.4MAX: 858.17MAX: 881.3MAX: 855.56

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmarksafe RET no microcodeoffsafe RETIBPB100200300400500SE +/- 2.01, N = 3SE +/- 1.76, N = 3SE +/- 0.84, N = 3SE +/- 1.11, N = 3418.95457.23422.58389.92

Apache Spark

Row Count: 1000000 - Partitions: 100 - Group By Test Time

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Group By Test Timesafe RET no microcodeoffsafe RETIBPB1.29152.5833.87455.1666.4575SE +/- 0.08, N = 3SE +/- 0.04, N = 15SE +/- 0.07, N = 3SE +/- 0.06, N = 35.174.915.155.74

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninjasafe RET no microcodeoffsafe RETIBPB4080120160200SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.15, N = 3SE +/- 0.20, N = 3182.17176.37181.53204.08

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500safe RET no microcodeoffsafe RETIBPB714212835SE +/- 0.49, N = 3SE +/- 0.29, N = 3SE +/- 0.22, N = 3SE +/- 0.35, N = 331.9331.4927.7331.11MAX: 930.97MAX: 939.96MAX: 938.92MAX: 908.02

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Ratingsafe RET no microcodeoffsafe RETIBPB80K160K240K320K400KSE +/- 25.38, N = 3SE +/- 1018.75, N = 3SE +/- 435.27, N = 3SE +/- 248.93, N = 33348123843743355953717991. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200safe RET no microcodeoffsafe RETIBPB300K600K900K1200K1500KSE +/- 6553.14, N = 3SE +/- 1566.77, N = 3SE +/- 4253.29, N = 3SE +/- 3166.01, N = 31202637.361176385.351211172.131344749.20

TensorFlow

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: ResNet-50safe RET no microcodeoffsafe RETIBPB48121620SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 315.5617.7815.6517.45

CockroachDB

Workload: KV, 95% Reads - Concurrency: 128

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 95% Reads - Concurrency: 128safe RET no microcodeoffsafe RETIBPB30K60K90K120K150KSE +/- 1043.12, N = 13SE +/- 931.05, N = 3SE +/- 1387.70, N = 15SE +/- 408.63, N = 3131487.0135187.2132046.0119163.8

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500safe RET no microcodeoffsafe RETIBPB300K600K900K1200K1500KSE +/- 13029.07, N = 3SE +/- 4294.81, N = 3SE +/- 5073.96, N = 3SE +/- 6687.04, N = 31408658.831415756.331583717.621441637.63

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To Compilesafe RET no microcodeoffsafe RETIBPB306090120150SE +/- 0.33, N = 3SE +/- 0.19, N = 3SE +/- 0.24, N = 3SE +/- 0.08, N = 3125.66121.95125.06135.96

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bird Strike on Windshieldsafe RET no microcodeoffsafe RETIBPB4080120160200SE +/- 0.67, N = 3SE +/- 0.07, N = 3SE +/- 0.73, N = 3SE +/- 0.89, N = 3152.91144.83152.27160.70

ClickHouse

100M Rows Hits Dataset, Third Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third Runsafe RET no microcodeoffsafe RETIBPB80160240320400SE +/- 3.27, N = 5SE +/- 2.21, N = 3SE +/- 1.85, N = 3SE +/- 2.22, N = 3329.19362.64337.45349.50MIN: 31.32 / MAX: 2857.14MIN: 31.5 / MAX: 4285.71MIN: 31.46 / MAX: 4000MIN: 31.56 / MAX: 5000

ClickHouse

100M Rows Hits Dataset, First Run / Cold Cache

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold Cachesafe RET no microcodeoffsafe RETIBPB80160240320400SE +/- 3.38, N = 5SE +/- 0.68, N = 3SE +/- 2.94, N = 3SE +/- 2.84, N = 3323.42349.43318.12336.69MIN: 30.82 / MAX: 5000MIN: 31.06 / MAX: 4285.71MIN: 30.57 / MAX: 3333.33MIN: 31.5 / MAX: 4000

Apache Spark

Row Count: 1000000 - Partitions: 100 - SHA-512 Benchmark Time

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - SHA-512 Benchmark Timesafe RET no microcodeoffsafe RETIBPB0.83481.66962.50443.33924.174SE +/- 0.04, N = 3SE +/- 0.03, N = 15SE +/- 0.04, N = 3SE +/- 0.05, N = 33.423.393.473.71

CockroachDB

Workload: KV, 50% Reads - Concurrency: 128

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 50% Reads - Concurrency: 128safe RET no microcodeoffsafe RETIBPB20K40K60K80K100KSE +/- 719.41, N = 15SE +/- 275.86, N = 3SE +/- 948.29, N = 15SE +/- 341.24, N = 3100851.4103635.099601.695416.0

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.1.3Test: Writessafe RET no microcodeoffsafe RETIBPB50K100K150K200K250KSE +/- 950.59, N = 3SE +/- 413.74, N = 3SE +/- 479.91, N = 3SE +/- 242.75, N = 3233069238741236241220814

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200safe RET no microcodeoffsafe RETIBPB918273645SE +/- 0.11, N = 3SE +/- 0.61, N = 3SE +/- 0.49, N = 3SE +/- 0.44, N = 338.5436.5835.8236.56MAX: 3276.77MAX: 2252.73MAX: 3267.55MAX: 2253.21

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200safe RET no microcodeoffsafe RETIBPB918273645SE +/- 0.62, N = 3SE +/- 0.55, N = 15SE +/- 0.52, N = 15SE +/- 0.32, N = 335.1037.7037.5336.47MAX: 728.37MAX: 802.64MAX: 755.16MAX: 808.57

Remhos

Test: Sample Remap Example

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Examplesafe RET no microcodeoffsafe RETIBPB510152025SE +/- 0.17, N = 3SE +/- 0.23, N = 3SE +/- 0.19, N = 3SE +/- 0.12, N = 1417.7917.3817.9618.661. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500safe RET no microcodeoffsafe RETIBPB816243240SE +/- 0.02, N = 3SE +/- 0.24, N = 3SE +/- 0.22, N = 3SE +/- 0.42, N = 430.2932.3630.1431.74MAX: 715.01MAX: 646.51MAX: 641.04MAX: 667.18

ClickHouse

100M Rows Hits Dataset, Second Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second Runsafe RET no microcodeoffsafe RETIBPB80160240320400SE +/- 2.16, N = 5SE +/- 1.42, N = 3SE +/- 4.86, N = 3SE +/- 2.20, N = 3337.01361.81337.12347.92MIN: 30.49 / MAX: 3529.41MIN: 31.46 / MAX: 4000MIN: 30.79 / MAX: 4000MIN: 31.85 / MAX: 3750

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200safe RET no microcodeoffsafe RETIBPB48121620SE +/- 0.16, N = 12SE +/- 0.19, N = 3SE +/- 0.21, N = 8SE +/- 0.19, N = 914.0513.8314.7314.78MAX: 609.96MAX: 596.78MAX: 645.11MAX: 618.06

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200safe RET no microcodeoffsafe RETIBPB11M22M33M44M55MSE +/- 147114.88, N = 3SE +/- 681823.31, N = 3SE +/- 634314.77, N = 3SE +/- 616490.96, N = 347445770.1849501499.1350578426.5449316970.28

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200safe RET no microcodeoffsafe RETIBPB10M20M30M40M50MSE +/- 614274.26, N = 3SE +/- 574678.74, N = 15SE +/- 543529.82, N = 15SE +/- 146499.20, N = 346538766.0143665846.2844027904.8944816394.74

DaCapo Benchmark

Java Test: Jython

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jythonsafe RET no microcodeoffsafe RETIBPB10002000300040005000SE +/- 47.28, N = 4SE +/- 18.07, N = 4SE +/- 49.88, N = 4SE +/- 38.12, N = 204191419342414446

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5safe RET no microcodeoffsafe RETIBPB500K1000K1500K2000K2500KSE +/- 31351.12, N = 3SE +/- 11955.97, N = 3SE +/- 1778.76, N = 3SE +/- 21878.46, N = 32218601.792204628.922145436.262092844.221. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysissafe RET no microcodeoffsafe RETIBPB306090120150SE +/- 0.66, N = 3SE +/- 0.85, N = 3SE +/- 1.05, N = 3SE +/- 1.03, N = 3137.52136.69138.85144.831. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm

OpenRadioss

Model: INIVOL and Fluid Structure Interaction Drop Container

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: INIVOL and Fluid Structure Interaction Drop Containersafe RET no microcodeoffsafe RETIBPB4080120160200SE +/- 0.39, N = 3SE +/- 0.16, N = 3SE +/- 0.50, N = 3SE +/- 0.17, N = 3163.02162.13163.97171.75

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500safe RET no microcodeoffsafe RETIBPB300K600K900K1200K1500KSE +/- 1525.49, N = 3SE +/- 7578.67, N = 3SE +/- 9180.92, N = 3SE +/- 14032.06, N = 41342031.111271946.571345598.591287324.35

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Timesafe RET no microcodeoffsafe RETIBPB306090120150145.06140.62144.02148.251. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500safe RET no microcodeoffsafe RETIBPB306090120150SE +/- 1.63, N = 5SE +/- 0.95, N = 10SE +/- 0.86, N = 15SE +/- 1.16, N = 8123.61117.97120.06120.53MAX: 4533.33MAX: 4652.25MAX: 4495.21MAX: 4401.37

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500safe RET no microcodeoffsafe RETIBPB8M16M24M32M40MSE +/- 394126.89, N = 5SE +/- 302926.36, N = 10SE +/- 288707.73, N = 15SE +/- 327739.29, N = 837720117.4039463981.4238833415.9738572529.56

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200safe RET no microcodeoffsafe RETIBPB200K400K600K800K1000KSE +/- 6730.71, N = 12SE +/- 8467.91, N = 3SE +/- 7998.38, N = 8SE +/- 7583.18, N = 9947741.34960525.66918691.45921701.44

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500safe RET no microcodeoffsafe RETIBPB20406080100SE +/- 0.81, N = 4SE +/- 2.14, N = 3SE +/- 1.29, N = 3SE +/- 0.42, N = 379.1978.9282.2480.78MAX: 5165.86MAX: 1729.94MAX: 3625.32MAX: 2592.69

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratesafe RET no microcodeoffsafe RETIBPB612182430SE +/- 0.34, N = 3SE +/- 0.21, N = 8SE +/- 0.26, N = 5SE +/- 0.09, N = 324.7024.2023.7024.251. (CC) gcc options: -O3 -march=native -fopenmp

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5safe RET no microcodeoffsafe RETIBPB500K1000K1500K2000K2500KSE +/- 17712.54, N = 3SE +/- 14704.83, N = 3SE +/- 4916.89, N = 3SE +/- 942.73, N = 32167181.092197287.302145052.142126493.291. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUsafe RET no microcodeoffsafe RETIBPB246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 37.587.687.607.461. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OSPRay

Benchmark: particle_volume/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/pathtracer/real_timesafe RET no microcodeoffsafe RETIBPB306090120150SE +/- 1.83, N = 3SE +/- 0.21, N = 3SE +/- 0.07, N = 3SE +/- 0.43, N = 3155.17157.83156.42153.44

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Water-layered Halfspacesafe RET no microcodeoffsafe RETIBPB714212835SE +/- 0.25, N = 3SE +/- 0.15, N = 3SE +/- 0.35, N = 3SE +/- 0.19, N = 330.4329.7729.5930.051. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500safe RET no microcodeoffsafe RETIBPB13M26M39M52M65MSE +/- 648692.91, N = 4SE +/- 817020.04, N = 3SE +/- 721225.08, N = 3SE +/- 269354.94, N = 358073516.7958682618.1857099408.1557529201.42

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUsafe RET no microcodeoffsafe RETIBPB9001800270036004500SE +/- 6.87, N = 3SE +/- 14.65, N = 3SE +/- 10.77, N = 3SE +/- 10.56, N = 34124.744092.914114.584204.34MIN: 2129.26 / MAX: 5016.36MIN: 3409.52 / MAX: 4641.43MIN: 2087 / MAX: 5053.62MIN: 2302.89 / MAX: 4817.721. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10safe RET no microcodeoffsafe RETIBPB500K1000K1500K2000K2500KSE +/- 16754.20, N = 10SE +/- 30210.22, N = 3SE +/- 792.70, N = 3SE +/- 12623.44, N = 32154339.262195705.512157815.692148876.031. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Tomographic Modelsafe RET no microcodeoffsafe RETIBPB48121620SE +/- 0.15, N = 3SE +/- 0.09, N = 3SE +/- 0.20, N = 3SE +/- 0.08, N = 314.4014.1314.1914.231. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian Dragonsafe RET no microcodeoffsafe RETIBPB1428425670SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.14, N = 364.3964.6064.6763.47MIN: 63.77 / MAX: 66.16MIN: 64.05 / MAX: 66.13MIN: 64.11 / MAX: 66.01MIN: 62.67 / MAX: 65.74

Apache Spark

Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmarksafe RET no microcodeoffsafe RETIBPB714212835SE +/- 0.01, N = 3SE +/- 0.12, N = 15SE +/- 0.33, N = 3SE +/- 0.20, N = 332.0231.8431.4331.49

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Timesafe RET no microcodeoffsafe RETIBPB140280420560700644.36633.52643.71645.411. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10safe RET no microcodeoffsafe RETIBPB500K1000K1500K2000K2500KSE +/- 17754.58, N = 3SE +/- 14630.02, N = 3SE +/- 2448.62, N = 3SE +/- 13504.65, N = 32173694.772177211.802172804.712137964.981. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Mount St. Helenssafe RET no microcodeoffsafe RETIBPB3691215SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 311.9811.8012.0111.951. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Homogeneous Halfspacesafe RET no microcodeoffsafe RETIBPB48121620SE +/- 0.20, N = 4SE +/- 0.07, N = 3SE +/- 0.21, N = 3SE +/- 0.11, N = 317.6417.4217.6917.701. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-Onlysafe RET no microcodeoffsafe RETIBPB714212835SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 327.5827.3427.4627.73

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: CPU-Onlysafe RET no microcodeoffsafe RETIBPB20406080100SE +/- 0.02, N = 3SE +/- 0.13, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 384.6984.5084.4985.63

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2safe RET no microcodeoffsafe RETIBPB200M400M600M800M1000MSE +/- 575791.94, N = 3SE +/- 839009.73, N = 3SE +/- 367255.40, N = 3SE +/- 1724277.85, N = 3999645400101179900099910210010051386671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atomssafe RET no microcodeoffsafe RETIBPB0.08670.17340.26010.34680.4335SE +/- 0.00029, N = 3SE +/- 0.00017, N = 3SE +/- 0.00028, N = 3SE +/- 0.00026, N = 30.381150.381300.380980.38534

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Layered Halfspacesafe RET no microcodeoffsafe RETIBPB714212835SE +/- 0.23, N = 3SE +/- 0.35, N = 3SE +/- 0.18, N = 3SE +/- 0.21, N = 331.8331.8531.6632.011. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OSPRay

Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timesafe RET no microcodeoffsafe RETIBPB3691215SE +/- 0.00, N = 3SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 313.2613.1413.2513.17

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_baresafe RET no microcodeoffsafe RETIBPB1.28932.57863.86795.15726.4465SE +/- 0.006, N = 3SE +/- 0.012, N = 3SE +/- 0.010, N = 3SE +/- 0.011, N = 35.7305.6805.7065.7071. (CXX) g++ options: -O3

OSPRay

Benchmark: gravity_spheres_volume/dim_512/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/scivis/real_timesafe RET no microcodeoffsafe RETIBPB246810SE +/- 0.01659, N = 3SE +/- 0.00864, N = 3SE +/- 0.01059, N = 3SE +/- 0.02088, N = 38.327498.331748.338138.26549

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUsafe RET no microcodeoffsafe RETIBPB2004006008001000SE +/- 0.72, N = 3SE +/- 0.17, N = 3SE +/- 0.13, N = 3SE +/- 0.40, N = 31126.641126.031126.141116.891. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OSPRay

Benchmark: gravity_spheres_volume/dim_512/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/ao/real_timesafe RET no microcodeoffsafe RETIBPB3691215SE +/- 0.02941, N = 3SE +/- 0.02460, N = 3SE +/- 0.01456, N = 3SE +/- 0.01872, N = 38.969418.960518.940498.89239

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUsafe RET no microcodeoffsafe RETIBPB714212835SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 328.3828.4028.3928.62MIN: 14.89 / MAX: 51.63MIN: 14.74 / MAX: 48.66MIN: 14.64 / MAX: 50.33MIN: 14.91 / MAX: 49.841. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPCsafe RET no microcodeoffsafe RETIBPB100200300400500SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 3SE +/- 0.67, N = 3452453453450MIN: 85 / MAX: 2535MIN: 84 / MAX: 2528MIN: 84 / MAX: 2520MIN: 83 / MAX: 2495

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Ratingsafe RET no microcodeoffsafe RETIBPB80K160K240K320K400KSE +/- 380.69, N = 3SE +/- 845.58, N = 3SE +/- 605.48, N = 3SE +/- 312.85, N = 33830393855853835153854871. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RETIBPB246810SE +/- 0.0167, N = 3SE +/- 0.0095, N = 3SE +/- 0.0271, N = 3SE +/- 0.0189, N = 38.36128.30788.32198.3411

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RETIBPB8001600240032004000SE +/- 7.72, N = 3SE +/- 4.76, N = 3SE +/- 12.28, N = 3SE +/- 7.83, N = 33816.773840.633834.243824.84

Neural Magic DeepSparse

Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RETIBPB1530456075SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.16, N = 3SE +/- 0.20, N = 365.8965.5965.5765.78

Neural Magic DeepSparse

Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RETIBPB110220330440550SE +/- 0.96, N = 3SE +/- 1.15, N = 3SE +/- 1.01, N = 3SE +/- 1.48, N = 3485.01487.25487.37485.68

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RETIBPB150300450600750SE +/- 1.47, N = 3SE +/- 1.21, N = 3SE +/- 0.96, N = 3SE +/- 1.14, N = 3679.58678.93682.21681.40

OSPRay

Benchmark: particle_volume/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/scivis/real_timesafe RET no microcodeoffsafe RETIBPB48121620SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 317.7317.7517.7317.67

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUsafe RET no microcodeoffsafe RETIBPB714212835SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 327.7927.8327.8227.711. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OSPRay

Benchmark: particle_volume/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/ao/real_timesafe RET no microcodeoffsafe RETIBPB48121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 318.0318.0217.9817.97

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RETIBPB1122334455SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 346.7246.7046.5746.61

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RETIBPB100200300400500SE +/- 0.42, N = 3SE +/- 0.24, N = 3SE +/- 0.31, N = 3SE +/- 0.43, N = 3468.36468.83468.22467.32

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUsafe RET no microcodeoffsafe RETIBPB2004006008001000SE +/- 0.27, N = 3SE +/- 0.32, N = 3SE +/- 1.09, N = 3SE +/- 0.42, N = 31142.291141.431142.531144.98MIN: 985.75 / MAX: 1168.76MIN: 998.76 / MAX: 1165.45MIN: 999.01 / MAX: 1177.02MIN: 502.04 / MAX: 1175.931. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RETIBPB1530456075SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 368.2668.1568.2368.35

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RETIBPB918273645SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.09, N = 337.7037.6037.6337.68

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RETIBPB1224364860SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 355.3755.3955.4055.50

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RETIBPB120240360480600SE +/- 0.39, N = 3SE +/- 0.44, N = 3SE +/- 0.39, N = 3SE +/- 0.94, N = 3577.06576.97576.82575.73

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Crownsafe RET no microcodeoffsafe RETIBPB1326395265SE +/- 0.14, N = 3SE +/- 0.08, N = 3SE +/- 0.15, N = 3SE +/- 0.10, N = 357.3057.4257.3157.30MIN: 56.26 / MAX: 58.69MIN: 56.59 / MAX: 58.54MIN: 56.2 / MAX: 58.59MIN: 56.3 / MAX: 58.61

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RETIBPB2004006008001000SE +/- 0.36, N = 3SE +/- 0.50, N = 3SE +/- 0.34, N = 3SE +/- 0.53, N = 3840.77839.72840.42840.89

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RETIBPB1224364860SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 353.6053.6053.5753.58

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Streamsafe RET no microcodeoffsafe RETIBPB130260390520650SE +/- 0.26, N = 3SE +/- 0.18, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 3596.56596.59596.78596.79

Apache Spark

Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark Using Dataframe

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark Using DataframeIBPB0.5851.171.7552.342.925SE +/- 0.08, N = 32.60

Apache Spark

Row Count: 1000000 - Partitions: 100 - Repartition Test Time

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Repartition Test Timesafe RET no microcodeoffsafe RETIBPB0.53551.0711.60652.1422.6775SE +/- 0.04, N = 3SE +/- 0.04, N = 15SE +/- 0.04, N = 3SE +/- 0.12, N = 32.382.092.262.31


Phoronix Test Suite v10.8.4