Benchmarks by Michael Larabel for a future article looking at AMD Inception impact.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2308112-NE-EPYC7763124 AMD EPYC 7763 1P spec_rstack_overflow - Phoronix Test Suite AMD EPYC 7763 1P spec_rstack_overflow Benchmarks by Michael Larabel for a future article looking at AMD Inception impact.
HTML result view exported from: https://openbenchmarking.org/result/2308112-NE-EPYC7763124&sro&grs .
AMD EPYC 7763 1P spec_rstack_overflow Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Vulkan Compiler File-System Screen Resolution off safe RET no microcode safe RET IBPB AMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads) AMD DAYTONA_X (RYM1009B BIOS) AMD Starship/Matisse 256GB 800GB INTEL SSDPF21Q800GB ASPEED VE228 2 x Mellanox MT27710 Ubuntu 22.04 6.5.0-rc5-phx-tues (x86_64) GNOME Shell 42.5 X Server 1.21.1.3 1.3.224 GCC 11.3.0 + LLVM 14.0.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - off: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 - safe RET no microcode: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 - safe RET: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1 - IBPB: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1 Java Details - OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04) Python Details - Python 3.10.6 Security Details - off: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - safe RET no microcode: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - safe RET: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - IBPB: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
AMD EPYC 7763 1P spec_rstack_overflow mysqlslap: 4096 pgbench: 100 - 800 - Read Only pgbench: 100 - 800 - Read Only - Average Latency rocksdb: Update Rand sqlite: 16 rocksdb: Read Rand Write Rand sqlite: 8 dacapobench: Tradebeans openradioss: Bumper Beam mysqlslap: 8192 build-linux-kernel: defconfig spark: 1000000 - 100 - Inner Join Test Time openradioss: Rubber O-Ring Seal Installation spark: 1000000 - 100 - Broadcast Inner Join Test Time nginx: 500 nginx: 1000 pgbench: 100 - 800 - Read Write pgbench: 100 - 800 - Read Write - Average Latency build-linux-kernel: allmodconfig openradioss: Cell Phone Drop Test build-nodejs: Time To Compile apache-iotdb: 500 - 1 - 200 numpy: spark: 1000000 - 100 - Group By Test Time build-llvm: Ninja apache-iotdb: 500 - 1 - 500 compress-7zip: Compression Rating apache-iotdb: 500 - 1 - 200 tensorflow: CPU - 64 - ResNet-50 cockroach: KV, 95% Reads - 128 apache-iotdb: 500 - 1 - 500 build-godot: Time To Compile openradioss: Bird Strike on Windshield clickhouse: 100M Rows Hits Dataset, Third Run clickhouse: 100M Rows Hits Dataset, First Run / Cold Cache spark: 1000000 - 100 - SHA-512 Benchmark Time cockroach: KV, 50% Reads - 128 cassandra: Writes apache-iotdb: 500 - 100 - 200 apache-iotdb: 200 - 100 - 200 remhos: Sample Remap Example apache-iotdb: 200 - 1 - 500 clickhouse: 100M Rows Hits Dataset, Second Run apache-iotdb: 200 - 1 - 200 apache-iotdb: 500 - 100 - 200 apache-iotdb: 200 - 100 - 200 dacapobench: Jython memtier-benchmark: Redis - 50 - 1:5 mrbayes: Primate Phylogeny Analysis openradioss: INIVOL and Fluid Structure Interaction Drop Container apache-iotdb: 200 - 1 - 500 openfoam: drivaerFastback, Medium Mesh Size - Mesh Time apache-iotdb: 200 - 100 - 500 apache-iotdb: 200 - 100 - 500 apache-iotdb: 200 - 1 - 200 apache-iotdb: 500 - 100 - 500 mt-dgemm: Sustained Floating-Point Rate memtier-benchmark: Redis - 100 - 1:5 openvino: Person Detection FP16 - CPU ospray: particle_volume/pathtracer/real_time specfem3d: Water-layered Halfspace apache-iotdb: 500 - 100 - 500 openvino: Person Detection FP16 - CPU memtier-benchmark: Redis - 100 - 1:10 specfem3d: Tomographic Model embree: Pathtracer ISPC - Asian Dragon spark: 1000000 - 100 - Calculate Pi Benchmark openfoam: drivaerFastback, Medium Mesh Size - Execution Time memtier-benchmark: Redis - 50 - 1:10 specfem3d: Mount St. Helens specfem3d: Homogeneous Halfspace blender: BMW27 - CPU-Only blender: Pabellon Barcelona - CPU-Only amg: namd: ATPase Simulation - 327,506 Atoms specfem3d: Layered Halfspace ospray: gravity_spheres_volume/dim_512/pathtracer/real_time gromacs: MPI CPU - water_GMX50_bare ospray: gravity_spheres_volume/dim_512/scivis/real_time openvino: Weld Porosity Detection FP16 - CPU ospray: gravity_spheres_volume/dim_512/ao/real_time openvino: Weld Porosity Detection FP16 - CPU openvkl: vklBenchmark ISPC compress-7zip: Decompression Rating deepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Stream deepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Stream deepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream deepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Stream ospray: particle_volume/scivis/real_time openvino: Face Detection FP16-INT8 - CPU ospray: particle_volume/ao/real_time deepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Stream deepsparse: ResNet-50, Baseline - Asynchronous Multi-Stream openvino: Face Detection FP16-INT8 - CPU deepsparse: ResNet-50, Baseline - Asynchronous Multi-Stream deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream embree: Pathtracer ISPC - Crown deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream spark: 1000000 - 100 - Calculate Pi Benchmark Using Dataframe spark: 1000000 - 100 - Repartition Test Time off safe RET no microcode safe RET IBPB 590 3128719 0.256 462287 6.273 2951684 3.755 3993 87.72 355 31.192 1.88 77.46 1.30 169583.15 166499.89 61604 12.988 289.063 33.10 164.268 14.05 457.23 4.91 176.374 31.49 384374 1176385.35 17.78 135187.2 1415756.33 121.948 144.83 362.64 349.43 3.39 103635.0 238741 36.58 37.70 17.375 32.36 361.81 13.83 49501499.13 43665846.28 4193 2204628.92 136.686 162.13 1271946.57 140.61562 117.97 39463981.42 960525.66 78.92 24.200551 2197287.30 7.68 157.829 29.772535386 58682618.18 4092.91 2195705.51 14.134265606 64.5964 31.84 633.51902 2177211.80 11.801238732 17.417120933 27.34 84.50 1011799000 0.38130 31.845630424 13.1355 5.680 8.33174 1126.03 8.96051 28.40 453 385585 8.3078 3840.6307 65.5911 487.2450 678.9330 17.7511 27.83 18.0226 46.7020 468.8306 1141.43 68.1483 37.6037 55.3922 576.9722 57.4229 839.7201 53.5968 596.5915 2.09 412 2707280 0.296 428112 8.834 2872765 4.850 4096 93.68 301 37.623 2.22 85.04 1.39 144020.03 140555.98 55175 14.499 344.242 36.37 172.749 13.61 418.95 5.17 182.169 31.93 334812 1202637.36 15.56 131487.0 1408658.83 125.663 152.91 329.19 323.42 3.42 100851.4 233069 38.54 35.10 17.788 30.29 337.01 14.05 47445770.18 46538766.01 4191 2218601.79 137.518 163.02 1342031.11 145.06069 123.61 37720117.40 947741.34 79.19 24.695818 2167181.09 7.58 155.165 30.427531709 58073516.79 4124.74 2154339.26 14.404419238 64.3916 32.02 644.36223 2173694.77 11.982163460 17.643680397 27.58 84.69 999645400 0.38115 31.829261870 13.2621 5.730 8.32749 1126.64 8.96941 28.38 452 383039 8.3612 3816.7676 65.8910 485.0069 679.5766 17.7305 27.79 18.0288 46.7239 468.3646 1142.29 68.2595 37.6987 55.3747 577.0589 57.2956 840.7721 53.5972 596.5579 2.38 418 2768445 0.289 426947 8.800 2839085 5.006 4143 93.90 301 37.243 2.14 84.48 1.41 142619.84 143271.26 54837 14.589 338.157 36.40 173.064 13.36 422.58 5.15 181.528 27.73 335595 1211172.13 15.65 132046.0 1583717.62 125.060 152.27 337.45 318.12 3.47 99601.6 236241 35.82 37.53 17.958 30.14 337.12 14.73 50578426.54 44027904.89 4241 2145436.26 138.851 163.97 1345598.59 144.02174 120.06 38833415.97 918691.45 82.24 23.702889 2145052.14 7.60 156.419 29.590868260 57099408.15 4114.58 2157815.69 14.188058962 64.6742 31.43 643.71316 2172804.71 12.010380781 17.690298650 27.46 84.49 999102100 0.38098 31.659940885 13.2538 5.706 8.33813 1126.14 8.94049 28.39 453 383515 8.3219 3834.2439 65.5662 487.3677 682.2071 17.7305 27.82 17.9817 46.5677 468.2170 1142.53 68.2325 37.6319 55.4028 576.8166 57.3138 840.4236 53.5729 596.7822 2.26 274 1733827 0.461 322231 7.934 2130006 4.793 5305 113.68 276 40.085 2.41 99.04 1.64 137051.69 135431.46 50463 15.854 352.178 40.11 195.493 11.93 389.92 5.74 204.080 31.11 371799 1344749.20 17.45 119163.8 1441637.63 135.962 160.70 349.50 336.69 3.71 95416.0 220814 36.56 36.47 18.658 31.74 347.92 14.78 49316970.28 44816394.74 4446 2092844.22 144.829 171.75 1287324.35 148.25442 120.53 38572529.56 921701.44 80.78 24.251474 2126493.29 7.46 153.439 30.053834808 57529201.42 4204.34 2148876.03 14.225825227 63.4668 31.49 645.40958 2137964.98 11.951448577 17.695914319 27.73 85.63 1005138667 0.38534 32.009762142 13.1709 5.707 8.26549 1116.89 8.89239 28.62 450 385487 8.3411 3824.8431 65.7767 485.6842 681.3961 17.6743 27.71 17.9663 46.6121 467.3150 1144.98 68.3456 37.6818 55.5026 575.7335 57.2967 840.8881 53.5782 596.7902 2.60 2.31 OpenBenchmarking.org
MariaDB Clients: 4096 OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.0.1 Clients: 4096 IBPB off safe RET safe RET no microcode 130 260 390 520 650 SE +/- 0.71, N = 3 SE +/- 5.48, N = 3 SE +/- 3.51, N = 3 SE +/- 2.96, N = 3 274 590 418 412 1. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl
PostgreSQL Scaling Factor: 100 - Clients: 800 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Only IBPB off safe RET safe RET no microcode 700K 1400K 2100K 2800K 3500K SE +/- 2988.66, N = 3 SE +/- 1705.16, N = 3 SE +/- 34286.68, N = 3 SE +/- 29158.10, N = 3 1733827 3128719 2768445 2707280 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency IBPB off safe RET safe RET no microcode 0.1037 0.2074 0.3111 0.4148 0.5185 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 0.461 0.256 0.289 0.296 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
RocksDB Test: Update Random OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Update Random IBPB off safe RET safe RET no microcode 100K 200K 300K 400K 500K SE +/- 110.81, N = 3 SE +/- 893.82, N = 3 SE +/- 185.49, N = 3 SE +/- 426.73, N = 3 322231 462287 426947 428112 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
SQLite Threads / Copies: 16 OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.41.2 Threads / Copies: 16 IBPB off safe RET safe RET no microcode 2 4 6 8 10 SE +/- 0.007, N = 3 SE +/- 0.020, N = 3 SE +/- 0.052, N = 3 SE +/- 0.024, N = 3 7.934 6.273 8.800 8.834 1. (CC) gcc options: -O2 -lz -lm
RocksDB Test: Read Random Write Random OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read Random Write Random IBPB off safe RET safe RET no microcode 600K 1200K 1800K 2400K 3000K SE +/- 7875.65, N = 3 SE +/- 35283.44, N = 4 SE +/- 21895.20, N = 3 SE +/- 18652.89, N = 3 2130006 2951684 2839085 2872765 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
SQLite Threads / Copies: 8 OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.41.2 Threads / Copies: 8 IBPB off safe RET safe RET no microcode 1.1264 2.2528 3.3792 4.5056 5.632 SE +/- 0.010, N = 3 SE +/- 0.013, N = 3 SE +/- 0.036, N = 3 SE +/- 0.016, N = 3 4.793 3.755 5.006 4.850 1. (CC) gcc options: -O2 -lz -lm
DaCapo Benchmark Java Test: Tradebeans OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Tradebeans IBPB off safe RET safe RET no microcode 1100 2200 3300 4400 5500 SE +/- 56.17, N = 4 SE +/- 42.66, N = 4 SE +/- 28.11, N = 4 SE +/- 44.47, N = 4 5305 3993 4143 4096
OpenRadioss Model: Bumper Beam OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Bumper Beam IBPB off safe RET safe RET no microcode 30 60 90 120 150 SE +/- 0.23, N = 3 SE +/- 0.33, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 113.68 87.72 93.90 93.68
MariaDB Clients: 8192 OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.0.1 Clients: 8192 IBPB off safe RET safe RET no microcode 80 160 240 320 400 SE +/- 0.62, N = 3 SE +/- 3.35, N = 3 SE +/- 1.18, N = 3 SE +/- 0.73, N = 3 276 355 301 301 1. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig IBPB off safe RET safe RET no microcode 9 18 27 36 45 SE +/- 0.37, N = 7 SE +/- 0.34, N = 5 SE +/- 0.37, N = 6 SE +/- 0.35, N = 6 40.09 31.19 37.24 37.62
Apache Spark Row Count: 1000000 - Partitions: 100 - Inner Join Test Time OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Inner Join Test Time IBPB off safe RET safe RET no microcode 0.5423 1.0846 1.6269 2.1692 2.7115 SE +/- 0.04, N = 3 SE +/- 0.02, N = 15 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 2.41 1.88 2.14 2.22
OpenRadioss Model: Rubber O-Ring Seal Installation OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Rubber O-Ring Seal Installation IBPB off safe RET safe RET no microcode 20 40 60 80 100 SE +/- 0.34, N = 3 SE +/- 0.23, N = 3 SE +/- 0.24, N = 3 SE +/- 0.17, N = 3 99.04 77.46 84.48 85.04
Apache Spark Row Count: 1000000 - Partitions: 100 - Broadcast Inner Join Test Time OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Broadcast Inner Join Test Time IBPB off safe RET safe RET no microcode 0.369 0.738 1.107 1.476 1.845 SE +/- 0.03, N = 3 SE +/- 0.01, N = 15 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 1.64 1.30 1.41 1.39
nginx Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 IBPB off safe RET safe RET no microcode 40K 80K 120K 160K 200K SE +/- 262.73, N = 3 SE +/- 284.72, N = 3 SE +/- 251.96, N = 3 SE +/- 284.55, N = 3 137051.69 169583.15 142619.84 144020.03 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 IBPB off safe RET safe RET no microcode 40K 80K 120K 160K 200K SE +/- 242.54, N = 3 SE +/- 362.13, N = 3 SE +/- 314.03, N = 3 SE +/- 352.89, N = 3 135431.46 166499.89 143271.26 140555.98 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
PostgreSQL Scaling Factor: 100 - Clients: 800 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Write IBPB off safe RET safe RET no microcode 13K 26K 39K 52K 65K SE +/- 133.40, N = 3 SE +/- 418.78, N = 3 SE +/- 207.71, N = 3 SE +/- 66.28, N = 3 50463 61604 54837 55175 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency IBPB off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 15.85 12.99 14.59 14.50 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Timed Linux Kernel Compilation Build: allmodconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: allmodconfig IBPB off safe RET safe RET no microcode 80 160 240 320 400 SE +/- 0.72, N = 3 SE +/- 0.49, N = 3 SE +/- 0.79, N = 3 SE +/- 0.90, N = 3 352.18 289.06 338.16 344.24
OpenRadioss Model: Cell Phone Drop Test OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Cell Phone Drop Test IBPB off safe RET safe RET no microcode 9 18 27 36 45 SE +/- 0.14, N = 3 SE +/- 0.11, N = 3 SE +/- 0.03, N = 3 SE +/- 0.26, N = 3 40.11 33.10 36.40 36.37
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 19.8.1 Time To Compile IBPB off safe RET safe RET no microcode 40 80 120 160 200 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 SE +/- 0.05, N = 3 SE +/- 0.12, N = 3 195.49 164.27 173.06 172.75
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 IBPB off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.13, N = 3 SE +/- 0.07, N = 3 SE +/- 0.19, N = 3 SE +/- 0.14, N = 3 11.93 14.05 13.36 13.61 MAX: 855.56 MAX: 858.17 MAX: 881.3 MAX: 854.4
Numpy Benchmark OpenBenchmarking.org Score, More Is Better Numpy Benchmark IBPB off safe RET safe RET no microcode 100 200 300 400 500 SE +/- 1.11, N = 3 SE +/- 1.76, N = 3 SE +/- 0.84, N = 3 SE +/- 2.01, N = 3 389.92 457.23 422.58 418.95
Apache Spark Row Count: 1000000 - Partitions: 100 - Group By Test Time OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Group By Test Time IBPB off safe RET safe RET no microcode 1.2915 2.583 3.8745 5.166 6.4575 SE +/- 0.06, N = 3 SE +/- 0.04, N = 15 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 5.74 4.91 5.15 5.17
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 16.0 Build System: Ninja IBPB off safe RET safe RET no microcode 40 80 120 160 200 SE +/- 0.20, N = 3 SE +/- 0.11, N = 3 SE +/- 0.15, N = 3 SE +/- 0.11, N = 3 204.08 176.37 181.53 182.17
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 IBPB off safe RET safe RET no microcode 7 14 21 28 35 SE +/- 0.35, N = 3 SE +/- 0.29, N = 3 SE +/- 0.22, N = 3 SE +/- 0.49, N = 3 31.11 31.49 27.73 31.93 MAX: 908.02 MAX: 939.96 MAX: 938.92 MAX: 930.97
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating IBPB off safe RET safe RET no microcode 80K 160K 240K 320K 400K SE +/- 248.93, N = 3 SE +/- 1018.75, N = 3 SE +/- 435.27, N = 3 SE +/- 25.38, N = 3 371799 384374 335595 334812 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 IBPB off safe RET safe RET no microcode 300K 600K 900K 1200K 1500K SE +/- 3166.01, N = 3 SE +/- 1566.77, N = 3 SE +/- 4253.29, N = 3 SE +/- 6553.14, N = 3 1344749.20 1176385.35 1211172.13 1202637.36
TensorFlow Device: CPU - Batch Size: 64 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 64 - Model: ResNet-50 IBPB off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 17.45 17.78 15.65 15.56
CockroachDB Workload: KV, 95% Reads - Concurrency: 128 OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 95% Reads - Concurrency: 128 IBPB off safe RET safe RET no microcode 30K 60K 90K 120K 150K SE +/- 408.63, N = 3 SE +/- 931.05, N = 3 SE +/- 1387.70, N = 15 SE +/- 1043.12, N = 13 119163.8 135187.2 132046.0 131487.0
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 IBPB off safe RET safe RET no microcode 300K 600K 900K 1200K 1500K SE +/- 6687.04, N = 3 SE +/- 4294.81, N = 3 SE +/- 5073.96, N = 3 SE +/- 13029.07, N = 3 1441637.63 1415756.33 1583717.62 1408658.83
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 4.0 Time To Compile IBPB off safe RET safe RET no microcode 30 60 90 120 150 SE +/- 0.08, N = 3 SE +/- 0.19, N = 3 SE +/- 0.24, N = 3 SE +/- 0.33, N = 3 135.96 121.95 125.06 125.66
OpenRadioss Model: Bird Strike on Windshield OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Bird Strike on Windshield IBPB off safe RET safe RET no microcode 40 80 120 160 200 SE +/- 0.89, N = 3 SE +/- 0.07, N = 3 SE +/- 0.73, N = 3 SE +/- 0.67, N = 3 160.70 144.83 152.27 152.91
ClickHouse 100M Rows Hits Dataset, Third Run OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Third Run IBPB off safe RET safe RET no microcode 80 160 240 320 400 SE +/- 2.22, N = 3 SE +/- 2.21, N = 3 SE +/- 1.85, N = 3 SE +/- 3.27, N = 5 349.50 362.64 337.45 329.19 MIN: 31.56 / MAX: 5000 MIN: 31.5 / MAX: 4285.71 MIN: 31.46 / MAX: 4000 MIN: 31.32 / MAX: 2857.14
ClickHouse 100M Rows Hits Dataset, First Run / Cold Cache OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, First Run / Cold Cache IBPB off safe RET safe RET no microcode 80 160 240 320 400 SE +/- 2.84, N = 3 SE +/- 0.68, N = 3 SE +/- 2.94, N = 3 SE +/- 3.38, N = 5 336.69 349.43 318.12 323.42 MIN: 31.5 / MAX: 4000 MIN: 31.06 / MAX: 4285.71 MIN: 30.57 / MAX: 3333.33 MIN: 30.82 / MAX: 5000
Apache Spark Row Count: 1000000 - Partitions: 100 - SHA-512 Benchmark Time OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - SHA-512 Benchmark Time IBPB off safe RET safe RET no microcode 0.8348 1.6696 2.5044 3.3392 4.174 SE +/- 0.05, N = 3 SE +/- 0.03, N = 15 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 3.71 3.39 3.47 3.42
CockroachDB Workload: KV, 50% Reads - Concurrency: 128 OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 50% Reads - Concurrency: 128 IBPB off safe RET safe RET no microcode 20K 40K 60K 80K 100K SE +/- 341.24, N = 3 SE +/- 275.86, N = 3 SE +/- 948.29, N = 15 SE +/- 719.41, N = 15 95416.0 103635.0 99601.6 100851.4
Apache Cassandra Test: Writes OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.1.3 Test: Writes IBPB off safe RET safe RET no microcode 50K 100K 150K 200K 250K SE +/- 242.75, N = 3 SE +/- 413.74, N = 3 SE +/- 479.91, N = 3 SE +/- 950.59, N = 3 220814 238741 236241 233069
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 IBPB off safe RET safe RET no microcode 9 18 27 36 45 SE +/- 0.44, N = 3 SE +/- 0.61, N = 3 SE +/- 0.49, N = 3 SE +/- 0.11, N = 3 36.56 36.58 35.82 38.54 MAX: 2253.21 MAX: 2252.73 MAX: 3267.55 MAX: 3276.77
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 IBPB off safe RET safe RET no microcode 9 18 27 36 45 SE +/- 0.32, N = 3 SE +/- 0.55, N = 15 SE +/- 0.52, N = 15 SE +/- 0.62, N = 3 36.47 37.70 37.53 35.10 MAX: 808.57 MAX: 802.64 MAX: 755.16 MAX: 728.37
Remhos Test: Sample Remap Example OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example IBPB off safe RET safe RET no microcode 5 10 15 20 25 SE +/- 0.12, N = 14 SE +/- 0.23, N = 3 SE +/- 0.19, N = 3 SE +/- 0.17, N = 3 18.66 17.38 17.96 17.79 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 IBPB off safe RET safe RET no microcode 8 16 24 32 40 SE +/- 0.42, N = 4 SE +/- 0.24, N = 3 SE +/- 0.22, N = 3 SE +/- 0.02, N = 3 31.74 32.36 30.14 30.29 MAX: 667.18 MAX: 646.51 MAX: 641.04 MAX: 715.01
ClickHouse 100M Rows Hits Dataset, Second Run OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Second Run IBPB off safe RET safe RET no microcode 80 160 240 320 400 SE +/- 2.20, N = 3 SE +/- 1.42, N = 3 SE +/- 4.86, N = 3 SE +/- 2.16, N = 5 347.92 361.81 337.12 337.01 MIN: 31.85 / MAX: 3750 MIN: 31.46 / MAX: 4000 MIN: 30.79 / MAX: 4000 MIN: 30.49 / MAX: 3529.41
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 IBPB off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.19, N = 9 SE +/- 0.19, N = 3 SE +/- 0.21, N = 8 SE +/- 0.16, N = 12 14.78 13.83 14.73 14.05 MAX: 618.06 MAX: 596.78 MAX: 645.11 MAX: 609.96
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 IBPB off safe RET safe RET no microcode 11M 22M 33M 44M 55M SE +/- 616490.96, N = 3 SE +/- 681823.31, N = 3 SE +/- 634314.77, N = 3 SE +/- 147114.88, N = 3 49316970.28 49501499.13 50578426.54 47445770.18
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 IBPB off safe RET safe RET no microcode 10M 20M 30M 40M 50M SE +/- 146499.20, N = 3 SE +/- 574678.74, N = 15 SE +/- 543529.82, N = 15 SE +/- 614274.26, N = 3 44816394.74 43665846.28 44027904.89 46538766.01
DaCapo Benchmark Java Test: Jython OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Jython IBPB off safe RET safe RET no microcode 1000 2000 3000 4000 5000 SE +/- 38.12, N = 20 SE +/- 18.07, N = 4 SE +/- 49.88, N = 4 SE +/- 47.28, N = 4 4446 4193 4241 4191
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 IBPB off safe RET safe RET no microcode 500K 1000K 1500K 2000K 2500K SE +/- 21878.46, N = 3 SE +/- 11955.97, N = 3 SE +/- 1778.76, N = 3 SE +/- 31351.12, N = 3 2092844.22 2204628.92 2145436.26 2218601.79 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis IBPB off safe RET safe RET no microcode 30 60 90 120 150 SE +/- 1.03, N = 3 SE +/- 0.85, N = 3 SE +/- 1.05, N = 3 SE +/- 0.66, N = 3 144.83 136.69 138.85 137.52 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
OpenRadioss Model: INIVOL and Fluid Structure Interaction Drop Container OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: INIVOL and Fluid Structure Interaction Drop Container IBPB off safe RET safe RET no microcode 40 80 120 160 200 SE +/- 0.17, N = 3 SE +/- 0.16, N = 3 SE +/- 0.50, N = 3 SE +/- 0.39, N = 3 171.75 162.13 163.97 163.02
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 IBPB off safe RET safe RET no microcode 300K 600K 900K 1200K 1500K SE +/- 14032.06, N = 4 SE +/- 7578.67, N = 3 SE +/- 9180.92, N = 3 SE +/- 1525.49, N = 3 1287324.35 1271946.57 1345598.59 1342031.11
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time IBPB off safe RET safe RET no microcode 30 60 90 120 150 148.25 140.62 144.02 145.06 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 IBPB off safe RET safe RET no microcode 30 60 90 120 150 SE +/- 1.16, N = 8 SE +/- 0.95, N = 10 SE +/- 0.86, N = 15 SE +/- 1.63, N = 5 120.53 117.97 120.06 123.61 MAX: 4401.37 MAX: 4652.25 MAX: 4495.21 MAX: 4533.33
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 IBPB off safe RET safe RET no microcode 8M 16M 24M 32M 40M SE +/- 327739.29, N = 8 SE +/- 302926.36, N = 10 SE +/- 288707.73, N = 15 SE +/- 394126.89, N = 5 38572529.56 39463981.42 38833415.97 37720117.40
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 IBPB off safe RET safe RET no microcode 200K 400K 600K 800K 1000K SE +/- 7583.18, N = 9 SE +/- 8467.91, N = 3 SE +/- 7998.38, N = 8 SE +/- 6730.71, N = 12 921701.44 960525.66 918691.45 947741.34
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 IBPB off safe RET safe RET no microcode 20 40 60 80 100 SE +/- 0.42, N = 3 SE +/- 2.14, N = 3 SE +/- 1.29, N = 3 SE +/- 0.81, N = 4 80.78 78.92 82.24 79.19 MAX: 2592.69 MAX: 1729.94 MAX: 3625.32 MAX: 5165.86
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate IBPB off safe RET safe RET no microcode 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.21, N = 8 SE +/- 0.26, N = 5 SE +/- 0.34, N = 3 24.25 24.20 23.70 24.70 1. (CC) gcc options: -O3 -march=native -fopenmp
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 IBPB off safe RET safe RET no microcode 500K 1000K 1500K 2000K 2500K SE +/- 942.73, N = 3 SE +/- 14704.83, N = 3 SE +/- 4916.89, N = 3 SE +/- 17712.54, N = 3 2126493.29 2197287.30 2145052.14 2167181.09 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Person Detection FP16 - Device: CPU IBPB off safe RET safe RET no microcode 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 7.46 7.68 7.60 7.58 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OSPRay Benchmark: particle_volume/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/pathtracer/real_time IBPB off safe RET safe RET no microcode 30 60 90 120 150 SE +/- 0.43, N = 3 SE +/- 0.21, N = 3 SE +/- 0.07, N = 3 SE +/- 1.83, N = 3 153.44 157.83 156.42 155.17
SPECFEM3D Model: Water-layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Water-layered Halfspace IBPB off safe RET safe RET no microcode 7 14 21 28 35 SE +/- 0.19, N = 3 SE +/- 0.15, N = 3 SE +/- 0.35, N = 3 SE +/- 0.25, N = 3 30.05 29.77 29.59 30.43 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 IBPB off safe RET safe RET no microcode 13M 26M 39M 52M 65M SE +/- 269354.94, N = 3 SE +/- 817020.04, N = 3 SE +/- 721225.08, N = 3 SE +/- 648692.91, N = 4 57529201.42 58682618.18 57099408.15 58073516.79
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Person Detection FP16 - Device: CPU IBPB off safe RET safe RET no microcode 900 1800 2700 3600 4500 SE +/- 10.56, N = 3 SE +/- 14.65, N = 3 SE +/- 10.77, N = 3 SE +/- 6.87, N = 3 4204.34 4092.91 4114.58 4124.74 MIN: 2302.89 / MAX: 4817.72 MIN: 3409.52 / MAX: 4641.43 MIN: 2087 / MAX: 5053.62 MIN: 2129.26 / MAX: 5016.36 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 IBPB off safe RET safe RET no microcode 500K 1000K 1500K 2000K 2500K SE +/- 12623.44, N = 3 SE +/- 30210.22, N = 3 SE +/- 792.70, N = 3 SE +/- 16754.20, N = 10 2148876.03 2195705.51 2157815.69 2154339.26 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
SPECFEM3D Model: Tomographic Model OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Tomographic Model IBPB off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.20, N = 3 SE +/- 0.15, N = 3 14.23 14.13 14.19 14.40 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon IBPB off safe RET safe RET no microcode 14 28 42 56 70 SE +/- 0.14, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 63.47 64.60 64.67 64.39 MIN: 62.67 / MAX: 65.74 MIN: 64.05 / MAX: 66.13 MIN: 64.11 / MAX: 66.01 MIN: 63.77 / MAX: 66.16
Apache Spark Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark IBPB off safe RET safe RET no microcode 7 14 21 28 35 SE +/- 0.20, N = 3 SE +/- 0.12, N = 15 SE +/- 0.33, N = 3 SE +/- 0.01, N = 3 31.49 31.84 31.43 32.02
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time IBPB off safe RET safe RET no microcode 140 280 420 560 700 645.41 633.52 643.71 644.36 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 IBPB off safe RET safe RET no microcode 500K 1000K 1500K 2000K 2500K SE +/- 13504.65, N = 3 SE +/- 14630.02, N = 3 SE +/- 2448.62, N = 3 SE +/- 17754.58, N = 3 2137964.98 2177211.80 2172804.71 2173694.77 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
SPECFEM3D Model: Mount St. Helens OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Mount St. Helens IBPB off safe RET safe RET no microcode 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 11.95 11.80 12.01 11.98 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SPECFEM3D Model: Homogeneous Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Homogeneous Halfspace IBPB off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 SE +/- 0.20, N = 4 17.70 17.42 17.69 17.64 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only IBPB off safe RET safe RET no microcode 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 27.73 27.34 27.46 27.58
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only IBPB off safe RET safe RET no microcode 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.13, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 85.63 84.50 84.49 84.69
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 IBPB off safe RET safe RET no microcode 200M 400M 600M 800M 1000M SE +/- 1724277.85, N = 3 SE +/- 839009.73, N = 3 SE +/- 367255.40, N = 3 SE +/- 575791.94, N = 3 1005138667 1011799000 999102100 999645400 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms IBPB off safe RET safe RET no microcode 0.0867 0.1734 0.2601 0.3468 0.4335 SE +/- 0.00026, N = 3 SE +/- 0.00017, N = 3 SE +/- 0.00028, N = 3 SE +/- 0.00029, N = 3 0.38534 0.38130 0.38098 0.38115
SPECFEM3D Model: Layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Layered Halfspace IBPB off safe RET safe RET no microcode 7 14 21 28 35 SE +/- 0.21, N = 3 SE +/- 0.35, N = 3 SE +/- 0.18, N = 3 SE +/- 0.23, N = 3 32.01 31.85 31.66 31.83 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OSPRay Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time IBPB off safe RET safe RET no microcode 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.13, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 13.17 13.14 13.25 13.26
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare IBPB off safe RET safe RET no microcode 1.2893 2.5786 3.8679 5.1572 6.4465 SE +/- 0.011, N = 3 SE +/- 0.012, N = 3 SE +/- 0.010, N = 3 SE +/- 0.006, N = 3 5.707 5.680 5.706 5.730 1. (CXX) g++ options: -O3
OSPRay Benchmark: gravity_spheres_volume/dim_512/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time IBPB off safe RET safe RET no microcode 2 4 6 8 10 SE +/- 0.02088, N = 3 SE +/- 0.00864, N = 3 SE +/- 0.01059, N = 3 SE +/- 0.01659, N = 3 8.26549 8.33174 8.33813 8.32749
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Weld Porosity Detection FP16 - Device: CPU IBPB off safe RET safe RET no microcode 200 400 600 800 1000 SE +/- 0.40, N = 3 SE +/- 0.17, N = 3 SE +/- 0.13, N = 3 SE +/- 0.72, N = 3 1116.89 1126.03 1126.14 1126.64 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OSPRay Benchmark: gravity_spheres_volume/dim_512/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/ao/real_time IBPB off safe RET safe RET no microcode 3 6 9 12 15 SE +/- 0.01872, N = 3 SE +/- 0.02460, N = 3 SE +/- 0.01456, N = 3 SE +/- 0.02941, N = 3 8.89239 8.96051 8.94049 8.96941
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Weld Porosity Detection FP16 - Device: CPU IBPB off safe RET safe RET no microcode 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 28.62 28.40 28.39 28.38 MIN: 14.91 / MAX: 49.84 MIN: 14.74 / MAX: 48.66 MIN: 14.64 / MAX: 50.33 MIN: 14.89 / MAX: 51.63 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenVKL Benchmark: vklBenchmark ISPC OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 1.3.1 Benchmark: vklBenchmark ISPC IBPB off safe RET safe RET no microcode 100 200 300 400 500 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 SE +/- 0.00, N = 3 SE +/- 0.58, N = 3 450 453 453 452 MIN: 83 / MAX: 2495 MIN: 84 / MAX: 2528 MIN: 84 / MAX: 2520 MIN: 85 / MAX: 2535
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating IBPB off safe RET safe RET no microcode 80K 160K 240K 320K 400K SE +/- 312.85, N = 3 SE +/- 845.58, N = 3 SE +/- 605.48, N = 3 SE +/- 380.69, N = 3 385487 385585 383515 383039 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Neural Magic DeepSparse Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream IBPB off safe RET safe RET no microcode 2 4 6 8 10 SE +/- 0.0189, N = 3 SE +/- 0.0095, N = 3 SE +/- 0.0271, N = 3 SE +/- 0.0167, N = 3 8.3411 8.3078 8.3219 8.3612
Neural Magic DeepSparse Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream IBPB off safe RET safe RET no microcode 800 1600 2400 3200 4000 SE +/- 7.83, N = 3 SE +/- 4.76, N = 3 SE +/- 12.28, N = 3 SE +/- 7.72, N = 3 3824.84 3840.63 3834.24 3816.77
Neural Magic DeepSparse Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream IBPB off safe RET safe RET no microcode 15 30 45 60 75 SE +/- 0.20, N = 3 SE +/- 0.15, N = 3 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 65.78 65.59 65.57 65.89
Neural Magic DeepSparse Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream IBPB off safe RET safe RET no microcode 110 220 330 440 550 SE +/- 1.48, N = 3 SE +/- 1.15, N = 3 SE +/- 1.01, N = 3 SE +/- 0.96, N = 3 485.68 487.25 487.37 485.01
Neural Magic DeepSparse Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream IBPB off safe RET safe RET no microcode 150 300 450 600 750 SE +/- 1.14, N = 3 SE +/- 1.21, N = 3 SE +/- 0.96, N = 3 SE +/- 1.47, N = 3 681.40 678.93 682.21 679.58
OSPRay Benchmark: particle_volume/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/scivis/real_time IBPB off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 17.67 17.75 17.73 17.73
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Face Detection FP16-INT8 - Device: CPU IBPB off safe RET safe RET no microcode 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 27.71 27.83 27.82 27.79 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OSPRay Benchmark: particle_volume/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/ao/real_time IBPB off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 17.97 18.02 17.98 18.03
Neural Magic DeepSparse Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream IBPB off safe RET safe RET no microcode 11 22 33 44 55 SE +/- 0.02, N = 3 SE +/- 0.11, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 46.61 46.70 46.57 46.72
Neural Magic DeepSparse Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream IBPB off safe RET safe RET no microcode 100 200 300 400 500 SE +/- 0.43, N = 3 SE +/- 0.24, N = 3 SE +/- 0.31, N = 3 SE +/- 0.42, N = 3 467.32 468.83 468.22 468.36
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Face Detection FP16-INT8 - Device: CPU IBPB off safe RET safe RET no microcode 200 400 600 800 1000 SE +/- 0.42, N = 3 SE +/- 0.32, N = 3 SE +/- 1.09, N = 3 SE +/- 0.27, N = 3 1144.98 1141.43 1142.53 1142.29 MIN: 502.04 / MAX: 1175.93 MIN: 998.76 / MAX: 1165.45 MIN: 999.01 / MAX: 1177.02 MIN: 985.75 / MAX: 1168.76 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
Neural Magic DeepSparse Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream IBPB off safe RET safe RET no microcode 15 30 45 60 75 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 68.35 68.15 68.23 68.26
Neural Magic DeepSparse Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream IBPB off safe RET safe RET no microcode 9 18 27 36 45 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 37.68 37.60 37.63 37.70
Neural Magic DeepSparse Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream IBPB off safe RET safe RET no microcode 12 24 36 48 60 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 55.50 55.39 55.40 55.37
Neural Magic DeepSparse Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream IBPB off safe RET safe RET no microcode 120 240 360 480 600 SE +/- 0.94, N = 3 SE +/- 0.44, N = 3 SE +/- 0.39, N = 3 SE +/- 0.39, N = 3 575.73 576.97 576.82 577.06
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Crown IBPB off safe RET safe RET no microcode 13 26 39 52 65 SE +/- 0.10, N = 3 SE +/- 0.08, N = 3 SE +/- 0.15, N = 3 SE +/- 0.14, N = 3 57.30 57.42 57.31 57.30 MIN: 56.3 / MAX: 58.61 MIN: 56.59 / MAX: 58.54 MIN: 56.2 / MAX: 58.59 MIN: 56.26 / MAX: 58.69
Neural Magic DeepSparse Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream IBPB off safe RET safe RET no microcode 200 400 600 800 1000 SE +/- 0.53, N = 3 SE +/- 0.50, N = 3 SE +/- 0.34, N = 3 SE +/- 0.36, N = 3 840.89 839.72 840.42 840.77
Neural Magic DeepSparse Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream IBPB off safe RET safe RET no microcode 12 24 36 48 60 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 53.58 53.60 53.57 53.60
Neural Magic DeepSparse Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream IBPB off safe RET safe RET no microcode 130 260 390 520 650 SE +/- 0.06, N = 3 SE +/- 0.18, N = 3 SE +/- 0.11, N = 3 SE +/- 0.26, N = 3 596.79 596.59 596.78 596.56
Apache Spark Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark Using Dataframe OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark Using Dataframe IBPB 0.585 1.17 1.755 2.34 2.925 SE +/- 0.08, N = 3 2.60
Apache Spark Row Count: 1000000 - Partitions: 100 - Repartition Test Time OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Repartition Test Time IBPB off safe RET safe RET no microcode 0.5355 1.071 1.6065 2.142 2.6775 SE +/- 0.12, N = 3 SE +/- 0.04, N = 15 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 2.31 2.09 2.26 2.38
Phoronix Test Suite v10.8.4