Benchmarks by Michael Larabel for a future article looking at AMD Inception impact.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2308109-NE-EPYC7763169 AMD EPYC 7763 1P spec_rstack_overflow - Phoronix Test Suite AMD EPYC 7763 1P spec_rstack_overflow Benchmarks by Michael Larabel for a future article looking at AMD Inception impact.
HTML result view exported from: https://openbenchmarking.org/result/2308109-NE-EPYC7763169&grt&sro .
AMD EPYC 7763 1P spec_rstack_overflow Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Vulkan Compiler File-System Screen Resolution off safe RET no microcode safe RET AMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads) AMD DAYTONA_X (RYM1009B BIOS) AMD Starship/Matisse 256GB 800GB INTEL SSDPF21Q800GB ASPEED VE228 2 x Mellanox MT27710 Ubuntu 22.04 6.5.0-rc5-phx-tues (x86_64) GNOME Shell 42.5 X Server 1.21.1.3 1.3.224 GCC 11.3.0 + LLVM 14.0.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - off: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 - safe RET no microcode: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 - safe RET: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1 Java Details - OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04) Python Details - Python 3.10.6 Security Details - off: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - safe RET no microcode: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - safe RET: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
AMD EPYC 7763 1P spec_rstack_overflow compress-7zip: Compression Rating compress-7zip: Decompression Rating mt-dgemm: Sustained Floating-Point Rate amg: cassandra: Writes apache-iotdb: 200 - 1 - 200 apache-iotdb: 200 - 1 - 200 apache-iotdb: 200 - 1 - 500 apache-iotdb: 200 - 1 - 500 apache-iotdb: 500 - 1 - 200 apache-iotdb: 500 - 1 - 200 apache-iotdb: 500 - 1 - 500 apache-iotdb: 500 - 1 - 500 apache-iotdb: 200 - 100 - 200 apache-iotdb: 200 - 100 - 200 apache-iotdb: 200 - 100 - 500 apache-iotdb: 200 - 100 - 500 apache-iotdb: 500 - 100 - 200 apache-iotdb: 500 - 100 - 200 apache-iotdb: 500 - 100 - 500 apache-iotdb: 500 - 100 - 500 spark: 1000000 - 100 - SHA-512 Benchmark Time spark: 1000000 - 100 - Calculate Pi Benchmark spark: 1000000 - 100 - Group By Test Time spark: 1000000 - 100 - Repartition Test Time spark: 1000000 - 100 - Inner Join Test Time spark: 1000000 - 100 - Broadcast Inner Join Test Time blender: BMW27 - CPU-Only blender: Pabellon Barcelona - CPU-Only clickhouse: 100M Rows Hits Dataset, First Run / Cold Cache clickhouse: 100M Rows Hits Dataset, Second Run clickhouse: 100M Rows Hits Dataset, Third Run cockroach: KV, 50% Reads - 128 cockroach: KV, 95% Reads - 128 dacapobench: Jython dacapobench: Tradebeans embree: Pathtracer ISPC - Crown embree: Pathtracer ISPC - Asian Dragon gromacs: MPI CPU - water_GMX50_bare mysqlslap: 4096 mysqlslap: 8192 namd: ATPase Simulation - 327,506 Atoms deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream deepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream deepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream deepsparse: ResNet-50, Baseline - Asynchronous Multi-Stream deepsparse: ResNet-50, Baseline - Asynchronous Multi-Stream deepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Stream deepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Stream deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream nginx: 500 nginx: 1000 numpy: openfoam: drivaerFastback, Medium Mesh Size - Mesh Time openfoam: drivaerFastback, Medium Mesh Size - Execution Time openradioss: Bumper Beam openradioss: Cell Phone Drop Test openradioss: Bird Strike on Windshield openradioss: Rubber O-Ring Seal Installation openradioss: INIVOL and Fluid Structure Interaction Drop Container openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Weld Porosity Detection FP16 - CPU openvkl: vklBenchmark ISPC ospray: particle_volume/ao/real_time ospray: particle_volume/scivis/real_time ospray: particle_volume/pathtracer/real_time ospray: gravity_spheres_volume/dim_512/ao/real_time ospray: gravity_spheres_volume/dim_512/scivis/real_time ospray: gravity_spheres_volume/dim_512/pathtracer/real_time pgbench: 100 - 800 - Read Only pgbench: 100 - 800 - Read Only - Average Latency pgbench: 100 - 800 - Read Write pgbench: 100 - 800 - Read Write - Average Latency memtier-benchmark: Redis - 50 - 1:5 memtier-benchmark: Redis - 100 - 1:5 memtier-benchmark: Redis - 50 - 1:10 memtier-benchmark: Redis - 100 - 1:10 remhos: Sample Remap Example rocksdb: Update Rand rocksdb: Read Rand Write Rand specfem3d: Mount St. Helens specfem3d: Layered Halfspace specfem3d: Tomographic Model specfem3d: Homogeneous Halfspace specfem3d: Water-layered Halfspace sqlite: 8 sqlite: 16 tensorflow: CPU - 64 - ResNet-50 build-godot: Time To Compile build-linux-kernel: defconfig build-linux-kernel: allmodconfig build-llvm: Ninja mrbayes: Primate Phylogeny Analysis build-nodejs: Time To Compile off safe RET no microcode safe RET 384374 385585 24.200551 1011799000 238741 960525.66 13.83 1271946.57 32.36 1176385.35 14.05 1415756.33 31.49 43665846.28 37.70 39463981.42 117.97 49501499.13 36.58 58682618.18 78.92 3.39 31.84 4.91 2.09 1.88 1.30 27.34 84.50 349.43 361.81 362.64 103635.0 135187.2 4193 3993 57.4229 64.5964 5.680 590 355 0.38130 37.6037 839.7201 487.2450 65.5911 468.8306 68.1483 3840.6307 8.3078 46.7020 678.9330 53.5968 596.5915 576.9722 55.3922 169583.15 166499.89 457.23 140.61562 633.51902 87.72 33.10 144.83 77.46 162.13 7.68 4092.91 27.83 1141.43 1126.03 28.40 453 18.0226 17.7511 157.829 8.96051 8.33174 13.1355 3128719 0.256 61604 12.988 2204628.92 2197287.30 2177211.80 2195705.51 17.375 462287 2951684 11.801238732 31.845630424 14.134265606 17.417120933 29.772535386 3.755 6.273 17.78 121.948 31.192 289.063 176.374 136.686 164.268 334812 383039 24.695818 999645400 233069 947741.34 14.05 1342031.11 30.29 1202637.36 13.61 1408658.83 31.93 46538766.01 35.10 37720117.40 123.61 47445770.18 38.54 58073516.79 79.19 3.42 32.02 5.17 2.38 2.22 1.39 27.58 84.69 323.42 337.01 329.19 100851.4 131487.0 4191 4096 57.2956 64.3916 5.730 412 301 0.38115 37.6987 840.7721 485.0069 65.8910 468.3646 68.2595 3816.7676 8.3612 46.7239 679.5766 53.5972 596.5579 577.0589 55.3747 144020.03 140555.98 418.95 145.06069 644.36223 93.68 36.37 152.91 85.04 163.02 7.58 4124.74 27.79 1142.29 1126.64 28.38 452 18.0288 17.7305 155.165 8.96941 8.32749 13.2621 2707280 0.296 55175 14.499 2218601.79 2167181.09 2173694.77 2154339.26 17.788 428112 2872765 11.982163460 31.829261870 14.404419238 17.643680397 30.427531709 4.850 8.834 15.56 125.663 37.623 344.242 182.169 137.518 172.749 335595 383515 23.702889 999102100 236241 918691.45 14.73 1345598.59 30.14 1211172.13 13.36 1583717.62 27.73 44027904.89 37.53 38833415.97 120.06 50578426.54 35.82 57099408.15 82.24 3.47 31.43 5.15 2.26 2.14 1.41 27.46 84.49 318.12 337.12 337.45 99601.6 132046.0 4241 4143 57.3138 64.6742 5.706 418 301 0.38098 37.6319 840.4236 487.3677 65.5662 468.2170 68.2325 3834.2439 8.3219 46.5677 682.2071 53.5729 596.7822 576.8166 55.4028 142619.84 143271.26 422.58 144.02174 643.71316 93.90 36.40 152.27 84.48 163.97 7.60 4114.58 27.82 1142.53 1126.14 28.39 453 17.9817 17.7305 156.419 8.94049 8.33813 13.2538 2768445 0.289 54837 14.589 2145436.26 2145052.14 2172804.71 2157815.69 17.958 426947 2839085 12.010380781 31.659940885 14.188058962 17.690298650 29.590868260 5.006 8.800 15.65 125.060 37.243 338.157 181.528 138.851 173.064 OpenBenchmarking.org
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating off safe RET safe RET no microcode 80K 160K 240K 320K 400K SE +/- 1018.75, N = 3 SE +/- 435.27, N = 3 SE +/- 25.38, N = 3 384374 335595 334812 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating off safe RET safe RET no microcode 80K 160K 240K 320K 400K SE +/- 845.58, N = 3 SE +/- 605.48, N = 3 SE +/- 380.69, N = 3 385585 383515 383039 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate off safe RET safe RET no microcode 6 12 18 24 30 SE +/- 0.21, N = 8 SE +/- 0.26, N = 5 SE +/- 0.34, N = 3 24.20 23.70 24.70 1. (CC) gcc options: -O3 -march=native -fopenmp
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 off safe RET safe RET no microcode 200M 400M 600M 800M 1000M SE +/- 839009.73, N = 3 SE +/- 367255.40, N = 3 SE +/- 575791.94, N = 3 1011799000 999102100 999645400 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
Apache Cassandra Test: Writes OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.1.3 Test: Writes off safe RET safe RET no microcode 50K 100K 150K 200K 250K SE +/- 413.74, N = 3 SE +/- 479.91, N = 3 SE +/- 950.59, N = 3 238741 236241 233069
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 off safe RET safe RET no microcode 200K 400K 600K 800K 1000K SE +/- 8467.91, N = 3 SE +/- 7998.38, N = 8 SE +/- 6730.71, N = 12 960525.66 918691.45 947741.34
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.19, N = 3 SE +/- 0.21, N = 8 SE +/- 0.16, N = 12 13.83 14.73 14.05 MAX: 596.78 MAX: 645.11 MAX: 609.96
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 off safe RET safe RET no microcode 300K 600K 900K 1200K 1500K SE +/- 7578.67, N = 3 SE +/- 9180.92, N = 3 SE +/- 1525.49, N = 3 1271946.57 1345598.59 1342031.11
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 off safe RET safe RET no microcode 8 16 24 32 40 SE +/- 0.24, N = 3 SE +/- 0.22, N = 3 SE +/- 0.02, N = 3 32.36 30.14 30.29 MAX: 646.51 MAX: 641.04 MAX: 715.01
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 off safe RET safe RET no microcode 300K 600K 900K 1200K 1500K SE +/- 1566.77, N = 3 SE +/- 4253.29, N = 3 SE +/- 6553.14, N = 3 1176385.35 1211172.13 1202637.36
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.19, N = 3 SE +/- 0.14, N = 3 14.05 13.36 13.61 MAX: 858.17 MAX: 881.3 MAX: 854.4
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 off safe RET safe RET no microcode 300K 600K 900K 1200K 1500K SE +/- 4294.81, N = 3 SE +/- 5073.96, N = 3 SE +/- 13029.07, N = 3 1415756.33 1583717.62 1408658.83
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 off safe RET safe RET no microcode 7 14 21 28 35 SE +/- 0.29, N = 3 SE +/- 0.22, N = 3 SE +/- 0.49, N = 3 31.49 27.73 31.93 MAX: 939.96 MAX: 938.92 MAX: 930.97
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 off safe RET safe RET no microcode 10M 20M 30M 40M 50M SE +/- 574678.74, N = 15 SE +/- 543529.82, N = 15 SE +/- 614274.26, N = 3 43665846.28 44027904.89 46538766.01
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 off safe RET safe RET no microcode 9 18 27 36 45 SE +/- 0.55, N = 15 SE +/- 0.52, N = 15 SE +/- 0.62, N = 3 37.70 37.53 35.10 MAX: 802.64 MAX: 755.16 MAX: 728.37
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 off safe RET safe RET no microcode 8M 16M 24M 32M 40M SE +/- 302926.36, N = 10 SE +/- 288707.73, N = 15 SE +/- 394126.89, N = 5 39463981.42 38833415.97 37720117.40
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 off safe RET safe RET no microcode 30 60 90 120 150 SE +/- 0.95, N = 10 SE +/- 0.86, N = 15 SE +/- 1.63, N = 5 117.97 120.06 123.61 MAX: 4652.25 MAX: 4495.21 MAX: 4533.33
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 off safe RET safe RET no microcode 11M 22M 33M 44M 55M SE +/- 681823.31, N = 3 SE +/- 634314.77, N = 3 SE +/- 147114.88, N = 3 49501499.13 50578426.54 47445770.18
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 off safe RET safe RET no microcode 9 18 27 36 45 SE +/- 0.61, N = 3 SE +/- 0.49, N = 3 SE +/- 0.11, N = 3 36.58 35.82 38.54 MAX: 2252.73 MAX: 3267.55 MAX: 3276.77
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 off safe RET safe RET no microcode 13M 26M 39M 52M 65M SE +/- 817020.04, N = 3 SE +/- 721225.08, N = 3 SE +/- 648692.91, N = 4 58682618.18 57099408.15 58073516.79
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 off safe RET safe RET no microcode 20 40 60 80 100 SE +/- 2.14, N = 3 SE +/- 1.29, N = 3 SE +/- 0.81, N = 4 78.92 82.24 79.19 MAX: 1729.94 MAX: 3625.32 MAX: 5165.86
Apache Spark Row Count: 1000000 - Partitions: 100 - SHA-512 Benchmark Time OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - SHA-512 Benchmark Time off safe RET safe RET no microcode 0.7808 1.5616 2.3424 3.1232 3.904 SE +/- 0.03, N = 15 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 3.39 3.47 3.42
Apache Spark Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark off safe RET safe RET no microcode 7 14 21 28 35 SE +/- 0.12, N = 15 SE +/- 0.33, N = 3 SE +/- 0.01, N = 3 31.84 31.43 32.02
Apache Spark Row Count: 1000000 - Partitions: 100 - Group By Test Time OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Group By Test Time off safe RET safe RET no microcode 1.1633 2.3266 3.4899 4.6532 5.8165 SE +/- 0.04, N = 15 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 4.91 5.15 5.17
Apache Spark Row Count: 1000000 - Partitions: 100 - Repartition Test Time OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Repartition Test Time off safe RET safe RET no microcode 0.5355 1.071 1.6065 2.142 2.6775 SE +/- 0.04, N = 15 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 2.09 2.26 2.38
Apache Spark Row Count: 1000000 - Partitions: 100 - Inner Join Test Time OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Inner Join Test Time off safe RET safe RET no microcode 0.4995 0.999 1.4985 1.998 2.4975 SE +/- 0.02, N = 15 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 1.88 2.14 2.22
Apache Spark Row Count: 1000000 - Partitions: 100 - Broadcast Inner Join Test Time OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Broadcast Inner Join Test Time off safe RET safe RET no microcode 0.3173 0.6346 0.9519 1.2692 1.5865 SE +/- 0.01, N = 15 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 1.30 1.41 1.39
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only off safe RET safe RET no microcode 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 27.34 27.46 27.58
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only off safe RET safe RET no microcode 20 40 60 80 100 SE +/- 0.13, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 84.50 84.49 84.69
ClickHouse 100M Rows Hits Dataset, First Run / Cold Cache OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, First Run / Cold Cache off safe RET safe RET no microcode 80 160 240 320 400 SE +/- 0.68, N = 3 SE +/- 2.94, N = 3 SE +/- 3.38, N = 5 349.43 318.12 323.42 MIN: 31.06 / MAX: 4285.71 MIN: 30.57 / MAX: 3333.33 MIN: 30.82 / MAX: 5000
ClickHouse 100M Rows Hits Dataset, Second Run OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Second Run off safe RET safe RET no microcode 80 160 240 320 400 SE +/- 1.42, N = 3 SE +/- 4.86, N = 3 SE +/- 2.16, N = 5 361.81 337.12 337.01 MIN: 31.46 / MAX: 4000 MIN: 30.79 / MAX: 4000 MIN: 30.49 / MAX: 3529.41
ClickHouse 100M Rows Hits Dataset, Third Run OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Third Run off safe RET safe RET no microcode 80 160 240 320 400 SE +/- 2.21, N = 3 SE +/- 1.85, N = 3 SE +/- 3.27, N = 5 362.64 337.45 329.19 MIN: 31.5 / MAX: 4285.71 MIN: 31.46 / MAX: 4000 MIN: 31.32 / MAX: 2857.14
CockroachDB Workload: KV, 50% Reads - Concurrency: 128 OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 50% Reads - Concurrency: 128 off safe RET safe RET no microcode 20K 40K 60K 80K 100K SE +/- 275.86, N = 3 SE +/- 948.29, N = 15 SE +/- 719.41, N = 15 103635.0 99601.6 100851.4
CockroachDB Workload: KV, 95% Reads - Concurrency: 128 OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 95% Reads - Concurrency: 128 off safe RET safe RET no microcode 30K 60K 90K 120K 150K SE +/- 931.05, N = 3 SE +/- 1387.70, N = 15 SE +/- 1043.12, N = 13 135187.2 132046.0 131487.0
DaCapo Benchmark Java Test: Jython OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Jython off safe RET safe RET no microcode 900 1800 2700 3600 4500 SE +/- 18.07, N = 4 SE +/- 49.88, N = 4 SE +/- 47.28, N = 4 4193 4241 4191
DaCapo Benchmark Java Test: Tradebeans OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Tradebeans off safe RET safe RET no microcode 900 1800 2700 3600 4500 SE +/- 42.66, N = 4 SE +/- 28.11, N = 4 SE +/- 44.47, N = 4 3993 4143 4096
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Crown off safe RET safe RET no microcode 13 26 39 52 65 SE +/- 0.08, N = 3 SE +/- 0.15, N = 3 SE +/- 0.14, N = 3 57.42 57.31 57.30 MIN: 56.59 / MAX: 58.54 MIN: 56.2 / MAX: 58.59 MIN: 56.26 / MAX: 58.69
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon off safe RET safe RET no microcode 14 28 42 56 70 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 64.60 64.67 64.39 MIN: 64.05 / MAX: 66.13 MIN: 64.11 / MAX: 66.01 MIN: 63.77 / MAX: 66.16
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare off safe RET safe RET no microcode 1.2893 2.5786 3.8679 5.1572 6.4465 SE +/- 0.012, N = 3 SE +/- 0.010, N = 3 SE +/- 0.006, N = 3 5.680 5.706 5.730 1. (CXX) g++ options: -O3
MariaDB Clients: 4096 OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.0.1 Clients: 4096 off safe RET safe RET no microcode 130 260 390 520 650 SE +/- 5.48, N = 3 SE +/- 3.51, N = 3 SE +/- 2.96, N = 3 590 418 412 1. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl
MariaDB Clients: 8192 OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.0.1 Clients: 8192 off safe RET safe RET no microcode 80 160 240 320 400 SE +/- 3.35, N = 3 SE +/- 1.18, N = 3 SE +/- 0.73, N = 3 355 301 301 1. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms off safe RET safe RET no microcode 0.0858 0.1716 0.2574 0.3432 0.429 SE +/- 0.00017, N = 3 SE +/- 0.00028, N = 3 SE +/- 0.00029, N = 3 0.38130 0.38098 0.38115
Neural Magic DeepSparse Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream off safe RET safe RET no microcode 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 37.60 37.63 37.70
Neural Magic DeepSparse Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream off safe RET safe RET no microcode 200 400 600 800 1000 SE +/- 0.50, N = 3 SE +/- 0.34, N = 3 SE +/- 0.36, N = 3 839.72 840.42 840.77
Neural Magic DeepSparse Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream off safe RET safe RET no microcode 110 220 330 440 550 SE +/- 1.15, N = 3 SE +/- 1.01, N = 3 SE +/- 0.96, N = 3 487.25 487.37 485.01
Neural Magic DeepSparse Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream off safe RET safe RET no microcode 15 30 45 60 75 SE +/- 0.15, N = 3 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 65.59 65.57 65.89
Neural Magic DeepSparse Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream off safe RET safe RET no microcode 100 200 300 400 500 SE +/- 0.24, N = 3 SE +/- 0.31, N = 3 SE +/- 0.42, N = 3 468.83 468.22 468.36
Neural Magic DeepSparse Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream off safe RET safe RET no microcode 15 30 45 60 75 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 68.15 68.23 68.26
Neural Magic DeepSparse Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream off safe RET safe RET no microcode 800 1600 2400 3200 4000 SE +/- 4.76, N = 3 SE +/- 12.28, N = 3 SE +/- 7.72, N = 3 3840.63 3834.24 3816.77
Neural Magic DeepSparse Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream off safe RET safe RET no microcode 2 4 6 8 10 SE +/- 0.0095, N = 3 SE +/- 0.0271, N = 3 SE +/- 0.0167, N = 3 8.3078 8.3219 8.3612
Neural Magic DeepSparse Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream off safe RET safe RET no microcode 11 22 33 44 55 SE +/- 0.11, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 46.70 46.57 46.72
Neural Magic DeepSparse Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream off safe RET safe RET no microcode 150 300 450 600 750 SE +/- 1.21, N = 3 SE +/- 0.96, N = 3 SE +/- 1.47, N = 3 678.93 682.21 679.58
Neural Magic DeepSparse Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream off safe RET safe RET no microcode 12 24 36 48 60 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 53.60 53.57 53.60
Neural Magic DeepSparse Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream off safe RET safe RET no microcode 130 260 390 520 650 SE +/- 0.18, N = 3 SE +/- 0.11, N = 3 SE +/- 0.26, N = 3 596.59 596.78 596.56
Neural Magic DeepSparse Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream off safe RET safe RET no microcode 120 240 360 480 600 SE +/- 0.44, N = 3 SE +/- 0.39, N = 3 SE +/- 0.39, N = 3 576.97 576.82 577.06
Neural Magic DeepSparse Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream off safe RET safe RET no microcode 12 24 36 48 60 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 55.39 55.40 55.37
nginx Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 off safe RET safe RET no microcode 40K 80K 120K 160K 200K SE +/- 284.72, N = 3 SE +/- 251.96, N = 3 SE +/- 284.55, N = 3 169583.15 142619.84 144020.03 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 off safe RET safe RET no microcode 40K 80K 120K 160K 200K SE +/- 362.13, N = 3 SE +/- 314.03, N = 3 SE +/- 352.89, N = 3 166499.89 143271.26 140555.98 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Numpy Benchmark OpenBenchmarking.org Score, More Is Better Numpy Benchmark off safe RET safe RET no microcode 100 200 300 400 500 SE +/- 1.76, N = 3 SE +/- 0.84, N = 3 SE +/- 2.01, N = 3 457.23 422.58 418.95
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time off safe RET safe RET no microcode 30 60 90 120 150 140.62 144.02 145.06 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time off safe RET safe RET no microcode 140 280 420 560 700 633.52 643.71 644.36 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenRadioss Model: Bumper Beam OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Bumper Beam off safe RET safe RET no microcode 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 87.72 93.90 93.68
OpenRadioss Model: Cell Phone Drop Test OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Cell Phone Drop Test off safe RET safe RET no microcode 8 16 24 32 40 SE +/- 0.11, N = 3 SE +/- 0.03, N = 3 SE +/- 0.26, N = 3 33.10 36.40 36.37
OpenRadioss Model: Bird Strike on Windshield OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Bird Strike on Windshield off safe RET safe RET no microcode 30 60 90 120 150 SE +/- 0.07, N = 3 SE +/- 0.73, N = 3 SE +/- 0.67, N = 3 144.83 152.27 152.91
OpenRadioss Model: Rubber O-Ring Seal Installation OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Rubber O-Ring Seal Installation off safe RET safe RET no microcode 20 40 60 80 100 SE +/- 0.23, N = 3 SE +/- 0.24, N = 3 SE +/- 0.17, N = 3 77.46 84.48 85.04
OpenRadioss Model: INIVOL and Fluid Structure Interaction Drop Container OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: INIVOL and Fluid Structure Interaction Drop Container off safe RET safe RET no microcode 40 80 120 160 200 SE +/- 0.16, N = 3 SE +/- 0.50, N = 3 SE +/- 0.39, N = 3 162.13 163.97 163.02
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Person Detection FP16 - Device: CPU off safe RET safe RET no microcode 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 7.68 7.60 7.58 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Person Detection FP16 - Device: CPU off safe RET safe RET no microcode 900 1800 2700 3600 4500 SE +/- 14.65, N = 3 SE +/- 10.77, N = 3 SE +/- 6.87, N = 3 4092.91 4114.58 4124.74 MIN: 3409.52 / MAX: 4641.43 MIN: 2087 / MAX: 5053.62 MIN: 2129.26 / MAX: 5016.36 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Face Detection FP16-INT8 - Device: CPU off safe RET safe RET no microcode 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 27.83 27.82 27.79 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Face Detection FP16-INT8 - Device: CPU off safe RET safe RET no microcode 200 400 600 800 1000 SE +/- 0.32, N = 3 SE +/- 1.09, N = 3 SE +/- 0.27, N = 3 1141.43 1142.53 1142.29 MIN: 998.76 / MAX: 1165.45 MIN: 999.01 / MAX: 1177.02 MIN: 985.75 / MAX: 1168.76 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Weld Porosity Detection FP16 - Device: CPU off safe RET safe RET no microcode 200 400 600 800 1000 SE +/- 0.17, N = 3 SE +/- 0.13, N = 3 SE +/- 0.72, N = 3 1126.03 1126.14 1126.64 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Weld Porosity Detection FP16 - Device: CPU off safe RET safe RET no microcode 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 28.40 28.39 28.38 MIN: 14.74 / MAX: 48.66 MIN: 14.64 / MAX: 50.33 MIN: 14.89 / MAX: 51.63 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenVKL Benchmark: vklBenchmark ISPC OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 1.3.1 Benchmark: vklBenchmark ISPC off safe RET safe RET no microcode 100 200 300 400 500 SE +/- 0.58, N = 3 SE +/- 0.00, N = 3 SE +/- 0.58, N = 3 453 453 452 MIN: 84 / MAX: 2528 MIN: 84 / MAX: 2520 MIN: 85 / MAX: 2535
OSPRay Benchmark: particle_volume/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/ao/real_time off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 18.02 17.98 18.03
OSPRay Benchmark: particle_volume/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/scivis/real_time off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 17.75 17.73 17.73
OSPRay Benchmark: particle_volume/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/pathtracer/real_time off safe RET safe RET no microcode 30 60 90 120 150 SE +/- 0.21, N = 3 SE +/- 0.07, N = 3 SE +/- 1.83, N = 3 157.83 156.42 155.17
OSPRay Benchmark: gravity_spheres_volume/dim_512/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/ao/real_time off safe RET safe RET no microcode 3 6 9 12 15 SE +/- 0.02460, N = 3 SE +/- 0.01456, N = 3 SE +/- 0.02941, N = 3 8.96051 8.94049 8.96941
OSPRay Benchmark: gravity_spheres_volume/dim_512/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time off safe RET safe RET no microcode 2 4 6 8 10 SE +/- 0.00864, N = 3 SE +/- 0.01059, N = 3 SE +/- 0.01659, N = 3 8.33174 8.33813 8.32749
OSPRay Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time off safe RET safe RET no microcode 3 6 9 12 15 SE +/- 0.13, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 13.14 13.25 13.26
PostgreSQL Scaling Factor: 100 - Clients: 800 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Only off safe RET safe RET no microcode 700K 1400K 2100K 2800K 3500K SE +/- 1705.16, N = 3 SE +/- 34286.68, N = 3 SE +/- 29158.10, N = 3 3128719 2768445 2707280 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency off safe RET safe RET no microcode 0.0666 0.1332 0.1998 0.2664 0.333 SE +/- 0.000, N = 3 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 0.256 0.289 0.296 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 800 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Write off safe RET safe RET no microcode 13K 26K 39K 52K 65K SE +/- 418.78, N = 3 SE +/- 207.71, N = 3 SE +/- 66.28, N = 3 61604 54837 55175 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 12.99 14.59 14.50 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 off safe RET safe RET no microcode 500K 1000K 1500K 2000K 2500K SE +/- 11955.97, N = 3 SE +/- 1778.76, N = 3 SE +/- 31351.12, N = 3 2204628.92 2145436.26 2218601.79 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 off safe RET safe RET no microcode 500K 1000K 1500K 2000K 2500K SE +/- 14704.83, N = 3 SE +/- 4916.89, N = 3 SE +/- 17712.54, N = 3 2197287.30 2145052.14 2167181.09 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 off safe RET safe RET no microcode 500K 1000K 1500K 2000K 2500K SE +/- 14630.02, N = 3 SE +/- 2448.62, N = 3 SE +/- 17754.58, N = 3 2177211.80 2172804.71 2173694.77 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 off safe RET safe RET no microcode 500K 1000K 1500K 2000K 2500K SE +/- 30210.22, N = 3 SE +/- 792.70, N = 3 SE +/- 16754.20, N = 10 2195705.51 2157815.69 2154339.26 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Remhos Test: Sample Remap Example OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.23, N = 3 SE +/- 0.19, N = 3 SE +/- 0.17, N = 3 17.38 17.96 17.79 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
RocksDB Test: Update Random OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Update Random off safe RET safe RET no microcode 100K 200K 300K 400K 500K SE +/- 893.82, N = 3 SE +/- 185.49, N = 3 SE +/- 426.73, N = 3 462287 426947 428112 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
RocksDB Test: Read Random Write Random OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read Random Write Random off safe RET safe RET no microcode 600K 1200K 1800K 2400K 3000K SE +/- 35283.44, N = 4 SE +/- 21895.20, N = 3 SE +/- 18652.89, N = 3 2951684 2839085 2872765 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
SPECFEM3D Model: Mount St. Helens OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Mount St. Helens off safe RET safe RET no microcode 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 11.80 12.01 11.98 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SPECFEM3D Model: Layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Layered Halfspace off safe RET safe RET no microcode 7 14 21 28 35 SE +/- 0.35, N = 3 SE +/- 0.18, N = 3 SE +/- 0.23, N = 3 31.85 31.66 31.83 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SPECFEM3D Model: Tomographic Model OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Tomographic Model off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.20, N = 3 SE +/- 0.15, N = 3 14.13 14.19 14.40 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SPECFEM3D Model: Homogeneous Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Homogeneous Halfspace off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 SE +/- 0.20, N = 4 17.42 17.69 17.64 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SPECFEM3D Model: Water-layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Water-layered Halfspace off safe RET safe RET no microcode 7 14 21 28 35 SE +/- 0.15, N = 3 SE +/- 0.35, N = 3 SE +/- 0.25, N = 3 29.77 29.59 30.43 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SQLite Threads / Copies: 8 OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.41.2 Threads / Copies: 8 off safe RET safe RET no microcode 1.1264 2.2528 3.3792 4.5056 5.632 SE +/- 0.013, N = 3 SE +/- 0.036, N = 3 SE +/- 0.016, N = 3 3.755 5.006 4.850 1. (CC) gcc options: -O2 -lz -lm
SQLite Threads / Copies: 16 OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.41.2 Threads / Copies: 16 off safe RET safe RET no microcode 2 4 6 8 10 SE +/- 0.020, N = 3 SE +/- 0.052, N = 3 SE +/- 0.024, N = 3 6.273 8.800 8.834 1. (CC) gcc options: -O2 -lz -lm
TensorFlow Device: CPU - Batch Size: 64 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 64 - Model: ResNet-50 off safe RET safe RET no microcode 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 17.78 15.65 15.56
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 4.0 Time To Compile off safe RET safe RET no microcode 30 60 90 120 150 SE +/- 0.19, N = 3 SE +/- 0.24, N = 3 SE +/- 0.33, N = 3 121.95 125.06 125.66
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig off safe RET safe RET no microcode 9 18 27 36 45 SE +/- 0.34, N = 5 SE +/- 0.37, N = 6 SE +/- 0.35, N = 6 31.19 37.24 37.62
Timed Linux Kernel Compilation Build: allmodconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: allmodconfig off safe RET safe RET no microcode 70 140 210 280 350 SE +/- 0.49, N = 3 SE +/- 0.79, N = 3 SE +/- 0.90, N = 3 289.06 338.16 344.24
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 16.0 Build System: Ninja off safe RET safe RET no microcode 40 80 120 160 200 SE +/- 0.11, N = 3 SE +/- 0.15, N = 3 SE +/- 0.11, N = 3 176.37 181.53 182.17
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis off safe RET safe RET no microcode 30 60 90 120 150 SE +/- 0.85, N = 3 SE +/- 1.05, N = 3 SE +/- 0.66, N = 3 136.69 138.85 137.52 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 19.8.1 Time To Compile off safe RET safe RET no microcode 40 80 120 160 200 SE +/- 0.14, N = 3 SE +/- 0.05, N = 3 SE +/- 0.12, N = 3 164.27 173.06 172.75
Phoronix Test Suite v10.8.4