Benchmarks by Michael Larabel for a future article looking at AMD Inception impact.
off Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
safe RET no microcode Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
safe RET Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
IBPB Processor: AMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads), Motherboard: AMD DAYTONA_X (RYM1009B BIOS), Chipset: AMD Starship/Matisse, Memory: 256GB, Disk: 800GB INTEL SSDPF21Q800GB, Graphics: ASPEED, Monitor: VE228, Network: 2 x Mellanox MT27710
OS: Ubuntu 22.04, Kernel: 6.5.0-rc5-phx-tues (x86_64), Desktop: GNOME Shell 42.5, Display Server: X Server 1.21.1.3, Vulkan: 1.3.224, Compiler: GCC 11.3.0 + LLVM 14.0.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
AMD EPYC 7763 1P spec_rstack_overflow OpenBenchmarking.org Phoronix Test Suite AMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads) AMD DAYTONA_X (RYM1009B BIOS) AMD Starship/Matisse 256GB 800GB INTEL SSDPF21Q800GB ASPEED VE228 2 x Mellanox MT27710 Ubuntu 22.04 6.5.0-rc5-phx-tues (x86_64) GNOME Shell 42.5 X Server 1.21.1.3 1.3.224 GCC 11.3.0 + LLVM 14.0.0 ext4 1920x1080 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Vulkan Compiler File-System Screen Resolution AMD EPYC 7763 1P Spec_rstack_overflow Benchmarks System Logs - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 - off: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 - safe RET no microcode: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 - safe RET: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1 - IBPB: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1 - OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04) - Python 3.10.6 - off: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - safe RET no microcode: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - safe RET: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - IBPB: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
off safe RET no microcode safe RET IBPB Result Overview Phoronix Test Suite 100% 117% 133% 150% 166% MariaDB PostgreSQL RocksDB SQLite Timed Linux Kernel Compilation nginx Timed Node.js Compilation OpenRadioss Numpy Benchmark Timed LLVM Compilation Apache Spark DaCapo Benchmark TensorFlow Timed Godot Game Engine Compilation CockroachDB ClickHouse Apache Cassandra 7-Zip Compression Remhos Timed MrBayes Analysis ACES DGEMM OpenFOAM Redis 7.0.12 + memtier_benchmark Apache IoTDB Blender SPECFEM3D OpenVINO Algebraic Multi-Grid Benchmark NAMD Embree GROMACS OSPRay OpenVKL Neural Magic DeepSparse
AMD EPYC 7763 1P spec_rstack_overflow dacapobench: Jython dacapobench: Tradebeans openradioss: Bumper Beam openradioss: Cell Phone Drop Test openradioss: Bird Strike on Windshield specfem3d: Mount St. Helens specfem3d: Layered Halfspace specfem3d: Water-layered Halfspace specfem3d: Tomographic Model remhos: Sample Remap Example openradioss: Rubber O-Ring Seal Installation openradioss: INIVOL and Fluid Structure Interaction Drop Container mrbayes: Primate Phylogeny Analysis tensorflow: CPU - 64 - ResNet-50 numpy: deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream deepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream deepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream deepsparse: ResNet-50, Baseline - Asynchronous Multi-Stream specfem3d: Homogeneous Halfspace deepsparse: ResNet-50, Baseline - Asynchronous Multi-Stream deepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Stream deepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Stream deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream gromacs: MPI CPU - water_GMX50_bare namd: ATPase Simulation - 327,506 Atoms openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Weld Porosity Detection FP16 - CPU mt-dgemm: Sustained Floating-Point Rate amg: openfoam: drivaerFastback, Medium Mesh Size - Mesh Time openfoam: drivaerFastback, Medium Mesh Size - Execution Time compress-7zip: Compression Rating compress-7zip: Decompression Rating build-llvm: Ninja build-linux-kernel: defconfig build-linux-kernel: allmodconfig blender: BMW27 - CPU-Only blender: Pabellon Barcelona - CPU-Only build-godot: Time To Compile embree: Pathtracer ISPC - Crown embree: Pathtracer ISPC - Asian Dragon openvkl: vklBenchmark ISPC ospray: particle_volume/ao/real_time ospray: particle_volume/scivis/real_time ospray: particle_volume/pathtracer/real_time ospray: gravity_spheres_volume/dim_512/ao/real_time ospray: gravity_spheres_volume/dim_512/scivis/real_time ospray: gravity_spheres_volume/dim_512/pathtracer/real_time build-nodejs: Time To Compile nginx: 500 nginx: 1000 apache-iotdb: 200 - 1 - 200 apache-iotdb: 200 - 1 - 200 apache-iotdb: 200 - 1 - 500 apache-iotdb: 200 - 1 - 500 apache-iotdb: 500 - 1 - 200 apache-iotdb: 500 - 1 - 200 apache-iotdb: 500 - 1 - 500 apache-iotdb: 500 - 1 - 500 apache-iotdb: 200 - 100 - 200 apache-iotdb: 200 - 100 - 200 apache-iotdb: 200 - 100 - 500 apache-iotdb: 200 - 100 - 500 apache-iotdb: 500 - 100 - 200 apache-iotdb: 500 - 100 - 200 apache-iotdb: 500 - 100 - 500 apache-iotdb: 500 - 100 - 500 spark: 1000000 - 100 - SHA-512 Benchmark Time spark: 1000000 - 100 - Calculate Pi Benchmark spark: 1000000 - 100 - Group By Test Time spark: 1000000 - 100 - Repartition Test Time spark: 1000000 - 100 - Inner Join Test Time spark: 1000000 - 100 - Broadcast Inner Join Test Time spark: 1000000 - 100 - Calculate Pi Benchmark Using Dataframe clickhouse: 100M Rows Hits Dataset, First Run / Cold Cache clickhouse: 100M Rows Hits Dataset, Second Run clickhouse: 100M Rows Hits Dataset, Third Run cockroach: KV, 50% Reads - 128 cockroach: KV, 95% Reads - 128 memtier-benchmark: Redis - 50 - 1:5 memtier-benchmark: Redis - 100 - 1:5 memtier-benchmark: Redis - 50 - 1:10 memtier-benchmark: Redis - 100 - 1:10 sqlite: 8 sqlite: 16 rocksdb: Update Rand rocksdb: Read Rand Write Rand cassandra: Writes pgbench: 100 - 800 - Read Only pgbench: 100 - 800 - Read Only - Average Latency pgbench: 100 - 800 - Read Write pgbench: 100 - 800 - Read Write - Average Latency mysqlslap: 4096 mysqlslap: 8192 off safe RET no microcode safe RET IBPB 4193 3993 87.72 33.10 144.83 11.801238732 31.845630424 29.772535386 14.134265606 17.375 77.46 162.13 136.686 17.78 457.23 37.6037 839.7201 487.2450 65.5911 468.8306 17.417120933 68.1483 3840.6307 8.3078 46.7020 678.9330 53.5968 596.5915 576.9722 55.3922 5.680 0.38130 7.68 4092.91 27.83 1141.43 1126.03 28.40 24.200551 1011799000 140.61562 633.51902 384374 385585 176.374 31.192 289.063 27.34 84.50 121.948 57.4229 64.5964 453 18.0226 17.7511 157.829 8.96051 8.33174 13.1355 164.268 169583.15 166499.89 960525.66 13.83 1271946.57 32.36 1176385.35 14.05 1415756.33 31.49 43665846.28 37.70 39463981.42 117.97 49501499.13 36.58 58682618.18 78.92 3.39 31.84 4.91 2.09 1.88 1.30 349.43 361.81 362.64 103635.0 135187.2 2204628.92 2197287.30 2177211.80 2195705.51 3.755 6.273 462287 2951684 238741 3128719 0.256 61604 12.988 590 355 4191 4096 93.68 36.37 152.91 11.982163460 31.829261870 30.427531709 14.404419238 17.788 85.04 163.02 137.518 15.56 418.95 37.6987 840.7721 485.0069 65.8910 468.3646 17.643680397 68.2595 3816.7676 8.3612 46.7239 679.5766 53.5972 596.5579 577.0589 55.3747 5.730 0.38115 7.58 4124.74 27.79 1142.29 1126.64 28.38 24.695818 999645400 145.06069 644.36223 334812 383039 182.169 37.623 344.242 27.58 84.69 125.663 57.2956 64.3916 452 18.0288 17.7305 155.165 8.96941 8.32749 13.2621 172.749 144020.03 140555.98 947741.34 14.05 1342031.11 30.29 1202637.36 13.61 1408658.83 31.93 46538766.01 35.10 37720117.40 123.61 47445770.18 38.54 58073516.79 79.19 3.42 32.02 5.17 2.38 2.22 1.39 323.42 337.01 329.19 100851.4 131487.0 2218601.79 2167181.09 2173694.77 2154339.26 4.850 8.834 428112 2872765 233069 2707280 0.296 55175 14.499 412 301 4241 4143 93.90 36.40 152.27 12.010380781 31.659940885 29.590868260 14.188058962 17.958 84.48 163.97 138.851 15.65 422.58 37.6319 840.4236 487.3677 65.5662 468.2170 17.690298650 68.2325 3834.2439 8.3219 46.5677 682.2071 53.5729 596.7822 576.8166 55.4028 5.706 0.38098 7.60 4114.58 27.82 1142.53 1126.14 28.39 23.702889 999102100 144.02174 643.71316 335595 383515 181.528 37.243 338.157 27.46 84.49 125.060 57.3138 64.6742 453 17.9817 17.7305 156.419 8.94049 8.33813 13.2538 173.064 142619.84 143271.26 918691.45 14.73 1345598.59 30.14 1211172.13 13.36 1583717.62 27.73 44027904.89 37.53 38833415.97 120.06 50578426.54 35.82 57099408.15 82.24 3.47 31.43 5.15 2.26 2.14 1.41 318.12 337.12 337.45 99601.6 132046.0 2145436.26 2145052.14 2172804.71 2157815.69 5.006 8.800 426947 2839085 236241 2768445 0.289 54837 14.589 418 301 4446 5305 113.68 40.11 160.70 11.951448577 32.009762142 30.053834808 14.225825227 18.658 99.04 171.75 144.829 17.45 389.92 37.6818 840.8881 485.6842 65.7767 467.3150 17.695914319 68.3456 3824.8431 8.3411 46.6121 681.3961 53.5782 596.7902 575.7335 55.5026 5.707 0.38534 7.46 4204.34 27.71 1144.98 1116.89 28.62 24.251474 1005138667 148.25442 645.40958 371799 385487 204.080 40.085 352.178 27.73 85.63 135.962 57.2967 63.4668 450 17.9663 17.6743 153.439 8.89239 8.26549 13.1709 195.493 137051.69 135431.46 921701.44 14.78 1287324.35 31.74 1344749.20 11.93 1441637.63 31.11 44816394.74 36.47 38572529.56 120.53 49316970.28 36.56 57529201.42 80.78 3.71 31.49 5.74 2.31 2.41 1.64 2.60 336.69 347.92 349.50 95416.0 119163.8 2092844.22 2126493.29 2137964.98 2148876.03 4.793 7.934 322231 2130006 220814 1733827 0.461 50463 15.854 274 276 OpenBenchmarking.org
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Tradebeans IBPB safe RET safe RET no microcode off 1100 2200 3300 4400 5500 SE +/- 56.17, N = 4 SE +/- 28.11, N = 4 SE +/- 44.47, N = 4 SE +/- 42.66, N = 4 5305 4143 4096 3993
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Bumper Beam IBPB safe RET safe RET no microcode off 30 60 90 120 150 SE +/- 0.23, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.33, N = 3 113.68 93.90 93.68 87.72
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Cell Phone Drop Test IBPB safe RET safe RET no microcode off 9 18 27 36 45 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 SE +/- 0.26, N = 3 SE +/- 0.11, N = 3 40.11 36.40 36.37 33.10
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Bird Strike on Windshield IBPB safe RET no microcode safe RET off 40 80 120 160 200 SE +/- 0.89, N = 3 SE +/- 0.67, N = 3 SE +/- 0.73, N = 3 SE +/- 0.07, N = 3 160.70 152.91 152.27 144.83
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Mount St. Helens safe RET safe RET no microcode IBPB off 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 12.01 11.98 11.95 11.80 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Layered Halfspace IBPB off safe RET no microcode safe RET 7 14 21 28 35 SE +/- 0.21, N = 3 SE +/- 0.35, N = 3 SE +/- 0.23, N = 3 SE +/- 0.18, N = 3 32.01 31.85 31.83 31.66 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Water-layered Halfspace safe RET no microcode IBPB off safe RET 7 14 21 28 35 SE +/- 0.25, N = 3 SE +/- 0.19, N = 3 SE +/- 0.15, N = 3 SE +/- 0.35, N = 3 30.43 30.05 29.77 29.59 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Tomographic Model safe RET no microcode IBPB safe RET off 4 8 12 16 20 SE +/- 0.15, N = 3 SE +/- 0.08, N = 3 SE +/- 0.20, N = 3 SE +/- 0.09, N = 3 14.40 14.23 14.19 14.13 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Remhos Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example IBPB safe RET safe RET no microcode off 5 10 15 20 25 SE +/- 0.12, N = 14 SE +/- 0.19, N = 3 SE +/- 0.17, N = 3 SE +/- 0.23, N = 3 18.66 17.96 17.79 17.38 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Rubber O-Ring Seal Installation IBPB safe RET no microcode safe RET off 20 40 60 80 100 SE +/- 0.34, N = 3 SE +/- 0.17, N = 3 SE +/- 0.24, N = 3 SE +/- 0.23, N = 3 99.04 85.04 84.48 77.46
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: INIVOL and Fluid Structure Interaction Drop Container IBPB safe RET safe RET no microcode off 40 80 120 160 200 SE +/- 0.17, N = 3 SE +/- 0.50, N = 3 SE +/- 0.39, N = 3 SE +/- 0.16, N = 3 171.75 163.97 163.02 162.13
Timed MrBayes Analysis This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis IBPB safe RET safe RET no microcode off 30 60 90 120 150 SE +/- 1.03, N = 3 SE +/- 1.05, N = 3 SE +/- 0.66, N = 3 SE +/- 0.85, N = 3 144.83 138.85 137.52 136.69 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 64 - Model: ResNet-50 safe RET no microcode safe RET IBPB off 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 15.56 15.65 17.45 17.78
Neural Magic DeepSparse OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream off safe RET IBPB safe RET no microcode 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 37.60 37.63 37.68 37.70
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream IBPB safe RET no microcode safe RET off 200 400 600 800 1000 SE +/- 0.53, N = 3 SE +/- 0.36, N = 3 SE +/- 0.34, N = 3 SE +/- 0.50, N = 3 840.89 840.77 840.42 839.72
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream safe RET no microcode IBPB off safe RET 110 220 330 440 550 SE +/- 0.96, N = 3 SE +/- 1.48, N = 3 SE +/- 1.15, N = 3 SE +/- 1.01, N = 3 485.01 485.68 487.25 487.37
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream safe RET no microcode IBPB off safe RET 15 30 45 60 75 SE +/- 0.14, N = 3 SE +/- 0.20, N = 3 SE +/- 0.15, N = 3 SE +/- 0.16, N = 3 65.89 65.78 65.59 65.57
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream IBPB safe RET safe RET no microcode off 100 200 300 400 500 SE +/- 0.43, N = 3 SE +/- 0.31, N = 3 SE +/- 0.42, N = 3 SE +/- 0.24, N = 3 467.32 468.22 468.36 468.83
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Homogeneous Halfspace IBPB safe RET safe RET no microcode off 4 8 12 16 20 SE +/- 0.11, N = 3 SE +/- 0.21, N = 3 SE +/- 0.20, N = 4 SE +/- 0.07, N = 3 17.70 17.69 17.64 17.42 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Neural Magic DeepSparse OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream IBPB safe RET no microcode safe RET off 15 30 45 60 75 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 68.35 68.26 68.23 68.15
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream safe RET no microcode IBPB safe RET off 800 1600 2400 3200 4000 SE +/- 7.72, N = 3 SE +/- 7.83, N = 3 SE +/- 12.28, N = 3 SE +/- 4.76, N = 3 3816.77 3824.84 3834.24 3840.63
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream safe RET no microcode IBPB safe RET off 2 4 6 8 10 SE +/- 0.0167, N = 3 SE +/- 0.0189, N = 3 SE +/- 0.0271, N = 3 SE +/- 0.0095, N = 3 8.3612 8.3411 8.3219 8.3078
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream safe RET IBPB off safe RET no microcode 11 22 33 44 55 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 46.57 46.61 46.70 46.72
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream safe RET IBPB safe RET no microcode off 150 300 450 600 750 SE +/- 0.96, N = 3 SE +/- 1.14, N = 3 SE +/- 1.47, N = 3 SE +/- 1.21, N = 3 682.21 681.40 679.58 678.93
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream safe RET IBPB off safe RET no microcode 12 24 36 48 60 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 53.57 53.58 53.60 53.60
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream IBPB safe RET off safe RET no microcode 130 260 390 520 650 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 SE +/- 0.18, N = 3 SE +/- 0.26, N = 3 596.79 596.78 596.59 596.56
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream IBPB safe RET off safe RET no microcode 120 240 360 480 600 SE +/- 0.94, N = 3 SE +/- 0.39, N = 3 SE +/- 0.44, N = 3 SE +/- 0.39, N = 3 575.73 576.82 576.97 577.06
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream IBPB safe RET off safe RET no microcode 12 24 36 48 60 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 55.50 55.40 55.39 55.37
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare off safe RET IBPB safe RET no microcode 1.2893 2.5786 3.8679 5.1572 6.4465 SE +/- 0.012, N = 3 SE +/- 0.010, N = 3 SE +/- 0.011, N = 3 SE +/- 0.006, N = 3 5.680 5.706 5.707 5.730 1. (CXX) g++ options: -O3
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms IBPB off safe RET no microcode safe RET 0.0867 0.1734 0.2601 0.3468 0.4335 SE +/- 0.00026, N = 3 SE +/- 0.00017, N = 3 SE +/- 0.00029, N = 3 SE +/- 0.00028, N = 3 0.38534 0.38130 0.38115 0.38098
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Person Detection FP16 - Device: CPU IBPB safe RET no microcode safe RET off 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 7.46 7.58 7.60 7.68 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Person Detection FP16 - Device: CPU IBPB safe RET no microcode safe RET off 900 1800 2700 3600 4500 SE +/- 10.56, N = 3 SE +/- 6.87, N = 3 SE +/- 10.77, N = 3 SE +/- 14.65, N = 3 4204.34 4124.74 4114.58 4092.91 MIN: 2302.89 / MAX: 4817.72 MIN: 2129.26 / MAX: 5016.36 MIN: 2087 / MAX: 5053.62 MIN: 3409.52 / MAX: 4641.43 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Face Detection FP16-INT8 - Device: CPU IBPB safe RET no microcode safe RET off 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 27.71 27.79 27.82 27.83 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Face Detection FP16-INT8 - Device: CPU IBPB safe RET safe RET no microcode off 200 400 600 800 1000 SE +/- 0.42, N = 3 SE +/- 1.09, N = 3 SE +/- 0.27, N = 3 SE +/- 0.32, N = 3 1144.98 1142.53 1142.29 1141.43 MIN: 502.04 / MAX: 1175.93 MIN: 999.01 / MAX: 1177.02 MIN: 985.75 / MAX: 1168.76 MIN: 998.76 / MAX: 1165.45 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Weld Porosity Detection FP16 - Device: CPU IBPB off safe RET safe RET no microcode 200 400 600 800 1000 SE +/- 0.40, N = 3 SE +/- 0.17, N = 3 SE +/- 0.13, N = 3 SE +/- 0.72, N = 3 1116.89 1126.03 1126.14 1126.64 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Weld Porosity Detection FP16 - Device: CPU IBPB off safe RET safe RET no microcode 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 28.62 28.40 28.39 28.38 MIN: 14.91 / MAX: 49.84 MIN: 14.74 / MAX: 48.66 MIN: 14.64 / MAX: 50.33 MIN: 14.89 / MAX: 51.63 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 safe RET safe RET no microcode IBPB off 200M 400M 600M 800M 1000M SE +/- 367255.40, N = 3 SE +/- 575791.94, N = 3 SE +/- 1724277.85, N = 3 SE +/- 839009.73, N = 3 999102100 999645400 1005138667 1011799000 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time IBPB safe RET no microcode safe RET off 30 60 90 120 150 148.25 145.06 144.02 140.62 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time IBPB safe RET no microcode safe RET off 140 280 420 560 700 645.41 644.36 643.71 633.52 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating safe RET no microcode safe RET IBPB off 80K 160K 240K 320K 400K SE +/- 380.69, N = 3 SE +/- 605.48, N = 3 SE +/- 312.85, N = 3 SE +/- 845.58, N = 3 383039 383515 385487 385585 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig IBPB safe RET no microcode safe RET off 9 18 27 36 45 SE +/- 0.37, N = 7 SE +/- 0.35, N = 6 SE +/- 0.37, N = 6 SE +/- 0.34, N = 5 40.09 37.62 37.24 31.19
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only IBPB safe RET no microcode safe RET off 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 27.73 27.58 27.46 27.34
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only IBPB safe RET no microcode off safe RET 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.13, N = 3 SE +/- 0.04, N = 3 85.63 84.69 84.50 84.49
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Crown safe RET no microcode IBPB safe RET off 13 26 39 52 65 SE +/- 0.14, N = 3 SE +/- 0.10, N = 3 SE +/- 0.15, N = 3 SE +/- 0.08, N = 3 57.30 57.30 57.31 57.42 MIN: 56.26 / MAX: 58.69 MIN: 56.3 / MAX: 58.61 MIN: 56.2 / MAX: 58.59 MIN: 56.59 / MAX: 58.54
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon IBPB safe RET no microcode off safe RET 14 28 42 56 70 SE +/- 0.14, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 63.47 64.39 64.60 64.67 MIN: 62.67 / MAX: 65.74 MIN: 63.77 / MAX: 66.16 MIN: 64.05 / MAX: 66.13 MIN: 64.11 / MAX: 66.01
OpenVKL OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 1.3.1 Benchmark: vklBenchmark ISPC IBPB safe RET no microcode off safe RET 100 200 300 400 500 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 SE +/- 0.58, N = 3 SE +/- 0.00, N = 3 450 452 453 453 MIN: 83 / MAX: 2495 MIN: 85 / MAX: 2535 MIN: 84 / MAX: 2528 MIN: 84 / MAX: 2520
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/ao/real_time IBPB safe RET off safe RET no microcode 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 17.97 17.98 18.02 18.03
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/scivis/real_time IBPB safe RET no microcode safe RET off 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 17.67 17.73 17.73 17.75
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/pathtracer/real_time IBPB safe RET no microcode safe RET off 30 60 90 120 150 SE +/- 0.43, N = 3 SE +/- 1.83, N = 3 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 153.44 155.17 156.42 157.83
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/ao/real_time IBPB safe RET off safe RET no microcode 3 6 9 12 15 SE +/- 0.01872, N = 3 SE +/- 0.01456, N = 3 SE +/- 0.02460, N = 3 SE +/- 0.02941, N = 3 8.89239 8.94049 8.96051 8.96941
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time IBPB safe RET no microcode off safe RET 2 4 6 8 10 SE +/- 0.02088, N = 3 SE +/- 0.01659, N = 3 SE +/- 0.00864, N = 3 SE +/- 0.01059, N = 3 8.26549 8.32749 8.33174 8.33813
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time off IBPB safe RET safe RET no microcode 3 6 9 12 15 SE +/- 0.13, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 13.14 13.17 13.25 13.26
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 IBPB safe RET safe RET no microcode off 40K 80K 120K 160K 200K SE +/- 262.73, N = 3 SE +/- 251.96, N = 3 SE +/- 284.55, N = 3 SE +/- 284.72, N = 3 137051.69 142619.84 144020.03 169583.15 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 IBPB safe RET no microcode safe RET off 40K 80K 120K 160K 200K SE +/- 242.54, N = 3 SE +/- 352.89, N = 3 SE +/- 314.03, N = 3 SE +/- 362.13, N = 3 135431.46 140555.98 143271.26 166499.89 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Apache IoTDB OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 safe RET IBPB safe RET no microcode off 200K 400K 600K 800K 1000K SE +/- 7998.38, N = 8 SE +/- 7583.18, N = 9 SE +/- 6730.71, N = 12 SE +/- 8467.91, N = 3 918691.45 921701.44 947741.34 960525.66
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 IBPB safe RET safe RET no microcode off 4 8 12 16 20 SE +/- 0.19, N = 9 SE +/- 0.21, N = 8 SE +/- 0.16, N = 12 SE +/- 0.19, N = 3 14.78 14.73 14.05 13.83 MAX: 618.06 MAX: 645.11 MAX: 609.96 MAX: 596.78
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 off IBPB safe RET no microcode safe RET 300K 600K 900K 1200K 1500K SE +/- 7578.67, N = 3 SE +/- 14032.06, N = 4 SE +/- 1525.49, N = 3 SE +/- 9180.92, N = 3 1271946.57 1287324.35 1342031.11 1345598.59
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 off IBPB safe RET no microcode safe RET 8 16 24 32 40 SE +/- 0.24, N = 3 SE +/- 0.42, N = 4 SE +/- 0.02, N = 3 SE +/- 0.22, N = 3 32.36 31.74 30.29 30.14 MAX: 646.51 MAX: 667.18 MAX: 715.01 MAX: 641.04
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 off safe RET no microcode safe RET IBPB 300K 600K 900K 1200K 1500K SE +/- 1566.77, N = 3 SE +/- 6553.14, N = 3 SE +/- 4253.29, N = 3 SE +/- 3166.01, N = 3 1176385.35 1202637.36 1211172.13 1344749.20
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 off safe RET no microcode safe RET IBPB 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 SE +/- 0.13, N = 3 14.05 13.61 13.36 11.93 MAX: 858.17 MAX: 854.4 MAX: 881.3 MAX: 855.56
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 safe RET no microcode off IBPB safe RET 300K 600K 900K 1200K 1500K SE +/- 13029.07, N = 3 SE +/- 4294.81, N = 3 SE +/- 6687.04, N = 3 SE +/- 5073.96, N = 3 1408658.83 1415756.33 1441637.63 1583717.62
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 safe RET no microcode off IBPB safe RET 7 14 21 28 35 SE +/- 0.49, N = 3 SE +/- 0.29, N = 3 SE +/- 0.35, N = 3 SE +/- 0.22, N = 3 31.93 31.49 31.11 27.73 MAX: 930.97 MAX: 939.96 MAX: 908.02 MAX: 938.92
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 off safe RET IBPB safe RET no microcode 10M 20M 30M 40M 50M SE +/- 574678.74, N = 15 SE +/- 543529.82, N = 15 SE +/- 146499.20, N = 3 SE +/- 614274.26, N = 3 43665846.28 44027904.89 44816394.74 46538766.01
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 off safe RET IBPB safe RET no microcode 9 18 27 36 45 SE +/- 0.55, N = 15 SE +/- 0.52, N = 15 SE +/- 0.32, N = 3 SE +/- 0.62, N = 3 37.70 37.53 36.47 35.10 MAX: 802.64 MAX: 755.16 MAX: 808.57 MAX: 728.37
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 safe RET no microcode IBPB safe RET off 8M 16M 24M 32M 40M SE +/- 394126.89, N = 5 SE +/- 327739.29, N = 8 SE +/- 288707.73, N = 15 SE +/- 302926.36, N = 10 37720117.40 38572529.56 38833415.97 39463981.42
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 safe RET no microcode IBPB safe RET off 30 60 90 120 150 SE +/- 1.63, N = 5 SE +/- 1.16, N = 8 SE +/- 0.86, N = 15 SE +/- 0.95, N = 10 123.61 120.53 120.06 117.97 MAX: 4533.33 MAX: 4401.37 MAX: 4495.21 MAX: 4652.25
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 safe RET no microcode IBPB off safe RET 11M 22M 33M 44M 55M SE +/- 147114.88, N = 3 SE +/- 616490.96, N = 3 SE +/- 681823.31, N = 3 SE +/- 634314.77, N = 3 47445770.18 49316970.28 49501499.13 50578426.54
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 safe RET no microcode off IBPB safe RET 9 18 27 36 45 SE +/- 0.11, N = 3 SE +/- 0.61, N = 3 SE +/- 0.44, N = 3 SE +/- 0.49, N = 3 38.54 36.58 36.56 35.82 MAX: 3276.77 MAX: 2252.73 MAX: 2253.21 MAX: 3267.55
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 safe RET IBPB safe RET no microcode off 13M 26M 39M 52M 65M SE +/- 721225.08, N = 3 SE +/- 269354.94, N = 3 SE +/- 648692.91, N = 4 SE +/- 817020.04, N = 3 57099408.15 57529201.42 58073516.79 58682618.18
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 safe RET IBPB safe RET no microcode off 20 40 60 80 100 SE +/- 1.29, N = 3 SE +/- 0.42, N = 3 SE +/- 0.81, N = 4 SE +/- 2.14, N = 3 82.24 80.78 79.19 78.92 MAX: 3625.32 MAX: 2592.69 MAX: 5165.86 MAX: 1729.94
Apache Spark This is a benchmark of Apache Spark with its PySpark interface. Apache Spark is an open-source unified analytics engine for large-scale data processing and dealing with big data. This test profile benchmars the Apache Spark in a single-system configuration using spark-submit. The test makes use of DIYBigData's pyspark-benchmark (https://github.com/DIYBigData/pyspark-benchmark/) for generating of test data and various Apache Spark operations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - SHA-512 Benchmark Time IBPB safe RET safe RET no microcode off 0.8348 1.6696 2.5044 3.3392 4.174 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 15 3.71 3.47 3.42 3.39
OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark safe RET no microcode off IBPB safe RET 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.12, N = 15 SE +/- 0.20, N = 3 SE +/- 0.33, N = 3 32.02 31.84 31.49 31.43
OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Group By Test Time IBPB safe RET no microcode safe RET off 1.2915 2.583 3.8745 5.166 6.4575 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 0.04, N = 15 5.74 5.17 5.15 4.91
OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Repartition Test Time safe RET no microcode IBPB safe RET off 0.5355 1.071 1.6065 2.142 2.6775 SE +/- 0.04, N = 3 SE +/- 0.12, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 15 2.38 2.31 2.26 2.09
OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Inner Join Test Time IBPB safe RET no microcode safe RET off 0.5423 1.0846 1.6269 2.1692 2.7115 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.02, N = 15 2.41 2.22 2.14 1.88
OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Broadcast Inner Join Test Time IBPB safe RET safe RET no microcode off 0.369 0.738 1.107 1.476 1.845 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 15 1.64 1.41 1.39 1.30
OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark Using Dataframe IBPB 0.585 1.17 1.755 2.34 2.925 SE +/- 0.08, N = 3 2.60
ClickHouse ClickHouse is an open-source, high performance OLAP data management system. This test profile uses ClickHouse's standard benchmark recommendations per https://clickhouse.com/docs/en/operations/performance-test/ / https://github.com/ClickHouse/ClickBench/tree/main/clickhouse with the 100 million rows web analytics dataset. The reported value is the query processing time using the geometric mean of all separate queries performed as an aggregate. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, First Run / Cold Cache safe RET safe RET no microcode IBPB off 80 160 240 320 400 SE +/- 2.94, N = 3 SE +/- 3.38, N = 5 SE +/- 2.84, N = 3 SE +/- 0.68, N = 3 318.12 323.42 336.69 349.43 MIN: 30.57 / MAX: 3333.33 MIN: 30.82 / MAX: 5000 MIN: 31.5 / MAX: 4000 MIN: 31.06 / MAX: 4285.71
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Second Run safe RET no microcode safe RET IBPB off 80 160 240 320 400 SE +/- 2.16, N = 5 SE +/- 4.86, N = 3 SE +/- 2.20, N = 3 SE +/- 1.42, N = 3 337.01 337.12 347.92 361.81 MIN: 30.49 / MAX: 3529.41 MIN: 30.79 / MAX: 4000 MIN: 31.85 / MAX: 3750 MIN: 31.46 / MAX: 4000
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Third Run safe RET no microcode safe RET IBPB off 80 160 240 320 400 SE +/- 3.27, N = 5 SE +/- 1.85, N = 3 SE +/- 2.22, N = 3 SE +/- 2.21, N = 3 329.19 337.45 349.50 362.64 MIN: 31.32 / MAX: 2857.14 MIN: 31.46 / MAX: 4000 MIN: 31.56 / MAX: 5000 MIN: 31.5 / MAX: 4285.71
CockroachDB CockroachDB is a cloud-native, distributed SQL database for data intensive applications. This test profile uses a server-less CockroachDB configuration to test various Coackroach workloads on the local host with a single node. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 50% Reads - Concurrency: 128 IBPB safe RET safe RET no microcode off 20K 40K 60K 80K 100K SE +/- 341.24, N = 3 SE +/- 948.29, N = 15 SE +/- 719.41, N = 15 SE +/- 275.86, N = 3 95416.0 99601.6 100851.4 103635.0
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 95% Reads - Concurrency: 128 IBPB safe RET no microcode safe RET off 30K 60K 90K 120K 150K SE +/- 408.63, N = 3 SE +/- 1043.12, N = 13 SE +/- 1387.70, N = 15 SE +/- 931.05, N = 3 119163.8 131487.0 132046.0 135187.2
Redis 7.0.12 + memtier_benchmark Memtier_benchmark is a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 IBPB safe RET off safe RET no microcode 500K 1000K 1500K 2000K 2500K SE +/- 21878.46, N = 3 SE +/- 1778.76, N = 3 SE +/- 11955.97, N = 3 SE +/- 31351.12, N = 3 2092844.22 2145436.26 2204628.92 2218601.79 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 IBPB safe RET safe RET no microcode off 500K 1000K 1500K 2000K 2500K SE +/- 942.73, N = 3 SE +/- 4916.89, N = 3 SE +/- 17712.54, N = 3 SE +/- 14704.83, N = 3 2126493.29 2145052.14 2167181.09 2197287.30 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 IBPB safe RET safe RET no microcode off 500K 1000K 1500K 2000K 2500K SE +/- 13504.65, N = 3 SE +/- 2448.62, N = 3 SE +/- 17754.58, N = 3 SE +/- 14630.02, N = 3 2137964.98 2172804.71 2173694.77 2177211.80 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 IBPB safe RET no microcode safe RET off 500K 1000K 1500K 2000K 2500K SE +/- 12623.44, N = 3 SE +/- 16754.20, N = 10 SE +/- 792.70, N = 3 SE +/- 30210.22, N = 3 2148876.03 2154339.26 2157815.69 2195705.51 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
SQLite This is a simple benchmark of SQLite. At present this test profile just measures the time to perform a pre-defined number of insertions on an indexed database with a variable number of concurrent repetitions -- up to the maximum number of CPU threads available. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.41.2 Threads / Copies: 8 safe RET safe RET no microcode IBPB off 1.1264 2.2528 3.3792 4.5056 5.632 SE +/- 0.036, N = 3 SE +/- 0.016, N = 3 SE +/- 0.010, N = 3 SE +/- 0.013, N = 3 5.006 4.850 4.793 3.755 1. (CC) gcc options: -O2 -lz -lm
OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.41.2 Threads / Copies: 16 safe RET no microcode safe RET IBPB off 2 4 6 8 10 SE +/- 0.024, N = 3 SE +/- 0.052, N = 3 SE +/- 0.007, N = 3 SE +/- 0.020, N = 3 8.834 8.800 7.934 6.273 1. (CC) gcc options: -O2 -lz -lm
RocksDB This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Update Random IBPB safe RET safe RET no microcode off 100K 200K 300K 400K 500K SE +/- 110.81, N = 3 SE +/- 185.49, N = 3 SE +/- 426.73, N = 3 SE +/- 893.82, N = 3 322231 426947 428112 462287 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read Random Write Random IBPB safe RET safe RET no microcode off 600K 1200K 1800K 2400K 3000K SE +/- 7875.65, N = 3 SE +/- 21895.20, N = 3 SE +/- 18652.89, N = 3 SE +/- 35283.44, N = 4 2130006 2839085 2872765 2951684 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Only IBPB safe RET no microcode safe RET off 700K 1400K 2100K 2800K 3500K SE +/- 2988.66, N = 3 SE +/- 29158.10, N = 3 SE +/- 34286.68, N = 3 SE +/- 1705.16, N = 3 1733827 2707280 2768445 3128719 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency IBPB safe RET no microcode safe RET off 0.1037 0.2074 0.3111 0.4148 0.5185 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 SE +/- 0.004, N = 3 SE +/- 0.000, N = 3 0.461 0.296 0.289 0.256 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Write IBPB safe RET safe RET no microcode off 13K 26K 39K 52K 65K SE +/- 133.40, N = 3 SE +/- 207.71, N = 3 SE +/- 66.28, N = 3 SE +/- 418.78, N = 3 50463 54837 55175 61604 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency IBPB safe RET safe RET no microcode off 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 15.85 14.59 14.50 12.99 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
MariaDB This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.0.1 Clients: 4096 IBPB safe RET no microcode safe RET off 130 260 390 520 650 SE +/- 0.71, N = 3 SE +/- 2.96, N = 3 SE +/- 3.51, N = 3 SE +/- 5.48, N = 3 274 412 418 590 1. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.0.1 Clients: 8192 IBPB safe RET no microcode safe RET off 80 160 240 320 400 SE +/- 0.62, N = 3 SE +/- 0.73, N = 3 SE +/- 1.18, N = 3 SE +/- 3.35, N = 3 276 301 301 355 1. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl
off Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 10 August 2023 05:20 by user phoronix.
safe RET no microcode Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 9 August 2023 18:46 by user phoronix.
safe RET Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 10 August 2023 12:35 by user phoronix.
IBPB Processor: AMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads), Motherboard: AMD DAYTONA_X (RYM1009B BIOS), Chipset: AMD Starship/Matisse, Memory: 256GB, Disk: 800GB INTEL SSDPF21Q800GB, Graphics: ASPEED, Monitor: VE228, Network: 2 x Mellanox MT27710
OS: Ubuntu 22.04, Kernel: 6.5.0-rc5-phx-tues (x86_64), Desktop: GNOME Shell 42.5, Display Server: X Server 1.21.1.3, Vulkan: 1.3.224, Compiler: GCC 11.3.0 + LLVM 14.0.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 10 August 2023 20:55 by user phoronix.