Benchmarks by Michael Larabel for a future article looking at AMD Inception impact.
safe RET no microcode Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
off Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
safe RET Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
IBPB Processor: AMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads), Motherboard: AMD DAYTONA_X (RYM1009B BIOS), Chipset: AMD Starship/Matisse, Memory: 256GB, Disk: 800GB INTEL SSDPF21Q800GB, Graphics: ASPEED, Monitor: VE228, Network: 2 x Mellanox MT27710
OS: Ubuntu 22.04, Kernel: 6.5.0-rc5-phx-tues (x86_64), Desktop: GNOME Shell 42.5, Display Server: X Server 1.21.1.3, Vulkan: 1.3.224, Compiler: GCC 11.3.0 + LLVM 14.0.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
AMD EPYC 7763 1P spec_rstack_overflow OpenBenchmarking.org Phoronix Test Suite AMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads) AMD DAYTONA_X (RYM1009B BIOS) AMD Starship/Matisse 256GB 800GB INTEL SSDPF21Q800GB ASPEED VE228 2 x Mellanox MT27710 Ubuntu 22.04 6.5.0-rc5-phx-tues (x86_64) GNOME Shell 42.5 X Server 1.21.1.3 1.3.224 GCC 11.3.0 + LLVM 14.0.0 ext4 1920x1080 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Vulkan Compiler File-System Screen Resolution AMD EPYC 7763 1P Spec_rstack_overflow Benchmarks System Logs - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 - safe RET no microcode: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 - off: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 - safe RET: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1 - IBPB: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1 - OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04) - Python 3.10.6 - safe RET no microcode: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - off: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - safe RET: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - IBPB: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
safe RET no microcode off safe RET IBPB Result Overview Phoronix Test Suite 100% 117% 133% 150% 166% MariaDB PostgreSQL RocksDB SQLite Timed Linux Kernel Compilation nginx Timed Node.js Compilation OpenRadioss Numpy Benchmark Timed LLVM Compilation Apache Spark DaCapo Benchmark TensorFlow Timed Godot Game Engine Compilation CockroachDB ClickHouse Apache Cassandra 7-Zip Compression Remhos Timed MrBayes Analysis ACES DGEMM OpenFOAM Redis 7.0.12 + memtier_benchmark Apache IoTDB Blender SPECFEM3D OpenVINO Algebraic Multi-Grid Benchmark NAMD Embree GROMACS OSPRay OpenVKL Neural Magic DeepSparse
AMD EPYC 7763 1P spec_rstack_overflow compress-7zip: Compression Rating compress-7zip: Decompression Rating mt-dgemm: Sustained Floating-Point Rate amg: cassandra: Writes apache-iotdb: 200 - 1 - 200 apache-iotdb: 200 - 1 - 200 apache-iotdb: 200 - 1 - 500 apache-iotdb: 200 - 1 - 500 apache-iotdb: 500 - 1 - 200 apache-iotdb: 500 - 1 - 200 apache-iotdb: 500 - 1 - 500 apache-iotdb: 500 - 1 - 500 apache-iotdb: 200 - 100 - 200 apache-iotdb: 200 - 100 - 200 apache-iotdb: 200 - 100 - 500 apache-iotdb: 200 - 100 - 500 apache-iotdb: 500 - 100 - 200 apache-iotdb: 500 - 100 - 200 apache-iotdb: 500 - 100 - 500 apache-iotdb: 500 - 100 - 500 spark: 1000000 - 100 - SHA-512 Benchmark Time spark: 1000000 - 100 - Calculate Pi Benchmark spark: 1000000 - 100 - Group By Test Time spark: 1000000 - 100 - Repartition Test Time spark: 1000000 - 100 - Inner Join Test Time spark: 1000000 - 100 - Broadcast Inner Join Test Time spark: 1000000 - 100 - Calculate Pi Benchmark Using Dataframe blender: BMW27 - CPU-Only blender: Pabellon Barcelona - CPU-Only clickhouse: 100M Rows Hits Dataset, First Run / Cold Cache clickhouse: 100M Rows Hits Dataset, Second Run clickhouse: 100M Rows Hits Dataset, Third Run cockroach: KV, 50% Reads - 128 cockroach: KV, 95% Reads - 128 dacapobench: Jython dacapobench: Tradebeans embree: Pathtracer ISPC - Crown embree: Pathtracer ISPC - Asian Dragon gromacs: MPI CPU - water_GMX50_bare mysqlslap: 4096 mysqlslap: 8192 namd: ATPase Simulation - 327,506 Atoms deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream deepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream deepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream deepsparse: ResNet-50, Baseline - Asynchronous Multi-Stream deepsparse: ResNet-50, Baseline - Asynchronous Multi-Stream deepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Stream deepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Stream deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream nginx: 500 nginx: 1000 numpy: openfoam: drivaerFastback, Medium Mesh Size - Mesh Time openfoam: drivaerFastback, Medium Mesh Size - Execution Time openradioss: Bumper Beam openradioss: Cell Phone Drop Test openradioss: Bird Strike on Windshield openradioss: Rubber O-Ring Seal Installation openradioss: INIVOL and Fluid Structure Interaction Drop Container openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Weld Porosity Detection FP16 - CPU openvkl: vklBenchmark ISPC ospray: particle_volume/ao/real_time ospray: particle_volume/scivis/real_time ospray: particle_volume/pathtracer/real_time ospray: gravity_spheres_volume/dim_512/ao/real_time ospray: gravity_spheres_volume/dim_512/scivis/real_time ospray: gravity_spheres_volume/dim_512/pathtracer/real_time pgbench: 100 - 800 - Read Only pgbench: 100 - 800 - Read Only - Average Latency pgbench: 100 - 800 - Read Write pgbench: 100 - 800 - Read Write - Average Latency memtier-benchmark: Redis - 50 - 1:5 memtier-benchmark: Redis - 100 - 1:5 memtier-benchmark: Redis - 50 - 1:10 memtier-benchmark: Redis - 100 - 1:10 remhos: Sample Remap Example rocksdb: Update Rand rocksdb: Read Rand Write Rand specfem3d: Mount St. Helens specfem3d: Layered Halfspace specfem3d: Tomographic Model specfem3d: Homogeneous Halfspace specfem3d: Water-layered Halfspace sqlite: 8 sqlite: 16 tensorflow: CPU - 64 - ResNet-50 build-godot: Time To Compile build-linux-kernel: defconfig build-linux-kernel: allmodconfig build-llvm: Ninja mrbayes: Primate Phylogeny Analysis build-nodejs: Time To Compile safe RET no microcode off safe RET IBPB 334812 383039 24.695818 999645400 233069 947741.34 14.05 1342031.11 30.29 1202637.36 13.61 1408658.83 31.93 46538766.01 35.10 37720117.40 123.61 47445770.18 38.54 58073516.79 79.19 3.42 32.02 5.17 2.38 2.22 1.39 27.58 84.69 323.42 337.01 329.19 100851.4 131487.0 4191 4096 57.2956 64.3916 5.730 412 301 0.38115 37.6987 840.7721 485.0069 65.8910 468.3646 68.2595 3816.7676 8.3612 46.7239 679.5766 53.5972 596.5579 577.0589 55.3747 144020.03 140555.98 418.95 145.06069 644.36223 93.68 36.37 152.91 85.04 163.02 7.58 4124.74 27.79 1142.29 1126.64 28.38 452 18.0288 17.7305 155.165 8.96941 8.32749 13.2621 2707280 0.296 55175 14.499 2218601.79 2167181.09 2173694.77 2154339.26 17.788 428112 2872765 11.982163460 31.829261870 14.404419238 17.643680397 30.427531709 4.850 8.834 15.56 125.663 37.623 344.242 182.169 137.518 172.749 384374 385585 24.200551 1011799000 238741 960525.66 13.83 1271946.57 32.36 1176385.35 14.05 1415756.33 31.49 43665846.28 37.70 39463981.42 117.97 49501499.13 36.58 58682618.18 78.92 3.39 31.84 4.91 2.09 1.88 1.30 27.34 84.50 349.43 361.81 362.64 103635.0 135187.2 4193 3993 57.4229 64.5964 5.680 590 355 0.38130 37.6037 839.7201 487.2450 65.5911 468.8306 68.1483 3840.6307 8.3078 46.7020 678.9330 53.5968 596.5915 576.9722 55.3922 169583.15 166499.89 457.23 140.61562 633.51902 87.72 33.10 144.83 77.46 162.13 7.68 4092.91 27.83 1141.43 1126.03 28.40 453 18.0226 17.7511 157.829 8.96051 8.33174 13.1355 3128719 0.256 61604 12.988 2204628.92 2197287.30 2177211.80 2195705.51 17.375 462287 2951684 11.801238732 31.845630424 14.134265606 17.417120933 29.772535386 3.755 6.273 17.78 121.948 31.192 289.063 176.374 136.686 164.268 335595 383515 23.702889 999102100 236241 918691.45 14.73 1345598.59 30.14 1211172.13 13.36 1583717.62 27.73 44027904.89 37.53 38833415.97 120.06 50578426.54 35.82 57099408.15 82.24 3.47 31.43 5.15 2.26 2.14 1.41 27.46 84.49 318.12 337.12 337.45 99601.6 132046.0 4241 4143 57.3138 64.6742 5.706 418 301 0.38098 37.6319 840.4236 487.3677 65.5662 468.2170 68.2325 3834.2439 8.3219 46.5677 682.2071 53.5729 596.7822 576.8166 55.4028 142619.84 143271.26 422.58 144.02174 643.71316 93.90 36.40 152.27 84.48 163.97 7.60 4114.58 27.82 1142.53 1126.14 28.39 453 17.9817 17.7305 156.419 8.94049 8.33813 13.2538 2768445 0.289 54837 14.589 2145436.26 2145052.14 2172804.71 2157815.69 17.958 426947 2839085 12.010380781 31.659940885 14.188058962 17.690298650 29.590868260 5.006 8.800 15.65 125.060 37.243 338.157 181.528 138.851 173.064 371799 385487 24.251474 1005138667 220814 921701.44 14.78 1287324.35 31.74 1344749.20 11.93 1441637.63 31.11 44816394.74 36.47 38572529.56 120.53 49316970.28 36.56 57529201.42 80.78 3.71 31.49 5.74 2.31 2.41 1.64 2.60 27.73 85.63 336.69 347.92 349.50 95416.0 119163.8 4446 5305 57.2967 63.4668 5.707 274 276 0.38534 37.6818 840.8881 485.6842 65.7767 467.3150 68.3456 3824.8431 8.3411 46.6121 681.3961 53.5782 596.7902 575.7335 55.5026 137051.69 135431.46 389.92 148.25442 645.40958 113.68 40.11 160.70 99.04 171.75 7.46 4204.34 27.71 1144.98 1116.89 28.62 450 17.9663 17.6743 153.439 8.89239 8.26549 13.1709 1733827 0.461 50463 15.854 2092844.22 2126493.29 2137964.98 2148876.03 18.658 322231 2130006 11.951448577 32.009762142 14.225825227 17.695914319 30.053834808 4.793 7.934 17.45 135.962 40.085 352.178 204.080 144.829 195.493 OpenBenchmarking.org
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating safe RET no microcode off safe RET IBPB 80K 160K 240K 320K 400K SE +/- 380.69, N = 3 SE +/- 845.58, N = 3 SE +/- 605.48, N = 3 SE +/- 312.85, N = 3 383039 385585 383515 385487 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 safe RET no microcode off safe RET IBPB 200M 400M 600M 800M 1000M SE +/- 575791.94, N = 3 SE +/- 839009.73, N = 3 SE +/- 367255.40, N = 3 SE +/- 1724277.85, N = 3 999645400 1011799000 999102100 1005138667 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
Apache IoTDB OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 safe RET no microcode off safe RET IBPB 200K 400K 600K 800K 1000K SE +/- 6730.71, N = 12 SE +/- 8467.91, N = 3 SE +/- 7998.38, N = 8 SE +/- 7583.18, N = 9 947741.34 960525.66 918691.45 921701.44
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 safe RET no microcode off safe RET IBPB 4 8 12 16 20 SE +/- 0.16, N = 12 SE +/- 0.19, N = 3 SE +/- 0.21, N = 8 SE +/- 0.19, N = 9 14.05 13.83 14.73 14.78 MAX: 609.96 MAX: 596.78 MAX: 645.11 MAX: 618.06
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 safe RET no microcode off safe RET IBPB 300K 600K 900K 1200K 1500K SE +/- 1525.49, N = 3 SE +/- 7578.67, N = 3 SE +/- 9180.92, N = 3 SE +/- 14032.06, N = 4 1342031.11 1271946.57 1345598.59 1287324.35
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 safe RET no microcode off safe RET IBPB 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.24, N = 3 SE +/- 0.22, N = 3 SE +/- 0.42, N = 4 30.29 32.36 30.14 31.74 MAX: 715.01 MAX: 646.51 MAX: 641.04 MAX: 667.18
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 safe RET no microcode off safe RET IBPB 300K 600K 900K 1200K 1500K SE +/- 6553.14, N = 3 SE +/- 1566.77, N = 3 SE +/- 4253.29, N = 3 SE +/- 3166.01, N = 3 1202637.36 1176385.35 1211172.13 1344749.20
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 safe RET no microcode off safe RET IBPB 4 8 12 16 20 SE +/- 0.14, N = 3 SE +/- 0.07, N = 3 SE +/- 0.19, N = 3 SE +/- 0.13, N = 3 13.61 14.05 13.36 11.93 MAX: 854.4 MAX: 858.17 MAX: 881.3 MAX: 855.56
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 safe RET no microcode off safe RET IBPB 300K 600K 900K 1200K 1500K SE +/- 13029.07, N = 3 SE +/- 4294.81, N = 3 SE +/- 5073.96, N = 3 SE +/- 6687.04, N = 3 1408658.83 1415756.33 1583717.62 1441637.63
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 safe RET no microcode off safe RET IBPB 7 14 21 28 35 SE +/- 0.49, N = 3 SE +/- 0.29, N = 3 SE +/- 0.22, N = 3 SE +/- 0.35, N = 3 31.93 31.49 27.73 31.11 MAX: 930.97 MAX: 939.96 MAX: 938.92 MAX: 908.02
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 safe RET no microcode off safe RET IBPB 10M 20M 30M 40M 50M SE +/- 614274.26, N = 3 SE +/- 574678.74, N = 15 SE +/- 543529.82, N = 15 SE +/- 146499.20, N = 3 46538766.01 43665846.28 44027904.89 44816394.74
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 safe RET no microcode off safe RET IBPB 9 18 27 36 45 SE +/- 0.62, N = 3 SE +/- 0.55, N = 15 SE +/- 0.52, N = 15 SE +/- 0.32, N = 3 35.10 37.70 37.53 36.47 MAX: 728.37 MAX: 802.64 MAX: 755.16 MAX: 808.57
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 safe RET no microcode off safe RET IBPB 8M 16M 24M 32M 40M SE +/- 394126.89, N = 5 SE +/- 302926.36, N = 10 SE +/- 288707.73, N = 15 SE +/- 327739.29, N = 8 37720117.40 39463981.42 38833415.97 38572529.56
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 safe RET no microcode off safe RET IBPB 30 60 90 120 150 SE +/- 1.63, N = 5 SE +/- 0.95, N = 10 SE +/- 0.86, N = 15 SE +/- 1.16, N = 8 123.61 117.97 120.06 120.53 MAX: 4533.33 MAX: 4652.25 MAX: 4495.21 MAX: 4401.37
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 safe RET no microcode off safe RET IBPB 11M 22M 33M 44M 55M SE +/- 147114.88, N = 3 SE +/- 681823.31, N = 3 SE +/- 634314.77, N = 3 SE +/- 616490.96, N = 3 47445770.18 49501499.13 50578426.54 49316970.28
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 safe RET no microcode off safe RET IBPB 9 18 27 36 45 SE +/- 0.11, N = 3 SE +/- 0.61, N = 3 SE +/- 0.49, N = 3 SE +/- 0.44, N = 3 38.54 36.58 35.82 36.56 MAX: 3276.77 MAX: 2252.73 MAX: 3267.55 MAX: 2253.21
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 safe RET no microcode off safe RET IBPB 13M 26M 39M 52M 65M SE +/- 648692.91, N = 4 SE +/- 817020.04, N = 3 SE +/- 721225.08, N = 3 SE +/- 269354.94, N = 3 58073516.79 58682618.18 57099408.15 57529201.42
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 safe RET no microcode off safe RET IBPB 20 40 60 80 100 SE +/- 0.81, N = 4 SE +/- 2.14, N = 3 SE +/- 1.29, N = 3 SE +/- 0.42, N = 3 79.19 78.92 82.24 80.78 MAX: 5165.86 MAX: 1729.94 MAX: 3625.32 MAX: 2592.69
Apache Spark This is a benchmark of Apache Spark with its PySpark interface. Apache Spark is an open-source unified analytics engine for large-scale data processing and dealing with big data. This test profile benchmars the Apache Spark in a single-system configuration using spark-submit. The test makes use of DIYBigData's pyspark-benchmark (https://github.com/DIYBigData/pyspark-benchmark/) for generating of test data and various Apache Spark operations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - SHA-512 Benchmark Time safe RET no microcode off safe RET IBPB 0.8348 1.6696 2.5044 3.3392 4.174 SE +/- 0.04, N = 3 SE +/- 0.03, N = 15 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 3.42 3.39 3.47 3.71
OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark safe RET no microcode off safe RET IBPB 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.12, N = 15 SE +/- 0.33, N = 3 SE +/- 0.20, N = 3 32.02 31.84 31.43 31.49
OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Group By Test Time safe RET no microcode off safe RET IBPB 1.2915 2.583 3.8745 5.166 6.4575 SE +/- 0.08, N = 3 SE +/- 0.04, N = 15 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 5.17 4.91 5.15 5.74
OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Repartition Test Time safe RET no microcode off safe RET IBPB 0.5355 1.071 1.6065 2.142 2.6775 SE +/- 0.04, N = 3 SE +/- 0.04, N = 15 SE +/- 0.04, N = 3 SE +/- 0.12, N = 3 2.38 2.09 2.26 2.31
OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Inner Join Test Time safe RET no microcode off safe RET IBPB 0.5423 1.0846 1.6269 2.1692 2.7115 SE +/- 0.05, N = 3 SE +/- 0.02, N = 15 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 2.22 1.88 2.14 2.41
OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Broadcast Inner Join Test Time safe RET no microcode off safe RET IBPB 0.369 0.738 1.107 1.476 1.845 SE +/- 0.02, N = 3 SE +/- 0.01, N = 15 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 1.39 1.30 1.41 1.64
OpenBenchmarking.org Seconds, Fewer Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark Using Dataframe IBPB 0.585 1.17 1.755 2.34 2.925 SE +/- 0.08, N = 3 2.60
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only safe RET no microcode off safe RET IBPB 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 27.58 27.34 27.46 27.73
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only safe RET no microcode off safe RET IBPB 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.13, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 84.69 84.50 84.49 85.63
ClickHouse ClickHouse is an open-source, high performance OLAP data management system. This test profile uses ClickHouse's standard benchmark recommendations per https://clickhouse.com/docs/en/operations/performance-test/ / https://github.com/ClickHouse/ClickBench/tree/main/clickhouse with the 100 million rows web analytics dataset. The reported value is the query processing time using the geometric mean of all separate queries performed as an aggregate. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, First Run / Cold Cache safe RET no microcode off safe RET IBPB 80 160 240 320 400 SE +/- 3.38, N = 5 SE +/- 0.68, N = 3 SE +/- 2.94, N = 3 SE +/- 2.84, N = 3 323.42 349.43 318.12 336.69 MIN: 30.82 / MAX: 5000 MIN: 31.06 / MAX: 4285.71 MIN: 30.57 / MAX: 3333.33 MIN: 31.5 / MAX: 4000
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Second Run safe RET no microcode off safe RET IBPB 80 160 240 320 400 SE +/- 2.16, N = 5 SE +/- 1.42, N = 3 SE +/- 4.86, N = 3 SE +/- 2.20, N = 3 337.01 361.81 337.12 347.92 MIN: 30.49 / MAX: 3529.41 MIN: 31.46 / MAX: 4000 MIN: 30.79 / MAX: 4000 MIN: 31.85 / MAX: 3750
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Third Run safe RET no microcode off safe RET IBPB 80 160 240 320 400 SE +/- 3.27, N = 5 SE +/- 2.21, N = 3 SE +/- 1.85, N = 3 SE +/- 2.22, N = 3 329.19 362.64 337.45 349.50 MIN: 31.32 / MAX: 2857.14 MIN: 31.5 / MAX: 4285.71 MIN: 31.46 / MAX: 4000 MIN: 31.56 / MAX: 5000
CockroachDB CockroachDB is a cloud-native, distributed SQL database for data intensive applications. This test profile uses a server-less CockroachDB configuration to test various Coackroach workloads on the local host with a single node. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 50% Reads - Concurrency: 128 safe RET no microcode off safe RET IBPB 20K 40K 60K 80K 100K SE +/- 719.41, N = 15 SE +/- 275.86, N = 3 SE +/- 948.29, N = 15 SE +/- 341.24, N = 3 100851.4 103635.0 99601.6 95416.0
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 95% Reads - Concurrency: 128 safe RET no microcode off safe RET IBPB 30K 60K 90K 120K 150K SE +/- 1043.12, N = 13 SE +/- 931.05, N = 3 SE +/- 1387.70, N = 15 SE +/- 408.63, N = 3 131487.0 135187.2 132046.0 119163.8
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Tradebeans safe RET no microcode off safe RET IBPB 1100 2200 3300 4400 5500 SE +/- 44.47, N = 4 SE +/- 42.66, N = 4 SE +/- 28.11, N = 4 SE +/- 56.17, N = 4 4096 3993 4143 5305
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Crown safe RET no microcode off safe RET IBPB 13 26 39 52 65 SE +/- 0.14, N = 3 SE +/- 0.08, N = 3 SE +/- 0.15, N = 3 SE +/- 0.10, N = 3 57.30 57.42 57.31 57.30 MIN: 56.26 / MAX: 58.69 MIN: 56.59 / MAX: 58.54 MIN: 56.2 / MAX: 58.59 MIN: 56.3 / MAX: 58.61
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon safe RET no microcode off safe RET IBPB 14 28 42 56 70 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.14, N = 3 64.39 64.60 64.67 63.47 MIN: 63.77 / MAX: 66.16 MIN: 64.05 / MAX: 66.13 MIN: 64.11 / MAX: 66.01 MIN: 62.67 / MAX: 65.74
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare safe RET no microcode off safe RET IBPB 1.2893 2.5786 3.8679 5.1572 6.4465 SE +/- 0.006, N = 3 SE +/- 0.012, N = 3 SE +/- 0.010, N = 3 SE +/- 0.011, N = 3 5.730 5.680 5.706 5.707 1. (CXX) g++ options: -O3
MariaDB This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.0.1 Clients: 4096 safe RET no microcode off safe RET IBPB 130 260 390 520 650 SE +/- 2.96, N = 3 SE +/- 5.48, N = 3 SE +/- 3.51, N = 3 SE +/- 0.71, N = 3 412 590 418 274 1. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl
OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 11.0.1 Clients: 8192 safe RET no microcode off safe RET IBPB 80 160 240 320 400 SE +/- 0.73, N = 3 SE +/- 3.35, N = 3 SE +/- 1.18, N = 3 SE +/- 0.62, N = 3 301 355 301 276 1. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms safe RET no microcode off safe RET IBPB 0.0867 0.1734 0.2601 0.3468 0.4335 SE +/- 0.00029, N = 3 SE +/- 0.00017, N = 3 SE +/- 0.00028, N = 3 SE +/- 0.00026, N = 3 0.38115 0.38130 0.38098 0.38534
Neural Magic DeepSparse OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream safe RET no microcode off safe RET IBPB 9 18 27 36 45 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 37.70 37.60 37.63 37.68
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream safe RET no microcode off safe RET IBPB 200 400 600 800 1000 SE +/- 0.36, N = 3 SE +/- 0.50, N = 3 SE +/- 0.34, N = 3 SE +/- 0.53, N = 3 840.77 839.72 840.42 840.89
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream safe RET no microcode off safe RET IBPB 110 220 330 440 550 SE +/- 0.96, N = 3 SE +/- 1.15, N = 3 SE +/- 1.01, N = 3 SE +/- 1.48, N = 3 485.01 487.25 487.37 485.68
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream safe RET no microcode off safe RET IBPB 15 30 45 60 75 SE +/- 0.14, N = 3 SE +/- 0.15, N = 3 SE +/- 0.16, N = 3 SE +/- 0.20, N = 3 65.89 65.59 65.57 65.78
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream safe RET no microcode off safe RET IBPB 100 200 300 400 500 SE +/- 0.42, N = 3 SE +/- 0.24, N = 3 SE +/- 0.31, N = 3 SE +/- 0.43, N = 3 468.36 468.83 468.22 467.32
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream safe RET no microcode off safe RET IBPB 15 30 45 60 75 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 68.26 68.15 68.23 68.35
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream safe RET no microcode off safe RET IBPB 800 1600 2400 3200 4000 SE +/- 7.72, N = 3 SE +/- 4.76, N = 3 SE +/- 12.28, N = 3 SE +/- 7.83, N = 3 3816.77 3840.63 3834.24 3824.84
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream safe RET no microcode off safe RET IBPB 2 4 6 8 10 SE +/- 0.0167, N = 3 SE +/- 0.0095, N = 3 SE +/- 0.0271, N = 3 SE +/- 0.0189, N = 3 8.3612 8.3078 8.3219 8.3411
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream safe RET no microcode off safe RET IBPB 11 22 33 44 55 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 46.72 46.70 46.57 46.61
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream safe RET no microcode off safe RET IBPB 150 300 450 600 750 SE +/- 1.47, N = 3 SE +/- 1.21, N = 3 SE +/- 0.96, N = 3 SE +/- 1.14, N = 3 679.58 678.93 682.21 681.40
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream safe RET no microcode off safe RET IBPB 12 24 36 48 60 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 53.60 53.60 53.57 53.58
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream safe RET no microcode off safe RET IBPB 130 260 390 520 650 SE +/- 0.26, N = 3 SE +/- 0.18, N = 3 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 596.56 596.59 596.78 596.79
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream safe RET no microcode off safe RET IBPB 120 240 360 480 600 SE +/- 0.39, N = 3 SE +/- 0.44, N = 3 SE +/- 0.39, N = 3 SE +/- 0.94, N = 3 577.06 576.97 576.82 575.73
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream safe RET no microcode off safe RET IBPB 12 24 36 48 60 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 55.37 55.39 55.40 55.50
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 safe RET no microcode off safe RET IBPB 40K 80K 120K 160K 200K SE +/- 284.55, N = 3 SE +/- 284.72, N = 3 SE +/- 251.96, N = 3 SE +/- 262.73, N = 3 144020.03 169583.15 142619.84 137051.69 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 safe RET no microcode off safe RET IBPB 40K 80K 120K 160K 200K SE +/- 352.89, N = 3 SE +/- 362.13, N = 3 SE +/- 314.03, N = 3 SE +/- 242.54, N = 3 140555.98 166499.89 143271.26 135431.46 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time safe RET no microcode off safe RET IBPB 30 60 90 120 150 145.06 140.62 144.02 148.25 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time safe RET no microcode off safe RET IBPB 140 280 420 560 700 644.36 633.52 643.71 645.41 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Bumper Beam safe RET no microcode off safe RET IBPB 30 60 90 120 150 SE +/- 0.08, N = 3 SE +/- 0.33, N = 3 SE +/- 0.03, N = 3 SE +/- 0.23, N = 3 93.68 87.72 93.90 113.68
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Cell Phone Drop Test safe RET no microcode off safe RET IBPB 9 18 27 36 45 SE +/- 0.26, N = 3 SE +/- 0.11, N = 3 SE +/- 0.03, N = 3 SE +/- 0.14, N = 3 36.37 33.10 36.40 40.11
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Bird Strike on Windshield safe RET no microcode off safe RET IBPB 40 80 120 160 200 SE +/- 0.67, N = 3 SE +/- 0.07, N = 3 SE +/- 0.73, N = 3 SE +/- 0.89, N = 3 152.91 144.83 152.27 160.70
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: Rubber O-Ring Seal Installation safe RET no microcode off safe RET IBPB 20 40 60 80 100 SE +/- 0.17, N = 3 SE +/- 0.23, N = 3 SE +/- 0.24, N = 3 SE +/- 0.34, N = 3 85.04 77.46 84.48 99.04
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2022.10.13 Model: INIVOL and Fluid Structure Interaction Drop Container safe RET no microcode off safe RET IBPB 40 80 120 160 200 SE +/- 0.39, N = 3 SE +/- 0.16, N = 3 SE +/- 0.50, N = 3 SE +/- 0.17, N = 3 163.02 162.13 163.97 171.75
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Person Detection FP16 - Device: CPU safe RET no microcode off safe RET IBPB 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 7.58 7.68 7.60 7.46 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Person Detection FP16 - Device: CPU safe RET no microcode off safe RET IBPB 900 1800 2700 3600 4500 SE +/- 6.87, N = 3 SE +/- 14.65, N = 3 SE +/- 10.77, N = 3 SE +/- 10.56, N = 3 4124.74 4092.91 4114.58 4204.34 MIN: 2129.26 / MAX: 5016.36 MIN: 3409.52 / MAX: 4641.43 MIN: 2087 / MAX: 5053.62 MIN: 2302.89 / MAX: 4817.72 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Face Detection FP16-INT8 - Device: CPU safe RET no microcode off safe RET IBPB 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 27.79 27.83 27.82 27.71 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Face Detection FP16-INT8 - Device: CPU safe RET no microcode off safe RET IBPB 200 400 600 800 1000 SE +/- 0.27, N = 3 SE +/- 0.32, N = 3 SE +/- 1.09, N = 3 SE +/- 0.42, N = 3 1142.29 1141.43 1142.53 1144.98 MIN: 985.75 / MAX: 1168.76 MIN: 998.76 / MAX: 1165.45 MIN: 999.01 / MAX: 1177.02 MIN: 502.04 / MAX: 1175.93 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Weld Porosity Detection FP16 - Device: CPU safe RET no microcode off safe RET IBPB 200 400 600 800 1000 SE +/- 0.72, N = 3 SE +/- 0.17, N = 3 SE +/- 0.13, N = 3 SE +/- 0.40, N = 3 1126.64 1126.03 1126.14 1116.89 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Weld Porosity Detection FP16 - Device: CPU safe RET no microcode off safe RET IBPB 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 28.38 28.40 28.39 28.62 MIN: 14.89 / MAX: 51.63 MIN: 14.74 / MAX: 48.66 MIN: 14.64 / MAX: 50.33 MIN: 14.91 / MAX: 49.84 1. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF
OpenVKL OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 1.3.1 Benchmark: vklBenchmark ISPC safe RET no microcode off safe RET IBPB 100 200 300 400 500 SE +/- 0.58, N = 3 SE +/- 0.58, N = 3 SE +/- 0.00, N = 3 SE +/- 0.67, N = 3 452 453 453 450 MIN: 85 / MAX: 2535 MIN: 84 / MAX: 2528 MIN: 84 / MAX: 2520 MIN: 83 / MAX: 2495
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/ao/real_time safe RET no microcode off safe RET IBPB 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 18.03 18.02 17.98 17.97
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/scivis/real_time safe RET no microcode off safe RET IBPB 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 17.73 17.75 17.73 17.67
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/pathtracer/real_time safe RET no microcode off safe RET IBPB 30 60 90 120 150 SE +/- 1.83, N = 3 SE +/- 0.21, N = 3 SE +/- 0.07, N = 3 SE +/- 0.43, N = 3 155.17 157.83 156.42 153.44
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/ao/real_time safe RET no microcode off safe RET IBPB 3 6 9 12 15 SE +/- 0.02941, N = 3 SE +/- 0.02460, N = 3 SE +/- 0.01456, N = 3 SE +/- 0.01872, N = 3 8.96941 8.96051 8.94049 8.89239
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time safe RET no microcode off safe RET IBPB 2 4 6 8 10 SE +/- 0.01659, N = 3 SE +/- 0.00864, N = 3 SE +/- 0.01059, N = 3 SE +/- 0.02088, N = 3 8.32749 8.33174 8.33813 8.26549
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time safe RET no microcode off safe RET IBPB 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.13, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 13.26 13.14 13.25 13.17
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Only safe RET no microcode off safe RET IBPB 700K 1400K 2100K 2800K 3500K SE +/- 29158.10, N = 3 SE +/- 1705.16, N = 3 SE +/- 34286.68, N = 3 SE +/- 2988.66, N = 3 2707280 3128719 2768445 1733827 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency safe RET no microcode off safe RET IBPB 0.1037 0.2074 0.3111 0.4148 0.5185 SE +/- 0.003, N = 3 SE +/- 0.000, N = 3 SE +/- 0.004, N = 3 SE +/- 0.001, N = 3 0.296 0.256 0.289 0.461 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Write safe RET no microcode off safe RET IBPB 13K 26K 39K 52K 65K SE +/- 66.28, N = 3 SE +/- 418.78, N = 3 SE +/- 207.71, N = 3 SE +/- 133.40, N = 3 55175 61604 54837 50463 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency safe RET no microcode off safe RET IBPB 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 14.50 12.99 14.59 15.85 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Redis 7.0.12 + memtier_benchmark Memtier_benchmark is a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 safe RET no microcode off safe RET IBPB 500K 1000K 1500K 2000K 2500K SE +/- 31351.12, N = 3 SE +/- 11955.97, N = 3 SE +/- 1778.76, N = 3 SE +/- 21878.46, N = 3 2218601.79 2204628.92 2145436.26 2092844.22 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 safe RET no microcode off safe RET IBPB 500K 1000K 1500K 2000K 2500K SE +/- 17712.54, N = 3 SE +/- 14704.83, N = 3 SE +/- 4916.89, N = 3 SE +/- 942.73, N = 3 2167181.09 2197287.30 2145052.14 2126493.29 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 safe RET no microcode off safe RET IBPB 500K 1000K 1500K 2000K 2500K SE +/- 17754.58, N = 3 SE +/- 14630.02, N = 3 SE +/- 2448.62, N = 3 SE +/- 13504.65, N = 3 2173694.77 2177211.80 2172804.71 2137964.98 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 safe RET no microcode off safe RET IBPB 500K 1000K 1500K 2000K 2500K SE +/- 16754.20, N = 10 SE +/- 30210.22, N = 3 SE +/- 792.70, N = 3 SE +/- 12623.44, N = 3 2154339.26 2195705.51 2157815.69 2148876.03 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Remhos Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example safe RET no microcode off safe RET IBPB 5 10 15 20 25 SE +/- 0.17, N = 3 SE +/- 0.23, N = 3 SE +/- 0.19, N = 3 SE +/- 0.12, N = 14 17.79 17.38 17.96 18.66 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
RocksDB This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Update Random safe RET no microcode off safe RET IBPB 100K 200K 300K 400K 500K SE +/- 426.73, N = 3 SE +/- 893.82, N = 3 SE +/- 185.49, N = 3 SE +/- 110.81, N = 3 428112 462287 426947 322231 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read Random Write Random safe RET no microcode off safe RET IBPB 600K 1200K 1800K 2400K 3000K SE +/- 18652.89, N = 3 SE +/- 35283.44, N = 4 SE +/- 21895.20, N = 3 SE +/- 7875.65, N = 3 2872765 2951684 2839085 2130006 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Mount St. Helens safe RET no microcode off safe RET IBPB 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 11.98 11.80 12.01 11.95 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Layered Halfspace safe RET no microcode off safe RET IBPB 7 14 21 28 35 SE +/- 0.23, N = 3 SE +/- 0.35, N = 3 SE +/- 0.18, N = 3 SE +/- 0.21, N = 3 31.83 31.85 31.66 32.01 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Tomographic Model safe RET no microcode off safe RET IBPB 4 8 12 16 20 SE +/- 0.15, N = 3 SE +/- 0.09, N = 3 SE +/- 0.20, N = 3 SE +/- 0.08, N = 3 14.40 14.13 14.19 14.23 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Homogeneous Halfspace safe RET no microcode off safe RET IBPB 4 8 12 16 20 SE +/- 0.20, N = 4 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 SE +/- 0.11, N = 3 17.64 17.42 17.69 17.70 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Water-layered Halfspace safe RET no microcode off safe RET IBPB 7 14 21 28 35 SE +/- 0.25, N = 3 SE +/- 0.15, N = 3 SE +/- 0.35, N = 3 SE +/- 0.19, N = 3 30.43 29.77 29.59 30.05 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SQLite This is a simple benchmark of SQLite. At present this test profile just measures the time to perform a pre-defined number of insertions on an indexed database with a variable number of concurrent repetitions -- up to the maximum number of CPU threads available. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.41.2 Threads / Copies: 8 safe RET no microcode off safe RET IBPB 1.1264 2.2528 3.3792 4.5056 5.632 SE +/- 0.016, N = 3 SE +/- 0.013, N = 3 SE +/- 0.036, N = 3 SE +/- 0.010, N = 3 4.850 3.755 5.006 4.793 1. (CC) gcc options: -O2 -lz -lm
OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.41.2 Threads / Copies: 16 safe RET no microcode off safe RET IBPB 2 4 6 8 10 SE +/- 0.024, N = 3 SE +/- 0.020, N = 3 SE +/- 0.052, N = 3 SE +/- 0.007, N = 3 8.834 6.273 8.800 7.934 1. (CC) gcc options: -O2 -lz -lm
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 64 - Model: ResNet-50 safe RET no microcode off safe RET IBPB 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 15.56 17.78 15.65 17.45
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig safe RET no microcode off safe RET IBPB 9 18 27 36 45 SE +/- 0.35, N = 6 SE +/- 0.34, N = 5 SE +/- 0.37, N = 6 SE +/- 0.37, N = 7 37.62 31.19 37.24 40.09
Timed MrBayes Analysis This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis safe RET no microcode off safe RET IBPB 30 60 90 120 150 SE +/- 0.66, N = 3 SE +/- 0.85, N = 3 SE +/- 1.05, N = 3 SE +/- 1.03, N = 3 137.52 136.69 138.85 144.83 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
safe RET no microcode Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 9 August 2023 18:46 by user phoronix.
off Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 10 August 2023 05:20 by user phoronix.
safe RET Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 10 August 2023 12:35 by user phoronix.
IBPB Processor: AMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads), Motherboard: AMD DAYTONA_X (RYM1009B BIOS), Chipset: AMD Starship/Matisse, Memory: 256GB, Disk: 800GB INTEL SSDPF21Q800GB, Graphics: ASPEED, Monitor: VE228, Network: 2 x Mellanox MT27710
OS: Ubuntu 22.04, Kernel: 6.5.0-rc5-phx-tues (x86_64), Desktop: GNOME Shell 42.5, Display Server: X Server 1.21.1.3, Vulkan: 1.3.224, Compiler: GCC 11.3.0 + LLVM 14.0.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 10 August 2023 20:55 by user phoronix.