ASUS ROG MAXIMUS Z890 HERO BIOS Flashing Method Benchmarks by Michael Larabel comparing performance of Core Ultra 9 Arrow Lake using 0806 CAP upgrade and then reflashing using the entire 0806 ZIP file to look for any difference.
HTML result view exported from: https://openbenchmarking.org/result/2410306-NE-INTELARRO88&rdt&grs .
ASUS ROG MAXIMUS Z890 HERO BIOS Flashing Method Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution 0806 Upgrade Using CAP 0806 Flashed ZIP Intel Core Ultra 9 285K @ 5.10GHz (24 Cores) ASUS ROG MAXIMUS Z890 HERO (0806 BIOS) Intel Device ae7f 2 x 16GB DDR5-6400MT/s Micron CP16G64C38U5B.M8D1 1000GB Western Digital WDS100T1X0E-00AFY0 ASUS AMD Radeon RX 7900 XTX 24GB Intel Device 7f50 ASUS VP28U Realtek Device 8126 + Intel I226-V + Intel Wi-Fi 7 Ubuntu 24.10 6.11.0-9-generic (x86_64) GNOME Shell 47.0 X Server + Wayland 4.6 Mesa 24.2.3-1ubuntu1 (LLVM 19.1.0 DRM 3.58) GCC 14.2.0 ext4 3840x2160 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x110 - Thermald 2.5.8 Java Details - OpenJDK Runtime Environment (build 21.0.5-ea+8-Ubuntu-1) Python Details - Python 3.12.7 Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
ASUS ROG MAXIMUS Z890 HERO BIOS Flashing Method mbw: Memory Copy, Fixed Block Size - 8192 MiB gimp: resize litert: NASNet Mobile specfem3d: Tomographic Model build-wasmer: Time To Compile gimp: auto-levels litert: Mobilenet Float gimp: unsharp-mask blender: Junkshop - CPU-Only build2: Time To Compile openfoam: drivaerFastback, Small Mesh Size - Mesh Time litert: SqueezeNet build-linux-kernel: defconfig litert: Mobilenet Quant npb: IS.D darktable: Boat - CPU-only specfem3d: Mount St. Helens gimp: rotate build-llvm: Ninja openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU stream: Scale lulesh: namd: STMV with 1,066,628 Atoms stream: Triad darktable: Masskrug - CPU-only openvino: Noise Suppression Poconet-Like FP16 - CPU openfoam: drivaerFastback, Medium Mesh Size - Execution Time openvino: Noise Suppression Poconet-Like FP16 - CPU litert: Inception ResNet V2 npb: BT.C openvino: Face Detection FP16 - CPU openradioss: Cell Phone Drop Test john-the-ripper: Blowfish npb: CG.C cp2k: Fayalite-FIST rawtherapee: Total Benchmark Time openvino: Face Detection FP16 - CPU npb: SP.B memcached: 1:10 openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Person Re-Identification Retail FP16 - CPU john-the-ripper: HMAC-SHA512 cp2k: H20-256 openvino: Person Re-Identification Retail FP16 - CPU specfem3d: Layered Halfspace litert: DeepLab V3 incompact3d: input.i3d 193 Cells Per Direction john-the-ripper: bcrypt cp2k: H20-64 incompact3d: input.i3d 129 Cells Per Direction litert: Inception V4 npb: FT.C darktable: Server Room - CPU-only stream: Add specfem3d: Water-layered Halfspace openradioss: INIVOL and Fluid Structure Interaction Drop Container stream: Copy srsran: PDSCH Processor Benchmark, Throughput Total openradioss: Chrysler Neon 1M openfoam: drivaerFastback, Small Mesh Size - Execution Time gromacs: MPI CPU - water_GMX50_bare specfem3d: Homogeneous Halfspace libxsmm: 64 ramspeed: Triad - Integer openradioss: Bird Strike on Windshield namd: ATPase with 327,506 Atoms openfoam: drivaerFastback, Medium Mesh Size - Mesh Time amg: libxsmm: 32 ramspeed: Average - Integer hpcg: 104 104 104 - 60 ramspeed: Add - Integer npb: MG.C java-jmh: Throughput ramspeed: Scale - Integer npb: LU.C npb: SP.C openradioss: Rubber O-Ring Seal Installation ramspeed: Copy - Integer litert: Quantized COCO SSD MobileNet v1 darktable: Server Rack - CPU-only memcached: 1:100 0806 Upgrade Using CAP 0806 Flashed ZIP 14563.959 15.665 6885.78 19.159487411 23.656 11.263 853.769 12.976 67.50 80.078 19.989936 1158.57 46.872 1097.80 1801.05 2.965 20.767615078 11.246 285.889 41.51 385.19 54322.7 11880.362 0.60345 61344.1 1.753 1187.96 1673.6319 13.26 13949.9 66147.35 861.55 43.18 47519 14175.35 84.454 37.186 6.93 24042.97 5758869.33 455.99 13.08 6.01 58355333 472.900 982.01 61.355849424 1378.78 50.9777705 47442 41.178 11.1817640 15425.8 32534.49 1.393 61683.3 58.942492291 227.94 81021.9 13645.6 609.95 139.55546 1.737 23.686568766 308.0 53859.52 125.49 2.07071 157.1755 563276667 160.0 53040.82 10.5482 54177.41 29083.60 109018731901.27 51141.46 67464.98 17334.53 59.32 53287.80 1858.06 0.179 6988527.14 15025.692 15.204 7048.95 18.790950919 23.235 11.461 867.219 13.177 68.38 81.026 19.780294 1170.69 47.320 1087.90 1785.26 2.991 20.598886012 11.338 288.179 41.84 382.20 54724.1 11967.117 0.60750 61746.5 1.742 1180.55 1663.331 13.34 14031.5 66524.26 866.23 42.96 47277 14243.43 84.858 37.359 6.90 24146.75 5782954.82 454.14 13.13 5.99 58546667 471.367 984.92 61.175863110 1382.66 50.8402367 47318 41.071 11.1528794 15462.2 32463.70 1.396 61815.7 59.068059616 228.39 81178.3 13622.0 610.97 139.71747 1.739 23.709877360 308.3 53807.97 125.60 2.06908 157.27952 563637100 160.1 53010.17 10.5422 54149.50 29068.72 109066111081.45 51158.28 67447.79 17338.38 59.31 53293.58 1858.03 0.179 7591245.10 OpenBenchmarking.org
MBW Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB 0806 Upgrade Using CAP 0806 Flashed ZIP 3K 6K 9K 12K 15K SE +/- 167.76, N = 3 SE +/- 144.86, N = 6 14563.96 15025.69 1. (CC) gcc options: -O3 -march=native
GIMP Test: resize OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.38 Test: resize 0806 Upgrade Using CAP 0806 Flashed ZIP 4 8 12 16 20 SE +/- 0.12, N = 3 SE +/- 0.10, N = 3 15.67 15.20
LiteRT Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: NASNet Mobile 0806 Upgrade Using CAP 0806 Flashed ZIP 1500 3000 4500 6000 7500 SE +/- 14.53, N = 3 SE +/- 30.60, N = 3 6885.78 7048.95
SPECFEM3D Model: Tomographic Model OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Tomographic Model 0806 Upgrade Using CAP 0806 Flashed ZIP 5 10 15 20 25 SE +/- 0.11, N = 3 SE +/- 0.03, N = 3 19.16 18.79 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Timed Wasmer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Wasmer Compilation 2.3 Time To Compile 0806 Upgrade Using CAP 0806 Flashed ZIP 6 12 18 24 30 SE +/- 0.18, N = 3 SE +/- 0.08, N = 3 23.66 23.24 1. (CC) gcc options: -m64 -ldl -lgcc_s -lutil -lrt -lpthread -lm -lc -pie -nodefaultlibs
GIMP Test: auto-levels OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.38 Test: auto-levels 0806 Upgrade Using CAP 0806 Flashed ZIP 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 11.26 11.46
LiteRT Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Mobilenet Float 0806 Upgrade Using CAP 0806 Flashed ZIP 200 400 600 800 1000 SE +/- 2.23, N = 3 SE +/- 3.97, N = 3 853.77 867.22
GIMP Test: unsharp-mask OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.38 Test: unsharp-mask 0806 Upgrade Using CAP 0806 Flashed ZIP 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 12.98 13.18
Blender Blend File: Junkshop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Junkshop - Compute: CPU-Only 0806 Upgrade Using CAP 0806 Flashed ZIP 15 30 45 60 75 SE +/- 0.40, N = 3 SE +/- 0.59, N = 3 67.50 68.38
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.17 Time To Compile 0806 Upgrade Using CAP 0806 Flashed ZIP 20 40 60 80 100 SE +/- 0.15, N = 3 SE +/- 0.22, N = 3 80.08 81.03
OpenFOAM Input: drivaerFastback, Small Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time 0806 Upgrade Using CAP 0806 Flashed ZIP 5 10 15 20 25 19.99 19.78 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
LiteRT Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: SqueezeNet 0806 Upgrade Using CAP 0806 Flashed ZIP 300 600 900 1200 1500 SE +/- 4.03, N = 3 SE +/- 1.50, N = 3 1158.57 1170.69
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: defconfig 0806 Upgrade Using CAP 0806 Flashed ZIP 11 22 33 44 55 SE +/- 0.57, N = 4 SE +/- 0.58, N = 3 46.87 47.32
LiteRT Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Mobilenet Quant 0806 Upgrade Using CAP 0806 Flashed ZIP 200 400 600 800 1000 SE +/- 3.48, N = 3 SE +/- 6.84, N = 3 1097.80 1087.90
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 0806 Upgrade Using CAP 0806 Flashed ZIP 400 800 1200 1600 2000 SE +/- 18.77, N = 3 SE +/- 16.91, N = 3 1801.05 1785.26 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
Darktable Test: Boat - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.8.1 Test: Boat - Acceleration: CPU-only 0806 Upgrade Using CAP 0806 Flashed ZIP 0.673 1.346 2.019 2.692 3.365 SE +/- 0.007, N = 3 SE +/- 0.006, N = 3 2.965 2.991
SPECFEM3D Model: Mount St. Helens OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Mount St. Helens 0806 Upgrade Using CAP 0806 Flashed ZIP 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 0.16, N = 3 20.77 20.60 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
GIMP Test: rotate OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.38 Test: rotate 0806 Upgrade Using CAP 0806 Flashed ZIP 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 11.25 11.34
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 16.0 Build System: Ninja 0806 Upgrade Using CAP 0806 Flashed ZIP 60 120 180 240 300 SE +/- 0.17, N = 3 SE +/- 0.56, N = 3 285.89 288.18
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16-INT8 - Device: CPU 0806 Upgrade Using CAP 0806 Flashed ZIP 10 20 30 40 50 SE +/- 0.14, N = 3 SE +/- 0.01, N = 3 41.51 41.84 MIN: 34.73 / MAX: 64.34 MIN: 34.86 / MAX: 70.98 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16-INT8 - Device: CPU 0806 Upgrade Using CAP 0806 Flashed ZIP 80 160 240 320 400 SE +/- 1.33, N = 3 SE +/- 0.14, N = 3 385.19 382.20 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
Stream Type: Scale OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Scale 0806 Upgrade Using CAP 0806 Flashed ZIP 12K 24K 36K 48K 60K SE +/- 16.83, N = 5 SE +/- 80.57, N = 5 54322.7 54724.1 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 0806 Upgrade Using CAP 0806 Flashed ZIP 3K 6K 9K 12K 15K SE +/- 97.80, N = 3 SE +/- 29.33, N = 3 11880.36 11967.12 1. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi
NAMD Input: STMV with 1,066,628 Atoms OpenBenchmarking.org ns/day, More Is Better NAMD 3.0 Input: STMV with 1,066,628 Atoms 0806 Upgrade Using CAP 0806 Flashed ZIP 0.1367 0.2734 0.4101 0.5468 0.6835 SE +/- 0.00097, N = 3 SE +/- 0.00221, N = 3 0.60345 0.60750
Stream Type: Triad OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Triad 0806 Upgrade Using CAP 0806 Flashed ZIP 13K 26K 39K 52K 65K SE +/- 17.62, N = 5 SE +/- 96.00, N = 5 61344.1 61746.5 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
Darktable Test: Masskrug - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.8.1 Test: Masskrug - Acceleration: CPU-only 0806 Upgrade Using CAP 0806 Flashed ZIP 0.3944 0.7888 1.1832 1.5776 1.972 SE +/- 0.004, N = 3 SE +/- 0.005, N = 3 1.753 1.742
OpenVINO Model: Noise Suppression Poconet-Like FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Noise Suppression Poconet-Like FP16 - Device: CPU 0806 Upgrade Using CAP 0806 Flashed ZIP 300 600 900 1200 1500 SE +/- 3.30, N = 3 SE +/- 1.89, N = 3 1187.96 1180.55 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time 0806 Upgrade Using CAP 0806 Flashed ZIP 400 800 1200 1600 2000 1673.63 1663.33 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
OpenVINO Model: Noise Suppression Poconet-Like FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Noise Suppression Poconet-Like FP16 - Device: CPU 0806 Upgrade Using CAP 0806 Flashed ZIP 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 13.26 13.34 MIN: 10.12 / MAX: 26.81 MIN: 10.28 / MAX: 27.86 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
LiteRT Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Inception ResNet V2 0806 Upgrade Using CAP 0806 Flashed ZIP 3K 6K 9K 12K 15K SE +/- 55.31, N = 3 SE +/- 63.51, N = 3 13949.9 14031.5
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 0806 Upgrade Using CAP 0806 Flashed ZIP 14K 28K 42K 56K 70K SE +/- 142.11, N = 3 SE +/- 368.88, N = 3 66147.35 66524.26 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Face Detection FP16 - Device: CPU 0806 Upgrade Using CAP 0806 Flashed ZIP 200 400 600 800 1000 SE +/- 1.44, N = 3 SE +/- 1.41, N = 3 861.55 866.23 MIN: 680.33 / MAX: 1111.67 MIN: 672.45 / MAX: 1180.6 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenRadioss Model: Cell Phone Drop Test OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test 0806 Upgrade Using CAP 0806 Flashed ZIP 10 20 30 40 50 SE +/- 0.21, N = 3 SE +/- 0.34, N = 3 43.18 42.96
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: Blowfish 0806 Upgrade Using CAP 0806 Flashed ZIP 10K 20K 30K 40K 50K SE +/- 56.68, N = 3 SE +/- 59.22, N = 3 47519 47277 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 0806 Upgrade Using CAP 0806 Flashed ZIP 3K 6K 9K 12K 15K SE +/- 49.92, N = 3 SE +/- 41.39, N = 3 14175.35 14243.43 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
CP2K Molecular Dynamics Input: Fayalite-FIST OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 2024.3 Input: Fayalite-FIST 0806 Upgrade Using CAP 0806 Flashed ZIP 20 40 60 80 100 SE +/- 1.01, N = 4 SE +/- 0.79, N = 3 84.45 84.86 1. (F9X) gfortran options: -fopenmp -march=native -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kgrid -lcp2kgriddgemm -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kdbx -lcp2kdbm -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -l:libhdf5_fortran.a -l:libhdf5.a -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -llibgrpp -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -lopenblas -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm
RawTherapee Total Benchmark Time OpenBenchmarking.org Seconds, Fewer Is Better RawTherapee Total Benchmark Time 0806 Upgrade Using CAP 0806 Flashed ZIP 9 18 27 36 45 SE +/- 0.13, N = 3 SE +/- 0.16, N = 3 37.19 37.36 1. RawTherapee, version 5.10, command line.
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Face Detection FP16 - Device: CPU 0806 Upgrade Using CAP 0806 Flashed ZIP 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 6.93 6.90 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 0806 Upgrade Using CAP 0806 Flashed ZIP 5K 10K 15K 20K 25K SE +/- 247.19, N = 5 SE +/- 53.23, N = 3 24042.97 24146.75 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
Memcached Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 0806 Upgrade Using CAP 0806 Flashed ZIP 1.2M 2.4M 3.6M 4.8M 6M SE +/- 26981.35, N = 3 SE +/- 59674.22, N = 3 5758869.33 5782954.82 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU 0806 Upgrade Using CAP 0806 Flashed ZIP 100 200 300 400 500 SE +/- 0.98, N = 3 SE +/- 0.37, N = 3 455.99 454.14 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU 0806 Upgrade Using CAP 0806 Flashed ZIP 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 13.08 13.13 MIN: 7.85 / MAX: 30.49 MIN: 7.82 / MAX: 28.74 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Person Re-Identification Retail FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Re-Identification Retail FP16 - Device: CPU 0806 Upgrade Using CAP 0806 Flashed ZIP 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 6.01 5.99 MIN: 4.18 / MAX: 20.45 MIN: 4.16 / MAX: 20.12 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
John The Ripper Test: HMAC-SHA512 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: HMAC-SHA512 0806 Upgrade Using CAP 0806 Flashed ZIP 13M 26M 39M 52M 65M SE +/- 693099.88, N = 3 SE +/- 810047.19, N = 3 58355333 58546667 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
CP2K Molecular Dynamics Input: H20-256 OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 2024.3 Input: H20-256 0806 Upgrade Using CAP 0806 Flashed ZIP 100 200 300 400 500 SE +/- 1.56, N = 3 SE +/- 2.08, N = 3 472.90 471.37 1. (F9X) gfortran options: -fopenmp -march=native -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kgrid -lcp2kgriddgemm -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kdbx -lcp2kdbm -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -l:libhdf5_fortran.a -l:libhdf5.a -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -llibgrpp -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -lopenblas -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm
OpenVINO Model: Person Re-Identification Retail FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Re-Identification Retail FP16 - Device: CPU 0806 Upgrade Using CAP 0806 Flashed ZIP 200 400 600 800 1000 SE +/- 0.50, N = 3 SE +/- 3.25, N = 3 982.01 984.92 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
SPECFEM3D Model: Layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Layered Halfspace 0806 Upgrade Using CAP 0806 Flashed ZIP 14 28 42 56 70 SE +/- 0.56, N = 3 SE +/- 0.44, N = 15 61.36 61.18 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
LiteRT Model: DeepLab V3 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: DeepLab V3 0806 Upgrade Using CAP 0806 Flashed ZIP 300 600 900 1200 1500 SE +/- 10.45, N = 3 SE +/- 1.42, N = 3 1378.78 1382.66
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction 0806 Upgrade Using CAP 0806 Flashed ZIP 12 24 36 48 60 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 50.98 50.84 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
John The Ripper Test: bcrypt OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: bcrypt 0806 Upgrade Using CAP 0806 Flashed ZIP 10K 20K 30K 40K 50K SE +/- 11.05, N = 3 SE +/- 100.13, N = 3 47442 47318 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
CP2K Molecular Dynamics Input: H20-64 OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 2024.3 Input: H20-64 0806 Upgrade Using CAP 0806 Flashed ZIP 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.13, N = 3 41.18 41.07 1. (F9X) gfortran options: -fopenmp -march=native -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kgrid -lcp2kgriddgemm -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kdbx -lcp2kdbm -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -l:libhdf5_fortran.a -l:libhdf5.a -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -llibgrpp -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -lopenblas -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 0806 Upgrade Using CAP 0806 Flashed ZIP 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 11.18 11.15 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
LiteRT Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Inception V4 0806 Upgrade Using CAP 0806 Flashed ZIP 3K 6K 9K 12K 15K SE +/- 51.27, N = 3 SE +/- 23.72, N = 3 15425.8 15462.2
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 0806 Upgrade Using CAP 0806 Flashed ZIP 7K 14K 21K 28K 35K SE +/- 112.87, N = 3 SE +/- 80.59, N = 3 32534.49 32463.70 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
Darktable Test: Server Room - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.8.1 Test: Server Room - Acceleration: CPU-only 0806 Upgrade Using CAP 0806 Flashed ZIP 0.3141 0.6282 0.9423 1.2564 1.5705 SE +/- 0.003, N = 3 SE +/- 0.003, N = 3 1.393 1.396
Stream Type: Add OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Add 0806 Upgrade Using CAP 0806 Flashed ZIP 13K 26K 39K 52K 65K SE +/- 28.51, N = 5 SE +/- 105.27, N = 5 61683.3 61815.7 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
SPECFEM3D Model: Water-layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Water-layered Halfspace 0806 Upgrade Using CAP 0806 Flashed ZIP 13 26 39 52 65 SE +/- 0.18, N = 3 SE +/- 0.09, N = 3 58.94 59.07 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenRadioss Model: INIVOL and Fluid Structure Interaction Drop Container OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container 0806 Upgrade Using CAP 0806 Flashed ZIP 50 100 150 200 250 SE +/- 0.81, N = 3 SE +/- 0.23, N = 3 227.94 228.39
Stream Type: Copy OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Copy 0806 Upgrade Using CAP 0806 Flashed ZIP 20K 40K 60K 80K 100K SE +/- 58.82, N = 5 SE +/- 74.07, N = 5 81021.9 81178.3 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
srsRAN Project Test: PDSCH Processor Benchmark, Throughput Total OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.10.1-20240325 Test: PDSCH Processor Benchmark, Throughput Total 0806 Upgrade Using CAP 0806 Flashed ZIP 3K 6K 9K 12K 15K SE +/- 152.42, N = 4 SE +/- 125.99, N = 3 13645.6 13622.0 1. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -fno-trapping-math -fno-math-errno -ldl
OpenRadioss Model: Chrysler Neon 1M OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Chrysler Neon 1M 0806 Upgrade Using CAP 0806 Flashed ZIP 130 260 390 520 650 SE +/- 0.66, N = 3 SE +/- 0.42, N = 3 609.95 610.97
OpenFOAM Input: drivaerFastback, Small Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time 0806 Upgrade Using CAP 0806 Flashed ZIP 30 60 90 120 150 139.56 139.72 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare 0806 Upgrade Using CAP 0806 Flashed ZIP 0.3913 0.7826 1.1739 1.5652 1.9565 SE +/- 0.001, N = 3 SE +/- 0.006, N = 3 1.737 1.739 1. (CXX) g++ options: -O3 -lm
SPECFEM3D Model: Homogeneous Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Homogeneous Halfspace 0806 Upgrade Using CAP 0806 Flashed ZIP 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 23.69 23.71 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
libxsmm M N K: 64 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 0806 Upgrade Using CAP 0806 Flashed ZIP 70 140 210 280 350 SE +/- 0.15, N = 3 SE +/- 0.09, N = 3 308.0 308.3 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
RAMspeed SMP Type: Triad - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Integer 0806 Upgrade Using CAP 0806 Flashed ZIP 12K 24K 36K 48K 60K SE +/- 15.30, N = 3 SE +/- 47.24, N = 3 53859.52 53807.97 1. (CC) gcc options: -O3 -march=native
OpenRadioss Model: Bird Strike on Windshield OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield 0806 Upgrade Using CAP 0806 Flashed ZIP 30 60 90 120 150 SE +/- 0.97, N = 3 SE +/- 0.32, N = 3 125.49 125.60
NAMD Input: ATPase with 327,506 Atoms OpenBenchmarking.org ns/day, More Is Better NAMD 3.0 Input: ATPase with 327,506 Atoms 0806 Upgrade Using CAP 0806 Flashed ZIP 0.4659 0.9318 1.3977 1.8636 2.3295 SE +/- 0.02376, N = 3 SE +/- 0.01792, N = 3 2.07071 2.06908
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time 0806 Upgrade Using CAP 0806 Flashed ZIP 30 60 90 120 150 157.18 157.28 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 0806 Upgrade Using CAP 0806 Flashed ZIP 120M 240M 360M 480M 600M SE +/- 507619.30, N = 3 SE +/- 1033117.77, N = 3 563276667 563637100
libxsmm M N K: 32 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 0806 Upgrade Using CAP 0806 Flashed ZIP 40 80 120 160 200 SE +/- 0.10, N = 3 SE +/- 0.10, N = 3 160.0 160.1 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
RAMspeed SMP Type: Average - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Integer 0806 Upgrade Using CAP 0806 Flashed ZIP 11K 22K 33K 44K 55K SE +/- 57.46, N = 3 SE +/- 26.51, N = 3 53040.82 53010.17 1. (CC) gcc options: -O3 -march=native
High Performance Conjugate Gradient X Y Z: 104 104 104 - RT: 60 OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 104 104 104 - RT: 60 0806 Upgrade Using CAP 0806 Flashed ZIP 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 10.55 10.54 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
RAMspeed SMP Type: Add - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Add - Benchmark: Integer 0806 Upgrade Using CAP 0806 Flashed ZIP 12K 24K 36K 48K 60K SE +/- 18.91, N = 3 SE +/- 106.25, N = 3 54177.41 54149.50 1. (CC) gcc options: -O3 -march=native
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 0806 Upgrade Using CAP 0806 Flashed ZIP 6K 12K 18K 24K 30K SE +/- 98.62, N = 3 SE +/- 64.67, N = 3 29083.60 29068.72 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
Java JMH Throughput OpenBenchmarking.org Ops/s, More Is Better Java JMH Throughput 0806 Upgrade Using CAP 0806 Flashed ZIP 20000M 40000M 60000M 80000M 100000M 109018731901.27 109066111081.45
RAMspeed SMP Type: Scale - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Integer 0806 Upgrade Using CAP 0806 Flashed ZIP 11K 22K 33K 44K 55K SE +/- 39.39, N = 3 SE +/- 16.43, N = 3 51141.46 51158.28 1. (CC) gcc options: -O3 -march=native
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 0806 Upgrade Using CAP 0806 Flashed ZIP 14K 28K 42K 56K 70K SE +/- 545.32, N = 3 SE +/- 437.31, N = 3 67464.98 67447.79 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C 0806 Upgrade Using CAP 0806 Flashed ZIP 4K 8K 12K 16K 20K SE +/- 76.59, N = 3 SE +/- 202.25, N = 3 17334.53 17338.38 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenRadioss Model: Rubber O-Ring Seal Installation OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation 0806 Upgrade Using CAP 0806 Flashed ZIP 13 26 39 52 65 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 59.32 59.31
RAMspeed SMP Type: Copy - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Integer 0806 Upgrade Using CAP 0806 Flashed ZIP 11K 22K 33K 44K 55K SE +/- 49.46, N = 3 SE +/- 73.00, N = 3 53287.80 53293.58 1. (CC) gcc options: -O3 -march=native
LiteRT Model: Quantized COCO SSD MobileNet v1 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Quantized COCO SSD MobileNet v1 0806 Upgrade Using CAP 0806 Flashed ZIP 400 800 1200 1600 2000 SE +/- 0.70, N = 3 SE +/- 9.68, N = 3 1858.06 1858.03
Darktable Test: Server Rack - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.8.1 Test: Server Rack - Acceleration: CPU-only 0806 Upgrade Using CAP 0806 Flashed ZIP 0.0403 0.0806 0.1209 0.1612 0.2015 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.179 0.179
Memcached Set To Get Ratio: 1:100 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 0806 Upgrade Using CAP 0806 Flashed ZIP 1.6M 3.2M 4.8M 6.4M 8M SE +/- 137848.99, N = 15 SE +/- 20998.77, N = 3 6988527.14 7591245.10 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Phoronix Test Suite v10.8.5