ASUS ROG MAXIMUS Z890 HERO BIOS Flashing Method Benchmarks by Michael Larabel comparing performance of Core Ultra 9 Arrow Lake using 0806 CAP upgrade and then reflashing using the entire 0806 ZIP file to look for any difference.
HTML result view exported from: https://openbenchmarking.org/result/2410306-NE-INTELARRO88&grr&sro .
ASUS ROG MAXIMUS Z890 HERO BIOS Flashing Method Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution 0806 Upgrade Using CAP 0806 Flashed ZIP Intel Core Ultra 9 285K @ 5.10GHz (24 Cores) ASUS ROG MAXIMUS Z890 HERO (0806 BIOS) Intel Device ae7f 2 x 16GB DDR5-6400MT/s Micron CP16G64C38U5B.M8D1 1000GB Western Digital WDS100T1X0E-00AFY0 ASUS AMD Radeon RX 7900 XTX 24GB Intel Device 7f50 ASUS VP28U Realtek Device 8126 + Intel I226-V + Intel Wi-Fi 7 Ubuntu 24.10 6.11.0-9-generic (x86_64) GNOME Shell 47.0 X Server + Wayland 4.6 Mesa 24.2.3-1ubuntu1 (LLVM 19.1.0 DRM 3.58) GCC 14.2.0 ext4 3840x2160 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x110 - Thermald 2.5.8 Java Details - OpenJDK Runtime Environment (build 21.0.5-ea+8-Ubuntu-1) Python Details - Python 3.12.7 Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
ASUS ROG MAXIMUS Z890 HERO BIOS Flashing Method openradioss: Chrysler Neon 1M openfoam: drivaerFastback, Medium Mesh Size - Execution Time openfoam: drivaerFastback, Medium Mesh Size - Mesh Time cp2k: H20-256 build-llvm: Ninja openradioss: INIVOL and Fluid Structure Interaction Drop Container hpcg: 104 104 104 - 60 memcached: 1:100 specfem3d: Layered Halfspace java-jmh: Throughput openradioss: Bird Strike on Windshield cp2k: Fayalite-FIST mbw: Memory Copy, Fixed Block Size - 8192 MiB gromacs: MPI CPU - water_GMX50_bare npb: SP.C build2: Time To Compile namd: STMV with 1,066,628 Atoms blender: Junkshop - CPU-Only memcached: 1:10 openvino: Face Detection FP16 - CPU openvino: Face Detection FP16 - CPU specfem3d: Water-layered Halfspace openradioss: Rubber O-Ring Seal Installation openvino: Noise Suppression Poconet-Like FP16 - CPU openvino: Noise Suppression Poconet-Like FP16 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Person Re-Identification Retail FP16 - CPU openvino: Person Re-Identification Retail FP16 - CPU john-the-ripper: HMAC-SHA512 openfoam: drivaerFastback, Small Mesh Size - Execution Time openfoam: drivaerFastback, Small Mesh Size - Mesh Time build-linux-kernel: defconfig incompact3d: input.i3d 193 Cells Per Direction litert: Inception V4 litert: Inception ResNet V2 litert: NASNet Mobile litert: Mobilenet Float litert: DeepLab V3 litert: SqueezeNet litert: Quantized COCO SSD MobileNet v1 litert: Mobilenet Quant openradioss: Cell Phone Drop Test npb: BT.C cp2k: H20-64 rawtherapee: Total Benchmark Time libxsmm: 64 libxsmm: 32 ramspeed: Average - Integer ramspeed: Copy - Integer ramspeed: Add - Integer ramspeed: Scale - Integer ramspeed: Triad - Integer stream: Copy npb: LU.C john-the-ripper: bcrypt john-the-ripper: Blowfish npb: IS.D specfem3d: Homogeneous Halfspace namd: ATPase with 327,506 Atoms amg: build-wasmer: Time To Compile specfem3d: Mount St. Helens specfem3d: Tomographic Model npb: SP.B gimp: resize npb: FT.C gimp: unsharp-mask incompact3d: input.i3d 129 Cells Per Direction gimp: auto-levels gimp: rotate npb: CG.C npb: MG.C lulesh: srsran: PDSCH Processor Benchmark, Throughput Total darktable: Boat - CPU-only darktable: Masskrug - CPU-only darktable: Server Room - CPU-only darktable: Server Rack - CPU-only stream: Add stream: Triad stream: Scale 0806 Upgrade Using CAP 0806 Flashed ZIP 609.95 1673.6319 157.1755 472.900 285.889 227.94 10.5482 6988527.14 61.355849424 109018731901.27 125.49 84.454 14563.959 1.737 17334.53 80.078 0.60345 67.50 5758869.33 861.55 6.93 58.942492291 59.32 13.26 1187.96 13.08 455.99 41.51 385.19 6.01 982.01 58355333 139.55546 19.989936 46.872 50.9777705 15425.8 13949.9 6885.78 853.769 1378.78 1158.57 1858.06 1097.80 43.18 66147.35 41.178 37.186 308.0 160.0 53040.82 53287.80 54177.41 51141.46 53859.52 81021.9 67464.98 47442 47519 1801.05 23.686568766 2.07071 563276667 23.656 20.767615078 19.159487411 24042.97 15.665 32534.49 12.976 11.1817640 11.263 11.246 14175.35 29083.60 11880.362 13645.6 2.965 1.753 1.393 0.179 61683.3 61344.1 54322.7 610.97 1663.331 157.27952 471.367 288.179 228.39 10.5422 7591245.10 61.175863110 109066111081.45 125.60 84.858 15025.692 1.739 17338.38 81.026 0.60750 68.38 5782954.82 866.23 6.90 59.068059616 59.31 13.34 1180.55 13.13 454.14 41.84 382.20 5.99 984.92 58546667 139.71747 19.780294 47.320 50.8402367 15462.2 14031.5 7048.95 867.219 1382.66 1170.69 1858.03 1087.90 42.96 66524.26 41.071 37.359 308.3 160.1 53010.17 53293.58 54149.50 51158.28 53807.97 81178.3 67447.79 47318 47277 1785.26 23.709877360 2.06908 563637100 23.235 20.598886012 18.790950919 24146.75 15.204 32463.70 13.177 11.1528794 11.461 11.338 14243.43 29068.72 11967.117 13622.0 2.991 1.742 1.396 0.179 61815.7 61746.5 54724.1 OpenBenchmarking.org
OpenRadioss Model: Chrysler Neon 1M OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Chrysler Neon 1M 0806 Flashed ZIP 0806 Upgrade Using CAP 130 260 390 520 650 SE +/- 0.42, N = 3 SE +/- 0.66, N = 3 610.97 609.95
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time 0806 Flashed ZIP 0806 Upgrade Using CAP 400 800 1200 1600 2000 1663.33 1673.63 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time 0806 Flashed ZIP 0806 Upgrade Using CAP 30 60 90 120 150 157.28 157.18 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
CP2K Molecular Dynamics Input: H20-256 OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 2024.3 Input: H20-256 0806 Flashed ZIP 0806 Upgrade Using CAP 100 200 300 400 500 SE +/- 2.08, N = 3 SE +/- 1.56, N = 3 471.37 472.90 1. (F9X) gfortran options: -fopenmp -march=native -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kgrid -lcp2kgriddgemm -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kdbx -lcp2kdbm -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -l:libhdf5_fortran.a -l:libhdf5.a -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -llibgrpp -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -lopenblas -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 16.0 Build System: Ninja 0806 Flashed ZIP 0806 Upgrade Using CAP 60 120 180 240 300 SE +/- 0.56, N = 3 SE +/- 0.17, N = 3 288.18 285.89
OpenRadioss Model: INIVOL and Fluid Structure Interaction Drop Container OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container 0806 Flashed ZIP 0806 Upgrade Using CAP 50 100 150 200 250 SE +/- 0.23, N = 3 SE +/- 0.81, N = 3 228.39 227.94
High Performance Conjugate Gradient X Y Z: 104 104 104 - RT: 60 OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 104 104 104 - RT: 60 0806 Flashed ZIP 0806 Upgrade Using CAP 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 10.54 10.55 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
Memcached Set To Get Ratio: 1:100 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 0806 Flashed ZIP 0806 Upgrade Using CAP 1.6M 3.2M 4.8M 6.4M 8M SE +/- 20998.77, N = 3 SE +/- 137848.99, N = 15 7591245.10 6988527.14 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
SPECFEM3D Model: Layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Layered Halfspace 0806 Flashed ZIP 0806 Upgrade Using CAP 14 28 42 56 70 SE +/- 0.44, N = 15 SE +/- 0.56, N = 3 61.18 61.36 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Java JMH Throughput OpenBenchmarking.org Ops/s, More Is Better Java JMH Throughput 0806 Flashed ZIP 0806 Upgrade Using CAP 20000M 40000M 60000M 80000M 100000M 109066111081.45 109018731901.27
OpenRadioss Model: Bird Strike on Windshield OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield 0806 Flashed ZIP 0806 Upgrade Using CAP 30 60 90 120 150 SE +/- 0.32, N = 3 SE +/- 0.97, N = 3 125.60 125.49
CP2K Molecular Dynamics Input: Fayalite-FIST OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 2024.3 Input: Fayalite-FIST 0806 Flashed ZIP 0806 Upgrade Using CAP 20 40 60 80 100 SE +/- 0.79, N = 3 SE +/- 1.01, N = 4 84.86 84.45 1. (F9X) gfortran options: -fopenmp -march=native -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kgrid -lcp2kgriddgemm -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kdbx -lcp2kdbm -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -l:libhdf5_fortran.a -l:libhdf5.a -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -llibgrpp -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -lopenblas -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm
MBW Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB 0806 Flashed ZIP 0806 Upgrade Using CAP 3K 6K 9K 12K 15K SE +/- 144.86, N = 6 SE +/- 167.76, N = 3 15025.69 14563.96 1. (CC) gcc options: -O3 -march=native
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare 0806 Flashed ZIP 0806 Upgrade Using CAP 0.3913 0.7826 1.1739 1.5652 1.9565 SE +/- 0.006, N = 3 SE +/- 0.001, N = 3 1.739 1.737 1. (CXX) g++ options: -O3 -lm
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C 0806 Flashed ZIP 0806 Upgrade Using CAP 4K 8K 12K 16K 20K SE +/- 202.25, N = 3 SE +/- 76.59, N = 3 17338.38 17334.53 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.17 Time To Compile 0806 Flashed ZIP 0806 Upgrade Using CAP 20 40 60 80 100 SE +/- 0.22, N = 3 SE +/- 0.15, N = 3 81.03 80.08
NAMD Input: STMV with 1,066,628 Atoms OpenBenchmarking.org ns/day, More Is Better NAMD 3.0 Input: STMV with 1,066,628 Atoms 0806 Flashed ZIP 0806 Upgrade Using CAP 0.1367 0.2734 0.4101 0.5468 0.6835 SE +/- 0.00221, N = 3 SE +/- 0.00097, N = 3 0.60750 0.60345
Blender Blend File: Junkshop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Junkshop - Compute: CPU-Only 0806 Flashed ZIP 0806 Upgrade Using CAP 15 30 45 60 75 SE +/- 0.59, N = 3 SE +/- 0.40, N = 3 68.38 67.50
Memcached Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 0806 Flashed ZIP 0806 Upgrade Using CAP 1.2M 2.4M 3.6M 4.8M 6M SE +/- 59674.22, N = 3 SE +/- 26981.35, N = 3 5782954.82 5758869.33 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Face Detection FP16 - Device: CPU 0806 Flashed ZIP 0806 Upgrade Using CAP 200 400 600 800 1000 SE +/- 1.41, N = 3 SE +/- 1.44, N = 3 866.23 861.55 MIN: 672.45 / MAX: 1180.6 MIN: 680.33 / MAX: 1111.67 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Face Detection FP16 - Device: CPU 0806 Flashed ZIP 0806 Upgrade Using CAP 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 6.90 6.93 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
SPECFEM3D Model: Water-layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Water-layered Halfspace 0806 Flashed ZIP 0806 Upgrade Using CAP 13 26 39 52 65 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 59.07 58.94 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenRadioss Model: Rubber O-Ring Seal Installation OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation 0806 Flashed ZIP 0806 Upgrade Using CAP 13 26 39 52 65 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 59.31 59.32
OpenVINO Model: Noise Suppression Poconet-Like FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Noise Suppression Poconet-Like FP16 - Device: CPU 0806 Flashed ZIP 0806 Upgrade Using CAP 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 13.34 13.26 MIN: 10.28 / MAX: 27.86 MIN: 10.12 / MAX: 26.81 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Noise Suppression Poconet-Like FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Noise Suppression Poconet-Like FP16 - Device: CPU 0806 Flashed ZIP 0806 Upgrade Using CAP 300 600 900 1200 1500 SE +/- 1.89, N = 3 SE +/- 3.30, N = 3 1180.55 1187.96 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU 0806 Flashed ZIP 0806 Upgrade Using CAP 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 13.13 13.08 MIN: 7.82 / MAX: 28.74 MIN: 7.85 / MAX: 30.49 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU 0806 Flashed ZIP 0806 Upgrade Using CAP 100 200 300 400 500 SE +/- 0.37, N = 3 SE +/- 0.98, N = 3 454.14 455.99 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16-INT8 - Device: CPU 0806 Flashed ZIP 0806 Upgrade Using CAP 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.14, N = 3 41.84 41.51 MIN: 34.86 / MAX: 70.98 MIN: 34.73 / MAX: 64.34 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16-INT8 - Device: CPU 0806 Flashed ZIP 0806 Upgrade Using CAP 80 160 240 320 400 SE +/- 0.14, N = 3 SE +/- 1.33, N = 3 382.20 385.19 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Person Re-Identification Retail FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Re-Identification Retail FP16 - Device: CPU 0806 Flashed ZIP 0806 Upgrade Using CAP 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 5.99 6.01 MIN: 4.16 / MAX: 20.12 MIN: 4.18 / MAX: 20.45 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Person Re-Identification Retail FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Re-Identification Retail FP16 - Device: CPU 0806 Flashed ZIP 0806 Upgrade Using CAP 200 400 600 800 1000 SE +/- 3.25, N = 3 SE +/- 0.50, N = 3 984.92 982.01 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
John The Ripper Test: HMAC-SHA512 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: HMAC-SHA512 0806 Flashed ZIP 0806 Upgrade Using CAP 13M 26M 39M 52M 65M SE +/- 810047.19, N = 3 SE +/- 693099.88, N = 3 58546667 58355333 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
OpenFOAM Input: drivaerFastback, Small Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time 0806 Flashed ZIP 0806 Upgrade Using CAP 30 60 90 120 150 139.72 139.56 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
OpenFOAM Input: drivaerFastback, Small Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time 0806 Flashed ZIP 0806 Upgrade Using CAP 5 10 15 20 25 19.78 19.99 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: defconfig 0806 Flashed ZIP 0806 Upgrade Using CAP 11 22 33 44 55 SE +/- 0.58, N = 3 SE +/- 0.57, N = 4 47.32 46.87
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction 0806 Flashed ZIP 0806 Upgrade Using CAP 12 24 36 48 60 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 50.84 50.98 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
LiteRT Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Inception V4 0806 Flashed ZIP 0806 Upgrade Using CAP 3K 6K 9K 12K 15K SE +/- 23.72, N = 3 SE +/- 51.27, N = 3 15462.2 15425.8
LiteRT Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Inception ResNet V2 0806 Flashed ZIP 0806 Upgrade Using CAP 3K 6K 9K 12K 15K SE +/- 63.51, N = 3 SE +/- 55.31, N = 3 14031.5 13949.9
LiteRT Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: NASNet Mobile 0806 Flashed ZIP 0806 Upgrade Using CAP 1500 3000 4500 6000 7500 SE +/- 30.60, N = 3 SE +/- 14.53, N = 3 7048.95 6885.78
LiteRT Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Mobilenet Float 0806 Flashed ZIP 0806 Upgrade Using CAP 200 400 600 800 1000 SE +/- 3.97, N = 3 SE +/- 2.23, N = 3 867.22 853.77
LiteRT Model: DeepLab V3 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: DeepLab V3 0806 Flashed ZIP 0806 Upgrade Using CAP 300 600 900 1200 1500 SE +/- 1.42, N = 3 SE +/- 10.45, N = 3 1382.66 1378.78
LiteRT Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: SqueezeNet 0806 Flashed ZIP 0806 Upgrade Using CAP 300 600 900 1200 1500 SE +/- 1.50, N = 3 SE +/- 4.03, N = 3 1170.69 1158.57
LiteRT Model: Quantized COCO SSD MobileNet v1 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Quantized COCO SSD MobileNet v1 0806 Flashed ZIP 0806 Upgrade Using CAP 400 800 1200 1600 2000 SE +/- 9.68, N = 3 SE +/- 0.70, N = 3 1858.03 1858.06
LiteRT Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Mobilenet Quant 0806 Flashed ZIP 0806 Upgrade Using CAP 200 400 600 800 1000 SE +/- 6.84, N = 3 SE +/- 3.48, N = 3 1087.90 1097.80
OpenRadioss Model: Cell Phone Drop Test OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test 0806 Flashed ZIP 0806 Upgrade Using CAP 10 20 30 40 50 SE +/- 0.34, N = 3 SE +/- 0.21, N = 3 42.96 43.18
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 0806 Flashed ZIP 0806 Upgrade Using CAP 14K 28K 42K 56K 70K SE +/- 368.88, N = 3 SE +/- 142.11, N = 3 66524.26 66147.35 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
CP2K Molecular Dynamics Input: H20-64 OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 2024.3 Input: H20-64 0806 Flashed ZIP 0806 Upgrade Using CAP 9 18 27 36 45 SE +/- 0.13, N = 3 SE +/- 0.08, N = 3 41.07 41.18 1. (F9X) gfortran options: -fopenmp -march=native -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kgrid -lcp2kgriddgemm -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kdbx -lcp2kdbm -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -l:libhdf5_fortran.a -l:libhdf5.a -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -llibgrpp -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -lopenblas -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm
RawTherapee Total Benchmark Time OpenBenchmarking.org Seconds, Fewer Is Better RawTherapee Total Benchmark Time 0806 Flashed ZIP 0806 Upgrade Using CAP 9 18 27 36 45 SE +/- 0.16, N = 3 SE +/- 0.13, N = 3 37.36 37.19 1. RawTherapee, version 5.10, command line.
libxsmm M N K: 64 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 0806 Flashed ZIP 0806 Upgrade Using CAP 70 140 210 280 350 SE +/- 0.09, N = 3 SE +/- 0.15, N = 3 308.3 308.0 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
libxsmm M N K: 32 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 0806 Flashed ZIP 0806 Upgrade Using CAP 40 80 120 160 200 SE +/- 0.10, N = 3 SE +/- 0.10, N = 3 160.1 160.0 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
RAMspeed SMP Type: Average - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Integer 0806 Flashed ZIP 0806 Upgrade Using CAP 11K 22K 33K 44K 55K SE +/- 26.51, N = 3 SE +/- 57.46, N = 3 53010.17 53040.82 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Copy - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Integer 0806 Flashed ZIP 0806 Upgrade Using CAP 11K 22K 33K 44K 55K SE +/- 73.00, N = 3 SE +/- 49.46, N = 3 53293.58 53287.80 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Add - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Add - Benchmark: Integer 0806 Flashed ZIP 0806 Upgrade Using CAP 12K 24K 36K 48K 60K SE +/- 106.25, N = 3 SE +/- 18.91, N = 3 54149.50 54177.41 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Scale - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Integer 0806 Flashed ZIP 0806 Upgrade Using CAP 11K 22K 33K 44K 55K SE +/- 16.43, N = 3 SE +/- 39.39, N = 3 51158.28 51141.46 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Triad - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Integer 0806 Flashed ZIP 0806 Upgrade Using CAP 12K 24K 36K 48K 60K SE +/- 47.24, N = 3 SE +/- 15.30, N = 3 53807.97 53859.52 1. (CC) gcc options: -O3 -march=native
Stream Type: Copy OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Copy 0806 Flashed ZIP 0806 Upgrade Using CAP 20K 40K 60K 80K 100K SE +/- 74.07, N = 5 SE +/- 58.82, N = 5 81178.3 81021.9 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 0806 Flashed ZIP 0806 Upgrade Using CAP 14K 28K 42K 56K 70K SE +/- 437.31, N = 3 SE +/- 545.32, N = 3 67447.79 67464.98 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
John The Ripper Test: bcrypt OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: bcrypt 0806 Flashed ZIP 0806 Upgrade Using CAP 10K 20K 30K 40K 50K SE +/- 100.13, N = 3 SE +/- 11.05, N = 3 47318 47442 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: Blowfish 0806 Flashed ZIP 0806 Upgrade Using CAP 10K 20K 30K 40K 50K SE +/- 59.22, N = 3 SE +/- 56.68, N = 3 47277 47519 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 0806 Flashed ZIP 0806 Upgrade Using CAP 400 800 1200 1600 2000 SE +/- 16.91, N = 3 SE +/- 18.77, N = 3 1785.26 1801.05 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
SPECFEM3D Model: Homogeneous Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Homogeneous Halfspace 0806 Flashed ZIP 0806 Upgrade Using CAP 6 12 18 24 30 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 23.71 23.69 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
NAMD Input: ATPase with 327,506 Atoms OpenBenchmarking.org ns/day, More Is Better NAMD 3.0 Input: ATPase with 327,506 Atoms 0806 Flashed ZIP 0806 Upgrade Using CAP 0.4659 0.9318 1.3977 1.8636 2.3295 SE +/- 0.01792, N = 3 SE +/- 0.02376, N = 3 2.06908 2.07071
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 0806 Flashed ZIP 0806 Upgrade Using CAP 120M 240M 360M 480M 600M SE +/- 1033117.77, N = 3 SE +/- 507619.30, N = 3 563637100 563276667
Timed Wasmer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Wasmer Compilation 2.3 Time To Compile 0806 Flashed ZIP 0806 Upgrade Using CAP 6 12 18 24 30 SE +/- 0.08, N = 3 SE +/- 0.18, N = 3 23.24 23.66 1. (CC) gcc options: -m64 -ldl -lgcc_s -lutil -lrt -lpthread -lm -lc -pie -nodefaultlibs
SPECFEM3D Model: Mount St. Helens OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Mount St. Helens 0806 Flashed ZIP 0806 Upgrade Using CAP 5 10 15 20 25 SE +/- 0.16, N = 3 SE +/- 0.04, N = 3 20.60 20.77 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SPECFEM3D Model: Tomographic Model OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Tomographic Model 0806 Flashed ZIP 0806 Upgrade Using CAP 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.11, N = 3 18.79 19.16 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 0806 Flashed ZIP 0806 Upgrade Using CAP 5K 10K 15K 20K 25K SE +/- 53.23, N = 3 SE +/- 247.19, N = 5 24146.75 24042.97 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
GIMP Test: resize OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.38 Test: resize 0806 Flashed ZIP 0806 Upgrade Using CAP 4 8 12 16 20 SE +/- 0.10, N = 3 SE +/- 0.12, N = 3 15.20 15.67
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 0806 Flashed ZIP 0806 Upgrade Using CAP 7K 14K 21K 28K 35K SE +/- 80.59, N = 3 SE +/- 112.87, N = 3 32463.70 32534.49 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
GIMP Test: unsharp-mask OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.38 Test: unsharp-mask 0806 Flashed ZIP 0806 Upgrade Using CAP 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 13.18 12.98
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 0806 Flashed ZIP 0806 Upgrade Using CAP 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 11.15 11.18 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
GIMP Test: auto-levels OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.38 Test: auto-levels 0806 Flashed ZIP 0806 Upgrade Using CAP 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 11.46 11.26
GIMP Test: rotate OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.38 Test: rotate 0806 Flashed ZIP 0806 Upgrade Using CAP 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 11.34 11.25
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 0806 Flashed ZIP 0806 Upgrade Using CAP 3K 6K 9K 12K 15K SE +/- 41.39, N = 3 SE +/- 49.92, N = 3 14243.43 14175.35 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 0806 Flashed ZIP 0806 Upgrade Using CAP 6K 12K 18K 24K 30K SE +/- 64.67, N = 3 SE +/- 98.62, N = 3 29068.72 29083.60 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 0806 Flashed ZIP 0806 Upgrade Using CAP 3K 6K 9K 12K 15K SE +/- 29.33, N = 3 SE +/- 97.80, N = 3 11967.12 11880.36 1. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi
srsRAN Project Test: PDSCH Processor Benchmark, Throughput Total OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.10.1-20240325 Test: PDSCH Processor Benchmark, Throughput Total 0806 Flashed ZIP 0806 Upgrade Using CAP 3K 6K 9K 12K 15K SE +/- 125.99, N = 3 SE +/- 152.42, N = 4 13622.0 13645.6 1. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -fno-trapping-math -fno-math-errno -ldl
Darktable Test: Boat - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.8.1 Test: Boat - Acceleration: CPU-only 0806 Flashed ZIP 0806 Upgrade Using CAP 0.673 1.346 2.019 2.692 3.365 SE +/- 0.006, N = 3 SE +/- 0.007, N = 3 2.991 2.965
Darktable Test: Masskrug - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.8.1 Test: Masskrug - Acceleration: CPU-only 0806 Flashed ZIP 0806 Upgrade Using CAP 0.3944 0.7888 1.1832 1.5776 1.972 SE +/- 0.005, N = 3 SE +/- 0.004, N = 3 1.742 1.753
Darktable Test: Server Room - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.8.1 Test: Server Room - Acceleration: CPU-only 0806 Flashed ZIP 0806 Upgrade Using CAP 0.3141 0.6282 0.9423 1.2564 1.5705 SE +/- 0.003, N = 3 SE +/- 0.003, N = 3 1.396 1.393
Darktable Test: Server Rack - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.8.1 Test: Server Rack - Acceleration: CPU-only 0806 Flashed ZIP 0806 Upgrade Using CAP 0.0403 0.0806 0.1209 0.1612 0.2015 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.179 0.179
Stream Type: Add OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Add 0806 Flashed ZIP 0806 Upgrade Using CAP 13K 26K 39K 52K 65K SE +/- 105.27, N = 5 SE +/- 28.51, N = 5 61815.7 61683.3 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
Stream Type: Triad OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Triad 0806 Flashed ZIP 0806 Upgrade Using CAP 13K 26K 39K 52K 65K SE +/- 96.00, N = 5 SE +/- 17.62, N = 5 61746.5 61344.1 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
Stream Type: Scale OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Scale 0806 Flashed ZIP 0806 Upgrade Using CAP 12K 24K 36K 48K 60K SE +/- 80.57, N = 5 SE +/- 16.83, N = 5 54724.1 54322.7 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
Phoronix Test Suite v10.8.5