Ryzen 9 9950X Memory Performance AMD Ryzen 9 9950X 16-Core testing with a ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS) and AMD Radeon RX 7900 GRE 16GB on Ubuntu 24.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2408144-NE-RYZEN999520&gru .
Ryzen 9 9950X Memory Performance Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads) ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS) AMD Device 14d8 2 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G 2000GB Corsair MP700 PRO AMD Radeon RX 7900 GRE 16GB AMD Navi 31 HDMI/DP DELL U2723QE Intel I225-V + Intel Wi-Fi 6E Ubuntu 24.04 6.10.0-phx (x86_64) GNOME Shell 46.0 X Server + Wayland 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57) GCC 13.2.0 ext4 3840x2160 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401a - Python 3.12.3 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Ryzen 9 9950X Memory Performance pytorch: CPU - 1 - ResNet-50 pytorch: CPU - 256 - ResNet-50 minibude: OpenMP - BM1 minibude: OpenMP - BM2 stress-ng: Memory Copying quicksilver: CORAL2 P1 embree: Pathtracer ISPC - Crown embree: Pathtracer ISPC - Asian Dragon x265: Bosphorus 4K x265: Bosphorus 1080p minibude: OpenMP - BM1 minibude: OpenMP - BM2 hpcg: 104 104 104 - 60 tensorflow: CPU - 1 - ResNet-50 tensorflow: CPU - 64 - ResNet-50 luxcorerender: DLSC - CPU luxcorerender: Danish Mood - CPU luxcorerender: Orange Juice - CPU luxcorerender: LuxCore Benchmark - CPU luxcorerender: Rainbow Colors and Prism - CPU ramspeed: Add - Integer ramspeed: Copy - Integer ramspeed: Scale - Integer ramspeed: Triad - Integer ramspeed: Average - Integer mbw: Memory Copy - 4096 MiB mbw: Memory Copy - 8192 MiB mbw: Memory Copy, Fixed Block Size - 4096 MiB mbw: Memory Copy, Fixed Block Size - 8192 MiB compress-7zip: Compression Rating compress-7zip: Decompression Rating etcpak: Multi-Threaded - ETC2 lczero: BLAS lczero: Eigen stockfish: Chess Benchmark gromacs: MPI CPU - water_GMX50_bare gromacs: water_GMX50_bare namd: ATPase with 327,506 Atoms namd: STMV with 1,066,628 Atoms java-jmh: Throughput memcached: 1:10 memcached: 1:100 llama-cpp: Meta-Llama-3-8B-Instruct-Q8_0.gguf llamafile: Meta-Llama-3-8B-Instruct.F16 - CPU npb: BT.C npb: CG.C npb: EP.C npb: EP.D npb: FT.C npb: IS.D npb: LU.C npb: MG.C npb: SP.B npb: SP.C brl-cad: VGR Performance Metric lulesh: incompact3d: input.i3d 129 Cells Per Direction incompact3d: input.i3d 193 Cells Per Direction openfoam: motorBike - Mesh Time openfoam: motorBike - Execution Time openfoam: drivaerFastback, Small Mesh Size - Mesh Time openfoam: drivaerFastback, Small Mesh Size - Execution Time openfoam: drivaerFastback, Medium Mesh Size - Mesh Time openfoam: drivaerFastback, Medium Mesh Size - Execution Time openradioss: Bumper Beam openradioss: Chrysler Neon 1M openradioss: Cell Phone Drop Test openradioss: Bird Strike on Windshield openradioss: Rubber O-Ring Seal Installation openradioss: INIVOL and Fluid Structure Interaction Drop Container specfem3d: Mount St. Helens specfem3d: Layered Halfspace specfem3d: Tomographic Model specfem3d: Homogeneous Halfspace specfem3d: Water-layered Halfspace build-gem5: Time To Compile build-linux-kernel: defconfig build-linux-kernel: allmodconfig build-llvm: Ninja build-nodejs: Time To Compile build2: Time To Compile y-cruncher: 1B y-cruncher: 500M povray: Trace Time blender: BMW27 - CPU-Only blender: Junkshop - CPU-Only blender: Barbershop - CPU-Only xnnpack: FP32MobileNetV2 xnnpack: FP32MobileNetV3Large xnnpack: FP32MobileNetV3Small xnnpack: FP16MobileNetV2 xnnpack: FP16MobileNetV3Large xnnpack: FP16MobileNetV3Small xnnpack: QU8MobileNetV2 xnnpack: QU8MobileNetV3Large xnnpack: QU8MobileNetV3Small 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 84.89 57.54 76.885 77.385 10752.81 25676667 36.5532 41.4914 38.27 134.52 1922.125 1934.613 8.93560 16.92 52.27 5.40 4.62 8.64 5.15 19.99 61079.93 65797.05 69141.08 62428.70 65503.93 22774.947 22247.784 19534.159 19467.394 195585 158133 741.784 226 220 51377778 3.141 1.927 3.41007 0.98084 91554717298.426 5915791.34 7507615.73 8.36 4.68 55287.37 11144.88 3678.46 3823.80 27106.18 1505.47 59920.96 24236.21 21328.28 14528.26 498966 9945.5309 14.0905552 64.1130218 84.9949 58.5695 22.579849 157.14898 181.56905 2096.5664 76.90 705.55 44.90 128.76 53.16 247.06 25.690986644 73.138797084 23.359092920 30.677918457 72.197260512 221.139 47.755 590.495 301.629 345.246 76.003 16.452 7.587 16.101 46.38 62.09 448.96 1264 1516 812 985 1291 770 759 1118 756 OpenBenchmarking.org
PyTorch Device: CPU - Batch Size: 1 - Model: ResNet-50 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 20 40 60 80 100 SE +/- 0.27, N = 3 84.89 MIN: 77.17 / MAX: 85.87
PyTorch Device: CPU - Batch Size: 256 - Model: ResNet-50 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 256 - Model: ResNet-50 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 13 26 39 52 65 SE +/- 0.34, N = 3 57.54 MIN: 53.14 / MAX: 58.34
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 20 40 60 80 100 SE +/- 0.27, N = 3 76.89 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 20 40 60 80 100 SE +/- 0.03, N = 3 77.39 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
Stress-NG Test: Memory Copying OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.17.08 Test: Memory Copying 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2K 4K 6K 8K 10K SE +/- 44.94, N = 3 10752.81 1. (CXX) g++ options: -lm -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lgmp -lgbm -lmpfr -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -U_FORTIFY_SOURCE
Quicksilver Input: CORAL2 P1 OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P1 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 5M 10M 15M 20M 25M SE +/- 21858.13, N = 3 25676667 1. (CXX) g++ options: -fopenmp -O3 -march=native
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 8 16 24 32 40 SE +/- 0.04, N = 3 36.55 MIN: 36.13 / MAX: 37.34
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 9 18 27 36 45 SE +/- 0.03, N = 3 41.49 MIN: 41.24 / MAX: 42.02
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 Video Input: Bosphorus 4K 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 9 18 27 36 45 SE +/- 0.04, N = 3 38.27 1. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 Video Input: Bosphorus 1080p 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 30 60 90 120 150 SE +/- 0.27, N = 3 134.52 1. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 400 800 1200 1600 2000 SE +/- 6.84, N = 3 1922.13 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 400 800 1200 1600 2000 SE +/- 0.64, N = 3 1934.61 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
High Performance Conjugate Gradient X Y Z: 104 104 104 - RT: 60 OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 104 104 104 - RT: 60 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 4 6 8 10 SE +/- 0.00142, N = 3 8.93560 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
TensorFlow Device: CPU - Batch Size: 1 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4 8 12 16 20 SE +/- 0.03, N = 3 16.92
TensorFlow Device: CPU - Batch Size: 64 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 64 - Model: ResNet-50 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 12 24 36 48 60 SE +/- 0.02, N = 3 52.27
LuxCoreRender Scene: DLSC - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: CPU 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 1.215 2.43 3.645 4.86 6.075 SE +/- 0.01, N = 3 5.40 MIN: 5.27 / MAX: 5.72
LuxCoreRender Scene: Danish Mood - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: CPU 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 1.0395 2.079 3.1185 4.158 5.1975 SE +/- 0.01, N = 3 4.62 MIN: 2.07 / MAX: 5.22
LuxCoreRender Scene: Orange Juice - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: CPU 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 4 6 8 10 SE +/- 0.01, N = 3 8.64 MIN: 7.64 / MAX: 9.25
LuxCoreRender Scene: LuxCore Benchmark - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: CPU 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 1.1588 2.3176 3.4764 4.6352 5.794 SE +/- 0.01, N = 3 5.15 MIN: 2.38 / MAX: 5.77
LuxCoreRender Scene: Rainbow Colors and Prism - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: CPU 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 5 10 15 20 25 SE +/- 0.07, N = 3 19.99 MIN: 18.05 / MAX: 20.42
RAMspeed SMP Type: Add - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Add - Benchmark: Integer 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 13K 26K 39K 52K 65K SE +/- 426.98, N = 3 61079.93 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Copy - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Integer 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 14K 28K 42K 56K 70K SE +/- 820.93, N = 3 65797.05 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Scale - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Integer 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 15K 30K 45K 60K 75K SE +/- 220.54, N = 3 69141.08 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Triad - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Integer 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 13K 26K 39K 52K 65K SE +/- 156.92, N = 3 62428.70 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Average - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Integer 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 14K 28K 42K 56K 70K SE +/- 119.74, N = 3 65503.93 1. (CC) gcc options: -O3 -march=native
MBW Test: Memory Copy - Array Size: 4096 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 4096 MiB 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 5K 10K 15K 20K 25K SE +/- 72.38, N = 3 22774.95 1. (CC) gcc options: -O3 -march=native
MBW Test: Memory Copy - Array Size: 8192 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 8192 MiB 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 5K 10K 15K 20K 25K SE +/- 49.10, N = 3 22247.78 1. (CC) gcc options: -O3 -march=native
MBW Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4K 8K 12K 16K 20K SE +/- 24.77, N = 3 19534.16 1. (CC) gcc options: -O3 -march=native
MBW Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4K 8K 12K 16K 20K SE +/- 10.56, N = 3 19467.39 1. (CC) gcc options: -O3 -march=native
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression Test: Compression Rating 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 40K 80K 120K 160K 200K SE +/- 322.74, N = 3 195585 1. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression Test: Decompression Rating 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 30K 60K 90K 120K 150K SE +/- 14.15, N = 3 158133 1. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20
Etcpak Benchmark: Multi-Threaded - Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 2.0 Benchmark: Multi-Threaded - Configuration: ETC2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 160 320 480 640 800 SE +/- 1.25, N = 3 741.78 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.31.1 Backend: BLAS 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 50 100 150 200 250 SE +/- 2.65, N = 3 226 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.31.1 Backend: Eigen 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 50 100 150 200 250 SE +/- 1.45, N = 3 220 1. (CXX) g++ options: -flto -pthread
Stockfish Chess Benchmark OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish Chess Benchmark 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 11M 22M 33M 44M 55M SE +/- 491754.88, N = 15 51377778 1. Stockfish 16 by the Stockfish developers (see AUTHORS file)
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 0.7067 1.4134 2.1201 2.8268 3.5335 SE +/- 0.001, N = 3 3.141 1. (CXX) g++ options: -O3 -lm
GROMACS Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS Input: water_GMX50_bare 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 0.4336 0.8672 1.3008 1.7344 2.168 SE +/- 0.002, N = 3 1.927 1. GROMACS version: 2023.3-Ubuntu_2023.3_1ubuntu3
NAMD Input: ATPase with 327,506 Atoms OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: ATPase with 327,506 Atoms 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 0.7673 1.5346 2.3019 3.0692 3.8365 SE +/- 0.02586, N = 3 3.41007
NAMD Input: STMV with 1,066,628 Atoms OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: STMV with 1,066,628 Atoms 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 0.2207 0.4414 0.6621 0.8828 1.1035 SE +/- 0.00027, N = 3 0.98084
Java JMH Throughput OpenBenchmarking.org Ops/s, More Is Better Java JMH Throughput 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 20000M 40000M 60000M 80000M 100000M 91554717298.43
Memcached Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 15831.56, N = 3 5915791.34 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Memcached Set To Get Ratio: 1:100 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 1.6M 3.2M 4.8M 6.4M 8M SE +/- 1150.91, N = 3 7507615.73 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Llama.cpp Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b3067 Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 4 6 8 10 SE +/- 0.01, N = 3 8.36 1. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas
Llamafile Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU OpenBenchmarking.org Tokens Per Second, More Is Better Llamafile 0.8.6 Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 1.053 2.106 3.159 4.212 5.265 4.68
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 12K 24K 36K 48K 60K SE +/- 54.79, N = 3 55287.37 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2K 4K 6K 8K 10K SE +/- 21.24, N = 3 11144.88 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 800 1600 2400 3200 4000 SE +/- 25.12, N = 3 3678.46 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 800 1600 2400 3200 4000 SE +/- 2.85, N = 3 3823.80 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 6K 12K 18K 24K 30K SE +/- 141.73, N = 3 27106.18 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 300 600 900 1200 1500 SE +/- 5.33, N = 3 1505.47 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 13K 26K 39K 52K 65K SE +/- 150.93, N = 3 59920.96 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 5K 10K 15K 20K 25K SE +/- 6.88, N = 3 24236.21 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 5K 10K 15K 20K 25K SE +/- 21.50, N = 3 21328.28 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 3K 6K 9K 12K 15K SE +/- 18.27, N = 3 14528.26 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.38.2 VGR Performance Metric 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 110K 220K 330K 440K 550K 498966 1. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2K 4K 6K 8K 10K SE +/- 19.00, N = 3 9945.53 1. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4 8 12 16 20 SE +/- 0.08, N = 3 14.09 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 14 28 42 56 70 SE +/- 0.09, N = 3 64.11 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenFOAM Input: motorBike - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: motorBike - Mesh Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 20 40 60 80 100 84.99 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenFOAM Input: motorBike - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: motorBike - Execution Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 13 26 39 52 65 58.57 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenFOAM Input: drivaerFastback, Small Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 5 10 15 20 25 22.58 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenFOAM Input: drivaerFastback, Small Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 30 60 90 120 150 157.15 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 40 80 120 160 200 181.57 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 500 1000 1500 2000 2500 2096.57 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenRadioss Model: Bumper Beam OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 20 40 60 80 100 SE +/- 0.07, N = 3 76.90
OpenRadioss Model: Chrysler Neon 1M OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Chrysler Neon 1M 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 150 300 450 600 750 SE +/- 0.28, N = 3 705.55
OpenRadioss Model: Cell Phone Drop Test OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 10 20 30 40 50 SE +/- 0.18, N = 3 44.90
OpenRadioss Model: Bird Strike on Windshield OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 30 60 90 120 150 SE +/- 0.28, N = 3 128.76
OpenRadioss Model: Rubber O-Ring Seal Installation OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 12 24 36 48 60 SE +/- 0.10, N = 3 53.16
OpenRadioss Model: INIVOL and Fluid Structure Interaction Drop Container OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 50 100 150 200 250 SE +/- 0.74, N = 3 247.06
SPECFEM3D Model: Mount St. Helens OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Mount St. Helens 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 6 12 18 24 30 SE +/- 0.17, N = 3 25.69 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SPECFEM3D Model: Layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Layered Halfspace 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 16 32 48 64 80 SE +/- 0.31, N = 3 73.14 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SPECFEM3D Model: Tomographic Model OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Tomographic Model 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 6 12 18 24 30 SE +/- 0.16, N = 3 23.36 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SPECFEM3D Model: Homogeneous Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Homogeneous Halfspace 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 7 14 21 28 35 SE +/- 0.28, N = 3 30.68 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SPECFEM3D Model: Water-layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Water-layered Halfspace 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 16 32 48 64 80 SE +/- 0.34, N = 3 72.20 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Timed Gem5 Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 23.0.1 Time To Compile 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 50 100 150 200 250 SE +/- 0.11, N = 3 221.14
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: defconfig 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 11 22 33 44 55 SE +/- 0.30, N = 3 47.76
Timed Linux Kernel Compilation Build: allmodconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: allmodconfig 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 130 260 390 520 650 SE +/- 0.15, N = 3 590.50
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 16.0 Build System: Ninja 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 70 140 210 280 350 SE +/- 0.12, N = 3 301.63
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 21.7.2 Time To Compile 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 80 160 240 320 400 SE +/- 0.34, N = 3 345.25
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.17 Time To Compile 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 20 40 60 80 100 SE +/- 0.16, N = 3 76.00
Y-Cruncher Pi Digits To Calculate: 1B OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 1B 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4 8 12 16 20 SE +/- 0.04, N = 3 16.45
Y-Cruncher Pi Digits To Calculate: 500M OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 500M 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 4 6 8 10 SE +/- 0.006, N = 3 7.587
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray Trace Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4 8 12 16 20 SE +/- 0.03, N = 3 16.10 1. POV-Ray 3.7.0.10.unofficial
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: BMW27 - Compute: CPU-Only 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 11 22 33 44 55 SE +/- 0.01, N = 3 46.38
Blender Blend File: Junkshop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Junkshop - Compute: CPU-Only 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 14 28 42 56 70 SE +/- 0.12, N = 3 62.09
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Barbershop - Compute: CPU-Only 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 100 200 300 400 500 SE +/- 0.20, N = 3 448.96
XNNPACK Model: FP32MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 300 600 900 1200 1500 SE +/- 5.21, N = 3 1264 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP32MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV3Large 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 300 600 900 1200 1500 SE +/- 19.19, N = 3 1516 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP32MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV3Small 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 200 400 600 800 1000 SE +/- 2.31, N = 3 812 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP16MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 200 400 600 800 1000 SE +/- 2.31, N = 3 985 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP16MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV3Large 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 300 600 900 1200 1500 SE +/- 2.89, N = 3 1291 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP16MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV3Small 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 170 340 510 680 850 SE +/- 1.76, N = 3 770 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: QU8MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 160 320 480 640 800 SE +/- 0.33, N = 3 759 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: QU8MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV3Large 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 200 400 600 800 1000 SE +/- 1.20, N = 3 1118 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: QU8MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV3Small 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 160 320 480 640 800 756 1. (CXX) g++ options: -O3 -lrt -lm
Phoronix Test Suite v10.8.5