AMD Ryzen 9 7900 DDR5-5200 ECC memory benchmarks by Michael Larabel for a future article
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2405045-NE-AMDRYZEN948 AMD Ryzen 9 7900 DDR5-5200 ECC Memory - Phoronix Test Suite AMD Ryzen 9 7900 DDR5-5200 ECC Memory AMD Ryzen 9 7900 DDR5-5200 ECC memory benchmarks by Michael Larabel for a future article
HTML result view exported from: https://openbenchmarking.org/result/2405045-NE-AMDRYZEN948&grs&rdt .
AMD Ryzen 9 7900 DDR5-5200 ECC Memory Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Compiler File-System Screen Resolution 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 AMD Ryzen 9 7900 12-Core @ 5.48GHz (12 Cores / 24 Threads) ASRockRack 1U4LW-B650/2L2T B650D4U-2L2T/BCM (2.09 BIOS) AMD Device 14d8 2 x 32GB DDR5-4800MT/s Micron MTC20C2085S1EC48BA1 1024GB SOLIDIGM SSDPFKNU010TZ ASPEED AMD Rembrandt Radeon HD Audio VA2431 2 x Intel I210 + 2 x Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMA Ubuntu 24.04 6.8.0-31-generic (x86_64) GNOME Shell 46.0 X Server GCC 13.2.0 ext4 1920x1200 2 x 32GB DDR5-5200MT/s Kingston OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203 Python Details - Python 3.12.3 Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
AMD Ryzen 9 7900 DDR5-5200 ECC Memory mbw: Memory Copy, Fixed Block Size - 4096 MiB mbw: Memory Copy, Fixed Block Size - 8192 MiB llama-cpp: llama-2-7b.Q4_0.gguf openfoam: drivaerFastback, Small Mesh Size - Mesh Time mbw: Memory Copy, Fixed Block Size - 1024 MiB npb: CG.C npb: SP.B libxsmm: 32 npb: FT.C incompact3d: input.i3d 129 Cells Per Direction npb: IS.D ramspeed: Add - Integer onednn: IP Shapes 3D - CPU mbw: Memory Copy - 8192 MiB npb: BT.C npb: SP.C onednn: Convolution Batch Shapes Auto - CPU openfoam: drivaerFastback, Small Mesh Size - Execution Time openradioss: Chrysler Neon 1M incompact3d: input.i3d 193 Cells Per Direction ramspeed: Scale - Integer llama-cpp: llama-2-13b.Q4_0.gguf tensorflow: CPU - 1 - AlexNet llamafile: llava-v1.5-7b-q4 - CPU ramspeed: Copy - Integer ramspeed: Average - Integer llamafile: mistral-7b-instruct-v0.2.Q8_0 - CPU npb: MG.C llamafile: wizardcoder-python-34b-v1.0.Q6_K - CPU specfem3d: Tomographic Model encode-wavpack: WAV To WavPack ramspeed: Triad - Integer llama-cpp: llama-2-70b-chat.Q5_0.gguf avifenc: 6 mbw: Memory Copy - 1024 MiB specfem3d: Water-layered Halfspace rnnoise: 26 Minute Long Talking Sample openvino: Road Segmentation ADAS FP16 - CPU jpegxl-decode: 1 openvino: Road Segmentation ADAS FP16 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Vehicle Detection FP16 - CPU mbw: Memory Copy - 4096 MiB tensorflow: CPU - 64 - ResNet-50 jpegxl-decode: All ffmpeg: libx265 - Live ospray: gravity_spheres_volume/dim_512/ao/real_time tensorflow: CPU - 64 - GoogLeNet openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU gromacs: MPI CPU - water_GMX50_bare compress-7zip: Compression Rating pgbench: 100 - 1000 - Read Only draco: Church Facade specfem3d: Layered Halfspace pgbench: 100 - 1000 - Read Only - Average Latency jpegxl: PNG - 90 svt-av1: Preset 12 - Bosphorus 4K embree: Pathtracer ISPC - Asian Dragon openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU embree: Pathtracer ISPC - Asian Dragon Obj graphics-magick: Rotate build-linux-kernel: defconfig openvino: Machine Translation EN To DE FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU rocksdb: Read While Writing graphics-magick: Noise-Gaussian tensorflow: CPU - 64 - AlexNet openradioss: Bumper Beam specfem3d: Homogeneous Halfspace openradioss: Bird Strike on Windshield svt-av1: Preset 13 - Bosphorus 4K build-php: Time To Compile openvino: Person Detection FP32 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Person Detection FP32 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU draco: Lion ffmpeg: libx265 - Video On Demand openradioss: INIVOL and Fluid Structure Interaction Drop Container stockfish: Chess Benchmark graphics-magick: HWB Color Space pgbench: 100 - 1000 - Read Write pgbench: 100 - 1000 - Read Write - Average Latency build-linux-kernel: allmodconfig namd: STMV with 1,066,628 Atoms brl-cad: VGR Performance Metric tensorflow: CPU - 1 - GoogLeNet ffmpeg: libx265 - Platform blender: Fishy Cat - CPU-Only ffmpeg: libx265 - Upload avifenc: 10, Lossless build-nodejs: Time To Compile graphics-magick: Swirl blender: Pabellon Barcelona - CPU-Only blender: Junkshop - CPU-Only quicksilver: CORAL2 P1 build-godot: Time To Compile openvino: Face Detection FP16-INT8 - CPU openvino: Face Detection FP16-INT8 - CPU blender: Classroom - CPU-Only build-mesa: Time To Compile avifenc: 2 srsran: PDSCH Processor Benchmark, Throughput Total npb: LU.C onednn: Deconvolution Batch shapes_3d - CPU compress-7zip: Decompression Rating tensorflow: CPU - 1 - ResNet-50 pytorch: CPU - 1 - ResNet-50 namd: ATPase with 327,506 Atoms openradioss: Cell Phone Drop Test nginx: 1000 quicksilver: CORAL2 P2 blender: Barbershop - CPU-Only srsran: PUSCH Processor Benchmark, Throughput Total openradioss: Rubber O-Ring Seal Installation specfem3d: Mount St. Helens v-ray: CPU graphics-magick: Resizing 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 16019.16 16405.968 13.67 28.682586 16190.133 9531.48 18304.39 107.1 22528.92 16.6699677 1292.66 54648.29 3.72571 17709.526 36869.25 11331.51 6.33617 196.382 913.48 79.4299469 63562.29 7.3 14.48 11.44 62972.55 59338.22 7.21 22577.88 2.02 36.587308722 4.409 55555.51 1.22 4.457 17705.756 100.601281906 7.54 21.27 85.282 281.64 9.26 645.63 17823.972 35.44 201.87 175.30 5.49836 112.13 80.58 74.4 2.151 139035 721872 5022 102.828797179 1.385 41.179 145.78 25.2658 5.48 1090.85 21.5768 165 76.578 60.53 99.05 3174105 91 263.8 105.47 47.018135857 182.19 146.707 44.612 74.91 345.8 80.04 17.32 3903 68.27 338.34 30072838 259 33933 29.47 979.122 0.62007 280941 54.41 68.30 94.72 33.66 4.145 515.333 187 233.85 99.78 18550000 240.329 322.63 18.57 193.27 22.061 43.071 11196.8 41111.39 3.43803 126457 14.68 58.75 2.26790 65.18 114755.2 22580000 676.26 1559.3 83.39 37.199417692 31844 289 17776.419 17799.501 14.69 26.766308 17220.318 10133.13 19269.37 111.7 23481.95 15.9998989 1345.7 56856.03 3.58111 18404.785 38315.1 11771.22 6.10032 189.28942 880.51 76.5668259 65913.62 7.57 15 11.85 65210.1 61195.82 7.43 23251.34 2.08 35.537095139 4.287 57010.05 1.25 4.354 18105.09 98.4589975 7.382 20.83 87.054 287.49 9.08 658.28 18169.531 36.12 205.306 178.26 5.58648 113.89 79.41 75.48 2.182 141015 732078 4952 101.40844005 1.366 41.745 147.738 25.6014 5.41 1104.95 21.8505 167 75.672 59.82 100.21 3209986 92 266.66 104.36 46.53244121 180.32 148.201 44.181 75.64 349.13 79.28 17.16 3868 68.86 335.53 30311842 261 34193 29.246 971.785 0.62459 282841 54.77 68.71 95.28 33.85 4.122 512.546 188 235.05 100.26 18480000 239.446 323.77 18.51 193.87 21.995 42.943 11166.1 41210.06 3.43041 126189 14.71 58.87 2.27071 65.25 114660.49 22590000 676.53 1558.7 83.36 37.202661409 31845 289 OpenBenchmarking.org
MBW Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 4K 8K 12K 16K 20K 16019.16 17776.42 1. (CC) gcc options: -O3 -march=native
MBW Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 4K 8K 12K 16K 20K 16405.97 17799.50 1. (CC) gcc options: -O3 -march=native
Llama.cpp Model: llama-2-7b.Q4_0.gguf OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b1808 Model: llama-2-7b.Q4_0.gguf 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 4 8 12 16 20 13.67 14.69 1. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas
OpenFOAM Input: drivaerFastback, Small Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 7 14 21 28 35 28.68 26.77 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
MBW Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 4K 8K 12K 16K 20K 16190.13 17220.32 1. (CC) gcc options: -O3 -march=native
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 2K 4K 6K 8K 10K 9531.48 10133.13 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 4K 8K 12K 16K 20K 18304.39 19269.37 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
libxsmm M N K: 32 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 30 60 90 120 150 107.1 111.7 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 5K 10K 15K 20K 25K 22528.92 23481.95 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 4 8 12 16 20 16.67 16.00 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 300 600 900 1200 1500 1292.66 1345.70 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
RAMspeed SMP Type: Add - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Add - Benchmark: Integer 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 12K 24K 36K 48K 60K 54648.29 56856.03 1. (CC) gcc options: -O3 -march=native
oneDNN Harness: IP Shapes 3D - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: IP Shapes 3D - Engine: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 0.8383 1.6766 2.5149 3.3532 4.1915 3.72571 3.58111 MIN: 3.69 MIN: 3.54 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
MBW Test: Memory Copy - Array Size: 8192 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 8192 MiB 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 4K 8K 12K 16K 20K 17709.53 18404.79 1. (CC) gcc options: -O3 -march=native
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 8K 16K 24K 32K 40K 36869.25 38315.10 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 3K 6K 9K 12K 15K 11331.51 11771.22 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
oneDNN Harness: Convolution Batch Shapes Auto - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Convolution Batch Shapes Auto - Engine: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 2 4 6 8 10 6.33617 6.10032 MIN: 6.24 MIN: 5.99 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenFOAM Input: drivaerFastback, Small Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 40 80 120 160 200 196.38 189.29 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenRadioss Model: Chrysler Neon 1M OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Chrysler Neon 1M 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 200 400 600 800 1000 913.48 880.51
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20 40 60 80 100 79.43 76.57 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
RAMspeed SMP Type: Scale - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Integer 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 14K 28K 42K 56K 70K 63562.29 65913.62 1. (CC) gcc options: -O3 -march=native
Llama.cpp Model: llama-2-13b.Q4_0.gguf OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b1808 Model: llama-2-13b.Q4_0.gguf 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 2 4 6 8 10 7.30 7.57 1. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas
TensorFlow Device: CPU - Batch Size: 1 - Model: AlexNet OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 1 - Model: AlexNet 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 4 8 12 16 20 14.48 15.00
Llamafile Test: llava-v1.5-7b-q4 - Acceleration: CPU OpenBenchmarking.org Tokens Per Second, More Is Better Llamafile 0.7 Test: llava-v1.5-7b-q4 - Acceleration: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 3 6 9 12 15 11.44 11.85
RAMspeed SMP Type: Copy - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Integer 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 14K 28K 42K 56K 70K 62972.55 65210.10 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Average - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Integer 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 13K 26K 39K 52K 65K 59338.22 61195.82 1. (CC) gcc options: -O3 -march=native
Llamafile Test: mistral-7b-instruct-v0.2.Q8_0 - Acceleration: CPU OpenBenchmarking.org Tokens Per Second, More Is Better Llamafile 0.7 Test: mistral-7b-instruct-v0.2.Q8_0 - Acceleration: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 2 4 6 8 10 7.21 7.43
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 5K 10K 15K 20K 25K 22577.88 23251.34 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
Llamafile Test: wizardcoder-python-34b-v1.0.Q6_K - Acceleration: CPU OpenBenchmarking.org Tokens Per Second, More Is Better Llamafile 0.7 Test: wizardcoder-python-34b-v1.0.Q6_K - Acceleration: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 0.468 0.936 1.404 1.872 2.34 2.02 2.08
SPECFEM3D Model: Tomographic Model OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Tomographic Model 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 8 16 24 32 40 36.59 35.54 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.7 WAV To WavPack 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 0.992 1.984 2.976 3.968 4.96 4.409 4.287
RAMspeed SMP Type: Triad - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Integer 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 12K 24K 36K 48K 60K 55555.51 57010.05 1. (CC) gcc options: -O3 -march=native
Llama.cpp Model: llama-2-70b-chat.Q5_0.gguf OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b1808 Model: llama-2-70b-chat.Q5_0.gguf 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 0.2813 0.5626 0.8439 1.1252 1.4065 1.22 1.25 1. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 1.0028 2.0056 3.0084 4.0112 5.014 4.457 4.354 1. (CXX) g++ options: -O3 -fPIC -lm
MBW Test: Memory Copy - Array Size: 1024 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 1024 MiB 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 4K 8K 12K 16K 20K 17705.76 18105.09 1. (CC) gcc options: -O3 -march=native
SPECFEM3D Model: Water-layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Water-layered Halfspace 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20 40 60 80 100 100.60 98.46 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
RNNoise Input: 26 Minute Long Talking Sample OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 0.2 Input: 26 Minute Long Talking Sample 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 2 4 6 8 10 7.540 7.382 1. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 5 10 15 20 25 21.27 20.83 MIN: 9.77 / MAX: 37.47 MIN: 12.6 / MAX: 29.81 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
JPEG-XL Decoding libjxl CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG-XL Decoding libjxl 0.10.1 CPU Threads: 1 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20 40 60 80 100 85.28 87.05
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 60 120 180 240 300 281.64 287.49 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 3 6 9 12 15 9.26 9.08 MIN: 4.6 / MAX: 15.49 MIN: 4.23 / MAX: 17.05 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 140 280 420 560 700 645.63 658.28 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
MBW Test: Memory Copy - Array Size: 4096 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 4096 MiB 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 4K 8K 12K 16K 20K 17823.97 18169.53 1. (CC) gcc options: -O3 -march=native
TensorFlow Device: CPU - Batch Size: 64 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 64 - Model: ResNet-50 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 8 16 24 32 40 35.44 36.12
JPEG-XL Decoding libjxl CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG-XL Decoding libjxl 0.10.1 CPU Threads: All 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 50 100 150 200 250 201.87 205.31
FFmpeg Encoder: libx265 - Scenario: Live OpenBenchmarking.org FPS, More Is Better FFmpeg 7.0 Encoder: libx265 - Scenario: Live 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 40 80 120 160 200 175.30 178.26 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OSPRay Benchmark: gravity_spheres_volume/dim_512/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: gravity_spheres_volume/dim_512/ao/real_time 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 1.257 2.514 3.771 5.028 6.285 5.49836 5.58648
TensorFlow Device: CPU - Batch Size: 64 - Model: GoogLeNet OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 64 - Model: GoogLeNet 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 30 60 90 120 150 112.13 113.89
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Detection FP16 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20 40 60 80 100 80.58 79.41 MIN: 60.22 / MAX: 102.9 MIN: 56.3 / MAX: 102.74 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Detection FP16 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20 40 60 80 100 74.40 75.48 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 0.491 0.982 1.473 1.964 2.455 2.151 2.182 1. (CXX) g++ options: -O3 -lm
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 30K 60K 90K 120K 150K 139035 141015 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 160K 320K 480K 640K 800K 721872 732078 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Google Draco Model: Church Facade OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.6 Model: Church Facade 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 1100 2200 3300 4400 5500 5022 4952 1. (CXX) g++ options: -O3
SPECFEM3D Model: Layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Layered Halfspace 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20 40 60 80 100 102.83 101.41 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 0.3116 0.6232 0.9348 1.2464 1.558 1.385 1.366 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
JPEG-XL libjxl Input: PNG - Quality: 90 OpenBenchmarking.org MP/s, More Is Better JPEG-XL libjxl 0.10.1 Input: PNG - Quality: 90 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 10 20 30 40 50 41.18 41.75 1. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 12 - Input: Bosphorus 4K 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 30 60 90 120 150 145.78 147.74 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 6 12 18 24 30 25.27 25.60 MIN: 25.12 / MAX: 25.79 MIN: 25.46 / MAX: 25.94
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Vehicle Bike Detection FP16 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 1.233 2.466 3.699 4.932 6.165 5.48 5.41 MIN: 3.89 / MAX: 9.6 MIN: 3.92 / MAX: 8.18 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Vehicle Bike Detection FP16 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 200 400 600 800 1000 1090.85 1104.95 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
Embree Binary: Pathtracer ISPC - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon Obj 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 5 10 15 20 25 21.58 21.85 MIN: 21.45 / MAX: 21.94 MIN: 21.73 / MAX: 22.2
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Rotate 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 40 80 120 160 200 165 167 1. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: defconfig 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20 40 60 80 100 76.58 75.67
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Machine Translation EN To DE FP16 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 14 28 42 56 70 60.53 59.82 MIN: 33.29 / MAX: 82.21 MIN: 48.42 / MAX: 67.39 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Machine Translation EN To DE FP16 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20 40 60 80 100 99.05 100.21 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
RocksDB Test: Read While Writing OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Read While Writing 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 700K 1400K 2100K 2800K 3500K 3174105 3209986 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Noise-Gaussian 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20 40 60 80 100 91 92 1. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp
TensorFlow Device: CPU - Batch Size: 64 - Model: AlexNet OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 64 - Model: AlexNet 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 60 120 180 240 300 263.80 266.66
OpenRadioss Model: Bumper Beam OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20 40 60 80 100 105.47 104.36
SPECFEM3D Model: Homogeneous Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Homogeneous Halfspace 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 11 22 33 44 55 47.02 46.53 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenRadioss Model: Bird Strike on Windshield OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 40 80 120 160 200 182.19 180.32
SVT-AV1 Encoder Mode: Preset 13 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 13 - Input: Bosphorus 4K 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 30 60 90 120 150 146.71 148.20 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 8.3.4 Time To Compile 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 10 20 30 40 50 44.61 44.18
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Person Detection FP32 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20 40 60 80 100 74.91 75.64 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 80 160 240 320 400 345.80 349.13 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Person Detection FP32 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20 40 60 80 100 80.04 79.28 MIN: 61.76 / MAX: 107.58 MIN: 60.22 / MAX: 100.15 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 4 8 12 16 20 17.32 17.16 MIN: 8.73 / MAX: 23.67 MIN: 13.76 / MAX: 23 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
Google Draco Model: Lion OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.6 Model: Lion 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 800 1600 2400 3200 4000 3903 3868 1. (CXX) g++ options: -O3
FFmpeg Encoder: libx265 - Scenario: Video On Demand OpenBenchmarking.org FPS, More Is Better FFmpeg 7.0 Encoder: libx265 - Scenario: Video On Demand 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 15 30 45 60 75 68.27 68.86 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenRadioss Model: INIVOL and Fluid Structure Interaction Drop Container OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 70 140 210 280 350 338.34 335.53
Stockfish Chess Benchmark OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 16.1 Chess Benchmark 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 6M 12M 18M 24M 30M 30072838 30311842 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: HWB Color Space 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 60 120 180 240 300 259 261 1. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 7K 14K 21K 28K 35K 33933 34193 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 7 14 21 28 35 29.47 29.25 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Timed Linux Kernel Compilation Build: allmodconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: allmodconfig 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 200 400 600 800 1000 979.12 971.79
NAMD Input: STMV with 1,066,628 Atoms OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: STMV with 1,066,628 Atoms 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 0.1405 0.281 0.4215 0.562 0.7025 0.62007 0.62459
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.38.2 VGR Performance Metric 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 60K 120K 180K 240K 300K 280941 282841 1. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6
TensorFlow Device: CPU - Batch Size: 1 - Model: GoogLeNet OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 1 - Model: GoogLeNet 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 12 24 36 48 60 54.41 54.77
FFmpeg Encoder: libx265 - Scenario: Platform OpenBenchmarking.org FPS, More Is Better FFmpeg 7.0 Encoder: libx265 - Scenario: Platform 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 15 30 45 60 75 68.30 68.71 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: Fishy Cat - Compute: CPU-Only 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20 40 60 80 100 94.72 95.28
FFmpeg Encoder: libx265 - Scenario: Upload OpenBenchmarking.org FPS, More Is Better FFmpeg 7.0 Encoder: libx265 - Scenario: Upload 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 8 16 24 32 40 33.66 33.85 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 10, Lossless 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 0.9326 1.8652 2.7978 3.7304 4.663 4.145 4.122 1. (CXX) g++ options: -O3 -fPIC -lm
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 21.7.2 Time To Compile 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 110 220 330 440 550 515.33 512.55
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Swirl 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 40 80 120 160 200 187 188 1. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: Pabellon Barcelona - Compute: CPU-Only 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 50 100 150 200 250 233.85 235.05
Blender Blend File: Junkshop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: Junkshop - Compute: CPU-Only 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20 40 60 80 100 99.78 100.26
Quicksilver Input: CORAL2 P1 OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P1 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 4M 8M 12M 16M 20M 18550000 18480000 1. (CXX) g++ options: -fopenmp -O3 -march=native
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 4.0 Time To Compile 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 50 100 150 200 250 240.33 239.45
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Face Detection FP16-INT8 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 70 140 210 280 350 322.63 323.77 MIN: 296.85 / MAX: 328.51 MIN: 308.71 / MAX: 330.44 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Face Detection FP16-INT8 - Device: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 5 10 15 20 25 18.57 18.51 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: Classroom - Compute: CPU-Only 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 40 80 120 160 200 193.27 193.87
Timed Mesa Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 24.0 Time To Compile 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 5 10 15 20 25 22.06 22.00
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 2 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 10 20 30 40 50 43.07 42.94 1. (CXX) g++ options: -O3 -fPIC -lm
srsRAN Project Test: PDSCH Processor Benchmark, Throughput Total OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.10.1-20240325 Test: PDSCH Processor Benchmark, Throughput Total 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 2K 4K 6K 8K 10K 11196.8 11166.1 1. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 9K 18K 27K 36K 45K 41111.39 41210.06 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
oneDNN Harness: Deconvolution Batch shapes_3d - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Deconvolution Batch shapes_3d - Engine: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 0.7736 1.5472 2.3208 3.0944 3.868 3.43803 3.43041 MIN: 3.21 MIN: 3.21 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 30K 60K 90K 120K 150K 126457 126189 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
TensorFlow Device: CPU - Batch Size: 1 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 4 8 12 16 20 14.68 14.71
PyTorch Device: CPU - Batch Size: 1 - Model: ResNet-50 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 13 26 39 52 65 58.75 58.87 MIN: 53.89 / MAX: 59.52 MIN: 54.65 / MAX: 59.36
NAMD Input: ATPase with 327,506 Atoms OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: ATPase with 327,506 Atoms 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 0.5109 1.0218 1.5327 2.0436 2.5545 2.26790 2.27071
OpenRadioss Model: Cell Phone Drop Test OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 15 30 45 60 75 65.18 65.25
nginx Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20K 40K 60K 80K 100K 114755.20 114660.49 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Quicksilver Input: CORAL2 P2 OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P2 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 5M 10M 15M 20M 25M 22580000 22590000 1. (CXX) g++ options: -fopenmp -O3 -march=native
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.1 Blend File: Barbershop - Compute: CPU-Only 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 150 300 450 600 750 676.26 676.53
srsRAN Project Test: PUSCH Processor Benchmark, Throughput Total OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.10.1-20240325 Test: PUSCH Processor Benchmark, Throughput Total 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 300 600 900 1200 1500 1559.3 1558.7 MIN: 954.9 MIN: 954.9 1. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl
OpenRadioss Model: Rubber O-Ring Seal Installation OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 20 40 60 80 100 83.39 83.36
SPECFEM3D Model: Mount St. Helens OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Mount St. Helens 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 9 18 27 36 45 37.20 37.20 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 6.0 Mode: CPU 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 7K 14K 21K 28K 35K 31844 31845
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Resizing 2 x 32GB DDR5-4800 2 x 32GB DDR5-5200 60 120 180 240 300 289 289 1. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp
Phoronix Test Suite v10.8.4