AMD Ryzen 9 7900 DDR5-5200 ECC Memory

AMD Ryzen 9 7900 DDR5-5200 ECC memory benchmarks by Michael Larabel for a future article

HTML result view exported from: https://openbenchmarking.org/result/2405045-NE-AMDRYZEN948&sro&grr.

AMD Ryzen 9 7900 DDR5-5200 ECC MemoryProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen Resolution2 x 32GB DDR5-48002 x 32GB DDR5-5200AMD Ryzen 9 7900 12-Core @ 5.48GHz (12 Cores / 24 Threads)ASRockRack 1U4LW-B650/2L2T B650D4U-2L2T/BCM (2.09 BIOS)AMD Device 14d82 x 32GB DDR5-4800MT/s Micron MTC20C2085S1EC48BA11024GB SOLIDIGM SSDPFKNU010TZASPEEDAMD Rembrandt Radeon HD AudioVA24312 x Intel I210 + 2 x Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMAUbuntu 24.046.8.0-31-generic (x86_64)GNOME Shell 46.0X ServerGCC 13.2.0ext41920x12002 x 32GB DDR5-5200MT/s KingstonOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203Python Details- Python 3.12.3Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD Ryzen 9 7900 DDR5-5200 ECC Memorybuild-linux-kernel: allmodconfigopenradioss: Chrysler Neon 1Mbrl-cad: VGR Performance Metricblender: Barbershop - CPU-Onlybuild-nodejs: Time To Compilellama-cpp: llama-2-70b-chat.Q5_0.ggufopenradioss: INIVOL and Fluid Structure Interaction Drop Containerbuild-godot: Time To Compileblender: Pabellon Barcelona - CPU-Onlyopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timequicksilver: CORAL2 P2llamafile: wizardcoder-python-34b-v1.0.Q6_K - CPUtensorflow: CPU - 64 - ResNet-50blender: Classroom - CPU-Onlyopenradioss: Bird Strike on Windshieldpgbench: 100 - 1000 - Read Only - Average Latencypgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Write - Average Latencypgbench: 100 - 1000 - Read Writenpb: SP.Cffmpeg: libx265 - Platformffmpeg: libx265 - Video On Demandffmpeg: libx265 - Uploadllamafile: mistral-7b-instruct-v0.2.Q8_0 - CPUopenradioss: Bumper Beamspecfem3d: Layered Halfspacespecfem3d: Water-layered Halfspaceblender: Junkshop - CPU-Onlystockfish: Chess Benchmarkblender: Fishy Cat - CPU-Onlynamd: STMV with 1,066,628 Atomsnginx: 1000openradioss: Rubber O-Ring Seal Installationgromacs: MPI CPU - water_GMX50_bareincompact3d: input.i3d 193 Cells Per Directionnpb: BT.Cbuild-linux-kernel: defconfigv-ray: CPUopenradioss: Cell Phone Drop Testllama-cpp: llama-2-13b.Q4_0.ggufospray: gravity_spheres_volume/dim_512/ao/real_timetensorflow: CPU - 64 - GoogLeNetopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUquicksilver: CORAL2 P1openvino: Road Segmentation ADAS FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUgraphics-magick: Noise-Gaussiangraphics-magick: Rotategraphics-magick: Swirlgraphics-magick: HWB Color Spacegraphics-magick: Resizingrocksdb: Read While Writingmbw: Memory Copy, Fixed Block Size - 8192 MiBlibxsmm: 32mbw: Memory Copy - 8192 MiBnpb: LU.Cspecfem3d: Homogeneous Halfspacebuild-php: Time To Compileavifenc: 2ffmpeg: libx265 - Liveembree: Pathtracer ISPC - Asian Dragon Objnamd: ATPase with 327,506 Atomsspecfem3d: Mount St. Helensspecfem3d: Tomographic Modelnpb: IS.Djpegxl-decode: Allramspeed: Average - Integerramspeed: Copy - Integerramspeed: Scale - Integerramspeed: Triad - Integerramspeed: Add - Integerjpegxl-decode: 1embree: Pathtracer ISPC - Asian Dragonjpegxl: PNG - 90tensorflow: CPU - 64 - AlexNetmbw: Memory Copy, Fixed Block Size - 4096 MiBmbw: Memory Copy - 4096 MiBllamafile: llava-v1.5-7b-q4 - CPUpytorch: CPU - 1 - ResNet-50llama-cpp: llama-2-7b.Q4_0.ggufbuild-mesa: Time To Compilecompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingnpb: SP.Bnpb: FT.Cincompact3d: input.i3d 129 Cells Per Directionnpb: CG.Csrsran: PUSCH Processor Benchmark, Throughput Totaltensorflow: CPU - 1 - ResNet-50tensorflow: CPU - 1 - AlexNetonednn: IP Shapes 3D - CPUnpb: MG.Crnnoise: 26 Minute Long Talking Samplembw: Memory Copy, Fixed Block Size - 1024 MiBmbw: Memory Copy - 1024 MiBdraco: Church Facadeonednn: Convolution Batch Shapes Auto - CPUsrsran: PDSCH Processor Benchmark, Throughput Totaldraco: Lionsvt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Kavifenc: 6encode-wavpack: WAV To WavPacktensorflow: CPU - 1 - GoogLeNetavifenc: 10, Losslessonednn: Deconvolution Batch shapes_3d - CPU2 x 32GB DDR5-48002 x 32GB DDR5-5200979.122913.48280941676.26515.3331.22338.34240.329233.85196.38228.682586225800002.0235.44193.27182.191.38572187229.473393311331.5168.3068.2733.667.21105.47102.828797179100.60128190699.783007283894.720.62007114755.283.392.15179.429946936869.2576.5783184465.187.35.49836112.13322.6318.5760.5399.0580.5874.480.0474.9117.32345.85.481090.851855000021.27281.649.26645.6391165187259289317410516405.968107.117709.52641111.3947.01813585744.61243.071175.3021.57682.2679037.19941769236.5873087221292.66201.8759338.2262972.5563562.2955555.5154648.2985.28225.265841.179263.816019.1617823.97211.4458.7513.6722.06112645713903518304.3922528.9216.66996779531.481559.314.6814.483.7257122577.887.5416190.13317705.75650226.3361711196.83903145.78146.7074.4574.40954.414.1453.43803971.785880.51282841676.53512.5461.25335.53239.446235.05189.2894226.766308225900002.0836.12193.87180.321.36673207829.2463419311771.2268.7168.8633.857.43104.36101.4084400598.4589975100.263031184295.280.62459114660.4983.362.18276.566825938315.175.6723184565.257.575.58648113.89323.7718.5159.82100.2179.4175.4879.2875.6417.16349.135.411104.951848000020.83287.499.08658.2892167188261289320998617799.501111.718404.78541210.0646.5324412144.18142.943178.2621.85052.2707137.20266140935.5370951391345.7205.30661195.8265210.165913.6257010.0556856.0387.05425.601441.745266.6617776.41918169.53111.8558.8714.6921.99512618914101519269.3723481.9515.999898910133.131558.714.71153.5811123251.347.38217220.31818105.0949526.1003211166.13868147.738148.2014.3544.28754.774.1223.43041OpenBenchmarking.org

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfig2 x 32GB DDR5-48002 x 32GB DDR5-52002004006008001000979.12971.79

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1M2 x 32GB DDR5-48002 x 32GB DDR5-52002004006008001000913.48880.51

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.38.2VGR Performance Metric2 x 32GB DDR5-48002 x 32GB DDR5-520060K120K180K240K300K2809412828411. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: Barbershop - Compute: CPU-Only2 x 32GB DDR5-48002 x 32GB DDR5-5200150300450600750676.26676.53

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To Compile2 x 32GB DDR5-48002 x 32GB DDR5-5200110220330440550515.33512.55

Llama.cpp

Model: llama-2-70b-chat.Q5_0.gguf

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b1808Model: llama-2-70b-chat.Q5_0.gguf2 x 32GB DDR5-48002 x 32GB DDR5-52000.28130.56260.84391.12521.40651.221.251. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas

OpenRadioss

Model: INIVOL and Fluid Structure Interaction Drop Container

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop Container2 x 32GB DDR5-48002 x 32GB DDR5-520070140210280350338.34335.53

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To Compile2 x 32GB DDR5-48002 x 32GB DDR5-520050100150200250240.33239.45

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: Pabellon Barcelona - Compute: CPU-Only2 x 32GB DDR5-48002 x 32GB DDR5-520050100150200250233.85235.05

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Time2 x 32GB DDR5-48002 x 32GB DDR5-52004080120160200196.38189.291. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Time2 x 32GB DDR5-48002 x 32GB DDR5-520071421283528.6826.771. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Quicksilver

Input: CORAL2 P2

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P22 x 32GB DDR5-48002 x 32GB DDR5-52005M10M15M20M25M22580000225900001. (CXX) g++ options: -fopenmp -O3 -march=native

Llamafile

Test: wizardcoder-python-34b-v1.0.Q6_K - Acceleration: CPU

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.7Test: wizardcoder-python-34b-v1.0.Q6_K - Acceleration: CPU2 x 32GB DDR5-48002 x 32GB DDR5-52000.4680.9361.4041.8722.342.022.08

TensorFlow

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: ResNet-502 x 32GB DDR5-48002 x 32GB DDR5-520081624324035.4436.12

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: Classroom - Compute: CPU-Only2 x 32GB DDR5-48002 x 32GB DDR5-52004080120160200193.27193.87

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on Windshield2 x 32GB DDR5-48002 x 32GB DDR5-52004080120160200182.19180.32

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency2 x 32GB DDR5-48002 x 32GB DDR5-52000.31160.62320.93481.24641.5581.3851.3661. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only2 x 32GB DDR5-48002 x 32GB DDR5-5200160K320K480K640K800K7218727320781. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency2 x 32GB DDR5-48002 x 32GB DDR5-520071421283529.4729.251. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write2 x 32GB DDR5-48002 x 32GB DDR5-52007K14K21K28K35K33933341931. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C2 x 32GB DDR5-48002 x 32GB DDR5-52003K6K9K12K15K11331.5111771.221. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

FFmpeg

Encoder: libx265 - Scenario: Platform

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 7.0Encoder: libx265 - Scenario: Platform2 x 32GB DDR5-48002 x 32GB DDR5-5200153045607568.3068.711. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FFmpeg

Encoder: libx265 - Scenario: Video On Demand

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 7.0Encoder: libx265 - Scenario: Video On Demand2 x 32GB DDR5-48002 x 32GB DDR5-5200153045607568.2768.861. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FFmpeg

Encoder: libx265 - Scenario: Upload

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 7.0Encoder: libx265 - Scenario: Upload2 x 32GB DDR5-48002 x 32GB DDR5-520081624324033.6633.851. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Llamafile

Test: mistral-7b-instruct-v0.2.Q8_0 - Acceleration: CPU

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.7Test: mistral-7b-instruct-v0.2.Q8_0 - Acceleration: CPU2 x 32GB DDR5-48002 x 32GB DDR5-52002468107.217.43

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper Beam2 x 32GB DDR5-48002 x 32GB DDR5-520020406080100105.47104.36

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Layered Halfspace2 x 32GB DDR5-48002 x 32GB DDR5-520020406080100102.83101.411. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Water-layered Halfspace2 x 32GB DDR5-48002 x 32GB DDR5-520020406080100100.6098.461. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Blender

Blend File: Junkshop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: Junkshop - Compute: CPU-Only2 x 32GB DDR5-48002 x 32GB DDR5-52002040608010099.78100.26

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 16.1Chess Benchmark2 x 32GB DDR5-48002 x 32GB DDR5-52006M12M18M24M30M30072838303118421. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: Fishy Cat - Compute: CPU-Only2 x 32GB DDR5-48002 x 32GB DDR5-52002040608010094.7295.28

NAMD

Input: STMV with 1,066,628 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: STMV with 1,066,628 Atoms2 x 32GB DDR5-48002 x 32GB DDR5-52000.14050.2810.42150.5620.70250.620070.62459

nginx

Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 10002 x 32GB DDR5-48002 x 32GB DDR5-520020K40K60K80K100K114755.20114660.491. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenRadioss

Model: Rubber O-Ring Seal Installation

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal Installation2 x 32GB DDR5-48002 x 32GB DDR5-52002040608010083.3983.36

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bare2 x 32GB DDR5-48002 x 32GB DDR5-52000.4910.9821.4731.9642.4552.1512.1821. (CXX) g++ options: -O3 -lm

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction2 x 32GB DDR5-48002 x 32GB DDR5-52002040608010079.4376.571. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C2 x 32GB DDR5-48002 x 32GB DDR5-52008K16K24K32K40K36869.2538315.101. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfig2 x 32GB DDR5-48002 x 32GB DDR5-52002040608010076.5875.67

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 6.0Mode: CPU2 x 32GB DDR5-48002 x 32GB DDR5-52007K14K21K28K35K3184431845

OpenRadioss

Model: Cell Phone Drop Test

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop Test2 x 32GB DDR5-48002 x 32GB DDR5-5200153045607565.1865.25

Llama.cpp

Model: llama-2-13b.Q4_0.gguf

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b1808Model: llama-2-13b.Q4_0.gguf2 x 32GB DDR5-48002 x 32GB DDR5-52002468107.307.571. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas

OSPRay

Benchmark: gravity_spheres_volume/dim_512/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: gravity_spheres_volume/dim_512/ao/real_time2 x 32GB DDR5-48002 x 32GB DDR5-52001.2572.5143.7715.0286.2855.498365.58648

TensorFlow

Device: CPU - Batch Size: 64 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: GoogLeNet2 x 32GB DDR5-48002 x 32GB DDR5-5200306090120150112.13113.89

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection FP16-INT8 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-520070140210280350322.63323.77MIN: 296.85 / MAX: 328.51MIN: 308.71 / MAX: 330.441. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection FP16-INT8 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-520051015202518.5718.511. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-5200142842567060.5359.82MIN: 33.29 / MAX: 82.21MIN: 48.42 / MAX: 67.391. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-52002040608010099.05100.211. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-52002040608010080.5879.41MIN: 60.22 / MAX: 102.9MIN: 56.3 / MAX: 102.741. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-52002040608010074.4075.481. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Detection FP32 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-52002040608010080.0479.28MIN: 61.76 / MAX: 107.58MIN: 60.22 / MAX: 100.151. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Detection FP32 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-52002040608010074.9175.641. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-52004812162017.3217.16MIN: 8.73 / MAX: 23.67MIN: 13.76 / MAX: 231. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-520080160240320400345.80349.131. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-52001.2332.4663.6994.9326.1655.485.41MIN: 3.89 / MAX: 9.6MIN: 3.92 / MAX: 8.181. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-520020040060080010001090.851104.951. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

Quicksilver

Input: CORAL2 P1

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P12 x 32GB DDR5-48002 x 32GB DDR5-52004M8M12M16M20M18550000184800001. (CXX) g++ options: -fopenmp -O3 -march=native

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-520051015202521.2720.83MIN: 9.77 / MAX: 37.47MIN: 12.6 / MAX: 29.811. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-520060120180240300281.64287.491. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-520036912159.269.08MIN: 4.6 / MAX: 15.49MIN: 4.23 / MAX: 17.051. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16 - Device: CPU2 x 32GB DDR5-48002 x 32GB DDR5-5200140280420560700645.63658.281. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-Gaussian2 x 32GB DDR5-48002 x 32GB DDR5-52002040608010091921. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Rotate2 x 32GB DDR5-48002 x 32GB DDR5-520040801201602001651671. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Swirl2 x 32GB DDR5-48002 x 32GB DDR5-520040801201602001871881. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: HWB Color Space2 x 32GB DDR5-48002 x 32GB DDR5-5200601201802403002592611. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Resizing2 x 32GB DDR5-48002 x 32GB DDR5-5200601201802403002892891. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

RocksDB

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While Writing2 x 32GB DDR5-48002 x 32GB DDR5-5200700K1400K2100K2800K3500K317410532099861. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB2 x 32GB DDR5-48002 x 32GB DDR5-52004K8K12K16K20K16405.9717799.501. (CC) gcc options: -O3 -march=native

libxsmm

M N K: 32

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 322 x 32GB DDR5-48002 x 32GB DDR5-5200306090120150107.1111.71. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

MBW

Test: Memory Copy - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB2 x 32GB DDR5-48002 x 32GB DDR5-52004K8K12K16K20K17709.5318404.791. (CC) gcc options: -O3 -march=native

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C2 x 32GB DDR5-48002 x 32GB DDR5-52009K18K27K36K45K41111.3941210.061. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous Halfspace2 x 32GB DDR5-48002 x 32GB DDR5-5200112233445547.0246.531. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 8.3.4Time To Compile2 x 32GB DDR5-48002 x 32GB DDR5-5200102030405044.6144.18

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 22 x 32GB DDR5-48002 x 32GB DDR5-5200102030405043.0742.941. (CXX) g++ options: -O3 -fPIC -lm

FFmpeg

Encoder: libx265 - Scenario: Live

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 7.0Encoder: libx265 - Scenario: Live2 x 32GB DDR5-48002 x 32GB DDR5-52004080120160200175.30178.261. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon Obj2 x 32GB DDR5-48002 x 32GB DDR5-520051015202521.5821.85MIN: 21.45 / MAX: 21.94MIN: 21.73 / MAX: 22.2

NAMD

Input: ATPase with 327,506 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: ATPase with 327,506 Atoms2 x 32GB DDR5-48002 x 32GB DDR5-52000.51091.02181.53272.04362.55452.267902.27071

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Mount St. Helens2 x 32GB DDR5-48002 x 32GB DDR5-520091827364537.2037.201. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic Model2 x 32GB DDR5-48002 x 32GB DDR5-520081624324036.5935.541. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D2 x 32GB DDR5-48002 x 32GB DDR5-5200300600900120015001292.661345.701. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

JPEG-XL Decoding libjxl

CPU Threads: All

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL Decoding libjxl 0.10.1CPU Threads: All2 x 32GB DDR5-48002 x 32GB DDR5-520050100150200250201.87205.31

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integer2 x 32GB DDR5-48002 x 32GB DDR5-520013K26K39K52K65K59338.2261195.821. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integer2 x 32GB DDR5-48002 x 32GB DDR5-520014K28K42K56K70K62972.5565210.101. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integer2 x 32GB DDR5-48002 x 32GB DDR5-520014K28K42K56K70K63562.2965913.621. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integer2 x 32GB DDR5-48002 x 32GB DDR5-520012K24K36K48K60K55555.5157010.051. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integer2 x 32GB DDR5-48002 x 32GB DDR5-520012K24K36K48K60K54648.2956856.031. (CC) gcc options: -O3 -march=native

JPEG-XL Decoding libjxl

CPU Threads: 1

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL Decoding libjxl 0.10.1CPU Threads: 12 x 32GB DDR5-48002 x 32GB DDR5-52002040608010085.2887.05

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon2 x 32GB DDR5-48002 x 32GB DDR5-520061218243025.2725.60MIN: 25.12 / MAX: 25.79MIN: 25.46 / MAX: 25.94

JPEG-XL libjxl

Input: PNG - Quality: 90

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL libjxl 0.10.1Input: PNG - Quality: 902 x 32GB DDR5-48002 x 32GB DDR5-5200102030405041.1841.751. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm

TensorFlow

Device: CPU - Batch Size: 64 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: AlexNet2 x 32GB DDR5-48002 x 32GB DDR5-520060120180240300263.80266.66

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB2 x 32GB DDR5-48002 x 32GB DDR5-52004K8K12K16K20K16019.1617776.421. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 4096 MiB2 x 32GB DDR5-48002 x 32GB DDR5-52004K8K12K16K20K17823.9718169.531. (CC) gcc options: -O3 -march=native

Llamafile

Test: llava-v1.5-7b-q4 - Acceleration: CPU

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.7Test: llava-v1.5-7b-q4 - Acceleration: CPU2 x 32GB DDR5-48002 x 32GB DDR5-5200369121511.4411.85

PyTorch

Device: CPU - Batch Size: 1 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 32GB DDR5-48002 x 32GB DDR5-5200132639526558.7558.87MIN: 53.89 / MAX: 59.52MIN: 54.65 / MAX: 59.36

Llama.cpp

Model: llama-2-7b.Q4_0.gguf

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b1808Model: llama-2-7b.Q4_0.gguf2 x 32GB DDR5-48002 x 32GB DDR5-52004812162013.6714.691. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 24.0Time To Compile2 x 32GB DDR5-48002 x 32GB DDR5-520051015202522.0622.00

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Rating2 x 32GB DDR5-48002 x 32GB DDR5-520030K60K90K120K150K1264571261891. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Rating2 x 32GB DDR5-48002 x 32GB DDR5-520030K60K90K120K150K1390351410151. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B2 x 32GB DDR5-48002 x 32GB DDR5-52004K8K12K16K20K18304.3919269.371. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C2 x 32GB DDR5-48002 x 32GB DDR5-52005K10K15K20K25K22528.9223481.951. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction2 x 32GB DDR5-48002 x 32GB DDR5-52004812162016.6716.001. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C2 x 32GB DDR5-48002 x 32GB DDR5-52002K4K6K8K10K9531.4810133.131. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput Total2 x 32GB DDR5-48002 x 32GB DDR5-5200300600900120015001559.31558.7MIN: 954.9MIN: 954.91. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

TensorFlow

Device: CPU - Batch Size: 1 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 32GB DDR5-48002 x 32GB DDR5-52004812162014.6814.71

TensorFlow

Device: CPU - Batch Size: 1 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: AlexNet2 x 32GB DDR5-48002 x 32GB DDR5-52004812162014.4815.00

oneDNN

Harness: IP Shapes 3D - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.4Harness: IP Shapes 3D - Engine: CPU2 x 32GB DDR5-48002 x 32GB DDR5-52000.83831.67662.51493.35324.19153.725713.58111MIN: 3.69MIN: 3.541. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C2 x 32GB DDR5-48002 x 32GB DDR5-52005K10K15K20K25K22577.8823251.341. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

RNNoise

Input: 26 Minute Long Talking Sample

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 0.2Input: 26 Minute Long Talking Sample2 x 32GB DDR5-48002 x 32GB DDR5-52002468107.5407.3821. (CC) gcc options: -O2 -pedantic -fvisibility=hidden

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB2 x 32GB DDR5-48002 x 32GB DDR5-52004K8K12K16K20K16190.1317220.321. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy - Array Size: 1024 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 1024 MiB2 x 32GB DDR5-48002 x 32GB DDR5-52004K8K12K16K20K17705.7618105.091. (CC) gcc options: -O3 -march=native

Google Draco

Model: Church Facade

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: Church Facade2 x 32GB DDR5-48002 x 32GB DDR5-520011002200330044005500502249521. (CXX) g++ options: -O3

oneDNN

Harness: Convolution Batch Shapes Auto - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.4Harness: Convolution Batch Shapes Auto - Engine: CPU2 x 32GB DDR5-48002 x 32GB DDR5-52002468106.336176.10032MIN: 6.24MIN: 5.991. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput Total2 x 32GB DDR5-48002 x 32GB DDR5-52002K4K6K8K10K11196.811166.11. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: Lion2 x 32GB DDR5-48002 x 32GB DDR5-52008001600240032004000390338681. (CXX) g++ options: -O3

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 12 - Input: Bosphorus 4K2 x 32GB DDR5-48002 x 32GB DDR5-5200306090120150145.78147.741. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 13 - Input: Bosphorus 4K2 x 32GB DDR5-48002 x 32GB DDR5-5200306090120150146.71148.201. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 62 x 32GB DDR5-48002 x 32GB DDR5-52001.00282.00563.00844.01125.0144.4574.3541. (CXX) g++ options: -O3 -fPIC -lm

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.7WAV To WavPack2 x 32GB DDR5-48002 x 32GB DDR5-52000.9921.9842.9763.9684.964.4094.287

TensorFlow

Device: CPU - Batch Size: 1 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: GoogLeNet2 x 32GB DDR5-48002 x 32GB DDR5-5200122436486054.4154.77

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 10, Lossless2 x 32GB DDR5-48002 x 32GB DDR5-52000.93261.86522.79783.73044.6634.1454.1221. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: Deconvolution Batch shapes_3d - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.4Harness: Deconvolution Batch shapes_3d - Engine: CPU2 x 32GB DDR5-48002 x 32GB DDR5-52000.77361.54722.32083.09443.8683.438033.43041MIN: 3.21MIN: 3.211. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl


Phoronix Test Suite v10.8.5