AMD Ryzen 9 7900 ECC Memory

AMD Ryzen 9 7900 12-Core testing with a ASRockRack 1U4LW-B650/2L2T B650D4U-2L2T/BCM (2.09 BIOS) and ASPEED on Ubuntu 24.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2404244-NE-AMDRYZEN993
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
2 x 32GB DDR5-4800
April 23
  5 Hours, 42 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD Ryzen 9 7900 ECC MemoryOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 7900 12-Core @ 5.48GHz (12 Cores / 24 Threads)ASRockRack 1U4LW-B650/2L2T B650D4U-2L2T/BCM (2.09 BIOS)AMD Device 14d82 x 32GB DDR5-4800MT/s Micron MTC20C2085S1EC48BA11024GB SOLIDIGM SSDPFKNU010TZASPEEDAMD Rembrandt Radeon HD AudioVA24312 x Intel I210 + 2 x Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMAUbuntu 24.046.8.0-31-generic (x86_64)GNOME Shell 46.0X ServerGCC 13.2.0ext41920x1200ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionAMD Ryzen 9 7900 ECC Memory BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203 - Python 3.12.3- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD Ryzen 9 7900 ECC Memorybuild-linux-kernel: allmodconfigopenradioss: Chrysler Neon 1Mblender: Barbershop - CPU-Onlybrl-cad: VGR Performance Metricbuild-nodejs: Time To Compilellama-cpp: llama-2-70b-chat.Q5_0.ggufopenradioss: INIVOL and Fluid Structure Interaction Drop Containerquicksilver: CTS2openssl: RSA4096openssl: RSA4096build-godot: Time To Compileospray-studio: 3 - 4K - 32 - Path Tracer - CPUblender: Pabellon Barcelona - CPU-Onlytinymembench: Standard Memsettinymembench: Standard Memcpyopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timellamafile: wizardcoder-python-34b-v1.0.Q6_K - CPUquicksilver: CORAL2 P2ospray-studio: 2 - 4K - 32 - Path Tracer - CPUospray-studio: 1 - 4K - 32 - Path Tracer - CPUtensorflow: CPU - 64 - ResNet-50blender: Classroom - CPU-Onlyopenradioss: Bird Strike on Windshieldopenssl: SHA256openssl: ChaCha20-Poly1305openssl: AES-256-GCMopenssl: AES-128-GCMopenssl: ChaCha20openssl: SHA512ospray: particle_volume/scivis/real_timeospray: particle_volume/pathtracer/real_timepgbench: 100 - 1000 - Read Only - Average Latencypgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Write - Average Latencypgbench: 100 - 1000 - Read Writellamafile: mistral-7b-instruct-v0.2.Q8_0 - CPUastcenc: Very Thoroughnpb: SP.Castcenc: Exhaustiveospray-studio: 3 - 4K - 16 - Path Tracer - CPUffmpeg: libx265 - Platformffmpeg: libx265 - Video On Demandffmpeg: libx265 - Uploadospray: particle_volume/ao/real_timenamd: STMV with 1,066,628 Atomsospray-studio: 2 - 4K - 16 - Path Tracer - CPUospray-studio: 1 - 4K - 16 - Path Tracer - CPUopenradioss: Bumper Beamspecfem3d: Layered Halfspacespecfem3d: Water-layered Halfspaceblender: Junkshop - CPU-Onlyoidn: RTLightmap.hdr.4096x4096 - CPU-Onlystockfish: Chess Benchmarkblender: Fishy Cat - CPU-Onlyvvenc: Bosphorus 4K - Fastnginx: 1000nginx: 500ospray-studio: 3 - 4K - 1 - Path Tracer - CPUavifenc: 0openradioss: Rubber O-Ring Seal Installationgromacs: MPI CPU - water_GMX50_bareospray-studio: 2 - 1080p - 16 - Path Tracer - CPUospray-studio: 2 - 4K - 1 - Path Tracer - CPUospray-studio: 1 - 1080p - 16 - Path Tracer - CPUospray-studio: 1 - 4K - 1 - Path Tracer - CPUincompact3d: input.i3d 193 Cells Per Directionnpb: BT.Conednn: Recurrent Neural Network Training - CPUbuild-linux-kernel: defconfigblender: BMW27 - CPU-Onlyllama-cpp: llama-2-13b.Q4_0.ggufonednn: Recurrent Neural Network Inference - CPUv-ray: CPUopenradioss: Cell Phone Drop Testospray-studio: 3 - 1080p - 1 - Path Tracer - CPUospray-studio: 1 - 1080p - 1 - Path Tracer - CPUospray-studio: 3 - 1080p - 32 - Path Tracer - CPUospray-studio: 2 - 1080p - 1 - Path Tracer - CPUospray: gravity_spheres_volume/dim_512/scivis/real_timeospray: gravity_spheres_volume/dim_512/ao/real_timetensorflow: CPU - 64 - GoogLeNetospray: gravity_spheres_volume/dim_512/pathtracer/real_timenpb: EP.Dluxcorerender: Orange Juice - CPUuvg266: Bosphorus 4K - Slowluxcorerender: Danish Mood - CPUluxcorerender: LuxCore Benchmark - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUgraphics-magick: Sharpenluxcorerender: DLSC - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUgraphics-magick: Enhancedopenvino: Noise Suppression Poconet-Like FP16 - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Person Re-Identification Retail FP16 - CPUquicksilver: CORAL2 P1openvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUgraphics-magick: Noise-Gaussianopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUgraphics-magick: Swirlopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Face Detection Retail FP16 - CPUgraphics-magick: HWB Color Spacegraphics-magick: Resizinggraphics-magick: Rotatenamd: ATPase with 327,506 Atomsrocksdb: Update Randrocksdb: Read Rand Write Randrocksdb: Read While Writingrocksdb: Rand Readospray-studio: 2 - 1080p - 32 - Path Tracer - CPUospray-studio: 1 - 1080p - 32 - Path Tracer - CPUuvg266: Bosphorus 4K - Mediummbw: Memory Copy, Fixed Block Size - 8192 MiBlibxsmm: 32mbw: Memory Copy - 8192 MiBnpb: LU.Cspecfem3d: Homogeneous Halfspacepytorch: CPU - 32 - ResNet-50oidn: RT.hdr_alb_nrm.3840x2160 - CPU-Onlyoidn: RT.ldr_alb_nrm.3840x2160 - CPU-Onlybuild-php: Time To Compilejpegxl: JPEG - 80jpegxl: PNG - 80avifenc: 2ffmpeg: libx265 - Liveembree: Pathtracer ISPC - Asian Dragon Objvvenc: Bosphorus 4K - Fasterospray-studio: 3 - 1080p - 16 - Path Tracer - CPUllama-cpp: llama-2-7b.Q4_0.ggufspecfem3d: Mount St. Helensastcenc: Thoroughspecfem3d: Tomographic Modelnpb: IS.Djpegxl-decode: Allembree: Pathtracer ISPC - Crownramspeed: Copy - Integerramspeed: Average - Integerramspeed: Scale - Integerramspeed: Triad - Integerramspeed: Add - Integerjpegxl-decode: 1embree: Pathtracer ISPC - Asian Dragonvvenc: Bosphorus 1080p - Fastjpegxl: PNG - 90jpegxl: JPEG - 90mbw: Memory Copy, Fixed Block Size - 4096 MiBtensorflow: CPU - 64 - AlexNetmbw: Memory Copy - 4096 MiBsvt-av1: Preset 4 - Bosphorus 4Kllamafile: llava-v1.5-7b-q4 - CPUjpegxl: PNG - 100jpegxl: JPEG - 100pytorch: CPU - 1 - ResNet-50build-mesa: Time To Compileonednn: Deconvolution Batch shapes_1d - CPUcompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingnpb: SP.Buvg266: Bosphorus 4K - Very Fastnpb: FT.Cuvg266: Bosphorus 4K - Super Fastastcenc: Fastincompact3d: input.i3d 129 Cells Per Directionnpb: CG.Cuvg266: Bosphorus 4K - Ultra Fastsrsran: PUSCH Processor Benchmark, Throughput Threadonednn: IP Shapes 1D - CPUluxcorerender: Rainbow Colors and Prism - CPUvvenc: Bosphorus 1080p - Fasteruvg266: Bosphorus 1080p - Slowastcenc: Mediumsrsran: PUSCH Processor Benchmark, Throughput Totaluvg266: Bosphorus 1080p - Mediumsvt-av1: Preset 8 - Bosphorus 4Ktensorflow: CPU - 1 - ResNet-50tensorflow: CPU - 1 - AlexNetonednn: IP Shapes 3D - CPUnpb: MG.Csvt-av1: Preset 4 - Bosphorus 1080prnnoise: 26 Minute Long Talking Samplembw: Memory Copy, Fixed Block Size - 1024 MiBavifenc: 6, Losslessmbw: Memory Copy - 1024 MiBdraco: Church Facadeonednn: Convolution Batch Shapes Auto - CPUsrsran: PDSCH Processor Benchmark, Throughput Totaldraco: Lionsvt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Kuvg266: Bosphorus 1080p - Very Fastuvg266: Bosphorus 1080p - Super Fastavifenc: 6encode-wavpack: WAV To WavPacknpb: EP.Ccompress-pbzip2: FreeBSD-13.0-RELEASE-amd64-memstick.img Compressionsvt-av1: Preset 8 - Bosphorus 1080ptensorflow: CPU - 1 - GoogLeNetavifenc: 10, Losslessuvg266: Bosphorus 1080p - Ultra Fastonednn: Deconvolution Batch shapes_3d - CPUsrsran: PDSCH Processor Benchmark, Throughput Threadsvt-av1: Preset 12 - Bosphorus 1080psvt-av1: Preset 13 - Bosphorus 1080p2 x 32GB DDR5-4800979.122913.48676.26280941515.3331.22338.3416160000279049.89842.1240.329229326233.8528761.225769.8196.38228.6825862.022258000019669119496135.44193.27182.1924692383490679374140801552719399601806934390109572791200080633321806.4428192.0371.38572187229.47339337.211.888511331.511.161311679668.3068.2733.666.458170.6200710024599448105.47102.828797179100.60128190699.780.313007283894.726.529114755.2120941.12706588.49183.392.15124232602723994595779.429946936869.251865.6676.57875.437.3963.1763184465.18177015036037915145.423475.49836112.136.548192178.725.739.52.923.26625.59.57322.6318.57433.6260.5399.0580.5874.480.0474.91709.471246.9117.32345.85.481090.8521.27281.644.541316.48185500005.181152.6821.66553.363.63252.9823.34513.489112.46961.040.3134985.349.26645.631876.331885.030.4524761.852.662212.082592891652.267907034152542969317410594310421522455116210.7516405.968107.117709.52641111.3947.01813585740.290.660.6644.61244.41644.89843.071175.3021.576814.933227313.6737.19941769213.740836.5873087221292.66201.8722.61462972.5559338.2263562.2955555.5154648.2985.28225.265820.40641.17941.75116019.16263.817823.9726.67911.4421.26721.1158.7522.0613.2792112645713903518304.3930.8422528.9232.69267.752916.66996779531.4838.37183.41.8696412.9646.63845.21105.12071559.350.7357.44414.6814.483.7257122577.8822.3367.5416190.1336.96217705.75650226.3361711196.83903145.78146.707118.8129.724.4574.4092189.284.26919163.51954.414.145152.673.43803882.5565.031648.567OpenBenchmarking.org

Timed Linux Kernel Compilation

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfig2 x 32GB DDR5-48002004006008001000979.12

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1M2 x 32GB DDR5-48002004006008001000913.48

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: Barbershop - Compute: CPU-Only2 x 32GB DDR5-4800150300450600750676.26

BRL-CAD

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.38.2VGR Performance Metric2 x 32GB DDR5-480060K120K180K240K300K2809411. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6

Timed Node.js Compilation

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To Compile2 x 32GB DDR5-4800110220330440550515.33

Llama.cpp

Llama.cpp is a port of Facebook's LLaMA model in C/C++ developed by Georgi Gerganov. Llama.cpp allows the inference of LLaMA and other supported models in C/C++. For CPU inference Llama.cpp supports AVX2/AVX-512, ARM NEON, and other modern ISAs along with features like OpenBLAS usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b1808Model: llama-2-70b-chat.Q5_0.gguf2 x 32GB DDR5-48000.27450.5490.82351.0981.37251.221. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop Container2 x 32GB DDR5-480070140210280350338.34

Quicksilver

Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CTS22 x 32GB DDR5-48003M6M9M12M15M161600001. (CXX) g++ options: -fopenmp -O3 -march=native

OpenSSL

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.3Algorithm: RSA40962 x 32GB DDR5-480060K120K180K240K300K279049.81. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.3Algorithm: RSA40962 x 32GB DDR5-48002K4K6K8K10K9842.11. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To Compile2 x 32GB DDR5-480050100150200250240.33

OSPRay Studio

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-480050K100K150K200K250K229326

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: Pabellon Barcelona - Compute: CPU-Only2 x 32GB DDR5-480050100150200250233.85

Tinymembench

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard Memset2 x 32GB DDR5-48006K12K18K24K30K28761.21. (CC) gcc options: -O2 -lm

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard Memcpy2 x 32GB DDR5-48006K12K18K24K30K25769.81. (CC) gcc options: -O2 -lm

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Time2 x 32GB DDR5-48004080120160200196.381. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Time2 x 32GB DDR5-480071421283528.681. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Llamafile

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.7Test: wizardcoder-python-34b-v1.0.Q6_K - Acceleration: CPU2 x 32GB DDR5-48000.45450.9091.36351.8182.27252.02

Quicksilver

Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P22 x 32GB DDR5-48005M10M15M20M25M225800001. (CXX) g++ options: -fopenmp -O3 -march=native

OSPRay Studio

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-480040K80K120K160K200K196691

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-480040K80K120K160K200K194961

TensorFlow

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: ResNet-502 x 32GB DDR5-480081624324035.44

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: Classroom - Compute: CPU-Only2 x 32GB DDR5-48004080120160200193.27

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on Windshield2 x 32GB DDR5-48004080120160200182.19

OpenSSL

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: SHA2562 x 32GB DDR5-48005000M10000M15000M20000M25000M246923834901. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: ChaCha20-Poly13052 x 32GB DDR5-480015000M30000M45000M60000M75000M679374140801. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: AES-256-GCM2 x 32GB DDR5-480030000M60000M90000M120000M150000M1552719399601. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: AES-128-GCM2 x 32GB DDR5-480040000M80000M120000M160000M200000M1806934390101. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: ChaCha202 x 32GB DDR5-480020000M40000M60000M80000M100000M957279120001. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: SHA5122 x 32GB DDR5-48002000M4000M6000M8000M10000M80633321801. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OSPRay

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: particle_volume/scivis/real_time2 x 32GB DDR5-48002468106.4428

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: particle_volume/pathtracer/real_time2 x 32GB DDR5-48004080120160200192.04

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency2 x 32GB DDR5-48000.31160.62320.93481.24641.5581.3851. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only2 x 32GB DDR5-4800150K300K450K600K750K7218721. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency2 x 32GB DDR5-480071421283529.471. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write2 x 32GB DDR5-48007K14K21K28K35K339331. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Llamafile

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.7Test: mistral-7b-instruct-v0.2.Q8_0 - Acceleration: CPU2 x 32GB DDR5-48002468107.21

ASTC Encoder

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: Very Thorough2 x 32GB DDR5-48000.42490.84981.27471.69962.12451.88851. (CXX) g++ options: -O3 -flto -pthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C2 x 32GB DDR5-48002K4K6K8K10K11331.511. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

ASTC Encoder

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: Exhaustive2 x 32GB DDR5-48000.26130.52260.78391.04521.30651.16131. (CXX) g++ options: -O3 -flto -pthread

OSPRay Studio

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-480030K60K90K120K150K116796

FFmpeg

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 7.0Encoder: libx265 - Scenario: Platform2 x 32GB DDR5-4800153045607568.301. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 7.0Encoder: libx265 - Scenario: Video On Demand2 x 32GB DDR5-4800153045607568.271. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 7.0Encoder: libx265 - Scenario: Upload2 x 32GB DDR5-480081624324033.661. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

OSPRay

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: particle_volume/ao/real_time2 x 32GB DDR5-48002468106.45817

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: STMV with 1,066,628 Atoms2 x 32GB DDR5-48000.13950.2790.41850.5580.69750.62007

OSPRay Studio

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-480020K40K60K80K100K100245

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-480020K40K60K80K100K99448

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper Beam2 x 32GB DDR5-480020406080100105.47

SPECFEM3D

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Layered Halfspace2 x 32GB DDR5-480020406080100102.831. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Water-layered Halfspace2 x 32GB DDR5-480020406080100100.601. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: Junkshop - Compute: CPU-Only2 x 32GB DDR5-48002040608010099.78

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.2Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only2 x 32GB DDR5-48000.06980.13960.20940.27920.3490.31

Stockfish

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 16.1Chess Benchmark2 x 32GB DDR5-48006M12M18M24M30M300728381. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: Fishy Cat - Compute: CPU-Only2 x 32GB DDR5-48002040608010094.72

VVenC

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 4K - Video Preset: Fast2 x 32GB DDR5-48002468106.5291. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 10002 x 32GB DDR5-480020K40K60K80K100K114755.21. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 5002 x 32GB DDR5-480030K60K90K120K150K120941.121. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OSPRay Studio

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-4800150030004500600075007065

libavif avifenc

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 02 x 32GB DDR5-48002040608010088.491. (CXX) g++ options: -O3 -fPIC -lm

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal Installation2 x 32GB DDR5-48002040608010083.39

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bare2 x 32GB DDR5-48000.4840.9681.4521.9362.422.1511. (CXX) g++ options: -O3 -lm

OSPRay Studio

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-48005K10K15K20K25K24232

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-4800130026003900520065006027

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-48005K10K15K20K25K23994

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-4800130026003900520065005957

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction2 x 32GB DDR5-48002040608010079.431. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C2 x 32GB DDR5-48008K16K24K32K40K36869.251. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.4Harness: Recurrent Neural Network Training - Engine: CPU2 x 32GB DDR5-48004008001200160020001865.66MIN: 1857.61. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

Timed Linux Kernel Compilation

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfig2 x 32GB DDR5-48002040608010076.58

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: BMW27 - Compute: CPU-Only2 x 32GB DDR5-48002040608010075.43

Llama.cpp

Llama.cpp is a port of Facebook's LLaMA model in C/C++ developed by Georgi Gerganov. Llama.cpp allows the inference of LLaMA and other supported models in C/C++. For CPU inference Llama.cpp supports AVX2/AVX-512, ARM NEON, and other modern ISAs along with features like OpenBLAS usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b1808Model: llama-2-13b.Q4_0.gguf2 x 32GB DDR5-48002468107.31. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.4Harness: Recurrent Neural Network Inference - Engine: CPU2 x 32GB DDR5-48002004006008001000963.18MIN: 955.071. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

Chaos Group V-RAY

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 6.0Mode: CPU2 x 32GB DDR5-48007K14K21K28K35K31844

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop Test2 x 32GB DDR5-4800153045607565.18

OSPRay Studio

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-48004008001200160020001770

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-4800300600900120015001503

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-480013K26K39K52K65K60379

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-4800300600900120015001514

OSPRay

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: gravity_spheres_volume/dim_512/scivis/real_time2 x 32GB DDR5-48001.22032.44063.66094.88126.10155.42347

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: gravity_spheres_volume/dim_512/ao/real_time2 x 32GB DDR5-48001.23712.47423.71134.94846.18555.49836

TensorFlow

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: GoogLeNet2 x 32GB DDR5-4800306090120150112.13

OSPRay

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time2 x 32GB DDR5-48002468106.54819

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D2 x 32GB DDR5-480050010001500200025002178.721. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPU2 x 32GB DDR5-48001.28932.57863.86795.15726.44655.73MIN: 5.32 / MAX: 5.89

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Slow2 x 32GB DDR5-480036912159.5

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPU2 x 32GB DDR5-48000.6571.3141.9712.6283.2852.92MIN: 1.1 / MAX: 3.36

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPU2 x 32GB DDR5-48000.73351.4672.20052.9343.66753.26MIN: 1.28 / MAX: 3.74

OpenVINO

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection FP16 - Device: CPU2 x 32GB DDR5-4800140280420560700625.5MIN: 596.94 / MAX: 641.381. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection FP16 - Device: CPU2 x 32GB DDR5-480036912159.571. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection FP16-INT8 - Device: CPU2 x 32GB DDR5-480070140210280350322.63MIN: 296.85 / MAX: 328.511. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection FP16-INT8 - Device: CPU2 x 32GB DDR5-480051015202518.571. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

GraphicsMagick

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Sharpen2 x 32GB DDR5-48001020304050431. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPU2 x 32GB DDR5-48000.81451.6292.44353.2584.07253.62MIN: 3.53 / MAX: 3.78

OpenVINO

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPU2 x 32GB DDR5-4800142842567060.53MIN: 33.29 / MAX: 82.211. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPU2 x 32GB DDR5-48002040608010099.051. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPU2 x 32GB DDR5-48002040608010080.58MIN: 60.22 / MAX: 102.91. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPU2 x 32GB DDR5-48002040608010074.41. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Detection FP32 - Device: CPU2 x 32GB DDR5-48002040608010080.04MIN: 61.76 / MAX: 107.581. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Detection FP32 - Device: CPU2 x 32GB DDR5-48002040608010074.911. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

GraphicsMagick

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Enhanced2 x 32GB DDR5-48001632486480701. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenVINO

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Noise Suppression Poconet-Like FP16 - Device: CPU2 x 32GB DDR5-480036912159.47MIN: 6.62 / MAX: 12.051. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Noise Suppression Poconet-Like FP16 - Device: CPU2 x 32GB DDR5-4800300600900120015001246.911. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPU2 x 32GB DDR5-48004812162017.32MIN: 8.73 / MAX: 23.671. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPU2 x 32GB DDR5-480080160240320400345.81. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPU2 x 32GB DDR5-48001.2332.4663.6994.9326.1655.48MIN: 3.89 / MAX: 9.61. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPU2 x 32GB DDR5-480020040060080010001090.851. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16 - Device: CPU2 x 32GB DDR5-480051015202521.27MIN: 9.77 / MAX: 37.471. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16 - Device: CPU2 x 32GB DDR5-480060120180240300281.641. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Re-Identification Retail FP16 - Device: CPU2 x 32GB DDR5-48001.02152.0433.06454.0865.10754.54MIN: 2.61 / MAX: 10.151. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Re-Identification Retail FP16 - Device: CPU2 x 32GB DDR5-4800300600900120015001316.481. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

Quicksilver

Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P12 x 32GB DDR5-48004M8M12M16M20M185500001. (CXX) g++ options: -fopenmp -O3 -march=native

OpenVINO

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPU2 x 32GB DDR5-48001.16552.3313.49654.6625.82755.18MIN: 2.88 / MAX: 13.181. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPU2 x 32GB DDR5-480020040060080010001152.681. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16-INT8 - Device: CPU2 x 32GB DDR5-480051015202521.66MIN: 17.58 / MAX: 38.771. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16-INT8 - Device: CPU2 x 32GB DDR5-4800120240360480600553.361. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16-INT8 - Device: CPU2 x 32GB DDR5-48000.811.622.433.244.053.6MIN: 1.94 / MAX: 11.061. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16-INT8 - Device: CPU2 x 32GB DDR5-480070014002100280035003252.981. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16 - Device: CPU2 x 32GB DDR5-480061218243023.34MIN: 14.47 / MAX: 31.021. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16 - Device: CPU2 x 32GB DDR5-4800110220330440550513.481. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

GraphicsMagick

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-Gaussian2 x 32GB DDR5-480020406080100911. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenVINO

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Weld Porosity Detection FP16 - Device: CPU2 x 32GB DDR5-4800369121512.46MIN: 6.31 / MAX: 17.781. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Weld Porosity Detection FP16 - Device: CPU2 x 32GB DDR5-48002004006008001000961.041. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU2 x 32GB DDR5-48000.06980.13960.20940.27920.3490.31MIN: 0.16 / MAX: 6.531. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU2 x 32GB DDR5-48007K14K21K28K35K34985.341. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16 - Device: CPU2 x 32GB DDR5-480036912159.26MIN: 4.6 / MAX: 15.491. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16 - Device: CPU2 x 32GB DDR5-4800140280420560700645.631. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

GraphicsMagick

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Swirl2 x 32GB DDR5-480040801201602001871. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenVINO

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Weld Porosity Detection FP16-INT8 - Device: CPU2 x 32GB DDR5-48002468106.33MIN: 3.26 / MAX: 19.391. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Weld Porosity Detection FP16-INT8 - Device: CPU2 x 32GB DDR5-48004008001200160020001885.031. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU2 x 32GB DDR5-48000.10130.20260.30390.40520.50650.45MIN: 0.22 / MAX: 8.771. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU2 x 32GB DDR5-48005K10K15K20K25K24761.851. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16 - Device: CPU2 x 32GB DDR5-48000.59851.1971.79552.3942.99252.66MIN: 1.38 / MAX: 17.221. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16 - Device: CPU2 x 32GB DDR5-480050010001500200025002212.081. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

GraphicsMagick

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: HWB Color Space2 x 32GB DDR5-4800601201802403002591. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Resizing2 x 32GB DDR5-4800601201802403002891. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Rotate2 x 32GB DDR5-480040801201602001651. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: ATPase with 327,506 Atoms2 x 32GB DDR5-48000.51031.02061.53092.04122.55152.26790

RocksDB

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Update Random2 x 32GB DDR5-4800150K300K450K600K750K7034151. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read Random Write Random2 x 32GB DDR5-4800500K1000K1500K2000K2500K25429691. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While Writing2 x 32GB DDR5-4800700K1400K2100K2800K3500K31741051. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Random Read2 x 32GB DDR5-480020M40M60M80M100M943104211. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OSPRay Studio

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-480011K22K33K44K55K52245

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-480011K22K33K44K55K51162

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Medium2 x 32GB DDR5-4800369121510.75

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB2 x 32GB DDR5-48004K8K12K16K20K16405.971. (CC) gcc options: -O3 -march=native

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 322 x 32GB DDR5-480020406080100107.11. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB2 x 32GB DDR5-48004K8K12K16K20K17709.531. (CC) gcc options: -O3 -march=native

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C2 x 32GB DDR5-48009K18K27K36K45K41111.391. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

SPECFEM3D

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous Halfspace2 x 32GB DDR5-4800112233445547.021. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

PyTorch

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 32 - Model: ResNet-502 x 32GB DDR5-480091827364540.29MIN: 39.09 / MAX: 40.73

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.2Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only2 x 32GB DDR5-48000.14850.2970.44550.5940.74250.66

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.2Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only2 x 32GB DDR5-48000.14850.2970.44550.5940.74250.66

Timed PHP Compilation

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 8.3.4Time To Compile2 x 32GB DDR5-4800102030405044.61

JPEG-XL libjxl

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL libjxl 0.10.1Input: JPEG - Quality: 802 x 32GB DDR5-4800102030405044.421. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL libjxl 0.10.1Input: PNG - Quality: 802 x 32GB DDR5-4800102030405044.901. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm

libavif avifenc

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 22 x 32GB DDR5-4800102030405043.071. (CXX) g++ options: -O3 -fPIC -lm

FFmpeg

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 7.0Encoder: libx265 - Scenario: Live2 x 32GB DDR5-48004080120160200175.301. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon Obj2 x 32GB DDR5-480051015202521.58MIN: 21.45 / MAX: 21.94

VVenC

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 4K - Video Preset: Faster2 x 32GB DDR5-48004812162014.931. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

OSPRay Studio

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU2 x 32GB DDR5-48007K14K21K28K35K32273

Llama.cpp

Llama.cpp is a port of Facebook's LLaMA model in C/C++ developed by Georgi Gerganov. Llama.cpp allows the inference of LLaMA and other supported models in C/C++. For CPU inference Llama.cpp supports AVX2/AVX-512, ARM NEON, and other modern ISAs along with features like OpenBLAS usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b1808Model: llama-2-7b.Q4_0.gguf2 x 32GB DDR5-48004812162013.671. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas

SPECFEM3D

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Mount St. Helens2 x 32GB DDR5-480091827364537.201. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

ASTC Encoder

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: Thorough2 x 32GB DDR5-48004812162013.741. (CXX) g++ options: -O3 -flto -pthread

SPECFEM3D

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic Model2 x 32GB DDR5-480081624324036.591. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D2 x 32GB DDR5-4800300600900120015001292.661. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

JPEG-XL Decoding libjxl

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL Decoding libjxl 0.10.1CPU Threads: All2 x 32GB DDR5-48004080120160200201.87

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Crown2 x 32GB DDR5-480051015202522.61MIN: 22.4 / MAX: 22.99

RAMspeed SMP

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integer2 x 32GB DDR5-480013K26K39K52K65K62972.551. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integer2 x 32GB DDR5-480013K26K39K52K65K59338.221. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integer2 x 32GB DDR5-480014K28K42K56K70K63562.291. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integer2 x 32GB DDR5-480012K24K36K48K60K55555.511. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integer2 x 32GB DDR5-480012K24K36K48K60K54648.291. (CC) gcc options: -O3 -march=native

JPEG-XL Decoding libjxl

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL Decoding libjxl 0.10.1CPU Threads: 12 x 32GB DDR5-48002040608010085.28

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon2 x 32GB DDR5-480061218243025.27MIN: 25.12 / MAX: 25.79

VVenC

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 1080p - Video Preset: Fast2 x 32GB DDR5-480051015202520.411. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

JPEG-XL libjxl

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL libjxl 0.10.1Input: PNG - Quality: 902 x 32GB DDR5-480091827364541.181. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL libjxl 0.10.1Input: JPEG - Quality: 902 x 32GB DDR5-4800102030405041.751. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB2 x 32GB DDR5-48003K6K9K12K15K16019.161. (CC) gcc options: -O3 -march=native

TensorFlow

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: AlexNet2 x 32GB DDR5-480060120180240300263.8

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 4096 MiB2 x 32GB DDR5-48004K8K12K16K20K17823.971. (CC) gcc options: -O3 -march=native

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 4 - Input: Bosphorus 4K2 x 32GB DDR5-48002468106.6791. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Llamafile

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.7Test: llava-v1.5-7b-q4 - Acceleration: CPU2 x 32GB DDR5-4800369121511.44

JPEG-XL libjxl

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL libjxl 0.10.1Input: PNG - Quality: 1002 x 32GB DDR5-480051015202521.271. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL libjxl 0.10.1Input: JPEG - Quality: 1002 x 32GB DDR5-480051015202521.111. (CXX) g++ options: -fno-rtti -O3 -fPIE -pie -lm

PyTorch

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 32GB DDR5-4800132639526558.75MIN: 53.89 / MAX: 59.52

Timed Mesa Compilation

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 24.0Time To Compile2 x 32GB DDR5-480051015202522.06

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.4Harness: Deconvolution Batch shapes_1d - Engine: CPU2 x 32GB DDR5-48000.73781.47562.21342.95123.6893.27921MIN: 2.811. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Rating2 x 32GB DDR5-480030K60K90K120K150K1264571. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Rating2 x 32GB DDR5-480030K60K90K120K150K1390351. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B2 x 32GB DDR5-48004K8K12K16K20K18304.391. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Very Fast2 x 32GB DDR5-480071421283530.84

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C2 x 32GB DDR5-48005K10K15K20K25K22528.921. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Super Fast2 x 32GB DDR5-480081624324032.69

ASTC Encoder

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: Fast2 x 32GB DDR5-480060120180240300267.751. (CXX) g++ options: -O3 -flto -pthread

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction2 x 32GB DDR5-48004812162016.671. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C2 x 32GB DDR5-48002K4K6K8K10K9531.481. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Ultra Fast2 x 32GB DDR5-480091827364538.37

srsRAN Project

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput Thread2 x 32GB DDR5-48004080120160200183.4MIN: 105.91. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.4Harness: IP Shapes 1D - Engine: CPU2 x 32GB DDR5-48000.42070.84141.26211.68282.10351.86964MIN: 1.821. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPU2 x 32GB DDR5-4800369121512.96MIN: 12.16 / MAX: 13.15

VVenC

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 1080p - Video Preset: Faster2 x 32GB DDR5-4800112233445546.641. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Slow2 x 32GB DDR5-4800102030405045.21

ASTC Encoder

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: Medium2 x 32GB DDR5-480020406080100105.121. (CXX) g++ options: -O3 -flto -pthread

srsRAN Project

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput Total2 x 32GB DDR5-4800300600900120015001559.3MIN: 954.91. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Medium2 x 32GB DDR5-4800112233445550.73

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 8 - Input: Bosphorus 4K2 x 32GB DDR5-4800132639526557.441. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

TensorFlow

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 32GB DDR5-48004812162014.68

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: AlexNet2 x 32GB DDR5-48004812162014.48

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.4Harness: IP Shapes 3D - Engine: CPU2 x 32GB DDR5-48000.83831.67662.51493.35324.19153.72571MIN: 3.691. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C2 x 32GB DDR5-48005K10K15K20K25K22577.881. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 4 - Input: Bosphorus 1080p2 x 32GB DDR5-480051015202522.341. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

RNNoise

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 0.2Input: 26 Minute Long Talking Sample2 x 32GB DDR5-48002468107.541. (CC) gcc options: -O2 -pedantic -fvisibility=hidden

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB2 x 32GB DDR5-48003K6K9K12K15K16190.131. (CC) gcc options: -O3 -march=native

libavif avifenc

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6, Lossless2 x 32GB DDR5-48002468106.9621. (CXX) g++ options: -O3 -fPIC -lm

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 1024 MiB2 x 32GB DDR5-48004K8K12K16K20K17705.761. (CC) gcc options: -O3 -march=native

Google Draco

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: Church Facade2 x 32GB DDR5-48001100220033004400550050221. (CXX) g++ options: -O3

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.4Harness: Convolution Batch Shapes Auto - Engine: CPU2 x 32GB DDR5-48002468106.33617MIN: 6.241. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

srsRAN Project

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput Total2 x 32GB DDR5-48002K4K6K8K10K11196.81. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

Google Draco

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: Lion2 x 32GB DDR5-4800800160024003200400039031. (CXX) g++ options: -O3

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 12 - Input: Bosphorus 4K2 x 32GB DDR5-4800306090120150145.781. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 13 - Input: Bosphorus 4K2 x 32GB DDR5-4800306090120150146.711. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Very Fast2 x 32GB DDR5-4800306090120150118.8

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Super Fast2 x 32GB DDR5-4800306090120150129.72

libavif avifenc

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 62 x 32GB DDR5-48001.00282.00563.00844.01125.0144.4571. (CXX) g++ options: -O3 -fPIC -lm

WavPack Audio Encoding

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.7WAV To WavPack2 x 32GB DDR5-48000.9921.9842.9763.9684.964.409

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C2 x 32GB DDR5-480050010001500200025002189.281. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Parallel BZIP2 Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterParallel BZIP2 Compression 1.1.13FreeBSD-13.0-RELEASE-amd64-memstick.img Compression2 x 32GB DDR5-48000.96061.92122.88183.84244.8034.269191. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 8 - Input: Bosphorus 1080p2 x 32GB DDR5-48004080120160200163.521. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

TensorFlow

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: GoogLeNet2 x 32GB DDR5-4800122436486054.41

libavif avifenc

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 10, Lossless2 x 32GB DDR5-48000.93261.86522.79783.73044.6634.1451. (CXX) g++ options: -O3 -fPIC -lm

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Ultra Fast2 x 32GB DDR5-4800306090120150152.67

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.4Harness: Deconvolution Batch shapes_3d - Engine: CPU2 x 32GB DDR5-48000.77361.54722.32083.09443.8683.43803MIN: 3.211. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

srsRAN Project

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput Thread2 x 32GB DDR5-48002004006008001000882.51. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 12 - Input: Bosphorus 1080p2 x 32GB DDR5-4800120240360480600565.031. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 13 - Input: Bosphorus 1080p2 x 32GB DDR5-4800140280420560700648.571. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

236 Results Shown

Timed Linux Kernel Compilation
OpenRadioss
Blender
BRL-CAD
Timed Node.js Compilation
Llama.cpp
OpenRadioss
Quicksilver
OpenSSL:
  RSA4096:
    verify/s
    sign/s
Timed Godot Game Engine Compilation
OSPRay Studio
Blender
Tinymembench:
  Standard Memset
  Standard Memcpy
OpenFOAM:
  drivaerFastback, Small Mesh Size - Execution Time
  drivaerFastback, Small Mesh Size - Mesh Time
Llamafile
Quicksilver
OSPRay Studio:
  2 - 4K - 32 - Path Tracer - CPU
  1 - 4K - 32 - Path Tracer - CPU
TensorFlow
Blender
OpenRadioss
OpenSSL:
  SHA256
  ChaCha20-Poly1305
  AES-256-GCM
  AES-128-GCM
  ChaCha20
  SHA512
OSPRay:
  particle_volume/scivis/real_time
  particle_volume/pathtracer/real_time
PostgreSQL:
  100 - 1000 - Read Only - Average Latency
  100 - 1000 - Read Only
  100 - 1000 - Read Write - Average Latency
  100 - 1000 - Read Write
Llamafile
ASTC Encoder
NAS Parallel Benchmarks
ASTC Encoder
OSPRay Studio
FFmpeg:
  libx265 - Platform
  libx265 - Video On Demand
  libx265 - Upload
OSPRay
NAMD
OSPRay Studio:
  2 - 4K - 16 - Path Tracer - CPU
  1 - 4K - 16 - Path Tracer - CPU
OpenRadioss
SPECFEM3D:
  Layered Halfspace
  Water-layered Halfspace
Blender
Intel Open Image Denoise
Stockfish
Blender
VVenC
nginx:
  1000
  500
OSPRay Studio
libavif avifenc
OpenRadioss
GROMACS
OSPRay Studio:
  2 - 1080p - 16 - Path Tracer - CPU
  2 - 4K - 1 - Path Tracer - CPU
  1 - 1080p - 16 - Path Tracer - CPU
  1 - 4K - 1 - Path Tracer - CPU
Xcompact3d Incompact3d
NAS Parallel Benchmarks
oneDNN
Timed Linux Kernel Compilation
Blender
Llama.cpp
oneDNN
Chaos Group V-RAY
OpenRadioss
OSPRay Studio:
  3 - 1080p - 1 - Path Tracer - CPU
  1 - 1080p - 1 - Path Tracer - CPU
  3 - 1080p - 32 - Path Tracer - CPU
  2 - 1080p - 1 - Path Tracer - CPU
OSPRay:
  gravity_spheres_volume/dim_512/scivis/real_time
  gravity_spheres_volume/dim_512/ao/real_time
TensorFlow
OSPRay
NAS Parallel Benchmarks
LuxCoreRender
uvg266
LuxCoreRender:
  Danish Mood - CPU
  LuxCore Benchmark - CPU
OpenVINO:
  Face Detection FP16 - CPU:
    ms
    FPS
  Face Detection FP16-INT8 - CPU:
    ms
    FPS
GraphicsMagick
LuxCoreRender
OpenVINO:
  Machine Translation EN To DE FP16 - CPU:
    ms
    FPS
  Person Detection FP16 - CPU:
    ms
    FPS
  Person Detection FP32 - CPU:
    ms
    FPS
GraphicsMagick
OpenVINO:
  Noise Suppression Poconet-Like FP16 - CPU:
    ms
    FPS
  Road Segmentation ADAS FP16-INT8 - CPU:
    ms
    FPS
  Person Vehicle Bike Detection FP16 - CPU:
    ms
    FPS
  Road Segmentation ADAS FP16 - CPU:
    ms
    FPS
  Person Re-Identification Retail FP16 - CPU:
    ms
    FPS
Quicksilver
OpenVINO:
  Vehicle Detection FP16-INT8 - CPU:
    ms
    FPS
  Handwritten English Recognition FP16-INT8 - CPU:
    ms
    FPS
  Face Detection Retail FP16-INT8 - CPU:
    ms
    FPS
  Handwritten English Recognition FP16 - CPU:
    ms
    FPS
GraphicsMagick
OpenVINO:
  Weld Porosity Detection FP16 - CPU:
    ms
    FPS
  Age Gender Recognition Retail 0013 FP16-INT8 - CPU:
    ms
    FPS
  Vehicle Detection FP16 - CPU:
    ms
    FPS
GraphicsMagick
OpenVINO:
  Weld Porosity Detection FP16-INT8 - CPU:
    ms
    FPS
  Age Gender Recognition Retail 0013 FP16 - CPU:
    ms
    FPS
  Face Detection Retail FP16 - CPU:
    ms
    FPS
GraphicsMagick:
  HWB Color Space
  Resizing
  Rotate
NAMD
RocksDB:
  Update Rand
  Read Rand Write Rand
  Read While Writing
  Rand Read
OSPRay Studio:
  2 - 1080p - 32 - Path Tracer - CPU
  1 - 1080p - 32 - Path Tracer - CPU
uvg266
MBW
libxsmm
MBW
NAS Parallel Benchmarks
SPECFEM3D
PyTorch
Intel Open Image Denoise:
  RT.hdr_alb_nrm.3840x2160 - CPU-Only
  RT.ldr_alb_nrm.3840x2160 - CPU-Only
Timed PHP Compilation
JPEG-XL libjxl:
  JPEG - 80
  PNG - 80
libavif avifenc
FFmpeg
Embree
VVenC
OSPRay Studio
Llama.cpp
SPECFEM3D
ASTC Encoder
SPECFEM3D
NAS Parallel Benchmarks
JPEG-XL Decoding libjxl
Embree
RAMspeed SMP:
  Copy - Integer
  Average - Integer
  Scale - Integer
  Triad - Integer
  Add - Integer
JPEG-XL Decoding libjxl
Embree
VVenC
JPEG-XL libjxl:
  PNG - 90
  JPEG - 90
MBW
TensorFlow
MBW
SVT-AV1
Llamafile
JPEG-XL libjxl:
  PNG - 100
  JPEG - 100
PyTorch
Timed Mesa Compilation
oneDNN
7-Zip Compression:
  Decompression Rating
  Compression Rating
NAS Parallel Benchmarks
uvg266
NAS Parallel Benchmarks
uvg266
ASTC Encoder
Xcompact3d Incompact3d
NAS Parallel Benchmarks
uvg266
srsRAN Project
oneDNN
LuxCoreRender
VVenC
uvg266
ASTC Encoder
srsRAN Project
uvg266
SVT-AV1
TensorFlow:
  CPU - 1 - ResNet-50
  CPU - 1 - AlexNet
oneDNN
NAS Parallel Benchmarks
SVT-AV1
RNNoise
MBW
libavif avifenc
MBW
Google Draco
oneDNN
srsRAN Project
Google Draco
SVT-AV1:
  Preset 12 - Bosphorus 4K
  Preset 13 - Bosphorus 4K
uvg266:
  Bosphorus 1080p - Very Fast
  Bosphorus 1080p - Super Fast
libavif avifenc
WavPack Audio Encoding
NAS Parallel Benchmarks
Parallel BZIP2 Compression
SVT-AV1
TensorFlow
libavif avifenc
uvg266
oneDNN
srsRAN Project
SVT-AV1:
  Preset 12 - Bosphorus 1080p
  Preset 13 - Bosphorus 1080p