AMD Ryzen 9 9950X DDR5 Memory Performance

AMD Ryzen 9 9950X DDR5 memory module benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2408225-NE-RYZEN999510
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
C++ Boost Tests 2 Tests
Timed Code Compilation 5 Tests
C/C++ Compiler Tests 2 Tests
CPU Massive 13 Tests
Creator Workloads 5 Tests
Fortran Tests 4 Tests
Game Development 2 Tests
HPC - High Performance Computing 16 Tests
Common Kernel Benchmarks 2 Tests
Large Language Models 2 Tests
Machine Learning 7 Tests
Memory Test Suite 2 Tests
Molecular Dynamics 5 Tests
MPI Benchmarks 4 Tests
Multi-Core 12 Tests
NVIDIA GPU Compute 4 Tests
OpenMPI Tests 9 Tests
Programmer / Developer System Benchmarks 6 Tests
Python Tests 6 Tests
Renderers 2 Tests
Scientific Computing 5 Tests
Server 2 Tests
Server CPU Tests 8 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G
August 14
  13 Hours, 26 Minutes
2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32
August 22
  10 Hours, 21 Minutes
2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36
August 20
  10 Hours, 59 Minutes
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38
August 21
  7 Hours, 49 Minutes
Invert Hiding All Results Option
  10 Hours, 39 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD Ryzen 9 9950X DDR5 Memory PerformanceOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS)AMD Device 14d82 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G2 x 32GB DDR5-6400MT/s Corsair CMK64GX5M2B6400C322 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C362 x 24GB DDR5-8000MT/s Corsair CMP48GX5M2X8000C382000GB Corsair MP700 PROAMD Radeon RX 7900 GRE 16GBAMD Navi 31 HDMI/DPDELL U2723QEIntel I225-V + Intel Wi-Fi 6EUbuntu 24.046.10.0-phx (x86_64)GNOME Shell 46.0X Server + Wayland4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57)GCC 13.2.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionAMD Ryzen 9 9950X DDR5 Memory Performance BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401a- Python 3.12.3- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected - 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32, 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36, 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)

2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38Result OverviewPhoronix Test Suite100%123%145%168%191%OpenFOAMOpenRadiossLeelaChessZeroXcompact3d Incompact3dlibxsmmHigh Performance Conjugate GradientLlama.cppLULESHNAS Parallel BenchmarksRAMspeed SMPBuild2PyTorchNAMDStockfishSPECFEM3DTensorFlowMBWMemcachedEmbreeJava JMHXNNPACKsimdjsonx265BlenderBRL-CADXmrigNumpy BenchmarkLuxCoreRenderQuicksilverPOV-RayminiBUDEEtcpakStress-NG

AMD Ryzen 9 9950X DDR5 Memory Performancembw: Memory Copy - 4096 MiBmbw: Memory Copy - 8192 MiBmbw: Memory Copy, Fixed Block Size - 4096 MiBmbw: Memory Copy, Fixed Block Size - 8192 MiBstress-ng: Memory Copyingramspeed: Add - Integerramspeed: Copy - Integerramspeed: Scale - Integerramspeed: Triad - Integerramspeed: Average - Integerbrl-cad: VGR Performance Metricetcpak: Multi-Threaded - ETC2xmrig: GhostRider - 1Mminibude: OpenMP - BM1minibude: OpenMP - BM1minibude: OpenMP - BM2minibude: OpenMP - BM2openradioss: Bumper Beamopenradioss: Chrysler Neon 1Mquicksilver: CORAL2 P1openradioss: Cell Phone Drop Testopenradioss: Bird Strike on Windshieldopenradioss: Rubber O-Ring Seal Installationopenradioss: INIVOL and Fluid Structure Interaction Drop Containertensorflow: CPU - 1 - ResNet-50tensorflow: CPU - 64 - ResNet-50lczero: BLASlczero: Eigennumpy: pytorch: CPU - 1 - ResNet-50pytorch: CPU - 256 - ResNet-50xnnpack: FP32MobileNetV2xnnpack: FP32MobileNetV3Largexnnpack: FP32MobileNetV3Smallxnnpack: FP16MobileNetV2xnnpack: FP16MobileNetV3Largexnnpack: FP16MobileNetV3Smallxnnpack: QU8MobileNetV2xnnpack: QU8MobileNetV3Largexnnpack: QU8MobileNetV3Smallspecfem3d: Homogeneous Halfspacellama-cpp: Meta-Llama-3-8B-Instruct-Q8_0.ggufllamafile: Meta-Llama-3-8B-Instruct.F16 - CPUgromacs: MPI CPU - water_GMX50_barespecfem3d: Water-layered Halfspacespecfem3d: Tomographic Modelspecfem3d: Layered Halfspacespecfem3d: Mount St. Helenshpcg: 104 104 104 - 60npb: BT.Cnpb: CG.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cstockfish: Chess Benchmarkx265: Bosphorus 1080pcompress-7zip: Decompression Ratingx265: Bosphorus 4Knpb: IS.Dnpb: MG.Cnpb: SP.Bnpb: SP.Cnamd: ATPase with 327,506 Atomscompress-7zip: Compression Ratingnpb: LU.Cnamd: STMV with 1,066,628 Atomslulesh: openfoam: motorBike - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timepovray: Trace Timey-cruncher: 500My-cruncher: 1Bopenfoam: motorBike - Mesh Timegromacs: water_GMX50_bareopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionjava-jmh: Throughputbuild-llvm: Ninjabuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigblender: BMW27 - CPU-Onlyblender: Junkshop - CPU-Onlyblender: Barbershop - CPU-Onlyembree: Pathtracer ISPC - Crownembree: Pathtracer ISPC - Asian Dragonluxcorerender: DLSC - CPUluxcorerender: Danish Mood - CPUluxcorerender: Orange Juice - CPUluxcorerender: LuxCore Benchmark - CPUlibxsmm: 64libxsmm: 128libxsmm: 32luxcorerender: Rainbow Colors and Prism - CPUbuild2: Time To Compilebuild-gem5: Time To Compilebuild-nodejs: Time To Compilememcached: 1:10memcached: 1:100simdjson: Kostyasimdjson: TopTweetsimdjson: LargeRand2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3822774.94722247.78419534.15919467.39410752.8161079.9365797.0569141.0862428.7065503.93498966741.7843616.61922.12576.8851934.61377.38576.90705.552567666744.90128.7653.16247.0616.9252.272262201068.3684.8957.54126415168129851291770759111875630.6779184578.364.683.14172.19726051223.35909292073.13879708425.6909866448.9356055287.3711144.883678.463823.8027106.1851377778134.5215813338.271505.4724236.2121328.2814528.263.4100719558559920.960.980849945.530958.569522.579849157.14898181.5690516.1017.58716.45284.99491.9272096.566414.090555264.113021891554717298.426301.62947.755590.49546.3862.09448.9636.553241.49145.404.628.645.15237.1472.0121.019.9976.003221.139345.2465915791.347507615.737.7113.592.1222510.46422294.06220817.56920850.88310720.3163967.7070747.0071754.5865845.2168588.84494437739.7363638.61926.89177.0761931.14177.24677.18668.162566000043.42128.0153.23239.2217.3454.302442361070.6185.5258.89127315278159961303776765111576129.8184312688.323.22869.56164246723.00050134169.43938729924.9565499288.7973758570.7711374.513731.973714.6628891.8649194155134.4815751238.301584.9325622.9322908.5815610.303.6046420306562084.721.0160210235.72257.245222.821389148.75845177.789416.1337.24715.42684.98021.9961994.74713.402805360.799243992045826799.54298.04947.783589.33146.5662.21450.7136.109940.94475.424.698.645.14253.3505.9128.020.0877.006219.268341.6795898565.347647187.487.6313.382.1122539.72022817.90020923.98920349.41410608.9665113.5270345.5373724.8866963.0369491.42486954734.5333699.61903.04076.1221904.95376.19877.45573.332566666743.25128.3253.66242.0017.3355.192642471046.9186.7958.49127215538189971302773765112476430.5429844879.003.29868.92093565123.28751738869.81742231125.1946536309.5160160124.0811908.473741.863739.1629713.5549974210134.1915700538.071635.3426596.4623254.4016097.003.5775520081064031.311.0242610578.51655.573823.143631145.88323174.1257216.3957.23615.60688.08072.0091909.665512.974287058.198459689012581592.653304.62148.25847.7263.41459.1236.254641.72385.364.658.555.04258.6508.9132.219.8477.9235874747.737498825.197.5713.242.1022093.98521444.00219747.58420124.94310608.8262771.0066545.2470445.7464832.8666822.49489784731.2293676.51899.80475.9921898.25875.93077.8125136667124.3754.2716.8153.542271064.1781.8956.191285159083610181334797783115478631.9817156778.6672.07947665823.86945318172.438348925.4533190609.1659058570.0311662.443722.563712.4528743.4950101396131.2837.091586.5925715.3422436.8115531.763.5095862375.430.998529832.638516.31985.474513.259573360.480583290340365706.79247.5164.08459.1935.182840.06965.344.608.465.05249.7491.1127.019.3380.1045694123.527350932.887.5012.882.11OpenBenchmarking.org

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 4096 MiB2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G5K10K15K20K25KSE +/- 227.86, N = 15SE +/- 265.89, N = 3SE +/- 31.03, N = 3SE +/- 72.38, N = 322093.9922510.4622539.7222774.951. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C365K10K15K20K25KSE +/- 56.11, N = 3SE +/- 49.10, N = 3SE +/- 188.50, N = 12SE +/- 204.03, N = 321444.0022247.7822294.0622817.901. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C364K8K12K16K20KSE +/- 24.77, N = 3SE +/- 36.42, N = 3SE +/- 32.81, N = 3SE +/- 211.85, N = 319534.1619747.5820817.5720923.991. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C324K8K12K16K20KSE +/- 10.56, N = 3SE +/- 239.88, N = 3SE +/- 144.90, N = 15SE +/- 58.10, N = 319467.3920124.9420349.4120850.881. (CC) gcc options: -O3 -march=native

Stress-NG

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Memory Copying2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2K4K6K8K10KSE +/- 25.18, N = 3SE +/- 39.56, N = 3SE +/- 72.54, N = 3SE +/- 44.94, N = 310608.8210608.9610720.3110752.811. (CXX) g++ options: -lm -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lgmp -lgbm -lmpfr -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -U_FORTIFY_SOURCE

RAMspeed SMP

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3614K28K42K56K70KSE +/- 426.98, N = 3SE +/- 64.38, N = 3SE +/- 382.92, N = 3SE +/- 108.61, N = 361079.9362771.0063967.7065113.521. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3215K30K45K60K75KSE +/- 820.93, N = 3SE +/- 299.96, N = 3SE +/- 762.95, N = 5SE +/- 453.50, N = 365797.0566545.2470345.5370747.001. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3616K32K48K64K80KSE +/- 220.54, N = 3SE +/- 513.22, N = 3SE +/- 266.14, N = 3SE +/- 364.59, N = 369141.0870445.7471754.5873724.881. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3614K28K42K56K70KSE +/- 156.92, N = 3SE +/- 130.09, N = 3SE +/- 33.42, N = 3SE +/- 439.59, N = 362428.7064832.8665845.2166963.031. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3615K30K45K60K75KSE +/- 119.74, N = 3SE +/- 805.54, N = 4SE +/- 204.48, N = 3SE +/- 315.00, N = 365503.9366822.4968588.8469491.421. (CC) gcc options: -O3 -march=native

BRL-CAD

BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.38.2VGR Performance Metric2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G110K220K330K440K550K4869544897844944374989661. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6

Etcpak

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 2.0Benchmark: Multi-Threaded - Configuration: ETC22 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G160320480640800SE +/- 0.82, N = 3SE +/- 1.36, N = 3SE +/- 0.38, N = 3SE +/- 1.25, N = 3731.23734.53739.74741.781. (CXX) g++ options: -flto -pthread

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1M2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C368001600240032004000SE +/- 6.72, N = 3SE +/- 37.04, N = 3SE +/- 36.12, N = 6SE +/- 30.71, N = 93616.63638.63676.53699.61. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM12 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32400800120016002000SE +/- 0.87, N = 3SE +/- 0.31, N = 3SE +/- 6.84, N = 3SE +/- 0.54, N = 31899.801903.041922.131926.891. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM12 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3220406080100SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.27, N = 3SE +/- 0.02, N = 375.9976.1276.8977.081. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM22 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G400800120016002000SE +/- 2.32, N = 3SE +/- 1.03, N = 3SE +/- 0.32, N = 3SE +/- 0.64, N = 31898.261904.951931.141934.611. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM22 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 375.9376.2077.2577.391. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper Beam2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.02, N = 3SE +/- 0.12, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 377.8177.4577.1876.90

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1M2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36150300450600750SE +/- 0.28, N = 3SE +/- 1.02, N = 3SE +/- 49.36, N = 9705.55668.16573.33

Quicksilver

Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P12 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G5M10M15M20M25MSE +/- 16666.67, N = 3SE +/- 41633.32, N = 3SE +/- 92074.85, N = 3SE +/- 21858.13, N = 3251366672566000025666667256766671. (CXX) g++ options: -fopenmp -O3 -march=native

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop Test2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361020304050SE +/- 0.18, N = 3SE +/- 0.29, N = 3SE +/- 0.15, N = 344.9043.4243.25

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on Windshield2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38306090120150SE +/- 0.28, N = 3SE +/- 0.28, N = 3SE +/- 0.35, N = 3SE +/- 5.81, N = 15128.76128.32128.01124.37

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal Installation2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1224364860SE +/- 0.06, N = 3SE +/- 0.16, N = 3SE +/- 0.15, N = 3SE +/- 0.10, N = 354.2753.6653.2353.16

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop Container2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3250100150200250SE +/- 0.74, N = 3SE +/- 0.76, N = 3SE +/- 0.99, N = 3247.06242.00239.22

Model: INIVOL and Fluid Structure Interaction Drop Container

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test run did not produce a result. E: ** ERROR: FILE fsi_drop_container_0000_0001.rst NOT FOUND

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3248121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 316.8116.9217.3317.34

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: ResNet-502 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361224364860SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 352.2753.5454.3055.19

LeelaChessZero

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: BLAS2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3660120180240300SE +/- 2.65, N = 3SE +/- 2.91, N = 3SE +/- 3.79, N = 32262442641. (CXX) g++ options: -flto -pthread

Backend: BLAS

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: Eigen2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3650100150200250SE +/- 1.45, N = 3SE +/- 2.85, N = 3SE +/- 1.15, N = 32202272362471. (CXX) g++ options: -flto -pthread

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322004006008001000SE +/- 1.37, N = 3SE +/- 4.91, N = 3SE +/- 8.61, N = 3SE +/- 2.01, N = 31046.911064.171068.361070.61

PyTorch

This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3620406080100SE +/- 0.82, N = 5SE +/- 0.27, N = 3SE +/- 0.96, N = 4SE +/- 0.63, N = 381.8984.8985.5286.79MIN: 67.48 / MAX: 84.5MIN: 77.17 / MAX: 85.87MIN: 68.54 / MAX: 88.53MIN: 78.82 / MAX: 88.23

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 256 - Model: ResNet-502 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321326395265SE +/- 0.25, N = 3SE +/- 0.34, N = 3SE +/- 0.36, N = 3SE +/- 0.40, N = 1556.1957.5458.4958.89MIN: 39.35 / MAX: 57.3MIN: 53.14 / MAX: 58.34MIN: 34.6 / MAX: 60.01MIN: 34.79 / MAX: 62.62

XNNPACK

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV22 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30060090012001500SE +/- 9.00, N = 3SE +/- 8.84, N = 3SE +/- 4.98, N = 3SE +/- 5.21, N = 312851273127212641. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Large2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30060090012001500SE +/- 3.18, N = 3SE +/- 2.08, N = 3SE +/- 9.64, N = 3SE +/- 19.19, N = 315901553152715161. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Small2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 3.18, N = 3SE +/- 5.00, N = 3SE +/- 2.31, N = 3SE +/- 2.31, N = 38368188158121. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV22 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 2.31, N = 3SE +/- 1.20, N = 3SE +/- 3.38, N = 3SE +/- 2.31, N = 310189979969851. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Large2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30060090012001500SE +/- 8.11, N = 3SE +/- 3.53, N = 3SE +/- 4.58, N = 3SE +/- 2.89, N = 313341303130212911. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Small2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 0.58, N = 3SE +/- 1.86, N = 3SE +/- 1.20, N = 3SE +/- 1.76, N = 37977767737701. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV22 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 3.18, N = 3SE +/- 2.96, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 37837657657591. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Large2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322004006008001000SE +/- 3.21, N = 3SE +/- 2.73, N = 3SE +/- 1.20, N = 3SE +/- 4.91, N = 311541124111811151. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Small2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 1.33, N = 3SE +/- 0.88, N = 3SE +/- 0.67, N = 37867647617561. (CXX) g++ options: -O3 -lrt -lm

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous Halfspace2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32714212835SE +/- 0.28, N = 3SE +/- 0.38, N = 3SE +/- 0.39, N = 331.9830.6830.5429.821. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b3067Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C363691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.328.368.669.001. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas

Llamafile

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.6Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.0532.1063.1594.2125.2654.68

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bare2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C360.74211.48422.22632.96843.7105SE +/- 0.001, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 33.1413.2283.2981. (CXX) g++ options: -O3 -lm

Implementation: MPI CPU - Input: water_GMX50_bare

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-System-Product-Name exited on signal 11 (Segmentation fault).

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Water-layered Halfspace2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361632486480SE +/- 0.34, N = 3SE +/- 0.16, N = 2SE +/- 0.34, N = 3SE +/- 0.24, N = 372.2072.0869.5668.921. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic Model2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32612182430SE +/- 0.16, N = 15SE +/- 0.16, N = 3SE +/- 0.14, N = 3SE +/- 0.20, N = 323.8723.3623.2923.001. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Layered Halfspace2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321632486480SE +/- 0.31, N = 3SE +/- 0.59, N = 3SE +/- 0.59, N = 373.1472.4469.8269.441. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Mount St. Helens2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32612182430SE +/- 0.17, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 325.6925.4525.1924.961. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 602 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C363691215SE +/- 0.00125, N = 3SE +/- 0.00142, N = 3SE +/- 0.00584, N = 3SE +/- 0.00741, N = 38.797378.935609.165909.516011. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3613K26K39K52K65KSE +/- 54.79, N = 3SE +/- 74.71, N = 3SE +/- 8.10, N = 3SE +/- 85.26, N = 355287.3758570.0358570.7760124.081. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C363K6K9K12K15KSE +/- 21.24, N = 3SE +/- 36.86, N = 3SE +/- 41.76, N = 3SE +/- 10.26, N = 311144.8811374.5111662.4411908.471. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C368001600240032004000SE +/- 25.12, N = 3SE +/- 21.03, N = 3SE +/- 53.14, N = 3SE +/- 17.86, N = 33678.463722.563731.973741.861. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G8001600240032004000SE +/- 51.16, N = 3SE +/- 36.74, N = 3SE +/- 40.74, N = 3SE +/- 2.85, N = 33712.453714.663739.163823.801. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C366K12K18K24K30KSE +/- 141.73, N = 3SE +/- 113.15, N = 3SE +/- 30.67, N = 3SE +/- 95.45, N = 327106.1828743.4928891.8629713.551. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Stockfish

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfishChess Benchmark2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G11M22M33M44M55MSE +/- 410305.51, N = 3SE +/- 515794.89, N = 15SE +/- 166041.67, N = 3SE +/- 491754.88, N = 15491941554997421050101396513777781. Stockfish 16 by the Stockfish developers (see AUTHORS file)

x265

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 1080p2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G306090120150SE +/- 0.23, N = 3SE +/- 0.13, N = 3SE +/- 0.45, N = 3SE +/- 0.27, N = 3131.28134.19134.48134.521. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

7-Zip Compression

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Decompression Rating2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30K60K90K120K150KSE +/- 33.91, N = 3SE +/- 69.90, N = 3SE +/- 14.15, N = 31570051575121581331. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

Test: Decompression Rating

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.

x265

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 4K2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32918273645SE +/- 0.15, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.15, N = 337.0938.0738.2738.301. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36400800120016002000SE +/- 5.33, N = 3SE +/- 2.77, N = 3SE +/- 5.23, N = 3SE +/- 4.49, N = 31505.471584.931586.591635.341. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C366K12K18K24K30KSE +/- 6.88, N = 3SE +/- 20.77, N = 3SE +/- 10.24, N = 3SE +/- 34.50, N = 324236.2125622.9325715.3426596.461. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C365K10K15K20K25KSE +/- 21.50, N = 3SE +/- 40.85, N = 3SE +/- 26.91, N = 3SE +/- 10.38, N = 321328.2822436.8122908.5823254.401. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C363K6K9K12K15KSE +/- 18.27, N = 3SE +/- 42.19, N = 3SE +/- 16.93, N = 3SE +/- 15.11, N = 314528.2615531.7615610.3016097.001. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAMD

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: ATPase with 327,506 Atoms2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C320.8111.6222.4333.2444.055SE +/- 0.02586, N = 3SE +/- 0.02740, N = 9SE +/- 0.02618, N = 15SE +/- 0.03856, N = 33.410073.509583.577553.60464

7-Zip Compression

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Compression Rating2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3240K80K120K160K200KSE +/- 322.74, N = 3SE +/- 94.03, N = 3SE +/- 467.61, N = 31955852008102030651. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

Test: Compression Rating

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3614K28K42K56K70KSE +/- 150.93, N = 3SE +/- 130.21, N = 3SE +/- 210.02, N = 3SE +/- 72.30, N = 359920.9662084.7262375.4364031.311. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAMD

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: STMV with 1,066,628 Atoms2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C360.23050.4610.69150.9221.1525SE +/- 0.00027, N = 3SE +/- 0.00137, N = 2SE +/- 0.00200, N = 3SE +/- 0.00057, N = 30.980840.998521.016021.02426

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.32 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362K4K6K8K10KSE +/- 38.48, N = 3SE +/- 19.00, N = 3SE +/- 67.88, N = 3SE +/- 13.71, N = 39832.649945.5310235.7210578.521. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Execution Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36132639526558.5757.2555.571. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Time2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G61218243023.1422.8222.581. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Input: drivaerFastback, Small Mesh Size - Mesh Time

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: [13] #0 Foam::error::printStack(Foam::Ostream&) at ??:?

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36306090120150157.15148.76145.881. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Input: drivaerFastback, Small Mesh Size - Execution Time

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: This is not a fatal error but might cause some unexpected behaviour.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C364080120160200181.57177.79174.131. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Input: drivaerFastback, Medium Mesh Size - Mesh Time

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: This is not a fatal error but might cause some unexpected behaviour.

POV-Ray

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-RayTrace Time2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G48121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 316.4016.3216.1316.101. POV-Ray 3.7.0.10.unofficial

Y-Cruncher

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 500M2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36246810SE +/- 0.006, N = 3SE +/- 0.016, N = 3SE +/- 0.023, N = 37.5877.2477.236

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 1B2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3248121620SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 316.4515.6115.43

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Mesh Time2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322040608010088.0885.4784.9984.981. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

GROMACS

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACSInput: water_GMX50_bare2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C360.4520.9041.3561.8082.26SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 31.9271.9962.0091. GROMACS version: 2023.3-Ubuntu_2023.3_1ubuntu3

Input: water_GMX50_bare

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: Fatal error:

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3650010001500200025002096.571994.751909.671. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Input: drivaerFastback, Medium Mesh Size - Execution Time

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: [13] --> FOAM FATAL ERROR:

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3648121620SE +/- 0.08, N = 3SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 314.0913.4013.2612.971. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361428425670SE +/- 0.09, N = 3SE +/- 0.20, N = 3SE +/- 0.01, N = 2SE +/- 0.14, N = 364.1160.8060.4858.201. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Java JMH

This very basic test profile runs the stock benchmark of the Java JMH benchmark via Maven. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/s, More Is BetterJava JMHThroughput2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3220000M40000M60000M80000M100000M89012581592.6590340365706.7991554717298.4392045826799.54

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninja2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3270140210280350SE +/- 0.38, N = 2SE +/- 0.12, N = 3SE +/- 0.22, N = 3304.62301.63298.05

Build System: Ninja

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: /usr/include/c++/13/bits/vector.tcc:445:7: internal compiler error: Segmentation fault

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfig2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1122334455SE +/- 0.40, N = 3SE +/- 0.30, N = 3SE +/- 0.30, N = 348.2647.7847.76

Build: defconfig

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: : internal compiler error: Segmentation fault

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfig2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32130260390520650SE +/- 0.15, N = 3SE +/- 0.52, N = 3590.50589.33

Build: allmodconfig

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: gcc: internal compiler error: Segmentation fault signal terminated program as

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: CPU-Only2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1122334455SE +/- 0.10, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 347.7247.5146.5646.38

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: CPU-Only2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1428425670SE +/- 0.17, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 364.0863.4162.2162.09

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: CPU-Only2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G100200300400500SE +/- 0.20, N = 3SE +/- 0.27, N = 3SE +/- 0.26, N = 3SE +/- 0.20, N = 3459.19459.12450.71448.96

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Crown2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G816243240SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 335.1836.1136.2536.55MIN: 34.83 / MAX: 36.01MIN: 35.69 / MAX: 36.98MIN: 35.89 / MAX: 36.94MIN: 36.13 / MAX: 37.34

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361020304050SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 340.0740.9441.4941.72MIN: 39.86 / MAX: 40.55MIN: 40.73 / MAX: 41.6MIN: 41.24 / MAX: 42.02MIN: 41.48 / MAX: 42.26

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPU2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321.21952.4393.65854.8786.0975SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.345.365.405.42MIN: 5.23 / MAX: 5.69MIN: 5.25 / MAX: 5.7MIN: 5.27 / MAX: 5.72MIN: 5.31 / MAX: 5.77

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPU2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321.05532.11063.16594.22125.2765SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 34.604.624.654.69MIN: 2.13 / MAX: 5.16MIN: 2.07 / MAX: 5.22MIN: 2.02 / MAX: 5.24MIN: 2.15 / MAX: 5.26

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPU2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.468.558.648.64MIN: 7.49 / MAX: 9.04MIN: 7.48 / MAX: 9.15MIN: 7.64 / MAX: 9.25MIN: 7.67 / MAX: 9.25

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPU2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.15882.31763.47644.63525.794SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 35.045.055.145.15MIN: 2.28 / MAX: 5.66MIN: 2.28 / MAX: 5.66MIN: 2.39 / MAX: 5.75MIN: 2.38 / MAX: 5.77

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 642 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3660120180240300SE +/- 0.20, N = 3SE +/- 0.53, N = 3SE +/- 0.03, N = 3SE +/- 0.37, N = 3237.1249.7253.3258.61. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 1282 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36110220330440550SE +/- 0.38, N = 3SE +/- 0.33, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 3472.0491.1505.9508.91. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36306090120150SE +/- 0.07, N = 3SE +/- 0.26, N = 3SE +/- 0.07, N = 3SE +/- 0.19, N = 3121.0127.0128.0132.21. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPU2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32510152025SE +/- 0.13, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.15, N = 319.3319.8419.9920.08MIN: 17.37 / MAX: 19.62MIN: 17.86 / MAX: 20.21MIN: 18.05 / MAX: 20.42MIN: 17.96 / MAX: 20.55

Build2

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.17Time To Compile2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.99, N = 2SE +/- 0.40, N = 3SE +/- 0.52, N = 3SE +/- 0.16, N = 380.1077.9277.0176.00

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To Compile2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3250100150200250SE +/- 0.11, N = 3SE +/- 0.19, N = 3221.14219.27

Time To Compile

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: Warning: Protocol buffer compiler (protoc) not found.

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To Compile2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3280160240320400SE +/- 0.34, N = 3SE +/- 0.24, N = 3345.25341.68

Time To Compile

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.

Memcached

Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:102 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.3M2.6M3.9M5.2M6.5MSE +/- 4372.40, N = 3SE +/- 11157.27, N = 3SE +/- 11599.22, N = 3SE +/- 15831.56, N = 35694123.525874747.735898565.345915791.341. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:1002 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321.6M3.2M4.8M6.4M8MSE +/- 71150.32, N = 3SE +/- 43634.73, N = 3SE +/- 1150.91, N = 3SE +/- 87376.76, N = 37350932.887498825.197507615.737647187.481. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

simdjson

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: Kostya2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G246810SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 37.507.577.637.711. (CXX) g++ options: -O3 -lrt

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: TopTweet2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G3691215SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.17, N = 3SE +/- 0.02, N = 312.8813.2413.3813.591. (CXX) g++ options: -O3 -lrt

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: LargeRandom2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.4770.9541.4311.9082.385SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 32.102.112.112.121. (CXX) g++ options: -O3 -lrt

104 Results Shown

MBW:
  Memory Copy - 4096 MiB
  Memory Copy - 8192 MiB
  Memory Copy, Fixed Block Size - 4096 MiB
  Memory Copy, Fixed Block Size - 8192 MiB
Stress-NG
RAMspeed SMP:
  Add - Integer
  Copy - Integer
  Scale - Integer
  Triad - Integer
  Average - Integer
BRL-CAD
Etcpak
Xmrig
miniBUDE:
  OpenMP - BM1:
    GFInst/s
    Billion Interactions/s
  OpenMP - BM2:
    GFInst/s
    Billion Interactions/s
OpenRadioss:
  Bumper Beam
  Chrysler Neon 1M
Quicksilver
OpenRadioss:
  Cell Phone Drop Test
  Bird Strike on Windshield
  Rubber O-Ring Seal Installation
  INIVOL and Fluid Structure Interaction Drop Container
TensorFlow:
  CPU - 1 - ResNet-50
  CPU - 64 - ResNet-50
LeelaChessZero:
  BLAS
  Eigen
Numpy Benchmark
PyTorch:
  CPU - 1 - ResNet-50
  CPU - 256 - ResNet-50
XNNPACK:
  FP32MobileNetV2
  FP32MobileNetV3Large
  FP32MobileNetV3Small
  FP16MobileNetV2
  FP16MobileNetV3Large
  FP16MobileNetV3Small
  QU8MobileNetV2
  QU8MobileNetV3Large
  QU8MobileNetV3Small
SPECFEM3D
Llama.cpp
Llamafile
GROMACS
SPECFEM3D:
  Water-layered Halfspace
  Tomographic Model
  Layered Halfspace
  Mount St. Helens
High Performance Conjugate Gradient
NAS Parallel Benchmarks:
  BT.C
  CG.C
  EP.C
  EP.D
  FT.C
Stockfish
x265
7-Zip Compression
x265
NAS Parallel Benchmarks:
  IS.D
  MG.C
  SP.B
  SP.C
NAMD
7-Zip Compression
NAS Parallel Benchmarks
NAMD
LULESH
OpenFOAM:
  motorBike - Execution Time
  drivaerFastback, Small Mesh Size - Mesh Time
  drivaerFastback, Small Mesh Size - Execution Time
  drivaerFastback, Medium Mesh Size - Mesh Time
POV-Ray
Y-Cruncher:
  500M
  1B
OpenFOAM
GROMACS
OpenFOAM
Xcompact3d Incompact3d:
  input.i3d 129 Cells Per Direction
  input.i3d 193 Cells Per Direction
Java JMH
Timed LLVM Compilation
Timed Linux Kernel Compilation:
  defconfig
  allmodconfig
Blender:
  BMW27 - CPU-Only
  Junkshop - CPU-Only
  Barbershop - CPU-Only
Embree:
  Pathtracer ISPC - Crown
  Pathtracer ISPC - Asian Dragon
LuxCoreRender:
  DLSC - CPU
  Danish Mood - CPU
  Orange Juice - CPU
  LuxCore Benchmark - CPU
libxsmm:
  64
  128
  32
LuxCoreRender
Build2
Timed Gem5 Compilation
Timed Node.js Compilation
Memcached:
  1:10
  1:100
simdjson:
  Kostya
  TopTweet
  LargeRand