AMD Ryzen 9 9950X DDR5 Memory Performance

AMD Ryzen 9 9950X DDR5 memory module benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2408225-NE-RYZEN999510
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
C++ Boost Tests 2 Tests
Timed Code Compilation 5 Tests
C/C++ Compiler Tests 2 Tests
CPU Massive 13 Tests
Creator Workloads 5 Tests
Fortran Tests 4 Tests
Game Development 2 Tests
HPC - High Performance Computing 16 Tests
Common Kernel Benchmarks 2 Tests
Large Language Models 2 Tests
Machine Learning 7 Tests
Memory Test Suite 2 Tests
Molecular Dynamics 5 Tests
MPI Benchmarks 4 Tests
Multi-Core 12 Tests
NVIDIA GPU Compute 4 Tests
OpenMPI Tests 9 Tests
Programmer / Developer System Benchmarks 6 Tests
Python Tests 6 Tests
Renderers 2 Tests
Scientific Computing 5 Tests
Server 2 Tests
Server CPU Tests 8 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G
August 14
  13 Hours, 26 Minutes
2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32
August 22
  10 Hours, 21 Minutes
2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36
August 20
  10 Hours, 59 Minutes
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38
August 21
  7 Hours, 49 Minutes
Invert Hiding All Results Option
  10 Hours, 39 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD Ryzen 9 9950X DDR5 Memory Performance - Phoronix Test Suite

AMD Ryzen 9 9950X DDR5 Memory Performance

AMD Ryzen 9 9950X DDR5 memory module benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2408225-NE-RYZEN999510&grw&sor&rro.

AMD Ryzen 9 9950X DDR5 Memory PerformanceProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen Resolution2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS)AMD Device 14d82 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G2000GB Corsair MP700 PROAMD Radeon RX 7900 GRE 16GBAMD Navi 31 HDMI/DPDELL U2723QEIntel I225-V + Intel Wi-Fi 6EUbuntu 24.046.10.0-phx (x86_64)GNOME Shell 46.0X Server + Wayland4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57)GCC 13.2.0ext43840x21602 x 32GB DDR5-6400MT/s Corsair CMK64GX5M2B6400C322 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C362 x 24GB DDR5-8000MT/s Corsair CMP48GX5M2X8000C38OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aPython Details- Python 3.12.3Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected Java Details- 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32, 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36, 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)

AMD Ryzen 9 9950X DDR5 Memory Performancembw: Memory Copy - 4096 MiBmbw: Memory Copy - 8192 MiBmbw: Memory Copy, Fixed Block Size - 4096 MiBmbw: Memory Copy, Fixed Block Size - 8192 MiBstress-ng: Memory Copyingramspeed: Add - Integerramspeed: Copy - Integerramspeed: Scale - Integerramspeed: Triad - Integerramspeed: Average - Integerbrl-cad: VGR Performance Metricetcpak: Multi-Threaded - ETC2xmrig: GhostRider - 1Mminibude: OpenMP - BM1minibude: OpenMP - BM1minibude: OpenMP - BM2minibude: OpenMP - BM2openradioss: Bumper Beamopenradioss: Chrysler Neon 1Mquicksilver: CORAL2 P1openradioss: Cell Phone Drop Testopenradioss: Bird Strike on Windshieldopenradioss: Rubber O-Ring Seal Installationopenradioss: INIVOL and Fluid Structure Interaction Drop Containertensorflow: CPU - 1 - ResNet-50tensorflow: CPU - 64 - ResNet-50lczero: BLASlczero: Eigennumpy: pytorch: CPU - 1 - ResNet-50pytorch: CPU - 256 - ResNet-50xnnpack: FP32MobileNetV2xnnpack: FP32MobileNetV3Largexnnpack: FP32MobileNetV3Smallxnnpack: FP16MobileNetV2xnnpack: FP16MobileNetV3Largexnnpack: FP16MobileNetV3Smallxnnpack: QU8MobileNetV2xnnpack: QU8MobileNetV3Largexnnpack: QU8MobileNetV3Smallspecfem3d: Homogeneous Halfspacellama-cpp: Meta-Llama-3-8B-Instruct-Q8_0.ggufllamafile: Meta-Llama-3-8B-Instruct.F16 - CPUgromacs: MPI CPU - water_GMX50_barespecfem3d: Water-layered Halfspacespecfem3d: Tomographic Modelspecfem3d: Layered Halfspacespecfem3d: Mount St. Helenshpcg: 104 104 104 - 60npb: BT.Cnpb: CG.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cstockfish: Chess Benchmarkx265: Bosphorus 1080pcompress-7zip: Decompression Ratingx265: Bosphorus 4Knpb: IS.Dnpb: MG.Cnpb: SP.Bnpb: SP.Cnamd: ATPase with 327,506 Atomscompress-7zip: Compression Ratingnpb: LU.Cnamd: STMV with 1,066,628 Atomslulesh: openfoam: motorBike - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timepovray: Trace Timey-cruncher: 500My-cruncher: 1Bopenfoam: motorBike - Mesh Timegromacs: water_GMX50_bareopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionjava-jmh: Throughputbuild-llvm: Ninjabuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigblender: BMW27 - CPU-Onlyblender: Junkshop - CPU-Onlyblender: Barbershop - CPU-Onlyembree: Pathtracer ISPC - Crownembree: Pathtracer ISPC - Asian Dragonluxcorerender: DLSC - CPUluxcorerender: Danish Mood - CPUluxcorerender: Orange Juice - CPUluxcorerender: LuxCore Benchmark - CPUlibxsmm: 64libxsmm: 128libxsmm: 32luxcorerender: Rainbow Colors and Prism - CPUbuild2: Time To Compilebuild-gem5: Time To Compilebuild-nodejs: Time To Compilememcached: 1:10memcached: 1:100simdjson: Kostyasimdjson: TopTweetsimdjson: LargeRand2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3822774.94722247.78419534.15919467.39410752.8161079.9365797.0569141.0862428.7065503.93498966741.7843616.61922.12576.8851934.61377.38576.90705.552567666744.90128.7653.16247.0616.9252.272262201068.3684.8957.54126415168129851291770759111875630.6779184578.364.683.14172.19726051223.35909292073.13879708425.6909866448.9356055287.3711144.883678.463823.8027106.1851377778134.5215813338.271505.4724236.2121328.2814528.263.4100719558559920.960.980849945.530958.569522.579849157.14898181.5690516.1017.58716.45284.99491.9272096.566414.090555264.113021891554717298.426301.62947.755590.49546.3862.09448.9636.553241.49145.404.628.645.15237.1472.0121.019.9976.003221.139345.2465915791.347507615.737.7113.592.1222510.46422294.06220817.56920850.88310720.3163967.7070747.0071754.5865845.2168588.84494437739.7363638.61926.89177.0761931.14177.24677.18668.162566000043.42128.0153.23239.2217.3454.302442361070.6185.5258.89127315278159961303776765111576129.8184312688.323.22869.56164246723.00050134169.43938729924.9565499288.7973758570.7711374.513731.973714.6628891.8649194155134.4815751238.301584.9325622.9322908.5815610.303.6046420306562084.721.0160210235.72257.245222.821389148.75845177.789416.1337.24715.42684.98021.9961994.74713.402805360.799243992045826799.54298.04947.783589.33146.5662.21450.7136.109940.94475.424.698.645.14253.3505.9128.020.0877.006219.268341.6795898565.347647187.487.6313.382.1122539.72022817.90020923.98920349.41410608.9665113.5270345.5373724.8866963.0369491.42486954734.5333699.61903.04076.1221904.95376.19877.45573.332566666743.25128.3253.66242.0017.3355.192642471046.9186.7958.49127215538189971302773765112476430.5429844879.003.29868.92093565123.28751738869.81742231125.1946536309.5160160124.0811908.473741.863739.1629713.5549974210134.1915700538.071635.3426596.4623254.4016097.003.5775520081064031.311.0242610578.51655.573823.143631145.88323174.1257216.3957.23615.60688.08072.0091909.665512.974287058.198459689012581592.653304.62148.25847.7263.41459.1236.254641.72385.364.658.555.04258.6508.9132.219.8477.9235874747.737498825.197.5713.242.1022093.98521444.00219747.58420124.94310608.8262771.0066545.2470445.7464832.8666822.49489784731.2293676.51899.80475.9921898.25875.93077.8125136667124.3754.2716.8153.542271064.1781.8956.191285159083610181334797783115478631.9817156778.6672.07947665823.86945318172.438348925.4533190609.1659058570.0311662.443722.563712.4528743.4950101396131.2837.091586.5925715.3422436.8115531.763.5095862375.430.998529832.638516.31985.474513.259573360.480583290340365706.79247.5164.08459.1935.182840.06965.344.608.465.05249.7491.1127.019.3380.1045694123.527350932.887.5012.882.11OpenBenchmarking.org

MBW

Test: Memory Copy - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 4096 MiB2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G5K10K15K20K25KSE +/- 227.86, N = 15SE +/- 265.89, N = 3SE +/- 31.03, N = 3SE +/- 72.38, N = 322093.9922510.4622539.7222774.951. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C365K10K15K20K25KSE +/- 56.11, N = 3SE +/- 49.10, N = 3SE +/- 188.50, N = 12SE +/- 204.03, N = 321444.0022247.7822294.0622817.901. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C364K8K12K16K20KSE +/- 24.77, N = 3SE +/- 36.42, N = 3SE +/- 32.81, N = 3SE +/- 211.85, N = 319534.1619747.5820817.5720923.991. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C324K8K12K16K20KSE +/- 10.56, N = 3SE +/- 239.88, N = 3SE +/- 144.90, N = 15SE +/- 58.10, N = 319467.3920124.9420349.4120850.881. (CC) gcc options: -O3 -march=native

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Memory Copying2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2K4K6K8K10KSE +/- 25.18, N = 3SE +/- 39.56, N = 3SE +/- 72.54, N = 3SE +/- 44.94, N = 310608.8210608.9610720.3110752.811. (CXX) g++ options: -lm -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lgmp -lgbm -lmpfr -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -U_FORTIFY_SOURCE

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3614K28K42K56K70KSE +/- 426.98, N = 3SE +/- 64.38, N = 3SE +/- 382.92, N = 3SE +/- 108.61, N = 361079.9362771.0063967.7065113.521. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3215K30K45K60K75KSE +/- 820.93, N = 3SE +/- 299.96, N = 3SE +/- 762.95, N = 5SE +/- 453.50, N = 365797.0566545.2470345.5370747.001. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3616K32K48K64K80KSE +/- 220.54, N = 3SE +/- 513.22, N = 3SE +/- 266.14, N = 3SE +/- 364.59, N = 369141.0870445.7471754.5873724.881. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3614K28K42K56K70KSE +/- 156.92, N = 3SE +/- 130.09, N = 3SE +/- 33.42, N = 3SE +/- 439.59, N = 362428.7064832.8665845.2166963.031. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3615K30K45K60K75KSE +/- 119.74, N = 3SE +/- 805.54, N = 4SE +/- 204.48, N = 3SE +/- 315.00, N = 365503.9366822.4968588.8469491.421. (CC) gcc options: -O3 -march=native

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.38.2VGR Performance Metric2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G110K220K330K440K550K4869544897844944374989661. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6

Etcpak

Benchmark: Multi-Threaded - Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 2.0Benchmark: Multi-Threaded - Configuration: ETC22 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G160320480640800SE +/- 0.82, N = 3SE +/- 1.36, N = 3SE +/- 0.38, N = 3SE +/- 1.25, N = 3731.23734.53739.74741.781. (CXX) g++ options: -flto -pthread

Xmrig

Variant: GhostRider - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1M2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C368001600240032004000SE +/- 6.72, N = 3SE +/- 37.04, N = 3SE +/- 36.12, N = 6SE +/- 30.71, N = 93616.63638.63676.53699.61. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM12 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32400800120016002000SE +/- 0.87, N = 3SE +/- 0.31, N = 3SE +/- 6.84, N = 3SE +/- 0.54, N = 31899.801903.041922.131926.891. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM12 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3220406080100SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.27, N = 3SE +/- 0.02, N = 375.9976.1276.8977.081. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM22 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G400800120016002000SE +/- 2.32, N = 3SE +/- 1.03, N = 3SE +/- 0.32, N = 3SE +/- 0.64, N = 31898.261904.951931.141934.611. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM22 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 375.9376.2077.2577.391. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper Beam2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.02, N = 3SE +/- 0.12, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 377.8177.4577.1876.90

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1M2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36150300450600750SE +/- 0.28, N = 3SE +/- 1.02, N = 3SE +/- 49.36, N = 9705.55668.16573.33

Quicksilver

Input: CORAL2 P1

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P12 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G5M10M15M20M25MSE +/- 16666.67, N = 3SE +/- 41633.32, N = 3SE +/- 92074.85, N = 3SE +/- 21858.13, N = 3251366672566000025666667256766671. (CXX) g++ options: -fopenmp -O3 -march=native

OpenRadioss

Model: Cell Phone Drop Test

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop Test2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361020304050SE +/- 0.18, N = 3SE +/- 0.29, N = 3SE +/- 0.15, N = 344.9043.4243.25

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on Windshield2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38306090120150SE +/- 0.28, N = 3SE +/- 0.28, N = 3SE +/- 0.35, N = 3SE +/- 5.81, N = 15128.76128.32128.01124.37

OpenRadioss

Model: Rubber O-Ring Seal Installation

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal Installation2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1224364860SE +/- 0.06, N = 3SE +/- 0.16, N = 3SE +/- 0.15, N = 3SE +/- 0.10, N = 354.2753.6653.2353.16

OpenRadioss

Model: INIVOL and Fluid Structure Interaction Drop Container

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop Container2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3250100150200250SE +/- 0.74, N = 3SE +/- 0.76, N = 3SE +/- 0.99, N = 3247.06242.00239.22

TensorFlow

Device: CPU - Batch Size: 1 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3248121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 316.8116.9217.3317.34

TensorFlow

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: ResNet-502 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361224364860SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 352.2753.5454.3055.19

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: BLAS2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3660120180240300SE +/- 2.65, N = 3SE +/- 2.91, N = 3SE +/- 3.79, N = 32262442641. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: Eigen2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3650100150200250SE +/- 1.45, N = 3SE +/- 2.85, N = 3SE +/- 1.15, N = 32202272362471. (CXX) g++ options: -flto -pthread

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322004006008001000SE +/- 1.37, N = 3SE +/- 4.91, N = 3SE +/- 8.61, N = 3SE +/- 2.01, N = 31046.911064.171068.361070.61

PyTorch

Device: CPU - Batch Size: 1 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3620406080100SE +/- 0.82, N = 5SE +/- 0.27, N = 3SE +/- 0.96, N = 4SE +/- 0.63, N = 381.8984.8985.5286.79MIN: 67.48 / MAX: 84.5MIN: 77.17 / MAX: 85.87MIN: 68.54 / MAX: 88.53MIN: 78.82 / MAX: 88.23

PyTorch

Device: CPU - Batch Size: 256 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 256 - Model: ResNet-502 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321326395265SE +/- 0.25, N = 3SE +/- 0.34, N = 3SE +/- 0.36, N = 3SE +/- 0.40, N = 1556.1957.5458.4958.89MIN: 39.35 / MAX: 57.3MIN: 53.14 / MAX: 58.34MIN: 34.6 / MAX: 60.01MIN: 34.79 / MAX: 62.62

XNNPACK

Model: FP32MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV22 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30060090012001500SE +/- 9.00, N = 3SE +/- 8.84, N = 3SE +/- 4.98, N = 3SE +/- 5.21, N = 312851273127212641. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Large2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30060090012001500SE +/- 3.18, N = 3SE +/- 2.08, N = 3SE +/- 9.64, N = 3SE +/- 19.19, N = 315901553152715161. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Small2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 3.18, N = 3SE +/- 5.00, N = 3SE +/- 2.31, N = 3SE +/- 2.31, N = 38368188158121. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV22 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 2.31, N = 3SE +/- 1.20, N = 3SE +/- 3.38, N = 3SE +/- 2.31, N = 310189979969851. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Large2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30060090012001500SE +/- 8.11, N = 3SE +/- 3.53, N = 3SE +/- 4.58, N = 3SE +/- 2.89, N = 313341303130212911. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Small2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 0.58, N = 3SE +/- 1.86, N = 3SE +/- 1.20, N = 3SE +/- 1.76, N = 37977767737701. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: QU8MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV22 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 3.18, N = 3SE +/- 2.96, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 37837657657591. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: QU8MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Large2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322004006008001000SE +/- 3.21, N = 3SE +/- 2.73, N = 3SE +/- 1.20, N = 3SE +/- 4.91, N = 311541124111811151. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: QU8MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Small2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 1.33, N = 3SE +/- 0.88, N = 3SE +/- 0.67, N = 37867647617561. (CXX) g++ options: -O3 -lrt -lm

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous Halfspace2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32714212835SE +/- 0.28, N = 3SE +/- 0.38, N = 3SE +/- 0.39, N = 331.9830.6830.5429.821. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Llama.cpp

Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b3067Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C363691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.328.368.669.001. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas

Llamafile

Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.6Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.0532.1063.1594.2125.2654.68

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bare2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C360.74211.48422.22632.96843.7105SE +/- 0.001, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 33.1413.2283.2981. (CXX) g++ options: -O3 -lm

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Water-layered Halfspace2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361632486480SE +/- 0.34, N = 3SE +/- 0.16, N = 2SE +/- 0.34, N = 3SE +/- 0.24, N = 372.2072.0869.5668.921. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic Model2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32612182430SE +/- 0.16, N = 15SE +/- 0.16, N = 3SE +/- 0.14, N = 3SE +/- 0.20, N = 323.8723.3623.2923.001. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Layered Halfspace2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321632486480SE +/- 0.31, N = 3SE +/- 0.59, N = 3SE +/- 0.59, N = 373.1472.4469.8269.441. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Mount St. Helens2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32612182430SE +/- 0.17, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 325.6925.4525.1924.961. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

High Performance Conjugate Gradient

X Y Z: 104 104 104 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 602 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C363691215SE +/- 0.00125, N = 3SE +/- 0.00142, N = 3SE +/- 0.00584, N = 3SE +/- 0.00741, N = 38.797378.935609.165909.516011. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3613K26K39K52K65KSE +/- 54.79, N = 3SE +/- 74.71, N = 3SE +/- 8.10, N = 3SE +/- 85.26, N = 355287.3758570.0358570.7760124.081. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C363K6K9K12K15KSE +/- 21.24, N = 3SE +/- 36.86, N = 3SE +/- 41.76, N = 3SE +/- 10.26, N = 311144.8811374.5111662.4411908.471. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C368001600240032004000SE +/- 25.12, N = 3SE +/- 21.03, N = 3SE +/- 53.14, N = 3SE +/- 17.86, N = 33678.463722.563731.973741.861. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G8001600240032004000SE +/- 51.16, N = 3SE +/- 36.74, N = 3SE +/- 40.74, N = 3SE +/- 2.85, N = 33712.453714.663739.163823.801. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C366K12K18K24K30KSE +/- 141.73, N = 3SE +/- 113.15, N = 3SE +/- 30.67, N = 3SE +/- 95.45, N = 327106.1828743.4928891.8629713.551. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfishChess Benchmark2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G11M22M33M44M55MSE +/- 410305.51, N = 3SE +/- 515794.89, N = 15SE +/- 166041.67, N = 3SE +/- 491754.88, N = 15491941554997421050101396513777781. Stockfish 16 by the Stockfish developers (see AUTHORS file)

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 1080p2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G306090120150SE +/- 0.23, N = 3SE +/- 0.13, N = 3SE +/- 0.45, N = 3SE +/- 0.27, N = 3131.28134.19134.48134.521. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Decompression Rating2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30K60K90K120K150KSE +/- 33.91, N = 3SE +/- 69.90, N = 3SE +/- 14.15, N = 31570051575121581331. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 4K2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32918273645SE +/- 0.15, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.15, N = 337.0938.0738.2738.301. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36400800120016002000SE +/- 5.33, N = 3SE +/- 2.77, N = 3SE +/- 5.23, N = 3SE +/- 4.49, N = 31505.471584.931586.591635.341. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C366K12K18K24K30KSE +/- 6.88, N = 3SE +/- 20.77, N = 3SE +/- 10.24, N = 3SE +/- 34.50, N = 324236.2125622.9325715.3426596.461. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C365K10K15K20K25KSE +/- 21.50, N = 3SE +/- 40.85, N = 3SE +/- 26.91, N = 3SE +/- 10.38, N = 321328.2822436.8122908.5823254.401. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C363K6K9K12K15KSE +/- 18.27, N = 3SE +/- 42.19, N = 3SE +/- 16.93, N = 3SE +/- 15.11, N = 314528.2615531.7615610.3016097.001. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAMD

Input: ATPase with 327,506 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: ATPase with 327,506 Atoms2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C320.8111.6222.4333.2444.055SE +/- 0.02586, N = 3SE +/- 0.02740, N = 9SE +/- 0.02618, N = 15SE +/- 0.03856, N = 33.410073.509583.577553.60464

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Compression Rating2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3240K80K120K160K200KSE +/- 322.74, N = 3SE +/- 94.03, N = 3SE +/- 467.61, N = 31955852008102030651. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3614K28K42K56K70KSE +/- 150.93, N = 3SE +/- 130.21, N = 3SE +/- 210.02, N = 3SE +/- 72.30, N = 359920.9662084.7262375.4364031.311. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAMD

Input: STMV with 1,066,628 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: STMV with 1,066,628 Atoms2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C360.23050.4610.69150.9221.1525SE +/- 0.00027, N = 3SE +/- 0.00137, N = 2SE +/- 0.00200, N = 3SE +/- 0.00057, N = 30.980840.998521.016021.02426

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.32 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362K4K6K8K10KSE +/- 38.48, N = 3SE +/- 19.00, N = 3SE +/- 67.88, N = 3SE +/- 13.71, N = 39832.649945.5310235.7210578.521. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

OpenFOAM

Input: motorBike - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Execution Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36132639526558.5757.2555.571. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Time2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G61218243023.1422.8222.581. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36306090120150157.15148.76145.881. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C364080120160200181.57177.79174.131. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-RayTrace Time2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G48121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 316.4016.3216.1316.101. POV-Ray 3.7.0.10.unofficial

Y-Cruncher

Pi Digits To Calculate: 500M

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 500M2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36246810SE +/- 0.006, N = 3SE +/- 0.016, N = 3SE +/- 0.023, N = 37.5877.2477.236

Y-Cruncher

Pi Digits To Calculate: 1B

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 1B2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3248121620SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 316.4515.6115.43

OpenFOAM

Input: motorBike - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Mesh Time2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322040608010088.0885.4784.9984.981. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

GROMACS

Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACSInput: water_GMX50_bare2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C360.4520.9041.3561.8082.26SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 31.9271.9962.0091. GROMACS version: 2023.3-Ubuntu_2023.3_1ubuntu3

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3650010001500200025002096.571994.751909.671. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3648121620SE +/- 0.08, N = 3SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 314.0913.4013.2612.971. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361428425670SE +/- 0.09, N = 3SE +/- 0.20, N = 3SE +/- 0.01, N = 2SE +/- 0.14, N = 364.1160.8060.4858.201. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Java JMH

Throughput

OpenBenchmarking.orgOps/s, More Is BetterJava JMHThroughput2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3220000M40000M60000M80000M100000M89012581592.6590340365706.7991554717298.4392045826799.54

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninja2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3270140210280350SE +/- 0.38, N = 2SE +/- 0.12, N = 3SE +/- 0.22, N = 3304.62301.63298.05

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfig2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1122334455SE +/- 0.40, N = 3SE +/- 0.30, N = 3SE +/- 0.30, N = 348.2647.7847.76

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfig2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32130260390520650SE +/- 0.15, N = 3SE +/- 0.52, N = 3590.50589.33

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: CPU-Only2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1122334455SE +/- 0.10, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 347.7247.5146.5646.38

Blender

Blend File: Junkshop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: CPU-Only2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1428425670SE +/- 0.17, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 364.0863.4162.2162.09

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: CPU-Only2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G100200300400500SE +/- 0.20, N = 3SE +/- 0.27, N = 3SE +/- 0.26, N = 3SE +/- 0.20, N = 3459.19459.12450.71448.96

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Crown2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G816243240SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 335.1836.1136.2536.55MIN: 34.83 / MAX: 36.01MIN: 35.69 / MAX: 36.98MIN: 35.89 / MAX: 36.94MIN: 36.13 / MAX: 37.34

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C361020304050SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 340.0740.9441.4941.72MIN: 39.86 / MAX: 40.55MIN: 40.73 / MAX: 41.6MIN: 41.24 / MAX: 42.02MIN: 41.48 / MAX: 42.26

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPU2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321.21952.4393.65854.8786.0975SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.345.365.405.42MIN: 5.23 / MAX: 5.69MIN: 5.25 / MAX: 5.7MIN: 5.27 / MAX: 5.72MIN: 5.31 / MAX: 5.77

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPU2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321.05532.11063.16594.22125.2765SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 34.604.624.654.69MIN: 2.13 / MAX: 5.16MIN: 2.07 / MAX: 5.22MIN: 2.02 / MAX: 5.24MIN: 2.15 / MAX: 5.26

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPU2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.468.558.648.64MIN: 7.49 / MAX: 9.04MIN: 7.48 / MAX: 9.15MIN: 7.64 / MAX: 9.25MIN: 7.67 / MAX: 9.25

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPU2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.15882.31763.47644.63525.794SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 35.045.055.145.15MIN: 2.28 / MAX: 5.66MIN: 2.28 / MAX: 5.66MIN: 2.39 / MAX: 5.75MIN: 2.38 / MAX: 5.77

libxsmm

M N K: 64

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 642 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C3660120180240300SE +/- 0.20, N = 3SE +/- 0.53, N = 3SE +/- 0.03, N = 3SE +/- 0.37, N = 3237.1249.7253.3258.61. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

libxsmm

M N K: 128

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 1282 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36110220330440550SE +/- 0.38, N = 3SE +/- 0.33, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 3472.0491.1505.9508.91. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

libxsmm

M N K: 32

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36306090120150SE +/- 0.07, N = 3SE +/- 0.26, N = 3SE +/- 0.07, N = 3SE +/- 0.19, N = 3121.0127.0128.0132.21. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPU2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32510152025SE +/- 0.13, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.15, N = 319.3319.8419.9920.08MIN: 17.37 / MAX: 19.62MIN: 17.86 / MAX: 20.21MIN: 18.05 / MAX: 20.42MIN: 17.96 / MAX: 20.55

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.17Time To Compile2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.99, N = 2SE +/- 0.40, N = 3SE +/- 0.52, N = 3SE +/- 0.16, N = 380.1077.9277.0176.00

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To Compile2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3250100150200250SE +/- 0.11, N = 3SE +/- 0.19, N = 3221.14219.27

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To Compile2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3280160240320400SE +/- 0.34, N = 3SE +/- 0.24, N = 3345.25341.68

Memcached

Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:102 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.3M2.6M3.9M5.2M6.5MSE +/- 4372.40, N = 3SE +/- 11157.27, N = 3SE +/- 11599.22, N = 3SE +/- 15831.56, N = 35694123.525874747.735898565.345915791.341. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:1002 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321.6M3.2M4.8M6.4M8MSE +/- 71150.32, N = 3SE +/- 43634.73, N = 3SE +/- 1150.91, N = 3SE +/- 87376.76, N = 37350932.887498825.197507615.737647187.481. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: Kostya2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G246810SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 37.507.577.637.711. (CXX) g++ options: -O3 -lrt

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: TopTweet2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G3691215SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.17, N = 3SE +/- 0.02, N = 312.8813.2413.3813.591. (CXX) g++ options: -O3 -lrt

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: LargeRandom2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.4770.9541.4311.9082.385SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 32.102.112.112.121. (CXX) g++ options: -O3 -lrt


Phoronix Test Suite v10.8.5