AMD Ryzen 9 9950X DDR5 Memory Performance

AMD Ryzen 9 9950X DDR5 memory module benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2408225-NE-RYZEN999510&sro&grs.

AMD Ryzen 9 9950X DDR5 Memory PerformanceProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen Resolution2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS)AMD Device 14d82 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G2000GB Corsair MP700 PROAMD Radeon RX 7900 GRE 16GBAMD Navi 31 HDMI/DPDELL U2723QEIntel I225-V + Intel Wi-Fi 6EUbuntu 24.046.10.0-phx (x86_64)GNOME Shell 46.0X Server + Wayland4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57)GCC 13.2.0ext43840x21602 x 32GB DDR5-6400MT/s Corsair CMK64GX5M2B6400C322 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C362 x 24GB DDR5-8000MT/s Corsair CMP48GX5M2X8000C38OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aPython Details- Python 3.12.3Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected Java Details- 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32, 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36, 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)

AMD Ryzen 9 9950X DDR5 Memory Performancelczero: BLASlczero: Eigennpb: SP.Cincompact3d: input.i3d 193 Cells Per Directionopenfoam: drivaerFastback, Medium Mesh Size - Execution Timenpb: MG.Cnpb: FT.Clibxsmm: 32libxsmm: 64npb: SP.Bnpb: BT.Cnpb: IS.Dincompact3d: input.i3d 129 Cells Per Directionllama-cpp: Meta-Llama-3-8B-Instruct-Q8_0.ggufhpcg: 104 104 104 - 60libxsmm: 128openfoam: drivaerFastback, Small Mesh Size - Execution Timelulesh: ramspeed: Copy - Integerramspeed: Triad - Integerspecfem3d: Homogeneous Halfspacembw: Memory Copy, Fixed Block Size - 4096 MiBmbw: Memory Copy, Fixed Block Size - 8192 MiBnpb: LU.Cnpb: CG.Cy-cruncher: 1Bramspeed: Scale - Integerramspeed: Add - Integermbw: Memory Copy - 8192 MiBramspeed: Average - Integerpytorch: CPU - 1 - ResNet-50namd: ATPase with 327,506 Atomstensorflow: CPU - 64 - ResNet-50simdjson: TopTweetbuild2: Time To Compileopenfoam: motorBike - Execution Timespecfem3d: Layered Halfspacegromacs: MPI CPU - water_GMX50_barexnnpack: FP32MobileNetV3Largey-cruncher: 500Mpytorch: CPU - 256 - ResNet-50specfem3d: Water-layered Halfspacestockfish: Chess Benchmarknamd: STMV with 1,066,628 Atomsopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timegromacs: water_GMX50_bareembree: Pathtracer ISPC - Asian Dragonmemcached: 1:100xnnpack: QU8MobileNetV3Smallembree: Pathtracer ISPC - Crownmemcached: 1:10luxcorerender: Rainbow Colors and Prism - CPUcompress-7zip: Compression Ratingopenradioss: Cell Phone Drop Testspecfem3d: Tomographic Modelopenfoam: motorBike - Mesh Timexnnpack: FP16MobileNetV3Smallxnnpack: QU8MobileNetV3Largejava-jmh: Throughputxnnpack: FP16MobileNetV2xnnpack: FP16MobileNetV3Largeopenradioss: INIVOL and Fluid Structure Interaction Drop Containerx265: Bosphorus 4Kblender: Junkshop - CPU-Onlyxnnpack: QU8MobileNetV2tensorflow: CPU - 1 - ResNet-50mbw: Memory Copy - 4096 MiBnpb: EP.Dxnnpack: FP32MobileNetV3Smallspecfem3d: Mount St. Helensblender: BMW27 - CPU-Onlysimdjson: Kostyaopenfoam: drivaerFastback, Small Mesh Size - Mesh Timex265: Bosphorus 1080pbrl-cad: VGR Performance Metricxmrig: GhostRider - 1Mblender: Barbershop - CPU-Onlynumpy: build-llvm: Ninjaluxcorerender: LuxCore Benchmark - CPUquicksilver: CORAL2 P1luxcorerender: Orange Juice - CPUopenradioss: Rubber O-Ring Seal Installationluxcorerender: Danish Mood - CPUminibude: OpenMP - BM2minibude: OpenMP - BM2povray: Trace Timenpb: EP.Cxnnpack: FP32MobileNetV2luxcorerender: DLSC - CPUetcpak: Multi-Threaded - ETC2minibude: OpenMP - BM1minibude: OpenMP - BM1stress-ng: Memory Copyingopenradioss: Bumper Beambuild-linux-kernel: defconfigbuild-nodejs: Time To Compilesimdjson: LargeRandbuild-gem5: Time To Compilecompress-7zip: Decompression Ratingbuild-linux-kernel: allmodconfigllamafile: Meta-Llama-3-8B-Instruct.F16 - CPUopenradioss: Bird Strike on Windshieldopenradioss: Chrysler Neon 1M2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C3822622014528.2664.11302182096.566424236.2127106.18121.0237.121328.2855287.371505.4714.09055528.368.93560472.0157.148989945.530965797.0562428.7030.67791845719534.15919467.39459920.9611144.8816.45269141.0861079.9322247.78465503.9384.893.4100752.2713.5976.00358.569573.1387970843.14115167.58757.5472.197260512513777780.98084181.569051.92741.49147507615.7375636.55325915791.3419.9919558544.9023.35909292084.9949770111891554717298.4269851291247.0638.2762.0975916.9222774.9473823.8081225.69098664446.387.7122.579849134.524989663616.6448.961068.36301.6295.15256766678.6453.164.6277.3851934.61316.1013678.4612645.40741.78476.8851922.12510752.8176.9047.755345.2462.12221.139158133590.4954.68128.76705.5524423615610.3060.79924391994.74725622.9328891.86128.0253.322908.5858570.771584.9313.40280538.328.79737505.9148.7584510235.72270747.0065845.2129.81843126820817.56920850.88362084.7211374.5115.42671754.5863967.7022294.06268588.8485.523.6046454.3013.3877.00657.245269.4393872993.22815277.24758.8969.561642467491941551.01602177.78941.99640.94477647187.4876136.10995898565.3420.0820306543.4223.00050134184.9802776111592045826799.549961303239.2238.3062.2176517.3422510.4643714.6681524.95654992846.567.6322.821389134.484944373638.6450.711070.61298.0495.14256600008.6453.234.6977.2461931.14116.1333731.9712735.42739.73677.0761926.89110720.3177.1847.783341.6792.11219.268157512589.331128.01668.1626424716097.0058.19845961909.665526596.4629713.55132.2258.623254.4060124.081635.3412.97428709.009.51601508.9145.8832310578.51670345.5366963.0330.54298448720923.98920349.41464031.3111908.4715.60673724.8865113.5222817.90069491.4286.793.5775555.1913.2477.92355.573869.8174223113.29815537.23658.4968.920935651499742101.02426174.125722.00941.72387498825.1976436.25465874747.7319.8420081043.2523.28751738888.0807773112489012581592.6539971302242.0038.0763.4176517.3322539.7203739.1681825.19465363047.727.5723.143631134.194869543699.6459.121046.91304.6215.04256666678.5553.664.6576.1981904.95316.3953741.8612725.36734.53376.1221903.04010608.9677.4548.2582.10157005128.32573.3322715531.7660.480583225715.3428743.49127.0249.722436.8158570.031586.5913.25957338.669.16590491.19832.638566545.2464832.8631.98171567719747.58420124.94362375.4311662.4470445.7462771.0021444.00266822.4981.893.5095853.5412.8880.10472.4383489159056.1972.079476658501013960.9985240.06967350932.8878635.18285694123.5219.3323.86945318185.4745797115490340365706.7921018133437.0964.0878316.8122093.9853712.4583625.45331906047.517.50131.284897843676.5459.191064.175.05251366678.4654.274.6075.9301898.25816.3193722.5612855.34731.22975.9921899.80410608.8277.812.11124.37OpenBenchmarking.org

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: BLAS2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3260120180240300SE +/- 2.65, N = 3SE +/- 3.79, N = 3SE +/- 2.91, N = 32262642441. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: Eigen2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3250100150200250SE +/- 1.45, N = 3SE +/- 1.15, N = 3SE +/- 2.85, N = 32202472272361. (CXX) g++ options: -flto -pthread

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C323K6K9K12K15KSE +/- 18.27, N = 3SE +/- 15.11, N = 3SE +/- 42.19, N = 3SE +/- 16.93, N = 314528.2616097.0015531.7615610.301. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321428425670SE +/- 0.09, N = 3SE +/- 0.14, N = 3SE +/- 0.01, N = 2SE +/- 0.20, N = 364.1158.2060.4860.801. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3250010001500200025002096.571909.671994.751. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C326K12K18K24K30KSE +/- 6.88, N = 3SE +/- 34.50, N = 3SE +/- 10.24, N = 3SE +/- 20.77, N = 324236.2126596.4625715.3425622.931. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C326K12K18K24K30KSE +/- 141.73, N = 3SE +/- 95.45, N = 3SE +/- 113.15, N = 3SE +/- 30.67, N = 327106.1829713.5528743.4928891.861. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

libxsmm

M N K: 32

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32306090120150SE +/- 0.07, N = 3SE +/- 0.19, N = 3SE +/- 0.26, N = 3SE +/- 0.07, N = 3121.0132.2127.0128.01. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

libxsmm

M N K: 64

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 642 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3260120180240300SE +/- 0.20, N = 3SE +/- 0.37, N = 3SE +/- 0.53, N = 3SE +/- 0.03, N = 3237.1258.6249.7253.31. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C325K10K15K20K25KSE +/- 21.50, N = 3SE +/- 10.38, N = 3SE +/- 40.85, N = 3SE +/- 26.91, N = 321328.2823254.4022436.8122908.581. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3213K26K39K52K65KSE +/- 54.79, N = 3SE +/- 85.26, N = 3SE +/- 74.71, N = 3SE +/- 8.10, N = 355287.3760124.0858570.0358570.771. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32400800120016002000SE +/- 5.33, N = 3SE +/- 4.49, N = 3SE +/- 5.23, N = 3SE +/- 2.77, N = 31505.471635.341586.591584.931. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3248121620SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.13, N = 314.0912.9713.2613.401. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Llama.cpp

Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b3067Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C323691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.369.008.668.321. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas

High Performance Conjugate Gradient

X Y Z: 104 104 104 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 602 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C323691215SE +/- 0.00142, N = 3SE +/- 0.00741, N = 3SE +/- 0.00584, N = 3SE +/- 0.00125, N = 38.935609.516019.165908.797371. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

libxsmm

M N K: 128

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 1282 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32110220330440550SE +/- 0.38, N = 3SE +/- 0.09, N = 3SE +/- 0.33, N = 3SE +/- 0.10, N = 3472.0508.9491.1505.91. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32306090120150157.15145.88148.761. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.32 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322K4K6K8K10KSE +/- 19.00, N = 3SE +/- 13.71, N = 3SE +/- 38.48, N = 3SE +/- 67.88, N = 39945.5310578.529832.6410235.721. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

RAMspeed SMP

Type: Copy - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3215K30K45K60K75KSE +/- 820.93, N = 3SE +/- 762.95, N = 5SE +/- 299.96, N = 3SE +/- 453.50, N = 365797.0570345.5366545.2470747.001. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3214K28K42K56K70KSE +/- 156.92, N = 3SE +/- 439.59, N = 3SE +/- 130.09, N = 3SE +/- 33.42, N = 362428.7066963.0364832.8665845.211. (CC) gcc options: -O3 -march=native

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous Halfspace2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32714212835SE +/- 0.28, N = 3SE +/- 0.38, N = 3SE +/- 0.39, N = 330.6830.5431.9829.821. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C324K8K12K16K20KSE +/- 24.77, N = 3SE +/- 211.85, N = 3SE +/- 36.42, N = 3SE +/- 32.81, N = 319534.1620923.9919747.5820817.571. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C324K8K12K16K20KSE +/- 10.56, N = 3SE +/- 144.90, N = 15SE +/- 239.88, N = 3SE +/- 58.10, N = 319467.3920349.4120124.9420850.881. (CC) gcc options: -O3 -march=native

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3214K28K42K56K70KSE +/- 150.93, N = 3SE +/- 72.30, N = 3SE +/- 210.02, N = 3SE +/- 130.21, N = 359920.9664031.3162375.4362084.721. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C323K6K9K12K15KSE +/- 21.24, N = 3SE +/- 10.26, N = 3SE +/- 41.76, N = 3SE +/- 36.86, N = 311144.8811908.4711662.4411374.511. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Y-Cruncher

Pi Digits To Calculate: 1B

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 1B2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3248121620SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 316.4515.6115.43

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3216K32K48K64K80KSE +/- 220.54, N = 3SE +/- 364.59, N = 3SE +/- 513.22, N = 3SE +/- 266.14, N = 369141.0873724.8870445.7471754.581. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3214K28K42K56K70KSE +/- 426.98, N = 3SE +/- 108.61, N = 3SE +/- 64.38, N = 3SE +/- 382.92, N = 361079.9365113.5262771.0063967.701. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C325K10K15K20K25KSE +/- 49.10, N = 3SE +/- 204.03, N = 3SE +/- 56.11, N = 3SE +/- 188.50, N = 1222247.7822817.9021444.0022294.061. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integer2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3215K30K45K60K75KSE +/- 119.74, N = 3SE +/- 315.00, N = 3SE +/- 805.54, N = 4SE +/- 204.48, N = 365503.9369491.4266822.4968588.841. (CC) gcc options: -O3 -march=native

PyTorch

Device: CPU - Batch Size: 1 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3220406080100SE +/- 0.27, N = 3SE +/- 0.63, N = 3SE +/- 0.82, N = 5SE +/- 0.96, N = 484.8986.7981.8985.52MIN: 77.17 / MAX: 85.87MIN: 78.82 / MAX: 88.23MIN: 67.48 / MAX: 84.5MIN: 68.54 / MAX: 88.53

NAMD

Input: ATPase with 327,506 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: ATPase with 327,506 Atoms2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C320.8111.6222.4333.2444.055SE +/- 0.02586, N = 3SE +/- 0.02618, N = 15SE +/- 0.02740, N = 9SE +/- 0.03856, N = 33.410073.577553.509583.60464

TensorFlow

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: ResNet-502 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321224364860SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 352.2755.1953.5454.30

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: TopTweet2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C323691215SE +/- 0.02, N = 3SE +/- 0.15, N = 3SE +/- 0.14, N = 3SE +/- 0.17, N = 313.5913.2412.8813.381. (CXX) g++ options: -O3 -lrt

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.17Time To Compile2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3220406080100SE +/- 0.16, N = 3SE +/- 0.40, N = 3SE +/- 0.99, N = 2SE +/- 0.52, N = 376.0077.9280.1077.01

OpenFOAM

Input: motorBike - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Execution Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32132639526558.5755.5757.251. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Layered Halfspace2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321632486480SE +/- 0.31, N = 3SE +/- 0.59, N = 3SE +/- 0.59, N = 373.1469.8272.4469.441. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bare2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C320.74211.48422.22632.96843.7105SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.004, N = 33.1413.2983.2281. (CXX) g++ options: -O3 -lm

XNNPACK

Model: FP32MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Large2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3230060090012001500SE +/- 19.19, N = 3SE +/- 2.08, N = 3SE +/- 3.18, N = 3SE +/- 9.64, N = 315161553159015271. (CXX) g++ options: -O3 -lrt -lm

Y-Cruncher

Pi Digits To Calculate: 500M

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 500M2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32246810SE +/- 0.006, N = 3SE +/- 0.023, N = 3SE +/- 0.016, N = 37.5877.2367.247

PyTorch

Device: CPU - Batch Size: 256 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 256 - Model: ResNet-502 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321326395265SE +/- 0.34, N = 3SE +/- 0.36, N = 3SE +/- 0.25, N = 3SE +/- 0.40, N = 1557.5458.4956.1958.89MIN: 53.14 / MAX: 58.34MIN: 34.6 / MAX: 60.01MIN: 39.35 / MAX: 57.3MIN: 34.79 / MAX: 62.62

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Water-layered Halfspace2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321632486480SE +/- 0.34, N = 3SE +/- 0.24, N = 3SE +/- 0.16, N = 2SE +/- 0.34, N = 372.2068.9272.0869.561. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfishChess Benchmark2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3211M22M33M44M55MSE +/- 491754.88, N = 15SE +/- 515794.89, N = 15SE +/- 166041.67, N = 3SE +/- 410305.51, N = 3513777784997421050101396491941551. Stockfish 16 by the Stockfish developers (see AUTHORS file)

NAMD

Input: STMV with 1,066,628 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: STMV with 1,066,628 Atoms2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C320.23050.4610.69150.9221.1525SE +/- 0.00027, N = 3SE +/- 0.00057, N = 3SE +/- 0.00137, N = 2SE +/- 0.00200, N = 30.980841.024260.998521.01602

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C324080120160200181.57174.13177.791. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

GROMACS

Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACSInput: water_GMX50_bare2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C320.4520.9041.3561.8082.26SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 31.9272.0091.9961. GROMACS version: 2023.3-Ubuntu_2023.3_1ubuntu3

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321020304050SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 341.4941.7240.0740.94MIN: 41.24 / MAX: 42.02MIN: 41.48 / MAX: 42.26MIN: 39.86 / MAX: 40.55MIN: 40.73 / MAX: 41.6

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:1002 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321.6M3.2M4.8M6.4M8MSE +/- 1150.91, N = 3SE +/- 43634.73, N = 3SE +/- 71150.32, N = 3SE +/- 87376.76, N = 37507615.737498825.197350932.887647187.481. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

XNNPACK

Model: QU8MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Small2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322004006008001000SE +/- 0.88, N = 3SE +/- 1.33, N = 3SE +/- 0.67, N = 37567647867611. (CXX) g++ options: -O3 -lrt -lm

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Crown2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32816243240SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 336.5536.2535.1836.11MIN: 36.13 / MAX: 37.34MIN: 35.89 / MAX: 36.94MIN: 34.83 / MAX: 36.01MIN: 35.69 / MAX: 36.98

Memcached

Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:102 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321.3M2.6M3.9M5.2M6.5MSE +/- 15831.56, N = 3SE +/- 11157.27, N = 3SE +/- 4372.40, N = 3SE +/- 11599.22, N = 35915791.345874747.735694123.525898565.341. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32510152025SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 319.9919.8419.3320.08MIN: 18.05 / MAX: 20.42MIN: 17.86 / MAX: 20.21MIN: 17.37 / MAX: 19.62MIN: 17.96 / MAX: 20.55

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Compression Rating2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3240K80K120K160K200KSE +/- 322.74, N = 3SE +/- 94.03, N = 3SE +/- 467.61, N = 31955852008102030651. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

OpenRadioss

Model: Cell Phone Drop Test

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop Test2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321020304050SE +/- 0.18, N = 3SE +/- 0.15, N = 3SE +/- 0.29, N = 344.9043.2543.42

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic Model2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32612182430SE +/- 0.16, N = 3SE +/- 0.14, N = 3SE +/- 0.16, N = 15SE +/- 0.20, N = 323.3623.2923.8723.001. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenFOAM

Input: motorBike - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Mesh Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322040608010084.9988.0885.4784.981. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

XNNPACK

Model: FP16MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Small2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322004006008001000SE +/- 1.76, N = 3SE +/- 1.20, N = 3SE +/- 0.58, N = 3SE +/- 1.86, N = 37707737977761. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: QU8MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Large2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322004006008001000SE +/- 1.20, N = 3SE +/- 2.73, N = 3SE +/- 3.21, N = 3SE +/- 4.91, N = 311181124115411151. (CXX) g++ options: -O3 -lrt -lm

Java JMH

Throughput

OpenBenchmarking.orgOps/s, More Is BetterJava JMHThroughput2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3220000M40000M60000M80000M100000M91554717298.4389012581592.6590340365706.7992045826799.54

XNNPACK

Model: FP16MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322004006008001000SE +/- 2.31, N = 3SE +/- 1.20, N = 3SE +/- 2.31, N = 3SE +/- 3.38, N = 398599710189961. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Large2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3230060090012001500SE +/- 2.89, N = 3SE +/- 4.58, N = 3SE +/- 8.11, N = 3SE +/- 3.53, N = 312911302133413031. (CXX) g++ options: -O3 -lrt -lm

OpenRadioss

Model: INIVOL and Fluid Structure Interaction Drop Container

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop Container2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3250100150200250SE +/- 0.74, N = 3SE +/- 0.76, N = 3SE +/- 0.99, N = 3247.06242.00239.22

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 4K2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32918273645SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.15, N = 3SE +/- 0.15, N = 338.2738.0737.0938.301. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

Blender

Blend File: Junkshop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: CPU-Only2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321428425670SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.17, N = 3SE +/- 0.03, N = 362.0963.4164.0862.21

XNNPACK

Model: QU8MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322004006008001000SE +/- 0.33, N = 3SE +/- 2.96, N = 3SE +/- 3.18, N = 3SE +/- 0.88, N = 37597657837651. (CXX) g++ options: -O3 -lrt -lm

TensorFlow

Device: CPU - Batch Size: 1 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3248121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 316.9217.3316.8117.34

MBW

Test: Memory Copy - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 4096 MiB2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C325K10K15K20K25KSE +/- 72.38, N = 3SE +/- 31.03, N = 3SE +/- 227.86, N = 15SE +/- 265.89, N = 322774.9522539.7222093.9922510.461. (CC) gcc options: -O3 -march=native

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C328001600240032004000SE +/- 2.85, N = 3SE +/- 40.74, N = 3SE +/- 51.16, N = 3SE +/- 36.74, N = 33823.803739.163712.453714.661. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

XNNPACK

Model: FP32MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Small2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322004006008001000SE +/- 2.31, N = 3SE +/- 5.00, N = 3SE +/- 3.18, N = 3SE +/- 2.31, N = 38128188368151. (CXX) g++ options: -O3 -lrt -lm

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Mount St. Helens2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32612182430SE +/- 0.17, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.12, N = 325.6925.1925.4524.961. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: CPU-Only2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321122334455SE +/- 0.01, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 346.3847.7247.5146.56

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: Kostya2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32246810SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 37.717.577.507.631. (CXX) g++ options: -O3 -lrt

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3261218243022.5823.1422.821. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 1080p2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32306090120150SE +/- 0.27, N = 3SE +/- 0.13, N = 3SE +/- 0.23, N = 3SE +/- 0.45, N = 3134.52134.19131.28134.481. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.38.2VGR Performance Metric2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32110K220K330K440K550K4989664869544897844944371. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6

Xmrig

Variant: GhostRider - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1M2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C328001600240032004000SE +/- 6.72, N = 3SE +/- 30.71, N = 9SE +/- 36.12, N = 6SE +/- 37.04, N = 33616.63699.63676.53638.61. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: CPU-Only2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32100200300400500SE +/- 0.20, N = 3SE +/- 0.27, N = 3SE +/- 0.20, N = 3SE +/- 0.26, N = 3448.96459.12459.19450.71

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322004006008001000SE +/- 8.61, N = 3SE +/- 1.37, N = 3SE +/- 4.91, N = 3SE +/- 2.01, N = 31068.361046.911064.171070.61

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninja2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3270140210280350SE +/- 0.12, N = 3SE +/- 0.38, N = 2SE +/- 0.22, N = 3301.63304.62298.05

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321.15882.31763.47644.63525.794SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 35.155.045.055.14MIN: 2.38 / MAX: 5.77MIN: 2.28 / MAX: 5.66MIN: 2.28 / MAX: 5.66MIN: 2.39 / MAX: 5.75

Quicksilver

Input: CORAL2 P1

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P12 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C325M10M15M20M25MSE +/- 21858.13, N = 3SE +/- 92074.85, N = 3SE +/- 16666.67, N = 3SE +/- 41633.32, N = 3256766672566666725136667256600001. (CXX) g++ options: -fopenmp -O3 -march=native

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.648.558.468.64MIN: 7.64 / MAX: 9.25MIN: 7.48 / MAX: 9.15MIN: 7.49 / MAX: 9.04MIN: 7.67 / MAX: 9.25

OpenRadioss

Model: Rubber O-Ring Seal Installation

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal Installation2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321224364860SE +/- 0.10, N = 3SE +/- 0.16, N = 3SE +/- 0.06, N = 3SE +/- 0.15, N = 353.1653.6654.2753.23

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321.05532.11063.16594.22125.2765SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 34.624.654.604.69MIN: 2.07 / MAX: 5.22MIN: 2.02 / MAX: 5.24MIN: 2.13 / MAX: 5.16MIN: 2.15 / MAX: 5.26

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3220406080100SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 377.3976.2075.9377.251. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32400800120016002000SE +/- 0.64, N = 3SE +/- 1.03, N = 3SE +/- 2.32, N = 3SE +/- 0.32, N = 31934.611904.951898.261931.141. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-RayTrace Time2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3248121620SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 316.1016.4016.3216.131. POV-Ray 3.7.0.10.unofficial

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C328001600240032004000SE +/- 25.12, N = 3SE +/- 17.86, N = 3SE +/- 21.03, N = 3SE +/- 53.14, N = 33678.463741.863722.563731.971. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

XNNPACK

Model: FP32MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3230060090012001500SE +/- 5.21, N = 3SE +/- 4.98, N = 3SE +/- 9.00, N = 3SE +/- 8.84, N = 312641272128512731. (CXX) g++ options: -O3 -lrt -lm

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321.21952.4393.65854.8786.0975SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 35.405.365.345.42MIN: 5.27 / MAX: 5.72MIN: 5.25 / MAX: 5.7MIN: 5.23 / MAX: 5.69MIN: 5.31 / MAX: 5.77

Etcpak

Benchmark: Multi-Threaded - Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 2.0Benchmark: Multi-Threaded - Configuration: ETC22 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32160320480640800SE +/- 1.25, N = 3SE +/- 1.36, N = 3SE +/- 0.82, N = 3SE +/- 0.38, N = 3741.78734.53731.23739.741. (CXX) g++ options: -flto -pthread

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM12 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3220406080100SE +/- 0.27, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 376.8976.1275.9977.081. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM12 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32400800120016002000SE +/- 6.84, N = 3SE +/- 0.31, N = 3SE +/- 0.87, N = 3SE +/- 0.54, N = 31922.131903.041899.801926.891. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Memory Copying2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322K4K6K8K10KSE +/- 44.94, N = 3SE +/- 39.56, N = 3SE +/- 25.18, N = 3SE +/- 72.54, N = 310752.8110608.9610608.8210720.311. (CXX) g++ options: -lm -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lgmp -lgbm -lmpfr -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -U_FORTIFY_SOURCE

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper Beam2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3220406080100SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 3SE +/- 0.14, N = 376.9077.4577.8177.18

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfig2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C321122334455SE +/- 0.30, N = 3SE +/- 0.40, N = 3SE +/- 0.30, N = 347.7648.2647.78

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To Compile2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3280160240320400SE +/- 0.34, N = 3SE +/- 0.24, N = 3345.25341.68

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: LargeRandom2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C320.4770.9541.4311.9082.385SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 32.122.102.112.111. (CXX) g++ options: -O3 -lrt

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To Compile2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3250100150200250SE +/- 0.11, N = 3SE +/- 0.19, N = 3221.14219.27

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Decompression Rating2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3230K60K90K120K150KSE +/- 14.15, N = 3SE +/- 33.91, N = 3SE +/- 69.90, N = 31581331570051575121. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfig2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32130260390520650SE +/- 0.15, N = 3SE +/- 0.52, N = 3590.50589.33

Llamafile

Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.6Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.0532.1063.1594.2125.2654.68

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on Windshield2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32306090120150SE +/- 0.28, N = 3SE +/- 0.28, N = 3SE +/- 5.81, N = 15SE +/- 0.35, N = 3128.76128.32124.37128.01

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1M2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32150300450600750SE +/- 0.28, N = 3SE +/- 49.36, N = 9SE +/- 1.02, N = 3705.55573.33668.16


Phoronix Test Suite v10.8.5