AMD Ryzen 9 9950X DDR5 Memory Performance

AMD Ryzen 9 9950X DDR5 memory module benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2408225-NE-RYZEN999510
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G
August 14
  13 Hours, 26 Minutes
2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32
August 22
  10 Hours, 21 Minutes
2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36
August 20
  10 Hours, 59 Minutes
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38
August 21
  7 Hours, 49 Minutes
Invert Behavior (Only Show Selected Data)
  10 Hours, 39 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD Ryzen 9 9950X DDR5 Memory PerformanceOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS)AMD Device 14d82 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G2 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C362 x 24GB DDR5-8000MT/s Corsair CMP48GX5M2X8000C382 x 32GB DDR5-6400MT/s Corsair CMK64GX5M2B6400C322000GB Corsair MP700 PROAMD Radeon RX 7900 GRE 16GBAMD Navi 31 HDMI/DPDELL U2723QEIntel I225-V + Intel Wi-Fi 6EUbuntu 24.046.10.0-phx (x86_64)GNOME Shell 46.0X Server + Wayland4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57)GCC 13.2.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionAMD Ryzen 9 9950X DDR5 Memory Performance BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401a- Python 3.12.3- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected - 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36, 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38, 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)

2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32Result OverviewPhoronix Test Suite100%123%145%168%191%OpenFOAMOpenRadiossLeelaChessZeroXcompact3d Incompact3dlibxsmmHigh Performance Conjugate GradientLlama.cppLULESHNAS Parallel BenchmarksRAMspeed SMPBuild2PyTorchNAMDStockfishSPECFEM3DTensorFlowMBWMemcachedEmbreeJava JMHXNNPACKsimdjsonx265BlenderBRL-CADXmrigNumpy BenchmarkLuxCoreRenderQuicksilverPOV-RayminiBUDEEtcpakStress-NG

AMD Ryzen 9 9950X DDR5 Memory Performancepytorch: CPU - 1 - ResNet-50pytorch: CPU - 256 - ResNet-50minibude: OpenMP - BM1minibude: OpenMP - BM2stress-ng: Memory Copyingquicksilver: CORAL2 P1embree: Pathtracer ISPC - Crownembree: Pathtracer ISPC - Asian Dragonx265: Bosphorus 4Kx265: Bosphorus 1080psimdjson: Kostyasimdjson: TopTweetsimdjson: LargeRandminibude: OpenMP - BM1minibude: OpenMP - BM2hpcg: 104 104 104 - 60libxsmm: 128libxsmm: 32libxsmm: 64xmrig: GhostRider - 1Mtensorflow: CPU - 1 - ResNet-50tensorflow: CPU - 64 - ResNet-50luxcorerender: DLSC - CPUluxcorerender: Danish Mood - CPUluxcorerender: Orange Juice - CPUluxcorerender: LuxCore Benchmark - CPUluxcorerender: Rainbow Colors and Prism - CPUramspeed: Add - Integerramspeed: Copy - Integerramspeed: Scale - Integerramspeed: Triad - Integerramspeed: Average - Integermbw: Memory Copy - 4096 MiBmbw: Memory Copy - 8192 MiBmbw: Memory Copy, Fixed Block Size - 4096 MiBmbw: Memory Copy, Fixed Block Size - 8192 MiBcompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingetcpak: Multi-Threaded - ETC2lczero: BLASlczero: Eigenstockfish: Chess Benchmarkgromacs: MPI CPU - water_GMX50_baregromacs: water_GMX50_barenamd: ATPase with 327,506 Atomsnamd: STMV with 1,066,628 Atomsjava-jmh: Throughputmemcached: 1:10memcached: 1:100numpy: llama-cpp: Meta-Llama-3-8B-Instruct-Q8_0.ggufllamafile: Meta-Llama-3-8B-Instruct.F16 - CPUnpb: BT.Cnpb: CG.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Cnpb: SP.Bnpb: SP.Cbrl-cad: VGR Performance Metriclulesh: incompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionopenfoam: motorBike - Mesh Timeopenfoam: motorBike - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenradioss: Bumper Beamopenradioss: Chrysler Neon 1Mopenradioss: Cell Phone Drop Testopenradioss: Bird Strike on Windshieldopenradioss: Rubber O-Ring Seal Installationopenradioss: INIVOL and Fluid Structure Interaction Drop Containerspecfem3d: Mount St. Helensspecfem3d: Layered Halfspacespecfem3d: Tomographic Modelspecfem3d: Homogeneous Halfspacespecfem3d: Water-layered Halfspacebuild-gem5: Time To Compilebuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigbuild-llvm: Ninjabuild-nodejs: Time To Compilebuild2: Time To Compiley-cruncher: 1By-cruncher: 500Mpovray: Trace Timeblender: BMW27 - CPU-Onlyblender: Junkshop - CPU-Onlyblender: Barbershop - CPU-Onlyxnnpack: FP32MobileNetV2xnnpack: FP32MobileNetV3Largexnnpack: FP32MobileNetV3Smallxnnpack: FP16MobileNetV2xnnpack: FP16MobileNetV3Largexnnpack: FP16MobileNetV3Smallxnnpack: QU8MobileNetV2xnnpack: QU8MobileNetV3Largexnnpack: QU8MobileNetV3Small2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C3284.8957.5476.88577.38510752.812567666736.553241.491438.27134.527.7113.592.121922.1251934.6138.93560472.0121.0237.13616.616.9252.275.404.628.645.1519.9961079.9365797.0569141.0862428.7065503.9322774.94722247.78419534.15919467.394195585158133741.784226220513777783.1411.9273.410070.9808491554717298.4265915791.347507615.731068.368.364.6855287.3711144.883678.463823.8027106.181505.4759920.9624236.2121328.2814528.264989669945.530914.090555264.113021884.994958.569522.579849157.14898181.569052096.566476.90705.5544.90128.7653.16247.0625.69098664473.13879708423.35909292030.67791845772.197260512221.13947.755590.495301.629345.24676.00316.4527.58716.10146.3862.09448.96126415168129851291770759111875686.7958.4976.12276.19810608.962566666736.254641.723838.07134.197.5713.242.101903.0401904.9539.51601508.9132.2258.63699.617.3355.195.364.658.555.0419.8465113.5270345.5373724.8866963.0369491.4222539.72022817.90020923.98920349.414200810157005734.533264247499742103.2982.0093.577551.0242689012581592.6535874747.737498825.191046.919.0060124.0811908.473741.863739.1629713.551635.3464031.3126596.4623254.4016097.0048695410578.51612.974287058.198459688.080755.573823.143631145.88323174.125721909.665577.45573.3343.25128.3253.66242.0025.19465363069.81742231123.28751738830.54298448768.92093565148.258304.62177.92315.6067.23616.39547.7263.41459.12127215538189971302773765112476481.8956.1975.99275.93010608.822513666735.182840.069637.09131.287.5012.882.111899.8041898.2589.16590491.1127.0249.73676.516.8153.545.344.608.465.0519.3362771.0066545.2470445.7464832.8666822.4922093.98521444.00219747.58420124.943731.229227501013963.509580.9985290340365706.7925694123.527350932.881064.178.6658570.0311662.443722.563712.4528743.491586.5962375.4325715.3422436.8115531.764897849832.638513.259573360.480583285.474577.81124.3754.2725.45331906072.438348923.86945318131.98171567772.07947665880.10416.31947.5164.08459.191285159083610181334797783115478685.5258.8977.07677.24610720.312566000036.109940.944738.30134.487.6313.382.111926.8911931.1418.79737505.9128.0253.33638.617.3454.305.424.698.645.1420.0863967.7070747.0071754.5865845.2168588.8422510.46422294.06220817.56920850.883203065157512739.736244236491941553.2281.9963.604641.0160292045826799.545898565.347647187.481070.618.3258570.7711374.513731.973714.6628891.861584.9362084.7225622.9322908.5815610.3049443710235.72213.402805360.799243984.980257.245222.821389148.75845177.78941994.74777.18668.1643.42128.0153.23239.2224.95654992869.43938729923.00050134129.81843126869.561642467219.26847.783589.331298.049341.67977.00615.4267.24716.13346.5662.21450.711273152781599613037767651115761OpenBenchmarking.org

PyTorch

This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.96, N = 4SE +/- 0.82, N = 5SE +/- 0.63, N = 3SE +/- 0.27, N = 385.5281.8986.7984.89MIN: 68.54 / MAX: 88.53MIN: 67.48 / MAX: 84.5MIN: 78.82 / MAX: 88.23MIN: 77.17 / MAX: 85.87

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 256 - Model: ResNet-502 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1326395265SE +/- 0.40, N = 15SE +/- 0.25, N = 3SE +/- 0.36, N = 3SE +/- 0.34, N = 358.8956.1958.4957.54MIN: 34.79 / MAX: 62.62MIN: 39.35 / MAX: 57.3MIN: 34.6 / MAX: 60.01MIN: 53.14 / MAX: 58.34

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM12 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.27, N = 377.0875.9976.1276.891. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM22 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 377.2575.9376.2077.391. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Stress-NG

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Memory Copying2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2K4K6K8K10KSE +/- 72.54, N = 3SE +/- 25.18, N = 3SE +/- 39.56, N = 3SE +/- 44.94, N = 310720.3110608.8210608.9610752.811. (CXX) g++ options: -lm -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lgmp -lgbm -lmpfr -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -U_FORTIFY_SOURCE

Quicksilver

Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P12 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G5M10M15M20M25MSE +/- 41633.32, N = 3SE +/- 16666.67, N = 3SE +/- 92074.85, N = 3SE +/- 21858.13, N = 3256600002513666725666667256766671. (CXX) g++ options: -fopenmp -O3 -march=native

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Crown2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G816243240SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 336.1135.1836.2536.55MIN: 35.69 / MAX: 36.98MIN: 34.83 / MAX: 36.01MIN: 35.89 / MAX: 36.94MIN: 36.13 / MAX: 37.34

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1020304050SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 340.9440.0741.7241.49MIN: 40.73 / MAX: 41.6MIN: 39.86 / MAX: 40.55MIN: 41.48 / MAX: 42.26MIN: 41.24 / MAX: 42.02

x265

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 4K2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G918273645SE +/- 0.15, N = 3SE +/- 0.15, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 338.3037.0938.0738.271. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 1080p2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G306090120150SE +/- 0.45, N = 3SE +/- 0.23, N = 3SE +/- 0.13, N = 3SE +/- 0.27, N = 3134.48131.28134.19134.521. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

simdjson

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: Kostya2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G246810SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 37.637.507.577.711. (CXX) g++ options: -O3 -lrt

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: TopTweet2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G3691215SE +/- 0.17, N = 3SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.02, N = 313.3812.8813.2413.591. (CXX) g++ options: -O3 -lrt

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: LargeRandom2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.4770.9541.4311.9082.385SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 32.112.112.102.121. (CXX) g++ options: -O3 -lrt

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM12 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G400800120016002000SE +/- 0.54, N = 3SE +/- 0.87, N = 3SE +/- 0.31, N = 3SE +/- 6.84, N = 31926.891899.801903.041922.131. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM22 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G400800120016002000SE +/- 0.32, N = 3SE +/- 2.32, N = 3SE +/- 1.03, N = 3SE +/- 0.64, N = 31931.141898.261904.951934.611. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 602 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G3691215SE +/- 0.00125, N = 3SE +/- 0.00584, N = 3SE +/- 0.00741, N = 3SE +/- 0.00142, N = 38.797379.165909.516018.935601. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 1282 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G110220330440550SE +/- 0.10, N = 3SE +/- 0.33, N = 3SE +/- 0.09, N = 3SE +/- 0.38, N = 3505.9491.1508.9472.01. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 322 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G306090120150SE +/- 0.07, N = 3SE +/- 0.26, N = 3SE +/- 0.19, N = 3SE +/- 0.07, N = 3128.0127.0132.2121.01. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 642 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G60120180240300SE +/- 0.03, N = 3SE +/- 0.53, N = 3SE +/- 0.37, N = 3SE +/- 0.20, N = 3253.3249.7258.6237.11. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1M2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G8001600240032004000SE +/- 37.04, N = 3SE +/- 36.12, N = 6SE +/- 30.71, N = 9SE +/- 6.72, N = 33638.63676.53699.63616.61. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: ResNet-502 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G48121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 317.3416.8117.3316.92

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: ResNet-502 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1224364860SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 354.3053.5455.1952.27

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPU2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.21952.4393.65854.8786.0975SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.425.345.365.40MIN: 5.31 / MAX: 5.77MIN: 5.23 / MAX: 5.69MIN: 5.25 / MAX: 5.7MIN: 5.27 / MAX: 5.72

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPU2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.05532.11063.16594.22125.2765SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 34.694.604.654.62MIN: 2.15 / MAX: 5.26MIN: 2.13 / MAX: 5.16MIN: 2.02 / MAX: 5.24MIN: 2.07 / MAX: 5.22

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPU2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.648.468.558.64MIN: 7.67 / MAX: 9.25MIN: 7.49 / MAX: 9.04MIN: 7.48 / MAX: 9.15MIN: 7.64 / MAX: 9.25

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPU2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.15882.31763.47644.63525.794SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.145.055.045.15MIN: 2.39 / MAX: 5.75MIN: 2.28 / MAX: 5.66MIN: 2.28 / MAX: 5.66MIN: 2.38 / MAX: 5.77

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPU2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G510152025SE +/- 0.15, N = 3SE +/- 0.13, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 320.0819.3319.8419.99MIN: 17.96 / MAX: 20.55MIN: 17.37 / MAX: 19.62MIN: 17.86 / MAX: 20.21MIN: 18.05 / MAX: 20.42

RAMspeed SMP

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integer2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G14K28K42K56K70KSE +/- 382.92, N = 3SE +/- 64.38, N = 3SE +/- 108.61, N = 3SE +/- 426.98, N = 363967.7062771.0065113.5261079.931. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integer2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G15K30K45K60K75KSE +/- 453.50, N = 3SE +/- 299.96, N = 3SE +/- 762.95, N = 5SE +/- 820.93, N = 370747.0066545.2470345.5365797.051. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integer2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G16K32K48K64K80KSE +/- 266.14, N = 3SE +/- 513.22, N = 3SE +/- 364.59, N = 3SE +/- 220.54, N = 371754.5870445.7473724.8869141.081. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integer2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G14K28K42K56K70KSE +/- 33.42, N = 3SE +/- 130.09, N = 3SE +/- 439.59, N = 3SE +/- 156.92, N = 365845.2164832.8666963.0362428.701. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integer2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G15K30K45K60K75KSE +/- 204.48, N = 3SE +/- 805.54, N = 4SE +/- 315.00, N = 3SE +/- 119.74, N = 368588.8466822.4969491.4265503.931. (CC) gcc options: -O3 -march=native

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 4096 MiB2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G5K10K15K20K25KSE +/- 265.89, N = 3SE +/- 227.86, N = 15SE +/- 31.03, N = 3SE +/- 72.38, N = 322510.4622093.9922539.7222774.951. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G5K10K15K20K25KSE +/- 188.50, N = 12SE +/- 56.11, N = 3SE +/- 204.03, N = 3SE +/- 49.10, N = 322294.0621444.0022817.9022247.781. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G4K8K12K16K20KSE +/- 32.81, N = 3SE +/- 36.42, N = 3SE +/- 211.85, N = 3SE +/- 24.77, N = 320817.5719747.5820923.9919534.161. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G4K8K12K16K20KSE +/- 58.10, N = 3SE +/- 239.88, N = 3SE +/- 144.90, N = 15SE +/- 10.56, N = 320850.8820124.9420349.4119467.391. (CC) gcc options: -O3 -march=native

7-Zip Compression

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Compression Rating2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G40K80K120K160K200KSE +/- 467.61, N = 3SE +/- 94.03, N = 3SE +/- 322.74, N = 32030652008101955851. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

Test: Compression Rating

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Decompression Rating2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30K60K90K120K150KSE +/- 69.90, N = 3SE +/- 33.91, N = 3SE +/- 14.15, N = 31575121570051581331. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

Test: Decompression Rating

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.

Etcpak

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 2.0Benchmark: Multi-Threaded - Configuration: ETC22 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G160320480640800SE +/- 0.38, N = 3SE +/- 0.82, N = 3SE +/- 1.36, N = 3SE +/- 1.25, N = 3739.74731.23734.53741.781. (CXX) g++ options: -flto -pthread

LeelaChessZero

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: BLAS2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G60120180240300SE +/- 2.91, N = 3SE +/- 3.79, N = 3SE +/- 2.65, N = 32442642261. (CXX) g++ options: -flto -pthread

Backend: BLAS

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: Eigen2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50100150200250SE +/- 2.85, N = 3SE +/- 1.15, N = 3SE +/- 1.45, N = 32362272472201. (CXX) g++ options: -flto -pthread

Stockfish

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfishChess Benchmark2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G11M22M33M44M55MSE +/- 410305.51, N = 3SE +/- 166041.67, N = 3SE +/- 515794.89, N = 15SE +/- 491754.88, N = 15491941555010139649974210513777781. Stockfish 16 by the Stockfish developers (see AUTHORS file)

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bare2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.74211.48422.22632.96843.7105SE +/- 0.004, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 33.2283.2983.1411. (CXX) g++ options: -O3 -lm

Implementation: MPI CPU - Input: water_GMX50_bare

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-System-Product-Name exited on signal 11 (Segmentation fault).

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACSInput: water_GMX50_bare2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.4520.9041.3561.8082.26SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 31.9962.0091.9271. GROMACS version: 2023.3-Ubuntu_2023.3_1ubuntu3

Input: water_GMX50_bare

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: Fatal error:

NAMD

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: ATPase with 327,506 Atoms2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.8111.6222.4333.2444.055SE +/- 0.03856, N = 3SE +/- 0.02740, N = 9SE +/- 0.02618, N = 15SE +/- 0.02586, N = 33.604643.509583.577553.41007

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: STMV with 1,066,628 Atoms2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G0.23050.4610.69150.9221.1525SE +/- 0.00200, N = 3SE +/- 0.00137, N = 2SE +/- 0.00057, N = 3SE +/- 0.00027, N = 31.016020.998521.024260.98084

Java JMH

This very basic test profile runs the stock benchmark of the Java JMH benchmark via Maven. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/s, More Is BetterJava JMHThroughput2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20000M40000M60000M80000M100000M92045826799.5490340365706.7989012581592.6591554717298.43

Memcached

Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:102 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.3M2.6M3.9M5.2M6.5MSE +/- 11599.22, N = 3SE +/- 4372.40, N = 3SE +/- 11157.27, N = 3SE +/- 15831.56, N = 35898565.345694123.525874747.735915791.341. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:1002 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.6M3.2M4.8M6.4M8MSE +/- 87376.76, N = 3SE +/- 71150.32, N = 3SE +/- 43634.73, N = 3SE +/- 1150.91, N = 37647187.487350932.887498825.197507615.731. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 2.01, N = 3SE +/- 4.91, N = 3SE +/- 1.37, N = 3SE +/- 8.61, N = 31070.611064.171046.911068.36

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b3067Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.328.669.008.361. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas

Llamafile

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.6Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1.0532.1063.1594.2125.2654.68

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G13K26K39K52K65KSE +/- 8.10, N = 3SE +/- 74.71, N = 3SE +/- 85.26, N = 3SE +/- 54.79, N = 358570.7758570.0360124.0855287.371. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G3K6K9K12K15KSE +/- 36.86, N = 3SE +/- 41.76, N = 3SE +/- 10.26, N = 3SE +/- 21.24, N = 311374.5111662.4411908.4711144.881. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G8001600240032004000SE +/- 53.14, N = 3SE +/- 21.03, N = 3SE +/- 17.86, N = 3SE +/- 25.12, N = 33731.973722.563741.863678.461. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G8001600240032004000SE +/- 36.74, N = 3SE +/- 51.16, N = 3SE +/- 40.74, N = 3SE +/- 2.85, N = 33714.663712.453739.163823.801. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G6K12K18K24K30KSE +/- 30.67, N = 3SE +/- 113.15, N = 3SE +/- 95.45, N = 3SE +/- 141.73, N = 328891.8628743.4929713.5527106.181. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G400800120016002000SE +/- 2.77, N = 3SE +/- 5.23, N = 3SE +/- 4.49, N = 3SE +/- 5.33, N = 31584.931586.591635.341505.471. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G14K28K42K56K70KSE +/- 130.21, N = 3SE +/- 210.02, N = 3SE +/- 72.30, N = 3SE +/- 150.93, N = 362084.7262375.4364031.3159920.961. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G6K12K18K24K30KSE +/- 20.77, N = 3SE +/- 10.24, N = 3SE +/- 34.50, N = 3SE +/- 6.88, N = 325622.9325715.3426596.4624236.211. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G5K10K15K20K25KSE +/- 26.91, N = 3SE +/- 40.85, N = 3SE +/- 10.38, N = 3SE +/- 21.50, N = 322908.5822436.8123254.4021328.281. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G3K6K9K12K15KSE +/- 16.93, N = 3SE +/- 42.19, N = 3SE +/- 15.11, N = 3SE +/- 18.27, N = 315610.3015531.7616097.0014528.261. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

BRL-CAD

BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.38.2VGR Performance Metric2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G110K220K330K440K550K4944374897844869544989661. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.32 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2K4K6K8K10KSE +/- 67.88, N = 3SE +/- 38.48, N = 3SE +/- 13.71, N = 3SE +/- 19.00, N = 310235.729832.6410578.529945.531. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G48121620SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 313.4013.2612.9714.091. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1428425670SE +/- 0.20, N = 3SE +/- 0.01, N = 2SE +/- 0.14, N = 3SE +/- 0.09, N = 360.8060.4858.2064.111. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Mesh Time2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2040608010084.9885.4788.0884.991. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Execution Time2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G132639526557.2555.5758.571. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Time2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G61218243022.8223.1422.581. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Input: drivaerFastback, Small Mesh Size - Mesh Time

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: [13] #0 Foam::error::printStack(Foam::Ostream&) at ??:?

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Time2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G306090120150148.76145.88157.151. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Input: drivaerFastback, Small Mesh Size - Execution Time

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: This is not a fatal error but might cause some unexpected behaviour.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Time2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G4080120160200177.79174.13181.571. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Input: drivaerFastback, Medium Mesh Size - Mesh Time

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: This is not a fatal error but might cause some unexpected behaviour.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50010001500200025001994.751909.672096.571. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Input: drivaerFastback, Medium Mesh Size - Execution Time

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: [13] --> FOAM FATAL ERROR:

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper Beam2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.14, N = 3SE +/- 0.02, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 377.1877.8177.4576.90

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1M2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G150300450600750SE +/- 1.02, N = 3SE +/- 49.36, N = 9SE +/- 0.28, N = 3668.16573.33705.55

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop Test2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1020304050SE +/- 0.29, N = 3SE +/- 0.15, N = 3SE +/- 0.18, N = 343.4243.2544.90

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on Windshield2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G306090120150SE +/- 0.35, N = 3SE +/- 5.81, N = 15SE +/- 0.28, N = 3SE +/- 0.28, N = 3128.01124.37128.32128.76

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal Installation2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1224364860SE +/- 0.15, N = 3SE +/- 0.06, N = 3SE +/- 0.16, N = 3SE +/- 0.10, N = 353.2354.2753.6653.16

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop Container2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50100150200250SE +/- 0.99, N = 3SE +/- 0.76, N = 3SE +/- 0.74, N = 3239.22242.00247.06

Model: INIVOL and Fluid Structure Interaction Drop Container

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test run did not produce a result. E: ** ERROR: FILE fsi_drop_container_0000_0001.rst NOT FOUND

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Mount St. Helens2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G612182430SE +/- 0.12, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.17, N = 324.9625.4525.1925.691. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Layered Halfspace2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1632486480SE +/- 0.59, N = 3SE +/- 0.59, N = 3SE +/- 0.31, N = 369.4472.4469.8273.141. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic Model2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G612182430SE +/- 0.20, N = 3SE +/- 0.16, N = 15SE +/- 0.14, N = 3SE +/- 0.16, N = 323.0023.8723.2923.361. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous Halfspace2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G714212835SE +/- 0.39, N = 3SE +/- 0.38, N = 3SE +/- 0.28, N = 329.8231.9830.5430.681. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Water-layered Halfspace2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1632486480SE +/- 0.34, N = 3SE +/- 0.16, N = 2SE +/- 0.24, N = 3SE +/- 0.34, N = 369.5672.0868.9272.201. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To Compile2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G50100150200250SE +/- 0.19, N = 3SE +/- 0.11, N = 3219.27221.14

Time To Compile

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: Warning: Protocol buffer compiler (protoc) not found.

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfig2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1122334455SE +/- 0.30, N = 3SE +/- 0.40, N = 3SE +/- 0.30, N = 347.7848.2647.76

Build: defconfig

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: : internal compiler error: Segmentation fault

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfig2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G130260390520650SE +/- 0.52, N = 3SE +/- 0.15, N = 3589.33590.50

Build: allmodconfig

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: gcc: internal compiler error: Segmentation fault signal terminated program as

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninja2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G70140210280350SE +/- 0.22, N = 3SE +/- 0.38, N = 2SE +/- 0.12, N = 3298.05304.62301.63

Build System: Ninja

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: /usr/include/c++/13/bits/vector.tcc:445:7: internal compiler error: Segmentation fault

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To Compile2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-6000 CL30 F5-6000J3038F16G80160240320400SE +/- 0.24, N = 3SE +/- 0.34, N = 3341.68345.25

Time To Compile

2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.

Build2

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.17Time To Compile2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G20406080100SE +/- 0.52, N = 3SE +/- 0.99, N = 2SE +/- 0.40, N = 3SE +/- 0.16, N = 377.0180.1077.9276.00

Y-Cruncher

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 1B2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G48121620SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 315.4315.6116.45

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 500M2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G246810SE +/- 0.016, N = 3SE +/- 0.023, N = 3SE +/- 0.006, N = 37.2477.2367.587

POV-Ray

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-RayTrace Time2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G48121620SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 316.1316.3216.4016.101. POV-Ray 3.7.0.10.unofficial

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: CPU-Only2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1122334455SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.01, N = 346.5647.5147.7246.38

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: CPU-Only2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G1428425670SE +/- 0.03, N = 3SE +/- 0.17, N = 3SE +/- 0.08, N = 3SE +/- 0.12, N = 362.2164.0863.4162.09

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: CPU-Only2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G100200300400500SE +/- 0.26, N = 3SE +/- 0.20, N = 3SE +/- 0.27, N = 3SE +/- 0.20, N = 3450.71459.19459.12448.96

XNNPACK

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV22 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30060090012001500SE +/- 8.84, N = 3SE +/- 9.00, N = 3SE +/- 4.98, N = 3SE +/- 5.21, N = 312731285127212641. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Large2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30060090012001500SE +/- 9.64, N = 3SE +/- 3.18, N = 3SE +/- 2.08, N = 3SE +/- 19.19, N = 315271590155315161. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Small2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 2.31, N = 3SE +/- 3.18, N = 3SE +/- 5.00, N = 3SE +/- 2.31, N = 38158368188121. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV22 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 3.38, N = 3SE +/- 2.31, N = 3SE +/- 1.20, N = 3SE +/- 2.31, N = 399610189979851. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Large2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G30060090012001500SE +/- 3.53, N = 3SE +/- 8.11, N = 3SE +/- 4.58, N = 3SE +/- 2.89, N = 313031334130212911. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Small2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 1.86, N = 3SE +/- 0.58, N = 3SE +/- 1.20, N = 3SE +/- 1.76, N = 37767977737701. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV22 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 0.88, N = 3SE +/- 3.18, N = 3SE +/- 2.96, N = 3SE +/- 0.33, N = 37657837657591. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Large2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 4.91, N = 3SE +/- 3.21, N = 3SE +/- 2.73, N = 3SE +/- 1.20, N = 311151154112411181. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Small2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C322 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C382 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C362 x 16GB DDR5-6000 CL30 F5-6000J3038F16G2004006008001000SE +/- 0.67, N = 3SE +/- 1.33, N = 3SE +/- 0.88, N = 37617867647561. (CXX) g++ options: -O3 -lrt -lm

104 Results Shown

PyTorch:
  CPU - 1 - ResNet-50
  CPU - 256 - ResNet-50
miniBUDE:
  OpenMP - BM1
  OpenMP - BM2
Stress-NG
Quicksilver
Embree:
  Pathtracer ISPC - Crown
  Pathtracer ISPC - Asian Dragon
x265:
  Bosphorus 4K
  Bosphorus 1080p
simdjson:
  Kostya
  TopTweet
  LargeRand
miniBUDE:
  OpenMP - BM1
  OpenMP - BM2
High Performance Conjugate Gradient
libxsmm:
  128
  32
  64
Xmrig
TensorFlow:
  CPU - 1 - ResNet-50
  CPU - 64 - ResNet-50
LuxCoreRender:
  DLSC - CPU
  Danish Mood - CPU
  Orange Juice - CPU
  LuxCore Benchmark - CPU
  Rainbow Colors and Prism - CPU
RAMspeed SMP:
  Add - Integer
  Copy - Integer
  Scale - Integer
  Triad - Integer
  Average - Integer
MBW:
  Memory Copy - 4096 MiB
  Memory Copy - 8192 MiB
  Memory Copy, Fixed Block Size - 4096 MiB
  Memory Copy, Fixed Block Size - 8192 MiB
7-Zip Compression:
  Compression Rating
  Decompression Rating
Etcpak
LeelaChessZero:
  BLAS
  Eigen
Stockfish
GROMACS
GROMACS
NAMD:
  ATPase with 327,506 Atoms
  STMV with 1,066,628 Atoms
Java JMH
Memcached:
  1:10
  1:100
Numpy Benchmark
Llama.cpp
Llamafile
NAS Parallel Benchmarks:
  BT.C
  CG.C
  EP.C
  EP.D
  FT.C
  IS.D
  LU.C
  MG.C
  SP.B
  SP.C
BRL-CAD
LULESH
Xcompact3d Incompact3d:
  input.i3d 129 Cells Per Direction
  input.i3d 193 Cells Per Direction
OpenFOAM:
  motorBike - Mesh Time
  motorBike - Execution Time
  drivaerFastback, Small Mesh Size - Mesh Time
  drivaerFastback, Small Mesh Size - Execution Time
  drivaerFastback, Medium Mesh Size - Mesh Time
  drivaerFastback, Medium Mesh Size - Execution Time
OpenRadioss:
  Bumper Beam
  Chrysler Neon 1M
  Cell Phone Drop Test
  Bird Strike on Windshield
  Rubber O-Ring Seal Installation
  INIVOL and Fluid Structure Interaction Drop Container
SPECFEM3D:
  Mount St. Helens
  Layered Halfspace
  Tomographic Model
  Homogeneous Halfspace
  Water-layered Halfspace
Timed Gem5 Compilation
Timed Linux Kernel Compilation:
  defconfig
  allmodconfig
Timed LLVM Compilation
Timed Node.js Compilation
Build2
Y-Cruncher:
  1B
  500M
POV-Ray
Blender:
  BMW27 - CPU-Only
  Junkshop - CPU-Only
  Barbershop - CPU-Only
XNNPACK:
  FP32MobileNetV2
  FP32MobileNetV3Large
  FP32MobileNetV3Small
  FP16MobileNetV2
  FP16MobileNetV3Large
  FP16MobileNetV3Small
  QU8MobileNetV2
  QU8MobileNetV3Large
  QU8MobileNetV3Small