AMD EPYC Turin 8c vs. 12c Memory Channel DDR5 Comparison

AMD EPYC 9655P memory benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2411164-NE-AMDEPYCTU56&grr&sro.

AMD EPYC Turin 8c vs. 12c Memory Channel DDR5 ComparisonProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen Resolution8c DDR5-600012c DDR5-6000AMD EPYC 9655P 96-Core @ 2.60GHz (96 Cores / 192 Threads)Supermicro Super Server H13SSL-N v1.01 (3.0 BIOS)AMD 1Ah8 x 64GB DDR5-6000MT/s3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Broadcom NetXtreme BCM5720 PCIeUbuntu 24.106.12.0-rc7-phx (x86_64)GNOME Shell 47.0X ServerGCC 14.2.0ext41024x76812 x 64GB DDR5-6000MT/s Micron MTC40F2046S1RC64BDY QSFFOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116Java Details- 8c DDR5-6000: OpenJDK Runtime Environment (build 21.0.5+11-Ubuntu-1ubuntu124.10)Python Details- Python 3.12.7Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC Turin 8c vs. 12c Memory Channel DDR5 Comparisonopenvkl: vklBenchmarkCPU ISPCsvt-av1: Preset 3 - Beauty 4K 10-bitpgbench: 1000 - 800 - Read Write - Average Latencypgbench: 1000 - 800 - Read Writehpcg: 144 144 144 - 60openradioss: Chrysler Neon 1Mluxcorerender: Danish Mood - CPUonnx: fcn-resnet101-11 - CPU - Standardonnx: fcn-resnet101-11 - CPU - Standardtensorflow: CPU - 512 - ResNet-50litert: Mobilenet Quantincompact3d: X3D-benchmarking input.i3depoch: Conewhisperfile: Mediumxnnpack: QS8MobileNetV2xnnpack: FP16MobileNetV3Smallxnnpack: FP16MobileNetV3Largexnnpack: FP16MobileNetV2xnnpack: FP16MobileNetV1xnnpack: FP32MobileNetV3Smallxnnpack: FP32MobileNetV3Largexnnpack: FP32MobileNetV2xnnpack: FP32MobileNetV1build-linux-kernel: allmodconfigopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUluxcorerender: DLSC - CPUhpcg: 104 104 104 - 60luxcorerender: LuxCore Benchmark - CPUonnx: CaffeNet 12-int8 - CPU - Standardonnx: CaffeNet 12-int8 - CPU - Standardpgbench: 1000 - 800 - Read Only - Average Latencypgbench: 1000 - 800 - Read Onlysvt-av1: Preset 5 - Beauty 4K 10-bitsvt-av1: Preset 3 - Bosphorus 4Klitert: NASNet Mobilelitert: DeepLab V3litert: Quantized COCO SSD MobileNet v1graph500: 26graph500: 26graph500: 26graph500: 26clickhouse: 100M Rows Hits Dataset, Third Runclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, First Run / Cold Cachecassandra: Writesblender: Barbershop - CPU-Onlyopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timewhisperfile: Smallnginx: 500apache: 500svt-av1: Preset 8 - Beauty 4K 10-bitwarpx: Uniform Plasmaopenradioss: INIVOL and Fluid Structure Interaction Drop Containerbuild-godot: Time To Compilelitert: Inception V4onednn: Recurrent Neural Network Training - CPUonednn: Recurrent Neural Network Inference - CPUv-ray: CPUsvt-av1: Preset 13 - Beauty 4K 10-bitmemcached: 1:10memcached: 1:5memcached: 1:100vvenc: Bosphorus 4K - Fastluxcorerender: Orange Juice - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUonnx: GPT-2 - CPU - Standardonnx: GPT-2 - CPU - Standardonnx: ZFNet-512 - CPU - Standardonnx: ZFNet-512 - CPU - Standardopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUonnx: bertsquad-12 - CPU - Standardonnx: bertsquad-12 - CPU - Standardonnx: yolov4 - CPU - Standardonnx: yolov4 - CPU - Standardonnx: T5 Encoder - CPU - Standardonnx: T5 Encoder - CPU - Standardopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUjohn-the-ripper: MD5john-the-ripper: HMAC-SHA512build2: Time To Compilelitert: Inception ResNet V2litert: Mobilenet Floatlitert: SqueezeNetstargate: 192000 - 512stargate: 192000 - 1024svt-av1: Preset 5 - Bosphorus 4Kmbw: Memory Copy, Fixed Block Size - 8192 MiBsrsran: PDSCH Processor Benchmark, Throughput Totalblender: Pabellon Barcelona - CPU-Onlyrawtherapee: Total Benchmark Timembw: Memory Copy - 8192 MiBblender: Classroom - CPU-Onlycompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingstargate: 96000 - 512liquid-dsp: 192 - 256 - 512liquid-dsp: 128 - 256 - 512john-the-ripper: WPA PSKliquid-dsp: 64 - 256 - 512john-the-ripper: bcryptjohn-the-ripper: Blowfishliquid-dsp: 192 - 256 - 57liquid-dsp: 128 - 256 - 57liquid-dsp: 64 - 256 - 57incompact3d: input.i3d 193 Cells Per Directionbuild-linux-kernel: defconfigstargate: 96000 - 1024gpaw: Carbon Nanotubenpb: EP.Dvvenc: Bosphorus 4K - Fasterwarpx: Plasma Accelerationamg: onednn: Deconvolution Batch shapes_1d - CPUastcenc: Very Thoroughastcenc: Exhaustivesrsran: PUSCH Processor Benchmark, Throughput Totalmbw: Memory Copy, Fixed Block Size - 4096 MiBuvg266: Bosphorus 4K - Slowblender: Fishy Cat - CPU-Onlyastcenc: Fastgromacs: MPI CPU - water_GMX50_bareuvg266: Bosphorus 4K - Mediummbw: Memory Copy - 4096 MiBspecfem3d: Water-layered Halfspaceminibude: OpenMP - BM2minibude: OpenMP - BM2npb: CG.Cblender: Junkshop - CPU-Onlyspecfem3d: Layered Halfspacenamd: STMV with 1,066,628 Atomsopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeramspeed: Add - Integerramspeed: Copy - Integerramspeed: Average - Integeronednn: IP Shapes 1D - CPUramspeed: Triad - Integerramspeed: Scale - Integersvt-av1: Preset 8 - Bosphorus 4Kblender: BMW27 - CPU-Onlykvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumspecfem3d: Mount St. Helensnpb: SP.Cspecfem3d: Homogeneous Halfspacenpb: BT.Cspecfem3d: Tomographic Modelnpb: SP.Bonednn: IP Shapes 3D - CPUastcenc: Thoroughuvg266: Bosphorus 4K - Super Fastuvg266: Bosphorus 4K - Very Fastuvg266: Bosphorus 4K - Ultra Fastluxcorerender: Rainbow Colors and Prism - CPUnpb: LU.Cnpb: IS.Dnpb: MG.Cm-queens: Time To Solvestream: Copykvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Super Fastsvt-av1: Preset 13 - Bosphorus 4Kkvazaar: Bosphorus 4K - Ultra Fastonednn: Convolution Batch Shapes Auto - CPUnamd: ATPase with 327,506 Atomsnpb: FT.Castcenc: Mediumonednn: Deconvolution Batch shapes_3d - CPUnpb: EP.Cstream: Addstream: Triadstream: Scale8c DDR5-600012c DDR5-600027621.8467.13611217541.0157129.1712.06109.0059.18611196.476260.21292.578786196.03215.522899968108101325790944570105341418992364596196.240106.96449.2317.0141.318612.921.047505958.5760.25431492777.68715.603100519.411817.97110.1267670900049779800013944500001365800000651.48652.53637.67422312121.15254.29552109.1954396.68799523766.78159864.9913.51216.4537358079.9980.87443959.9429.684277.99619814517.1106752055.123711825.7512031156.039.59824.15330.13145.1112.427314.9264.75740.137.536329.604.5610448.9220.852291.090.33166363.913.9723443.865.34359187.1256.50761153.6375.808240.9827.063543.7946.202621.647882.943412.06054.24372235.5966.7014026.362140833341332633353.55159977.84441.277048.393.8963114.38897456.08818188.11126209.941.1836.22823357.88935.516265537270455.91859521367666671956333333101933314176333332370552374585955433333505320000032141333339.0141642922.5556.51734928.56014658.2522.63525.7628227220852180006.8440211.63037.13396238.118332.40930.3018.551072.781916.65432.9223448.63314.355377904288.3517208.77960572.2016.8013.8612520993.5595322.64837822.29707299666.50115580.47110436.650.529052104337.42117864.04182.60813.0944.7645.385.855350933108558.218.198005837284718.536.626369356169436.810.26555882.189665.5266.1266.8627.85275011.415653.35105206.587.229221234.489.3393.40430.86896.720.34137411.62457120202.20568.33630.71536311375.83233276.2229604.0204677.828251.8527.03711374561.772290.7712.03106.21339.44076241.555800.65203.937052186.85202.1590910095105591307489824611104181364392424629193.34669.48689.5016.4962.544112.390.9595311041.950.25431550367.69915.68499408.311450.97211.9974713800055493600015057100001478180000765.07771.05740.15447176120.83192.22721106.3117791.05176505484.86170467.3713.55416.2745715279.4280.55244088.3425.784277.64319976917.1456670118.943701189.2311923788.439.64324.19330.01145.1011.577934.4056.71844.937.486380.584.5610462.5619.612437.930.33166983.333.9623427.505.11928195.2766.09718163.9765.808240.4027.013551.3545.406722.023881.881612.21494.10880243.3286.7114006.152168100041347433353.43958054.34399.487051.593.9321794.42957056.43822449.82625941.941.2535.85425126.49235.436272157792425.94397921492000001952100000102333314129666672375422375155958600000496343333331959000007.0849269222.2246.53932225.66013685.9722.96025.8510977230762863336.7673611.62777.13346237.722520.67430.4018.461109.950117.78733.0425160.63214.275555136286.8587171.45165380.5816.7513.7000752294.2171722.40856822.944605120526.51139450.46130318.770.531984122052.34139548.85186.68713.1444.7945.305.778714613154965.848.194862498361153.066.777958300199255.890.26679182.279567.2667.2467.6528.23311883.806970.89147097.417.214343229.390.4691.54435.38296.070.34359312.28998164903.45568.82620.71637311230.81355858.4348347.3314946.5OpenBenchmarking.org

OpenVKL

Benchmark: vklBenchmarkCPU ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 2.0.0Benchmark: vklBenchmarkCPU ISPC12c DDR5-60008c DDR5-60006001200180024003000SE +/- 1.76, N = 3SE +/- 1.53, N = 328252762MIN: 217 / MAX: 36373MIN: 216 / MAX: 33882

SVT-AV1

Encoder Mode: Preset 3 - Input: Beauty 4K 10-bit

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 3 - Input: Beauty 4K 10-bit12c DDR5-60008c DDR5-60000.41670.83341.25011.66682.0835SE +/- 0.007, N = 3SE +/- 0.002, N = 31.8521.8461. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 1000 - Clients: 800 - Mode: Read Write - Average Latency12c DDR5-60008c DDR5-6000246810SE +/- 0.079, N = 5SE +/- 0.053, N = 117.0377.1361. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 1000 - Clients: 800 - Mode: Read Write12c DDR5-60008c DDR5-600020K40K60K80K100KSE +/- 1257.14, N = 5SE +/- 836.14, N = 111137451121751. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 6012c DDR5-60008c DDR5-60001428425670SE +/- 0.01, N = 3SE +/- 0.03, N = 361.7741.021. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1M12c DDR5-60008c DDR5-6000306090120150SE +/- 0.26, N = 3SE +/- 0.94, N = 1190.77129.17

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPU12c DDR5-60008c DDR5-60003691215SE +/- 0.09, N = 15SE +/- 0.11, N = 1512.0312.06MIN: 5.72 / MAX: 14.2MIN: 5.76 / MAX: 14.38

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: fcn-resnet101-11 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-600020406080100SE +/- 1.49, N = 15SE +/- 1.08, N = 15106.21109.011. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: fcn-resnet101-11 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60003691215SE +/- 0.13237, N = 15SE +/- 0.08964, N = 159.440769.186111. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

TensorFlow

Device: CPU - Batch Size: 512 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 512 - Model: ResNet-5012c DDR5-60008c DDR5-600050100150200250SE +/- 0.11, N = 3SE +/- 0.27, N = 3241.55196.47

LiteRT

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Mobilenet Quant12c DDR5-60008c DDR5-600013002600390052006500SE +/- 109.66, N = 15SE +/- 155.36, N = 155800.656260.21

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3d12c DDR5-60008c DDR5-600060120180240300SE +/- 2.76, N = 3SE +/- 3.79, N = 3203.94292.581. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Epoch

Epoch3D Deck: Cone

OpenBenchmarking.orgSeconds, Fewer Is BetterEpoch 4.19.4Epoch3D Deck: Cone12c DDR5-60008c DDR5-60004080120160200SE +/- 0.55, N = 3SE +/- 0.52, N = 3186.85196.031. (F9X) gfortran options: -O3 -std=f2003 -Jobj -lsdf -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Whisperfile

Model Size: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisperfile 20Aug24Model Size: Medium12c DDR5-60008c DDR5-600050100150200250SE +/- 0.37, N = 3SE +/- 0.69, N = 3202.16215.52

XNNPACK

Model: QS8MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: QS8MobileNetV212c DDR5-60008c DDR5-60002K4K6K8K10KSE +/- 60.71, N = 3SE +/- 103.51, N = 31009599681. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV3Small12c DDR5-60008c DDR5-60002K4K6K8K10KSE +/- 181.63, N = 3SE +/- 162.45, N = 310559108101. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV3Large12c DDR5-60008c DDR5-60003K6K9K12K15KSE +/- 58.09, N = 3SE +/- 75.41, N = 313074132571. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV212c DDR5-60008c DDR5-60002K4K6K8K10KSE +/- 96.09, N = 3SE +/- 49.08, N = 3898290941. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV1

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV112c DDR5-60008c DDR5-600010002000300040005000SE +/- 43.59, N = 3SE +/- 11.46, N = 3461145701. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV3Small12c DDR5-60008c DDR5-60002K4K6K8K10KSE +/- 54.11, N = 3SE +/- 8.95, N = 310418105341. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV3Large12c DDR5-60008c DDR5-60003K6K9K12K15KSE +/- 63.76, N = 3SE +/- 140.55, N = 313643141891. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV212c DDR5-60008c DDR5-60002K4K6K8K10KSE +/- 73.61, N = 3SE +/- 46.71, N = 3924292361. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV1

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV112c DDR5-60008c DDR5-600010002000300040005000SE +/- 57.47, N = 3SE +/- 41.29, N = 3462945961. (CXX) g++ options: -O3 -lrt -lm

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfig12c DDR5-60008c DDR5-60004080120160200SE +/- 0.43, N = 3SE +/- 0.13, N = 3193.35196.24

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPU12c DDR5-60008c DDR5-600020406080100SE +/- 0.19, N = 3SE +/- 1.46, N = 1569.48106.96MIN: 34.01 / MAX: 134.55MIN: 37.41 / MAX: 380.851. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPU12c DDR5-60008c DDR5-6000150300450600750SE +/- 1.81, N = 3SE +/- 6.83, N = 15689.50449.231. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPU12c DDR5-60008c DDR5-600048121620SE +/- 0.19, N = 3SE +/- 0.21, N = 1516.4917.01MIN: 15.79 / MAX: 20.03MIN: 15.67 / MAX: 20.15

High Performance Conjugate Gradient

X Y Z: 104 104 104 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 6012c DDR5-60008c DDR5-60001428425670SE +/- 0.03, N = 3SE +/- 0.02, N = 362.5441.321. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPU12c DDR5-60008c DDR5-60003691215SE +/- 0.12, N = 15SE +/- 0.17, N = 312.3912.92MIN: 5.62 / MAX: 14.88MIN: 6.3 / MAX: 15.04

ONNX Runtime

Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60000.23570.47140.70710.94281.1785SE +/- 0.010778, N = 3SE +/- 0.018573, N = 150.9595311.0475051. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60002004006008001000SE +/- 11.56, N = 3SE +/- 17.42, N = 151041.95958.581. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 1000 - Clients: 800 - Mode: Read Only - Average Latency12c DDR5-60008c DDR5-60000.05720.11440.17160.22880.286SE +/- 0.000, N = 3SE +/- 0.001, N = 30.2540.2541. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 1000 - Clients: 800 - Mode: Read Only12c DDR5-60008c DDR5-6000700K1400K2100K2800K3500KSE +/- 1685.88, N = 3SE +/- 8342.05, N = 3315503631492771. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

SVT-AV1

Encoder Mode: Preset 5 - Input: Beauty 4K 10-bit

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 5 - Input: Beauty 4K 10-bit12c DDR5-60008c DDR5-6000246810SE +/- 0.039, N = 3SE +/- 0.026, N = 37.6997.6871. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 3 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 3 - Input: Bosphorus 4K12c DDR5-60008c DDR5-600048121620SE +/- 0.03, N = 3SE +/- 0.00, N = 315.6815.601. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

LiteRT

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: NASNet Mobile12c DDR5-60008c DDR5-600020K40K60K80K100KSE +/- 1271.15, N = 3SE +/- 1096.54, N = 1599408.3100519.4

LiteRT

Model: DeepLab V3

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: DeepLab V312c DDR5-60008c DDR5-60003K6K9K12K15KSE +/- 138.74, N = 3SE +/- 140.63, N = 1511450.911817.9

LiteRT

Model: Quantized COCO SSD MobileNet v1

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Quantized COCO SSD MobileNet v112c DDR5-60008c DDR5-600015003000450060007500SE +/- 28.16, N = 3SE +/- 68.84, N = 157211.997110.12

Graph500

Scale: 26

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 2612c DDR5-60008c DDR5-6000160M320M480M640M800M7471380006767090001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 2612c DDR5-60008c DDR5-6000120M240M360M480M600M5549360004977980001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 2612c DDR5-60008c DDR5-6000300M600M900M1200M1500M150571000013944500001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 2612c DDR5-60008c DDR5-6000300M600M900M1200M1500M147818000013658000001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

ClickHouse

100M Rows Hits Dataset, Third Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third Run12c DDR5-60008c DDR5-6000160320480640800SE +/- 4.74, N = 3SE +/- 1.81, N = 3765.07651.48MIN: 69.77 / MAX: 8571.43MIN: 70.92 / MAX: 6666.67

ClickHouse

100M Rows Hits Dataset, Second Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second Run12c DDR5-60008c DDR5-6000170340510680850SE +/- 2.95, N = 3SE +/- 0.96, N = 3771.05652.53MIN: 69.77 / MAX: 8571.43MIN: 69.28 / MAX: 7500

ClickHouse

100M Rows Hits Dataset, First Run / Cold Cache

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold Cache12c DDR5-60008c DDR5-6000160320480640800SE +/- 3.26, N = 3SE +/- 1.98, N = 3740.15637.67MIN: 69.2 / MAX: 7500MIN: 67.95 / MAX: 7500

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 5.0Test: Writes12c DDR5-60008c DDR5-6000100K200K300K400K500KSE +/- 2454.69, N = 3SE +/- 1303.09, N = 3447176422312

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: CPU-Only12c DDR5-60008c DDR5-6000306090120150SE +/- 0.11, N = 3SE +/- 0.12, N = 3120.83121.15

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time12c DDR5-60008c DDR5-600060120180240300192.23254.301. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Time12c DDR5-60008c DDR5-600020406080100106.31109.201. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

Whisperfile

Model Size: Small

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisperfile 20Aug24Model Size: Small12c DDR5-60008c DDR5-600020406080100SE +/- 0.71, N = 3SE +/- 0.65, N = 391.0596.69

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 50012c DDR5-60008c DDR5-6000110K220K330K440K550KSE +/- 328.79, N = 3SE +/- 3145.55, N = 3505484.86523766.781. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 50012c DDR5-60008c DDR5-600040K80K120K160K200KSE +/- 413.18, N = 3SE +/- 177.80, N = 3170467.37159864.991. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

SVT-AV1

Encoder Mode: Preset 8 - Input: Beauty 4K 10-bit

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 8 - Input: Beauty 4K 10-bit12c DDR5-60008c DDR5-60003691215SE +/- 0.00, N = 3SE +/- 0.02, N = 313.5513.511. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

WarpX

Input: Uniform Plasma

OpenBenchmarking.orgSeconds, Fewer Is BetterWarpX 24.10Input: Uniform Plasma12c DDR5-60008c DDR5-600048121620SE +/- 0.23, N = 15SE +/- 0.24, N = 1216.2716.451. (CXX) g++ options: -O3 -lm

OpenRadioss

Model: INIVOL and Fluid Structure Interaction Drop Container

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop Container12c DDR5-60008c DDR5-600020406080100SE +/- 0.06, N = 3SE +/- 0.30, N = 379.4279.99

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To Compile12c DDR5-60008c DDR5-600020406080100SE +/- 0.19, N = 3SE +/- 0.04, N = 380.5580.87

LiteRT

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Inception V412c DDR5-60008c DDR5-60009K18K27K36K45KSE +/- 38.95, N = 3SE +/- 438.48, N = 644088.343959.9

oneDNN

Harness: Recurrent Neural Network Training - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Recurrent Neural Network Training - Engine: CPU12c DDR5-60008c DDR5-600090180270360450SE +/- 0.20, N = 3SE +/- 0.28, N = 3425.78429.68MIN: 420.18MIN: 423.831. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Recurrent Neural Network Inference - Engine: CPU12c DDR5-60008c DDR5-600060120180240300SE +/- 0.81, N = 3SE +/- 0.67, N = 3277.64278.00MIN: 269.94MIN: 271.731. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 6.0Mode: CPU12c DDR5-60008c DDR5-600040K80K120K160K200KSE +/- 252.42, N = 3SE +/- 357.23, N = 3199769198145

SVT-AV1

Encoder Mode: Preset 13 - Input: Beauty 4K 10-bit

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 13 - Input: Beauty 4K 10-bit12c DDR5-60008c DDR5-600048121620SE +/- 0.01, N = 3SE +/- 0.00, N = 317.1517.111. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Memcached

Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:1012c DDR5-60008c DDR5-60001.4M2.8M4.2M5.6M7MSE +/- 23004.60, N = 3SE +/- 5728.08, N = 36670118.946752055.121. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:512c DDR5-60008c DDR5-6000800K1600K2400K3200K4000KSE +/- 16589.87, N = 3SE +/- 3725.22, N = 33701189.233711825.751. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10012c DDR5-60008c DDR5-60003M6M9M12M15MSE +/- 20796.91, N = 3SE +/- 48609.95, N = 311923788.4312031156.031. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

VVenC

Video Input: Bosphorus 4K - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 4K - Video Preset: Fast12c DDR5-60008c DDR5-60003691215SE +/- 0.082, N = 3SE +/- 0.064, N = 39.6439.5981. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPU12c DDR5-60008c DDR5-6000612182430SE +/- 0.09, N = 3SE +/- 0.09, N = 324.1924.15MIN: 21.48 / MAX: 33.13MIN: 21.4 / MAX: 32.92

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-600070140210280350SE +/- 0.62, N = 3SE +/- 0.26, N = 3330.01330.13MIN: 242.42 / MAX: 353.46MIN: 257.44 / MAX: 357.541. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-6000306090120150SE +/- 0.26, N = 3SE +/- 0.12, N = 3145.10145.111. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Noise Suppression Poconet-Like FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Noise Suppression Poconet-Like FP16 - Device: CPU12c DDR5-60008c DDR5-60003691215SE +/- 0.03, N = 3SE +/- 0.02, N = 311.5712.42MIN: 6.95 / MAX: 30.79MIN: 6.11 / MAX: 28.551. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Noise Suppression Poconet-Like FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Noise Suppression Poconet-Like FP16 - Device: CPU12c DDR5-60008c DDR5-60002K4K6K8K10KSE +/- 20.16, N = 3SE +/- 10.57, N = 37934.407314.921. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPU12c DDR5-60008c DDR5-60001428425670SE +/- 0.09, N = 3SE +/- 0.40, N = 356.7164.75MIN: 34.97 / MAX: 96.61MIN: 28.81 / MAX: 99.111. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPU12c DDR5-60008c DDR5-60002004006008001000SE +/- 1.30, N = 3SE +/- 4.52, N = 3844.93740.131. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPU12c DDR5-60008c DDR5-6000246810SE +/- 0.01, N = 3SE +/- 0.02, N = 37.487.53MIN: 4.42 / MAX: 23.35MIN: 4.35 / MAX: 22.921. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPU12c DDR5-60008c DDR5-600014002800420056007000SE +/- 12.38, N = 3SE +/- 17.72, N = 36380.586329.601. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Re-Identification Retail FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Re-Identification Retail FP16 - Device: CPU12c DDR5-60008c DDR5-60001.0262.0523.0784.1045.13SE +/- 0.00, N = 3SE +/- 0.01, N = 34.564.56MIN: 2.45 / MAX: 18.95MIN: 2.23 / MAX: 20.981. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Re-Identification Retail FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Re-Identification Retail FP16 - Device: CPU12c DDR5-60008c DDR5-60002K4K6K8K10KSE +/- 5.49, N = 3SE +/- 7.25, N = 310462.5610448.921. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-6000510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 319.6120.85MIN: 10.37 / MAX: 41.25MIN: 10.51 / MAX: 40.511. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-60005001000150020002500SE +/- 1.15, N = 3SE +/- 2.70, N = 32437.932291.091. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-60000.07430.14860.22290.29720.3715SE +/- 0.00, N = 3SE +/- 0.00, N = 30.330.33MIN: 0.12 / MAX: 23.43MIN: 0.13 / MAX: 24.631. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-600040K80K120K160K200KSE +/- 251.97, N = 3SE +/- 234.32, N = 3166983.33166363.911. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-60000.89331.78662.67993.57324.4665SE +/- 0.01, N = 3SE +/- 0.00, N = 33.963.97MIN: 1.68 / MAX: 20.25MIN: 1.61 / MAX: 20.641. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-60005K10K15K20K25KSE +/- 29.34, N = 3SE +/- 11.10, N = 323427.5023443.861. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: GPT-2 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60001.20232.40463.60694.80926.0115SE +/- 0.03369, N = 3SE +/- 0.06457, N = 35.119285.343591. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: GPT-2 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60004080120160200SE +/- 1.29, N = 3SE +/- 2.26, N = 3195.28187.131. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: ZFNet-512 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: ZFNet-512 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-6000246810SE +/- 0.00692, N = 3SE +/- 0.02791, N = 36.097186.507611. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: ZFNet-512 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: ZFNet-512 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60004080120160200SE +/- 0.19, N = 3SE +/- 0.66, N = 3163.98153.641. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-60001.3052.613.9155.226.525SE +/- 0.01, N = 3SE +/- 0.00, N = 35.805.80MIN: 2.41 / MAX: 27.54MIN: 2.26 / MAX: 26.371. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-60002K4K6K8K10KSE +/- 10.65, N = 3SE +/- 5.48, N = 38240.408240.981. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-6000612182430SE +/- 0.04, N = 3SE +/- 0.01, N = 327.0127.06MIN: 16.35 / MAX: 47.96MIN: 16.36 / MAX: 45.561. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-60008001600240032004000SE +/- 4.91, N = 3SE +/- 1.42, N = 33551.353543.791. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: bertsquad-12 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60001020304050SE +/- 0.24, N = 3SE +/- 0.48, N = 345.4146.201. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: bertsquad-12 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-6000510152025SE +/- 0.12, N = 3SE +/- 0.23, N = 322.0221.651. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: yolov4 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-600020406080100SE +/- 0.85, N = 3SE +/- 1.14, N = 381.8882.941. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: yolov4 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60003691215SE +/- 0.13, N = 3SE +/- 0.16, N = 312.2112.061. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: T5 Encoder - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: T5 Encoder - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60000.95481.90962.86443.81924.774SE +/- 0.00593, N = 3SE +/- 0.01572, N = 34.108804.243721. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: T5 Encoder - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: T5 Encoder - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-600050100150200250SE +/- 0.36, N = 3SE +/- 0.88, N = 3243.33235.601. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Weld Porosity Detection FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-6000246810SE +/- 0.01, N = 3SE +/- 0.00, N = 36.716.70MIN: 2.58 / MAX: 23.11MIN: 2.45 / MAX: 241. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Weld Porosity Detection FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-60003K6K9K12K15KSE +/- 11.52, N = 3SE +/- 1.42, N = 314006.1514026.361. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD512c DDR5-60008c DDR5-60005M10M15M20M25MSE +/- 20808.65, N = 3SE +/- 32518.37, N = 321681000214083331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: HMAC-SHA512

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA51212c DDR5-60008c DDR5-600090M180M270M360M450MSE +/- 2811379.81, N = 3SE +/- 211622.57, N = 34134743334133263331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.17Time To Compile12c DDR5-60008c DDR5-60001224364860SE +/- 0.01, N = 3SE +/- 0.16, N = 353.4453.55

LiteRT

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Inception ResNet V212c DDR5-60008c DDR5-600013K26K39K52K65KSE +/- 598.16, N = 3SE +/- 811.71, N = 358054.359977.8

LiteRT

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Mobilenet Float12c DDR5-60008c DDR5-600010002000300040005000SE +/- 16.88, N = 3SE +/- 18.31, N = 34399.484441.27

LiteRT

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: SqueezeNet12c DDR5-60008c DDR5-600015003000450060007500SE +/- 17.70, N = 3SE +/- 27.86, N = 37051.597048.39

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 51212c DDR5-60008c DDR5-60000.88471.76942.65413.53884.4235SE +/- 0.003414, N = 3SE +/- 0.040462, N = 33.9321793.8963111. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 102412c DDR5-60008c DDR5-60000.99671.99342.99013.98684.9835SE +/- 0.018753, N = 3SE +/- 0.014254, N = 34.4295704.3889741. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

SVT-AV1

Encoder Mode: Preset 5 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 5 - Input: Bosphorus 4K12c DDR5-60008c DDR5-60001326395265SE +/- 0.28, N = 3SE +/- 0.10, N = 356.4456.091. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB12c DDR5-60008c DDR5-60005K10K15K20K25KSE +/- 64.32, N = 3SE +/- 60.40, N = 322449.8318188.111. (CC) gcc options: -O3 -march=native

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput Total12c DDR5-60008c DDR5-60006K12K18K24K30KSE +/- 311.92, N = 3SE +/- 185.11, N = 1225941.926209.91. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Pabellon Barcelona - Compute: CPU-Only12c DDR5-60008c DDR5-6000918273645SE +/- 0.14, N = 3SE +/- 0.15, N = 341.2541.18

RawTherapee

Total Benchmark Time

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark Time12c DDR5-60008c DDR5-6000816243240SE +/- 0.05, N = 3SE +/- 0.05, N = 335.8536.231. RawTherapee, version 5.10, command line.

MBW

Test: Memory Copy - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB12c DDR5-60008c DDR5-60005K10K15K20K25KSE +/- 31.99, N = 3SE +/- 19.02, N = 325126.4923357.891. (CC) gcc options: -O3 -march=native

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Classroom - Compute: CPU-Only12c DDR5-60008c DDR5-6000816243240SE +/- 0.04, N = 3SE +/- 0.08, N = 335.4335.51

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression Rating12c DDR5-60008c DDR5-6000130K260K390K520K650KSE +/- 169.10, N = 3SE +/- 340.29, N = 36272156265531. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression Rating12c DDR5-60008c DDR5-6000200K400K600K800K1000KSE +/- 879.82, N = 3SE +/- 1850.34, N = 37792427270451. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 96000 - Buffer Size: 51212c DDR5-60008c DDR5-60001.33742.67484.01225.34966.687SE +/- 0.018441, N = 3SE +/- 0.033577, N = 35.9439795.9185951. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Liquid-DSP

Threads: 192 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 51212c DDR5-60008c DDR5-6000500M1000M1500M2000M2500MSE +/- 1582192.57, N = 3SE +/- 821245.67, N = 3214920000021367666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 51212c DDR5-60008c DDR5-6000400M800M1200M1600M2000MSE +/- 6251666.44, N = 3SE +/- 1072898.46, N = 3195210000019563333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSK12c DDR5-60008c DDR5-6000200K400K600K800K1000KSE +/- 2185.81, N = 3SE +/- 1666.67, N = 3102333310193331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 51212c DDR5-60008c DDR5-6000300M600M900M1200M1500MSE +/- 1419311.26, N = 3SE +/- 2356079.61, N = 3141296666714176333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcrypt12c DDR5-60008c DDR5-600050K100K150K200K250KSE +/- 33.49, N = 3SE +/- 505.86, N = 32375422370551. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: Blowfish12c DDR5-60008c DDR5-600050K100K150K200K250KSE +/- 16.90, N = 3SE +/- 16.64, N = 32375152374581. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

Liquid-DSP

Threads: 192 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 5712c DDR5-60008c DDR5-60001300M2600M3900M5200M6500MSE +/- 9761830.43, N = 3SE +/- 8738103.02, N = 3595860000059554333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 5712c DDR5-60008c DDR5-60001100M2200M3300M4400M5500MSE +/- 8326730.72, N = 3SE +/- 8868107.65, N = 3496343333350532000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 5712c DDR5-60008c DDR5-6000700M1400M2100M2800M3500MSE +/- 1814754.35, N = 3SE +/- 3636084.59, N = 3319590000032141333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction12c DDR5-60008c DDR5-60003691215SE +/- 0.06792326, N = 3SE +/- 0.05824182, N = 157.084926929.014164291. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfig12c DDR5-60008c DDR5-6000510152025SE +/- 0.25, N = 4SE +/- 0.25, N = 422.2222.56

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 96000 - Buffer Size: 102412c DDR5-60008c DDR5-6000246810SE +/- 0.024623, N = 3SE +/- 0.002866, N = 36.5393226.5173491. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon Nanotube12c DDR5-60008c DDR5-6000714212835SE +/- 0.23, N = 3SE +/- 0.24, N = 325.6628.561. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D12c DDR5-60008c DDR5-60003K6K9K12K15KSE +/- 614.47, N = 12SE +/- 133.00, N = 313685.9714658.251. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

VVenC

Video Input: Bosphorus 4K - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 4K - Video Preset: Faster12c DDR5-60008c DDR5-6000612182430SE +/- 0.06, N = 3SE +/- 0.02, N = 322.9622.641. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

WarpX

Input: Plasma Acceleration

OpenBenchmarking.orgSeconds, Fewer Is BetterWarpX 24.10Input: Plasma Acceleration12c DDR5-60008c DDR5-6000612182430SE +/- 0.22, N = 3SE +/- 0.17, N = 325.8525.761. (CXX) g++ options: -O3 -lm

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.212c DDR5-60008c DDR5-6000700M1400M2100M2800M3500MSE +/- 7144725.85, N = 3SE +/- 1073443.21, N = 330762863332085218000

oneDNN

Harness: Deconvolution Batch shapes_1d - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Deconvolution Batch shapes_1d - Engine: CPU12c DDR5-60008c DDR5-6000246810SE +/- 0.02120, N = 3SE +/- 0.01476, N = 36.767366.84402MIN: 4.07MIN: 5.141. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

ASTC Encoder

Preset: Very Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Very Thorough12c DDR5-60008c DDR5-60003691215SE +/- 0.00, N = 3SE +/- 0.00, N = 311.6311.631. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Exhaustive12c DDR5-60008c DDR5-6000246810SE +/- 0.0018, N = 3SE +/- 0.0004, N = 37.13347.13391. (CXX) g++ options: -O3 -flto -pthread

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput Total12c DDR5-60008c DDR5-600013002600390052006500SE +/- 0.38, N = 3SE +/- 0.12, N = 36237.76238.1MIN: 3818.9 / MAX: 6238.4MIN: 3819.5 / MAX: 6238.31. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB12c DDR5-60008c DDR5-60005K10K15K20K25KSE +/- 55.98, N = 3SE +/- 54.43, N = 322520.6718332.411. (CC) gcc options: -O3 -march=native

uvg266

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Slow12c DDR5-60008c DDR5-6000714212835SE +/- 0.01, N = 3SE +/- 0.04, N = 330.4030.30

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Fishy Cat - Compute: CPU-Only12c DDR5-60008c DDR5-6000510152025SE +/- 0.05, N = 3SE +/- 0.03, N = 318.4618.55

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Fast12c DDR5-60008c DDR5-60002004006008001000SE +/- 14.18, N = 3SE +/- 7.37, N = 131109.951072.781. (CXX) g++ options: -O3 -flto -pthread

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bare12c DDR5-60008c DDR5-600048121620SE +/- 0.02, N = 3SE +/- 0.03, N = 317.7916.651. (CXX) g++ options: -O3 -lm

uvg266

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Medium12c DDR5-60008c DDR5-6000816243240SE +/- 0.06, N = 3SE +/- 0.02, N = 333.0432.92

MBW

Test: Memory Copy - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 4096 MiB12c DDR5-60008c DDR5-60005K10K15K20K25KSE +/- 22.33, N = 3SE +/- 26.98, N = 325160.6323448.631. (CC) gcc options: -O3 -march=native

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Water-layered Halfspace12c DDR5-60008c DDR5-600048121620SE +/- 0.08, N = 3SE +/- 0.07, N = 314.2814.361. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM212c DDR5-60008c DDR5-600060120180240300SE +/- 2.24, N = 3SE +/- 0.50, N = 3286.86288.351. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM212c DDR5-60008c DDR5-600015003000450060007500SE +/- 56.00, N = 3SE +/- 12.59, N = 37171.457208.781. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C12c DDR5-60008c DDR5-600014K28K42K56K70KSE +/- 928.22, N = 15SE +/- 631.87, N = 1565380.5860572.201. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Blender

Blend File: Junkshop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: CPU-Only12c DDR5-60008c DDR5-600048121620SE +/- 0.01, N = 3SE +/- 0.02, N = 316.7516.80

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Layered Halfspace12c DDR5-60008c DDR5-600048121620SE +/- 0.07, N = 3SE +/- 0.11, N = 313.7013.861. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAMD

Input: STMV with 1,066,628 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: STMV with 1,066,628 Atoms12c DDR5-60008c DDR5-60000.94891.89782.84673.79564.7445SE +/- 0.00356, N = 3SE +/- 0.00613, N = 34.217173.55953

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Time12c DDR5-60008c DDR5-600051015202522.4122.651. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Time12c DDR5-60008c DDR5-600051015202522.9422.301. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integer12c DDR5-60008c DDR5-600030K60K90K120K150KSE +/- 235.46, N = 3SE +/- 1117.41, N = 3120526.5199666.501. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integer12c DDR5-60008c DDR5-600030K60K90K120K150KSE +/- 266.61, N = 3SE +/- 779.65, N = 3139450.46115580.471. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integer12c DDR5-60008c DDR5-600030K60K90K120K150KSE +/- 252.96, N = 3SE +/- 807.29, N = 3130318.77110436.651. (CC) gcc options: -O3 -march=native

oneDNN

Harness: IP Shapes 1D - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: IP Shapes 1D - Engine: CPU12c DDR5-60008c DDR5-60000.11970.23940.35910.47880.5985SE +/- 0.003116, N = 3SE +/- 0.004111, N = 30.5319840.529052MIN: 0.48MIN: 0.471. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

RAMspeed SMP

Type: Triad - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integer12c DDR5-60008c DDR5-600030K60K90K120K150KSE +/- 253.90, N = 3SE +/- 795.56, N = 3122052.34104337.421. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integer12c DDR5-60008c DDR5-600030K60K90K120K150KSE +/- 283.77, N = 3SE +/- 291.38, N = 3139548.85117864.041. (CC) gcc options: -O3 -march=native

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 8 - Input: Bosphorus 4K12c DDR5-60008c DDR5-60004080120160200SE +/- 1.50, N = 3SE +/- 0.19, N = 3186.69182.611. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: CPU-Only12c DDR5-60008c DDR5-60003691215SE +/- 0.05, N = 3SE +/- 0.05, N = 313.1413.09

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Slow12c DDR5-60008c DDR5-60001020304050SE +/- 0.08, N = 3SE +/- 0.11, N = 344.7944.761. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Medium12c DDR5-60008c DDR5-60001020304050SE +/- 0.06, N = 3SE +/- 0.09, N = 345.3045.381. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Mount St. Helens12c DDR5-60008c DDR5-60001.31752.6353.95255.276.5875SE +/- 0.055041694, N = 6SE +/- 0.034598254, N = 35.7787146135.8553509331. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C12c DDR5-60008c DDR5-600030K60K90K120K150KSE +/- 622.44, N = 3SE +/- 459.55, N = 3154965.84108558.211. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous Halfspace12c DDR5-60008c DDR5-6000246810SE +/- 0.106640607, N = 3SE +/- 0.043840287, N = 38.1948624988.1980058371. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C12c DDR5-60008c DDR5-600080K160K240K320K400KSE +/- 2078.86, N = 3SE +/- 377.10, N = 3361153.06284718.531. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic Model12c DDR5-60008c DDR5-6000246810SE +/- 0.086144824, N = 3SE +/- 0.080343793, N = 36.7779583006.6263693561. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B12c DDR5-60008c DDR5-600040K80K120K160K200KSE +/- 2749.03, N = 3SE +/- 1524.43, N = 15199255.89169436.811. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

oneDNN

Harness: IP Shapes 3D - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: IP Shapes 3D - Engine: CPU12c DDR5-60008c DDR5-60000.060.120.180.240.3SE +/- 0.000721, N = 3SE +/- 0.000649, N = 30.2667910.265558MIN: 0.25MIN: 0.251. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Thorough12c DDR5-60008c DDR5-600020406080100SE +/- 0.02, N = 3SE +/- 0.04, N = 382.2882.191. (CXX) g++ options: -O3 -flto -pthread

uvg266

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Super Fast12c DDR5-60008c DDR5-60001530456075SE +/- 0.11, N = 3SE +/- 0.32, N = 367.2665.52

uvg266

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Very Fast12c DDR5-60008c DDR5-60001530456075SE +/- 0.21, N = 3SE +/- 0.33, N = 367.2466.12

uvg266

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast12c DDR5-60008c DDR5-60001530456075SE +/- 0.18, N = 3SE +/- 0.79, N = 367.6566.86

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPU12c DDR5-60008c DDR5-6000714212835SE +/- 0.19, N = 3SE +/- 0.02, N = 328.2327.85MIN: 25.05 / MAX: 28.69MIN: 24.8 / MAX: 28.48

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C12c DDR5-60008c DDR5-600070K140K210K280K350KSE +/- 2348.10, N = 3SE +/- 1080.76, N = 3311883.80275011.411. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D12c DDR5-60008c DDR5-600015003000450060007500SE +/- 26.78, N = 3SE +/- 31.49, N = 36970.895653.351. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C12c DDR5-60008c DDR5-600030K60K90K120K150KSE +/- 2056.93, N = 3SE +/- 1082.06, N = 15147097.41105206.581. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solve12c DDR5-60008c DDR5-6000246810SE +/- 0.047, N = 3SE +/- 0.010, N = 37.2147.2291. (CXX) g++ options: -fopenmp -O2 -march=native

Stream

Type: Copy

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Copy12c DDR5-60008c DDR5-600070K140K210K280K350KSE +/- 570.76, N = 5SE +/- 357.86, N = 5343229.3221234.41. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Very Fast12c DDR5-60008c DDR5-600020406080100SE +/- 0.61, N = 3SE +/- 0.20, N = 390.4689.331. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Super Fast12c DDR5-60008c DDR5-600020406080100SE +/- 0.52, N = 3SE +/- 0.53, N = 391.5493.401. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 13 - Input: Bosphorus 4K12c DDR5-60008c DDR5-600090180270360450SE +/- 1.17, N = 3SE +/- 0.57, N = 3435.38430.871. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Ultra Fast12c DDR5-60008c DDR5-600020406080100SE +/- 0.29, N = 3SE +/- 0.36, N = 396.0796.721. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: Convolution Batch Shapes Auto - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Convolution Batch Shapes Auto - Engine: CPU12c DDR5-60008c DDR5-60000.07730.15460.23190.30920.3865SE +/- 0.000927, N = 3SE +/- 0.000677, N = 30.3435930.341374MIN: 0.32MIN: 0.331. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

NAMD

Input: ATPase with 327,506 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: ATPase with 327,506 Atoms12c DDR5-60008c DDR5-60003691215SE +/- 0.03, N = 3SE +/- 0.03, N = 312.2911.62

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C12c DDR5-60008c DDR5-600040K80K120K160K200KSE +/- 1877.63, N = 4SE +/- 1124.88, N = 3164903.45120202.201. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Medium12c DDR5-60008c DDR5-6000120240360480600SE +/- 0.76, N = 3SE +/- 0.83, N = 3568.83568.341. (CXX) g++ options: -O3 -flto -pthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Deconvolution Batch shapes_3d - Engine: CPU12c DDR5-60008c DDR5-60000.16120.32240.48360.64480.806SE +/- 0.002304, N = 3SE +/- 0.001800, N = 30.7163730.715363MIN: 0.62MIN: 0.621. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C12c DDR5-60008c DDR5-60002K4K6K8K10KSE +/- 85.90, N = 3SE +/- 91.88, N = 311230.8111375.831. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Stream

Type: Add

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Add12c DDR5-60008c DDR5-600080K160K240K320K400KSE +/- 1639.97, N = 5SE +/- 1797.63, N = 5355858.4233276.21. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stream

Type: Triad

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Triad12c DDR5-60008c DDR5-600070K140K210K280K350KSE +/- 1895.03, N = 5SE +/- 936.98, N = 5348347.3229604.01. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stream

Type: Scale

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Scale12c DDR5-60008c DDR5-600070K140K210K280K350KSE +/- 478.12, N = 5SE +/- 630.12, N = 5314946.5204677.81. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp


Phoronix Test Suite v10.8.5