AMD EPYC Turin 8c vs. 12c Memory Channel DDR5 Comparison

AMD EPYC 9655P memory benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2411164-NE-AMDEPYCTU56.

AMD EPYC Turin 8c vs. 12c Memory Channel DDR5 ComparisonProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen Resolution8c DDR5-600012c DDR5-6000AMD EPYC 9655P 96-Core @ 2.60GHz (96 Cores / 192 Threads)Supermicro Super Server H13SSL-N v1.01 (3.0 BIOS)AMD 1Ah8 x 64GB DDR5-6000MT/s3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Broadcom NetXtreme BCM5720 PCIeUbuntu 24.106.12.0-rc7-phx (x86_64)GNOME Shell 47.0X ServerGCC 14.2.0ext41024x76812 x 64GB DDR5-6000MT/s Micron MTC40F2046S1RC64BDY QSFFOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116Java Details- 8c DDR5-6000: OpenJDK Runtime Environment (build 21.0.5+11-Ubuntu-1ubuntu124.10)Python Details- Python 3.12.7Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC Turin 8c vs. 12c Memory Channel DDR5 Comparisonramspeed: Add - Integerramspeed: Copy - Integerramspeed: Scale - Integerramspeed: Triad - Integerramspeed: Average - Integerstream: Copystream: Scalestream: Triadstream: Addmbw: Memory Copy - 4096 MiBmbw: Memory Copy - 8192 MiBmbw: Memory Copy, Fixed Block Size - 4096 MiBmbw: Memory Copy, Fixed Block Size - 8192 MiBhpcg: 104 104 104 - 60hpcg: 144 144 144 - 60npb: BT.Cnpb: CG.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Cnpb: SP.Bnpb: SP.Cminibude: OpenMP - BM2minibude: OpenMP - BM2namd: ATPase with 327,506 Atomsnamd: STMV with 1,066,628 Atomsamg: incompact3d: X3D-benchmarking input.i3dincompact3d: input.i3d 193 Cells Per Directionopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenradioss: Chrysler Neon 1Mopenradioss: INIVOL and Fluid Structure Interaction Drop Containerspecfem3d: Mount St. Helensspecfem3d: Layered Halfspacespecfem3d: Tomographic Modelspecfem3d: Homogeneous Halfspacespecfem3d: Water-layered Halfspacesrsran: PDSCH Processor Benchmark, Throughput Totalsrsran: PUSCH Processor Benchmark, Throughput Totaljohn-the-ripper: bcryptjohn-the-ripper: WPA PSKjohn-the-ripper: Blowfishjohn-the-ripper: HMAC-SHA512john-the-ripper: MD5kvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Super Fastkvazaar: Bosphorus 4K - Ultra Fastsvt-av1: Preset 3 - Bosphorus 4Ksvt-av1: Preset 5 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Ksvt-av1: Preset 3 - Beauty 4K 10-bitsvt-av1: Preset 5 - Beauty 4K 10-bitsvt-av1: Preset 8 - Beauty 4K 10-bitsvt-av1: Preset 13 - Beauty 4K 10-bituvg266: Bosphorus 4K - Slowuvg266: Bosphorus 4K - Mediumuvg266: Bosphorus 4K - Very Fastuvg266: Bosphorus 4K - Super Fastuvg266: Bosphorus 4K - Ultra Fastvvenc: Bosphorus 4K - Fastvvenc: Bosphorus 4K - Fasterstargate: 96000 - 512stargate: 192000 - 512stargate: 96000 - 1024stargate: 192000 - 1024build-godot: Time To Compilebuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigbuild2: Time To Compileonednn: IP Shapes 1D - CPUonednn: IP Shapes 3D - CPUonednn: Convolution Batch Shapes Auto - CPUonednn: Deconvolution Batch shapes_1d - CPUonednn: Deconvolution Batch shapes_3d - CPUonednn: Recurrent Neural Network Training - CPUonednn: Recurrent Neural Network Inference - CPUliquid-dsp: 64 - 256 - 57liquid-dsp: 128 - 256 - 57liquid-dsp: 192 - 256 - 57liquid-dsp: 64 - 256 - 512liquid-dsp: 128 - 256 - 512liquid-dsp: 192 - 256 - 512graph500: 26graph500: 26graph500: 26graph500: 26gromacs: MPI CPU - water_GMX50_barepgbench: 1000 - 800 - Read Onlypgbench: 1000 - 800 - Read Only - Average Latencypgbench: 1000 - 800 - Read Writepgbench: 1000 - 800 - Read Write - Average Latencytensorflow: CPU - 512 - ResNet-50rawtherapee: Total Benchmark Timegpaw: Carbon Nanotubeblender: BMW27 - CPU-Onlyblender: Junkshop - CPU-Onlyblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlyopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUnginx: 500apache: 500whisperfile: Smallwhisperfile: Mediumepoch: Conewarpx: Uniform Plasmawarpx: Plasma Accelerationluxcorerender: DLSC - CPUluxcorerender: Danish Mood - CPUluxcorerender: Orange Juice - CPUluxcorerender: LuxCore Benchmark - CPUluxcorerender: Rainbow Colors and Prism - CPUopenvkl: vklBenchmarkCPU ISPCcompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingm-queens: Time To Solveclickhouse: 100M Rows Hits Dataset, First Run / Cold Cacheclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, Third Runmemcached: 1:5memcached: 1:10memcached: 1:100astcenc: Fastastcenc: Mediumastcenc: Thoroughastcenc: Exhaustiveastcenc: Very Thoroughlitert: DeepLab V3litert: SqueezeNetlitert: Inception V4litert: NASNet Mobilelitert: Mobilenet Floatlitert: Mobilenet Quantlitert: Inception ResNet V2litert: Quantized COCO SSD MobileNet v1xnnpack: FP32MobileNetV1xnnpack: FP32MobileNetV2xnnpack: FP32MobileNetV3Largexnnpack: FP32MobileNetV3Smallxnnpack: FP16MobileNetV1xnnpack: FP16MobileNetV2xnnpack: FP16MobileNetV3Largexnnpack: FP16MobileNetV3Smallxnnpack: QS8MobileNetV2cassandra: Writesv-ray: CPUonnx: GPT-2 - CPU - Standardonnx: GPT-2 - CPU - Standardonnx: yolov4 - CPU - Standardonnx: yolov4 - CPU - Standardonnx: ZFNet-512 - CPU - Standardonnx: ZFNet-512 - CPU - Standardonnx: T5 Encoder - CPU - Standardonnx: T5 Encoder - CPU - Standardonnx: bertsquad-12 - CPU - Standardonnx: bertsquad-12 - CPU - Standardonnx: CaffeNet 12-int8 - CPU - Standardonnx: CaffeNet 12-int8 - CPU - Standardonnx: fcn-resnet101-11 - CPU - Standardonnx: fcn-resnet101-11 - CPU - Standard8c DDR5-600012c DDR5-600099666.50115580.47117864.04104337.42110436.65221234.4204677.8229604.0233276.223448.63323357.88918332.40918188.11141.318641.0157284718.5360572.2011375.8314658.25120202.205653.35275011.41105206.58169436.81108558.217208.779288.35111.624573.559532085218000292.5787869.0141642922.29707222.648378109.19543254.29552129.1779.995.85535093313.8612520996.6263693568.19800583714.35537790426209.96238.123705510193332374584133263332140833344.7645.3889.3393.4096.7215.60356.088182.608430.8681.8467.68713.51217.11030.3032.9266.1265.5266.869.59822.6355.9185953.8963116.5173494.38897480.87422.555196.24053.5510.5290520.2655580.3413746.844020.715363429.684277.9963214133333505320000059554333331417633333195633333321367666671365800000139445000049779800067670900016.65431492770.2541121757.136196.4736.22828.56013.0916.8035.5118.55121.1541.18449.23106.96145.11330.138240.985.8023443.863.972291.0920.85740.1364.7514026.366.706329.607.537314.9212.4210448.924.563543.7927.06166363.910.33523766.78159864.9996.68799215.52289196.0316.4537358025.7628227217.0112.0624.1512.9227.8527627270456265537.229637.67652.53651.483711825.756752055.1212031156.031072.7819568.336382.18967.133911.630311817.97048.3943959.9100519.44441.276260.2159977.87110.124596923614189105344570909413257108109968422312198145187.1255.3435912.060582.9434153.6376.50761235.5964.2437221.647846.2026958.5761.0475059.18611109.005120526.51139450.46139548.85122052.34130318.77343229.3314946.5348347.3355858.425160.63225126.49222520.67422449.82662.544161.7722361153.0665380.5811230.8113685.97164903.456970.89311883.80147097.41199255.89154965.847171.451286.85812.289984.217173076286333203.9370527.0849269222.94460522.408568106.31177192.2272190.7779.425.77871461313.7000752296.7779583008.19486249814.27555513625941.96237.723754210233332375154134743332168100044.7945.3090.4691.5496.0715.68456.438186.687435.3821.8527.69913.55417.14530.4033.0467.2467.2667.659.64322.9605.9439793.9321796.5393224.42957080.55222.224193.34653.4390.5319840.2667910.3435936.767360.716373425.784277.6433195900000496343333359586000001412966667195210000021492000001478180000150571000055493600074713800017.78731550360.2541137457.037241.5535.85425.66013.1416.7535.4318.46120.8341.25689.5069.48145.10330.018240.405.8023427.503.962437.9319.61844.9356.7114006.156.716380.587.487934.4011.5710462.564.563551.3527.01166983.330.33505484.86170467.3791.05176202.15909186.8516.2745715225.8510977216.4912.0324.1912.3928.2328257792426272157.214740.15771.05765.073701189.236670118.9411923788.431109.9501568.826282.27957.133411.627711450.97051.5944088.399408.34399.485800.6558054.37211.9946299242136431041846118982130741055910095447176199769195.2765.1192812.214981.8816163.9766.09718243.3284.1088022.023845.40671041.950.9595319.44076106.2133OpenBenchmarking.org

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integer8c DDR5-600012c DDR5-600030K60K90K120K150KSE +/- 1117.41, N = 3SE +/- 235.46, N = 399666.50120526.511. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integer8c DDR5-600012c DDR5-600030K60K90K120K150KSE +/- 779.65, N = 3SE +/- 266.61, N = 3115580.47139450.461. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integer8c DDR5-600012c DDR5-600030K60K90K120K150KSE +/- 291.38, N = 3SE +/- 283.77, N = 3117864.04139548.851. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integer8c DDR5-600012c DDR5-600030K60K90K120K150KSE +/- 795.56, N = 3SE +/- 253.90, N = 3104337.42122052.341. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integer8c DDR5-600012c DDR5-600030K60K90K120K150KSE +/- 807.29, N = 3SE +/- 252.96, N = 3110436.65130318.771. (CC) gcc options: -O3 -march=native

Stream

Type: Copy

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Copy8c DDR5-600012c DDR5-600070K140K210K280K350KSE +/- 357.86, N = 5SE +/- 570.76, N = 5221234.4343229.31. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stream

Type: Scale

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Scale8c DDR5-600012c DDR5-600070K140K210K280K350KSE +/- 630.12, N = 5SE +/- 478.12, N = 5204677.8314946.51. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stream

Type: Triad

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Triad8c DDR5-600012c DDR5-600070K140K210K280K350KSE +/- 936.98, N = 5SE +/- 1895.03, N = 5229604.0348347.31. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stream

Type: Add

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Add8c DDR5-600012c DDR5-600080K160K240K320K400KSE +/- 1797.63, N = 5SE +/- 1639.97, N = 5233276.2355858.41. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

MBW

Test: Memory Copy - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 4096 MiB8c DDR5-600012c DDR5-60005K10K15K20K25KSE +/- 26.98, N = 3SE +/- 22.33, N = 323448.6325160.631. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB8c DDR5-600012c DDR5-60005K10K15K20K25KSE +/- 19.02, N = 3SE +/- 31.99, N = 323357.8925126.491. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB8c DDR5-600012c DDR5-60005K10K15K20K25KSE +/- 54.43, N = 3SE +/- 55.98, N = 318332.4122520.671. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB8c DDR5-600012c DDR5-60005K10K15K20K25KSE +/- 60.40, N = 3SE +/- 64.32, N = 318188.1122449.831. (CC) gcc options: -O3 -march=native

High Performance Conjugate Gradient

X Y Z: 104 104 104 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 608c DDR5-600012c DDR5-60001428425670SE +/- 0.02, N = 3SE +/- 0.03, N = 341.3262.541. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 608c DDR5-600012c DDR5-60001428425670SE +/- 0.03, N = 3SE +/- 0.01, N = 341.0261.771. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C8c DDR5-600012c DDR5-600080K160K240K320K400KSE +/- 377.10, N = 3SE +/- 2078.86, N = 3284718.53361153.061. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C8c DDR5-600012c DDR5-600014K28K42K56K70KSE +/- 631.87, N = 15SE +/- 928.22, N = 1560572.2065380.581. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C8c DDR5-600012c DDR5-60002K4K6K8K10KSE +/- 91.88, N = 3SE +/- 85.90, N = 311375.8311230.811. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D8c DDR5-600012c DDR5-60003K6K9K12K15KSE +/- 133.00, N = 3SE +/- 614.47, N = 1214658.2513685.971. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C8c DDR5-600012c DDR5-600040K80K120K160K200KSE +/- 1124.88, N = 3SE +/- 1877.63, N = 4120202.20164903.451. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D8c DDR5-600012c DDR5-600015003000450060007500SE +/- 31.49, N = 3SE +/- 26.78, N = 35653.356970.891. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C8c DDR5-600012c DDR5-600070K140K210K280K350KSE +/- 1080.76, N = 3SE +/- 2348.10, N = 3275011.41311883.801. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C8c DDR5-600012c DDR5-600030K60K90K120K150KSE +/- 1082.06, N = 15SE +/- 2056.93, N = 3105206.58147097.411. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B8c DDR5-600012c DDR5-600040K80K120K160K200KSE +/- 1524.43, N = 15SE +/- 2749.03, N = 3169436.81199255.891. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C8c DDR5-600012c DDR5-600030K60K90K120K150KSE +/- 459.55, N = 3SE +/- 622.44, N = 3108558.21154965.841. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM28c DDR5-600012c DDR5-600015003000450060007500SE +/- 12.59, N = 3SE +/- 56.00, N = 37208.787171.451. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM28c DDR5-600012c DDR5-600060120180240300SE +/- 0.50, N = 3SE +/- 2.24, N = 3288.35286.861. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

NAMD

Input: ATPase with 327,506 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: ATPase with 327,506 Atoms8c DDR5-600012c DDR5-60003691215SE +/- 0.03, N = 3SE +/- 0.03, N = 311.6212.29

NAMD

Input: STMV with 1,066,628 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: STMV with 1,066,628 Atoms8c DDR5-600012c DDR5-60000.94891.89782.84673.79564.7445SE +/- 0.00613, N = 3SE +/- 0.00356, N = 33.559534.21717

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.28c DDR5-600012c DDR5-6000700M1400M2100M2800M3500MSE +/- 1073443.21, N = 3SE +/- 7144725.85, N = 320852180003076286333

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3d8c DDR5-600012c DDR5-600060120180240300SE +/- 3.79, N = 3SE +/- 2.76, N = 3292.58203.941. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction8c DDR5-600012c DDR5-60003691215SE +/- 0.05824182, N = 15SE +/- 0.06792326, N = 39.014164297.084926921. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Time8c DDR5-600012c DDR5-600051015202522.3022.941. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Time8c DDR5-600012c DDR5-600051015202522.6522.411. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Time8c DDR5-600012c DDR5-600020406080100109.20106.311. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time8c DDR5-600012c DDR5-600060120180240300254.30192.231. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1M8c DDR5-600012c DDR5-6000306090120150SE +/- 0.94, N = 11SE +/- 0.26, N = 3129.1790.77

OpenRadioss

Model: INIVOL and Fluid Structure Interaction Drop Container

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop Container8c DDR5-600012c DDR5-600020406080100SE +/- 0.30, N = 3SE +/- 0.06, N = 379.9979.42

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Mount St. Helens8c DDR5-600012c DDR5-60001.31752.6353.95255.276.5875SE +/- 0.034598254, N = 3SE +/- 0.055041694, N = 65.8553509335.7787146131. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Layered Halfspace8c DDR5-600012c DDR5-600048121620SE +/- 0.11, N = 3SE +/- 0.07, N = 313.8613.701. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic Model8c DDR5-600012c DDR5-6000246810SE +/- 0.080343793, N = 3SE +/- 0.086144824, N = 36.6263693566.7779583001. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous Halfspace8c DDR5-600012c DDR5-6000246810SE +/- 0.043840287, N = 3SE +/- 0.106640607, N = 38.1980058378.1948624981. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Water-layered Halfspace8c DDR5-600012c DDR5-600048121620SE +/- 0.07, N = 3SE +/- 0.08, N = 314.3614.281. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput Total8c DDR5-600012c DDR5-60006K12K18K24K30KSE +/- 185.11, N = 12SE +/- 311.92, N = 326209.925941.91. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput Total8c DDR5-600012c DDR5-600013002600390052006500SE +/- 0.12, N = 3SE +/- 0.38, N = 36238.16237.7MIN: 3819.5 / MAX: 6238.3MIN: 3818.9 / MAX: 6238.41. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcrypt8c DDR5-600012c DDR5-600050K100K150K200K250KSE +/- 505.86, N = 3SE +/- 33.49, N = 32370552375421. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSK8c DDR5-600012c DDR5-6000200K400K600K800K1000KSE +/- 1666.67, N = 3SE +/- 2185.81, N = 3101933310233331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: Blowfish8c DDR5-600012c DDR5-600050K100K150K200K250KSE +/- 16.64, N = 3SE +/- 16.90, N = 32374582375151. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: HMAC-SHA512

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA5128c DDR5-600012c DDR5-600090M180M270M360M450MSE +/- 211622.57, N = 3SE +/- 2811379.81, N = 34133263334134743331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD58c DDR5-600012c DDR5-60005M10M15M20M25MSE +/- 32518.37, N = 3SE +/- 20808.65, N = 321408333216810001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Slow8c DDR5-600012c DDR5-60001020304050SE +/- 0.11, N = 3SE +/- 0.08, N = 344.7644.791. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Medium8c DDR5-600012c DDR5-60001020304050SE +/- 0.09, N = 3SE +/- 0.06, N = 345.3845.301. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Very Fast8c DDR5-600012c DDR5-600020406080100SE +/- 0.20, N = 3SE +/- 0.61, N = 389.3390.461. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Super Fast8c DDR5-600012c DDR5-600020406080100SE +/- 0.53, N = 3SE +/- 0.52, N = 393.4091.541. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Ultra Fast8c DDR5-600012c DDR5-600020406080100SE +/- 0.36, N = 3SE +/- 0.29, N = 396.7296.071. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

SVT-AV1

Encoder Mode: Preset 3 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 3 - Input: Bosphorus 4K8c DDR5-600012c DDR5-600048121620SE +/- 0.00, N = 3SE +/- 0.03, N = 315.6015.681. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 5 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 5 - Input: Bosphorus 4K8c DDR5-600012c DDR5-60001326395265SE +/- 0.10, N = 3SE +/- 0.28, N = 356.0956.441. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 8 - Input: Bosphorus 4K8c DDR5-600012c DDR5-60004080120160200SE +/- 0.19, N = 3SE +/- 1.50, N = 3182.61186.691. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 13 - Input: Bosphorus 4K8c DDR5-600012c DDR5-600090180270360450SE +/- 0.57, N = 3SE +/- 1.17, N = 3430.87435.381. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 3 - Input: Beauty 4K 10-bit

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 3 - Input: Beauty 4K 10-bit8c DDR5-600012c DDR5-60000.41670.83341.25011.66682.0835SE +/- 0.002, N = 3SE +/- 0.007, N = 31.8461.8521. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 5 - Input: Beauty 4K 10-bit

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 5 - Input: Beauty 4K 10-bit8c DDR5-600012c DDR5-6000246810SE +/- 0.026, N = 3SE +/- 0.039, N = 37.6877.6991. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 8 - Input: Beauty 4K 10-bit

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 8 - Input: Beauty 4K 10-bit8c DDR5-600012c DDR5-60003691215SE +/- 0.02, N = 3SE +/- 0.00, N = 313.5113.551. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 13 - Input: Beauty 4K 10-bit

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 13 - Input: Beauty 4K 10-bit8c DDR5-600012c DDR5-600048121620SE +/- 0.00, N = 3SE +/- 0.01, N = 317.1117.151. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

uvg266

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Slow8c DDR5-600012c DDR5-6000714212835SE +/- 0.04, N = 3SE +/- 0.01, N = 330.3030.40

uvg266

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Medium8c DDR5-600012c DDR5-6000816243240SE +/- 0.02, N = 3SE +/- 0.06, N = 332.9233.04

uvg266

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Very Fast8c DDR5-600012c DDR5-60001530456075SE +/- 0.33, N = 3SE +/- 0.21, N = 366.1267.24

uvg266

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Super Fast8c DDR5-600012c DDR5-60001530456075SE +/- 0.32, N = 3SE +/- 0.11, N = 365.5267.26

uvg266

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast8c DDR5-600012c DDR5-60001530456075SE +/- 0.79, N = 3SE +/- 0.18, N = 366.8667.65

VVenC

Video Input: Bosphorus 4K - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 4K - Video Preset: Fast8c DDR5-600012c DDR5-60003691215SE +/- 0.064, N = 3SE +/- 0.082, N = 39.5989.6431. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

VVenC

Video Input: Bosphorus 4K - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 4K - Video Preset: Faster8c DDR5-600012c DDR5-6000612182430SE +/- 0.02, N = 3SE +/- 0.06, N = 322.6422.961. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 96000 - Buffer Size: 5128c DDR5-600012c DDR5-60001.33742.67484.01225.34966.687SE +/- 0.033577, N = 3SE +/- 0.018441, N = 35.9185955.9439791. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 5128c DDR5-600012c DDR5-60000.88471.76942.65413.53884.4235SE +/- 0.040462, N = 3SE +/- 0.003414, N = 33.8963113.9321791. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 96000 - Buffer Size: 10248c DDR5-600012c DDR5-6000246810SE +/- 0.002866, N = 3SE +/- 0.024623, N = 36.5173496.5393221. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 10248c DDR5-600012c DDR5-60000.99671.99342.99013.98684.9835SE +/- 0.014254, N = 3SE +/- 0.018753, N = 34.3889744.4295701. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To Compile8c DDR5-600012c DDR5-600020406080100SE +/- 0.04, N = 3SE +/- 0.19, N = 380.8780.55

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfig8c DDR5-600012c DDR5-6000510152025SE +/- 0.25, N = 4SE +/- 0.25, N = 422.5622.22

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfig8c DDR5-600012c DDR5-60004080120160200SE +/- 0.13, N = 3SE +/- 0.43, N = 3196.24193.35

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.17Time To Compile8c DDR5-600012c DDR5-60001224364860SE +/- 0.16, N = 3SE +/- 0.01, N = 353.5553.44

oneDNN

Harness: IP Shapes 1D - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: IP Shapes 1D - Engine: CPU8c DDR5-600012c DDR5-60000.11970.23940.35910.47880.5985SE +/- 0.004111, N = 3SE +/- 0.003116, N = 30.5290520.531984MIN: 0.47MIN: 0.481. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: IP Shapes 3D - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: IP Shapes 3D - Engine: CPU8c DDR5-600012c DDR5-60000.060.120.180.240.3SE +/- 0.000649, N = 3SE +/- 0.000721, N = 30.2655580.266791MIN: 0.25MIN: 0.251. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Convolution Batch Shapes Auto - Engine: CPU8c DDR5-600012c DDR5-60000.07730.15460.23190.30920.3865SE +/- 0.000677, N = 3SE +/- 0.000927, N = 30.3413740.343593MIN: 0.33MIN: 0.321. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Deconvolution Batch shapes_1d - Engine: CPU8c DDR5-600012c DDR5-6000246810SE +/- 0.01476, N = 3SE +/- 0.02120, N = 36.844026.76736MIN: 5.14MIN: 4.071. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Deconvolution Batch shapes_3d - Engine: CPU8c DDR5-600012c DDR5-60000.16120.32240.48360.64480.806SE +/- 0.001800, N = 3SE +/- 0.002304, N = 30.7153630.716373MIN: 0.62MIN: 0.621. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: Recurrent Neural Network Training - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Recurrent Neural Network Training - Engine: CPU8c DDR5-600012c DDR5-600090180270360450SE +/- 0.28, N = 3SE +/- 0.20, N = 3429.68425.78MIN: 423.83MIN: 420.181. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Recurrent Neural Network Inference - Engine: CPU8c DDR5-600012c DDR5-600060120180240300SE +/- 0.67, N = 3SE +/- 0.81, N = 3278.00277.64MIN: 271.73MIN: 269.941. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 578c DDR5-600012c DDR5-6000700M1400M2100M2800M3500MSE +/- 3636084.59, N = 3SE +/- 1814754.35, N = 3321413333331959000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 578c DDR5-600012c DDR5-60001100M2200M3300M4400M5500MSE +/- 8868107.65, N = 3SE +/- 8326730.72, N = 3505320000049634333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 192 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 578c DDR5-600012c DDR5-60001300M2600M3900M5200M6500MSE +/- 8738103.02, N = 3SE +/- 9761830.43, N = 3595543333359586000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 5128c DDR5-600012c DDR5-6000300M600M900M1200M1500MSE +/- 2356079.61, N = 3SE +/- 1419311.26, N = 3141763333314129666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 5128c DDR5-600012c DDR5-6000400M800M1200M1600M2000MSE +/- 1072898.46, N = 3SE +/- 6251666.44, N = 3195633333319521000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 192 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 5128c DDR5-600012c DDR5-6000500M1000M1500M2000M2500MSE +/- 821245.67, N = 3SE +/- 1582192.57, N = 3213676666721492000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Graph500

Scale: 26

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 268c DDR5-600012c DDR5-6000300M600M900M1200M1500M136580000014781800001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 268c DDR5-600012c DDR5-6000300M600M900M1200M1500M139445000015057100001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 268c DDR5-600012c DDR5-6000120M240M360M480M600M4977980005549360001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 268c DDR5-600012c DDR5-6000160M320M480M640M800M6767090007471380001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bare8c DDR5-600012c DDR5-600048121620SE +/- 0.03, N = 3SE +/- 0.02, N = 316.6517.791. (CXX) g++ options: -O3 -lm

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 1000 - Clients: 800 - Mode: Read Only8c DDR5-600012c DDR5-6000700K1400K2100K2800K3500KSE +/- 8342.05, N = 3SE +/- 1685.88, N = 3314927731550361. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 1000 - Clients: 800 - Mode: Read Only - Average Latency8c DDR5-600012c DDR5-60000.05720.11440.17160.22880.286SE +/- 0.001, N = 3SE +/- 0.000, N = 30.2540.2541. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 1000 - Clients: 800 - Mode: Read Write8c DDR5-600012c DDR5-600020K40K60K80K100KSE +/- 836.14, N = 11SE +/- 1257.14, N = 51121751137451. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 1000 - Clients: 800 - Mode: Read Write - Average Latency8c DDR5-600012c DDR5-6000246810SE +/- 0.053, N = 11SE +/- 0.079, N = 57.1367.0371. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

TensorFlow

Device: CPU - Batch Size: 512 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 512 - Model: ResNet-508c DDR5-600012c DDR5-600050100150200250SE +/- 0.27, N = 3SE +/- 0.11, N = 3196.47241.55

RawTherapee

Total Benchmark Time

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark Time8c DDR5-600012c DDR5-6000816243240SE +/- 0.05, N = 3SE +/- 0.05, N = 336.2335.851. RawTherapee, version 5.10, command line.

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon Nanotube8c DDR5-600012c DDR5-6000714212835SE +/- 0.24, N = 3SE +/- 0.23, N = 328.5625.661. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: CPU-Only8c DDR5-600012c DDR5-60003691215SE +/- 0.05, N = 3SE +/- 0.05, N = 313.0913.14

Blender

Blend File: Junkshop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: CPU-Only8c DDR5-600012c DDR5-600048121620SE +/- 0.02, N = 3SE +/- 0.01, N = 316.8016.75

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Classroom - Compute: CPU-Only8c DDR5-600012c DDR5-6000816243240SE +/- 0.08, N = 3SE +/- 0.04, N = 335.5135.43

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Fishy Cat - Compute: CPU-Only8c DDR5-600012c DDR5-6000510152025SE +/- 0.03, N = 3SE +/- 0.05, N = 318.5518.46

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: CPU-Only8c DDR5-600012c DDR5-6000306090120150SE +/- 0.12, N = 3SE +/- 0.11, N = 3121.15120.83

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Pabellon Barcelona - Compute: CPU-Only8c DDR5-600012c DDR5-6000918273645SE +/- 0.15, N = 3SE +/- 0.14, N = 341.1841.25

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPU8c DDR5-600012c DDR5-6000150300450600750SE +/- 6.83, N = 15SE +/- 1.81, N = 3449.23689.501. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPU8c DDR5-600012c DDR5-600020406080100SE +/- 1.46, N = 15SE +/- 0.19, N = 3106.9669.48MIN: 37.41 / MAX: 380.85MIN: 34.01 / MAX: 134.551. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-6000306090120150SE +/- 0.12, N = 3SE +/- 0.26, N = 3145.11145.101. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-600070140210280350SE +/- 0.26, N = 3SE +/- 0.62, N = 3330.13330.01MIN: 257.44 / MAX: 357.54MIN: 242.42 / MAX: 353.461. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-60002K4K6K8K10KSE +/- 5.48, N = 3SE +/- 10.65, N = 38240.988240.401. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-60001.3052.613.9155.226.525SE +/- 0.00, N = 3SE +/- 0.01, N = 35.805.80MIN: 2.26 / MAX: 26.37MIN: 2.41 / MAX: 27.541. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-60005K10K15K20K25KSE +/- 11.10, N = 3SE +/- 29.34, N = 323443.8623427.501. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-60000.89331.78662.67993.57324.4665SE +/- 0.00, N = 3SE +/- 0.01, N = 33.973.96MIN: 1.61 / MAX: 20.64MIN: 1.68 / MAX: 20.251. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-60005001000150020002500SE +/- 2.70, N = 3SE +/- 1.15, N = 32291.092437.931. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-6000510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 320.8519.61MIN: 10.51 / MAX: 40.51MIN: 10.37 / MAX: 41.251. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPU8c DDR5-600012c DDR5-60002004006008001000SE +/- 4.52, N = 3SE +/- 1.30, N = 3740.13844.931. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPU8c DDR5-600012c DDR5-60001428425670SE +/- 0.40, N = 3SE +/- 0.09, N = 364.7556.71MIN: 28.81 / MAX: 99.11MIN: 34.97 / MAX: 96.611. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Weld Porosity Detection FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-60003K6K9K12K15KSE +/- 1.42, N = 3SE +/- 11.52, N = 314026.3614006.151. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Weld Porosity Detection FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-6000246810SE +/- 0.00, N = 3SE +/- 0.01, N = 36.706.71MIN: 2.45 / MAX: 24MIN: 2.58 / MAX: 23.111. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPU8c DDR5-600012c DDR5-600014002800420056007000SE +/- 17.72, N = 3SE +/- 12.38, N = 36329.606380.581. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPU8c DDR5-600012c DDR5-6000246810SE +/- 0.02, N = 3SE +/- 0.01, N = 37.537.48MIN: 4.35 / MAX: 22.92MIN: 4.42 / MAX: 23.351. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Noise Suppression Poconet-Like FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Noise Suppression Poconet-Like FP16 - Device: CPU8c DDR5-600012c DDR5-60002K4K6K8K10KSE +/- 10.57, N = 3SE +/- 20.16, N = 37314.927934.401. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Noise Suppression Poconet-Like FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Noise Suppression Poconet-Like FP16 - Device: CPU8c DDR5-600012c DDR5-60003691215SE +/- 0.02, N = 3SE +/- 0.03, N = 312.4211.57MIN: 6.11 / MAX: 28.55MIN: 6.95 / MAX: 30.791. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Re-Identification Retail FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Re-Identification Retail FP16 - Device: CPU8c DDR5-600012c DDR5-60002K4K6K8K10KSE +/- 7.25, N = 3SE +/- 5.49, N = 310448.9210462.561. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Re-Identification Retail FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Re-Identification Retail FP16 - Device: CPU8c DDR5-600012c DDR5-60001.0262.0523.0784.1045.13SE +/- 0.01, N = 3SE +/- 0.00, N = 34.564.56MIN: 2.23 / MAX: 20.98MIN: 2.45 / MAX: 18.951. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-60008001600240032004000SE +/- 1.42, N = 3SE +/- 4.91, N = 33543.793551.351. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-6000612182430SE +/- 0.01, N = 3SE +/- 0.04, N = 327.0627.01MIN: 16.36 / MAX: 45.56MIN: 16.35 / MAX: 47.961. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-600040K80K120K160K200KSE +/- 234.32, N = 3SE +/- 251.97, N = 3166363.91166983.331. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-60000.07430.14860.22290.29720.3715SE +/- 0.00, N = 3SE +/- 0.00, N = 30.330.33MIN: 0.13 / MAX: 24.63MIN: 0.12 / MAX: 23.431. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 5008c DDR5-600012c DDR5-6000110K220K330K440K550KSE +/- 3145.55, N = 3SE +/- 328.79, N = 3523766.78505484.861. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 5008c DDR5-600012c DDR5-600040K80K120K160K200KSE +/- 177.80, N = 3SE +/- 413.18, N = 3159864.99170467.371. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Whisperfile

Model Size: Small

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisperfile 20Aug24Model Size: Small8c DDR5-600012c DDR5-600020406080100SE +/- 0.65, N = 3SE +/- 0.71, N = 396.6991.05

Whisperfile

Model Size: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisperfile 20Aug24Model Size: Medium8c DDR5-600012c DDR5-600050100150200250SE +/- 0.69, N = 3SE +/- 0.37, N = 3215.52202.16

Epoch

Epoch3D Deck: Cone

OpenBenchmarking.orgSeconds, Fewer Is BetterEpoch 4.19.4Epoch3D Deck: Cone8c DDR5-600012c DDR5-60004080120160200SE +/- 0.52, N = 3SE +/- 0.55, N = 3196.03186.851. (F9X) gfortran options: -O3 -std=f2003 -Jobj -lsdf -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

WarpX

Input: Uniform Plasma

OpenBenchmarking.orgSeconds, Fewer Is BetterWarpX 24.10Input: Uniform Plasma8c DDR5-600012c DDR5-600048121620SE +/- 0.24, N = 12SE +/- 0.23, N = 1516.4516.271. (CXX) g++ options: -O3 -lm

WarpX

Input: Plasma Acceleration

OpenBenchmarking.orgSeconds, Fewer Is BetterWarpX 24.10Input: Plasma Acceleration8c DDR5-600012c DDR5-6000612182430SE +/- 0.17, N = 3SE +/- 0.22, N = 325.7625.851. (CXX) g++ options: -O3 -lm

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPU8c DDR5-600012c DDR5-600048121620SE +/- 0.21, N = 15SE +/- 0.19, N = 317.0116.49MIN: 15.67 / MAX: 20.15MIN: 15.79 / MAX: 20.03

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPU8c DDR5-600012c DDR5-60003691215SE +/- 0.11, N = 15SE +/- 0.09, N = 1512.0612.03MIN: 5.76 / MAX: 14.38MIN: 5.72 / MAX: 14.2

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPU8c DDR5-600012c DDR5-6000612182430SE +/- 0.09, N = 3SE +/- 0.09, N = 324.1524.19MIN: 21.4 / MAX: 32.92MIN: 21.48 / MAX: 33.13

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPU8c DDR5-600012c DDR5-60003691215SE +/- 0.17, N = 3SE +/- 0.12, N = 1512.9212.39MIN: 6.3 / MAX: 15.04MIN: 5.62 / MAX: 14.88

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPU8c DDR5-600012c DDR5-6000714212835SE +/- 0.02, N = 3SE +/- 0.19, N = 327.8528.23MIN: 24.8 / MAX: 28.48MIN: 25.05 / MAX: 28.69

OpenVKL

Benchmark: vklBenchmarkCPU ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 2.0.0Benchmark: vklBenchmarkCPU ISPC8c DDR5-600012c DDR5-60006001200180024003000SE +/- 1.53, N = 3SE +/- 1.76, N = 327622825MIN: 216 / MAX: 33882MIN: 217 / MAX: 36373

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression Rating8c DDR5-600012c DDR5-6000200K400K600K800K1000KSE +/- 1850.34, N = 3SE +/- 879.82, N = 37270457792421. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression Rating8c DDR5-600012c DDR5-6000130K260K390K520K650KSE +/- 340.29, N = 3SE +/- 169.10, N = 36265536272151. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solve8c DDR5-600012c DDR5-6000246810SE +/- 0.010, N = 3SE +/- 0.047, N = 37.2297.2141. (CXX) g++ options: -fopenmp -O2 -march=native

ClickHouse

100M Rows Hits Dataset, First Run / Cold Cache

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold Cache8c DDR5-600012c DDR5-6000160320480640800SE +/- 1.98, N = 3SE +/- 3.26, N = 3637.67740.15MIN: 67.95 / MAX: 7500MIN: 69.2 / MAX: 7500

ClickHouse

100M Rows Hits Dataset, Second Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second Run8c DDR5-600012c DDR5-6000170340510680850SE +/- 0.96, N = 3SE +/- 2.95, N = 3652.53771.05MIN: 69.28 / MAX: 7500MIN: 69.77 / MAX: 8571.43

ClickHouse

100M Rows Hits Dataset, Third Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third Run8c DDR5-600012c DDR5-6000160320480640800SE +/- 1.81, N = 3SE +/- 4.74, N = 3651.48765.07MIN: 70.92 / MAX: 6666.67MIN: 69.77 / MAX: 8571.43

Memcached

Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:58c DDR5-600012c DDR5-6000800K1600K2400K3200K4000KSE +/- 3725.22, N = 3SE +/- 16589.87, N = 33711825.753701189.231. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:108c DDR5-600012c DDR5-60001.4M2.8M4.2M5.6M7MSE +/- 5728.08, N = 3SE +/- 23004.60, N = 36752055.126670118.941. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:1008c DDR5-600012c DDR5-60003M6M9M12M15MSE +/- 48609.95, N = 3SE +/- 20796.91, N = 312031156.0311923788.431. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Fast8c DDR5-600012c DDR5-60002004006008001000SE +/- 7.37, N = 13SE +/- 14.18, N = 31072.781109.951. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Medium8c DDR5-600012c DDR5-6000120240360480600SE +/- 0.83, N = 3SE +/- 0.76, N = 3568.34568.831. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Thorough8c DDR5-600012c DDR5-600020406080100SE +/- 0.04, N = 3SE +/- 0.02, N = 382.1982.281. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Exhaustive8c DDR5-600012c DDR5-6000246810SE +/- 0.0004, N = 3SE +/- 0.0018, N = 37.13397.13341. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Very Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Very Thorough8c DDR5-600012c DDR5-60003691215SE +/- 0.00, N = 3SE +/- 0.00, N = 311.6311.631. (CXX) g++ options: -O3 -flto -pthread

LiteRT

Model: DeepLab V3

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: DeepLab V38c DDR5-600012c DDR5-60003K6K9K12K15KSE +/- 140.63, N = 15SE +/- 138.74, N = 311817.911450.9

LiteRT

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: SqueezeNet8c DDR5-600012c DDR5-600015003000450060007500SE +/- 27.86, N = 3SE +/- 17.70, N = 37048.397051.59

LiteRT

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Inception V48c DDR5-600012c DDR5-60009K18K27K36K45KSE +/- 438.48, N = 6SE +/- 38.95, N = 343959.944088.3

LiteRT

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: NASNet Mobile8c DDR5-600012c DDR5-600020K40K60K80K100KSE +/- 1096.54, N = 15SE +/- 1271.15, N = 3100519.499408.3

LiteRT

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Mobilenet Float8c DDR5-600012c DDR5-600010002000300040005000SE +/- 18.31, N = 3SE +/- 16.88, N = 34441.274399.48

LiteRT

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Mobilenet Quant8c DDR5-600012c DDR5-600013002600390052006500SE +/- 155.36, N = 15SE +/- 109.66, N = 156260.215800.65

LiteRT

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Inception ResNet V28c DDR5-600012c DDR5-600013K26K39K52K65KSE +/- 811.71, N = 3SE +/- 598.16, N = 359977.858054.3

LiteRT

Model: Quantized COCO SSD MobileNet v1

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Quantized COCO SSD MobileNet v18c DDR5-600012c DDR5-600015003000450060007500SE +/- 68.84, N = 15SE +/- 28.16, N = 37110.127211.99

XNNPACK

Model: FP32MobileNetV1

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV18c DDR5-600012c DDR5-600010002000300040005000SE +/- 41.29, N = 3SE +/- 57.47, N = 3459646291. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV28c DDR5-600012c DDR5-60002K4K6K8K10KSE +/- 46.71, N = 3SE +/- 73.61, N = 3923692421. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV3Large8c DDR5-600012c DDR5-60003K6K9K12K15KSE +/- 140.55, N = 3SE +/- 63.76, N = 314189136431. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV3Small8c DDR5-600012c DDR5-60002K4K6K8K10KSE +/- 8.95, N = 3SE +/- 54.11, N = 310534104181. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV1

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV18c DDR5-600012c DDR5-600010002000300040005000SE +/- 11.46, N = 3SE +/- 43.59, N = 3457046111. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV28c DDR5-600012c DDR5-60002K4K6K8K10KSE +/- 49.08, N = 3SE +/- 96.09, N = 3909489821. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV3Large8c DDR5-600012c DDR5-60003K6K9K12K15KSE +/- 75.41, N = 3SE +/- 58.09, N = 313257130741. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV3Small8c DDR5-600012c DDR5-60002K4K6K8K10KSE +/- 162.45, N = 3SE +/- 181.63, N = 310810105591. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: QS8MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: QS8MobileNetV28c DDR5-600012c DDR5-60002K4K6K8K10KSE +/- 103.51, N = 3SE +/- 60.71, N = 39968100951. (CXX) g++ options: -O3 -lrt -lm

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 5.0Test: Writes8c DDR5-600012c DDR5-6000100K200K300K400K500KSE +/- 1303.09, N = 3SE +/- 2454.69, N = 3422312447176

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 6.0Mode: CPU8c DDR5-600012c DDR5-600040K80K120K160K200KSE +/- 357.23, N = 3SE +/- 252.42, N = 3198145199769

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: GPT-2 - Device: CPU - Executor: Standard8c DDR5-600012c DDR5-60004080120160200SE +/- 2.26, N = 3SE +/- 1.29, N = 3187.13195.281. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: GPT-2 - Device: CPU - Executor: Standard8c DDR5-600012c DDR5-60001.20232.40463.60694.80926.0115SE +/- 0.06457, N = 3SE +/- 0.03369, N = 35.343595.119281. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: yolov4 - Device: CPU - Executor: Standard8c DDR5-600012c DDR5-60003691215SE +/- 0.16, N = 3SE +/- 0.13, N = 312.0612.211. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: yolov4 - Device: CPU - Executor: Standard8c DDR5-600012c DDR5-600020406080100SE +/- 1.14, N = 3SE +/- 0.85, N = 382.9481.881. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: ZFNet-512 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: ZFNet-512 - Device: CPU - Executor: Standard8c DDR5-600012c DDR5-60004080120160200SE +/- 0.66, N = 3SE +/- 0.19, N = 3153.64163.981. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: ZFNet-512 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: ZFNet-512 - Device: CPU - Executor: Standard8c DDR5-600012c DDR5-6000246810SE +/- 0.02791, N = 3SE +/- 0.00692, N = 36.507616.097181. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: T5 Encoder - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: T5 Encoder - Device: CPU - Executor: Standard8c DDR5-600012c DDR5-600050100150200250SE +/- 0.88, N = 3SE +/- 0.36, N = 3235.60243.331. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: T5 Encoder - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: T5 Encoder - Device: CPU - Executor: Standard8c DDR5-600012c DDR5-60000.95481.90962.86443.81924.774SE +/- 0.01572, N = 3SE +/- 0.00593, N = 34.243724.108801. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: bertsquad-12 - Device: CPU - Executor: Standard8c DDR5-600012c DDR5-6000510152025SE +/- 0.23, N = 3SE +/- 0.12, N = 321.6522.021. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: bertsquad-12 - Device: CPU - Executor: Standard8c DDR5-600012c DDR5-60001020304050SE +/- 0.48, N = 3SE +/- 0.24, N = 346.2045.411. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard8c DDR5-600012c DDR5-60002004006008001000SE +/- 17.42, N = 15SE +/- 11.56, N = 3958.581041.951. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard8c DDR5-600012c DDR5-60000.23570.47140.70710.94281.1785SE +/- 0.018573, N = 15SE +/- 0.010778, N = 31.0475050.9595311. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: fcn-resnet101-11 - Device: CPU - Executor: Standard8c DDR5-600012c DDR5-60003691215SE +/- 0.08964, N = 15SE +/- 0.13237, N = 159.186119.440761. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: fcn-resnet101-11 - Device: CPU - Executor: Standard8c DDR5-600012c DDR5-600020406080100SE +/- 1.08, N = 15SE +/- 1.49, N = 15109.01106.211. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt


Phoronix Test Suite v10.8.5