AMD EPYC Turin 8c vs. 12c Memory Channel DDR5 Comparison

AMD EPYC 9655P memory benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2411164-NE-AMDEPYCTU56&grw&sor.

AMD EPYC Turin 8c vs. 12c Memory Channel DDR5 ComparisonProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen Resolution8c DDR5-600012c DDR5-6000AMD EPYC 9655P 96-Core @ 2.60GHz (96 Cores / 192 Threads)Supermicro Super Server H13SSL-N v1.01 (3.0 BIOS)AMD 1Ah8 x 64GB DDR5-6000MT/s3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Broadcom NetXtreme BCM5720 PCIeUbuntu 24.106.12.0-rc7-phx (x86_64)GNOME Shell 47.0X ServerGCC 14.2.0ext41024x76812 x 64GB DDR5-6000MT/s Micron MTC40F2046S1RC64BDY QSFFOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116Java Details- 8c DDR5-6000: OpenJDK Runtime Environment (build 21.0.5+11-Ubuntu-1ubuntu124.10)Python Details- Python 3.12.7Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC Turin 8c vs. 12c Memory Channel DDR5 Comparisonmbw: Memory Copy - 4096 MiBmbw: Memory Copy - 8192 MiBmbw: Memory Copy, Fixed Block Size - 4096 MiBmbw: Memory Copy, Fixed Block Size - 8192 MiBramspeed: Add - Integerramspeed: Copy - Integerramspeed: Scale - Integerramspeed: Triad - Integerramspeed: Average - Integerstream: Copystream: Scalestream: Triadstream: Addstargate: 96000 - 512stargate: 192000 - 512stargate: 96000 - 1024stargate: 192000 - 1024astcenc: Fastastcenc: Mediumastcenc: Thoroughastcenc: Exhaustiveastcenc: Very Thoroughrawtherapee: Total Benchmark Timegraph500: 26graph500: 26graph500: 26graph500: 26minibude: OpenMP - BM2minibude: OpenMP - BM2openradioss: Chrysler Neon 1Mopenradioss: INIVOL and Fluid Structure Interaction Drop Containerwhisperfile: Smallwhisperfile: Mediumtensorflow: CPU - 512 - ResNet-50onnx: GPT-2 - CPU - Standardspecfem3d: Tomographic Modelspecfem3d: Homogeneous Halfspacespecfem3d: Mount St. Helensspecfem3d: Layered Halfspaceonnx: yolov4 - CPU - Standardonnx: ZFNet-512 - CPU - Standardonnx: T5 Encoder - CPU - Standardonnx: bertsquad-12 - CPU - Standardspecfem3d: Water-layered Halfspaceonnx: CaffeNet 12-int8 - CPU - Standardonnx: fcn-resnet101-11 - CPU - Standardxnnpack: FP32MobileNetV1xnnpack: FP32MobileNetV2xnnpack: FP32MobileNetV3Largexnnpack: FP32MobileNetV3Smallxnnpack: FP16MobileNetV1xnnpack: FP16MobileNetV2xnnpack: FP16MobileNetV3Largexnnpack: FP16MobileNetV3Smallxnnpack: QS8MobileNetV2gromacs: MPI CPU - water_GMX50_barehpcg: 104 104 104 - 60hpcg: 144 144 144 - 60npb: BT.Cnpb: CG.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Cnpb: SP.Bnpb: SP.Cnamd: ATPase with 327,506 Atomsnamd: STMV with 1,066,628 Atomsonednn: IP Shapes 1D - CPUonednn: IP Shapes 3D - CPUonednn: Convolution Batch Shapes Auto - CPUonednn: Deconvolution Batch shapes_1d - CPUonednn: Deconvolution Batch shapes_3d - CPUonednn: Recurrent Neural Network Training - CPUonednn: Recurrent Neural Network Inference - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUamg: openfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeincompact3d: X3D-benchmarking input.i3dincompact3d: input.i3d 193 Cells Per Directiongpaw: Carbon Nanotubecompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingjohn-the-ripper: bcryptjohn-the-ripper: WPA PSKjohn-the-ripper: Blowfishjohn-the-ripper: HMAC-SHA512john-the-ripper: MD5m-queens: Time To Solvebuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Super Fastlitert: SqueezeNetkvazaar: Bosphorus 4K - Ultra Fastwarpx: Plasma Accelerationepoch: Conewarpx: Uniform Plasmasvt-av1: Preset 3 - Bosphorus 4Ksvt-av1: Preset 5 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Ksvt-av1: Preset 3 - Beauty 4K 10-bitsvt-av1: Preset 5 - Beauty 4K 10-bitsvt-av1: Preset 8 - Beauty 4K 10-bitsvt-av1: Preset 13 - Beauty 4K 10-bitlitert: Inception V4v-ray: CPUblender: BMW27 - CPU-Onlyblender: Junkshop - CPU-Onlyblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlyuvg266: Bosphorus 4K - Slowlitert: Inception ResNet V2uvg266: Bosphorus 4K - Mediumlitert: Mobilenet Quantuvg266: Bosphorus 4K - Very Fastlitert: Mobilenet Floatuvg266: Bosphorus 4K - Ultra Fastvvenc: Bosphorus 4K - Fastvvenc: Bosphorus 4K - Fasterlitert: DeepLab V3uvg266: Bosphorus 4K - Super Fastbuild-godot: Time To Compileopenvkl: vklBenchmarkCPU ISPCluxcorerender: DLSC - CPUlitert: Quantized COCO SSD MobileNet v1litert: NASNet Mobileluxcorerender: Danish Mood - CPUluxcorerender: Orange Juice - CPUluxcorerender: LuxCore Benchmark - CPUluxcorerender: Rainbow Colors and Prism - CPUbuild2: Time To Compileliquid-dsp: 64 - 256 - 57liquid-dsp: 128 - 256 - 57liquid-dsp: 192 - 256 - 57liquid-dsp: 64 - 256 - 512liquid-dsp: 128 - 256 - 512liquid-dsp: 192 - 256 - 512srsran: PDSCH Processor Benchmark, Throughput Totalsrsran: PUSCH Processor Benchmark, Throughput Totalnginx: 500apache: 500clickhouse: 100M Rows Hits Dataset, First Run / Cold Cacheclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, Third Runmemcached: 1:5memcached: 1:10memcached: 1:100cassandra: Writespgbench: 1000 - 800 - Read Onlypgbench: 1000 - 800 - Read Only - Average Latencypgbench: 1000 - 800 - Read Writepgbench: 1000 - 800 - Read Write - Average Latencyonnx: GPT-2 - CPU - Standardonnx: yolov4 - CPU - Standardonnx: ZFNet-512 - CPU - Standardonnx: T5 Encoder - CPU - Standardonnx: bertsquad-12 - CPU - Standardonnx: CaffeNet 12-int8 - CPU - Standardonnx: fcn-resnet101-11 - CPU - Standard8c DDR5-600012c DDR5-600023448.63323357.88918332.40918188.11199666.50115580.47117864.04104337.42110436.65221234.4204677.8229604.0233276.25.9185953.8963116.5173494.3889741072.7819568.336382.18967.133911.630336.228136580000013944500004977980006767090007208.779288.351129.1779.9996.68799215.52289196.47187.1256.6263693568.1980058375.85535093313.86125209912.0605153.637235.59621.647814.355377904958.5769.18611459692361418910534457090941325710810996816.65441.318641.0157284718.5360572.2011375.8314658.25120202.205653.35275011.41105206.58169436.81108558.2111.624573.559530.5290520.2655580.3413746.844020.715363429.684277.996449.23106.96145.11330.138240.985.8023443.863.972291.0920.85740.1364.7514026.366.706329.607.537314.9212.4210448.924.563543.7927.06166363.910.33208521800022.29707222.648378109.19543254.29552292.5787869.0141642928.5607270456265532370551019333237458413326333214083337.22922.555196.24044.7645.3889.3393.407048.3996.7225.76282272196.0316.4537358015.60356.088182.608430.8681.8467.68713.51217.11043959.919814513.0916.8035.5118.55121.1541.1830.3059977.832.926260.2166.124441.2766.869.59822.63511817.965.5280.874276217.017110.12100519.412.0624.1512.9227.8553.55132141333335053200000595543333314176333331956333333213676666726209.96238.1523766.78159864.99637.67652.53651.483711825.756752055.1212031156.0342231231492770.2541121757.1365.3435982.94346.507614.2437246.20261.047505109.00525160.63225126.49222520.67422449.826120526.51139450.46139548.85122052.34130318.77343229.3314946.5348347.3355858.45.9439793.9321796.5393224.4295701109.9501568.826282.27957.133411.627735.854147818000015057100005549360007471380007171.451286.85890.7779.4291.05176202.15909241.55195.2766.7779583008.1948624985.77871461313.70007522912.2149163.976243.32822.023814.2755551361041.959.440764629924213643104184611898213074105591009517.78762.544161.7722361153.0665380.5811230.8113685.97164903.456970.89311883.80147097.41199255.89154965.8412.289984.217170.5319840.2667910.3435936.767360.716373425.784277.643689.5069.48145.10330.018240.405.8023427.503.962437.9319.61844.9356.7114006.156.716380.587.487934.4011.5710462.564.563551.3527.01166983.330.33307628633322.94460522.408568106.31177192.22721203.9370527.0849269225.6607792426272152375421023333237515413474333216810007.21422.224193.34644.7945.3090.4691.547051.5996.0725.85109772186.8516.2745715215.68456.438186.687435.3821.8527.69913.55417.14544088.319976913.1416.7535.4318.46120.8341.2530.4058054.333.045800.6567.244399.4867.659.64322.96011450.967.2680.552282516.497211.9999408.312.0324.1912.3928.2353.43931959000004963433333595860000014129666671952100000214920000025941.96237.7505484.86170467.37740.15771.05765.073701189.236670118.9411923788.4344717631550360.2541137457.0375.1192881.88166.097184.1088045.40670.959531106.2133OpenBenchmarking.org

MBW

Test: Memory Copy - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 4096 MiB12c DDR5-60008c DDR5-60005K10K15K20K25KSE +/- 22.33, N = 3SE +/- 26.98, N = 325160.6323448.631. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB12c DDR5-60008c DDR5-60005K10K15K20K25KSE +/- 31.99, N = 3SE +/- 19.02, N = 325126.4923357.891. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB12c DDR5-60008c DDR5-60005K10K15K20K25KSE +/- 55.98, N = 3SE +/- 54.43, N = 322520.6718332.411. (CC) gcc options: -O3 -march=native

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB12c DDR5-60008c DDR5-60005K10K15K20K25KSE +/- 64.32, N = 3SE +/- 60.40, N = 322449.8318188.111. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integer12c DDR5-60008c DDR5-600030K60K90K120K150KSE +/- 235.46, N = 3SE +/- 1117.41, N = 3120526.5199666.501. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integer12c DDR5-60008c DDR5-600030K60K90K120K150KSE +/- 266.61, N = 3SE +/- 779.65, N = 3139450.46115580.471. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integer12c DDR5-60008c DDR5-600030K60K90K120K150KSE +/- 283.77, N = 3SE +/- 291.38, N = 3139548.85117864.041. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integer12c DDR5-60008c DDR5-600030K60K90K120K150KSE +/- 253.90, N = 3SE +/- 795.56, N = 3122052.34104337.421. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integer12c DDR5-60008c DDR5-600030K60K90K120K150KSE +/- 252.96, N = 3SE +/- 807.29, N = 3130318.77110436.651. (CC) gcc options: -O3 -march=native

Stream

Type: Copy

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Copy12c DDR5-60008c DDR5-600070K140K210K280K350KSE +/- 570.76, N = 5SE +/- 357.86, N = 5343229.3221234.41. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stream

Type: Scale

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Scale12c DDR5-60008c DDR5-600070K140K210K280K350KSE +/- 478.12, N = 5SE +/- 630.12, N = 5314946.5204677.81. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stream

Type: Triad

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Triad12c DDR5-60008c DDR5-600070K140K210K280K350KSE +/- 1895.03, N = 5SE +/- 936.98, N = 5348347.3229604.01. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stream

Type: Add

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Add12c DDR5-60008c DDR5-600080K160K240K320K400KSE +/- 1639.97, N = 5SE +/- 1797.63, N = 5355858.4233276.21. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 96000 - Buffer Size: 51212c DDR5-60008c DDR5-60001.33742.67484.01225.34966.687SE +/- 0.018441, N = 3SE +/- 0.033577, N = 35.9439795.9185951. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 51212c DDR5-60008c DDR5-60000.88471.76942.65413.53884.4235SE +/- 0.003414, N = 3SE +/- 0.040462, N = 33.9321793.8963111. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 96000 - Buffer Size: 102412c DDR5-60008c DDR5-6000246810SE +/- 0.024623, N = 3SE +/- 0.002866, N = 36.5393226.5173491. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 102412c DDR5-60008c DDR5-60000.99671.99342.99013.98684.9835SE +/- 0.018753, N = 3SE +/- 0.014254, N = 34.4295704.3889741. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Fast12c DDR5-60008c DDR5-60002004006008001000SE +/- 14.18, N = 3SE +/- 7.37, N = 131109.951072.781. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Medium12c DDR5-60008c DDR5-6000120240360480600SE +/- 0.76, N = 3SE +/- 0.83, N = 3568.83568.341. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Thorough12c DDR5-60008c DDR5-600020406080100SE +/- 0.02, N = 3SE +/- 0.04, N = 382.2882.191. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Exhaustive8c DDR5-600012c DDR5-6000246810SE +/- 0.0004, N = 3SE +/- 0.0018, N = 37.13397.13341. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Very Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Very Thorough8c DDR5-600012c DDR5-60003691215SE +/- 0.00, N = 3SE +/- 0.00, N = 311.6311.631. (CXX) g++ options: -O3 -flto -pthread

RawTherapee

Total Benchmark Time

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark Time12c DDR5-60008c DDR5-6000816243240SE +/- 0.05, N = 3SE +/- 0.05, N = 335.8536.231. RawTherapee, version 5.10, command line.

Graph500

Scale: 26

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 2612c DDR5-60008c DDR5-6000300M600M900M1200M1500M147818000013658000001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 2612c DDR5-60008c DDR5-6000300M600M900M1200M1500M150571000013944500001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 2612c DDR5-60008c DDR5-6000120M240M360M480M600M5549360004977980001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 2612c DDR5-60008c DDR5-6000160M320M480M640M800M7471380006767090001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM28c DDR5-600012c DDR5-600015003000450060007500SE +/- 12.59, N = 3SE +/- 56.00, N = 37208.787171.451. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM28c DDR5-600012c DDR5-600060120180240300SE +/- 0.50, N = 3SE +/- 2.24, N = 3288.35286.861. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1M12c DDR5-60008c DDR5-6000306090120150SE +/- 0.26, N = 3SE +/- 0.94, N = 1190.77129.17

OpenRadioss

Model: INIVOL and Fluid Structure Interaction Drop Container

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop Container12c DDR5-60008c DDR5-600020406080100SE +/- 0.06, N = 3SE +/- 0.30, N = 379.4279.99

Whisperfile

Model Size: Small

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisperfile 20Aug24Model Size: Small12c DDR5-60008c DDR5-600020406080100SE +/- 0.71, N = 3SE +/- 0.65, N = 391.0596.69

Whisperfile

Model Size: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisperfile 20Aug24Model Size: Medium12c DDR5-60008c DDR5-600050100150200250SE +/- 0.37, N = 3SE +/- 0.69, N = 3202.16215.52

TensorFlow

Device: CPU - Batch Size: 512 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 512 - Model: ResNet-5012c DDR5-60008c DDR5-600050100150200250SE +/- 0.11, N = 3SE +/- 0.27, N = 3241.55196.47

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: GPT-2 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60004080120160200SE +/- 1.29, N = 3SE +/- 2.26, N = 3195.28187.131. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic Model8c DDR5-600012c DDR5-6000246810SE +/- 0.080343793, N = 3SE +/- 0.086144824, N = 36.6263693566.7779583001. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous Halfspace12c DDR5-60008c DDR5-6000246810SE +/- 0.106640607, N = 3SE +/- 0.043840287, N = 38.1948624988.1980058371. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Mount St. Helens12c DDR5-60008c DDR5-60001.31752.6353.95255.276.5875SE +/- 0.055041694, N = 6SE +/- 0.034598254, N = 35.7787146135.8553509331. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Layered Halfspace12c DDR5-60008c DDR5-600048121620SE +/- 0.07, N = 3SE +/- 0.11, N = 313.7013.861. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: yolov4 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60003691215SE +/- 0.13, N = 3SE +/- 0.16, N = 312.2112.061. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: ZFNet-512 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: ZFNet-512 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60004080120160200SE +/- 0.19, N = 3SE +/- 0.66, N = 3163.98153.641. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: T5 Encoder - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: T5 Encoder - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-600050100150200250SE +/- 0.36, N = 3SE +/- 0.88, N = 3243.33235.601. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: bertsquad-12 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-6000510152025SE +/- 0.12, N = 3SE +/- 0.23, N = 322.0221.651. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Water-layered Halfspace12c DDR5-60008c DDR5-600048121620SE +/- 0.08, N = 3SE +/- 0.07, N = 314.2814.361. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

ONNX Runtime

Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60002004006008001000SE +/- 11.56, N = 3SE +/- 17.42, N = 151041.95958.581. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: fcn-resnet101-11 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60003691215SE +/- 0.13237, N = 15SE +/- 0.08964, N = 159.440769.186111. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

XNNPACK

Model: FP32MobileNetV1

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV18c DDR5-600012c DDR5-600010002000300040005000SE +/- 41.29, N = 3SE +/- 57.47, N = 3459646291. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV28c DDR5-600012c DDR5-60002K4K6K8K10KSE +/- 46.71, N = 3SE +/- 73.61, N = 3923692421. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV3Large12c DDR5-60008c DDR5-60003K6K9K12K15KSE +/- 63.76, N = 3SE +/- 140.55, N = 313643141891. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV3Small12c DDR5-60008c DDR5-60002K4K6K8K10KSE +/- 54.11, N = 3SE +/- 8.95, N = 310418105341. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV1

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV18c DDR5-600012c DDR5-600010002000300040005000SE +/- 11.46, N = 3SE +/- 43.59, N = 3457046111. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV212c DDR5-60008c DDR5-60002K4K6K8K10KSE +/- 96.09, N = 3SE +/- 49.08, N = 3898290941. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV3Large12c DDR5-60008c DDR5-60003K6K9K12K15KSE +/- 58.09, N = 3SE +/- 75.41, N = 313074132571. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV3Small12c DDR5-60008c DDR5-60002K4K6K8K10KSE +/- 181.63, N = 3SE +/- 162.45, N = 310559108101. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: QS8MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: QS8MobileNetV28c DDR5-600012c DDR5-60002K4K6K8K10KSE +/- 103.51, N = 3SE +/- 60.71, N = 39968100951. (CXX) g++ options: -O3 -lrt -lm

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bare12c DDR5-60008c DDR5-600048121620SE +/- 0.02, N = 3SE +/- 0.03, N = 317.7916.651. (CXX) g++ options: -O3 -lm

High Performance Conjugate Gradient

X Y Z: 104 104 104 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 6012c DDR5-60008c DDR5-60001428425670SE +/- 0.03, N = 3SE +/- 0.02, N = 362.5441.321. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 6012c DDR5-60008c DDR5-60001428425670SE +/- 0.01, N = 3SE +/- 0.03, N = 361.7741.021. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C12c DDR5-60008c DDR5-600080K160K240K320K400KSE +/- 2078.86, N = 3SE +/- 377.10, N = 3361153.06284718.531. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C12c DDR5-60008c DDR5-600014K28K42K56K70KSE +/- 928.22, N = 15SE +/- 631.87, N = 1565380.5860572.201. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C8c DDR5-600012c DDR5-60002K4K6K8K10KSE +/- 91.88, N = 3SE +/- 85.90, N = 311375.8311230.811. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D8c DDR5-600012c DDR5-60003K6K9K12K15KSE +/- 133.00, N = 3SE +/- 614.47, N = 1214658.2513685.971. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C12c DDR5-60008c DDR5-600040K80K120K160K200KSE +/- 1877.63, N = 4SE +/- 1124.88, N = 3164903.45120202.201. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D12c DDR5-60008c DDR5-600015003000450060007500SE +/- 26.78, N = 3SE +/- 31.49, N = 36970.895653.351. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C12c DDR5-60008c DDR5-600070K140K210K280K350KSE +/- 2348.10, N = 3SE +/- 1080.76, N = 3311883.80275011.411. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C12c DDR5-60008c DDR5-600030K60K90K120K150KSE +/- 2056.93, N = 3SE +/- 1082.06, N = 15147097.41105206.581. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B12c DDR5-60008c DDR5-600040K80K120K160K200KSE +/- 2749.03, N = 3SE +/- 1524.43, N = 15199255.89169436.811. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C12c DDR5-60008c DDR5-600030K60K90K120K150KSE +/- 622.44, N = 3SE +/- 459.55, N = 3154965.84108558.211. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAMD

Input: ATPase with 327,506 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: ATPase with 327,506 Atoms12c DDR5-60008c DDR5-60003691215SE +/- 0.03, N = 3SE +/- 0.03, N = 312.2911.62

NAMD

Input: STMV with 1,066,628 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: STMV with 1,066,628 Atoms12c DDR5-60008c DDR5-60000.94891.89782.84673.79564.7445SE +/- 0.00356, N = 3SE +/- 0.00613, N = 34.217173.55953

oneDNN

Harness: IP Shapes 1D - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: IP Shapes 1D - Engine: CPU8c DDR5-600012c DDR5-60000.11970.23940.35910.47880.5985SE +/- 0.004111, N = 3SE +/- 0.003116, N = 30.5290520.531984MIN: 0.47MIN: 0.481. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: IP Shapes 3D - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: IP Shapes 3D - Engine: CPU8c DDR5-600012c DDR5-60000.060.120.180.240.3SE +/- 0.000649, N = 3SE +/- 0.000721, N = 30.2655580.266791MIN: 0.25MIN: 0.251. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Convolution Batch Shapes Auto - Engine: CPU8c DDR5-600012c DDR5-60000.07730.15460.23190.30920.3865SE +/- 0.000677, N = 3SE +/- 0.000927, N = 30.3413740.343593MIN: 0.33MIN: 0.321. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Deconvolution Batch shapes_1d - Engine: CPU12c DDR5-60008c DDR5-6000246810SE +/- 0.02120, N = 3SE +/- 0.01476, N = 36.767366.84402MIN: 4.07MIN: 5.141. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Deconvolution Batch shapes_3d - Engine: CPU8c DDR5-600012c DDR5-60000.16120.32240.48360.64480.806SE +/- 0.001800, N = 3SE +/- 0.002304, N = 30.7153630.716373MIN: 0.62MIN: 0.621. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: Recurrent Neural Network Training - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Recurrent Neural Network Training - Engine: CPU12c DDR5-60008c DDR5-600090180270360450SE +/- 0.20, N = 3SE +/- 0.28, N = 3425.78429.68MIN: 420.18MIN: 423.831. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Recurrent Neural Network Inference - Engine: CPU12c DDR5-60008c DDR5-600060120180240300SE +/- 0.81, N = 3SE +/- 0.67, N = 3277.64278.00MIN: 269.94MIN: 271.731. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPU12c DDR5-60008c DDR5-6000150300450600750SE +/- 1.81, N = 3SE +/- 6.83, N = 15689.50449.231. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPU12c DDR5-60008c DDR5-600020406080100SE +/- 0.19, N = 3SE +/- 1.46, N = 1569.48106.96MIN: 34.01 / MAX: 134.55MIN: 37.41 / MAX: 380.851. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-6000306090120150SE +/- 0.12, N = 3SE +/- 0.26, N = 3145.11145.101. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-600070140210280350SE +/- 0.62, N = 3SE +/- 0.26, N = 3330.01330.13MIN: 242.42 / MAX: 353.46MIN: 257.44 / MAX: 357.541. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-60002K4K6K8K10KSE +/- 5.48, N = 3SE +/- 10.65, N = 38240.988240.401. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-60001.3052.613.9155.226.525SE +/- 0.00, N = 3SE +/- 0.01, N = 35.805.80MIN: 2.26 / MAX: 26.37MIN: 2.41 / MAX: 27.541. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-60005K10K15K20K25KSE +/- 11.10, N = 3SE +/- 29.34, N = 323443.8623427.501. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-60000.89331.78662.67993.57324.4665SE +/- 0.01, N = 3SE +/- 0.00, N = 33.963.97MIN: 1.68 / MAX: 20.25MIN: 1.61 / MAX: 20.641. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-60005001000150020002500SE +/- 1.15, N = 3SE +/- 2.70, N = 32437.932291.091. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-6000510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 319.6120.85MIN: 10.37 / MAX: 41.25MIN: 10.51 / MAX: 40.511. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPU12c DDR5-60008c DDR5-60002004006008001000SE +/- 1.30, N = 3SE +/- 4.52, N = 3844.93740.131. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPU12c DDR5-60008c DDR5-60001428425670SE +/- 0.09, N = 3SE +/- 0.40, N = 356.7164.75MIN: 34.97 / MAX: 96.61MIN: 28.81 / MAX: 99.111. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Weld Porosity Detection FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-60003K6K9K12K15KSE +/- 1.42, N = 3SE +/- 11.52, N = 314026.3614006.151. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Weld Porosity Detection FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-6000246810SE +/- 0.00, N = 3SE +/- 0.01, N = 36.706.71MIN: 2.45 / MAX: 24MIN: 2.58 / MAX: 23.111. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPU12c DDR5-60008c DDR5-600014002800420056007000SE +/- 12.38, N = 3SE +/- 17.72, N = 36380.586329.601. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPU12c DDR5-60008c DDR5-6000246810SE +/- 0.01, N = 3SE +/- 0.02, N = 37.487.53MIN: 4.42 / MAX: 23.35MIN: 4.35 / MAX: 22.921. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Noise Suppression Poconet-Like FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Noise Suppression Poconet-Like FP16 - Device: CPU12c DDR5-60008c DDR5-60002K4K6K8K10KSE +/- 20.16, N = 3SE +/- 10.57, N = 37934.407314.921. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Noise Suppression Poconet-Like FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Noise Suppression Poconet-Like FP16 - Device: CPU12c DDR5-60008c DDR5-60003691215SE +/- 0.03, N = 3SE +/- 0.02, N = 311.5712.42MIN: 6.95 / MAX: 30.79MIN: 6.11 / MAX: 28.551. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Re-Identification Retail FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Re-Identification Retail FP16 - Device: CPU12c DDR5-60008c DDR5-60002K4K6K8K10KSE +/- 5.49, N = 3SE +/- 7.25, N = 310462.5610448.921. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Re-Identification Retail FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Re-Identification Retail FP16 - Device: CPU8c DDR5-600012c DDR5-60001.0262.0523.0784.1045.13SE +/- 0.01, N = 3SE +/- 0.00, N = 34.564.56MIN: 2.23 / MAX: 20.98MIN: 2.45 / MAX: 18.951. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-60008001600240032004000SE +/- 4.91, N = 3SE +/- 1.42, N = 33551.353543.791. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-6000612182430SE +/- 0.04, N = 3SE +/- 0.01, N = 327.0127.06MIN: 16.35 / MAX: 47.96MIN: 16.36 / MAX: 45.561. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU12c DDR5-60008c DDR5-600040K80K120K160K200KSE +/- 251.97, N = 3SE +/- 234.32, N = 3166983.33166363.911. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU8c DDR5-600012c DDR5-60000.07430.14860.22290.29720.3715SE +/- 0.00, N = 3SE +/- 0.00, N = 30.330.33MIN: 0.13 / MAX: 24.63MIN: 0.12 / MAX: 23.431. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.212c DDR5-60008c DDR5-6000700M1400M2100M2800M3500MSE +/- 7144725.85, N = 3SE +/- 1073443.21, N = 330762863332085218000

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Time8c DDR5-600012c DDR5-600051015202522.3022.941. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Time12c DDR5-60008c DDR5-600051015202522.4122.651. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Time12c DDR5-60008c DDR5-600020406080100106.31109.201. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time12c DDR5-60008c DDR5-600060120180240300192.23254.301. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3d12c DDR5-60008c DDR5-600060120180240300SE +/- 2.76, N = 3SE +/- 3.79, N = 3203.94292.581. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction12c DDR5-60008c DDR5-60003691215SE +/- 0.06792326, N = 3SE +/- 0.05824182, N = 157.084926929.014164291. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon Nanotube12c DDR5-60008c DDR5-6000714212835SE +/- 0.23, N = 3SE +/- 0.24, N = 325.6628.561. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression Rating12c DDR5-60008c DDR5-6000200K400K600K800K1000KSE +/- 879.82, N = 3SE +/- 1850.34, N = 37792427270451. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression Rating12c DDR5-60008c DDR5-6000130K260K390K520K650KSE +/- 169.10, N = 3SE +/- 340.29, N = 36272156265531. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcrypt12c DDR5-60008c DDR5-600050K100K150K200K250KSE +/- 33.49, N = 3SE +/- 505.86, N = 32375422370551. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSK12c DDR5-60008c DDR5-6000200K400K600K800K1000KSE +/- 2185.81, N = 3SE +/- 1666.67, N = 3102333310193331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: Blowfish12c DDR5-60008c DDR5-600050K100K150K200K250KSE +/- 16.90, N = 3SE +/- 16.64, N = 32375152374581. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: HMAC-SHA512

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA51212c DDR5-60008c DDR5-600090M180M270M360M450MSE +/- 2811379.81, N = 3SE +/- 211622.57, N = 34134743334133263331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD512c DDR5-60008c DDR5-60005M10M15M20M25MSE +/- 20808.65, N = 3SE +/- 32518.37, N = 321681000214083331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solve12c DDR5-60008c DDR5-6000246810SE +/- 0.047, N = 3SE +/- 0.010, N = 37.2147.2291. (CXX) g++ options: -fopenmp -O2 -march=native

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfig12c DDR5-60008c DDR5-6000510152025SE +/- 0.25, N = 4SE +/- 0.25, N = 422.2222.56

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfig12c DDR5-60008c DDR5-60004080120160200SE +/- 0.43, N = 3SE +/- 0.13, N = 3193.35196.24

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Slow12c DDR5-60008c DDR5-60001020304050SE +/- 0.08, N = 3SE +/- 0.11, N = 344.7944.761. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Medium8c DDR5-600012c DDR5-60001020304050SE +/- 0.09, N = 3SE +/- 0.06, N = 345.3845.301. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Very Fast12c DDR5-60008c DDR5-600020406080100SE +/- 0.61, N = 3SE +/- 0.20, N = 390.4689.331. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Super Fast8c DDR5-600012c DDR5-600020406080100SE +/- 0.53, N = 3SE +/- 0.52, N = 393.4091.541. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

LiteRT

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: SqueezeNet8c DDR5-600012c DDR5-600015003000450060007500SE +/- 27.86, N = 3SE +/- 17.70, N = 37048.397051.59

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Ultra Fast8c DDR5-600012c DDR5-600020406080100SE +/- 0.36, N = 3SE +/- 0.29, N = 396.7296.071. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

WarpX

Input: Plasma Acceleration

OpenBenchmarking.orgSeconds, Fewer Is BetterWarpX 24.10Input: Plasma Acceleration8c DDR5-600012c DDR5-6000612182430SE +/- 0.17, N = 3SE +/- 0.22, N = 325.7625.851. (CXX) g++ options: -O3 -lm

Epoch

Epoch3D Deck: Cone

OpenBenchmarking.orgSeconds, Fewer Is BetterEpoch 4.19.4Epoch3D Deck: Cone12c DDR5-60008c DDR5-60004080120160200SE +/- 0.55, N = 3SE +/- 0.52, N = 3186.85196.031. (F9X) gfortran options: -O3 -std=f2003 -Jobj -lsdf -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

WarpX

Input: Uniform Plasma

OpenBenchmarking.orgSeconds, Fewer Is BetterWarpX 24.10Input: Uniform Plasma12c DDR5-60008c DDR5-600048121620SE +/- 0.23, N = 15SE +/- 0.24, N = 1216.2716.451. (CXX) g++ options: -O3 -lm

SVT-AV1

Encoder Mode: Preset 3 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 3 - Input: Bosphorus 4K12c DDR5-60008c DDR5-600048121620SE +/- 0.03, N = 3SE +/- 0.00, N = 315.6815.601. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 5 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 5 - Input: Bosphorus 4K12c DDR5-60008c DDR5-60001326395265SE +/- 0.28, N = 3SE +/- 0.10, N = 356.4456.091. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 8 - Input: Bosphorus 4K12c DDR5-60008c DDR5-60004080120160200SE +/- 1.50, N = 3SE +/- 0.19, N = 3186.69182.611. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 13 - Input: Bosphorus 4K12c DDR5-60008c DDR5-600090180270360450SE +/- 1.17, N = 3SE +/- 0.57, N = 3435.38430.871. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 3 - Input: Beauty 4K 10-bit

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 3 - Input: Beauty 4K 10-bit12c DDR5-60008c DDR5-60000.41670.83341.25011.66682.0835SE +/- 0.007, N = 3SE +/- 0.002, N = 31.8521.8461. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 5 - Input: Beauty 4K 10-bit

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 5 - Input: Beauty 4K 10-bit12c DDR5-60008c DDR5-6000246810SE +/- 0.039, N = 3SE +/- 0.026, N = 37.6997.6871. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 8 - Input: Beauty 4K 10-bit

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 8 - Input: Beauty 4K 10-bit12c DDR5-60008c DDR5-60003691215SE +/- 0.00, N = 3SE +/- 0.02, N = 313.5513.511. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 13 - Input: Beauty 4K 10-bit

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 13 - Input: Beauty 4K 10-bit12c DDR5-60008c DDR5-600048121620SE +/- 0.01, N = 3SE +/- 0.00, N = 317.1517.111. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

LiteRT

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Inception V48c DDR5-600012c DDR5-60009K18K27K36K45KSE +/- 438.48, N = 6SE +/- 38.95, N = 343959.944088.3

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 6.0Mode: CPU12c DDR5-60008c DDR5-600040K80K120K160K200KSE +/- 252.42, N = 3SE +/- 357.23, N = 3199769198145

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: CPU-Only8c DDR5-600012c DDR5-60003691215SE +/- 0.05, N = 3SE +/- 0.05, N = 313.0913.14

Blender

Blend File: Junkshop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: CPU-Only12c DDR5-60008c DDR5-600048121620SE +/- 0.01, N = 3SE +/- 0.02, N = 316.7516.80

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Classroom - Compute: CPU-Only12c DDR5-60008c DDR5-6000816243240SE +/- 0.04, N = 3SE +/- 0.08, N = 335.4335.51

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Fishy Cat - Compute: CPU-Only12c DDR5-60008c DDR5-6000510152025SE +/- 0.05, N = 3SE +/- 0.03, N = 318.4618.55

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: CPU-Only12c DDR5-60008c DDR5-6000306090120150SE +/- 0.11, N = 3SE +/- 0.12, N = 3120.83121.15

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Pabellon Barcelona - Compute: CPU-Only8c DDR5-600012c DDR5-6000918273645SE +/- 0.15, N = 3SE +/- 0.14, N = 341.1841.25

uvg266

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Slow12c DDR5-60008c DDR5-6000714212835SE +/- 0.01, N = 3SE +/- 0.04, N = 330.4030.30

LiteRT

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Inception ResNet V212c DDR5-60008c DDR5-600013K26K39K52K65KSE +/- 598.16, N = 3SE +/- 811.71, N = 358054.359977.8

uvg266

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Medium12c DDR5-60008c DDR5-6000816243240SE +/- 0.06, N = 3SE +/- 0.02, N = 333.0432.92

LiteRT

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Mobilenet Quant12c DDR5-60008c DDR5-600013002600390052006500SE +/- 109.66, N = 15SE +/- 155.36, N = 155800.656260.21

uvg266

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Very Fast12c DDR5-60008c DDR5-60001530456075SE +/- 0.21, N = 3SE +/- 0.33, N = 367.2466.12

LiteRT

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Mobilenet Float12c DDR5-60008c DDR5-600010002000300040005000SE +/- 16.88, N = 3SE +/- 18.31, N = 34399.484441.27

uvg266

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast12c DDR5-60008c DDR5-60001530456075SE +/- 0.18, N = 3SE +/- 0.79, N = 367.6566.86

VVenC

Video Input: Bosphorus 4K - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 4K - Video Preset: Fast12c DDR5-60008c DDR5-60003691215SE +/- 0.082, N = 3SE +/- 0.064, N = 39.6439.5981. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

VVenC

Video Input: Bosphorus 4K - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 4K - Video Preset: Faster12c DDR5-60008c DDR5-6000612182430SE +/- 0.06, N = 3SE +/- 0.02, N = 322.9622.641. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

LiteRT

Model: DeepLab V3

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: DeepLab V312c DDR5-60008c DDR5-60003K6K9K12K15KSE +/- 138.74, N = 3SE +/- 140.63, N = 1511450.911817.9

uvg266

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Super Fast12c DDR5-60008c DDR5-60001530456075SE +/- 0.11, N = 3SE +/- 0.32, N = 367.2665.52

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To Compile12c DDR5-60008c DDR5-600020406080100SE +/- 0.19, N = 3SE +/- 0.04, N = 380.5580.87

OpenVKL

Benchmark: vklBenchmarkCPU ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 2.0.0Benchmark: vklBenchmarkCPU ISPC12c DDR5-60008c DDR5-60006001200180024003000SE +/- 1.76, N = 3SE +/- 1.53, N = 328252762MIN: 217 / MAX: 36373MIN: 216 / MAX: 33882

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPU8c DDR5-600012c DDR5-600048121620SE +/- 0.21, N = 15SE +/- 0.19, N = 317.0116.49MIN: 15.67 / MAX: 20.15MIN: 15.79 / MAX: 20.03

LiteRT

Model: Quantized COCO SSD MobileNet v1

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Quantized COCO SSD MobileNet v18c DDR5-600012c DDR5-600015003000450060007500SE +/- 68.84, N = 15SE +/- 28.16, N = 37110.127211.99

LiteRT

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: NASNet Mobile12c DDR5-60008c DDR5-600020K40K60K80K100KSE +/- 1271.15, N = 3SE +/- 1096.54, N = 1599408.3100519.4

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPU8c DDR5-600012c DDR5-60003691215SE +/- 0.11, N = 15SE +/- 0.09, N = 1512.0612.03MIN: 5.76 / MAX: 14.38MIN: 5.72 / MAX: 14.2

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPU12c DDR5-60008c DDR5-6000612182430SE +/- 0.09, N = 3SE +/- 0.09, N = 324.1924.15MIN: 21.48 / MAX: 33.13MIN: 21.4 / MAX: 32.92

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPU8c DDR5-600012c DDR5-60003691215SE +/- 0.17, N = 3SE +/- 0.12, N = 1512.9212.39MIN: 6.3 / MAX: 15.04MIN: 5.62 / MAX: 14.88

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPU12c DDR5-60008c DDR5-6000714212835SE +/- 0.19, N = 3SE +/- 0.02, N = 328.2327.85MIN: 25.05 / MAX: 28.69MIN: 24.8 / MAX: 28.48

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.17Time To Compile12c DDR5-60008c DDR5-60001224364860SE +/- 0.01, N = 3SE +/- 0.16, N = 353.4453.55

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 578c DDR5-600012c DDR5-6000700M1400M2100M2800M3500MSE +/- 3636084.59, N = 3SE +/- 1814754.35, N = 3321413333331959000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 578c DDR5-600012c DDR5-60001100M2200M3300M4400M5500MSE +/- 8868107.65, N = 3SE +/- 8326730.72, N = 3505320000049634333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 192 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 5712c DDR5-60008c DDR5-60001300M2600M3900M5200M6500MSE +/- 9761830.43, N = 3SE +/- 8738103.02, N = 3595860000059554333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 5128c DDR5-600012c DDR5-6000300M600M900M1200M1500MSE +/- 2356079.61, N = 3SE +/- 1419311.26, N = 3141763333314129666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 5128c DDR5-600012c DDR5-6000400M800M1200M1600M2000MSE +/- 1072898.46, N = 3SE +/- 6251666.44, N = 3195633333319521000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 192 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 192 - Buffer Length: 256 - Filter Length: 51212c DDR5-60008c DDR5-6000500M1000M1500M2000M2500MSE +/- 1582192.57, N = 3SE +/- 821245.67, N = 3214920000021367666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput Total8c DDR5-600012c DDR5-60006K12K18K24K30KSE +/- 185.11, N = 12SE +/- 311.92, N = 326209.925941.91. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput Total8c DDR5-600012c DDR5-600013002600390052006500SE +/- 0.12, N = 3SE +/- 0.38, N = 36238.16237.7MIN: 3819.5 / MAX: 6238.3MIN: 3818.9 / MAX: 6238.41. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 5008c DDR5-600012c DDR5-6000110K220K330K440K550KSE +/- 3145.55, N = 3SE +/- 328.79, N = 3523766.78505484.861. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 50012c DDR5-60008c DDR5-600040K80K120K160K200KSE +/- 413.18, N = 3SE +/- 177.80, N = 3170467.37159864.991. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

ClickHouse

100M Rows Hits Dataset, First Run / Cold Cache

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold Cache12c DDR5-60008c DDR5-6000160320480640800SE +/- 3.26, N = 3SE +/- 1.98, N = 3740.15637.67MIN: 69.2 / MAX: 7500MIN: 67.95 / MAX: 7500

ClickHouse

100M Rows Hits Dataset, Second Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second Run12c DDR5-60008c DDR5-6000170340510680850SE +/- 2.95, N = 3SE +/- 0.96, N = 3771.05652.53MIN: 69.77 / MAX: 8571.43MIN: 69.28 / MAX: 7500

ClickHouse

100M Rows Hits Dataset, Third Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third Run12c DDR5-60008c DDR5-6000160320480640800SE +/- 4.74, N = 3SE +/- 1.81, N = 3765.07651.48MIN: 69.77 / MAX: 8571.43MIN: 70.92 / MAX: 6666.67

Memcached

Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:58c DDR5-600012c DDR5-6000800K1600K2400K3200K4000KSE +/- 3725.22, N = 3SE +/- 16589.87, N = 33711825.753701189.231. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:108c DDR5-600012c DDR5-60001.4M2.8M4.2M5.6M7MSE +/- 5728.08, N = 3SE +/- 23004.60, N = 36752055.126670118.941. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:1008c DDR5-600012c DDR5-60003M6M9M12M15MSE +/- 48609.95, N = 3SE +/- 20796.91, N = 312031156.0311923788.431. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 5.0Test: Writes12c DDR5-60008c DDR5-6000100K200K300K400K500KSE +/- 2454.69, N = 3SE +/- 1303.09, N = 3447176422312

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 1000 - Clients: 800 - Mode: Read Only12c DDR5-60008c DDR5-6000700K1400K2100K2800K3500KSE +/- 1685.88, N = 3SE +/- 8342.05, N = 3315503631492771. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 1000 - Clients: 800 - Mode: Read Only - Average Latency8c DDR5-600012c DDR5-60000.05720.11440.17160.22880.286SE +/- 0.001, N = 3SE +/- 0.000, N = 30.2540.2541. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 1000 - Clients: 800 - Mode: Read Write12c DDR5-60008c DDR5-600020K40K60K80K100KSE +/- 1257.14, N = 5SE +/- 836.14, N = 111137451121751. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 1000 - Clients: 800 - Mode: Read Write - Average Latency12c DDR5-60008c DDR5-6000246810SE +/- 0.079, N = 5SE +/- 0.053, N = 117.0377.1361. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: GPT-2 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60001.20232.40463.60694.80926.0115SE +/- 0.03369, N = 3SE +/- 0.06457, N = 35.119285.343591. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: yolov4 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-600020406080100SE +/- 0.85, N = 3SE +/- 1.14, N = 381.8882.941. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: ZFNet-512 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: ZFNet-512 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-6000246810SE +/- 0.00692, N = 3SE +/- 0.02791, N = 36.097186.507611. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: T5 Encoder - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: T5 Encoder - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60000.95481.90962.86443.81924.774SE +/- 0.00593, N = 3SE +/- 0.01572, N = 34.108804.243721. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: bertsquad-12 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60001020304050SE +/- 0.24, N = 3SE +/- 0.48, N = 345.4146.201. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-60000.23570.47140.70710.94281.1785SE +/- 0.010778, N = 3SE +/- 0.018573, N = 150.9595311.0475051. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.19Model: fcn-resnet101-11 - Device: CPU - Executor: Standard12c DDR5-60008c DDR5-600020406080100SE +/- 1.49, N = 15SE +/- 1.08, N = 15106.21109.011. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt


Phoronix Test Suite v10.8.5