GCE c3d-standard-60

KVM testing on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2310039-NE-2310031NE80&grr&rdt&rro.

GCE c3d-standard-60ProcessorMotherboardChipsetMemoryDiskNetworkOSKernelVulkanCompilerFile-SystemSystem Layerc3d-standard-60 AMD Genoat2d-standard-60 AMD MilanAMD EPYC 9B14 (30 Cores / 60 Threads)Google Compute Engine c3d-standard-60Intel 440FX 82441FX PMC240GB215GB nvme_card-pdGoogle Compute Engine VirtualUbuntu 22.046.2.0-1014-gcp (x86_64)1.3.238GCC 11.4.0ext4KVMAMD EPYC 7B13 (60 Cores)Google Compute Engine t2d-standard-60215GB PersistentDiskRed Hat Virtio deviceOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- CPU Microcode: 0xffffffffJava Details- OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Details- Python 3.10.12Security Details- c3d-standard-60 AMD Genoa: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - t2d-standard-60 AMD Milan: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

GCE c3d-standard-60pgbench: 100 - 800 - Read Write - Average Latencyapache-iotdb: 800 - 100 - 800 - 400apache-iotdb: 800 - 100 - 800 - 400nekrs: Kershawnekrs: TurboPipe Periodicpgbench: 100 - 1000 - Read Write - Average Latencyrodinia: OpenMP HotSpot3Dlammps: 20k Atomsopenradioss: Chrysler Neon 1Mapache-iotdb: 800 - 100 - 500 - 400apache-iotdb: 800 - 100 - 500 - 400blender: Barbershop - CPU-Onlyapache-iotdb: 500 - 100 - 800 - 400apache-iotdb: 500 - 100 - 800 - 400pgbench: 100 - 800 - Read Writebrl-cad: VGR Performance Metricopenradioss: Bird Strike on Windshieldopenradioss: Bumper Beamapache-iotdb: 500 - 100 - 500 - 400apache-iotdb: 500 - 100 - 500 - 400tensorflow: CPU - 64 - ResNet-50openradioss: Rubber O-Ring Seal Installationopenradioss: Cell Phone Drop Testpgbench: 100 - 1000 - Read Writebuild-nodejs: Time To Compileopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUopenssl: AES-256-GCMopenssl: AES-128-GCMopenssl: ChaCha20openssl: ChaCha20-Poly1305openssl: SHA512openssl: SHA256build-gem5: Time To Compilestockfish: Total Timebuild-linux-kernel: allmodconfigpgbench: 100 - 800 - Read Only - Average Latencypgbench: 100 - 1000 - Read Only - Average Latencycassandra: Writestensorflow: CPU - 32 - ResNet-50blender: Pabellon Barcelona - CPU-Onlynpb: IS.Dlaghos: Sedov Blast Wave, ube_922_hex.meshnginx: 1000nginx: 500blender: Classroom - CPU-Onlypgbench: 100 - 800 - Read Onlyavifenc: 0pgbench: 100 - 1000 - Read Onlynpb: LU.Ctensorflow: CPU - 16 - ResNet-50incompact3d: input.i3d 193 Cells Per Directionopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenssl: RSA4096openssl: RSA4096rodinia: OpenMP LavaMDlaghos: Triple Point Problemblender: Fishy Cat - CPU-Onlygromacs: MPI CPU - water_GMX50_barerodinia: OpenMP Leukocytenpb: EP.Davifenc: 2npb: SP.Cblender: BMW27 - CPU-Onlynpb: FT.Cnpb: CG.Cbuild-linux-kernel: defconfigamg: npb: BT.Ccompress-7zip: Decompression Ratingcompress-7zip: Compression Ratinglibxsmm: 32remhos: Sample Remap Examplelibxsmm: 64rodinia: OpenMP Streamclustercoremark: CoreMark Size 666 - Iterations Per Secondrodinia: OpenMP CFD Solveravifenc: 6, Losslessincompact3d: input.i3d 129 Cells Per Directionnpb: MG.Clammps: Rhodopsin Proteinavifenc: 6heffte: c2c - FFTW - double - 128heffte: r2c - FFTW - double - 128heffte: r2c - FFTW - float - 128heffte: c2c - FFTW - float - 128c3d-standard-60 AMD Genoat2d-standard-60 AMD Milan682.12353598844289858333472394000084.16619.776337.70447.6834332237623.8934762565510819147.3192.87418.593326815869.6889.6538.82198.39483.99142.7583.91142.902933280484973430952844401739809498931239093047731470227057346211821313176.76710589445722864062.742422.40259.55180537.84187350.4478.06873563.1350.9928.0196877648.8118.39340.2935.1964.70185.296.791764.2118.56645.7420.77576.9439.38761.1431.08964.464.536605.125.862043.050.454971.268.621389.690.5243607.042.874166.398.203650.5715.981875.28493077.620079.564.862209.004.39145.4983783.6041.53839919.7139647.4719597.8696288983396257.48226211271795255.433.362489.76.4481445843.52155210.0256.8895.8715788542701.8317.4233.25057.311693.6005148.57588.6301140.916709.463506855736819358332730620000172.71788.53526.734327.88433.9634123810351.58633.58349258995682629363123.6175.68415.083346680420.9072.0630.125793191.706208.4773.74193.4578.962160259676402346040826101802491457701196477203372224480418350884997103170.930112958788333.3510.3990.49818716920.36112.641752.62364.64155609.04162957.7589.35200378478.350200818694247.7718.2924.57211811393.5610.73568.7726.28155.1496.5823.64633.7626.50565.3466.46225.4876.72390.7281.00370.033.524239.529.901512.570.6144049.1540.67368.390.9929668.1211.651285.4711.322646.5814.761014.70860844.612973.050.974222.3045.225.28942.0104935.6841.98943228.1134.2754846.1816649.3733.399920427767122720.61247255278973289.216.326554.26.4231730658.4494407.3687.6395.6305732747291.9627.8283.20560.0343106.029196.948109.676OpenBenchmarking.org

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latencyt2d-standard-60 AMD Milan306090120150SE +/- 1.23, N = 12140.921. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa150300450600750SE +/- 25.87, N = 3SE +/- 7.53, N = 3709.46682.12MAX: 113264.06MAX: 98294.84

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa8M16M24M32M40MSE +/- 354923.23, N = 3SE +/- 267152.12, N = 33506855735359884

nekRS

Input: Kershaw

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: Kershawt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa900M1800M2700M3600M4500MSE +/- 84802173.02, N = 12SE +/- 57202190.49, N = 12368193583342898583331. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

nekRS

Input: TurboPipe Periodic

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: TurboPipe Periodict2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa1000M2000M3000M4000M5000MSE +/- 481352.26, N = 3SE +/- 201939132.13, N = 12273062000047239400001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latencyt2d-standard-60 AMD Milan4080120160200SE +/- 1.44, N = 8172.721. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3Dt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa20406080100SE +/- 1.83, N = 12SE +/- 0.86, N = 1588.5484.171. (CXX) g++ options: -O2 -lOpenCL

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k Atomst2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa612182430SE +/- 0.05, N = 3SE +/- 0.04, N = 326.7319.781. (CXX) g++ options: -O3 -lm -ldl

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1Mt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa70140210280350SE +/- 1.48, N = 3SE +/- 2.03, N = 3327.88337.70

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa100200300400500SE +/- 35.57, N = 3SE +/- 27.08, N = 3433.96447.68MAX: 103381.73MAX: 95136.87

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa7M14M21M28M35MSE +/- 130588.20, N = 3SE +/- 66565.66, N = 33412381034332237

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: CPU-Onlyt2d-standard-60 AMD Milan80160240320400SE +/- 0.60, N = 3351.58

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa140280420560700SE +/- 12.58, N = 3SE +/- 3.18, N = 3633.58623.89MAX: 54749.7MAX: 41831.2

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa7M14M21M28M35MSE +/- 221111.93, N = 3SE +/- 188621.29, N = 33492589934762565

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Writet2d-standard-60 AMD Milan12002400360048006000SE +/- 49.28, N = 1256821. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.36VGR Performance Metrict2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa130K260K390K520K650K6293635108191. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on Windshieldt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa306090120150SE +/- 0.13, N = 3SE +/- 3.26, N = 9123.61147.31

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper Beamt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa20406080100SE +/- 0.10, N = 3SE +/- 1.56, N = 1575.6892.87

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa90180270360450SE +/- 4.08, N = 3SE +/- 2.14, N = 3415.08418.59MAX: 28810.4MAX: 31920.67

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa7M14M21M28M35MSE +/- 303573.79, N = 3SE +/- 154587.14, N = 33346680433268158

TensorFlow

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: ResNet-50t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa1632486480SE +/- 0.03, N = 3SE +/- 0.07, N = 320.9069.68

OpenRadioss

Model: Rubber O-Ring Seal Installation

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal Installationt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa20406080100SE +/- 0.22, N = 3SE +/- 4.01, N = 1272.0689.65

OpenRadioss

Model: Cell Phone Drop Test

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop Testt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa918273645SE +/- 0.39, N = 15SE +/- 1.27, N = 1530.1238.82

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Writet2d-standard-60 AMD Milan12002400360048006000SE +/- 49.93, N = 857931. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To Compilet2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa4080120160200SE +/- 0.05, N = 3SE +/- 0.29, N = 3191.71198.39

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Person Detection FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa50100150200250SE +/- 9.17, N = 15SE +/- 0.13, N = 3208.4783.99MIN: 119.65 / MAX: 316.01MIN: 64.11 / MAX: 117.811. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Person Detection FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa306090120150SE +/- 3.12, N = 15SE +/- 0.23, N = 373.74142.751. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Person Detection FP32 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa4080120160200SE +/- 7.79, N = 15SE +/- 0.29, N = 3193.4583.91MIN: 113.44 / MAX: 315.54MIN: 61.61 / MAX: 119.941. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Person Detection FP32 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa306090120150SE +/- 2.74, N = 15SE +/- 0.48, N = 378.96142.901. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa60000M120000M180000M240000M300000MSE +/- 178290221.71, N = 3SE +/- 71241287.97, N = 32160259676402933280484971. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa70000M140000M210000M280000M350000MSE +/- 376720190.05, N = 3SE +/- 342949201.09, N = 32346040826103430952844401. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa40000M80000M120000M160000M200000MSE +/- 47698640.87, N = 3SE +/- 12326205.97, N = 31802491457701739809498931. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa30000M60000M90000M120000M150000MSE +/- 198663058.81, N = 3SE +/- 3664727.47, N = 31196477203371239093047731. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa5000M10000M15000M20000M25000MSE +/- 108274834.92, N = 3SE +/- 4399663.13, N = 322244804183147022705731. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa11000M22000M33000M44000M55000MSE +/- 20491615.60, N = 3SE +/- 9562123.40, N = 350884997103462118213131. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compilet2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa4080120160200SE +/- 0.27, N = 3SE +/- 0.07, N = 3170.93176.77

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total Timet2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa20M40M60M80M100MSE +/- 1618403.29, N = 14SE +/- 1450871.09, N = 3112958788105894457-mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigt2d-standard-60 AMD Milan70140210280350SE +/- 1.20, N = 3333.35

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latencyt2d-standard-60 AMD Milan0.08980.17960.26940.35920.449SE +/- 0.004, N = 30.3991. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latencyt2d-standard-60 AMD Milan0.11210.22420.33630.44840.5605SE +/- 0.006, N = 30.4981. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.1.3Test: Writest2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa50K100K150K200K250KSE +/- 681.76, N = 3SE +/- 1129.40, N = 3187169228640

TensorFlow

Device: CPU - Batch Size: 32 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: ResNet-50t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa1428425670SE +/- 0.06, N = 3SE +/- 0.10, N = 320.3662.74

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: CPU-Onlyt2d-standard-60 AMD Milan306090120150SE +/- 0.03, N = 3112.64

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.Dt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa5001000150020002500SE +/- 142.62, N = 12SE +/- 36.45, N = 151752.622422.401. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Laghos

Test: Sedov Blast Wave, ube_922_hex.mesh

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Sedov Blast Wave, ube_922_hex.mesht2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa80160240320400SE +/- 0.62, N = 3SE +/- 0.31, N = 3364.64259.551. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

nginx

Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa40K80K120K160K200KSE +/- 156.32, N = 3SE +/- 688.79, N = 3155609.04180537.841. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa40K80K120K160K200KSE +/- 394.72, N = 3SE +/- 126.15, N = 3162957.75187350.441. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: CPU-Onlyt2d-standard-60 AMD Milan20406080100SE +/- 0.07, N = 389.35

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Onlyt2d-standard-60 AMD Milan400K800K1200K1600K2000KSE +/- 20558.90, N = 320037841. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 0t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa20406080100SE +/- 0.08, N = 3SE +/- 0.07, N = 378.3578.071. (CXX) g++ options: -O3 -fPIC -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Onlyt2d-standard-60 AMD Milan400K800K1200K1600K2000KSE +/- 22497.42, N = 320081861. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Ct2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa20K40K60K80K100KSE +/- 1463.52, N = 15SE +/- 293.57, N = 394247.7773563.131. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

TensorFlow

Device: CPU - Batch Size: 16 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 16 - Model: ResNet-50t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa1224364860SE +/- 0.05, N = 3SE +/- 0.04, N = 318.2950.99

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directiont2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa714212835SE +/- 0.31, N = 12SE +/- 0.25, N = 324.5728.021. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa30060090012001500SE +/- 1.32, N = 3SE +/- 0.17, N = 31393.56648.81MIN: 1290.45 / MAX: 1469.82MIN: 632.52 / MAX: 677.151. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 310.7318.391. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection FP16-INT8 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa120240360480600SE +/- 0.35, N = 3SE +/- 0.21, N = 3568.77340.29MIN: 481.12 / MAX: 606.31MIN: 328.52 / MAX: 353.661. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection FP16-INT8 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa816243240SE +/- 0.02, N = 3SE +/- 0.02, N = 326.2835.191. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Machine Translation EN To DE FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa306090120150SE +/- 0.58, N = 3SE +/- 0.09, N = 3155.1464.70MIN: 114.75 / MAX: 224.64MIN: 50.24 / MAX: 93.151. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Machine Translation EN To DE FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa4080120160200SE +/- 0.37, N = 3SE +/- 0.25, N = 396.58185.291. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Person Vehicle Bike Detection FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa612182430SE +/- 0.14, N = 3SE +/- 0.02, N = 323.646.79MIN: 9.57 / MAX: 42.22MIN: 4.45 / MAX: 24.381. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Person Vehicle Bike Detection FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa400800120016002000SE +/- 3.66, N = 3SE +/- 4.38, N = 3633.761764.211. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16-INT8 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 326.5018.56MIN: 19.6 / MAX: 56.62MIN: 15.5 / MAX: 36.361. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16-INT8 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa140280420560700SE +/- 0.37, N = 3SE +/- 0.79, N = 3565.34645.741. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa1530456075SE +/- 0.27, N = 3SE +/- 0.04, N = 366.4620.77MIN: 25.85 / MAX: 122.2MIN: 11.01 / MAX: 36.691. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa120240360480600SE +/- 0.93, N = 3SE +/- 1.04, N = 3225.48576.941. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16-INT8 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa20406080100SE +/- 0.27, N = 3SE +/- 0.09, N = 376.7239.38MIN: 58.8 / MAX: 121.69MIN: 34.32 / MAX: 53.981. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16-INT8 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa160320480640800SE +/- 1.37, N = 3SE +/- 1.75, N = 3390.72761.141. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa20406080100SE +/- 0.21, N = 3SE +/- 0.02, N = 381.0031.08MIN: 64.65 / MAX: 134.64MIN: 19.98 / MAX: 50.811. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa2004006008001000SE +/- 0.96, N = 3SE +/- 0.60, N = 3370.03964.461. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16-INT8 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa1.01932.03863.05794.07725.0965SE +/- 0.00, N = 3SE +/- 0.01, N = 33.524.53MIN: 2.77 / MAX: 18.76MIN: 2.72 / MAX: 12.381. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16-INT8 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa14002800420056007000SE +/- 2.08, N = 3SE +/- 6.52, N = 34239.526605.121. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16-INT8 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 39.905.86MIN: 8.13 / MAX: 26.87MIN: 3.22 / MAX: 20.191. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16-INT8 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa400800120016002000SE +/- 1.12, N = 3SE +/- 3.16, N = 31512.572043.051. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa0.13730.27460.41190.54920.6865SE +/- 0.00, N = 3SE +/- 0.00, N = 30.610.40MIN: 0.41 / MAX: 264.31MIN: 0.23 / MAX: 7.491. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa12K24K36K48K60KSE +/- 332.93, N = 3SE +/- 45.46, N = 344049.1554971.261. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa918273645SE +/- 0.17, N = 3SE +/- 0.04, N = 340.678.62MIN: 12.07 / MAX: 59.44MIN: 5.74 / MAX: 36.031. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa30060090012001500SE +/- 1.55, N = 3SE +/- 6.33, N = 3368.391389.691. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa0.22280.44560.66840.89121.114SE +/- 0.00, N = 3SE +/- 0.00, N = 30.990.52MIN: 0.8 / MAX: 13.76MIN: 0.31 / MAX: 8.861. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa9K18K27K36K45KSE +/- 14.46, N = 3SE +/- 18.42, N = 329668.1243607.041. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa3691215SE +/- 0.14, N = 3SE +/- 0.01, N = 311.652.87MIN: 3.76 / MAX: 29.1MIN: 1.77 / MAX: 11.421. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa9001800270036004500SE +/- 15.96, N = 3SE +/- 8.74, N = 31285.474166.391. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16-INT8 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 311.328.20MIN: 10.13 / MAX: 30.81MIN: 4.92 / MAX: 14.51. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16-INT8 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa8001600240032004000SE +/- 3.26, N = 3SE +/- 3.28, N = 32646.583650.571. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa48121620SE +/- 0.01, N = 3SE +/- 0.00, N = 314.7615.98MIN: 12.71 / MAX: 35.86MIN: 9.37 / MAX: 24.291. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa400800120016002000SE +/- 0.56, N = 3SE +/- 0.47, N = 31014.701875.281. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa200K400K600K800K1000KSE +/- 644.45, N = 3SE +/- 50.91, N = 3860844.6493077.61. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa4K8K12K16K20KSE +/- 11.58, N = 3SE +/- 14.62, N = 312973.020079.51. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa1428425670SE +/- 0.13, N = 3SE +/- 0.18, N = 350.9764.861. (CXX) g++ options: -O2 -lOpenCL

Laghos

Test: Triple Point Problem

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Triple Point Problemt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa50100150200250SE +/- 1.77, N = 3SE +/- 0.22, N = 3222.30209.001. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: CPU-Onlyt2d-standard-60 AMD Milan1020304050SE +/- 0.13, N = 345.22

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_baret2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa1.192.383.574.765.95SE +/- 0.005, N = 3SE +/- 0.011, N = 35.2894.3911. (CXX) g++ options: -O3

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Leukocytet2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa1020304050SE +/- 0.02, N = 3SE +/- 0.17, N = 342.0145.501. (CXX) g++ options: -O2 -lOpenCL

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa11002200330044005500SE +/- 51.21, N = 5SE +/- 30.67, N = 34935.683783.601. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 2t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa1020304050SE +/- 0.03, N = 3SE +/- 0.11, N = 341.9941.541. (CXX) g++ options: -O3 -fPIC -lm

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Ct2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa9K18K27K36K45KSE +/- 555.92, N = 3SE +/- 45.01, N = 343228.1139919.711. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-Onlyt2d-standard-60 AMD Milan816243240SE +/- 0.06, N = 334.27

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Ct2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa12K24K36K48K60KSE +/- 137.77, N = 3SE +/- 600.19, N = 1554846.1839647.471. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Ct2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa4K8K12K16K20KSE +/- 1215.82, N = 15SE +/- 77.92, N = 316649.3719597.861. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigt2d-standard-60 AMD Milan816243240SE +/- 0.37, N = 533.40

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa200M400M600M800M1000MSE +/- 1088162.98, N = 3SE +/- 2060519.90, N = 39204277679628898331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Ct2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa30K60K90K120K150KSE +/- 42.77, N = 3SE +/- 122.23, N = 3122720.6196257.481. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Ratingt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa50K100K150K200K250KSE +/- 347.74, N = 3SE +/- 519.21, N = 32472552262111. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Ratingt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa60K120K180K240K300KSE +/- 388.74, N = 3SE +/- 346.51, N = 32789732717951. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

libxsmm

M N K: 32

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 32t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa60120180240300SE +/- 3.60, N = 4SE +/- 0.19, N = 3289.2255.41. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

Remhos

Test: Sample Remap Example

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Examplet2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa816243240SE +/- 0.05, N = 3SE +/- 0.17, N = 316.3333.361. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

libxsmm

M N K: 64

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 64t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa120240360480600SE +/- 0.25, N = 3SE +/- 0.12, N = 3554.2489.71. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclustert2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa246810SE +/- 0.009, N = 3SE +/- 0.104, N = 156.4236.4481. (CXX) g++ options: -O2 -lOpenCL

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa400K800K1200K1600K2000KSE +/- 9191.61, N = 3SE +/- 1295.68, N = 31730658.451445843.521. (CC) gcc options: -O2 -lrt" -lrt

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solvert2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa3691215SE +/- 0.034, N = 3SE +/- 0.013, N = 37.36810.0251. (CXX) g++ options: -O2 -lOpenCL

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6, Losslesst2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa246810SE +/- 0.031, N = 3SE +/- 0.099, N = 37.6396.8891. (CXX) g++ options: -O3 -fPIC -lm

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directiont2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa1.32112.64223.96335.28446.6055SE +/- 0.02210564, N = 3SE +/- 0.04970425, N = 35.630573275.871578851. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Ct2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa10K20K30K40K50KSE +/- 145.06, N = 3SE +/- 29.69, N = 347291.9642701.831. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin Proteint2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa714212835SE +/- 0.17, N = 3SE +/- 0.54, N = 1227.8317.421. (CXX) g++ options: -O3 -lm -ldl

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa0.73131.46262.19392.92523.6565SE +/- 0.013, N = 3SE +/- 0.007, N = 33.2053.2501. (CXX) g++ options: -O3 -fPIC -lm

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa1326395265SE +/- 0.66, N = 3SE +/- 1.29, N = 1560.0357.311. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa20406080100SE +/- 0.91, N = 3SE +/- 1.78, N = 12106.0393.601. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa4080120160200SE +/- 0.82, N = 3SE +/- 2.19, N = 12196.95148.581. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa20406080100SE +/- 0.78, N = 3SE +/- 0.52, N = 3109.6888.631. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.5