GCE c3d-standard-60

KVM testing on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2310148-NE-2310093NE34&grs&sor.

GCE c3d-standard-60ProcessorMotherboardChipsetMemoryDiskNetworkOSKernelVulkanCompilerFile-SystemSystem Layerc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlargem7a.16xlargec7g.16xlargec2d-standard-56AMD EPYC 9B14 (30 Cores / 60 Threads)Google Compute Engine c3d-standard-60Intel 440FX 82441FX PMC240GB215GB nvme_card-pdGoogle Compute Engine VirtualUbuntu 22.046.2.0-1014-gcp (x86_64)1.3.238GCC 11.4.0ext4KVMAMD EPYC 7B13 (60 Cores)Google Compute Engine t2d-standard-60215GB PersistentDiskRed Hat Virtio deviceARMv8 Neoverse-N1 (64 Cores)Amazon EC2 c6g.16xlarge (1.0 BIOS)Amazon Device 0200128GB215GB Amazon Elastic Block StoreAmazon Elastic5.19.0-1025-aws (aarch64)amazonAMD EPYC 9R14 (64 Cores)Amazon EC2 m7a.16xlarge (1.0 BIOS)Intel 440FX 82441FX PMC256GB5.19.0-1025-aws (x86_64)ARMv8 Neoverse-V1 (64 Cores)Amazon EC2 c7g.16xlarge (1.0 BIOS)Amazon Device 0200128GB5.19.0-1025-aws (aarch64)AMD EPYC 7B13 (28 Cores / 56 Threads)Google Compute Engine c2d-standard-56Intel 440FX 82441FX PMC224GB215GB PersistentDiskRed Hat Virtio device6.2.0-1014-gcp (x86_64)KVMOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- c3d-standard-60 AMD Genoa: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - t2d-standard-60 AMD Milan: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - c6g.16xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - m7a.16xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - c7g.16xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - c2d-standard-56: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- c3d-standard-60 AMD Genoa: CPU Microcode: 0xffffffff- t2d-standard-60 AMD Milan: CPU Microcode: 0xffffffff- m7a.16xlarge: CPU Microcode: 0xa10113e- c2d-standard-56: CPU Microcode: 0xffffffffJava Details- OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Details- Python 3.10.12Security Details- c3d-standard-60 AMD Genoa: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - t2d-standard-60 AMD Milan: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - c6g.16xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - m7a.16xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - c7g.16xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - c2d-standard-56: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

GCE c3d-standard-60openvino: Face Detection FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUnpb: BT.Copenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUnpb: FT.Ctensorflow: CPU - 64 - ResNet-50npb: MG.Cavifenc: 2nekrs: Kershawopenssl: ChaCha20-Poly1305openssl: RSA4096openssl: ChaCha20openssl: AES-128-GCMnpb: IS.Dopenssl: AES-256-GCMtensorflow: CPU - 32 - ResNet-50avifenc: 0openvino: Weld Porosity Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Face Detection FP16 - CPUtensorflow: CPU - 16 - ResNet-50build-linux-kernel: defconfignpb: EP.Dnpb: CG.Copenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16 - CPUpgbench: 100 - 1000 - Read Only - Average Latencypgbench: 100 - 1000 - Read Onlypgbench: 100 - 800 - Read Onlypgbench: 100 - 800 - Read Only - Average Latencygromacs: MPI CPU - water_GMX50_barelibxsmm: 32libxsmm: 64openssl: SHA512nekrs: TurboPipe Periodicincompact3d: input.i3d 193 Cells Per Directionremhos: Sample Remap Examplerodinia: OpenMP Streamclusterrodinia: OpenMP CFD Solverheffte: c2c - FFTW - double - 128lammps: Rhodopsin Proteinincompact3d: input.i3d 129 Cells Per Directionamg: openvino: Face Detection FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenssl: RSA4096openvino: Weld Porosity Detection FP16 - CPUnpb: LU.Cnpb: SP.Clammps: 20k Atomsheffte: c2c - FFTW - float - 128heffte: r2c - FFTW - float - 128brl-cad: VGR Performance Metricopenradioss: Chrysler Neon 1Mbuild-nodejs: Time To Compilecoremark: CoreMark Size 666 - Iterations Per Secondcassandra: Writesblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlylaghos: Sedov Blast Wave, ube_922_hex.meshblender: Barbershop - CPU-Onlybuild-linux-kernel: allmodconfigavifenc: 6stockfish: Total Timenginx: 1000blender: Pabellon Barcelona - CPU-Onlyrodinia: OpenMP LavaMDheffte: r2c - FFTW - double - 128blender: Fishy Cat - CPU-Onlynginx: 500avifenc: 6, Losslessopenssl: SHA256build-gem5: Time To Compilecompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingrodinia: OpenMP Leukocytelaghos: Triple Point Problemapache-iotdb: 500 - 100 - 800 - 400apache-iotdb: 800 - 100 - 800 - 400apache-iotdb: 800 - 100 - 500 - 400apache-iotdb: 800 - 100 - 500 - 400apache-iotdb: 500 - 100 - 500 - 400apache-iotdb: 500 - 100 - 500 - 400pgbench: 100 - 1000 - Read Writeapache-iotdb: 500 - 100 - 800 - 400rodinia: OpenMP HotSpot3Dpgbench: 100 - 800 - Read Writeapache-iotdb: 800 - 100 - 800 - 400pgbench: 100 - 1000 - Read Write - Average Latencypgbench: 100 - 800 - Read Write - Average Latencyopenradioss: Rubber O-Ring Seal Installationopenradioss: Bird Strike on Windshieldopenradioss: Cell Phone Drop Testopenradioss: Bumper Beamc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlargem7a.16xlargec7g.16xlargec2d-standard-56340.29645.744.5396257.4854971.26964.4618.56761.1443607.041764.21576.9439647.4769.6842701.8341.5384289858333123909304773493077.61739809498933430952844402422.4029332804849762.7478.0681875.281389.698.204166.3918.3950.993783.6019597.860.4142.75142.902043.056.7920.77185.298.626605.122.874.391255.4489.714702270573472394000028.019687733.3626.44810.02557.311617.4235.87157885962889833648.8131.085.8683.9983.9135.1939.380.5264.703650.5720079.515.9873563.1339919.7119.77688.6301148.575510819337.70198.3941445843.521552228640259.553.250105894457180537.8464.86293.6005187350.446.88946211821313176.76727179522621145.498209.003476256535359884447.6834332237418.5933268158623.8984.166682.1289.65147.3138.8292.87568.77565.343.52122720.6144049.15370.0326.50390.7229668.12633.76225.4854846.1820.9047291.9641.9893681935833119647720337860844.61802491457702346040826101752.6221602596764020.3678.3501014.70368.3911.321285.4710.7318.2933.3994935.6816649.370.6173.7478.961512.5723.6466.4696.5840.674239.5211.650.498200818620037840.3995.289289.2554.222244804183273062000024.572118116.3266.4237.36860.034327.8285.630573279204277671393.5681.009.90208.47193.4526.2876.720.99155.142646.5812973.014.7694247.7743228.1126.734109.676196.948629363327.88191.7061730658.44944018716934.2789.35364.64351.58333.3513.205112958788155609.04112.6450.974106.02945.22162957.757.63950884997103170.93027897324725542.010222.303492589935068557433.9634123810415.08334668045793633.5888.5355682709.46172.717140.91672.06123.6130.1275.6822391.860.152186.8124229.14136.162.536773.312.36178.827.362.6121386.3725661.04167.946175886000046715126487215683.267324778360158788510970915.80129198197600270.0688.396.53181.9420.790.1102.2162213.7613343.357.331.061.060.14135.87382.471.36153.120.4648.081.02697503110432670.7672.766312.7589.514384917863222171000025.874832820.81614.2125.98332.357526.0415.6181168610328936679996.56394.946990.10947.59947.860.04423.955.58735.585.502640.0119.1718807.759716.9925.059129.172202.445286.2011259870.716902217355321.29409.0974.46781807706158700.3662.30179.0156162553.858.87942288513973224.414239735234046179.5247764784210.608168.191261.111222.993.07193219.1292100.801419.1113.071158.4381996.813146.041049.66103413.02100.15121293.8035.8057667846667216773475533996017.53083080450835925453627404085.2052211308052787.1965.4473132.482417.345.167222.1031.7269.5527.7097501.7642007.570.27284.42283.403666.895.0715.22315.146.6010382.222.210.347288094029230090.2747.655643.41201.826481506820477479666711.591308613.8675.9306.48071.109532.7852.896026611843444333503.3822.534.3556.2156.4161.1927.600.3850.726177.9431583.810.19210544.87102392.4031.471121.505190.602788704190.79154.4412158639.27488327858527.7471.51409.73276.23267.9652.649135419169224859.0991.8843.286124.36337.12233014.725.67862253861197153.80033063328259334.661218.864450233344699315355.1342643903340.11408992105300521.9874.7785312598.49188.676150.60159.18115.9626.1466.2712852.080.261255.0238934.15180.792.533871.332.44258.5910.154.5139830.1849799.57102.081325216666774306515763713624.41032346086833323451119831797.76283187826230170.58714.3111.19109.9933.230.1880.3473648.4322031.275.521.811.810.2598.57221.502.3289.360.830.080.770129852713143170.6094.194494.2785.032045616820399173666713.791938814.25811.4574.35352.477937.4623.1587883717673490005639.78395.913957.74554.28553.520.08410.003.86431.909.0910181.169.8928356.7017223.9536.725175.729291.966236.4821608877.078630316573409.66316.0053.178119502473255540.0343.905127.985254864.935.91454438555170179.999310517285523231.8553425348187.188149.597868.35466.414.3694143.8535877.14280.2629.99301.6021951.031022.81322.2747216.0521.8254301.2546.541819529333382221279643468650.61208253059531295487666872190.3911840259574021.4888.810689.56701.0017.692215.747.2520.0041.1682579.9223089.420.682.4782.471089.8313.6643.3993.2319.943195.586.300.669149444515099870.5303.951306.3581.113104599007543108333325.415339124.6436.6589.34361.184618.0286.144091929818772671914.2799.8112.83169.57169.6216.0392.771.25149.971581.237181.420.2878848.0848423.5218.369104.534164.720410738363.60231.3841212209.81928017822749.01125.73239.48468.20452.7483.80284017019161127.09150.0670.15799.648059.22160196.248.08039825794840192.60123043020819246.741199.803546071335377376414.4034625899408.13333959685820629.0090.7365702679.86171.947140.41069.43146.5636.8498.45OpenBenchmarking.org

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection FP16-INT8 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge5K10K15K20K25KSE +/- 0.03, N = 3SE +/- 0.21, N = 3SE +/- 0.35, N = 3SE +/- 0.38, N = 3SE +/- 5.72, N = 3SE +/- 15.60, N = 3261.11340.29568.77868.3512852.0822391.86-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 12821.62 / MAX: 12882.02-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 22364.17 / MAX: 22423.421. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16-INT8 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge30060090012001500SE +/- 0.51, N = 3SE +/- 0.79, N = 3SE +/- 0.37, N = 3SE +/- 1.07, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31222.99645.74565.34466.410.260.15-pie-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16-INT8 - Device: CPUm7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoac7g.16xlargec6g.16xlarge5001000150020002500SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.23, N = 3SE +/- 27.29, N = 33.073.524.364.531255.022186.81-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 1251.83 / MAX: 1261.9-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 2135.06 / MAX: 2233.341. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cm7a.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-56c7g.16xlargec6g.16xlarge40K80K120K160K200KSE +/- 560.75, N = 3SE +/- 42.77, N = 3SE +/- 122.23, N = 3SE +/- 80.45, N = 3SE +/- 19.48, N = 3SE +/- 7.69, N = 3193219.12122720.6196257.4894143.8538934.1524229.141. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge20K40K60K80K100KSE +/- 453.62, N = 3SE +/- 45.46, N = 3SE +/- 332.93, N = 3SE +/- 26.90, N = 3SE +/- 0.71, N = 3SE +/- 0.48, N = 392100.8054971.2644049.1535877.14180.79136.16-pie-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge30060090012001500SE +/- 0.70, N = 3SE +/- 0.60, N = 3SE +/- 0.96, N = 3SE +/- 0.37, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 31419.11964.46370.03280.262.532.53-pie-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16-INT8 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge15003000450060007500SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.51, N = 3SE +/- 76.95, N = 313.0718.5626.5029.993871.336773.31-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 3868.49 / MAX: 3876.89-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6616.83 / MAX: 6859.991. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16-INT8 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge2004006008001000SE +/- 0.18, N = 3SE +/- 1.75, N = 3SE +/- 1.37, N = 3SE +/- 2.14, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 31158.43761.14390.72301.602.442.36-pie-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge20K40K60K80K100KSE +/- 31.79, N = 3SE +/- 18.42, N = 3SE +/- 14.46, N = 3SE +/- 3.74, N = 3SE +/- 0.88, N = 3SE +/- 0.39, N = 381996.8143607.0429668.1221951.03258.59178.82-pie-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Person Vehicle Bike Detection FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargec6g.16xlarge7001400210028003500SE +/- 1.10, N = 3SE +/- 4.38, N = 3SE +/- 1.21, N = 3SE +/- 3.66, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 33146.041764.211022.81633.7610.157.36-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargec6g.16xlarge2004006008001000SE +/- 0.11, N = 3SE +/- 1.04, N = 3SE +/- 2.90, N = 3SE +/- 0.93, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31049.66576.94322.27225.484.512.61-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cm7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec3d-standard-60 AMD Genoac6g.16xlarge20K40K60K80K100KSE +/- 446.85, N = 3SE +/- 137.77, N = 3SE +/- 57.93, N = 3SE +/- 13.54, N = 3SE +/- 600.19, N = 15SE +/- 2.85, N = 3103413.0254846.1847216.0539830.1839647.4721386.371. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

TensorFlow

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: ResNet-50m7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milan20406080100SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3100.1569.6821.8220.90

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cm7a.16xlargec2d-standard-56c7g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge30K60K90K120K150KSE +/- 526.14, N = 3SE +/- 99.78, N = 3SE +/- 10.47, N = 3SE +/- 145.06, N = 3SE +/- 29.69, N = 3SE +/- 10.99, N = 3121293.8054301.2549799.5747291.9642701.8325661.041. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 2m7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge4080120160200SE +/- 0.22, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.13, N = 3SE +/- 0.22, N = 335.8141.5441.9946.54102.08167.951. (CXX) g++ options: -O3 -fPIC -lm

nekRS

Input: Kershaw

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: Kershawc2d-standard-56m7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc7g.16xlargec6g.16xlarge2000M4000M6000M8000M10000MSE +/- 88231529.95, N = 3SE +/- 49077561.86, N = 3SE +/- 57202190.49, N = 12SE +/- 84802173.02, N = 12SE +/- 2525353.66, N = 3SE +/- 2970005.61, N = 38195293333766784666742898583333681935833325216666717588600001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305m7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge50000M100000M150000M200000M250000MSE +/- 85099636.22, N = 3SE +/- 3664727.47, N = 3SE +/- 198663058.81, N = 3SE +/- 25534569.58, N = 3SE +/- 814473.51, N = 3SE +/- 2259404.37, N = 3216773475533123909304773119647720337822212796437430651576346715126487-m64 -lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto-m64-lssl -lcrypto1. (CC) gcc options: -pthread -O3 -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096m7a.16xlarget2d-standard-60 AMD Milanc7g.16xlargec3d-standard-60 AMD Genoac2d-standard-56c6g.16xlarge200K400K600K800K1000KSE +/- 367.82, N = 3SE +/- 644.45, N = 3SE +/- 189.99, N = 3SE +/- 50.91, N = 3SE +/- 418.00, N = 3SE +/- 6.55, N = 3996017.5860844.6713624.4493077.6468650.6215683.2-m64 -lssl -lcrypto-m64 -lssl -lcrypto-lssl -lcrypto-m64 -lssl -lcrypto-m64-lssl -lcrypto1. (CC) gcc options: -pthread -O3 -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20m7a.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-56c7g.16xlargec6g.16xlarge70000M140000M210000M280000M350000MSE +/- 280681661.27, N = 3SE +/- 47698640.87, N = 3SE +/- 12326205.97, N = 3SE +/- 13175767.44, N = 3SE +/- 926708.87, N = 3SE +/- 372419.81, N = 330830804508318024914577017398094989312082530595310323460868367324778360-m64 -lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto-m64-lssl -lcrypto1. (CC) gcc options: -pthread -O3 -ldl

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMm7a.16xlargec3d-standard-60 AMD Genoac7g.16xlarget2d-standard-60 AMD Milanc6g.16xlargec2d-standard-56130000M260000M390000M520000M650000MSE +/- 1818538727.73, N = 3SE +/- 342949201.09, N = 3SE +/- 38386470.30, N = 3SE +/- 376720190.05, N = 3SE +/- 5537993.15, N = 3SE +/- 2205754.73, N = 3592545362740343095284440332345111983234604082610158788510970129548766687-m64 -lssl -lcrypto-m64 -lssl -lcrypto-lssl -lcrypto-m64 -lssl -lcrypto1. (CC) gcc options: -pthread -O3 -ldl

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.Dm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56c7g.16xlarget2d-standard-60 AMD Milanc6g.16xlarge9001800270036004500SE +/- 3.14, N = 3SE +/- 36.45, N = 15SE +/- 3.14, N = 3SE +/- 1.25, N = 3SE +/- 142.62, N = 12SE +/- 0.58, N = 34085.202422.402190.391797.761752.62915.801. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMm7a.16xlargec3d-standard-60 AMD Genoac7g.16xlarget2d-standard-60 AMD Milanc6g.16xlargec2d-standard-56110000M220000M330000M440000M550000MSE +/- 1691817104.57, N = 3SE +/- 71241287.97, N = 3SE +/- 10793448.32, N = 3SE +/- 178290221.71, N = 3SE +/- 2100313.05, N = 3SE +/- 18032723.86, N = 3522113080527293328048497283187826230216025967640129198197600118402595740-m64 -lssl -lcrypto-m64 -lssl -lcrypto-lssl -lcrypto-m64 -lssl -lcrypto1. (CC) gcc options: -pthread -O3 -ldl

TensorFlow

Device: CPU - Batch Size: 32 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: ResNet-50m7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milan20406080100SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 387.1962.7421.4820.36

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 0m7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge60120180240300SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.31, N = 3SE +/- 0.42, N = 3SE +/- 0.34, N = 365.4578.0778.3588.81170.59270.071. (CXX) g++ options: -O3 -fPIC -lm

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge7001400210028003500SE +/- 0.31, N = 3SE +/- 0.47, N = 3SE +/- 0.56, N = 3SE +/- 0.77, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 33132.481875.281014.70689.5614.318.39-pie-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargec6g.16xlarge5001000150020002500SE +/- 1.65, N = 3SE +/- 6.33, N = 3SE +/- 0.69, N = 3SE +/- 1.55, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 32417.341389.69701.00368.3911.196.53-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16-INT8 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge4080120160200SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.24, N = 35.168.2011.3217.69109.99181.94-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 108.37 / MAX: 112.83-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 180.71 / MAX: 184.111. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargec6g.16xlarge15003000450060007500SE +/- 4.07, N = 3SE +/- 8.74, N = 3SE +/- 3.06, N = 3SE +/- 15.96, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 37222.104166.392215.741285.4733.2320.79-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge714212835SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 331.7218.3910.737.250.180.10-pie-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

TensorFlow

Device: CPU - Batch Size: 16 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 16 - Model: ResNet-50m7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milan1530456075SE +/- 0.20, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 369.5550.9920.0018.29

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigm7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge20406080100SE +/- 0.29, N = 5SE +/- 0.37, N = 5SE +/- 0.54, N = 3SE +/- 0.65, N = 3SE +/- 0.82, N = 327.7133.4041.1780.35102.22

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dm7a.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac7g.16xlargec2d-standard-56c6g.16xlarge16003200480064008000SE +/- 8.54, N = 3SE +/- 51.21, N = 5SE +/- 30.67, N = 3SE +/- 25.41, N = 13SE +/- 36.32, N = 3SE +/- 7.28, N = 37501.764935.683783.603648.432579.922213.761. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cm7a.16xlargec2d-standard-56c7g.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge9K18K27K36K45KSE +/- 178.46, N = 3SE +/- 206.16, N = 7SE +/- 25.33, N = 3SE +/- 77.92, N = 3SE +/- 1215.82, N = 15SE +/- 23.52, N = 342007.5723089.4222031.2719597.8616649.3713343.351. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargec6g.16xlarge246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 30.270.400.600.615.527.33-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 4.5 / MAX: 9.53-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6.54 / MAX: 9.951. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Person Detection FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargec6g.16xlarge60120180240300SE +/- 0.53, N = 3SE +/- 0.23, N = 3SE +/- 0.13, N = 3SE +/- 3.12, N = 15SE +/- 0.00, N = 3SE +/- 0.00, N = 3284.42142.7582.4773.741.811.06-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Person Detection FP32 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargec6g.16xlarge60120180240300SE +/- 0.29, N = 3SE +/- 0.48, N = 3SE +/- 0.14, N = 3SE +/- 2.74, N = 15SE +/- 0.00, N = 3SE +/- 0.00, N = 3283.40142.9082.4778.961.811.06-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16-INT8 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge8001600240032004000SE +/- 1.69, N = 3SE +/- 3.16, N = 3SE +/- 1.12, N = 3SE +/- 1.05, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 153666.892043.051512.571089.830.250.14-pie-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Person Vehicle Bike Detection FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargec6g.16xlarge306090120150SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.14, N = 3SE +/- 0.21, N = 3SE +/- 0.21, N = 35.076.7913.6623.6498.57135.87-pie - MIN: 9.57 / MAX: 42.22-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 94.4 / MAX: 118.88-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 132.35 / MAX: 148.641. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargec6g.16xlarge80160240320400SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.39, N = 3SE +/- 0.27, N = 3SE +/- 0.13, N = 3SE +/- 0.20, N = 315.2220.7743.3966.46221.50382.47-pie - MIN: 25.85 / MAX: 122.2-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 220.78 / MAX: 223.35-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 381.67 / MAX: 383.431. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Machine Translation EN To DE FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge70140210280350SE +/- 0.11, N = 3SE +/- 0.25, N = 3SE +/- 0.37, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3315.14185.2996.5893.232.321.36-pie-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargec6g.16xlarge306090120150SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.17, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 36.608.6219.9440.6789.36153.12-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 10.34 / MAX: 37.94-pie - MIN: 12.07 / MAX: 59.44-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 88.98 / MAX: 90.65-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 152.69 / MAX: 153.751. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16-INT8 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge2K4K6K8K10KSE +/- 3.60, N = 3SE +/- 6.52, N = 3SE +/- 2.08, N = 3SE +/- 2.82, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 310382.226605.124239.523195.580.800.46-pie-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargec6g.16xlarge1122334455SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 32.212.876.3011.6530.0848.08-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 3.57 / MAX: 21.5-pie - MIN: 3.76 / MAX: 29.1-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 29.37 / MAX: 32.17-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 47.55 / MAX: 50.491. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latencym7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge0.23090.46180.69270.92361.1545SE +/- 0.000, N = 3SE +/- 0.006, N = 3SE +/- 0.004, N = 3SE +/- 0.007, N = 3SE +/- 0.013, N = 30.3470.4980.6690.7701.0261. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Onlym7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge600K1200K1800K2400K3000KSE +/- 2874.04, N = 3SE +/- 22497.42, N = 3SE +/- 9220.12, N = 3SE +/- 11830.85, N = 3SE +/- 12606.16, N = 328809402008186149444512985279750311. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Onlym7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge600K1200K1800K2400K3000KSE +/- 13309.30, N = 3SE +/- 20558.90, N = 3SE +/- 20718.28, N = 3SE +/- 14241.12, N = 3SE +/- 12058.27, N = 3292300920037841509987131431710432671. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latencym7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge0.17260.34520.51780.69040.863SE +/- 0.001, N = 3SE +/- 0.004, N = 3SE +/- 0.007, N = 3SE +/- 0.007, N = 3SE +/- 0.009, N = 30.2740.3990.5300.6090.7671. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_barem7a.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac7g.16xlargec2d-standard-56c6g.16xlarge246810SE +/- 0.035, N = 3SE +/- 0.005, N = 3SE +/- 0.011, N = 3SE +/- 0.003, N = 3SE +/- 0.005, N = 3SE +/- 0.001, N = 37.6555.2894.3914.1943.9512.7661. (CXX) g++ options: -O3

libxsmm

M N K: 32

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 32m7a.16xlargec7g.16xlargec6g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa140280420560700SE +/- 0.40, N = 3SE +/- 0.07, N = 3SE +/- 0.47, N = 3SE +/- 0.38, N = 3SE +/- 3.60, N = 4SE +/- 0.19, N = 3643.4494.2312.7306.3289.2255.4-lquadmath -msse4.2-march=armv8.1-a-march=armv8.1-a-lquadmath -msse4.2-lquadmath -msse4.2-lquadmath -msse4.21. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden

libxsmm

M N K: 64

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 64m7a.16xlargec7g.16xlargec6g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa30060090012001500SE +/- 0.52, N = 3SE +/- 0.03, N = 3SE +/- 0.96, N = 3SE +/- 1.99, N = 3SE +/- 0.25, N = 3SE +/- 0.12, N = 31201.8785.0589.5581.1554.2489.7-lquadmath -msse4.2-march=armv8.1-a-march=armv8.1-a-lquadmath -msse4.2-lquadmath -msse4.2-lquadmath -msse4.21. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512c7g.16xlargem7a.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlargec2d-standard-567000M14000M21000M28000M35000MSE +/- 9797429.91, N = 3SE +/- 29763937.73, N = 3SE +/- 108274834.92, N = 3SE +/- 4399663.13, N = 3SE +/- 6214593.12, N = 3SE +/- 631939.37, N = 3320456168202648150682022244804183147022705731438491786313104599007-lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto-lssl -lcrypto-m641. (CC) gcc options: -pthread -O3 -ldl

nekRS

Input: TurboPipe Periodic

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: TurboPipe Periodicc2d-standard-56m7a.16xlargec3d-standard-60 AMD Genoac7g.16xlarget2d-standard-60 AMD Milanc6g.16xlarge1200M2400M3600M4800M6000MSE +/- 4494968.79, N = 3SE +/- 6657808.28, N = 3SE +/- 201939132.13, N = 12SE +/- 1792766.33, N = 3SE +/- 481352.26, N = 3SE +/- 1790009.31, N = 35431083333477479666747239400003991736667273062000022217100001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionm7a.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-56c6g.16xlargec3d-standard-60 AMD Genoa714212835SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.31, N = 12SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.25, N = 311.5913.7924.5725.4225.8728.021. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Remhos

Test: Sample Remap Example

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Examplem7a.16xlargec7g.16xlarget2d-standard-60 AMD Milanc6g.16xlargec2d-standard-56c3d-standard-60 AMD Genoa816243240SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.17, N = 313.8714.2616.3320.8224.6433.361. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclusterm7a.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-56c7g.16xlargec6g.16xlarge48121620SE +/- 0.049, N = 3SE +/- 0.009, N = 3SE +/- 0.104, N = 15SE +/- 0.009, N = 3SE +/- 0.124, N = 3SE +/- 0.017, N = 35.9306.4236.4486.65811.45714.2121. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solverc7g.16xlargec6g.16xlargem7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoa3691215SE +/- 0.011, N = 3SE +/- 0.001, N = 3SE +/- 0.007, N = 3SE +/- 0.034, N = 3SE +/- 0.013, N = 3SE +/- 0.013, N = 34.3535.9836.4807.3689.34310.0251. (CXX) g++ options: -O2 -lOpenCL

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128m7a.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac7g.16xlargec6g.16xlarge1632486480SE +/- 0.63, N = 15SE +/- 0.78, N = 15SE +/- 0.66, N = 3SE +/- 1.29, N = 15SE +/- 0.00, N = 3SE +/- 0.10, N = 371.1161.1860.0357.3152.4832.361. (CXX) g++ options: -O3

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin Proteinc7g.16xlargem7a.16xlarget2d-standard-60 AMD Milanc6g.16xlargec2d-standard-56c3d-standard-60 AMD Genoa918273645SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.17, N = 3SE +/- 0.04, N = 3SE +/- 0.13, N = 3SE +/- 0.54, N = 1237.4632.7927.8326.0418.0317.42-lm-lm-lm-lm1. (CXX) g++ options: -O3 -ldl

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionm7a.16xlargec7g.16xlargec6g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-56246810SE +/- 0.03993251, N = 3SE +/- 0.01236463, N = 3SE +/- 0.01888616, N = 3SE +/- 0.02210564, N = 3SE +/- 0.04970425, N = 3SE +/- 0.02575719, N = 32.896026613.158788375.618116865.630573275.871578856.144091921. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2m7a.16xlargec7g.16xlargec6g.16xlargec2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milan400M800M1200M1600M2000MSE +/- 1428129.35, N = 3SE +/- 536620.29, N = 3SE +/- 176147.98, N = 3SE +/- 999273.29, N = 3SE +/- 2060519.90, N = 3SE +/- 1088162.98, N = 31843444333176734900010328936679818772679628898339204277671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge2K4K6K8K10KSE +/- 0.12, N = 3SE +/- 0.17, N = 3SE +/- 1.32, N = 3SE +/- 3.85, N = 3SE +/- 1.45, N = 3SE +/- 1.02, N = 3503.38648.811393.561914.275639.789996.56-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 1788.62 / MAX: 2033.22-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 5635.22 / MAX: 5646.39-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 9993.58 / MAX: 10001.761. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c6g.16xlargec7g.16xlarge90180270360450SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.21, N = 3SE +/- 0.13, N = 3SE +/- 2.18, N = 3SE +/- 1.65, N = 322.5331.0881.0099.81394.94395.91-pie - MIN: 64.65 / MAX: 134.64-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 55.12 / MAX: 150.04-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 380.82 / MAX: 408.15-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 380.35 / MAX: 415.381. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16-INT8 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge15003000450060007500SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 1.04, N = 3SE +/- 27.09, N = 154.355.869.9012.833957.746990.10-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 3952.99 / MAX: 3967.18-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6842.82 / MAX: 7088.531. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Person Detection FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargec6g.16xlarge2004006008001000SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.29, N = 3SE +/- 9.17, N = 15SE +/- 0.71, N = 3SE +/- 0.37, N = 356.2183.99169.57208.47554.28947.59-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 94.4 / MAX: 218.7-pie - MIN: 119.65 / MAX: 316.01-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 550.22 / MAX: 563.55-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 943.57 / MAX: 951.681. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Person Detection FP32 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargec6g.16xlarge2004006008001000SE +/- 0.06, N = 3SE +/- 0.29, N = 3SE +/- 0.29, N = 3SE +/- 7.79, N = 15SE +/- 0.11, N = 3SE +/- 0.55, N = 356.4183.91169.62193.45553.52947.86-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 104.86 / MAX: 209.35-pie - MIN: 113.44 / MAX: 315.54-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 549.89 / MAX: 562.06-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 944.57 / MAX: 958.261. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection FP16-INT8 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge1428425670SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 361.1935.1926.2816.030.080.04-pie-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16-INT8 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge90180270360450SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.27, N = 3SE +/- 0.65, N = 3SE +/- 3.34, N = 3SE +/- 0.26, N = 327.6039.3876.7292.77410.00423.95-pie - MIN: 58.8 / MAX: 121.69-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 51.49 / MAX: 118.72-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 391.1 / MAX: 424.93-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 411.17 / MAX: 440.571. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge1.25552.5113.76655.0226.2775SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 30.380.520.991.253.865.58-pie - MIN: 0.8 / MAX: 13.76-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 0.71 / MAX: 15.89-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 3.63 / MAX: 5.36-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 5.45 / MAX: 6.481. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Machine Translation EN To DE FP16 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargec6g.16xlarge160320480640800SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.58, N = 3SE +/- 0.24, N = 3SE +/- 0.26, N = 350.7264.70149.97155.14431.90735.58-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 81.05 / MAX: 193.4-pie - MIN: 114.75 / MAX: 224.64-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 430.12 / MAX: 435.25-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 734.34 / MAX: 738.231. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16-INT8 - Device: CPUm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c7g.16xlargec6g.16xlarge13002600390052006500SE +/- 4.43, N = 3SE +/- 3.28, N = 3SE +/- 3.26, N = 3SE +/- 0.68, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 36177.943650.572646.581581.239.095.50-pie-pie-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096m7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc7g.16xlargec2d-standard-56c6g.16xlarge7K14K21K28K35KSE +/- 24.57, N = 3SE +/- 14.62, N = 3SE +/- 11.58, N = 3SE +/- 1.56, N = 3SE +/- 0.44, N = 3SE +/- 0.09, N = 331583.820079.512973.010181.17181.42640.0-m64 -lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto-lssl -lcrypto-m641. (CC) gcc options: -pthread -O3 -ldl

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16 - Device: CPUm7a.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-56c7g.16xlargec6g.16xlarge306090120150SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 310.1914.7615.9820.2869.89119.17-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 14.46 / MAX: 30.6-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 69.44 / MAX: 71.41-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 118.73 / MAX: 120.211. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cm7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoac7g.16xlargec6g.16xlarge50K100K150K200K250KSE +/- 661.61, N = 3SE +/- 1463.52, N = 15SE +/- 22.13, N = 3SE +/- 293.57, N = 3SE +/- 30.15, N = 3SE +/- 7.52, N = 3210544.8794247.7778848.0873563.1328356.7018807.751. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Cm7a.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac7g.16xlargec6g.16xlarge20K40K60K80K100KSE +/- 91.17, N = 3SE +/- 113.84, N = 3SE +/- 555.92, N = 3SE +/- 45.01, N = 3SE +/- 40.05, N = 3SE +/- 0.76, N = 3102392.4048423.5243228.1139919.7117223.959716.991. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k Atomsc7g.16xlargem7a.16xlarget2d-standard-60 AMD Milanc6g.16xlargec3d-standard-60 AMD Genoac2d-standard-56816243240SE +/- 0.11, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 336.7331.4726.7325.0619.7818.37-lm-lm-lm-lm1. (CXX) g++ options: -O3 -ldl

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128c7g.16xlargec6g.16xlargem7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoa4080120160200SE +/- 0.44, N = 3SE +/- 0.11, N = 3SE +/- 1.44, N = 15SE +/- 0.78, N = 3SE +/- 1.06, N = 3SE +/- 0.52, N = 3175.73129.17121.51109.68104.5388.631. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128c7g.16xlargec6g.16xlarget2d-standard-60 AMD Milanm7a.16xlargec2d-standard-56c3d-standard-60 AMD Genoa60120180240300SE +/- 0.59, N = 3SE +/- 0.57, N = 3SE +/- 0.82, N = 3SE +/- 2.16, N = 15SE +/- 1.51, N = 6SE +/- 2.19, N = 12291.97202.45196.95190.60164.72148.581. (CXX) g++ options: -O3

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.36VGR Performance Metricm7a.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-56200K400K600K800K1000K7887046293635108194107381. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1Mm7a.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-5680160240320400SE +/- 0.61, N = 3SE +/- 1.48, N = 3SE +/- 2.03, N = 3SE +/- 0.14, N = 3190.79327.88337.70363.60

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To Compilem7a.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-56c7g.16xlargec6g.16xlarge60120180240300SE +/- 0.11, N = 3SE +/- 0.05, N = 3SE +/- 0.29, N = 3SE +/- 0.26, N = 3SE +/- 0.31, N = 3SE +/- 0.11, N = 3154.44191.71198.39231.38236.48286.20

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondm7a.16xlarget2d-standard-60 AMD Milanc7g.16xlargec3d-standard-60 AMD Genoac6g.16xlargec2d-standard-56500K1000K1500K2000K2500KSE +/- 1437.63, N = 3SE +/- 9191.61, N = 3SE +/- 11587.63, N = 15SE +/- 1295.68, N = 3SE +/- 635.29, N = 3SE +/- 833.48, N = 32158639.271730658.451608877.081445843.521259870.721212209.821. (CC) gcc options: -O2 -lrt" -lrt

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.1.3Test: Writesc7g.16xlargem7a.16xlargec3d-standard-60 AMD Genoac6g.16xlarget2d-standard-60 AMD Milanc2d-standard-5670K140K210K280K350KSE +/- 2261.70, N = 3SE +/- 352.65, N = 3SE +/- 1129.40, N = 3SE +/- 3249.94, N = 12SE +/- 681.76, N = 3SE +/- 1718.54, N = 3316573278585228640217355187169178227

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-Onlym7a.16xlarget2d-standard-60 AMD Milanc2d-standard-561122334455SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 327.7434.2749.01

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: CPU-Onlym7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56306090120150SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 371.5189.35125.73

Laghos

Test: Sedov Blast Wave, ube_922_hex.mesh

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Sedov Blast Wave, ube_922_hex.meshm7a.16xlargec7g.16xlarget2d-standard-60 AMD Milanc6g.16xlargec3d-standard-60 AMD Genoac2d-standard-5690180270360450SE +/- 1.29, N = 3SE +/- 0.57, N = 3SE +/- 0.62, N = 3SE +/- 0.79, N = 3SE +/- 0.31, N = 3SE +/- 0.02, N = 3409.73409.66364.64321.29259.55239.481. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: CPU-Onlym7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56100200300400500SE +/- 0.49, N = 3SE +/- 0.60, N = 3SE +/- 0.71, N = 3276.23351.58468.20

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigm7a.16xlargec7g.16xlarget2d-standard-60 AMD Milanc6g.16xlargec2d-standard-56100200300400500SE +/- 0.37, N = 3SE +/- 0.81, N = 3SE +/- 1.20, N = 3SE +/- 2.41, N = 3SE +/- 1.50, N = 3267.97316.01333.35409.10452.75

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6m7a.16xlargec7g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-56c6g.16xlarge1.00512.01023.01534.02045.0255SE +/- 0.015, N = 3SE +/- 0.004, N = 3SE +/- 0.013, N = 3SE +/- 0.007, N = 3SE +/- 0.010, N = 3SE +/- 0.014, N = 32.6493.1783.2053.2503.8024.4671. (CXX) g++ options: -O3 -fPIC -lm

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total Timem7a.16xlargec7g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-56c6g.16xlarge30M60M90M120M150MSE +/- 1001447.21, N = 3SE +/- 1921303.63, N = 15SE +/- 1618403.29, N = 14SE +/- 1450871.09, N = 3SE +/- 705199.36, N = 15SE +/- 1645401.81, N = 151354191691195024731129587881058944578401701981807706-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

nginx

Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000c7g.16xlargem7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56c6g.16xlarget2d-standard-60 AMD Milan50K100K150K200K250KSE +/- 141.46, N = 3SE +/- 260.28, N = 3SE +/- 688.79, N = 3SE +/- 157.70, N = 3SE +/- 132.24, N = 3SE +/- 156.32, N = 3255540.03224859.09180537.84161127.09158700.36155609.041. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: CPU-Onlym7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56306090120150SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.13, N = 391.88112.64150.06

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDm7a.16xlargec7g.16xlarget2d-standard-60 AMD Milanc6g.16xlargec3d-standard-60 AMD Genoac2d-standard-561632486480SE +/- 0.24, N = 3SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 0.03, N = 3SE +/- 0.18, N = 3SE +/- 0.13, N = 343.2943.9150.9762.3064.8670.161. (CXX) g++ options: -O2 -lOpenCL

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128c7g.16xlargem7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoac6g.16xlarge306090120150SE +/- 0.10, N = 3SE +/- 1.18, N = 15SE +/- 0.91, N = 3SE +/- 1.39, N = 3SE +/- 1.78, N = 12SE +/- 0.71, N = 3127.99124.36106.0399.6593.6079.021. (CXX) g++ options: -O3

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: CPU-Onlym7a.16xlarget2d-standard-60 AMD Milanc2d-standard-561326395265SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.10, N = 337.1245.2259.22

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500c7g.16xlargem7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlargec2d-standard-5650K100K150K200K250KSE +/- 757.47, N = 3SE +/- 451.60, N = 3SE +/- 126.15, N = 3SE +/- 394.72, N = 3SE +/- 249.05, N = 3SE +/- 513.44, N = 3254864.93233014.72187350.44162957.75162553.85160196.241. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6, Losslessm7a.16xlargec7g.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c6g.16xlarge246810SE +/- 0.008, N = 3SE +/- 0.009, N = 3SE +/- 0.099, N = 3SE +/- 0.031, N = 3SE +/- 0.015, N = 3SE +/- 0.032, N = 35.6785.9146.8897.6398.0808.8791. (CXX) g++ options: -O3 -fPIC -lm

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256m7a.16xlargec7g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlargec2d-standard-5613000M26000M39000M52000M65000MSE +/- 161718701.08, N = 3SE +/- 1709212.75, N = 3SE +/- 20491615.60, N = 3SE +/- 9562123.40, N = 3SE +/- 192235444.29, N = 3SE +/- 7310870.38, N = 3622538611975443855517050884997103462118213134228851397339825794840-m64 -lssl -lcrypto-lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto-lssl -lcrypto-m641. (CC) gcc options: -pthread -O3 -ldl

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compilem7a.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac7g.16xlargec2d-standard-56c6g.16xlarge50100150200250SE +/- 0.39, N = 3SE +/- 0.27, N = 3SE +/- 0.07, N = 3SE +/- 0.25, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3153.80170.93176.77180.00192.60224.41

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Ratingm7a.16xlargec7g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlargec2d-standard-5670K140K210K280K350KSE +/- 400.99, N = 3SE +/- 291.75, N = 3SE +/- 388.74, N = 3SE +/- 346.51, N = 3SE +/- 359.95, N = 3SE +/- 214.91, N = 33306333105172789732717952397352304301. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Ratingc7g.16xlargem7a.16xlarget2d-standard-60 AMD Milanc6g.16xlargec3d-standard-60 AMD Genoac2d-standard-5660K120K180K240K300KSE +/- 141.62, N = 3SE +/- 342.37, N = 3SE +/- 347.74, N = 3SE +/- 57.33, N = 3SE +/- 519.21, N = 3SE +/- 140.45, N = 32855232825932472552340462262112081921. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Leukocytem7a.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-561122334455SE +/- 0.32, N = 3SE +/- 0.02, N = 3SE +/- 0.17, N = 3SE +/- 0.24, N = 334.6642.0145.5046.741. (CXX) g++ options: -O2 -lOpenCL

Laghos

Test: Triple Point Problem

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Triple Point Problemc7g.16xlarget2d-standard-60 AMD Milanm7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56c6g.16xlarge50100150200250SE +/- 0.90, N = 3SE +/- 1.77, N = 3SE +/- 1.67, N = 3SE +/- 0.22, N = 3SE +/- 0.61, N = 3SE +/- 0.50, N = 3231.85222.30218.86209.00199.80179.521. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400m7a.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa10M20M30M40M50MSE +/- 224779.59, N = 3SE +/- 80227.43, N = 3SE +/- 221111.93, N = 3SE +/- 188621.29, N = 344502333354607133492589934762565

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400m7a.16xlargec2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milan10M20M30M40M50MSE +/- 99649.31, N = 3SE +/- 248722.01, N = 3SE +/- 267152.12, N = 3SE +/- 354923.23, N = 344699315353773763535988435068557

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400m7a.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa100200300400500SE +/- 7.15, N = 3SE +/- 4.37, N = 3SE +/- 35.57, N = 3SE +/- 27.08, N = 3355.13414.40433.96447.68MAX: 67149.77MAX: 91109.59MAX: 103381.73MAX: 95136.87

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400m7a.16xlargec2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milan9M18M27M36M45MSE +/- 111073.27, N = 3SE +/- 177230.11, N = 3SE +/- 66565.66, N = 3SE +/- 130588.20, N = 342643903346258993433223734123810

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400m7a.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa90180270360450SE +/- 4.92, N = 3SE +/- 2.53, N = 3SE +/- 4.08, N = 3SE +/- 2.14, N = 3340.11408.13415.08418.59MAX: 37899.66MAX: 33039.23MAX: 28810.4MAX: 31920.67

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400m7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoa9M18M27M36M45MSE +/- 457186.43, N = 3SE +/- 303573.79, N = 3SE +/- 247504.71, N = 3SE +/- 154587.14, N = 340899210334668043339596833268158

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Writec2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargem7a.16xlargec6g.16xlarge12002400360048006000SE +/- 50.46, N = 12SE +/- 49.93, N = 8SE +/- 23.65, N = 3SE +/- 16.57, N = 3SE +/- 124.50, N = 9582057935342530047761. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400m7a.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milan140280420560700SE +/- 0.65, N = 3SE +/- 3.18, N = 3SE +/- 13.51, N = 3SE +/- 12.58, N = 3521.98623.89629.00633.58MAX: 34235.44MAX: 41831.2MAX: 48846.34MAX: 54749.7

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3Dm7a.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-5620406080100SE +/- 0.74, N = 15SE +/- 0.86, N = 15SE +/- 1.83, N = 12SE +/- 1.43, N = 1574.7884.1788.5490.741. (CXX) g++ options: -O2 -lOpenCL

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Writec2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargem7a.16xlargec6g.16xlarge12002400360048006000SE +/- 47.23, N = 12SE +/- 49.28, N = 12SE +/- 29.72, N = 3SE +/- 13.63, N = 3SE +/- 109.40, N = 12570256825348531247841. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400m7a.16xlargec2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milan150300450600750SE +/- 4.86, N = 3SE +/- 1.38, N = 3SE +/- 7.53, N = 3SE +/- 25.87, N = 3598.49679.86682.12709.46MAX: 62432.53MAX: 79453.07MAX: 98294.84MAX: 113264.06

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latencyc2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargem7a.16xlargec6g.16xlarge50100150200250SE +/- 1.46, N = 12SE +/- 1.44, N = 8SE +/- 0.83, N = 3SE +/- 0.59, N = 3SE +/- 6.00, N = 9171.95172.72187.19188.68210.611. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latencyc2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargem7a.16xlargec6g.16xlarge4080120160200SE +/- 1.15, N = 12SE +/- 1.23, N = 12SE +/- 0.83, N = 3SE +/- 0.39, N = 3SE +/- 3.85, N = 12140.41140.92149.60150.60168.191. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenRadioss

Model: Rubber O-Ring Seal Installation

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal Installationm7a.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa20406080100SE +/- 0.73, N = 3SE +/- 0.24, N = 3SE +/- 0.22, N = 3SE +/- 4.01, N = 1259.1869.4372.0689.65

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on Windshieldm7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoa306090120150SE +/- 0.18, N = 3SE +/- 0.13, N = 3SE +/- 0.41, N = 3SE +/- 3.26, N = 9115.96123.61146.56147.31

OpenRadioss

Model: Cell Phone Drop Test

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop Testm7a.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoa918273645SE +/- 0.08, N = 3SE +/- 0.39, N = 15SE +/- 0.12, N = 3SE +/- 1.27, N = 1526.1430.1236.8438.82

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper Beamm7a.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-5620406080100SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 1.56, N = 15SE +/- 0.85, N = 366.2775.6892.8798.45


Phoronix Test Suite v10.8.5