GCE c3d-standard-60

amazon testing on Ubuntu 22.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2310055-NE-2310039NE76
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Timed Code Compilation 3 Tests
C/C++ Compiler Tests 7 Tests
CPU Massive 12 Tests
Creator Workloads 4 Tests
Database Test Suite 3 Tests
Fortran Tests 4 Tests
HPC - High Performance Computing 10 Tests
Java Tests 2 Tests
Common Kernel Benchmarks 2 Tests
Machine Learning 2 Tests
Molecular Dynamics 3 Tests
MPI Benchmarks 4 Tests
Multi-Core 15 Tests
NVIDIA GPU Compute 3 Tests
OpenMPI Tests 11 Tests
Programmer / Developer System Benchmarks 4 Tests
Python Tests 4 Tests
Scientific Computing 4 Tests
Server 5 Tests
Server CPU Tests 7 Tests
Common Workstation Benchmarks 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
c3d-standard-60 AMD Genoa
October 03 2023
  10 Hours, 30 Minutes
t2d-standard-60 AMD Milan
October 03 2023
  13 Hours, 23 Minutes
c6g.16xlarge
October 05 2023
  9 Hours, 27 Minutes
Invert Hiding All Results Option
  11 Hours, 7 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


GCE c3d-standard-60 - Phoronix Test Suite

GCE c3d-standard-60

amazon testing on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2310055-NE-2310039NE76&gru&sor.

GCE c3d-standard-60ProcessorMotherboardChipsetMemoryDiskNetworkOSKernelVulkanCompilerFile-SystemSystem Layerc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlargeAMD EPYC 9B14 (30 Cores / 60 Threads)Google Compute Engine c3d-standard-60Intel 440FX 82441FX PMC240GB215GB nvme_card-pdGoogle Compute Engine VirtualUbuntu 22.046.2.0-1014-gcp (x86_64)1.3.238GCC 11.4.0ext4KVMAMD EPYC 7B13 (60 Cores)Google Compute Engine t2d-standard-60215GB PersistentDiskRed Hat Virtio deviceARMv8 Neoverse-N1 (64 Cores)Amazon EC2 c6g.16xlarge (1.0 BIOS)Amazon Device 0200128GB215GB Amazon Elastic Block StoreAmazon Elastic5.19.0-1025-aws (aarch64)amazonOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- c3d-standard-60 AMD Genoa: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - t2d-standard-60 AMD Milan: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - c6g.16xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Processor Details- c3d-standard-60 AMD Genoa, t2d-standard-60 AMD Milan: CPU Microcode: 0xffffffffJava Details- OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Details- Python 3.10.12Security Details- c3d-standard-60 AMD Genoa: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - t2d-standard-60 AMD Milan: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - c6g.16xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected

GCE c3d-standard-60openssl: SHA256openssl: SHA512openssl: ChaCha20openssl: AES-128-GCMopenssl: AES-256-GCMopenssl: ChaCha20-Poly1305amg: nekrs: Kershawnekrs: TurboPipe Periodicopenvino: Face Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUheffte: c2c - FFTW - float - 128heffte: r2c - FFTW - float - 128heffte: c2c - FFTW - double - 128heffte: r2c - FFTW - double - 128libxsmm: 32libxsmm: 64tensorflow: CPU - 16 - ResNet-50tensorflow: CPU - 32 - ResNet-50tensorflow: CPU - 64 - ResNet-50coremark: CoreMark Size 666 - Iterations Per Secondlaghos: Triple Point Problemlaghos: Sedov Blast Wave, ube_922_hex.meshcompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingstockfish: Total Timegromacs: MPI CPU - water_GMX50_barelammps: 20k Atomslammps: Rhodopsin Proteincassandra: Writesapache-iotdb: 500 - 100 - 500 - 400apache-iotdb: 500 - 100 - 800 - 400apache-iotdb: 800 - 100 - 500 - 400apache-iotdb: 800 - 100 - 800 - 400nginx: 500nginx: 1000openssl: RSA4096npb: BT.Cnpb: CG.Cnpb: EP.Dnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Cnpb: SP.Cpgbench: 100 - 800 - Read Onlypgbench: 100 - 1000 - Read Onlypgbench: 100 - 800 - Read Writepgbench: 100 - 1000 - Read Writeopenssl: RSA4096brl-cad: VGR Performance Metricapache-iotdb: 500 - 100 - 500 - 400apache-iotdb: 500 - 100 - 800 - 400apache-iotdb: 800 - 100 - 500 - 400apache-iotdb: 800 - 100 - 800 - 400openvino: Face Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUpgbench: 100 - 800 - Read Only - Average Latencypgbench: 100 - 1000 - Read Only - Average Latencypgbench: 100 - 800 - Read Write - Average Latencypgbench: 100 - 1000 - Read Write - Average Latencyrodinia: OpenMP LavaMDrodinia: OpenMP HotSpot3Drodinia: OpenMP Leukocyterodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusterincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionopenradioss: Bumper Beamopenradioss: Chrysler Neon 1Mopenradioss: Cell Phone Drop Testopenradioss: Bird Strike on Windshieldopenradioss: Rubber O-Ring Seal Installationremhos: Sample Remap Exampleavifenc: 0avifenc: 2avifenc: 6avifenc: 6, Losslessbuild-gem5: Time To Compilebuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigbuild-nodejs: Time To Compileblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlyc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge46211821313147022705731739809498933430952844402933280484971239093047739628898334289858333472394000018.39142.75142.901389.6935.194166.39576.942043.051875.286605.12645.74185.293650.571764.21964.4643607.04761.1454971.2688.6301148.57557.311693.6005255.4489.750.9962.7469.681445843.521552209.00259.552717952262111058944574.39119.77617.42322864033268158347625653433223735359884187350.44180537.8420079.596257.4819597.863783.6039647.472422.4073563.1342701.8339919.71493077.6510819418.59623.89447.68682.12648.8183.9983.918.62340.292.8720.775.8615.984.5318.5664.708.206.7931.080.5239.380.464.86284.16645.49810.0256.4485.8715788528.019687792.87337.7038.82147.3189.6533.36278.06841.5383.2506.889176.767198.39450884997103222448041831802491457702346040826102160259676401196477203379204277673681935833273062000010.7373.7478.96368.3926.281285.47225.481512.571014.704239.52565.3496.582646.58633.76370.0329668.12390.7244049.15109.676196.94860.0343106.029289.2554.218.2920.3620.901730658.449440222.30364.642789732472551129587885.28926.73427.82818716933466804349258993412381035068557162957.75155609.0412973.0122720.6116649.374935.6854846.181752.6294247.7747291.9643228.112003784200818656825793860844.6629363415.08633.58433.96709.461393.56208.47193.4540.67568.7711.6566.469.9014.763.5226.50155.1411.3223.6481.000.9976.720.610.3990.498140.916172.71750.97488.53542.0107.3686.4235.6305732724.572118175.68327.8830.12123.6172.0616.32678.35041.9893.2057.639170.93033.399333.351191.70634.2789.3545.22351.58112.64422885139731438491786367324778360158788510970129198197600467151264871032893667175886000022217100000.11.061.066.530.0420.792.610.148.390.460.151.365.507.362.53178.822.36136.16129.172202.44532.357579.0156312.7589.51259870.716902179.52321.29239735234046818077062.76625.05926.041217355162553.85158700.362640.024229.1413343.352213.7621386.37915.8018807.7525661.049716.99104326797503147844776215683.29996.56947.59947.86153.1222391.8648.08382.476990.10119.172186.816773.31735.58181.94135.87394.945.58423.957.330.7671.026168.191210.60862.3015.98314.2125.6181168625.874832820.816270.068167.9464.4678.879224.414102.216409.097286.201OpenBenchmarking.org

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge11000M22000M33000M44000M55000MSE +/- 20491615.60, N = 3SE +/- 9562123.40, N = 3SE +/- 192235444.29, N = 3508849971034621182131342288513973-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge5000M10000M15000M20000M25000MSE +/- 108274834.92, N = 3SE +/- 4399663.13, N = 3SE +/- 6214593.12, N = 3222448041831470227057314384917863-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge40000M80000M120000M160000M200000MSE +/- 47698640.87, N = 3SE +/- 12326205.97, N = 3SE +/- 372419.81, N = 318024914577017398094989367324778360-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge70000M140000M210000M280000M350000MSE +/- 342949201.09, N = 3SE +/- 376720190.05, N = 3SE +/- 5537993.15, N = 3343095284440234604082610158788510970-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge60000M120000M180000M240000M300000MSE +/- 71241287.97, N = 3SE +/- 178290221.71, N = 3SE +/- 2100313.05, N = 3293328048497216025967640129198197600-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge30000M60000M90000M120000M150000MSE +/- 3664727.47, N = 3SE +/- 198663058.81, N = 3SE +/- 2259404.37, N = 312390930477311964772033746715126487-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2c6g.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milan200M400M600M800M1000MSE +/- 176147.98, N = 3SE +/- 2060519.90, N = 3SE +/- 1088162.98, N = 310328936679628898339204277671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

nekRS

Input: Kershaw

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: Kershawc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge900M1800M2700M3600M4500MSE +/- 57202190.49, N = 12SE +/- 84802173.02, N = 12SE +/- 2970005.61, N = 34289858333368193583317588600001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

nekRS

Input: TurboPipe Periodic

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: TurboPipe Periodicc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge1000M2000M3000M4000M5000MSE +/- 201939132.13, N = 12SE +/- 481352.26, N = 3SE +/- 1790009.31, N = 34723940000273062000022217100001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 318.3910.730.10-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Person Detection FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge306090120150SE +/- 0.23, N = 3SE +/- 3.12, N = 15SE +/- 0.00, N = 3142.7573.741.06-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Person Detection FP32 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge306090120150SE +/- 0.48, N = 3SE +/- 2.74, N = 15SE +/- 0.00, N = 3142.9078.961.06-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge30060090012001500SE +/- 6.33, N = 3SE +/- 1.55, N = 3SE +/- 0.01, N = 31389.69368.396.53-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge816243240SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 335.1926.280.04-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge9001800270036004500SE +/- 8.74, N = 3SE +/- 15.96, N = 3SE +/- 0.01, N = 34166.391285.4720.79-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge120240360480600SE +/- 1.04, N = 3SE +/- 0.93, N = 3SE +/- 0.00, N = 3576.94225.482.61-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge400800120016002000SE +/- 3.16, N = 3SE +/- 1.12, N = 3SE +/- 0.00, N = 152043.051512.570.14-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge400800120016002000SE +/- 0.47, N = 3SE +/- 0.56, N = 3SE +/- 0.01, N = 31875.281014.708.39-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge14002800420056007000SE +/- 6.52, N = 3SE +/- 2.08, N = 3SE +/- 0.01, N = 36605.124239.520.46-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge140280420560700SE +/- 0.79, N = 3SE +/- 0.37, N = 3SE +/- 0.00, N = 3645.74565.340.15-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Machine Translation EN To DE FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge4080120160200SE +/- 0.25, N = 3SE +/- 0.37, N = 3SE +/- 0.00, N = 3185.2996.581.36-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge8001600240032004000SE +/- 3.28, N = 3SE +/- 3.26, N = 3SE +/- 0.01, N = 33650.572646.585.50-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Person Vehicle Bike Detection FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge400800120016002000SE +/- 4.38, N = 3SE +/- 3.66, N = 3SE +/- 0.01, N = 31764.21633.767.36-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge2004006008001000SE +/- 0.60, N = 3SE +/- 0.96, N = 3SE +/- 0.02, N = 3964.46370.032.53-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge9K18K27K36K45KSE +/- 18.42, N = 3SE +/- 14.46, N = 3SE +/- 0.39, N = 343607.0429668.12178.82-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge160320480640800SE +/- 1.75, N = 3SE +/- 1.37, N = 3SE +/- 0.00, N = 3761.14390.722.36-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge12K24K36K48K60KSE +/- 45.46, N = 3SE +/- 332.93, N = 3SE +/- 0.48, N = 354971.2644049.15136.16-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128c6g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa306090120150SE +/- 0.11, N = 3SE +/- 0.78, N = 3SE +/- 0.52, N = 3129.17109.6888.631. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128c6g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa4080120160200SE +/- 0.57, N = 3SE +/- 0.82, N = 3SE +/- 2.19, N = 12202.45196.95148.581. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge1326395265SE +/- 0.66, N = 3SE +/- 1.29, N = 15SE +/- 0.10, N = 360.0357.3132.361. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge20406080100SE +/- 0.91, N = 3SE +/- 1.78, N = 12SE +/- 0.71, N = 3106.0393.6079.021. (CXX) g++ options: -O3

libxsmm

M N K: 32

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 32c6g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa70140210280350SE +/- 0.47, N = 3SE +/- 3.60, N = 4SE +/- 0.19, N = 3312.7289.2255.4-march=armv8.1-a-lquadmath -msse4.2-lquadmath -msse4.21. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden

libxsmm

M N K: 64

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 64c6g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa130260390520650SE +/- 0.96, N = 3SE +/- 0.25, N = 3SE +/- 0.12, N = 3589.5554.2489.7-march=armv8.1-a-lquadmath -msse4.2-lquadmath -msse4.21. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden

TensorFlow

Device: CPU - Batch Size: 16 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 16 - Model: ResNet-50c3d-standard-60 AMD Genoat2d-standard-60 AMD Milan1224364860SE +/- 0.04, N = 3SE +/- 0.05, N = 350.9918.29

TensorFlow

Device: CPU - Batch Size: 32 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: ResNet-50c3d-standard-60 AMD Genoat2d-standard-60 AMD Milan1428425670SE +/- 0.10, N = 3SE +/- 0.06, N = 362.7420.36

TensorFlow

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: ResNet-50c3d-standard-60 AMD Genoat2d-standard-60 AMD Milan1632486480SE +/- 0.07, N = 3SE +/- 0.03, N = 369.6820.90

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge400K800K1200K1600K2000KSE +/- 9191.61, N = 3SE +/- 1295.68, N = 3SE +/- 635.29, N = 31730658.451445843.521259870.721. (CC) gcc options: -O2 -lrt" -lrt

Laghos

Test: Triple Point Problem

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Triple Point Problemt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge50100150200250SE +/- 1.77, N = 3SE +/- 0.22, N = 3SE +/- 0.50, N = 3222.30209.00179.521. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Laghos

Test: Sedov Blast Wave, ube_922_hex.mesh

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Sedov Blast Wave, ube_922_hex.mesht2d-standard-60 AMD Milanc6g.16xlargec3d-standard-60 AMD Genoa80160240320400SE +/- 0.62, N = 3SE +/- 0.79, N = 3SE +/- 0.31, N = 3364.64321.29259.551. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Ratingt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge60K120K180K240K300KSE +/- 388.74, N = 3SE +/- 346.51, N = 3SE +/- 359.95, N = 32789732717952397351. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Ratingt2d-standard-60 AMD Milanc6g.16xlargec3d-standard-60 AMD Genoa50K100K150K200K250KSE +/- 347.74, N = 3SE +/- 57.33, N = 3SE +/- 519.21, N = 32472552340462262111. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total Timet2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge20M40M60M80M100MSE +/- 1618403.29, N = 14SE +/- 1450871.09, N = 3SE +/- 1645401.81, N = 1511295878810589445781807706-m64 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_baret2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge1.192.383.574.765.95SE +/- 0.005, N = 3SE +/- 0.011, N = 3SE +/- 0.001, N = 35.2894.3912.7661. (CXX) g++ options: -O3

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k Atomst2d-standard-60 AMD Milanc6g.16xlargec3d-standard-60 AMD Genoa612182430SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 326.7325.0619.78-lm-lm1. (CXX) g++ options: -O3 -ldl

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin Proteint2d-standard-60 AMD Milanc6g.16xlargec3d-standard-60 AMD Genoa714212835SE +/- 0.17, N = 3SE +/- 0.04, N = 3SE +/- 0.54, N = 1227.8326.0417.42-lm-lm1. (CXX) g++ options: -O3 -ldl

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.1.3Test: Writesc3d-standard-60 AMD Genoac6g.16xlarget2d-standard-60 AMD Milan50K100K150K200K250KSE +/- 1129.40, N = 3SE +/- 3249.94, N = 12SE +/- 681.76, N = 3228640217355187169

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa7M14M21M28M35MSE +/- 303573.79, N = 3SE +/- 154587.14, N = 33346680433268158

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa7M14M21M28M35MSE +/- 221111.93, N = 3SE +/- 188621.29, N = 33492589934762565

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400c3d-standard-60 AMD Genoat2d-standard-60 AMD Milan7M14M21M28M35MSE +/- 66565.66, N = 3SE +/- 130588.20, N = 33433223734123810

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400c3d-standard-60 AMD Genoat2d-standard-60 AMD Milan8M16M24M32M40MSE +/- 267152.12, N = 3SE +/- 354923.23, N = 33535988435068557

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge40K80K120K160K200KSE +/- 126.15, N = 3SE +/- 394.72, N = 3SE +/- 249.05, N = 3187350.44162957.75162553.851. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000c3d-standard-60 AMD Genoac6g.16xlarget2d-standard-60 AMD Milan40K80K120K160K200KSE +/- 688.79, N = 3SE +/- 132.24, N = 3SE +/- 156.32, N = 3180537.84158700.36155609.041. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge4K8K12K16K20KSE +/- 14.62, N = 3SE +/- 11.58, N = 3SE +/- 0.09, N = 320079.512973.02640.0-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Ct2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge30K60K90K120K150KSE +/- 42.77, N = 3SE +/- 122.23, N = 3SE +/- 7.69, N = 3122720.6196257.4824229.141. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge4K8K12K16K20KSE +/- 77.92, N = 3SE +/- 1215.82, N = 15SE +/- 23.52, N = 319597.8616649.3713343.351. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge11002200330044005500SE +/- 51.21, N = 5SE +/- 30.67, N = 3SE +/- 7.28, N = 34935.683783.602213.761. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Ct2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge12K24K36K48K60KSE +/- 137.77, N = 3SE +/- 600.19, N = 15SE +/- 2.85, N = 354846.1839647.4721386.371. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.Dc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge5001000150020002500SE +/- 36.45, N = 15SE +/- 142.62, N = 12SE +/- 0.58, N = 32422.401752.62915.801. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Ct2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge20K40K60K80K100KSE +/- 1463.52, N = 15SE +/- 293.57, N = 3SE +/- 7.52, N = 394247.7773563.1318807.751. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Ct2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge10K20K30K40K50KSE +/- 145.06, N = 3SE +/- 29.69, N = 3SE +/- 10.99, N = 347291.9642701.8325661.041. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Ct2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge9K18K27K36K45KSE +/- 555.92, N = 3SE +/- 45.01, N = 3SE +/- 0.76, N = 343228.1139919.719716.991. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Onlyt2d-standard-60 AMD Milanc6g.16xlarge400K800K1200K1600K2000KSE +/- 20558.90, N = 3SE +/- 12058.27, N = 3200378410432671. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Onlyt2d-standard-60 AMD Milanc6g.16xlarge400K800K1200K1600K2000KSE +/- 22497.42, N = 3SE +/- 12606.16, N = 320081869750311. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Writet2d-standard-60 AMD Milanc6g.16xlarge12002400360048006000SE +/- 49.28, N = 12SE +/- 109.40, N = 12568247841. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Writet2d-standard-60 AMD Milanc6g.16xlarge12002400360048006000SE +/- 49.93, N = 8SE +/- 124.50, N = 9579347761. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge200K400K600K800K1000KSE +/- 644.45, N = 3SE +/- 50.91, N = 3SE +/- 6.55, N = 3860844.6493077.6215683.2-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.36VGR Performance Metrict2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa130K260K390K520K650K6293635108191. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa90180270360450SE +/- 4.08, N = 3SE +/- 2.14, N = 3415.08418.59MAX: 28810.4MAX: 31920.67

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400c3d-standard-60 AMD Genoat2d-standard-60 AMD Milan140280420560700SE +/- 3.18, N = 3SE +/- 12.58, N = 3623.89633.58MAX: 41831.2MAX: 54749.7

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa100200300400500SE +/- 35.57, N = 3SE +/- 27.08, N = 3433.96447.68MAX: 103381.73MAX: 95136.87

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400c3d-standard-60 AMD Genoat2d-standard-60 AMD Milan150300450600750SE +/- 7.53, N = 3SE +/- 25.87, N = 3682.12709.46MAX: 98294.84MAX: 113264.06

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge2K4K6K8K10KSE +/- 0.17, N = 3SE +/- 1.32, N = 3SE +/- 1.02, N = 3648.811393.569996.56-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 9993.58 / MAX: 10001.761. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Person Detection FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge2004006008001000SE +/- 0.13, N = 3SE +/- 9.17, N = 15SE +/- 0.37, N = 383.99208.47947.59-pie - MIN: 119.65 / MAX: 316.01-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 943.57 / MAX: 951.681. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Person Detection FP32 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge2004006008001000SE +/- 0.29, N = 3SE +/- 7.79, N = 15SE +/- 0.55, N = 383.91193.45947.86-pie - MIN: 113.44 / MAX: 315.54-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 944.57 / MAX: 958.261. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge306090120150SE +/- 0.04, N = 3SE +/- 0.17, N = 3SE +/- 0.09, N = 38.6240.67153.12-pie - MIN: 12.07 / MAX: 59.44-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 152.69 / MAX: 153.751. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge5K10K15K20K25KSE +/- 0.21, N = 3SE +/- 0.35, N = 3SE +/- 15.60, N = 3340.29568.7722391.86-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 22364.17 / MAX: 22423.421. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge1122334455SE +/- 0.01, N = 3SE +/- 0.14, N = 3SE +/- 0.02, N = 32.8711.6548.08-pie - MIN: 3.76 / MAX: 29.1-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 47.55 / MAX: 50.491. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge80160240320400SE +/- 0.04, N = 3SE +/- 0.27, N = 3SE +/- 0.20, N = 320.7766.46382.47-pie - MIN: 25.85 / MAX: 122.2-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 381.67 / MAX: 383.431. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge15003000450060007500SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 27.09, N = 155.869.906990.10-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6842.82 / MAX: 7088.531. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge306090120150SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 314.7615.98119.17-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 118.73 / MAX: 120.211. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16-INT8 - Device: CPUt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge5001000150020002500SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 27.29, N = 33.524.532186.81-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 2135.06 / MAX: 2233.341. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge15003000450060007500SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 76.95, N = 318.5626.506773.31-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6616.83 / MAX: 6859.991. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Machine Translation EN To DE FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge160320480640800SE +/- 0.09, N = 3SE +/- 0.58, N = 3SE +/- 0.26, N = 364.70155.14735.58-pie - MIN: 114.75 / MAX: 224.64-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 734.34 / MAX: 738.231. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge4080120160200SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.24, N = 38.2011.32181.94-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 180.71 / MAX: 184.111. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Person Vehicle Bike Detection FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge306090120150SE +/- 0.02, N = 3SE +/- 0.14, N = 3SE +/- 0.21, N = 36.7923.64135.87-pie - MIN: 9.57 / MAX: 42.22-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 132.35 / MAX: 148.641. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge90180270360450SE +/- 0.02, N = 3SE +/- 0.21, N = 3SE +/- 2.18, N = 331.0881.00394.94-pie - MIN: 64.65 / MAX: 134.64-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 380.82 / MAX: 408.151. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge1.25552.5113.76655.0226.2775SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 30.520.995.58-pie - MIN: 0.8 / MAX: 13.76-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 5.45 / MAX: 6.481. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge90180270360450SE +/- 0.09, N = 3SE +/- 0.27, N = 3SE +/- 0.26, N = 339.3876.72423.95-pie - MIN: 58.8 / MAX: 121.69-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 411.17 / MAX: 440.571. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 30.400.617.33-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6.54 / MAX: 9.951. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latencyt2d-standard-60 AMD Milanc6g.16xlarge0.17260.34520.51780.69040.863SE +/- 0.004, N = 3SE +/- 0.009, N = 30.3990.7671. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latencyt2d-standard-60 AMD Milanc6g.16xlarge0.23090.46180.69270.92361.1545SE +/- 0.006, N = 3SE +/- 0.013, N = 30.4981.0261. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latencyt2d-standard-60 AMD Milanc6g.16xlarge4080120160200SE +/- 1.23, N = 12SE +/- 3.85, N = 12140.92168.191. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latencyt2d-standard-60 AMD Milanc6g.16xlarge50100150200250SE +/- 1.44, N = 8SE +/- 6.00, N = 9172.72210.611. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDt2d-standard-60 AMD Milanc6g.16xlargec3d-standard-60 AMD Genoa1428425670SE +/- 0.13, N = 3SE +/- 0.03, N = 3SE +/- 0.18, N = 350.9762.3064.861. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3Dc3d-standard-60 AMD Genoat2d-standard-60 AMD Milan20406080100SE +/- 0.86, N = 15SE +/- 1.83, N = 1284.1788.541. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Leukocytet2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa1020304050SE +/- 0.02, N = 3SE +/- 0.17, N = 342.0145.501. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solverc6g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa3691215SE +/- 0.001, N = 3SE +/- 0.034, N = 3SE +/- 0.013, N = 35.9837.36810.0251. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclustert2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge48121620SE +/- 0.009, N = 3SE +/- 0.104, N = 15SE +/- 0.017, N = 36.4236.44814.2121. (CXX) g++ options: -O2 -lOpenCL

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionc6g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa1.32112.64223.96335.28446.6055SE +/- 0.01888616, N = 3SE +/- 0.02210564, N = 3SE +/- 0.04970425, N = 35.618116865.630573275.871578851. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directiont2d-standard-60 AMD Milanc6g.16xlargec3d-standard-60 AMD Genoa714212835SE +/- 0.31, N = 12SE +/- 0.01, N = 3SE +/- 0.25, N = 324.5725.8728.021. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper Beamt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa20406080100SE +/- 0.10, N = 3SE +/- 1.56, N = 1575.6892.87

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1Mt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa70140210280350SE +/- 1.48, N = 3SE +/- 2.03, N = 3327.88337.70

OpenRadioss

Model: Cell Phone Drop Test

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop Testt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa918273645SE +/- 0.39, N = 15SE +/- 1.27, N = 1530.1238.82

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on Windshieldt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa306090120150SE +/- 0.13, N = 3SE +/- 3.26, N = 9123.61147.31

OpenRadioss

Model: Rubber O-Ring Seal Installation

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal Installationt2d-standard-60 AMD Milanc3d-standard-60 AMD Genoa20406080100SE +/- 0.22, N = 3SE +/- 4.01, N = 1272.0689.65

Remhos

Test: Sample Remap Example

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Examplet2d-standard-60 AMD Milanc6g.16xlargec3d-standard-60 AMD Genoa816243240SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.17, N = 316.3320.8233.361. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 0c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge60120180240300SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.34, N = 378.0778.35270.071. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 2c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge4080120160200SE +/- 0.11, N = 3SE +/- 0.03, N = 3SE +/- 0.22, N = 341.5441.99167.951. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge1.00512.01023.01534.02045.0255SE +/- 0.013, N = 3SE +/- 0.007, N = 3SE +/- 0.014, N = 33.2053.2504.4671. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6, Losslessc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlarge246810SE +/- 0.099, N = 3SE +/- 0.031, N = 3SE +/- 0.032, N = 36.8897.6398.8791. (CXX) g++ options: -O3 -fPIC -lm

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compilet2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge50100150200250SE +/- 0.27, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3170.93176.77224.41

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigt2d-standard-60 AMD Milanc6g.16xlarge20406080100SE +/- 0.37, N = 5SE +/- 0.82, N = 333.40102.22

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigt2d-standard-60 AMD Milanc6g.16xlarge90180270360450SE +/- 1.20, N = 3SE +/- 2.41, N = 3333.35409.10

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To Compilet2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac6g.16xlarge60120180240300SE +/- 0.05, N = 3SE +/- 0.29, N = 3SE +/- 0.11, N = 3191.71198.39286.20

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-Onlyt2d-standard-60 AMD Milan816243240SE +/- 0.06, N = 334.27

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: CPU-Onlyt2d-standard-60 AMD Milan20406080100SE +/- 0.07, N = 389.35

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: CPU-Onlyt2d-standard-60 AMD Milan1020304050SE +/- 0.13, N = 345.22

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: CPU-Onlyt2d-standard-60 AMD Milan80160240320400SE +/- 0.60, N = 3351.58

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: CPU-Onlyt2d-standard-60 AMD Milan306090120150SE +/- 0.03, N = 3112.64


Phoronix Test Suite v10.8.4