amazon testing on Ubuntu 22.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2310055-NE-2310039NE76 GCE c3d-standard-60 - Phoronix Test Suite GCE c3d-standard-60 amazon testing on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2310055-NE-2310039NE76&grs&sro .
GCE c3d-standard-60 Processor Motherboard Chipset Memory Disk Network OS Kernel Vulkan Compiler File-System System Layer c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge AMD EPYC 9B14 (30 Cores / 60 Threads) Google Compute Engine c3d-standard-60 Intel 440FX 82441FX PMC 240GB 215GB nvme_card-pd Google Compute Engine Virtual Ubuntu 22.04 6.2.0-1014-gcp (x86_64) 1.3.238 GCC 11.4.0 ext4 KVM AMD EPYC 7B13 (60 Cores) Google Compute Engine t2d-standard-60 215GB PersistentDisk Red Hat Virtio device ARMv8 Neoverse-N1 (64 Cores) Amazon EC2 c6g.16xlarge (1.0 BIOS) Amazon Device 0200 128GB 215GB Amazon Elastic Block Store Amazon Elastic 5.19.0-1025-aws (aarch64) amazon OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - c3d-standard-60 AMD Genoa: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - t2d-standard-60 AMD Milan: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - c6g.16xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Processor Details - c3d-standard-60 AMD Genoa, t2d-standard-60 AMD Milan: CPU Microcode: 0xffffffff Java Details - OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04) Python Details - Python 3.10.12 Security Details - c3d-standard-60 AMD Genoa: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - t2d-standard-60 AMD Milan: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - c6g.16xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
GCE c3d-standard-60 openvino: Face Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16 - CPU openssl: RSA4096 openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU npb: BT.C openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU npb: SP.C avifenc: 2 openvino: Handwritten English Recognition FP16 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openssl: RSA4096 avifenc: 0 tensorflow: CPU - 64 - ResNet-50 tensorflow: CPU - 32 - ResNet-50 build-linux-kernel: defconfig openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Road Segmentation ADAS FP16 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Face Detection Retail FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU tensorflow: CPU - 16 - ResNet-50 openssl: ChaCha20 openssl: ChaCha20-Poly1305 npb: FT.C openssl: AES-256-GCM npb: EP.D openssl: AES-128-GCM pgbench: 100 - 1000 - Read Only - Average Latency pgbench: 100 - 1000 - Read Only remhos: Sample Remap Example openvino: Face Detection FP16 - CPU openvino: Road Segmentation ADAS FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Face Detection Retail FP16 - CPU openvino: Face Detection FP16 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Handwritten English Recognition FP16 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU pgbench: 100 - 800 - Read Only - Average Latency pgbench: 100 - 800 - Read Only gromacs: MPI CPU - water_GMX50_bare npb: MG.C rodinia: OpenMP CFD Solver openssl: SHA512 build-nodejs: Time To Compile heffte: c2c - FFTW - float - 128 laghos: Sedov Blast Wave, ube_922_hex.mesh avifenc: 6 coremark: CoreMark Size 666 - Iterations Per Second heffte: r2c - FFTW - float - 128 lammps: 20k Atoms build-gem5: Time To Compile avifenc: 6, Lossless rodinia: OpenMP LavaMD laghos: Triple Point Problem brl-cad: VGR Performance Metric build-linux-kernel: allmodconfig libxsmm: 32 cassandra: Writes libxsmm: 64 openssl: SHA256 compress-7zip: Compression Rating nginx: 1000 nginx: 500 incompact3d: input.i3d 193 Cells Per Direction amg: compress-7zip: Decompression Rating rodinia: OpenMP Leukocyte incompact3d: input.i3d 129 Cells Per Direction openradioss: Chrysler Neon 1M apache-iotdb: 500 - 100 - 800 - 400 apache-iotdb: 500 - 100 - 500 - 400 apache-iotdb: 800 - 100 - 800 - 400 apache-iotdb: 800 - 100 - 500 - 400 apache-iotdb: 500 - 100 - 500 - 400 apache-iotdb: 500 - 100 - 800 - 400 blender: Pabellon Barcelona - CPU-Only blender: Barbershop - CPU-Only blender: Fishy Cat - CPU-Only blender: Classroom - CPU-Only blender: BMW27 - CPU-Only pgbench: 100 - 1000 - Read Write - Average Latency pgbench: 100 - 800 - Read Write - Average Latency openvino: Person Detection FP32 - CPU openvino: Person Detection FP32 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU pgbench: 100 - 1000 - Read Write pgbench: 100 - 800 - Read Write apache-iotdb: 800 - 100 - 800 - 400 apache-iotdb: 800 - 100 - 500 - 400 stockfish: Total Time lammps: Rhodopsin Protein nekrs: TurboPipe Periodic nekrs: Kershaw openradioss: Rubber O-Ring Seal Installation openradioss: Bird Strike on Windshield openradioss: Cell Phone Drop Test openradioss: Bumper Beam heffte: r2c - FFTW - double - 128 heffte: c2c - FFTW - double - 128 rodinia: OpenMP Streamcluster rodinia: OpenMP HotSpot3D npb: LU.C npb: IS.D npb: CG.C c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan c6g.16xlarge 35.19 15.98 20079.5 3650.57 340.29 4.53 96257.48 645.74 54971.26 39919.71 41.538 964.46 18.56 761.14 493077.6 78.068 69.68 62.74 43607.04 1764.21 1875.28 576.94 8.20 1389.69 4166.39 6.79 50.99 173980949893 123909304773 39647.47 293328048497 3783.60 343095284440 33.362 18.39 20.77 0.4 8.62 2.87 648.81 2043.05 6605.12 185.29 31.08 5.86 64.70 39.38 0.52 4.391 42701.83 10.025 14702270573 198.394 88.6301 259.55 3.250 1445843.521552 148.575 19.776 176.767 6.889 64.862 209.00 510819 255.4 228640 489.7 46211821313 271795 180537.84 187350.44 28.0196877 962889833 226211 45.498 5.87157885 337.70 623.89 418.59 35359884 34332237 33268158 34762565 83.91 142.90 83.99 142.75 682.12 447.68 105894457 17.423 4723940000 4289858333 89.65 147.31 38.82 92.87 93.6005 57.3116 6.448 84.166 73563.13 2422.40 19597.86 26.28 14.76 12973.0 2646.58 568.77 3.52 122720.61 565.34 44049.15 43228.11 41.989 370.03 26.50 390.72 860844.6 78.350 20.90 20.36 33.399 29668.12 633.76 1014.70 225.48 11.32 368.39 1285.47 23.64 18.29 180249145770 119647720337 54846.18 216025967640 4935.68 234604082610 0.498 2008186 16.326 10.73 66.46 0.61 40.67 11.65 1393.56 1512.57 4239.52 96.58 81.00 9.90 155.14 76.72 0.99 0.399 2003784 5.289 47291.96 7.368 22244804183 191.706 109.676 364.64 3.205 1730658.449440 196.948 26.734 170.930 7.639 50.974 222.30 629363 333.351 289.2 187169 554.2 50884997103 278973 155609.04 162957.75 24.5721181 920427767 247255 42.010 5.63057327 327.88 633.58 415.08 35068557 34123810 33466804 34925899 112.64 351.58 45.22 89.35 34.27 172.717 140.916 193.45 78.96 208.47 73.74 5793 5682 709.46 433.96 112958788 27.828 2730620000 3681935833 72.06 123.61 30.12 75.68 106.029 60.0343 6.423 88.535 94247.77 1752.62 16649.37 0.04 119.17 2640.0 5.50 22391.86 2186.81 24229.14 0.15 136.16 9716.99 167.946 2.53 6773.31 2.36 215683.2 270.068 102.216 178.82 7.36 8.39 2.61 181.94 6.53 20.79 135.87 67324778360 46715126487 21386.37 129198197600 2213.76 158788510970 1.026 975031 20.816 0.1 382.47 7.33 153.12 48.08 9996.56 0.14 0.46 1.36 394.94 6990.10 735.58 423.95 5.58 0.767 1043267 2.766 25661.04 5.983 14384917863 286.201 129.172 321.29 4.467 1259870.716902 202.445 25.059 224.414 8.879 62.301 179.52 409.097 312.7 217355 589.5 42288513973 239735 158700.36 162553.85 25.8748328 1032893667 234046 5.61811686 210.608 168.191 947.86 1.06 947.59 1.06 4776 4784 81807706 26.041 2221710000 1758860000 79.0156 32.3575 14.212 18807.75 915.80 13343.35 OpenBenchmarking.org
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 35.19 0.04 26.28 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 15.98 119.17 14.76 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 118.73 / MAX: 120.21 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 4K 8K 12K 16K 20K SE +/- 14.62, N = 3 SE +/- 0.09, N = 3 SE +/- 11.58, N = 3 20079.5 2640.0 12973.0 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 800 1600 2400 3200 4000 SE +/- 3.28, N = 3 SE +/- 0.01, N = 3 SE +/- 3.26, N = 3 3650.57 5.50 2646.58 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 5K 10K 15K 20K 25K SE +/- 0.21, N = 3 SE +/- 15.60, N = 3 SE +/- 0.35, N = 3 340.29 22391.86 568.77 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 22364.17 / MAX: 22423.42 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 500 1000 1500 2000 2500 SE +/- 0.01, N = 3 SE +/- 27.29, N = 3 SE +/- 0.00, N = 3 4.53 2186.81 3.52 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 2135.06 / MAX: 2233.34 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 30K 60K 90K 120K 150K SE +/- 122.23, N = 3 SE +/- 7.69, N = 3 SE +/- 42.77, N = 3 96257.48 24229.14 122720.61 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 140 280 420 560 700 SE +/- 0.79, N = 3 SE +/- 0.00, N = 3 SE +/- 0.37, N = 3 645.74 0.15 565.34 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 12K 24K 36K 48K 60K SE +/- 45.46, N = 3 SE +/- 0.48, N = 3 SE +/- 332.93, N = 3 54971.26 136.16 44049.15 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 9K 18K 27K 36K 45K SE +/- 45.01, N = 3 SE +/- 0.76, N = 3 SE +/- 555.92, N = 3 39919.71 9716.99 43228.11 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 2 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 40 80 120 160 200 SE +/- 0.11, N = 3 SE +/- 0.22, N = 3 SE +/- 0.03, N = 3 41.54 167.95 41.99 1. (CXX) g++ options: -O3 -fPIC -lm
OpenVINO Model: Handwritten English Recognition FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 200 400 600 800 1000 SE +/- 0.60, N = 3 SE +/- 0.02, N = 3 SE +/- 0.96, N = 3 964.46 2.53 370.03 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 1500 3000 4500 6000 7500 SE +/- 0.02, N = 3 SE +/- 76.95, N = 3 SE +/- 0.02, N = 3 18.56 6773.31 26.50 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6616.83 / MAX: 6859.99 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 160 320 480 640 800 SE +/- 1.75, N = 3 SE +/- 0.00, N = 3 SE +/- 1.37, N = 3 761.14 2.36 390.72 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 200K 400K 600K 800K 1000K SE +/- 50.91, N = 3 SE +/- 6.55, N = 3 SE +/- 644.45, N = 3 493077.6 215683.2 860844.6 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 0 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 60 120 180 240 300 SE +/- 0.07, N = 3 SE +/- 0.34, N = 3 SE +/- 0.08, N = 3 78.07 270.07 78.35 1. (CXX) g++ options: -O3 -fPIC -lm
TensorFlow Device: CPU - Batch Size: 64 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 64 - Model: ResNet-50 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 16 32 48 64 80 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 69.68 20.90
TensorFlow Device: CPU - Batch Size: 32 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 32 - Model: ResNet-50 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 14 28 42 56 70 SE +/- 0.10, N = 3 SE +/- 0.06, N = 3 62.74 20.36
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig c6g.16xlarge t2d-standard-60 AMD Milan 20 40 60 80 100 SE +/- 0.82, N = 3 SE +/- 0.37, N = 5 102.22 33.40
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 9K 18K 27K 36K 45K SE +/- 18.42, N = 3 SE +/- 0.39, N = 3 SE +/- 14.46, N = 3 43607.04 178.82 29668.12 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 400 800 1200 1600 2000 SE +/- 4.38, N = 3 SE +/- 0.01, N = 3 SE +/- 3.66, N = 3 1764.21 7.36 633.76 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 400 800 1200 1600 2000 SE +/- 0.47, N = 3 SE +/- 0.01, N = 3 SE +/- 0.56, N = 3 1875.28 8.39 1014.70 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 120 240 360 480 600 SE +/- 1.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.93, N = 3 576.94 2.61 225.48 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 40 80 120 160 200 SE +/- 0.01, N = 3 SE +/- 0.24, N = 3 SE +/- 0.01, N = 3 8.20 181.94 11.32 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 180.71 / MAX: 184.11 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 300 600 900 1200 1500 SE +/- 6.33, N = 3 SE +/- 0.01, N = 3 SE +/- 1.55, N = 3 1389.69 6.53 368.39 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Face Detection Retail FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 900 1800 2700 3600 4500 SE +/- 8.74, N = 3 SE +/- 0.01, N = 3 SE +/- 15.96, N = 3 4166.39 20.79 1285.47 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.21, N = 3 SE +/- 0.14, N = 3 6.79 135.87 23.64 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 132.35 / MAX: 148.64 -pie - MIN: 9.57 / MAX: 42.22 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
TensorFlow Device: CPU - Batch Size: 16 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 16 - Model: ResNet-50 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 12 24 36 48 60 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 50.99 18.29
OpenSSL Algorithm: ChaCha20 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 40000M 80000M 120000M 160000M 200000M SE +/- 12326205.97, N = 3 SE +/- 372419.81, N = 3 SE +/- 47698640.87, N = 3 173980949893 67324778360 180249145770 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: ChaCha20-Poly1305 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 30000M 60000M 90000M 120000M 150000M SE +/- 3664727.47, N = 3 SE +/- 2259404.37, N = 3 SE +/- 198663058.81, N = 3 123909304773 46715126487 119647720337 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 12K 24K 36K 48K 60K SE +/- 600.19, N = 15 SE +/- 2.85, N = 3 SE +/- 137.77, N = 3 39647.47 21386.37 54846.18 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenSSL Algorithm: AES-256-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 60000M 120000M 180000M 240000M 300000M SE +/- 71241287.97, N = 3 SE +/- 2100313.05, N = 3 SE +/- 178290221.71, N = 3 293328048497 129198197600 216025967640 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 1100 2200 3300 4400 5500 SE +/- 30.67, N = 3 SE +/- 7.28, N = 3 SE +/- 51.21, N = 5 3783.60 2213.76 4935.68 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenSSL Algorithm: AES-128-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 70000M 140000M 210000M 280000M 350000M SE +/- 342949201.09, N = 3 SE +/- 5537993.15, N = 3 SE +/- 376720190.05, N = 3 343095284440 158788510970 234604082610 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency c6g.16xlarge t2d-standard-60 AMD Milan 0.2309 0.4618 0.6927 0.9236 1.1545 SE +/- 0.013, N = 3 SE +/- 0.006, N = 3 1.026 0.498 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only c6g.16xlarge t2d-standard-60 AMD Milan 400K 800K 1200K 1600K 2000K SE +/- 12606.16, N = 3 SE +/- 22497.42, N = 3 975031 2008186 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Remhos Test: Sample Remap Example OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 8 16 24 32 40 SE +/- 0.17, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 33.36 20.82 16.33 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 18.39 0.10 10.73 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 80 160 240 320 400 SE +/- 0.04, N = 3 SE +/- 0.20, N = 3 SE +/- 0.27, N = 3 20.77 382.47 66.46 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 381.67 / MAX: 383.43 -pie - MIN: 25.85 / MAX: 122.2 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 0.40 7.33 0.61 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6.54 / MAX: 9.95 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 30 60 90 120 150 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 SE +/- 0.17, N = 3 8.62 153.12 40.67 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 152.69 / MAX: 153.75 -pie - MIN: 12.07 / MAX: 59.44 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Face Detection Retail FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 11 22 33 44 55 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.14, N = 3 2.87 48.08 11.65 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 47.55 / MAX: 50.49 -pie - MIN: 3.76 / MAX: 29.1 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 2K 4K 6K 8K 10K SE +/- 0.17, N = 3 SE +/- 1.02, N = 3 SE +/- 1.32, N = 3 648.81 9996.56 1393.56 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 9993.58 / MAX: 10001.76 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 400 800 1200 1600 2000 SE +/- 3.16, N = 3 SE +/- 0.00, N = 15 SE +/- 1.12, N = 3 2043.05 0.14 1512.57 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 1400 2800 4200 5600 7000 SE +/- 6.52, N = 3 SE +/- 0.01, N = 3 SE +/- 2.08, N = 3 6605.12 0.46 4239.52 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 40 80 120 160 200 SE +/- 0.25, N = 3 SE +/- 0.00, N = 3 SE +/- 0.37, N = 3 185.29 1.36 96.58 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Handwritten English Recognition FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 90 180 270 360 450 SE +/- 0.02, N = 3 SE +/- 2.18, N = 3 SE +/- 0.21, N = 3 31.08 394.94 81.00 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 380.82 / MAX: 408.15 -pie - MIN: 64.65 / MAX: 134.64 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 1500 3000 4500 6000 7500 SE +/- 0.01, N = 3 SE +/- 27.09, N = 15 SE +/- 0.01, N = 3 5.86 6990.10 9.90 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6842.82 / MAX: 7088.53 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 160 320 480 640 800 SE +/- 0.09, N = 3 SE +/- 0.26, N = 3 SE +/- 0.58, N = 3 64.70 735.58 155.14 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 734.34 / MAX: 738.23 -pie - MIN: 114.75 / MAX: 224.64 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 90 180 270 360 450 SE +/- 0.09, N = 3 SE +/- 0.26, N = 3 SE +/- 0.27, N = 3 39.38 423.95 76.72 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 411.17 / MAX: 440.57 -pie - MIN: 58.8 / MAX: 121.69 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 1.2555 2.511 3.7665 5.022 6.2775 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 0.52 5.58 0.99 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 5.45 / MAX: 6.48 -pie - MIN: 0.8 / MAX: 13.76 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
PostgreSQL Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency c6g.16xlarge t2d-standard-60 AMD Milan 0.1726 0.3452 0.5178 0.6904 0.863 SE +/- 0.009, N = 3 SE +/- 0.004, N = 3 0.767 0.399 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 800 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Only c6g.16xlarge t2d-standard-60 AMD Milan 400K 800K 1200K 1600K 2000K SE +/- 12058.27, N = 3 SE +/- 20558.90, N = 3 1043267 2003784 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 1.19 2.38 3.57 4.76 5.95 SE +/- 0.011, N = 3 SE +/- 0.001, N = 3 SE +/- 0.005, N = 3 4.391 2.766 5.289 1. (CXX) g++ options: -O3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 10K 20K 30K 40K 50K SE +/- 29.69, N = 3 SE +/- 10.99, N = 3 SE +/- 145.06, N = 3 42701.83 25661.04 47291.96 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 3 6 9 12 15 SE +/- 0.013, N = 3 SE +/- 0.001, N = 3 SE +/- 0.034, N = 3 10.025 5.983 7.368 1. (CXX) g++ options: -O2 -lOpenCL
OpenSSL Algorithm: SHA512 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 5000M 10000M 15000M 20000M 25000M SE +/- 4399663.13, N = 3 SE +/- 6214593.12, N = 3 SE +/- 108274834.92, N = 3 14702270573 14384917863 22244804183 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 19.8.1 Time To Compile c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 60 120 180 240 300 SE +/- 0.29, N = 3 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 198.39 286.20 191.71
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 30 60 90 120 150 SE +/- 0.52, N = 3 SE +/- 0.11, N = 3 SE +/- 0.78, N = 3 88.63 129.17 109.68 1. (CXX) g++ options: -O3
Laghos Test: Sedov Blast Wave, ube_922_hex.mesh OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Sedov Blast Wave, ube_922_hex.mesh c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 80 160 240 320 400 SE +/- 0.31, N = 3 SE +/- 0.79, N = 3 SE +/- 0.62, N = 3 259.55 321.29 364.64 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 1.0051 2.0102 3.0153 4.0204 5.0255 SE +/- 0.007, N = 3 SE +/- 0.014, N = 3 SE +/- 0.013, N = 3 3.250 4.467 3.205 1. (CXX) g++ options: -O3 -fPIC -lm
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 400K 800K 1200K 1600K 2000K SE +/- 1295.68, N = 3 SE +/- 635.29, N = 3 SE +/- 9191.61, N = 3 1445843.52 1259870.72 1730658.45 1. (CC) gcc options: -O2 -lrt" -lrt
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 40 80 120 160 200 SE +/- 2.19, N = 12 SE +/- 0.57, N = 3 SE +/- 0.82, N = 3 148.58 202.45 196.95 1. (CXX) g++ options: -O3
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 23Jun2022 Model: 20k Atoms c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 19.78 25.06 26.73 -lm -lm 1. (CXX) g++ options: -O3 -ldl
Timed Gem5 Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 21.2 Time To Compile c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 50 100 150 200 250 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 SE +/- 0.27, N = 3 176.77 224.41 170.93
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6, Lossless c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 2 4 6 8 10 SE +/- 0.099, N = 3 SE +/- 0.032, N = 3 SE +/- 0.031, N = 3 6.889 8.879 7.639 1. (CXX) g++ options: -O3 -fPIC -lm
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 14 28 42 56 70 SE +/- 0.18, N = 3 SE +/- 0.03, N = 3 SE +/- 0.13, N = 3 64.86 62.30 50.97 1. (CXX) g++ options: -O2 -lOpenCL
Laghos Test: Triple Point Problem OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Triple Point Problem c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 50 100 150 200 250 SE +/- 0.22, N = 3 SE +/- 0.50, N = 3 SE +/- 1.77, N = 3 209.00 179.52 222.30 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.36 VGR Performance Metric c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 130K 260K 390K 520K 650K 510819 629363 1. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6
Timed Linux Kernel Compilation Build: allmodconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: allmodconfig c6g.16xlarge t2d-standard-60 AMD Milan 90 180 270 360 450 SE +/- 2.41, N = 3 SE +/- 1.20, N = 3 409.10 333.35
libxsmm M N K: 32 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 70 140 210 280 350 SE +/- 0.19, N = 3 SE +/- 0.47, N = 3 SE +/- 3.60, N = 4 255.4 312.7 289.2 -lquadmath -msse4.2 -march=armv8.1-a -lquadmath -msse4.2 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden
Apache Cassandra Test: Writes OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.1.3 Test: Writes c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 50K 100K 150K 200K 250K SE +/- 1129.40, N = 3 SE +/- 3249.94, N = 12 SE +/- 681.76, N = 3 228640 217355 187169
libxsmm M N K: 64 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 130 260 390 520 650 SE +/- 0.12, N = 3 SE +/- 0.96, N = 3 SE +/- 0.25, N = 3 489.7 589.5 554.2 -lquadmath -msse4.2 -march=armv8.1-a -lquadmath -msse4.2 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 11000M 22000M 33000M 44000M 55000M SE +/- 9562123.40, N = 3 SE +/- 192235444.29, N = 3 SE +/- 20491615.60, N = 3 46211821313 42288513973 50884997103 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 60K 120K 180K 240K 300K SE +/- 346.51, N = 3 SE +/- 359.95, N = 3 SE +/- 388.74, N = 3 271795 239735 278973 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
nginx Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 40K 80K 120K 160K 200K SE +/- 688.79, N = 3 SE +/- 132.24, N = 3 SE +/- 156.32, N = 3 180537.84 158700.36 155609.04 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 40K 80K 120K 160K 200K SE +/- 126.15, N = 3 SE +/- 249.05, N = 3 SE +/- 394.72, N = 3 187350.44 162553.85 162957.75 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 7 14 21 28 35 SE +/- 0.25, N = 3 SE +/- 0.01, N = 3 SE +/- 0.31, N = 12 28.02 25.87 24.57 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 200M 400M 600M 800M 1000M SE +/- 2060519.90, N = 3 SE +/- 176147.98, N = 3 SE +/- 1088162.98, N = 3 962889833 1032893667 920427767 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 50K 100K 150K 200K 250K SE +/- 519.21, N = 3 SE +/- 57.33, N = 3 SE +/- 347.74, N = 3 226211 234046 247255 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 10 20 30 40 50 SE +/- 0.17, N = 3 SE +/- 0.02, N = 3 45.50 42.01 1. (CXX) g++ options: -O2 -lOpenCL
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 1.3211 2.6422 3.9633 5.2844 6.6055 SE +/- 0.04970425, N = 3 SE +/- 0.01888616, N = 3 SE +/- 0.02210564, N = 3 5.87157885 5.61811686 5.63057327 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenRadioss Model: Chrysler Neon 1M OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Chrysler Neon 1M c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 70 140 210 280 350 SE +/- 2.03, N = 3 SE +/- 1.48, N = 3 337.70 327.88
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 140 280 420 560 700 SE +/- 3.18, N = 3 SE +/- 12.58, N = 3 623.89 633.58 MAX: 41831.2 MAX: 54749.7
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 90 180 270 360 450 SE +/- 2.14, N = 3 SE +/- 4.08, N = 3 418.59 415.08 MAX: 31920.67 MAX: 28810.4
Apache IoTDB Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 8M 16M 24M 32M 40M SE +/- 267152.12, N = 3 SE +/- 354923.23, N = 3 35359884 35068557
Apache IoTDB Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 7M 14M 21M 28M 35M SE +/- 66565.66, N = 3 SE +/- 130588.20, N = 3 34332237 34123810
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 7M 14M 21M 28M 35M SE +/- 154587.14, N = 3 SE +/- 303573.79, N = 3 33268158 33466804
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 7M 14M 21M 28M 35M SE +/- 188621.29, N = 3 SE +/- 221111.93, N = 3 34762565 34925899
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only t2d-standard-60 AMD Milan 30 60 90 120 150 SE +/- 0.03, N = 3 112.64
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Barbershop - Compute: CPU-Only t2d-standard-60 AMD Milan 80 160 240 320 400 SE +/- 0.60, N = 3 351.58
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only t2d-standard-60 AMD Milan 10 20 30 40 50 SE +/- 0.13, N = 3 45.22
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Classroom - Compute: CPU-Only t2d-standard-60 AMD Milan 20 40 60 80 100 SE +/- 0.07, N = 3 89.35
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only t2d-standard-60 AMD Milan 8 16 24 32 40 SE +/- 0.06, N = 3 34.27
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency c6g.16xlarge t2d-standard-60 AMD Milan 50 100 150 200 250 SE +/- 6.00, N = 9 SE +/- 1.44, N = 8 210.61 172.72 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency c6g.16xlarge t2d-standard-60 AMD Milan 40 80 120 160 200 SE +/- 3.85, N = 12 SE +/- 1.23, N = 12 168.19 140.92 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Detection FP32 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 200 400 600 800 1000 SE +/- 0.29, N = 3 SE +/- 0.55, N = 3 SE +/- 7.79, N = 15 83.91 947.86 193.45 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 944.57 / MAX: 958.26 -pie - MIN: 113.44 / MAX: 315.54 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Detection FP32 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 30 60 90 120 150 SE +/- 0.48, N = 3 SE +/- 0.00, N = 3 SE +/- 2.74, N = 15 142.90 1.06 78.96 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 200 400 600 800 1000 SE +/- 0.13, N = 3 SE +/- 0.37, N = 3 SE +/- 9.17, N = 15 83.99 947.59 208.47 -isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 943.57 / MAX: 951.68 -pie - MIN: 119.65 / MAX: 316.01 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 30 60 90 120 150 SE +/- 0.23, N = 3 SE +/- 0.00, N = 3 SE +/- 3.12, N = 15 142.75 1.06 73.74 -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write c6g.16xlarge t2d-standard-60 AMD Milan 1200 2400 3600 4800 6000 SE +/- 124.50, N = 9 SE +/- 49.93, N = 8 4776 5793 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 800 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Write c6g.16xlarge t2d-standard-60 AMD Milan 1200 2400 3600 4800 6000 SE +/- 109.40, N = 12 SE +/- 49.28, N = 12 4784 5682 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Apache IoTDB Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 150 300 450 600 750 SE +/- 7.53, N = 3 SE +/- 25.87, N = 3 682.12 709.46 MAX: 98294.84 MAX: 113264.06
Apache IoTDB Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 100 200 300 400 500 SE +/- 27.08, N = 3 SE +/- 35.57, N = 3 447.68 433.96 MAX: 95136.87 MAX: 103381.73
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 15 Total Time c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 20M 40M 60M 80M 100M SE +/- 1450871.09, N = 3 SE +/- 1645401.81, N = 15 SE +/- 1618403.29, N = 14 105894457 81807706 112958788 -m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -m64 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 1. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 23Jun2022 Model: Rhodopsin Protein c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 7 14 21 28 35 SE +/- 0.54, N = 12 SE +/- 0.04, N = 3 SE +/- 0.17, N = 3 17.42 26.04 27.83 -lm -lm 1. (CXX) g++ options: -O3 -ldl
nekRS Input: TurboPipe Periodic OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: TurboPipe Periodic c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 1000M 2000M 3000M 4000M 5000M SE +/- 201939132.13, N = 12 SE +/- 1790009.31, N = 3 SE +/- 481352.26, N = 3 4723940000 2221710000 2730620000 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
nekRS Input: Kershaw OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: Kershaw c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 900M 1800M 2700M 3600M 4500M SE +/- 57202190.49, N = 12 SE +/- 2970005.61, N = 3 SE +/- 84802173.02, N = 12 4289858333 1758860000 3681935833 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
OpenRadioss Model: Rubber O-Ring Seal Installation OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 20 40 60 80 100 SE +/- 4.01, N = 12 SE +/- 0.22, N = 3 89.65 72.06
OpenRadioss Model: Bird Strike on Windshield OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 30 60 90 120 150 SE +/- 3.26, N = 9 SE +/- 0.13, N = 3 147.31 123.61
OpenRadioss Model: Cell Phone Drop Test OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 9 18 27 36 45 SE +/- 1.27, N = 15 SE +/- 0.39, N = 15 38.82 30.12
OpenRadioss Model: Bumper Beam OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 20 40 60 80 100 SE +/- 1.56, N = 15 SE +/- 0.10, N = 3 92.87 75.68
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 20 40 60 80 100 SE +/- 1.78, N = 12 SE +/- 0.71, N = 3 SE +/- 0.91, N = 3 93.60 79.02 106.03 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 13 26 39 52 65 SE +/- 1.29, N = 15 SE +/- 0.10, N = 3 SE +/- 0.66, N = 3 57.31 32.36 60.03 1. (CXX) g++ options: -O3
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 4 8 12 16 20 SE +/- 0.104, N = 15 SE +/- 0.017, N = 3 SE +/- 0.009, N = 3 6.448 14.212 6.423 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D c3d-standard-60 AMD Genoa t2d-standard-60 AMD Milan 20 40 60 80 100 SE +/- 0.86, N = 15 SE +/- 1.83, N = 12 84.17 88.54 1. (CXX) g++ options: -O2 -lOpenCL
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 20K 40K 60K 80K 100K SE +/- 293.57, N = 3 SE +/- 7.52, N = 3 SE +/- 1463.52, N = 15 73563.13 18807.75 94247.77 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 500 1000 1500 2000 2500 SE +/- 36.45, N = 15 SE +/- 0.58, N = 3 SE +/- 142.62, N = 12 2422.40 915.80 1752.62 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C c3d-standard-60 AMD Genoa c6g.16xlarge t2d-standard-60 AMD Milan 4K 8K 12K 16K 20K SE +/- 77.92, N = 3 SE +/- 23.52, N = 3 SE +/- 1215.82, N = 15 19597.86 13343.35 16649.37 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Phoronix Test Suite v10.8.4