Graviton4 vs. Graviton3 vs. Graviton2 + AMD EPYC, Intel Xeon AWS

Benchmarks by Michael Larabel for a future article on Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/2407115-NE-4GRAVITON83&grs.

Graviton4 vs. Graviton3 vs. Graviton2 + AMD EPYC, Intel Xeon AWSProcessorMotherboardChipsetMemoryDiskNetworkGraphicsOSKernelCompilerFile-SystemSystem LayerScreen ResolutionGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlargeARMv8 Neoverse-V2 (64 Cores)Amazon EC2 r8g.16xlarge (1.0 BIOS)Amazon Device 0200496GB429GB Amazon Elastic Block StoreAmazon ElasticUbuntu 24.046.8.0-1009-aws (aarch64)GCC 13.2.0ext4amazonARMv8 Neoverse-V1 (64 Cores)Amazon EC2 r7g.16xlarge (1.0 BIOS)AMD EPYC 9R14 (64 Cores)Amazon EC2 r7a.16xlarge (1.0 BIOS)Intel 440FX 82441FX PMC1 x 512GB DDR5-4800MT/ssimpledrmdrmfb6.8.0-1009-aws (x86_64)800x600ARMv8 Neoverse-N1 (64 Cores)Amazon EC2 r6g.16xlarge (1.0 BIOS)Amazon Device 0200512GB6.8.0-1009-aws (aarch64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- Graviton4 r8g.16xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v - Graviton3 r7g.16xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v - EPYC 9R14 r7a.16xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Graviton2 r6g.16xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v Python Details- Python 3.12.3Security Details- Graviton4 r8g.16xlarge: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected - Graviton3 r7g.16xlarge: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - EPYC 9R14 r7a.16xlarge: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: disabled; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected - Graviton2 r6g.16xlarge: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected Processor Details- EPYC 9R14 r7a.16xlarge: CPU Microcode: 0xa101148

Graviton4 vs. Graviton3 vs. Graviton2 + AMD EPYC, Intel Xeon AWSjohn-the-ripper: WPA PSKjohn-the-ripper: MD5liquid-dsp: 64 - 256 - 512openssl: ChaCha20-Poly1305openssl: ChaCha20john-the-ripper: HMAC-SHA512rocksdb: Update Randblender: Pabellon Barcelona - CPU-Onlyblender: Fishy Cat - CPU-Onlysrsran: PDSCH Processor Benchmark, Throughput Totalsrsran: PUSCH Processor Benchmark, Throughput Totalxmrig: GhostRider - 1Mincompact3d: input.i3d 193 Cells Per Directiongromacs: MPI CPU - water_GMX50_bareblender: Classroom - CPU-Onlyblender: Barbershop - CPU-Onlyminife: Smallblender: BMW27 - CPU-Onlybuild-nodejs: Time To Compileopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenssl: SHA512openfoam: drivaerFastback, Small Mesh Size - Execution Timepgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Only - Average Latencyc-ray: 5K - 16c-ray: 4K - 16liquid-dsp: 64 - 256 - 57build-godot: Time To Compileopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenssl: AES-256-GCMrocksdb: Rand Readopenfoam: drivaerFastback, Small Mesh Size - Mesh Timejohn-the-ripper: bcryptjohn-the-ripper: Blowfishclickhouse: 100M Rows Hits Dataset, First Run / Cold Cacherocksdb: Read Rand Write Randopenssl: AES-128-GCMbuild-llvm: Ninjaclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, Third Runxmrig: KawPow - 1Mxmrig: CryptoNight-Heavy - 1Mxmrig: Monero - 1Mxmrig: CryptoNight-Femto UPX2 - 1Mbuild-gem5: Time To Compileliquid-dsp: 64 - 256 - 32xmrig: Wownero - 1Mcompress-7zip: Compression Ratingopenssl: SHA256rocksdb: Read While Writingcompress-7zip: Decompression Ratingpgbench: 100 - 1000 - Read Writepgbench: 100 - 1000 - Read Write - Average Latencystockfish: Chess BenchmarkGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge574441579667185000000751905487071018987203231080663331274616202.2695.0114402.51332.55958.77.875020664.831105.39499.6365410.450.64365.024344.16763541280936341.41307519475250.51450.24728.3231929666667147.87091.97452726612515052734653258618.3847685703857032449.445584014299797102137182.063479.94495.0321906.021867.221872.121851.9186.768325873333328304.03834605697117453785225973315054420226.24481440801526311451333166313333694307045409414750207786689000817470247.60115.4712612.61053.82108.513.95584844.163129.48644.9836361.762.77571.011552.56573065946571068.29461510874880.92068.69938.6681654733333195.497121.2786124909550059724316677124.7844655070950679348.414254497287591524763257.853375.29382.1414540.414553.914564.014547.5251.402257380000020409.62967314739551236368723892845764541220.27355548662371747864700075970000021749109673730861139039325638400074289884.5239.1628256.82400.26899.810.91576618.51957.11284.6341274.230.19250.429374.6966443.16970920927100.47888.89149.6912529533333127.448123.9971723648294821332470366924.7882369140991221488.883756585256101795407198.500508.87514.29213.288226153333332093363041922884084458224.32385180777498541332667150236667464792765336704593394761258333305604321.20147.778104.1705.92067.525.52030052.709169.86832.2722674.083.01678.013872.0790714108556757102.648359167861.091113.14663.6981173933333274.041194.7898612651093551716581693937.2915144521645204252.632912791156957263187344.432269.15274.0711873.511885.211892.711889.4334.808185120000016874.22298443920668082359558452320704483223.17143501453OpenBenchmarking.org

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge80K160K240K320K400KSE +/- 10.97, N = 3SE +/- 0.00, N = 3SE +/- 164.01, N = 3SE +/- 0.00, N = 3574445263137174749854-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5Graviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge2M4M6M8M10MSE +/- 1201.85, N = 3SE +/- 333.33, N = 3SE +/- 7571.88, N = 3SE +/- 666.67, N = 31579667145133386470001332667-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512Graviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge160M320M480M640M800MSE +/- 0.00, N = 3SE +/- 3333.33, N = 3SE +/- 1140935.29, N = 3SE +/- 1503344.42, N = 31850000001663133337597000001502366671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20-Poly1305Graviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge50000M100000M150000M200000M250000MSE +/- 558939.35, N = 3SE +/- 6321669.59, N = 3SE +/- 197460025.45, N = 3SE +/- 36473958.31, N = 37519054870769430704540217491096737464792765331. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. Graviton3 r7g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)3. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8 4. Graviton2 r6g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20Graviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge70000M140000M210000M280000M350000MSE +/- 66408.09, N = 3SE +/- 4627810.23, N = 3SE +/- 350599170.76, N = 3SE +/- 10580063.28, N = 310189872032394147502077308611390393670459339471. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. Graviton3 r7g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)3. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8 4. Graviton2 r6g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

John The Ripper

Test: HMAC-SHA512

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA512Graviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge50M100M150M200M250MSE +/- 640028.21, N = 3SE +/- 76622.45, N = 3SE +/- 730794.32, N = 3SE +/- 126483.64, N = 31080663338668900025638400061258333-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

RocksDB

Test: Update Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Update RandomGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge300K600K900K1200K1500KSE +/- 11457.12, N = 3SE +/- 7555.40, N = 15SE +/- 4861.01, N = 3SE +/- 1081.97, N = 312746168174707428983056041. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Pabellon Barcelona - Compute: CPU-OnlyGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge70140210280350SE +/- 0.34, N = 3SE +/- 0.13, N = 3SE +/- 0.11, N = 3SE +/- 0.45, N = 3202.26247.6084.52321.20

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Fishy Cat - Compute: CPU-OnlyGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge306090120150SE +/- 0.26, N = 3SE +/- 0.23, N = 3SE +/- 0.06, N = 3SE +/- 0.44, N = 395.01115.4739.16147.77

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput TotalGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge6K12K18K24K30KSE +/- 34.15, N = 3SE +/- 20.13, N = 3SE +/- 148.22, N = 3SE +/- 19.11, N = 314402.512612.628256.88104.1-march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq1. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput TotalGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge5001000150020002500SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 31332.51053.82400.2705.9MIN: 784 / MAX: 1332.6MIN: 599.4 / MAX: 1053.9-march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq - MIN: 1696.6 / MAX: 2400.3MIN: 415.81. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

Xmrig

Variant: GhostRider - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1MGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge15003000450060007500SE +/- 47.30, N = 12SE +/- 4.85, N = 3SE +/- 2.15, N = 3SE +/- 1.60, N = 35958.72108.56899.82067.5-maes1. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge612182430SE +/- 0.01884341, N = 3SE +/- 0.01649632, N = 3SE +/- 0.01638008, N = 3SE +/- 0.02637428, N = 37.8750206613.9558484010.9157661025.520300501. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge246810SE +/- 0.004, N = 3SE +/- 0.001, N = 3SE +/- 0.041, N = 3SE +/- 0.001, N = 34.8314.1638.5192.7091. (CXX) g++ options: -O3 -lm

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Classroom - Compute: CPU-OnlyGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge4080120160200SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.14, N = 3SE +/- 0.42, N = 3105.39129.4857.11169.86

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Barbershop - Compute: CPU-OnlyGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge2004006008001000SE +/- 0.76, N = 3SE +/- 0.24, N = 3SE +/- 0.03, N = 3SE +/- 2.45, N = 3499.63644.98284.63832.27

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge14K28K42K56K70KSE +/- 28.12, N = 3SE +/- 21.14, N = 3SE +/- 136.71, N = 3SE +/- 53.93, N = 365410.436361.741274.222674.01. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: BMW27 - Compute: CPU-OnlyGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge20406080100SE +/- 0.10, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.30, N = 350.6462.7730.1983.01

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To CompileGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge150300450600750SE +/- 0.19, N = 3SE +/- 0.62, N = 3SE +/- 0.19, N = 3SE +/- 0.88, N = 3365.02571.01250.43678.01

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution TimeGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge2004006008001000344.17552.57374.70872.08-mcpu=native-mcpu=native-m64-mcpu=native1. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: SHA512Graviton4 r8g.16xlargeGraviton3 r7g.16xlargeGraviton2 r6g.16xlarge8000M16000M24000M32000M40000MSE +/- 8695334.33, N = 3SE +/- 3137583.25, N = 3SE +/- 333393.11, N = 33541280936330659465710141085567571. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution TimeGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge2040608010041.4168.2943.17102.65-mcpu=native-mcpu=native-m64-mcpu=native1. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlyGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge400K800K1200K1600K2000KSE +/- 6233.51, N = 3SE +/- 5453.81, N = 3SE +/- 7113.73, N = 3SE +/- 5859.87, N = 31947525108748820927109167861. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencyGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge0.24550.4910.73650.9821.2275SE +/- 0.002, N = 3SE +/- 0.005, N = 3SE +/- 0.002, N = 3SE +/- 0.007, N = 30.5140.9200.4781.0911. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

C-Ray

Resolution: 5K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 5K - Rays Per Pixel: 16Graviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge306090120150SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 350.2568.7088.89113.151. (CC) gcc options: -lpthread -lm

C-Ray

Resolution: 4K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 4K - Rays Per Pixel: 16Graviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge1428425670SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 328.3238.6749.6963.701. (CC) gcc options: -lpthread -lm

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57Graviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge500M1000M1500M2000M2500MSE +/- 33333.33, N = 3SE +/- 133333.33, N = 3SE +/- 6691619.97, N = 3SE +/- 88191.71, N = 319296666671654733333252953333311739333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To CompileGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge60120180240300SE +/- 0.35, N = 3SE +/- 0.85, N = 3SE +/- 0.19, N = 3SE +/- 0.76, N = 3147.87195.50127.45274.04

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh TimeGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge408012016020091.97121.28124.00194.79-mcpu=native-mcpu=native-m64-mcpu=native1. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-256-GCMGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge60000M120000M180000M240000M300000MSE +/- 6417811.59, N = 3SE +/- 4988409.71, N = 3SE +/- 338102973.44, N = 3SE +/- 14487293.04, N = 32661251505272490955005972364829482131265109355171. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. Graviton3 r7g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)3. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8 4. Graviton2 r6g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Random ReadGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge70M140M210M280M350MSE +/- 1371022.17, N = 3SE +/- 36358.69, N = 3SE +/- 827628.35, N = 3SE +/- 2004027.04, N = 33465325862431667713247036691658169391. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh TimeGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge91827364518.3824.7824.7937.29-mcpu=native-mcpu=native-m64-mcpu=native1. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge20K40K60K80K100KSE +/- 2.33, N = 3SE +/- 8.67, N = 3SE +/- 75.52, N = 3SE +/- 4.70, N = 357038507099140945216-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge20K40K60K80K100KSE +/- 9.29, N = 3SE +/- 15.39, N = 3SE +/- 117.95, N = 3SE +/- 30.67, N = 357032506799122145204-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

ClickHouse

100M Rows Hits Dataset, First Run / Cold Cache

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold CacheGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge110220330440550SE +/- 4.96, N = 9SE +/- 5.68, N = 9SE +/- 6.79, N = 3SE +/- 3.33, N = 3449.44348.41488.88252.63MIN: 42.7 / MAX: 6666.67MIN: 33.19 / MAX: 5000MIN: 40.21 / MAX: 5000MIN: 20.56 / MAX: 3157.89

RocksDB

Test: Read Random Write Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read Random Write RandomGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge1.2M2.4M3.6M4.8M6MSE +/- 11170.68, N = 3SE +/- 1357.98, N = 3SE +/- 17358.30, N = 3SE +/- 47713.48, N = 1255840144254497375658529127911. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-128-GCMGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge60000M120000M180000M240000M300000MSE +/- 7802917.99, N = 3SE +/- 3618954.21, N = 3SE +/- 358257182.84, N = 3SE +/- 11278788.96, N = 32997971021372875915247632561017954071569572631871. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. Graviton3 r7g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)3. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8 4. Graviton2 r6g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge70140210280350SE +/- 0.20, N = 3SE +/- 0.38, N = 3SE +/- 0.91, N = 3SE +/- 0.93, N = 3182.06257.85198.50344.43

ClickHouse

100M Rows Hits Dataset, Second Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second RunGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge110220330440550SE +/- 5.05, N = 9SE +/- 3.03, N = 9SE +/- 2.24, N = 3SE +/- 5.29, N = 3479.94375.29508.87269.15MIN: 43.1 / MAX: 6000MIN: 33.15 / MAX: 5000MIN: 40.57 / MAX: 6000MIN: 20.53 / MAX: 4000

ClickHouse

100M Rows Hits Dataset, Third Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third RunGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge110220330440550SE +/- 6.71, N = 9SE +/- 2.76, N = 9SE +/- 9.86, N = 3SE +/- 6.12, N = 3495.03382.14514.29274.07MIN: 43.07 / MAX: 6666.67MIN: 33.22 / MAX: 4615.38MIN: 40.51 / MAX: 6000MIN: 20.67 / MAX: 3750

Xmrig

Variant: KawPow - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: KawPow - Hash Count: 1MGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeGraviton2 r6g.16xlarge5K10K15K20K25KSE +/- 59.25, N = 3SE +/- 6.29, N = 3SE +/- 9.89, N = 321906.014540.411873.51. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: CryptoNight-Heavy - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Heavy - Hash Count: 1MGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeGraviton2 r6g.16xlarge5K10K15K20K25KSE +/- 16.81, N = 3SE +/- 14.63, N = 3SE +/- 8.20, N = 321867.214553.911885.21. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Monero - Hash Count: 1MGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeGraviton2 r6g.16xlarge5K10K15K20K25KSE +/- 40.15, N = 3SE +/- 12.04, N = 3SE +/- 10.98, N = 321872.114564.011892.71. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: CryptoNight-Femto UPX2 - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Femto UPX2 - Hash Count: 1MGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeGraviton2 r6g.16xlarge5K10K15K20K25KSE +/- 9.16, N = 3SE +/- 4.95, N = 3SE +/- 1.52, N = 321851.914547.511889.41. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To CompileGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge70140210280350SE +/- 1.43, N = 12SE +/- 2.33, N = 3SE +/- 2.25, N = 3SE +/- 4.90, N = 9186.77251.40213.29334.81

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32Graviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge700M1400M2100M2800M3500MSE +/- 533333.33, N = 3SE +/- 404145.19, N = 3SE +/- 3199131.83, N = 3SE +/- 300000.00, N = 332587333332573800000226153333318512000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Wownero - Hash Count: 1MGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeGraviton2 r6g.16xlarge6K12K18K24K30KSE +/- 16.31, N = 3SE +/- 8.34, N = 3SE +/- 14.38, N = 328304.020409.616874.21. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge80K160K240K320K400KSE +/- 397.84, N = 3SE +/- 546.41, N = 3SE +/- 1508.30, N = 3SE +/- 252.67, N = 33834602967313209332298441. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: SHA256Graviton4 r8g.16xlargeGraviton3 r7g.16xlargeGraviton2 r6g.16xlarge12000M24000M36000M48000M60000MSE +/- 10002237.60, N = 3SE +/- 29159419.58, N = 3SE +/- 13778007.51, N = 35697117453747395512363392066808231. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

RocksDB

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While WritingGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge2M4M6M8M10MSE +/- 26974.32, N = 3SE +/- 6592.32, N = 3SE +/- 43017.34, N = 13SE +/- 15757.89, N = 385225976872389630419259558451. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge70K140K210K280K350KSE +/- 107.45, N = 3SE +/- 70.02, N = 3SE +/- 148.35, N = 3SE +/- 14.19, N = 33315052845762884082320701. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read WriteGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge10002000300040005000SE +/- 14.53, N = 3SE +/- 45.23, N = 3SE +/- 34.82, N = 3SE +/- 33.26, N = 1144204541445844831. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average LatencyGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge50100150200250SE +/- 0.74, N = 3SE +/- 2.18, N = 3SE +/- 1.74, N = 3SE +/- 1.59, N = 11226.24220.27224.32223.171. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 16.1Chess BenchmarkGraviton4 r8g.16xlargeGraviton3 r7g.16xlargeEPYC 9R14 r7a.16xlargeGraviton2 r6g.16xlarge20M40M60M80M100MSE +/- 2235406.92, N = 15SE +/- 1152869.30, N = 12SE +/- 927253.56, N = 12SE +/- 1062798.48, N = 1281440801555486628518077743501453-m64 -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -flto -flto-partition=one -flto=jobserver


Phoronix Test Suite v10.8.5