Graviton4 r8g.16xlarge vs. AMD EPYC 4th Gen

Initial benchmarks by Michael Larabel

HTML result view exported from: https://openbenchmarking.org/result/2407108-NE-2407106NE99.

Graviton4 r8g.16xlarge vs. AMD EPYC 4th GenProcessorMotherboardChipsetMemoryDiskNetworkGraphicsOSKernelCompilerFile-SystemSystem LayerScreen ResolutionGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlargeARMv8 Neoverse-V2 (64 Cores)Amazon EC2 r8g.16xlarge (1.0 BIOS)Amazon Device 0200496GB429GB Amazon Elastic Block StoreAmazon ElasticUbuntu 24.046.8.0-1009-aws (aarch64)GCC 13.2.0ext4amazonAMD EPYC 9R14 (64 Cores)Amazon EC2 r7a.16xlarge (1.0 BIOS)Intel 440FX 82441FX PMC1 x 512GB DDR5-4800MT/ssimpledrmdrmfb6.8.0-1009-aws (x86_64)800x600OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- Graviton4 r8g.16xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v - EPYC 9R14 r7a.16xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Python Details- Python 3.12.3Security Details- Graviton4 r8g.16xlarge: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected - EPYC 9R14 r7a.16xlarge: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: disabled; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected Processor Details- EPYC 9R14 r7a.16xlarge: CPU Microcode: 0xa101148

Graviton4 r8g.16xlarge vs. AMD EPYC 4th Genminife: Smallincompact3d: input.i3d 193 Cells Per Directionopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timexmrig: KawPow - 1Mxmrig: Monero - 1Mxmrig: Wownero - 1Mxmrig: GhostRider - 1Mxmrig: CryptoNight-Heavy - 1Mxmrig: CryptoNight-Femto UPX2 - 1Msrsran: PDSCH Processor Benchmark, Throughput Totalsrsran: PUSCH Processor Benchmark, Throughput Totaljohn-the-ripper: bcryptjohn-the-ripper: WPA PSKjohn-the-ripper: Blowfishjohn-the-ripper: HMAC-SHA512john-the-ripper: MD5compress-7zip: Compression Ratingcompress-7zip: Decompression Ratingstockfish: Chess Benchmarkbuild-gem5: Time To Compilebuild-godot: Time To Compilebuild-llvm: Ninjabuild-nodejs: Time To Compilec-ray: 4K - 16c-ray: 5K - 16liquid-dsp: 64 - 256 - 32liquid-dsp: 64 - 256 - 57liquid-dsp: 64 - 256 - 512openssl: SHA256openssl: SHA512openssl: ChaCha20openssl: AES-128-GCMopenssl: AES-256-GCMopenssl: ChaCha20-Poly1305clickhouse: 100M Rows Hits Dataset, First Run / Cold Cacheclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, Third Rungromacs: MPI CPU - water_GMX50_barepgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Only - Average Latencypgbench: 100 - 1000 - Read Writepgbench: 100 - 1000 - Read Write - Average Latencyrocksdb: Rand Readrocksdb: Update Randrocksdb: Read While Writingrocksdb: Read Rand Write Randblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Pabellon Barcelona - CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge65410.47.8750206618.38476841.41307591.974527344.167621906.021872.128304.05958.721867.221851.914402.51332.5570385744457032108066333157966738346033150581440801186.768147.870182.063365.02428.32350.24732587333331929666667185000000569711745373541280936310189872032329979710213726612515052775190548707449.44479.94495.034.83119475250.5144420226.24434653258612746168522597558401450.64105.3995.01499.63202.2641274.210.915766124.78823643.169709123.99717374.696646899.828256.82400.29140937174791221256384000864700032093328840885180777213.288127.448198.500250.42949.69188.89122615333332529533333759700000308611390393256101795407236482948213217491096737488.88508.87514.298.51920927100.4784458224.3233247036697428986304192375658530.1957.1139.16284.6384.52OpenBenchmarking.org

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge14K28K42K56K70KSE +/- 28.12, N = 3SE +/- 136.71, N = 365410.441274.21. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge3691215SE +/- 0.01884341, N = 3SE +/- 0.01638008, N = 37.8750206610.915766101. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh TimeGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge61218243018.3824.79-mcpu=native-m641. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution TimeGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge102030405041.4143.17-mcpu=native-m641. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh TimeGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge30609012015091.97124.00-mcpu=native-m641. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution TimeGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge80160240320400344.17374.70-mcpu=native-m641. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Xmrig

Variant: KawPow - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: KawPow - Hash Count: 1MGraviton4 r8g.16xlarge5K10K15K20K25KSE +/- 59.25, N = 321906.01. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Monero - Hash Count: 1MGraviton4 r8g.16xlarge5K10K15K20K25KSE +/- 40.15, N = 321872.11. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Wownero - Hash Count: 1MGraviton4 r8g.16xlarge6K12K18K24K30KSE +/- 16.31, N = 328304.01. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: GhostRider - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1MGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge15003000450060007500SE +/- 47.30, N = 12SE +/- 2.15, N = 35958.76899.8-maes1. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: CryptoNight-Heavy - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Heavy - Hash Count: 1MGraviton4 r8g.16xlarge5K10K15K20K25KSE +/- 16.81, N = 321867.21. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: CryptoNight-Femto UPX2 - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Femto UPX2 - Hash Count: 1MGraviton4 r8g.16xlarge5K10K15K20K25KSE +/- 9.16, N = 321851.91. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput TotalGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge6K12K18K24K30KSE +/- 34.15, N = 3SE +/- 148.22, N = 314402.528256.8-march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq1. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput TotalGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge5001000150020002500SE +/- 0.03, N = 3SE +/- 0.07, N = 31332.52400.2MIN: 784 / MAX: 1332.6-march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq - MIN: 1696.6 / MAX: 2400.31. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20K40K60K80K100KSE +/- 2.33, N = 3SE +/- 75.52, N = 35703891409-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge80K160K240K320K400KSE +/- 10.97, N = 3SE +/- 164.01, N = 357444371747-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20K40K60K80K100KSE +/- 9.29, N = 3SE +/- 117.95, N = 35703291221-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: HMAC-SHA512

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA512Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge50M100M150M200M250MSE +/- 640028.21, N = 3SE +/- 730794.32, N = 3108066333256384000-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge2M4M6M8M10MSE +/- 1201.85, N = 3SE +/- 7571.88, N = 315796678647000-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge80K160K240K320K400KSE +/- 397.84, N = 3SE +/- 1508.30, N = 33834603209331. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge70K140K210K280K350KSE +/- 107.45, N = 3SE +/- 148.35, N = 33315052884081. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 16.1Chess BenchmarkGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20M40M60M80M100MSE +/- 2235406.92, N = 15SE +/- 927253.56, N = 128144080185180777-m64 -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -flto -flto-partition=one -flto=jobserver

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To CompileGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge50100150200250SE +/- 1.43, N = 12SE +/- 2.25, N = 3186.77213.29

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To CompileGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge306090120150SE +/- 0.35, N = 3SE +/- 0.19, N = 3147.87127.45

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge4080120160200SE +/- 0.20, N = 3SE +/- 0.91, N = 3182.06198.50

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To CompileGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge80160240320400SE +/- 0.19, N = 3SE +/- 0.19, N = 3365.02250.43

C-Ray

Resolution: 4K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 4K - Rays Per Pixel: 16Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge1122334455SE +/- 0.03, N = 3SE +/- 0.03, N = 328.3249.691. (CC) gcc options: -lpthread -lm

C-Ray

Resolution: 5K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 5K - Rays Per Pixel: 16Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20406080100SE +/- 0.07, N = 3SE +/- 0.10, N = 350.2588.891. (CC) gcc options: -lpthread -lm

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge700M1400M2100M2800M3500MSE +/- 533333.33, N = 3SE +/- 3199131.83, N = 3325873333322615333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge500M1000M1500M2000M2500MSE +/- 33333.33, N = 3SE +/- 6691619.97, N = 3192966666725295333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge160M320M480M640M800MSE +/- 0.00, N = 3SE +/- 1140935.29, N = 31850000007597000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: SHA256Graviton4 r8g.16xlarge12000M24000M36000M48000M60000MSE +/- 10002237.60, N = 3569711745371. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: SHA512Graviton4 r8g.16xlarge8000M16000M24000M32000M40000MSE +/- 8695334.33, N = 3354128093631. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge70000M140000M210000M280000M350000MSE +/- 66408.09, N = 3SE +/- 350599170.76, N = 31018987203233086113903931. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-128-GCMGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge60000M120000M180000M240000M300000MSE +/- 7802917.99, N = 3SE +/- 358257182.84, N = 32997971021372561017954071. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-256-GCMGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge60000M120000M180000M240000M300000MSE +/- 6417811.59, N = 3SE +/- 338102973.44, N = 32661251505272364829482131. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20-Poly1305Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge50000M100000M150000M200000M250000MSE +/- 558939.35, N = 3SE +/- 197460025.45, N = 3751905487072174910967371. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

ClickHouse

100M Rows Hits Dataset, First Run / Cold Cache

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold CacheGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge110220330440550SE +/- 4.96, N = 9SE +/- 6.79, N = 3449.44488.88MIN: 42.7 / MAX: 6666.67MIN: 40.21 / MAX: 5000

ClickHouse

100M Rows Hits Dataset, Second Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second RunGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge110220330440550SE +/- 5.05, N = 9SE +/- 2.24, N = 3479.94508.87MIN: 43.1 / MAX: 6000MIN: 40.57 / MAX: 6000

ClickHouse

100M Rows Hits Dataset, Third Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third RunGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge110220330440550SE +/- 6.71, N = 9SE +/- 9.86, N = 3495.03514.29MIN: 43.07 / MAX: 6666.67MIN: 40.51 / MAX: 6000

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge246810SE +/- 0.004, N = 3SE +/- 0.041, N = 34.8318.5191. (CXX) g++ options: -O3 -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge400K800K1200K1600K2000KSE +/- 6233.51, N = 3SE +/- 7113.73, N = 3194752520927101. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge0.11570.23140.34710.46280.5785SE +/- 0.002, N = 3SE +/- 0.002, N = 30.5140.4781. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read WriteGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge10002000300040005000SE +/- 14.53, N = 3SE +/- 34.82, N = 3442044581. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average LatencyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge50100150200250SE +/- 0.74, N = 3SE +/- 1.74, N = 3226.24224.321. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Random ReadGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge70M140M210M280M350MSE +/- 1371022.17, N = 3SE +/- 827628.35, N = 33465325863247036691. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

RocksDB

Test: Update Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Update RandomGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge300K600K900K1200K1500KSE +/- 11457.12, N = 3SE +/- 4861.01, N = 312746167428981. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

RocksDB

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While WritingGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge2M4M6M8M10MSE +/- 26974.32, N = 3SE +/- 43017.34, N = 13852259763041921. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

RocksDB

Test: Read Random Write Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read Random Write RandomGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge1.2M2.4M3.6M4.8M6MSE +/- 11170.68, N = 3SE +/- 17358.30, N = 3558401437565851. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: BMW27 - Compute: CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge1122334455SE +/- 0.10, N = 3SE +/- 0.08, N = 350.6430.19

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Classroom - Compute: CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20406080100SE +/- 0.11, N = 3SE +/- 0.14, N = 3105.3957.11

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Fishy Cat - Compute: CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20406080100SE +/- 0.26, N = 3SE +/- 0.06, N = 395.0139.16

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Barbershop - Compute: CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge110220330440550SE +/- 0.76, N = 3SE +/- 0.03, N = 3499.63284.63

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Pabellon Barcelona - Compute: CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge4080120160200SE +/- 0.34, N = 3SE +/- 0.11, N = 3202.2684.52


Phoronix Test Suite v10.8.5