Google Axion C4A 48 vCPU vs. C4 Xeon vs. T2A Ampere Altra

48 vCPU Google GCE comparison of Axion C4A instance compared to 48 vCPU Xeon C4 (Emerald Rapids) and T2A Ampere Altra instances. Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2410309-NE-AXIONC4A532.

Google Axion C4A 48 vCPU vs. C4 Xeon vs. T2A Ampere AltraProcessorMotherboardMemoryDiskNetworkChipsetOSKernelCompilerFile-SystemSystem LayerC4A AxionC4 Xeon Platinum EMRT2A Ampere AltraARMv8 Neoverse-V2 (48 Cores)KVM Google Compute Engine12 x 16GB RAM215GB nvme_card-pdGoogle Compute Engine VirtualUbuntu 24.046.8.0-1015-gcp (aarch64)GCC 13.2.0ext4googleINTEL XEON PLATINUM 8581C (24 Cores / 48 Threads)Google Compute Engine c4-standard-48Intel 440FX 82441FX PMC16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 4 RAM6.8.0-1015-gcp (x86_64)ARMv8 Neoverse-N1 (48 Cores)KVM Google Compute Engine12 x 16GB RAM6.8.0-1015-gcp (aarch64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- C4A Axion: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v - C4 Xeon Platinum EMR: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - T2A Ampere Altra: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v Java Details- C4A Axion, C4 Xeon Platinum EMR: OpenJDK Runtime Environment (build 11.0.24+8-post-Ubuntu-1ubuntu324.04.1)Python Details- Python 3.12.3Security Details- C4A Axion: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected - C4 Xeon Platinum EMR: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: SW sequence; BHI: SW loop KVM: SW loop + srbds: Not affected + tsx_async_abort: Not affected - T2A Ampere Altra: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected Processor Details- C4 Xeon Platinum EMR: CPU Microcode: 0xffffffff

Google Axion C4A 48 vCPU vs. C4 Xeon vs. T2A Ampere Altrahpcg: 104 104 104 - 60hpcg: 144 144 144 - 60minife: Smallcloverleaf: clover_bm64_shortrodinia: OpenMP LavaMDrodinia: OpenMP CFD Solveramg: pennant: sedovbigpennant: leblancbigincompact3d: input.i3d 193 Cells Per Directionlammps: 20k Atomslulesh: xmrig: GhostRider - 1Mjohn-the-ripper: bcryptjohn-the-ripper: Blowfishjohn-the-ripper: HMAC-SHA512coremark: CoreMark Size 666 - Iterations Per Secondcompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingbuild-godot: Time To Compilebuild-llvm: Ninjabuild-nodejs: Time To Compileopenssl: ChaCha20openssl: AES-128-GCMopenssl: AES-256-GCMopenssl: ChaCha20-Poly1305clickhouse: 100M Rows Hits Dataset, First Run / Cold Cacheclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, Third Runmemcached: 1:10memcached: 1:100askap: tConvolve MT - Griddingaskap: tConvolve MT - Degriddingaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddingaskap: tConvolve OpenMP - Griddingaskap: tConvolve OpenMP - Degriddingaskap: Hogbom Clean OpenMPgromacs: MPI CPU - water_GMX50_barerocksdb: Rand Readrocksdb: Update Randrocksdb: Read While Writingrocksdb: Read Rand Write RandC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra42.752243.138547199.626.1541.1773.10621883530007.3101305.0415129.6765801140.83120794.0524298.24574945763898646671441191.863234306082266745153.601204.046260.46110083722818026614102880323185766600370429152870438.85465.02480.414672917.724416447.7311899.715472.617695.617890.625980.143530.72254.713.9842885116919994936282429455005017.755117.444320576.771.4177.0616.62191672306728.325949.51221730.504697819.76710358.8933421.046521463281656040001027241.721365208023145509176.754309.393393.115146568391977466561387338344315542484101608292900326.71357.96357.965999388.745970099.594496.426342.5813938.018321.217158.720822.61456.384.3871361035867945575175867356433324741.654.2368.6075.552108857700016.9726012.6722726.944511623.3109812.80711618.84039940398547350001052843.691802193705206064317.423882.0346015170678014058590444711347098109741733619623237.24256.07257.304143638.413966685.264732.276569.725355.184161.399510.5412198.61080.452.39714921290153128438302722179030OpenBenchmarking.org

High Performance Conjugate Gradient

X Y Z: 104 104 104 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 60C4A AxionC4 Xeon Platinum EMR1020304050SE +/- 0.16, N = 3SE +/- 0.00, N = 342.7517.761. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 60C4A AxionC4 Xeon Platinum EMR1020304050SE +/- 0.18, N = 3SE +/- 0.00, N = 343.1417.441. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra10K20K30K40K50KSE +/- 12.79, N = 3SE +/- 4.08, N = 3SE +/- 53.66, N = 347199.620576.724741.61. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

CloverLeaf

Input: clover_bm64_short

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm64_shortC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra1632486480SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.16, N = 326.1571.4154.231. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra20406080100SE +/- 0.12, N = 3SE +/- 0.16, N = 3SE +/- 0.02, N = 341.1877.0668.611. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra246810SE +/- 0.024, N = 3SE +/- 0.020, N = 3SE +/- 0.014, N = 33.1066.6215.5521. (CXX) g++ options: -O2 -lOpenCL

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2C4A AxionC4 Xeon Platinum EMRT2A Ampere Altra500M1000M1500M2000M2500MSE +/- 2463059.28, N = 3SE +/- 341457.48, N = 3SE +/- 10220970.27, N = 3218835300091672306710885770001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra714212835SE +/- 0.029628, N = 3SE +/- 0.080167, N = 3SE +/- 0.094168, N = 37.31013028.32594016.9726001. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra3691215SE +/- 0.017045, N = 3SE +/- 0.012783, N = 3SE +/- 0.010185, N = 35.0415129.51221712.6722701. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra714212835SE +/- 0.00471123, N = 3SE +/- 0.28274140, N = 6SE +/- 0.18163136, N = 139.6765801130.5046978026.944511601. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra918273645SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 340.8319.7723.311. (CXX) g++ options: -O3 -lm -ldl

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3C4A AxionC4 Xeon Platinum EMRT2A Ampere Altra4K8K12K16K20KSE +/- 282.43, N = 3SE +/- 45.91, N = 3SE +/- 34.22, N = 320794.0510358.899812.811. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

Xmrig

Variant: GhostRider - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1MC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra9001800270036004500SE +/- 12.97, N = 3SE +/- 7.01, N = 3SE +/- 18.74, N = 34298.23421.01618.8-maes1. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra10K20K30K40K50KSE +/- 25.03, N = 3SE +/- 54.04, N = 3SE +/- 13.96, N = 3457494652140399-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra10K20K30K40K50KSE +/- 14.62, N = 3SE +/- 34.85, N = 3SE +/- 27.71, N = 3457634632840398-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: HMAC-SHA512

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA512C4A AxionC4 Xeon Platinum EMRT2A Ampere Altra40M80M120M160M200MSE +/- 28852.11, N = 3SE +/- 65317.17, N = 3SE +/- 1525107.00, N = 128986466716560400054735000-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra300K600K900K1200K1500KSE +/- 20293.48, N = 3SE +/- 1293.04, N = 3SE +/- 1902.13, N = 31441191.861027241.721052843.691. (CC) gcc options: -O2 -lrt" -lrt

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra70K140K210K280K350KSE +/- 748.65, N = 3SE +/- 191.64, N = 3SE +/- 1305.07, N = 33060822080231937051. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra60K120K180K240K300KSE +/- 67.68, N = 3SE +/- 206.03, N = 3SE +/- 42.51, N = 32667451455092060641. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To CompileC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra70140210280350SE +/- 0.21, N = 3SE +/- 0.35, N = 3SE +/- 0.18, N = 3153.60176.75317.42

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaC4A AxionC4 Xeon Platinum EMR70140210280350SE +/- 0.23, N = 3SE +/- 0.29, N = 3204.05309.39

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To CompileC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra2004006008001000SE +/- 0.26, N = 3SE +/- 2.47, N = 3SE +/- 1.32, N = 3260.46393.12882.03

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20C4A AxionC4 Xeon Platinum EMRT2A Ampere Altra30000M60000M90000M120000M150000MSE +/- 48601049.47, N = 3SE +/- 34658260.13, N = 3SE +/- 7423321.42, N = 3100837228180146568391977601517067801. C4A Axion: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. C4 Xeon Platinum EMR: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8 3. T2A Ampere Altra: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-128-GCMC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra100000M200000M300000M400000M500000MSE +/- 7311961.78, N = 3SE +/- 6674463163.95, N = 12SE +/- 13254818.21, N = 32661410288034665613873381405859044471. C4A Axion: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. C4 Xeon Platinum EMR: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8 3. T2A Ampere Altra: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-256-GCMC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra70000M140000M210000M280000M350000MSE +/- 119817516.04, N = 3SE +/- 5252124128.24, N = 12SE +/- 18912120.18, N = 32318576660033443155424841134709810971. C4A Axion: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. C4 Xeon Platinum EMR: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8 3. T2A Ampere Altra: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20-Poly1305C4A AxionC4 Xeon Platinum EMRT2A Ampere Altra20000M40000M60000M80000M100000MSE +/- 36045318.84, N = 3SE +/- 317228455.16, N = 3SE +/- 1266565.62, N = 370429152870101608292900417336196231. C4A Axion: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. C4 Xeon Platinum EMR: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8 3. T2A Ampere Altra: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

ClickHouse

100M Rows Hits Dataset, First Run / Cold Cache

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold CacheC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra100200300400500SE +/- 5.15, N = 9SE +/- 1.91, N = 3SE +/- 3.01, N = 3438.85326.71237.24MIN: 35.57 / MAX: 6666.67MIN: 20.51 / MAX: 6666.67MIN: 18.69 / MAX: 4000

ClickHouse

100M Rows Hits Dataset, Second Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second RunC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra100200300400500SE +/- 4.32, N = 9SE +/- 1.96, N = 3SE +/- 4.17, N = 3465.02357.96256.07MIN: 35.65 / MAX: 7500MIN: 21.12 / MAX: 7500MIN: 18.78 / MAX: 4285.71

ClickHouse

100M Rows Hits Dataset, Third Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third RunC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra100200300400500SE +/- 3.70, N = 9SE +/- 2.07, N = 3SE +/- 2.67, N = 3480.41357.96257.30MIN: 35.46 / MAX: 6666.67MIN: 20.98 / MAX: 6666.67MIN: 18.79 / MAX: 3750

Memcached

Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10C4A AxionC4 Xeon Platinum EMRT2A Ampere Altra1.3M2.6M3.9M5.2M6.5MSE +/- 200017.70, N = 12SE +/- 66686.31, N = 5SE +/- 57647.92, N = 34672917.725999388.744143638.411. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100C4A AxionC4 Xeon Platinum EMRT2A Ampere Altra1.3M2.6M3.9M5.2M6.5MSE +/- 203945.72, N = 12SE +/- 55898.73, N = 15SE +/- 26632.42, N = 34416447.735970099.593966685.261. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - GriddingC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra3K6K9K12K15KSE +/- 11.07, N = 3SE +/- 6.07, N = 3SE +/- 0.58, N = 311899.704496.424732.271. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - DegriddingC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra3K6K9K12K15KSE +/- 32.45, N = 3SE +/- 3.63, N = 3SE +/- 5.63, N = 315472.606342.586569.721. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra4K8K12K16K20KSE +/- 229.66, N = 3SE +/- 159.30, N = 4SE +/- 21.03, N = 317695.6013938.005355.181. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra4K8K12K16K20KSE +/- 0.00, N = 3SE +/- 294.55, N = 4SE +/- 9.69, N = 317890.6018321.204161.391. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - GriddingC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra6K12K18K24K30KSE +/- 286.07, N = 15SE +/- 147.92, N = 15SE +/- 156.68, N = 1225980.1017158.709510.541. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - DegriddingC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra9K18K27K36K45KSE +/- 575.94, N = 15SE +/- 182.46, N = 15SE +/- 64.77, N = 1243530.720822.612198.61. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMPC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra5001000150020002500SE +/- 37.89, N = 15SE +/- 7.10, N = 3SE +/- 11.98, N = 52254.711456.381080.451. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra0.98711.97422.96133.94844.9355SE +/- 0.001, N = 3SE +/- 0.012, N = 3SE +/- 0.006, N = 33.9844.3872.3971. (CXX) g++ options: -O3 -lm

RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Random ReadC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra60M120M180M240M300MSE +/- 88148.93, N = 3SE +/- 599927.22, N = 3SE +/- 344165.60, N = 32885116911361035861492129011. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

RocksDB

Test: Update Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Update RandomC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra200K400K600K800K1000KSE +/- 9437.64, N = 3SE +/- 5276.32, N = 3SE +/- 5458.72, N = 49994937945575312841. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

RocksDB

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While WritingC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra1.3M2.6M3.9M5.2M6.5MSE +/- 42437.10, N = 3SE +/- 8881.77, N = 3SE +/- 21345.21, N = 36282429517586738302721. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

RocksDB

Test: Read Random Write Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read Random Write RandomC4A AxionC4 Xeon Platinum EMRT2A Ampere Altra1000K2000K3000K4000K5000KSE +/- 4377.96, N = 3SE +/- 17653.14, N = 3SE +/- 27299.12, N = 34550050356433321790301. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti


Phoronix Test Suite v10.8.5