48 vCPU Google GCE comparison of Axion C4A instance compared to 48 vCPU Xeon C4 (Emerald Rapids) and T2A Ampere Altra instances. Benchmarks by Michael Larabel for a future article.
C4A Axion Processor: ARMv8 Neoverse-V2 (48 Cores), Motherboard: KVM Google Compute Engine, Memory: 12 x 16GB RAM, Disk: 215GB nvme_card-pd, Network: Google Compute Engine Virtual
OS: Ubuntu 24.04, Kernel: 6.8.0-1015-gcp (aarch64), Compiler: GCC 13.2.0, File-System: ext4, System Layer: google
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -vJava Notes: OpenJDK Runtime Environment (build 11.0.24+8-post-Ubuntu-1ubuntu324.04.1)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected
C4 Xeon Platinum EMR Processor: INTEL XEON PLATINUM 8581C (24 Cores / 48 Threads) , Motherboard: Google Compute Engine c4-standard-48 , Chipset: Intel 440FX 82441FX PMC , Memory: 16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 4 RAM , Disk: 215GB nvme_card-pd , Network: Google Compute Engine Virtual
OS: Ubuntu 24.04, Kernel: 6.8.0-1015-gcp (x86_64), Compiler: GCC 13.2.0, File-System: ext4, System Layer: google
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xffffffffJava Notes: OpenJDK Runtime Environment (build 11.0.24+8-post-Ubuntu-1ubuntu324.04.1)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: SW sequence; BHI: SW loop KVM: SW loop + srbds: Not affected + tsx_async_abort: Not affected
T2A Ampere Altra Processor: ARMv8 Neoverse-N1 (48 Cores) , Motherboard: KVM Google Compute Engine , Memory: 12 x 16GB RAM , Disk: 215GB nvme_card-pd , Network: Google Compute Engine Virtual
OS: Ubuntu 24.04, Kernel: 6.8.0-1015-gcp (aarch64), Compiler: GCC 13.2.0, File-System: ext4, System Layer: google
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -vPython Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
Google Axion C4A 48 vCPU vs. C4 Xeon vs. T2A Ampere Altra Processor Motherboard Memory Disk Network Chipset OS Kernel Compiler File-System System Layer C4A Axion C4 Xeon Platinum EMR T2A Ampere Altra ARMv8 Neoverse-V2 (48 Cores) KVM Google Compute Engine 12 x 16GB RAM 215GB nvme_card-pd Google Compute Engine Virtual Ubuntu 24.04 6.8.0-1015-gcp (aarch64) GCC 13.2.0 ext4 google INTEL XEON PLATINUM 8581C (24 Cores / 48 Threads) Google Compute Engine c4-standard-48 Intel 440FX 82441FX PMC 16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 4 RAM 6.8.0-1015-gcp (x86_64) ARMv8 Neoverse-N1 (48 Cores) KVM Google Compute Engine 12 x 16GB RAM 6.8.0-1015-gcp (aarch64) OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - C4A Axion: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v - C4 Xeon Platinum EMR: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - T2A Ampere Altra: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v Java Details - C4A Axion, C4 Xeon Platinum EMR: OpenJDK Runtime Environment (build 11.0.24+8-post-Ubuntu-1ubuntu324.04.1) Python Details - Python 3.12.3 Security Details - C4A Axion: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected - C4 Xeon Platinum EMR: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: SW sequence; BHI: SW loop KVM: SW loop + srbds: Not affected + tsx_async_abort: Not affected - T2A Ampere Altra: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected Processor Details - C4 Xeon Platinum EMR: CPU Microcode: 0xffffffff
C4A Axion C4 Xeon Platinum EMR T2A Ampere Altra Result Overview Phoronix Test Suite 100% 160% 220% 280% 340% Timed Node.js Compilation Xcompact3d Incompact3d ASKAP OpenSSL CloverLeaf Pennant Xmrig Algebraic Multi-Grid Benchmark miniFE LULESH Timed Godot Game Engine Compilation LAMMPS Molecular Dynamics Simulator Rodinia RocksDB ClickHouse GROMACS 7-Zip Compression John The Ripper Memcached Coremark
Google Axion C4A 48 vCPU vs. C4 Xeon vs. T2A Ampere Altra clickhouse: 100M Rows Hits Dataset, Third Run clickhouse: 100M Rows Hits Dataset, Second Run clickhouse: 100M Rows Hits Dataset, First Run / Cold Cache build-nodejs: Time To Compile xmrig: GhostRider - 1M openssl: AES-128-GCM openssl: AES-256-GCM lammps: 20k Atoms build-llvm: Ninja hpcg: 144 144 144 - 60 memcached: 1:100 build-godot: Time To Compile openssl: ChaCha20 openssl: ChaCha20-Poly1305 memcached: 1:10 hpcg: 104 104 104 - 60 john-the-ripper: HMAC-SHA512 askap: tConvolve MT - Degridding askap: tConvolve MT - Gridding rocksdb: Update Rand incompact3d: input.i3d 193 Cells Per Direction rodinia: OpenMP LavaMD rocksdb: Read Rand Write Rand rocksdb: Rand Read rocksdb: Read While Writing gromacs: MPI CPU - water_GMX50_bare cloverleaf: clover_bm64_short askap: tConvolve MPI - Gridding askap: tConvolve MPI - Degridding askap: tConvolve OpenMP - Degridding askap: tConvolve OpenMP - Gridding askap: Hogbom Clean OpenMP john-the-ripper: bcrypt john-the-ripper: Blowfish compress-7zip: Decompression Rating compress-7zip: Compression Rating coremark: CoreMark Size 666 - Iterations Per Second amg: pennant: sedovbig minife: Small lulesh: pennant: leblancbig rodinia: OpenMP CFD Solver C4A Axion C4 Xeon Platinum EMR T2A Ampere Altra 480.41 465.02 438.85 260.461 4298.2 266141028803 231857666003 40.831 204.046 43.1385 4416447.73 153.601 100837228180 70429152870 4672917.72 42.7522 89864667 15472.6 11899.7 999493 9.67658011 41.177 4550050 288511691 6282429 3.984 26.15 17890.6 17695.6 43530.7 25980.1 2254.71 45749 45763 266745 306082 1441191.863234 2188353000 7.310130 47199.6 20794.052 5.041512 3.106 357.96 357.96 326.71 393.115 3421.0 466561387338 344315542484 19.767 309.393 17.4443 5970099.59 176.754 146568391977 101608292900 5999388.74 17.7551 165604000 6342.58 4496.42 794557 30.5046978 77.061 3564333 136103586 5175867 4.387 71.41 18321.2 13938.0 20822.6 17158.7 1456.38 46521 46328 145509 208023 1027241.721365 916723067 28.32594 20576.7 10358.893 9.512217 6.621 257.30 256.07 237.24 882.034 1618.8 140585904447 113470981097 23.310 3966685.26 317.423 60151706780 41733619623 4143638.41 54735000 6569.72 4732.27 531284 26.9445116 68.607 2179030 149212901 3830272 2.397 54.23 4161.39 5355.18 12198.6 9510.54 1080.45 40399 40398 206064 193705 1052843.691802 1088577000 16.97260 24741.6 9812.8071 12.67227 5.552 OpenBenchmarking.org
ClickHouse ClickHouse is an open-source, high performance OLAP data management system. This test profile uses ClickHouse's standard benchmark recommendations per https://clickhouse.com/docs/en/operations/performance-test/ / https://github.com/ClickHouse/ClickBench/tree/main/clickhouse with the 100 million rows web analytics dataset. The reported value is the query processing time using the geometric mean of all separate queries performed as an aggregate. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Third Run T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 100 200 300 400 500 SE +/- 2.67, N = 3 SE +/- 3.70, N = 9 SE +/- 2.07, N = 3 257.30 480.41 357.96 MIN: 18.79 / MAX: 3750 MIN: 35.46 / MAX: 6666.67 MIN: 20.98 / MAX: 6666.67
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Second Run T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 100 200 300 400 500 SE +/- 4.17, N = 3 SE +/- 4.32, N = 9 SE +/- 1.96, N = 3 256.07 465.02 357.96 MIN: 18.78 / MAX: 4285.71 MIN: 35.65 / MAX: 7500 MIN: 21.12 / MAX: 7500
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, First Run / Cold Cache T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 100 200 300 400 500 SE +/- 3.01, N = 3 SE +/- 5.15, N = 9 SE +/- 1.91, N = 3 237.24 438.85 326.71 MIN: 18.69 / MAX: 4000 MIN: 35.57 / MAX: 6666.67 MIN: 20.51 / MAX: 6666.67
Xmrig Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: GhostRider - Hash Count: 1M T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 900 1800 2700 3600 4500 SE +/- 18.74, N = 3 SE +/- 12.97, N = 3 SE +/- 7.01, N = 3 1618.8 4298.2 3421.0 -maes 1. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenSSL OpenBenchmarking.org byte/s, More Is Better OpenSSL Algorithm: AES-128-GCM T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 100000M 200000M 300000M 400000M 500000M SE +/- 13254818.21, N = 3 SE +/- 7311961.78, N = 3 SE +/- 6674463163.95, N = 12 140585904447 266141028803 466561387338 1. T2A Ampere Altra: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) 2. C4A Axion: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) 3. C4 Xeon Platinum EMR: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8
OpenBenchmarking.org byte/s, More Is Better OpenSSL Algorithm: AES-256-GCM T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 70000M 140000M 210000M 280000M 350000M SE +/- 18912120.18, N = 3 SE +/- 119817516.04, N = 3 SE +/- 5252124128.24, N = 12 113470981097 231857666003 344315542484 1. T2A Ampere Altra: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) 2. C4A Axion: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) 3. C4 Xeon Platinum EMR: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8
Memcached Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 26632.42, N = 3 SE +/- 203945.72, N = 12 SE +/- 55898.73, N = 15 3966685.26 4416447.73 5970099.59 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenSSL OpenBenchmarking.org byte/s, More Is Better OpenSSL Algorithm: ChaCha20 T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 30000M 60000M 90000M 120000M 150000M SE +/- 7423321.42, N = 3 SE +/- 48601049.47, N = 3 SE +/- 34658260.13, N = 3 60151706780 100837228180 146568391977 1. T2A Ampere Altra: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) 2. C4A Axion: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) 3. C4 Xeon Platinum EMR: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8
OpenBenchmarking.org byte/s, More Is Better OpenSSL Algorithm: ChaCha20-Poly1305 T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 20000M 40000M 60000M 80000M 100000M SE +/- 1266565.62, N = 3 SE +/- 36045318.84, N = 3 SE +/- 317228455.16, N = 3 41733619623 70429152870 101608292900 1. T2A Ampere Altra: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) 2. C4A Axion: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) 3. C4 Xeon Platinum EMR: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8
Memcached Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 57647.92, N = 3 SE +/- 200017.70, N = 12 SE +/- 66686.31, N = 5 4143638.41 4672917.72 5999388.74 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 3K 6K 9K 12K 15K SE +/- 5.63, N = 3 SE +/- 32.45, N = 3 SE +/- 3.63, N = 3 6569.72 15472.60 6342.58 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 3K 6K 9K 12K 15K SE +/- 0.58, N = 3 SE +/- 11.07, N = 3 SE +/- 6.07, N = 3 4732.27 11899.70 4496.42 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
RocksDB This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Update Random T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 200K 400K 600K 800K 1000K SE +/- 5458.72, N = 4 SE +/- 9437.64, N = 3 SE +/- 5276.32, N = 3 531284 999493 794557 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 7 14 21 28 35 SE +/- 0.18163136, N = 13 SE +/- 0.00471123, N = 3 SE +/- 0.28274140, N = 6 26.94451160 9.67658011 30.50469780 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.12, N = 3 SE +/- 0.16, N = 3 68.61 41.18 77.06 1. (CXX) g++ options: -O2 -lOpenCL
RocksDB This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Read Random Write Random T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 1000K 2000K 3000K 4000K 5000K SE +/- 27299.12, N = 3 SE +/- 4377.96, N = 3 SE +/- 17653.14, N = 3 2179030 4550050 3564333 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Random Read T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 60M 120M 180M 240M 300M SE +/- 344165.60, N = 3 SE +/- 88148.93, N = 3 SE +/- 599927.22, N = 3 149212901 288511691 136103586 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Read While Writing T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 21345.21, N = 3 SE +/- 42437.10, N = 3 SE +/- 8881.77, N = 3 3830272 6282429 5175867 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 0.9871 1.9742 2.9613 3.9484 4.9355 SE +/- 0.006, N = 3 SE +/- 0.001, N = 3 SE +/- 0.012, N = 3 2.397 3.984 4.387 1. (CXX) g++ options: -O3 -lm
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm64_short T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 16 32 48 64 80 SE +/- 0.16, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 54.23 26.15 71.41 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 4K 8K 12K 16K 20K SE +/- 9.69, N = 3 SE +/- 0.00, N = 3 SE +/- 294.55, N = 4 4161.39 17890.60 18321.20 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 4K 8K 12K 16K 20K SE +/- 21.03, N = 3 SE +/- 229.66, N = 3 SE +/- 159.30, N = 4 5355.18 17695.60 13938.00 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 9K 18K 27K 36K 45K SE +/- 64.77, N = 12 SE +/- 575.94, N = 15 SE +/- 182.46, N = 15 12198.6 43530.7 20822.6 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 6K 12K 18K 24K 30K SE +/- 156.68, N = 12 SE +/- 286.07, N = 15 SE +/- 147.92, N = 15 9510.54 25980.10 17158.70 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 500 1000 1500 2000 2500 SE +/- 11.98, N = 5 SE +/- 37.89, N = 15 SE +/- 7.10, N = 3 1080.45 2254.71 1456.38 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: Blowfish T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 10K 20K 30K 40K 50K SE +/- 27.71, N = 3 SE +/- 14.62, N = 3 SE +/- 34.85, N = 3 40398 45763 46328 -m64 1. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt
7-Zip Compression OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 24.05 Test: Decompression Rating T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 60K 120K 180K 240K 300K SE +/- 42.51, N = 3 SE +/- 67.68, N = 3 SE +/- 206.03, N = 3 206064 266745 145509 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 24.05 Test: Compression Rating T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 70K 140K 210K 280K 350K SE +/- 1305.07, N = 3 SE +/- 748.65, N = 3 SE +/- 191.64, N = 3 193705 306082 208023 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Coremark This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 300K 600K 900K 1200K 1500K SE +/- 1902.13, N = 3 SE +/- 20293.48, N = 3 SE +/- 1293.04, N = 3 1052843.69 1441191.86 1027241.72 1. (CC) gcc options: -O2 -lrt" -lrt
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 500M 1000M 1500M 2000M 2500M SE +/- 10220970.27, N = 3 SE +/- 2463059.28, N = 3 SE +/- 341457.48, N = 3 1088577000 2188353000 916723067 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
Pennant Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 7 14 21 28 35 SE +/- 0.094168, N = 3 SE +/- 0.029628, N = 3 SE +/- 0.080167, N = 3 16.972600 7.310130 28.325940 1. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi
miniFE MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 10K 20K 30K 40K 50K SE +/- 53.66, N = 3 SE +/- 12.79, N = 3 SE +/- 4.08, N = 3 24741.6 47199.6 20576.7 1. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi
Pennant Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 3 6 9 12 15 SE +/- 0.010185, N = 3 SE +/- 0.017045, N = 3 SE +/- 0.012783, N = 3 12.672270 5.041512 9.512217 1. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver T2A Ampere Altra C4A Axion C4 Xeon Platinum EMR 2 4 6 8 10 SE +/- 0.014, N = 3 SE +/- 0.024, N = 3 SE +/- 0.020, N = 3 5.552 3.106 6.621 1. (CXX) g++ options: -O2 -lOpenCL
C4A Axion Processor: ARMv8 Neoverse-V2 (48 Cores), Motherboard: KVM Google Compute Engine, Memory: 12 x 16GB RAM, Disk: 215GB nvme_card-pd, Network: Google Compute Engine Virtual
OS: Ubuntu 24.04, Kernel: 6.8.0-1015-gcp (aarch64), Compiler: GCC 13.2.0, File-System: ext4, System Layer: google
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -vJava Notes: OpenJDK Runtime Environment (build 11.0.24+8-post-Ubuntu-1ubuntu324.04.1)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 2 October 2024 14:36 by user michael_larabel.
C4 Xeon Platinum EMR Processor: INTEL XEON PLATINUM 8581C (24 Cores / 48 Threads), Motherboard: Google Compute Engine c4-standard-48, Chipset: Intel 440FX 82441FX PMC, Memory: 16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 16 + 4 RAM, Disk: 215GB nvme_card-pd, Network: Google Compute Engine Virtual
OS: Ubuntu 24.04, Kernel: 6.8.0-1015-gcp (x86_64), Compiler: GCC 13.2.0, File-System: ext4, System Layer: google
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xffffffffJava Notes: OpenJDK Runtime Environment (build 11.0.24+8-post-Ubuntu-1ubuntu324.04.1)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: SW sequence; BHI: SW loop KVM: SW loop + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 3 October 2024 14:56 by user michael_larabel.
T2A Ampere Altra Processor: ARMv8 Neoverse-N1 (48 Cores), Motherboard: KVM Google Compute Engine, Memory: 12 x 16GB RAM, Disk: 215GB nvme_card-pd, Network: Google Compute Engine Virtual
OS: Ubuntu 24.04, Kernel: 6.8.0-1015-gcp (aarch64), Compiler: GCC 13.2.0, File-System: ext4, System Layer: google
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -vPython Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 7 October 2024 14:49 by user michael_larabel.