Ampere Altra September 2021

Ampere Altra ARMv8 Neoverse-N1 testing with a WIWYNN Mt.Jade (1.1.20201019 BIOS) and ASPEED on Ubuntu 21.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2109022-TJ-AMPEREALT86&gru.

Ampere Altra September 2021ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelVulkanCompilerFile-SystemScreen ResolutionAmpere Altra 160 CoresAmpere Altra ARMv8 Neoverse-N1 @ 3.30GHz (160 Cores)WIWYNN Mt.Jade (1.1.20201019 BIOS)Ampere Computing LLC Altra PCI Root Complex A502GB3841GB Micron_9300_MTFDHAL3T8TDP + 960GB SAMSUNG MZ1LB960HAJQ-00007ASPEEDVE228Mellanox MT28908 + Intel I210Ubuntu 21.045.11.0-25-generic (aarch64)1.0.2GCC 10.3.0ext41920x1080OpenBenchmarking.org- Transparent Huge Pages: madvise- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - Scaling Governor: cppc_cpufreq performance (Boost: Enabled)- Python 3.9.5- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

Ampere Altra September 2021stress-ng: MMAPstress-ng: NUMAstress-ng: MEMFDstress-ng: Atomicstress-ng: Cryptostress-ng: Mallocstress-ng: Forkingstress-ng: SENDFILEstress-ng: CPU Cachestress-ng: CPU Stressstress-ng: Semaphoresstress-ng: Matrix Mathstress-ng: Vector Mathstress-ng: Memory Copyingstress-ng: Socket Activitystress-ng: Context Switchingstress-ng: Glibc C String Functionsstress-ng: Glibc Qsort Data Sortingstress-ng: System V Message Passingsysbench: CPUamg: dav1d: Summer Nature 4Kdav1d: Chimera 1080p 10-bitaom-av1: Speed 4 Two-Pass - Bosphorus 4Kaom-av1: Speed 6 Realtime - Bosphorus 4Kaom-av1: Speed 6 Two-Pass - Bosphorus 4Kaom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 9 Realtime - Bosphorus 4Kvpxenc: Speed 0 - Bosphorus 4Kvpxenc: Speed 5 - Bosphorus 4Khpcg: mt-dgemm: Sustained Floating-Point Ratecoremark: CoreMark Size 666 - Iterations Per Secondaircrack-ng: securemark: SecureMark-TLSsysbench: RAM / Memorylczero: BLASlczero: Eigenstockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthlammps: 20k Atomslammps: Rhodopsin Proteinrocksdb: Rand Fillrocksdb: Update Randrocksdb: Seq Fillmysqlslap: 1024hint: FLOATjohn-the-ripper: Blowfishjohn-the-ripper: MD5nginx: 500nginx: 1000apache: 500apache: 1000liquid-dsp: 128 - 256 - 57liquid-dsp: 160 - 256 - 57pgbench: 100 - 250 - Read Onlypgbench: 100 - 250 - Read Writelulesh: pgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 250 - Read Write - Average Latencytnn: CPU - DenseNettnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v2tnn: CPU - SqueezeNet v1.1nwchem: C240 Buckyballincompact3d: X3D-benchmarking input.i3dincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionmocassin: Dust 2D tau100.0openfoam: Motorbike 30Mopenfoam: Motorbike 60Mqe: AUSURF112relion: Basic - CPUavifenc: 0avifenc: 2avifenc: 6avifenc: 6, Losslessbuild-apache: Time To Compilebuild-ffmpeg: Time To Compilebuild-gdb: Time To Compilebuild-godot: Time To Compilebuild-imagemagick: Time To Compilebuild-linux-kernel: Time To Compilebuild-llvm: Ninjabuild-llvm: Unix Makefilesbuild-nodejs: Time To Compilebuild-php: Time To Compilebuild2: Time To Compilepovray: Trace Timeprimesieve: 1e12 Prime Number Generationm-queens: Time To Solven-queens: Elapsed Timewebp2: Quality 75, Compression Effort 7webp2: Quality 95, Compression Effort 7webp2: Quality 100, Compression Effort 5webp2: Quality 100, Lossless Compressionwrf: conus 2.5kmblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Pabellon Barcelona - CPU-OnlyAmpere Altra 160 Cores627.73218.61897.256694.1937959.551849557417.2010469.642283406.202363.2325351.5411392704.92835662.631157056.2610969.1917019.9730369754.1912530726.361723.283612912.57586141.721265829000231.55298.511.255.303.1627.4033.212.006.2413.92344.1265903537109.247619191295.4481606811316.172018203610636950711711674636.47632.71291002895967583598353049502.18036114458244783351805.5852597.6540860.2739919.8627761000003459866667203612286527716.91351.2328.7333033.843314.92482.089256.3202400.71009.525296.1515696833.886483517827.07301.49585.53427.832130.45174.87921.92135.55036.96319.70780.94889.15423.54053.701164.551296.837147.45065.92883.51712.5355.4566.2891.341155.250278.8377.831440.58228840.00242.1264.9893.85218.71148.70OpenBenchmarking.org

Stress-NG

Test: MMAP

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: MMAPAmpere Altra 160 Cores140280420560700SE +/- 5.69, N = 3627.731. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: NUMA

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: NUMAAmpere Altra 160 Cores50100150200250SE +/- 0.31, N = 3218.611. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: MEMFD

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: MEMFDAmpere Altra 160 Cores2004006008001000SE +/- 4.56, N = 3897.251. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Atomic

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: AtomicAmpere Altra 160 Cores14002800420056007000SE +/- 12.66, N = 36694.191. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Crypto

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: CryptoAmpere Altra 160 Cores8K16K24K32K40KSE +/- 54.67, N = 337959.551. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Malloc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: MallocAmpere Altra 160 Cores400M800M1200M1600M2000MSE +/- 2215162.36, N = 31849557417.201. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Forking

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: ForkingAmpere Altra 160 Cores2K4K6K8K10KSE +/- 84.72, N = 310469.641. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: SENDFILE

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: SENDFILEAmpere Altra 160 Cores500K1000K1500K2000K2500KSE +/- 2567.56, N = 32283406.201. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: CPU Cache

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: CPU CacheAmpere Altra 160 Cores5001000150020002500SE +/- 42.56, N = 152363.231. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: CPU Stress

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: CPU StressAmpere Altra 160 Cores5K10K15K20K25KSE +/- 9.04, N = 325351.541. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Semaphores

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: SemaphoresAmpere Altra 160 Cores2M4M6M8M10MSE +/- 25276.70, N = 311392704.921. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Matrix Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Matrix MathAmpere Altra 160 Cores200K400K600K800K1000KSE +/- 744.15, N = 3835662.631. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Vector MathAmpere Altra 160 Cores200K400K600K800K1000KSE +/- 67.65, N = 31157056.261. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Memory CopyingAmpere Altra 160 Cores2K4K6K8K10KSE +/- 40.00, N = 310969.191. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Socket Activity

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Socket ActivityAmpere Altra 160 Cores4K8K12K16K20KSE +/- 455.46, N = 1517019.971. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Context Switching

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Context SwitchingAmpere Altra 160 Cores7M14M21M28M35MSE +/- 421638.23, N = 330369754.191. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Glibc C String Functions

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Glibc C String FunctionsAmpere Altra 160 Cores3M6M9M12M15MSE +/- 3063.36, N = 312530726.361. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: Glibc Qsort Data Sorting

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Glibc Qsort Data SortingAmpere Altra 160 Cores400800120016002000SE +/- 0.74, N = 31723.281. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Stress-NG

Test: System V Message Passing

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: System V Message PassingAmpere Altra 160 Cores800K1600K2400K3200K4000KSE +/- 37484.14, N = 53612912.571. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPUAmpere Altra 160 Cores130K260K390K520K650KSE +/- 3578.00, N = 3586141.721. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2Ampere Altra 160 Cores300M600M900M1200M1500MSE +/- 5883042.95, N = 312658290001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.1Video Input: Summer Nature 4KAmpere Altra 160 Cores50100150200250SE +/- 1.88, N = 3231.55MIN: 48.17 / MAX: 284.151. (CC) gcc options: -pthread -lm

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.1Video Input: Chimera 1080p 10-bitAmpere Altra 160 Cores70140210280350SE +/- 3.63, N = 3298.51MIN: 172.1 / MAX: 368.431. (CC) gcc options: -pthread -lm

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4KAmpere Altra 160 Cores0.28130.56260.84391.12521.4065SE +/- 0.03, N = 91.251. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4KAmpere Altra 160 Cores1.19252.3853.57754.775.9625SE +/- 0.06, N = 35.301. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4KAmpere Altra 160 Cores0.7111.4222.1332.8443.555SE +/- 0.09, N = 93.161. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4KAmpere Altra 160 Cores612182430SE +/- 0.59, N = 1527.401. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4KAmpere Altra 160 Cores816243240SE +/- 0.76, N = 1233.211. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

VP9 libvpx Encoding

Speed: Speed 0 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 0 - Input: Bosphorus 4KAmpere Altra 160 Cores0.450.91.351.82.25SE +/- 0.01, N = 32.001. (CXX) g++ options: -lm -lpthread -march=armv8-a -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11

VP9 libvpx Encoding

Speed: Speed 5 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 5 - Input: Bosphorus 4KAmpere Altra 160 Cores246810SE +/- 0.01, N = 36.241. (CXX) g++ options: -lm -lpthread -march=armv8-a -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1Ampere Altra 160 Cores48121620SE +/- 0.06, N = 313.921. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateAmpere Altra 160 Cores0.92851.8572.78553.7144.6425SE +/- 0.075831, N = 154.1265901. (CC) gcc options: -O3 -march=native -fopenmp

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondAmpere Altra 160 Cores800K1600K2400K3200K4000KSE +/- 47203.92, N = 153537109.251. (CC) gcc options: -O2 -lrt" -lrt

Aircrack-ng

OpenBenchmarking.orgk/s, More Is BetterAircrack-ng 1.5.2Ampere Altra 160 Cores40K80K120K160K200KSE +/- 98.54, N = 3191295.451. (CXX) g++ options: -O3 -fvisibility=hidden -fcommon -rdynamic -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSAmpere Altra 160 Cores30K60K90K120K150KSE +/- 68.94, N = 31606811. (CC) gcc options: -pedantic -O3

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / MemoryAmpere Altra 160 Cores30060090012001500SE +/- 8.28, N = 31316.171. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASAmpere Altra 160 Cores400800120016002000SE +/- 17.79, N = 320181. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenAmpere Altra 160 Cores400800120016002000SE +/- 22.05, N = 520361. (CXX) g++ options: -flto -pthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total TimeAmpere Altra 160 Cores20M40M60M80M100MSE +/- 2387695.00, N = 151063695071. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthAmpere Altra 160 Cores30M60M90M120M150MSE +/- 540538.10, N = 3117116746

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsAmpere Altra 160 Cores816243240SE +/- 0.30, N = 336.481. (CXX) g++ options: -O3 -pthread -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinAmpere Altra 160 Cores816243240SE +/- 0.26, N = 332.711. (CXX) g++ options: -O3 -pthread -lm

Facebook RocksDB

Test: Random Fill

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.22.1Test: Random FillAmpere Altra 160 Cores20K40K60K80K100KSE +/- 904.31, N = 15910021. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Facebook RocksDB

Test: Update Random

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.22.1Test: Update RandomAmpere Altra 160 Cores20K40K60K80K100KSE +/- 1056.58, N = 3895961. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Facebook RocksDB

Test: Sequential Fill

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.22.1Test: Sequential FillAmpere Altra 160 Cores16K32K48K64K80KSE +/- 195.00, N = 2758351. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

MariaDB

Clients: 1024

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 1024Ampere Altra 160 Cores20406080100SE +/- 1.49, N = 8981. (CXX) g++ options: -fPIC -pie -fstack-protector -moutline-atomics -O3 -shared -pthread -lrt -lpthread -lz -ldl -lm -lstdc++

Hierarchical INTegration

Test: FLOAT

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATAmpere Altra 160 Cores80M160M240M320M400MSE +/- 565607.90, N = 3353049502.181. (CC) gcc options: -O3 -march=native -lm

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishAmpere Altra 160 Cores20K40K60K80K100KSE +/- 2941.93, N = 141144581. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5Ampere Altra 160 Cores500K1000K1500K2000K2500KSE +/- 67574.86, N = 1224478331. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

nginx

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500Ampere Altra 160 Cores11K22K33K44K55KSE +/- 142.63, N = 351805.581. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000Ampere Altra 160 Cores11K22K33K44K55KSE +/- 673.66, N = 352597.651. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500Ampere Altra 160 Cores9K18K27K36K45KSE +/- 490.38, N = 1340860.271. (CC) gcc options: -shared -fPIC -O2 -pthread

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000Ampere Altra 160 Cores9K18K27K36K45KSE +/- 307.24, N = 339919.861. (CC) gcc options: -shared -fPIC -O2 -pthread

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57Ampere Altra 160 Cores600M1200M1800M2400M3000MSE +/- 458257.57, N = 327761000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 160 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 160 - Buffer Length: 256 - Filter Length: 57Ampere Altra 160 Cores700M1400M2100M2800M3500MSE +/- 2366666.67, N = 334598666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read OnlyAmpere Altra 160 Cores40K80K120K160K200KSE +/- 3096.32, N = 122036121. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read WriteAmpere Altra 160 Cores6K12K18K24K30KSE +/- 27.47, N = 3286521. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3Ampere Altra 160 Cores17003400510068008500SE +/- 76.57, N = 57716.911. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average LatencyAmpere Altra 160 Cores0.27720.55440.83161.10881.386SE +/- 0.018, N = 121.2321. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average LatencyAmpere Altra 160 Cores246810SE +/- 0.007, N = 38.7331. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

TNN

Target: CPU - Model: DenseNet

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetAmpere Altra 160 Cores7001400210028003500SE +/- 41.20, N = 123033.84MIN: 2648.28 / MAX: 3543.331. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2Ampere Altra 160 Cores70140210280350SE +/- 3.16, N = 3314.92MIN: 295.62 / MAX: 406.561. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2Ampere Altra 160 Cores20406080100SE +/- 0.03, N = 382.09MIN: 81.93 / MAX: 82.431. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1Ampere Altra 160 Cores60120180240300SE +/- 0.18, N = 3256.32MIN: 255.63 / MAX: 256.961. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 BuckyballAmpere Altra 160 Cores50010001500200025002400.71. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lz -lcomex -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dAmpere Altra 160 Cores2004006008001000SE +/- 6.06, N = 31009.531. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per DirectionAmpere Altra 160 Cores246810SE +/- 0.03160382, N = 36.151569681. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionAmpere Altra 160 Cores816243240SE +/- 0.32, N = 333.891. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz

Monte Carlo Simulations of Ionised Nebulae

Input: Dust 2D tau100.0

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2019-03-24Input: Dust 2D tau100.0Ampere Altra 160 Cores40801201602001781. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lrt -lz

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30MAmpere Altra 160 Cores612182430SE +/- 0.31, N = 327.071. (CXX) g++ options: -std=c++11 -O3 -mcpu=native -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm

OpenFOAM

Input: Motorbike 60M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60MAmpere Altra 160 Cores70140210280350SE +/- 0.76, N = 3301.491. (CXX) g++ options: -std=c++11 -O3 -mcpu=native -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.8Input: AUSURF112Ampere Altra 160 Cores130260390520650SE +/- 0.49, N = 3585.531. (F9X) gfortran options: -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz

RELION

Test: Basic - Device: CPU

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPUAmpere Altra 160 Cores90180270360450SE +/- 1.33, N = 3427.831. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 0Ampere Altra 160 Cores306090120150SE +/- 0.16, N = 3130.451. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2Ampere Altra 160 Cores20406080100SE +/- 0.39, N = 374.881. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6Ampere Altra 160 Cores510152025SE +/- 0.20, N = 1521.921. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, LosslessAmpere Altra 160 Cores816243240SE +/- 0.27, N = 335.551. (CXX) g++ options: -O3 -fPIC -lm

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To CompileAmpere Altra 160 Cores816243240SE +/- 0.29, N = 336.96

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.4Time To CompileAmpere Altra 160 Cores510152025SE +/- 0.27, N = 1519.71

Timed GDB GNU Debugger Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 10.2Time To CompileAmpere Altra 160 Cores20406080100SE +/- 1.31, N = 1580.95

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileAmpere Altra 160 Cores20406080100SE +/- 1.64, N = 1589.15

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To CompileAmpere Altra 160 Cores612182430SE +/- 0.17, N = 323.54

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.14Time To CompileAmpere Altra 160 Cores1224364860SE +/- 0.46, N = 853.70

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 12.0Build System: NinjaAmpere Altra 160 Cores4080120160200SE +/- 2.01, N = 4164.55

Timed LLVM Compilation

Build System: Unix Makefiles

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 12.0Build System: Unix MakefilesAmpere Altra 160 Cores60120180240300SE +/- 3.35, N = 3296.84

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To CompileAmpere Altra 160 Cores306090120150SE +/- 1.73, N = 3147.45

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To CompileAmpere Altra 160 Cores1530456075SE +/- 0.86, N = 1565.93

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To CompileAmpere Altra 160 Cores20406080100SE +/- 1.72, N = 1583.52

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeAmpere Altra 160 Cores3691215SE +/- 0.20, N = 1512.541. (CXX) g++ options: -pipe -O3 -ffast-math -pthread -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.41e12 Prime Number GenerationAmpere Altra 160 Cores1.22762.45523.68284.91046.138SE +/- 0.141, N = 155.4561. (CXX) g++ options: -O3 -lpthread

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To SolveAmpere Altra 160 Cores246810SE +/- 0.063, N = 36.2891. (CXX) g++ options: -fopenmp -O2 -march=native

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeAmpere Altra 160 Cores0.30170.60340.90511.20681.5085SE +/- 0.029, N = 151.3411. (CC) gcc options: -static -fopenmp -O3 -march=native

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7Ampere Altra 160 Cores306090120150SE +/- 0.04, N = 3155.251. (CXX) g++ options: -fno-rtti -O3 -rdynamic -lpthread -ljpeg

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7Ampere Altra 160 Cores60120180240300SE +/- 1.62, N = 3278.841. (CXX) g++ options: -fno-rtti -O3 -rdynamic -lpthread -ljpeg

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5Ampere Altra 160 Cores246810SE +/- 0.033, N = 37.8311. (CXX) g++ options: -fno-rtti -O3 -rdynamic -lpthread -ljpeg

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless CompressionAmpere Altra 160 Cores100200300400500SE +/- 0.41, N = 3440.581. (CXX) g++ options: -fno-rtti -O3 -rdynamic -lpthread -ljpeg

WRF

Input: conus 2.5km

OpenBenchmarking.orgSeconds, Fewer Is BetterWRF 4.2.2Input: conus 2.5kmAmpere Altra 160 Cores6K12K18K24K30K28840.001. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -fallow-invalid-boz -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.83.5Blend File: BMW27 - Compute: CPU-OnlyAmpere Altra 160 Cores1020304050SE +/- 0.66, N = 1542.12

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.83.5Blend File: Classroom - Compute: CPU-OnlyAmpere Altra 160 Cores1428425670SE +/- 0.03, N = 364.98

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.83.5Blend File: Fishy Cat - Compute: CPU-OnlyAmpere Altra 160 Cores20406080100SE +/- 0.49, N = 393.85

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.83.5Blend File: Barbershop - Compute: CPU-OnlyAmpere Altra 160 Cores50100150200250SE +/- 3.38, N = 9218.71

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.83.5Blend File: Pabellon Barcelona - Compute: CPU-OnlyAmpere Altra 160 Cores306090120150SE +/- 1.14, N = 3148.70


Phoronix Test Suite v10.8.4