Ampere Altra September 2021 Ampere Altra ARMv8 Neoverse-N1 testing with a WIWYNN Mt.Jade (1.1.20201019 BIOS) and ASPEED on Ubuntu 21.04 via the Phoronix Test Suite. Ampere Altra 160 Cores: Processor: Ampere Altra ARMv8 Neoverse-N1 @ 3.30GHz (160 Cores), Motherboard: WIWYNN Mt.Jade (1.1.20201019 BIOS), Chipset: Ampere Computing LLC Altra PCI Root Complex A, Memory: 502GB, Disk: 3841GB Micron_9300_MTFDHAL3T8TDP + 960GB SAMSUNG MZ1LB960HAJQ-00007, Graphics: ASPEED, Monitor: VE228, Network: Mellanox MT28908 + Intel I210 OS: Ubuntu 21.04, Kernel: 5.11.0-25-generic (aarch64), Vulkan: 1.0.2, Compiler: GCC 10.3.0, File-System: ext4, Screen Resolution: 1920x1080 High Performance Conjugate Gradient 3.1 GFLOP/s > Higher Is Better Ampere Altra 160 Cores . 13.92 |=============================================== LeelaChessZero 0.28 Backend: BLAS Nodes Per Second > Higher Is Better Ampere Altra 160 Cores . 2018 |================================================ LeelaChessZero 0.28 Backend: Eigen Nodes Per Second > Higher Is Better Ampere Altra 160 Cores . 2036 |================================================ Algebraic Multi-Grid Benchmark 1.2 Figure Of Merit > Higher Is Better Ampere Altra 160 Cores . 1265829000 |========================================== NWChem 7.0.2 Input: C240 Buckyball Seconds < Lower Is Better Ampere Altra 160 Cores . 2400.7 |============================================== Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d Seconds < Lower Is Better Ampere Altra 160 Cores . 1009.53 |============================================= Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction Seconds < Lower Is Better Ampere Altra 160 Cores . 6.15156968 |========================================== Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction Seconds < Lower Is Better Ampere Altra 160 Cores . 33.89 |=============================================== Monte Carlo Simulations of Ionised Nebulae 2019-03-24 Input: Dust 2D tau100.0 Seconds < Lower Is Better Ampere Altra 160 Cores . 178 |================================================= OpenFOAM 8 Input: Motorbike 30M Seconds < Lower Is Better Ampere Altra 160 Cores . 27.07 |=============================================== OpenFOAM 8 Input: Motorbike 60M Seconds < Lower Is Better Ampere Altra 160 Cores . 301.49 |============================================== Quantum ESPRESSO 6.8 Input: AUSURF112 Seconds < Lower Is Better Ampere Altra 160 Cores . 585.53 |============================================== RELION 3.1.1 Test: Basic - Device: CPU Seconds < Lower Is Better Ampere Altra 160 Cores . 427.83 |============================================== LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms ns/day > Higher Is Better Ampere Altra 160 Cores . 36.48 |=============================================== LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein ns/day > Higher Is Better Ampere Altra 160 Cores . 32.71 |=============================================== LULESH 2.0.3 z/s > Higher Is Better Ampere Altra 160 Cores . 7716.91 |============================================= John The Ripper 1.9.0-jumbo-1 Test: Blowfish Real C/S > Higher Is Better Ampere Altra 160 Cores . 114458 |============================================== John The Ripper 1.9.0-jumbo-1 Test: MD5 Real C/S > Higher Is Better Ampere Altra 160 Cores . 2447833 |============================================= dav1d 0.9.1 Video Input: Summer Nature 4K FPS > Higher Is Better Ampere Altra 160 Cores . 231.55 |============================================== dav1d 0.9.1 Video Input: Chimera 1080p 10-bit FPS > Higher Is Better Ampere Altra 160 Cores . 298.51 |============================================== AOM AV1 3.1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K Frames Per Second > Higher Is Better Ampere Altra 160 Cores . 1.25 |================================================ AOM AV1 3.1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K Frames Per Second > Higher Is Better Ampere Altra 160 Cores . 5.30 |================================================ AOM AV1 3.1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K Frames Per Second > Higher Is Better Ampere Altra 160 Cores . 3.16 |================================================ AOM AV1 3.1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K Frames Per Second > Higher Is Better Ampere Altra 160 Cores . 27.40 |=============================================== AOM AV1 3.1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K Frames Per Second > Higher Is Better Ampere Altra 160 Cores . 33.21 |=============================================== VP9 libvpx Encoding 1.10.0 Speed: Speed 0 - Input: Bosphorus 4K Frames Per Second > Higher Is Better Ampere Altra 160 Cores . 2.00 |================================================ VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 4K Frames Per Second > Higher Is Better Ampere Altra 160 Cores . 6.24 |================================================ ACES DGEMM 1.0 Sustained Floating-Point Rate GFLOP/s > Higher Is Better Ampere Altra 160 Cores . 4.126590 |============================================ Coremark 1.0 CoreMark Size 666 - Iterations Per Second Iterations/Sec > Higher Is Better Ampere Altra 160 Cores . 3537109.25 |========================================== Stockfish 13 Total Time Nodes Per Second > Higher Is Better Ampere Altra 160 Cores . 106369507 |=========================================== asmFish 2018-07-23 1024 Hash Memory, 26 Depth Nodes/second > Higher Is Better Ampere Altra 160 Cores . 117116746 |=========================================== libavif avifenc 0.9.0 Encoder Speed: 0 Seconds < Lower Is Better Ampere Altra 160 Cores . 130.45 |============================================== libavif avifenc 0.9.0 Encoder Speed: 2 Seconds < Lower Is Better Ampere Altra 160 Cores . 74.88 |=============================================== libavif avifenc 0.9.0 Encoder Speed: 6 Seconds < Lower Is Better Ampere Altra 160 Cores . 21.92 |=============================================== libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Seconds < Lower Is Better Ampere Altra 160 Cores . 35.55 |=============================================== Timed Apache Compilation 2.4.41 Time To Compile Seconds < Lower Is Better Ampere Altra 160 Cores . 36.96 |=============================================== Timed FFmpeg Compilation 4.4 Time To Compile Seconds < Lower Is Better Ampere Altra 160 Cores . 19.71 |=============================================== Timed GDB GNU Debugger Compilation 10.2 Time To Compile Seconds < Lower Is Better Ampere Altra 160 Cores . 80.95 |=============================================== Timed Godot Game Engine Compilation 3.2.3 Time To Compile Seconds < Lower Is Better Ampere Altra 160 Cores . 89.15 |=============================================== Timed ImageMagick Compilation 6.9.0 Time To Compile Seconds < Lower Is Better Ampere Altra 160 Cores . 23.54 |=============================================== Timed Linux Kernel Compilation 5.14 Time To Compile Seconds < Lower Is Better Ampere Altra 160 Cores . 53.70 |=============================================== Timed LLVM Compilation 12.0 Build System: Ninja Seconds < Lower Is Better Ampere Altra 160 Cores . 164.55 |============================================== Timed LLVM Compilation 12.0 Build System: Unix Makefiles Seconds < Lower Is Better Ampere Altra 160 Cores . 296.84 |============================================== Timed Node.js Compilation 15.11 Time To Compile Seconds < Lower Is Better Ampere Altra 160 Cores . 147.45 |============================================== Timed PHP Compilation 7.4.2 Time To Compile Seconds < Lower Is Better Ampere Altra 160 Cores . 65.93 |=============================================== Build2 0.13 Time To Compile Seconds < Lower Is Better Ampere Altra 160 Cores . 83.52 |=============================================== POV-Ray 3.7.0.7 Trace Time Seconds < Lower Is Better Ampere Altra 160 Cores . 12.54 |=============================================== Primesieve 7.4 1e12 Prime Number Generation Seconds < Lower Is Better Ampere Altra 160 Cores . 5.456 |=============================================== m-queens 1.2 Time To Solve Seconds < Lower Is Better Ampere Altra 160 Cores . 6.289 |=============================================== N-Queens 1.0 Elapsed Time Seconds < Lower Is Better Ampere Altra 160 Cores . 1.341 |=============================================== WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 Seconds < Lower Is Better Ampere Altra 160 Cores . 155.25 |============================================== WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 Seconds < Lower Is Better Ampere Altra 160 Cores . 278.84 |============================================== WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 Seconds < Lower Is Better Ampere Altra 160 Cores . 7.831 |=============================================== WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression Seconds < Lower Is Better Ampere Altra 160 Cores . 440.58 |============================================== Aircrack-ng 1.5.2 k/s > Higher Is Better Ampere Altra 160 Cores . 191295.45 |=========================================== SecureMark 1.0.4 Benchmark: SecureMark-TLS marks > Higher Is Better Ampere Altra 160 Cores . 160681 |============================================== Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better Ampere Altra 160 Cores . 2776100000 |========================================== Liquid-DSP 2021.01.31 Threads: 160 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better Ampere Altra 160 Cores . 3459866667 |========================================== MariaDB 10.6.4 Clients: 1024 Queries Per Second > Higher Is Better Ampere Altra 160 Cores . 98 |================================================== PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only TPS > Higher Is Better Ampere Altra 160 Cores . 203612 |============================================== PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency ms < Lower Is Better Ampere Altra 160 Cores . 1.232 |=============================================== PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Write TPS > Higher Is Better Ampere Altra 160 Cores . 28652 |=============================================== PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency ms < Lower Is Better Ampere Altra 160 Cores . 8.733 |=============================================== Stress-NG 0.11.07 Test: MMAP Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 627.73 |============================================== Stress-NG 0.11.07 Test: NUMA Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 218.61 |============================================== Stress-NG 0.11.07 Test: MEMFD Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 897.25 |============================================== Stress-NG 0.11.07 Test: Atomic Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 6694.19 |============================================= Stress-NG 0.11.07 Test: Crypto Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 37959.55 |============================================ Stress-NG 0.11.07 Test: Malloc Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 1849557417.20 |======================================= Stress-NG 0.11.07 Test: Forking Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 10469.64 |============================================ Stress-NG 0.11.07 Test: SENDFILE Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 2283406.20 |========================================== Stress-NG 0.11.07 Test: CPU Cache Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 2363.23 |============================================= Stress-NG 0.11.07 Test: CPU Stress Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 25351.54 |============================================ Stress-NG 0.11.07 Test: Semaphores Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 11392704.92 |========================================= Stress-NG 0.11.07 Test: Matrix Math Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 835662.63 |=========================================== Stress-NG 0.11.07 Test: Vector Math Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 1157056.26 |========================================== Stress-NG 0.11.07 Test: Memory Copying Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 10969.19 |============================================ Stress-NG 0.11.07 Test: Socket Activity Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 17019.97 |============================================ Stress-NG 0.11.07 Test: Context Switching Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 30369754.19 |========================================= Stress-NG 0.11.07 Test: Glibc C String Functions Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 12530726.36 |========================================= Stress-NG 0.11.07 Test: Glibc Qsort Data Sorting Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 1723.28 |============================================= Stress-NG 0.11.07 Test: System V Message Passing Bogo Ops/s > Higher Is Better Ampere Altra 160 Cores . 3612912.57 |========================================== WRF 4.2.2 Input: conus 2.5km Seconds < Lower Is Better Ampere Altra 160 Cores . 28840.00 |============================================ TNN 0.3 Target: CPU - Model: DenseNet ms < Lower Is Better Ampere Altra 160 Cores . 3033.84 |============================================= TNN 0.3 Target: CPU - Model: MobileNet v2 ms < Lower Is Better Ampere Altra 160 Cores . 314.92 |============================================== TNN 0.3 Target: CPU - Model: SqueezeNet v2 ms < Lower Is Better Ampere Altra 160 Cores . 82.09 |=============================================== TNN 0.3 Target: CPU - Model: SqueezeNet v1.1 ms < Lower Is Better Ampere Altra 160 Cores . 256.32 |============================================== Sysbench 1.0.20 Test: RAM / Memory MiB/sec > Higher Is Better Ampere Altra 160 Cores . 1316.17 |============================================= Sysbench 1.0.20 Test: CPU Events Per Second > Higher Is Better Ampere Altra 160 Cores . 586141.72 |=========================================== Facebook RocksDB 6.22.1 Test: Random Fill Op/s > Higher Is Better Ampere Altra 160 Cores . 91002 |=============================================== Facebook RocksDB 6.22.1 Test: Update Random Op/s > Higher Is Better Ampere Altra 160 Cores . 89596 |=============================================== Facebook RocksDB 6.22.1 Test: Sequential Fill Op/s > Higher Is Better Ampere Altra 160 Cores . 75835 |=============================================== Blender 2.83.5 Blend File: BMW27 - Compute: CPU-Only Seconds < Lower Is Better Ampere Altra 160 Cores . 42.12 |=============================================== Blender 2.83.5 Blend File: Classroom - Compute: CPU-Only Seconds < Lower Is Better Ampere Altra 160 Cores . 64.98 |=============================================== Blender 2.83.5 Blend File: Fishy Cat - Compute: CPU-Only Seconds < Lower Is Better Ampere Altra 160 Cores . 93.85 |=============================================== Blender 2.83.5 Blend File: Barbershop - Compute: CPU-Only Seconds < Lower Is Better Ampere Altra 160 Cores . 218.71 |============================================== Blender 2.83.5 Blend File: Pabellon Barcelona - Compute: CPU-Only Seconds < Lower Is Better Ampere Altra 160 Cores . 148.70 |============================================== Hierarchical INTegration 1.0 Test: FLOAT QUIPs > Higher Is Better Ampere Altra 160 Cores . 353049502.18 |======================================== nginx 1.21.1 Concurrent Requests: 500 Requests Per Second > Higher Is Better Ampere Altra 160 Cores . 51805.58 |============================================ nginx 1.21.1 Concurrent Requests: 1000 Requests Per Second > Higher Is Better Ampere Altra 160 Cores . 52597.65 |============================================ Apache HTTP Server 2.4.48 Concurrent Requests: 500 Requests Per Second > Higher Is Better Ampere Altra 160 Cores . 40860.27 |============================================ Apache HTTP Server 2.4.48 Concurrent Requests: 1000 Requests Per Second > Higher Is Better Ampere Altra 160 Cores . 39919.86 |============================================