Graviton3 Neoverse-V1 Compiler Tests

Benchmarks by Michael Larabel for a future article. amazon testing on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2206081-NE-ARMSVE03321&sor.

Graviton3 Neoverse-V1 Compiler TestsProcessorMotherboardChipsetMemoryDiskNetworkOSKernelCompilerFile-SystemSystem Layer-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1ARMv8 Neoverse-V1 (32 Cores)Amazon EC2 c7g.8xlarge (1.0 BIOS)Amazon Device 020062GB301GB Amazon Elastic Block StoreAmazon ElasticUbuntu 22.045.15.0-1004-aws (aarch64)GCC 12.0.0 20220117ext4amazonOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- -march=armv8.4-a: CXXFLAGS="-O3 -march=armv8.4-a" CFLAGS="-O3 -march=armv8.4-a"- -march=armv8.4-a+sve: CXXFLAGS="-O3 -march=armv8.4-a+sve" CFLAGS="-O3 -march=armv8.4-a+sve"- -march=armv8.4-a+sve -mcpu=neoverse-v1: CXXFLAGS="-O3 -march=armv8.4-a+sve -mcpu=neoverse-v1" CFLAGS="-O3 -march=armv8.4-a+sve -mcpu=neoverse-v1"Compiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-nls --disable-werror --enable-checking=yes,extra,rtl --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++ --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-objc-gc=auto --enable-plugin --enable-shared --host=aarch64-linux-gnu --program-prefix= --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Python Details- Python 3.10.4Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected

Graviton3 Neoverse-V1 Compiler Testscryptopp: Unkeyed Algorithmslczero: BLASlczero: Eigenmrbayes: Primate Phylogeny Analysislammps: Rhodopsin Proteinwebp: Quality 100, Losslesswebp: Quality 100, Highest Compressiongmpbench: Total Timexmrig: Monero - 1Mxmrig: Wownero - 1Mcompress-zstd: 3 - Compression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedjpegxl: PNG - 7jpegxl: PNG - 8jpegxl: JPEG - 7jpegxl: JPEG - 8nettle: aes256nettle: chachanettle: sha512nettle: poly1305-aesluajit: Compositeluajit: Monte Carloluajit: Fast Fourier Transformluajit: Sparse Matrix Multiplyluajit: Dense LU Matrix Factorizationluajit: Jacobi Successive Over-Relaxationbotan: KASUMIbotan: KASUMI - Decryptbotan: AES-256botan: AES-256 - Decryptbotan: Twofishbotan: Twofish - Decryptbotan: Blowfishbotan: Blowfish - Decryptbotan: CAST-256botan: CAST-256 - Decryptbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spaceaom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 10 Realtime - Bosphorus 4Kaom-av1: Speed 8 Realtime - Bosphorus 1080paom-av1: Speed 9 Realtime - Bosphorus 1080paom-av1: Speed 10 Realtime - Bosphorus 1080px264: Bosphorus 4Kx264: Bosphorus 1080pmt-dgemm: Sustained Floating-Point Ratecoremark: CoreMark Size 666 - Iterations Per Secondhimeno: Poisson Pressure Solverstockfish: Total Timestargate: 44100 - 512stargate: 96000 - 512stargate: 44100 - 1024stargate: 480000 - 512stargate: 96000 - 1024stargate: 480000 - 1024c-ray: Total Time - 4K, 16 Rays Per Pixelpovray: Trace Timeprimesieve: 1e12 Prime Number Generationsmallpt: Global Illumination Renderer; 128 Samplesaobench: 2048 x 2048 - Total Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3encode-opus: WAV To Opus Encodeespeak: Text-To-Speech Synthesisngspice: C2670ngspice: C7552rnnoise: openjpeg: NASA Curiosity Panorama M34openssl: SHA256openssl: RSA4096openssl: RSA4096liquid-dsp: 8 - 256 - 57liquid-dsp: 16 - 256 - 57liquid-dsp: 32 - 256 - 57gromacs: MPI CPU - water_GMX50_bareastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivedraco: Liondraco: Church Facaderedis: GETredis: SETcaffe: AlexNet - CPU - 200caffe: GoogleNet - CPU - 200tnn: CPU - DenseNettnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v2tnn: CPU - SqueezeNet v1.1sysbench: CPUonnx: GPT-2 - CPU - Standardonnx: bertsquad-12 - CPU - Standardonnx: fcn-resnet101-11 - CPU - Standardonnx: ArcFace ResNet-100 - CPU - Standardonnx: super-resolution-10 - CPU - Standardencode-wavpack: WAV To WavPackkripke: -march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1459.87016312811311237.54221.33023.8528.6304152.38645.411811.26937.872.93083.41241.33820.840.03250.98.320.6773.2126.304435.91740.25498.83871.901282.59343.27661.551151.573355.53901.0262.01762.2775494.3135477.571239.703246.155278.874288.505108.786108.599389.375382.5141225577732233949497862.1361.88120.13152.46190.2748.43168.9212.813927789646.9248005561.559287574856806.0729164.4140556.3700356.0053864.7298486.32268419.29619.8478.4383.89533.48138.3118.05418.32036.587102.558103.93317.62257205276039435705090.5356359.61763633333527000007052333332.2774.88339.143535.3647535479352523377.921865840.13436341238072730.40260.77871.130257.70496726.401236477373938541320.488204143167449.02420812971333234.25521.14923.4038.6404155.68669.811877.87027.274.03094.81242.73824.840.33263.78.360.6779.5827.344447.04733.59504.33820.511309.03343.85615.711162.333521.13902.1662.0062.2615442.6495474.321248.887258.148280.570289.032108.754108.623390.311383.94512726117182414515106762.1965.62123.95156.71193.6848.51169.5813.442233762066.5011635508.282435558233406.2135004.4748486.5381636.1244554.8077976.45018219.29920.2638.5333.89633.49438.5157.44014.40229.982106.910111.64417.38555196274281768805088.1356407.81677333333354233336686366672.2754.80929.013135.1926530978432513289.21861924.13439311251252346.322280.24376.301205.79996666.761231777273935541120.515192709233270.93160912801337241.53921.01823.8458.6024152.78681.411842.06938.373.03167.51243.83882.339.03339.78.070.6778.8027.094438.27731.07481.90859.931303.89343.90668.401164.283547.90902.1465.29267.5115415.1945409.815244.209246.611280.989283.410109.290109.108385.377378.97612575917412402509101661.5663.25124.22156.63193.5548.56169.3013.348870798137.7084065538.739912599667856.0686504.4124876.3601385.9816624.7297566.28800619.55519.6118.4353.89233.55338.6017.44614.38029.985104.095106.76717.29055415276812906005114.1355966.71506733333013300006024433332.2794.72908.942734.6761529777972546595.581879962422621204282390.259273.39576.214205.28096702.811246077473938541620.763194776367OpenBenchmarking.org

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed Algorithms-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1100200300400500SE +/- 0.25, N = 3SE +/- 0.25, N = 3SE +/- 1.67, N = 3459.87449.02270.93-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v11. (CXX) g++ options: -O3 -fPIC -pthread -pipe

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLAS-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v130060090012001500SE +/- 6.96, N = 3SE +/- 13.99, N = 5SE +/- 16.50, N = 3129712811280-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v11. (CXX) g++ options: -flto -O3 -pthread

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigen-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a30060090012001500SE +/- 3.18, N = 3SE +/- 14.64, N = 5SE +/- 13.65, N = 3133713331311-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CXX) g++ options: -flto -O3 -pthread

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysis-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v150100150200250SE +/- 0.18, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3234.26237.54241.54-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v11. (CC) gcc options: -O3 -std=c99 -pedantic -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1510152025SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 321.3321.1521.02-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v11. (CXX) g++ options: -O3 -lm

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a612182430SE +/- 0.18, N = 3SE +/- 0.23, N = 3SE +/- 0.01, N = 323.4023.8523.85-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CC) gcc options: -fvisibility=hidden -O3 -lm -ljpeg -lpng16 -ltiff

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compression-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve246810SE +/- 0.006, N = 3SE +/- 0.006, N = 3SE +/- 0.017, N = 38.6028.6308.640-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CC) gcc options: -fvisibility=hidden -O3 -lm -ljpeg -lpng16 -ltiff

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a90018002700360045004155.64152.74152.3-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CC) gcc options: -O3 -lm

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Monero - Hash Count: 1M-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a2K4K6K8K10KSE +/- 3.15, N = 3SE +/- 6.18, N = 3SE +/- 9.56, N = 38681.48669.88645.4-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CXX) g++ options: -O3 -fexceptions -fno-rtti -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Wownero - Hash Count: 1M-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a3K6K9K12K15KSE +/- 26.88, N = 3SE +/- 0.35, N = 3SE +/- 27.59, N = 311877.811842.011811.2-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CXX) g++ options: -O3 -fexceptions -fno-rtti -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speed-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a15003000450060007500SE +/- 15.42, N = 3SE +/- 9.61, N = 3SE +/- 14.45, N = 37027.26938.36937.8-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speed-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1632486480SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.23, N = 374.073.072.9-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression Speed-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a7001400210028003500SE +/- 8.27, N = 3SE +/- 6.60, N = 3SE +/- 8.46, N = 33167.53094.83083.4-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Compression Speed-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a30060090012001500SE +/- 4.22, N = 3SE +/- 4.88, N = 3SE +/- 5.37, N = 31243.81242.71241.3-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Decompression Speed-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a8001600240032004000SE +/- 40.64, N = 3SE +/- 1.28, N = 3SE +/- 3.95, N = 33882.33824.83820.8-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speed-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1918273645SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 340.340.039.0-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v11. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speed-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a7001400210028003500SE +/- 5.09, N = 3SE +/- 0.59, N = 3SE +/- 7.62, N = 33339.73263.73250.9-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CC) gcc options: -O3 -pthread -lz -llzma

JPEG XL libjxl

Input: PNG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.6.1Input: PNG - Encode Speed: 7-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 38.368.328.07-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v11. (CXX) g++ options: -O3 -funwind-tables -O2 -fPIE -pie

JPEG XL libjxl

Input: PNG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.6.1Input: PNG - Encode Speed: 8-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a0.15080.30160.45240.60320.754SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.670.670.67-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CXX) g++ options: -O3 -funwind-tables -O2 -fPIE -pie

JPEG XL libjxl

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.6.1Input: JPEG - Encode Speed: 7-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a20406080100SE +/- 0.12, N = 3SE +/- 0.22, N = 3SE +/- 0.13, N = 379.5878.8073.21-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CXX) g++ options: -O3 -funwind-tables -O2 -fPIE -pie

JPEG XL libjxl

Input: JPEG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.6.1Input: JPEG - Encode Speed: 8-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a612182430SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 327.3427.0926.30-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CXX) g++ options: -O3 -funwind-tables -O2 -fPIE -pie

Nettle

Test: aes256

OpenBenchmarking.orgMbyte/s, More Is BetterNettle 3.8Test: aes256-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a10002000300040005000SE +/- 0.39, N = 3SE +/- 0.72, N = 3SE +/- 3.11, N = 34447.044438.274435.91-march=armv8.4-a+sve -lgmp - MIN: 3925.11 / MAX: 5627.84-march=armv8.4-a+sve -mcpu=neoverse-v1 - MIN: 3923.68 / MAX: 5627.25-march=armv8.4-a -lgmp - MIN: 3927.32 / MAX: 5628.861. (CC) gcc options: -O3 -ggdb3 -lnettle -lm -lcrypto

Nettle

Test: chacha

OpenBenchmarking.orgMbyte/s, More Is BetterNettle 3.8Test: chacha-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1160320480640800SE +/- 0.61, N = 3SE +/- 0.55, N = 3SE +/- 0.15, N = 3740.25733.59731.07-march=armv8.4-a -lgmp - MIN: 454.21 / MAX: 956.53-march=armv8.4-a+sve -lgmp - MIN: 442.26 / MAX: 956.22-march=armv8.4-a+sve -mcpu=neoverse-v1 - MIN: 446.51 / MAX: 951.281. (CC) gcc options: -O3 -ggdb3 -lnettle -lm -lcrypto

Nettle

Test: sha512

OpenBenchmarking.orgMbyte/s, More Is BetterNettle 3.8Test: sha512-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1110220330440550SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3504.33498.83481.90-march=armv8.4-a+sve -lgmp-march=armv8.4-a -lgmp-march=armv8.4-a+sve -mcpu=neoverse-v11. (CC) gcc options: -O3 -ggdb3 -lnettle -lm -lcrypto

Nettle

Test: poly1305-aes

OpenBenchmarking.orgMbyte/s, More Is BetterNettle 3.8Test: poly1305-aes-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve2004006008001000SE +/- 1.52, N = 3SE +/- 0.05, N = 3SE +/- 5.37, N = 3871.90859.93820.51-march=armv8.4-a -lgmp-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve -lgmp1. (CC) gcc options: -O3 -ggdb3 -lnettle -lm -lcrypto

LuaJIT

Test: Composite

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Composite-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a30060090012001500SE +/- 18.19, N = 3SE +/- 13.31, N = 6SE +/- 0.41, N = 31309.031303.891282.59-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -O3 -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Monte Carlo-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a70140210280350SE +/- 0.56, N = 3SE +/- 0.54, N = 3SE +/- 0.35, N = 3343.90343.85343.27-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -O3 -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Fast Fourier Transform-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve140280420560700SE +/- 0.07, N = 3SE +/- 0.39, N = 3SE +/- 10.69, N = 3668.40661.55615.71-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -O3 -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Sparse Matrix Multiply-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a30060090012001500SE +/- 7.84, N = 3SE +/- 7.20, N = 3SE +/- 3.14, N = 31164.281162.331151.57-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -O3 -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Dense LU Matrix Factorization-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a8001600240032004000SE +/- 93.82, N = 3SE +/- 86.66, N = 3SE +/- 6.02, N = 33547.903521.133355.53-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -O3 -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Jacobi Successive Over-Relaxation-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a2004006008001000SE +/- 1.00, N = 3SE +/- 1.01, N = 3SE +/- 0.68, N = 3902.16902.14901.02-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -O3 -U_FORTIFY_SOURCE -fno-stack-protector

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1530456075SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 365.2962.0262.001. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decrypt-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1530456075SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 367.5162.2862.261. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v112002400360048006000SE +/- 14.75, N = 3SE +/- 9.30, N = 3SE +/- 3.07, N = 35494.315442.655415.191. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decrypt-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v112002400360048006000SE +/- 5.58, N = 3SE +/- 8.64, N = 3SE +/- 1.75, N = 35477.575474.325409.821. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a50100150200250SE +/- 0.26, N = 3SE +/- 0.12, N = 3SE +/- 0.12, N = 3248.89244.21239.701. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decrypt-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a60120180240300SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.20, N = 3258.15246.61246.161. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a60120180240300SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.29, N = 3280.99280.57278.871. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decrypt-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v160120180240300SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3289.03288.51283.411. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve20406080100SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3109.29108.79108.751. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decrypt-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a20406080100SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3109.11108.62108.601. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v180160240320400SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3390.31389.38385.381. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decrypt-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v180160240320400SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3383.95382.51378.981. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirl-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a30060090012001500SE +/- 1.33, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3127212571225-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CC) gcc options: -fopenmp -O3 -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a130260390520650SE +/- 1.20, N = 3SE +/- 0.67, N = 3SE +/- 0.00, N = 3611591577-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CC) gcc options: -fopenmp -O3 -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhanced-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve160320480640800SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3741732718-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CC) gcc options: -fopenmp -O3 -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a5001000150020002500SE +/- 1.86, N = 3SE +/- 0.33, N = 3SE +/- 22.36, N = 3241424022339-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CC) gcc options: -fopenmp -O3 -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussian-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a110220330440550SE +/- 0.58, N = 3SE +/- 0.00, N = 3SE +/- 0.67, N = 3515509494-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CC) gcc options: -fopenmp -O3 -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Space-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a2004006008001000SE +/- 0.33, N = 3SE +/- 1.00, N = 3SE +/- 1.00, N = 310671016978-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CC) gcc options: -fopenmp -O3 -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.3Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v11428425670SE +/- 0.61, N = 3SE +/- 0.43, N = 3SE +/- 0.62, N = 662.1962.1361.56-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v11. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm

AOM AV1

Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.3Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1530456075SE +/- 0.22, N = 3SE +/- 0.29, N = 3SE +/- 0.47, N = 365.6263.2561.88-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.3Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a306090120150SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 3124.22123.95120.13-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.3Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a306090120150SE +/- 0.20, N = 3SE +/- 0.12, N = 3SE +/- 0.03, N = 3156.71156.63152.46-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm

AOM AV1

Encoder Mode: Speed 10 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.3Encoder Mode: Speed 10 Realtime - Input: Bosphorus 1080p-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a4080120160200SE +/- 0.20, N = 3SE +/- 0.17, N = 3SE +/- 0.28, N = 3193.68193.55190.27-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm

x264

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2022-02-22Video Input: Bosphorus 4K-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1122334455SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 348.5648.5148.431. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -lm -lpthread -O3 -flto

x264

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2022-02-22Video Input: Bosphorus 1080p-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a4080120160200SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3169.58169.30168.921. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -lm -lpthread -O3 -flto

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rate-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a3691215SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 313.4413.3512.81-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CC) gcc options: -O3 -march=native -fopenmp

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve200K400K600K800K1000KSE +/- 108.80, N = 3SE +/- 169.09, N = 3SE +/- 416.47, N = 3798137.71789646.92762066.50-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CC) gcc options: -O2 -O3 -lrt" -lrt

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve12002400360048006000SE +/- 2.66, N = 3SE +/- 13.44, N = 3SE +/- 5.76, N = 35561.565538.745508.28-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve1. (CC) gcc options: -O3

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve13M26M39M52M65MSE +/- 655325.68, N = 15SE +/- 721518.45, N = 14SE +/- 645132.01, N = 3599667855748568055823340-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CXX) g++ options: -lgcov -lpthread -O3 -fno-exceptions -std=c++17 -pedantic -flto -fprofile-use -fno-peel-loops -fno-tracer -flto=jobserver

Stargate Digital Audio Workstation

Sample Rate: 44100 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 21.10.9Sample Rate: 44100 - Buffer Size: 512-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1246810SE +/- 0.003428, N = 3SE +/- 0.002564, N = 3SE +/- 0.002556, N = 36.2135006.0729166.0686501. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 21.10.9Sample Rate: 96000 - Buffer Size: 512-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v11.00682.01363.02044.02725.034SE +/- 0.002191, N = 3SE +/- 0.003502, N = 3SE +/- 0.001339, N = 34.4748484.4140554.4124871. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 44100 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 21.10.9Sample Rate: 44100 - Buffer Size: 1024-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1246810SE +/- 0.002411, N = 3SE +/- 0.002515, N = 3SE +/- 0.002583, N = 36.5381636.3700356.3601381. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 480000 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 21.10.9Sample Rate: 480000 - Buffer Size: 512-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1246810SE +/- 0.002030, N = 3SE +/- 0.001956, N = 3SE +/- 0.002132, N = 36.1244556.0053865.9816621. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 21.10.9Sample Rate: 96000 - Buffer Size: 1024-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v11.08182.16363.24544.32725.409SE +/- 0.002826, N = 3SE +/- 0.000903, N = 3SE +/- 0.000548, N = 34.8077974.7298484.7297561. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Stargate Digital Audio Workstation

Sample Rate: 480000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 21.10.9Sample Rate: 480000 - Buffer Size: 1024-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1246810SE +/- 0.002250, N = 3SE +/- 0.002118, N = 3SE +/- 0.001046, N = 36.4501826.3226846.2880061. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1510152025SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 319.3019.3019.56-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v11. (CC) gcc options: -lm -lpthread -O3

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Time-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve510152025SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 319.6119.8520.26-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CXX) g++ options: -pipe -O3 -ffast-math -R/usr/lib -lXpm -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.71e12 Prime Number Generation-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve246810SE +/- 0.022, N = 3SE +/- 0.022, N = 3SE +/- 0.043, N = 38.4358.4388.533-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CXX) g++ options: -O3

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve0.87661.75322.62983.50644.383SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 33.8923.8953.896-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CXX) g++ options: -fopenmp -O3

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1816243240SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 333.4833.4933.55-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v11. (CC) gcc options: -lm -O3

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.3WAV To FLAC-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1918273645SE +/- 0.02, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 538.3138.5238.60-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v11. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a246810SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.004, N = 37.4407.4468.054-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a510152025SE +/- 0.01, N = 5SE +/- 0.02, N = 5SE +/- 0.00, N = 514.3814.4018.32-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a816243240SE +/- 0.30, N = 20SE +/- 0.28, N = 20SE +/- 0.31, N = 1629.9829.9936.59-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CC) gcc options: -O3 -std=c99

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve20406080100SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.18, N = 3102.56104.10106.91-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve1. (CC) gcc options: -O3 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve20406080100SE +/- 0.51, N = 3SE +/- 1.20, N = 3SE +/- 1.04, N = 3103.93106.77111.64-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve1. (CC) gcc options: -O3 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

RNNoise

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a48121620SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 317.2917.3917.62-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CC) gcc options: -O3 -pedantic -fvisibility=hidden

OpenJPEG

Encode: NASA Curiosity Panorama M34

OpenBenchmarking.orgms, Fewer Is BetterOpenJPEG 2.4Encode: NASA Curiosity Panorama M34-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a12K24K36K48K60KSE +/- 89.48, N = 3SE +/- 7.80, N = 3SE +/- 19.06, N = 3551965541557205-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a1. (CXX) g++ options: -O3 -rdynamic

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve6000M12000M18000M24000M30000MSE +/- 24491056.96, N = 3SE +/- 25639974.71, N = 3SE +/- 32102278.66, N = 3276812906002760394357027428176880-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve11002200330044005500SE +/- 4.19, N = 3SE +/- 0.78, N = 3SE +/- 0.53, N = 35114.15090.55088.1-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v180K160K240K320K400KSE +/- 10.52, N = 3SE +/- 8.85, N = 3SE +/- 38.59, N = 3356407.8356359.6355966.7-march=armv8.4-a+sve-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v11. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v140M80M120M160M200MSE +/- 12018.50, N = 3SE +/- 26666.67, N = 3SE +/- 12018.50, N = 3176363333167733333150673333-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v11. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v180M160M240M320M400MSE +/- 30550.50, N = 3SE +/- 20275.88, N = 3SE +/- 64291.01, N = 3352700000335423333301330000-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v11. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1150M300M450M600M750MSE +/- 125476.87, N = 3SE +/- 1978807.16, N = 3SE +/- 16666.67, N = 3705233333668636667602443333-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v11. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_bare-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve0.51281.02561.53842.05122.564SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 32.2792.2772.275-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CXX) g++ options: -O3

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Medium-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1.09872.19743.29614.39485.4935SE +/- 0.0099, N = 3SE +/- 0.0080, N = 3SE +/- 0.0125, N = 34.72904.80924.8833-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Thorough-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a3691215SE +/- 0.0034, N = 3SE +/- 0.0027, N = 3SE +/- 0.0046, N = 38.94279.01319.1435-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Exhaustive-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a816243240SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 334.6835.1935.36-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CXX) g++ options: -O3 -flto -pthread

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.0Model: Lion-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a11002200330044005500SE +/- 7.84, N = 3SE +/- 2.40, N = 3SE +/- 2.65, N = 3529753095354-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CXX) g++ options: -O3

Google Draco

Model: Church Facade

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.0Model: Church Facade-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a2K4K6K8K10KSE +/- 26.24, N = 3SE +/- 7.00, N = 3SE +/- 6.64, N = 3779778437935-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CXX) g++ options: -O3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve500K1000K1500K2000K2500KSE +/- 8325.87, N = 3SE +/- 1605.36, N = 3SE +/- 9056.40, N = 32546595.582523377.922513289.20-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve400K800K1200K1600K2000KSE +/- 1178.58, N = 3SE +/- 794.78, N = 3SE +/- 7427.93, N = 31879962.001865840.131861924.13-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 200-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve9K18K27K36K45KSE +/- 6.51, N = 3SE +/- 31.22, N = 3SE +/- 12.55, N = 3422624363443931-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CXX) g++ options: -O3 -fPIC -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 200-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve30K60K90K120K150KSE +/- 69.18, N = 3SE +/- 49.72, N = 3SE +/- 105.70, N = 3120428123807125125-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CXX) g++ options: -O3 -fPIC -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

TNN

Target: CPU - Model: DenseNet

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNet-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a6001200180024003000SE +/- 22.16, N = 3SE +/- 24.72, N = 3SE +/- 12.63, N = 32346.322390.262730.40-march=armv8.4-a+sve - MIN: 2268.4 / MAX: 2446.6-march=armv8.4-a+sve -mcpu=neoverse-v1 - MIN: 2289.32 / MAX: 2501.92-march=armv8.4-a - MIN: 2665.42 / MAX: 2834.731. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve60120180240300SE +/- 0.10, N = 3SE +/- 0.24, N = 3SE +/- 0.69, N = 3260.78273.40280.24-march=armv8.4-a - MIN: 259.13 / MAX: 262.38-march=armv8.4-a+sve -mcpu=neoverse-v1 - MIN: 272.22 / MAX: 274.95-march=armv8.4-a+sve - MIN: 277.9 / MAX: 282.311. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve20406080100SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 371.1376.2176.30-march=armv8.4-a - MIN: 70.76 / MAX: 71.58-march=armv8.4-a+sve -mcpu=neoverse-v1 - MIN: 75.89 / MAX: 76.52-march=armv8.4-a+sve - MIN: 76.07 / MAX: 76.531. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a60120180240300SE +/- 0.25, N = 3SE +/- 0.14, N = 3SE +/- 0.08, N = 3205.28205.80257.70-march=armv8.4-a+sve -mcpu=neoverse-v1 - MIN: 204.74 / MAX: 206.1-march=armv8.4-a+sve - MIN: 205.46 / MAX: 206.28-march=armv8.4-a - MIN: 256.95 / MAX: 258.391. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve20K40K60K80K100KSE +/- 8.50, N = 3SE +/- 7.82, N = 3SE +/- 2.72, N = 396726.4096702.8196666.76-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve1. (CC) gcc options: -O2 -funroll-loops -O3 -rdynamic -ldl -laio -lm

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: GPT-2 - Device: CPU - Executor: Standard-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve3K6K9K12K15KSE +/- 10.09, N = 3SE +/- 63.90, N = 3SE +/- 12.91, N = 3124601236412317-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CXX) g++ options: -O3 -ffunction-sections -fdata-sections -march=native -mtune=native -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: bertsquad-12 - Device: CPU - Executor: Standard-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve170340510680850SE +/- 0.44, N = 3SE +/- 0.44, N = 3SE +/- 0.50, N = 3774773772-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CXX) g++ options: -O3 -ffunction-sections -fdata-sections -march=native -mtune=native -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: Standard-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1632486480SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3737373-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve-march=armv8.4-a1. (CXX) g++ options: -O3 -ffunction-sections -fdata-sections -march=native -mtune=native -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve2004006008001000SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 0.17, N = 3938938935-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CXX) g++ options: -O3 -ffunction-sections -fdata-sections -march=native -mtune=native -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: super-resolution-10 - Device: CPU - Executor: Standard-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve12002400360048006000SE +/- 0.67, N = 3SE +/- 0.93, N = 3SE +/- 2.17, N = 3541654135411-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a-march=armv8.4-a+sve1. (CXX) g++ options: -O3 -ffunction-sections -fdata-sections -march=native -mtune=native -flto -fno-fat-lto-objects -ldl -lrt

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v1510152025SE +/- 0.00, N = 5SE +/- 0.03, N = 5SE +/- 0.03, N = 520.4920.5220.76-march=armv8.4-a-march=armv8.4-a+sve-march=armv8.4-a+sve -mcpu=neoverse-v11. (CXX) g++ options: -O3 -rdynamic

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve40M80M120M160M200MSE +/- 226703.11, N = 3SE +/- 200700.56, N = 3SE +/- 298633.53, N = 3204143167194776367192709233-march=armv8.4-a-march=armv8.4-a+sve -mcpu=neoverse-v1-march=armv8.4-a+sve1. (CXX) g++ options: -O3 -fopenmp


Phoronix Test Suite v10.8.5