ARMv8 32 Cores

Ampere eMAG ARMv8 testing with a AmpereComputing OSPREY (4.8.19 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2101300-HA-ARMV832CO04&grs&sor.

ARMv8 32 CoresProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen Resolution1234Ampere eMAG ARMv8 @ 3.00GHz (32 Cores)AmpereComputing OSPREY (4.8.19 BIOS)Applied Micro Circuits X-Gene126GB256GB Samsung SSD 860ASPEEDVE228Intel I210Ubuntu 20.045.7.0-050700-generic (aarch64)GNOME Shell 3.36.4X Server 1.20.8aspeedGCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Processor Details- Scaling Governor: cppc_cpufreq ondemandPython Details- Python 3.8.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Mitigation of PTI + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Vulnerable + tsx_async_abort: Not affected

ARMv8 32 Coresredis: LPOPcloverleaf: Lagrangian-Eulerian Hydrodynamicslzbench: Zstd 8 - Compressionredis: GETlzbench: Crush 0 - Compressioncryptsetup: Serpent-XTS 256b Encryptiononnx: bertsquad-10 - OpenMP CPUtnn: CPU - SqueezeNet v1.1redis: SADDcryptsetup: Twofish-XTS 256b Encryptionlzbench: Zstd 1 - Compressionlzbench: Zstd 8 - Decompressionlzbench: Brotli 0 - Compressionredis: SETcryptsetup: AES-XTS 256b Encryptionlzbench: Zstd 1 - Decompressionlzbench: Libdeflate 1 - Compressionlammps: Rhodopsin Proteinaskap: tConvolve OpenMP - Griddinglzbench: Brotli 2 - Compressionlzbench: Brotli 2 - Decompressiongnupg: 2.7GB Sample File Encryptionaskap: tConvolve OpenMP - Degriddingredis: LPUSHqe: AUSURF112npb: CG.Cfinancebench: Bonds OpenMPnpb: IS.Dlzbench: Brotli 0 - Decompressionbuild-godot: Time To Compilecryptsetup: PBKDF2-sha512dav1d: Chimera 1080pgcrypt: kripke: webp2: Defaultcryptsetup: Twofish-XTS 512b Decryptiononnx: super-resolution-10 - OpenMP CPUqmcpack: simple-H2Odav1d: Summer Nature 1080plulesh: lzbench: Crush 0 - Decompressionlzbench: Libdeflate 1 - Decompressionopenfoam: Motorbike 30Msynthmark: VoiceMark_100webp2: Quality 100, Compression Effort 5financebench: Repo OpenMPonnx: shufflenet-v2-10 - OpenMP CPUcryptsetup: Serpent-XTS 512b Encryptioncryptsetup: Serpent-XTS 256b Decryptioncryptsetup: Serpent-XTS 512b Decryptionaskap: tConvolve MT - Degriddingwebp2: Quality 75, Compression Effort 7cryptsetup: PBKDF2-whirlpooldav1d: Chimera 1080p 10-bitcryptsetup: AES-XTS 512b Decryptioncryptsetup: AES-XTS 256b Decryptionnpb: MG.Ccryptsetup: AES-XTS 512b Encryptionrav1e: 10askap: tConvolve MPI - Griddingnpb: LU.Caskap: tConvolve MT - Griddingaskap: tConvolve MPI - Degriddingtnn: CPU - MobileNet v2cryptsetup: Twofish-XTS 256b Decryptiondav1d: Summer Nature 4Kcryptsetup: Twofish-XTS 512b Encryptionnpb: EP.Dnpb: FT.Cwebp2: Quality 95, Compression Effort 7etcpak: ETC1 + Ditheringamg: etcpak: ETC1npb: EP.Cetcpak: ETC2etcpak: DXT1onnx: fcn-resnet101-11 - OpenMP CPUonnx: yolov4 - OpenMP CPUrav1e: 6rav1e: 5rav1e: 1lzbench: XZ 0 - Decompressionlzbench: XZ 0 - Compressionaskap: Hogbom Clean OpenMP1234241264.03217.8122240905.582860.446981.299237915.06103.9189537166223333.06443.5596705.7322812.8476311120.1493157.28231808.2542802062.31283851.07292311.98265240.61270627486.86515.421474964179.458104.464491.49894.965679.0974253508127.23180.57118.736175912.828125122361.761.93048.74360.95724408459.10361.3447.15036.96361.80.4291757.113790.582868.111645.97983.782104.642.09105.9511.465556.34649.41133.78156896413333.831511.4322.376299.36111360.2240.1860.0787217165.861224413.21216.0323232094.702861.8960.245233442.45105.9193547169226995.93450.6604695.7052802.90763103169.71229452.3342402043.48282255.79167314.50267242.33370627887.03512.0969.513104.491.35095.195684.9400253506127.39180.96218.788176496.77083361.861.861.93042.93359.81224453759.28362.4448.35028.42362.60.4301753.203790.992862.981642.54981.879104.642.03105.8511.775564.33649.02833.81156939960033.846511.7722.389299.493360.2240.1860.0787217182.679224.422327191547168602705.7427530742402061.27313.2126687.4391.40595.345696.8829252507127.515041.763783.8742.11512.185563.4133.82356958480033.862511.5922.387299.3627217221402.99232.4923232063.532861.845964.458232857.60106.1193547169225671.64448.2605705.7872773.7075308118.6393195.17229879.7942402057.35281522.66667313.47267241.988701566512.438471917409.457105.064191.1085701.7695253507127.70180.33018.723176039.046875122761.962.062.13052.75360.264243780361.9447.15038.22362.70.431756.133792.062869.071643.40983.644104.8105.7511.225554.04650.02633.82956932603333.862511.7522.375299.47611360.2240.1860.0787217175.768OpenBenchmarking.org

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP12450K100K150K200K250KSE +/- 3981.96, N = 3SE +/- 2025.93, N = 3SE +/- 510.50, N = 3241264.03224413.21221402.991. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics213450100150200250SE +/- 1.76, N = 3SE +/- 3.23, N = 12SE +/- 3.88, N = 3SE +/- 2.76, N = 3216.03217.81224.42232.491. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

lzbench

Test: Zstd 8 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression4321612182430232323221. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET12450K100K150K200K250KSE +/- 1327.90, N = 3SE +/- 576.84, N = 3SE +/- 1044.09, N = 3240905.58232094.70232063.531. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

lzbench

Test: Crush 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression4213714212835SE +/- 0.33, N = 3282828271. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption4211428425670SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 1.32, N = 361.861.860.4

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU141020304050SE +/- 0.73, N = 3SE +/- 0.73, N = 346451. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.12412004006008001000SE +/- 1.20, N = 3SE +/- 1.69, N = 3SE +/- 2.06, N = 3960.25964.46981.30MIN: 954.37 / MAX: 975.86MIN: 929.88 / MAX: 976.14MIN: 969.28 / MAX: 990.761. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD12450K100K150K200K250KSE +/- 1282.52, N = 3SE +/- 2092.98, N = 3SE +/- 1155.81, N = 3237915.06233442.45232857.601. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption42120406080100SE +/- 0.17, N = 3SE +/- 0.18, N = 3SE +/- 2.12, N = 3106.1105.9103.9

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression42314080120160200SE +/- 0.58, N = 31931931911891. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 8 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression4321120240360480600SE +/- 0.67, N = 3SE +/- 0.33, N = 35475475475371. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression42314080120160200SE +/- 0.67, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 31691691681661. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET24150K100K150K200K250KSE +/- 1149.17, N = 3SE +/- 2416.80, N = 3SE +/- 373.74, N = 3226995.93225671.64223333.061. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption241100200300400500SE +/- 0.66, N = 3SE +/- 1.52, N = 3SE +/- 3.75, N = 3450.6448.2443.5

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression4231130260390520650SE +/- 0.88, N = 3SE +/- 1.00, N = 3SE +/- 2.33, N = 36056046025961. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression43121632486480SE +/- 0.33, N = 3707070691. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein43121.30212.60423.90635.20846.5105SE +/- 0.024, N = 3SE +/- 0.018, N = 3SE +/- 0.023, N = 3SE +/- 0.028, N = 35.7875.7425.7325.7051. (CXX) g++ options: -O3 -pthread -lm

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding1246001200180024003000SE +/- 19.67, N = 3SE +/- 17.04, N = 3SE +/- 16.68, N = 32812.842802.902773.701. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

lzbench

Test: Brotli 2 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression214320406080100767675751. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 2 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression124370140210280350SE +/- 0.67, N = 33113103083071. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

GnuPG

2.7GB Sample File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File Encryption41306090120150SE +/- 0.17, N = 3SE +/- 0.69, N = 3118.64120.151. (CC) gcc options: -O2

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding4217001400210028003500SE +/- 12.73, N = 3SE +/- 0.00, N = 3SE +/- 12.43, N = 33195.173169.713157.281. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH14250K100K150K200K250KSE +/- 1424.80, N = 3SE +/- 1620.07, N = 3SE +/- 1205.15, N = 3231808.25229879.79229452.331. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF11223419001800270036004500SE +/- 20.00, N = 3SE +/- 20.00, N = 3SE +/- 20.00, N = 3SE +/- 20.00, N = 342404240424042801. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C1342400800120016002000SE +/- 5.44, N = 3SE +/- 6.86, N = 3SE +/- 1.15, N = 3SE +/- 14.91, N = 32062.312061.272057.352043.481. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP42160K120K180K240K300KSE +/- 441.17, N = 3SE +/- 594.11, N = 3SE +/- 1118.39, N = 3281522.67282255.79283851.071. (CXX) g++ options: -O3 -march=native -fopenmp

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D243170140210280350SE +/- 1.13, N = 3SE +/- 1.60, N = 3SE +/- 0.91, N = 3SE +/- 0.27, N = 3314.50313.47313.21311.981. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression423160120180240300SE +/- 0.58, N = 32672672662651. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile14250100150200250SE +/- 0.12, N = 3SE +/- 0.37, N = 3SE +/- 0.15, N = 3240.61241.99242.33

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha512214150K300K450K600K750KSE +/- 1680.43, N = 3SE +/- 1144.86, N = 3SE +/- 2731.60, N = 3706278706274701566

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p32120406080100SE +/- 0.21, N = 3SE +/- 0.23, N = 3SE +/- 0.23, N = 387.4387.0386.86MIN: 69.27 / MAX: 117.15MIN: 69.11 / MAX: 115.39MIN: 69.37 / MAX: 117.791. (CC) gcc options: -pthread

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9241110220330440550SE +/- 1.49, N = 3SE +/- 0.79, N = 3SE +/- 0.84, N = 3512.10512.44515.421. (CC) gcc options: -O2 -fvisibility=hidden

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.41410M20M30M40M50MSE +/- 139229.10, N = 3SE +/- 224373.99, N = 347496417471917401. (CXX) g++ options: -O3 -fopenmp

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default4123691215SE +/- 0.042, N = 3SE +/- 0.051, N = 3SE +/- 0.022, N = 39.4579.4589.5131. (CXX) g++ options: -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption42120406080100SE +/- 0.20, N = 2SE +/- 0.30, N = 2SE +/- 0.15, N = 2105.0104.4104.4

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU14140280420560700SE +/- 3.48, N = 3SE +/- 1.48, N = 36446411. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O423120406080100SE +/- 0.39, N = 3SE +/- 0.22, N = 3SE +/- 0.25, N = 3SE +/- 0.14, N = 391.1191.3591.4191.501. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -mcpu=native -O3 -fomit-frame-pointer -ffast-math -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p32120406080100SE +/- 0.43, N = 3SE +/- 0.77, N = 3SE +/- 0.90, N = 395.3495.1994.96MIN: 53.76 / MAX: 105.88MIN: 52.27 / MAX: 105.53MIN: 52.23 / MAX: 105.261. (CC) gcc options: -pthread

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3432112002400360048006000SE +/- 8.32, N = 3SE +/- 15.44, N = 3SE +/- 12.68, N = 3SE +/- 34.70, N = 35701.775696.885684.945679.101. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

lzbench

Test: Crush 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression421360120180240300SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 32532532532521. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Decompression1432110220330440550SE +/- 0.33, N = 35085075075061. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M1234306090120150SE +/- 0.18, N = 3SE +/- 0.17, N = 3SE +/- 0.26, N = 3SE +/- 0.04, N = 3127.23127.39127.51127.701. (CXX) g++ options: -std=c++11 -O3 -mcpu=native -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_1002144080120160200SE +/- 0.07, N = 3SE +/- 0.21, N = 3SE +/- 0.10, N = 3180.96180.57180.331. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5412510152025SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 318.7218.7418.791. (CXX) g++ options: -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP14240K80K120K160K200KSE +/- 147.83, N = 3SE +/- 81.88, N = 3SE +/- 398.01, N = 3175912.83176039.05176496.771. (CXX) g++ options: -O3 -march=native -fopenmp

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU4130060090012001500SE +/- 0.44, N = 3SE +/- 4.11, N = 3122712231. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption421142842567061.961.861.7

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption4121428425670SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 362.061.961.8

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption421428425670SE +/- 0.10, N = 262.161.9

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding4127001400210028003500SE +/- 3.51, N = 3SE +/- 2.62, N = 3SE +/- 1.09, N = 33052.753048.743042.931. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 724180160240320400SE +/- 0.13, N = 3SE +/- 0.11, N = 3SE +/- 0.20, N = 3359.81360.26360.961. (CXX) g++ options: -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool21450K100K150K200K250KSE +/- 263.56, N = 3SE +/- 524.81, N = 3SE +/- 400.20, N = 3244537244084243780

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit211326395265SE +/- 0.17, N = 3SE +/- 0.02, N = 359.2859.10MIN: 47.29 / MAX: 89.03MIN: 47.26 / MAX: 90.171. (CC) gcc options: -pthread

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption24180160240320400SE +/- 1.15, N = 3SE +/- 1.17, N = 3SE +/- 0.75, N = 3362.4361.9361.3

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption241100200300400500SE +/- 2.04, N = 3SE +/- 1.59, N = 3SE +/- 1.23, N = 3448.3447.1447.1

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C341211002200330044005500SE +/- 21.14, N = 3SE +/- 3.88, N = 3SE +/- 13.57, N = 3SE +/- 8.95, N = 35041.765038.225036.965028.421. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption42180160240320400SE +/- 1.11, N = 3SE +/- 1.08, N = 3SE +/- 0.64, N = 3362.7362.6361.8

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 104210.09680.19360.29040.38720.484SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.4300.4300.429

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding142400800120016002000SE +/- 0.98, N = 3SE +/- 0.98, N = 3SE +/- 0.98, N = 31757.111756.131753.201. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C42138001600240032004000SE +/- 1.17, N = 3SE +/- 2.66, N = 3SE +/- 1.50, N = 3SE +/- 1.99, N = 33792.063790.993790.583783.871. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding4126001200180024003000SE +/- 0.85, N = 3SE +/- 0.64, N = 3SE +/- 3.89, N = 32869.072868.112862.981. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding142400800120016002000SE +/- 0.86, N = 3SE +/- 0.86, N = 3SE +/- 1.48, N = 31645.971643.401642.541. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v22412004006008001000SE +/- 2.18, N = 3SE +/- 2.46, N = 3SE +/- 2.36, N = 3981.88983.64983.78MIN: 893 / MAX: 1070.96MIN: 905.66 / MAX: 1070.74MIN: 885.36 / MAX: 1079.661. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption42120406080100SE +/- 0.13, N = 3SE +/- 0.19, N = 3SE +/- 0.09, N = 3104.8104.6104.6

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K3121020304050SE +/- 0.08, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 342.1142.0942.03MIN: 26.09 / MAX: 45.45MIN: 26.15 / MAX: 45.35MIN: 25.81 / MAX: 45.21. (CC) gcc options: -pthread

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption12420406080100SE +/- 0.20, N = 2SE +/- 0.30, N = 2105.9105.8105.7

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D3214110220330440550SE +/- 0.19, N = 3SE +/- 0.79, N = 3SE +/- 0.60, N = 3SE +/- 0.77, N = 3512.18511.77511.46511.221. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C231412002400360048006000SE +/- 1.95, N = 3SE +/- 0.61, N = 3SE +/- 5.04, N = 3SE +/- 6.36, N = 35564.335563.415556.345554.041. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7214140280420560700SE +/- 0.15, N = 3SE +/- 0.69, N = 3SE +/- 0.37, N = 3649.03649.41650.031. (CXX) g++ options: -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux

Etcpak

Configuration: ETC1 + Dithering

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering4321816243240SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 333.8333.8233.8133.781. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.23241120M240M360M480M600MSE +/- 300973.14, N = 3SE +/- 310448.91, N = 3SE +/- 290594.89, N = 3SE +/- 614324.88, N = 35695848005693996005693260335689641331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC14321816243240SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 333.8633.8633.8533.831. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C2431110220330440550SE +/- 0.72, N = 3SE +/- 0.71, N = 3SE +/- 0.53, N = 3SE +/- 0.55, N = 3511.77511.75511.59511.431. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC22314510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 322.3922.3922.3822.381. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1243170140210280350SE +/- 0.14, N = 3SE +/- 0.06, N = 3SE +/- 0.16, N = 3SE +/- 0.22, N = 3299.49299.48299.36299.361. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU413691215SE +/- 0.00, N = 3SE +/- 0.00, N = 311111. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU421816243240SE +/- 0.17, N = 3SE +/- 0.17, N = 3SE +/- 0.17, N = 33636361. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 64210.05040.10080.15120.20160.252SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.2240.2240.224

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 54210.04190.08380.12570.16760.2095SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1860.1860.186

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 14210.01760.03520.05280.07040.088SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0780.0780.078

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression43211632486480727272721. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression432148121620171717171. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP2414080120160200SE +/- 3.33, N = 15SE +/- 6.40, N = 12SE +/- 5.27, N = 15182.68175.77165.861. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp


Phoronix Test Suite v10.8.4