ARMv8 32 Cores

Ampere eMAG ARMv8 testing with a AmpereComputing OSPREY (4.8.19 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2101300-HA-ARMV832CO04&grs&rdt.

ARMv8 32 CoresProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen Resolution1234Ampere eMAG ARMv8 @ 3.00GHz (32 Cores)AmpereComputing OSPREY (4.8.19 BIOS)Applied Micro Circuits X-Gene126GB256GB Samsung SSD 860ASPEEDVE228Intel I210Ubuntu 20.045.7.0-050700-generic (aarch64)GNOME Shell 3.36.4X Server 1.20.8aspeedGCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Processor Details- Scaling Governor: cppc_cpufreq ondemandPython Details- Python 3.8.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Mitigation of PTI + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Vulnerable + tsx_async_abort: Not affected

ARMv8 32 Coresredis: LPOPcloverleaf: Lagrangian-Eulerian Hydrodynamicslzbench: Zstd 8 - Compressionredis: GETlzbench: Crush 0 - Compressioncryptsetup: Serpent-XTS 256b Encryptiononnx: bertsquad-10 - OpenMP CPUtnn: CPU - SqueezeNet v1.1redis: SADDcryptsetup: Twofish-XTS 256b Encryptionlzbench: Zstd 1 - Compressionlzbench: Zstd 8 - Decompressionlzbench: Brotli 0 - Compressionredis: SETcryptsetup: AES-XTS 256b Encryptionlzbench: Zstd 1 - Decompressionlzbench: Libdeflate 1 - Compressionlammps: Rhodopsin Proteinaskap: tConvolve OpenMP - Griddinglzbench: Brotli 2 - Compressionlzbench: Brotli 2 - Decompressiongnupg: 2.7GB Sample File Encryptionaskap: tConvolve OpenMP - Degriddingredis: LPUSHqe: AUSURF112npb: CG.Cfinancebench: Bonds OpenMPnpb: IS.Dlzbench: Brotli 0 - Decompressionbuild-godot: Time To Compilecryptsetup: PBKDF2-sha512dav1d: Chimera 1080pgcrypt: kripke: webp2: Defaultcryptsetup: Twofish-XTS 512b Decryptiononnx: super-resolution-10 - OpenMP CPUqmcpack: simple-H2Odav1d: Summer Nature 1080plulesh: lzbench: Crush 0 - Decompressionlzbench: Libdeflate 1 - Decompressionopenfoam: Motorbike 30Msynthmark: VoiceMark_100webp2: Quality 100, Compression Effort 5financebench: Repo OpenMPonnx: shufflenet-v2-10 - OpenMP CPUcryptsetup: Serpent-XTS 512b Encryptioncryptsetup: Serpent-XTS 256b Decryptioncryptsetup: Serpent-XTS 512b Decryptionaskap: tConvolve MT - Degriddingwebp2: Quality 75, Compression Effort 7cryptsetup: PBKDF2-whirlpooldav1d: Chimera 1080p 10-bitcryptsetup: AES-XTS 512b Decryptioncryptsetup: AES-XTS 256b Decryptionnpb: MG.Ccryptsetup: AES-XTS 512b Encryptionrav1e: 10askap: tConvolve MPI - Griddingnpb: LU.Caskap: tConvolve MT - Griddingaskap: tConvolve MPI - Degriddingtnn: CPU - MobileNet v2cryptsetup: Twofish-XTS 256b Decryptiondav1d: Summer Nature 4Kcryptsetup: Twofish-XTS 512b Encryptionnpb: EP.Dnpb: FT.Cwebp2: Quality 95, Compression Effort 7etcpak: ETC1 + Ditheringamg: etcpak: ETC1npb: EP.Cetcpak: ETC2etcpak: DXT1onnx: fcn-resnet101-11 - OpenMP CPUonnx: yolov4 - OpenMP CPUrav1e: 6rav1e: 5rav1e: 1lzbench: XZ 0 - Decompressionlzbench: XZ 0 - Compressionaskap: Hogbom Clean OpenMP1234241264.03217.8122240905.582860.446981.299237915.06103.9189537166223333.06443.5596705.7322812.8476311120.1493157.28231808.2542802062.31283851.07292311.98265240.61270627486.86515.421474964179.458104.464491.49894.965679.0974253508127.23180.57118.736175912.828125122361.761.93048.74360.95724408459.10361.3447.15036.96361.80.4291757.113790.582868.111645.97983.782104.642.09105.9511.465556.34649.41133.78156896413333.831511.4322.376299.36111360.2240.1860.0787217165.861224413.21216.0323232094.702861.8960.245233442.45105.9193547169226995.93450.6604695.7052802.90763103169.71229452.3342402043.48282255.79167314.50267242.33370627887.03512.0969.513104.491.35095.195684.9400253506127.39180.96218.788176496.77083361.861.861.93042.93359.81224453759.28362.4448.35028.42362.60.4301753.203790.992862.981642.54981.879104.642.03105.8511.775564.33649.02833.81156939960033.846511.7722.389299.493360.2240.1860.0787217182.679224.422327191547168602705.7427530742402061.27313.2126687.4391.40595.345696.8829252507127.515041.763783.8742.11512.185563.4133.82356958480033.862511.5922.387299.3627217221402.99232.4923232063.532861.845964.458232857.60106.1193547169225671.64448.2605705.7872773.7075308118.6393195.17229879.7942402057.35281522.66667313.47267241.988701566512.438471917409.457105.064191.1085701.7695253507127.70180.33018.723176039.046875122761.962.062.13052.75360.264243780361.9447.15038.22362.70.431756.133792.062869.071643.40983.644104.8105.7511.225554.04650.02633.82956932603333.862511.7522.375299.47611360.2240.1860.0787217175.768OpenBenchmarking.org

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP12450K100K150K200K250KSE +/- 3981.96, N = 3SE +/- 2025.93, N = 3SE +/- 510.50, N = 3241264.03224413.21221402.991. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics123450100150200250SE +/- 3.23, N = 12SE +/- 1.76, N = 3SE +/- 3.88, N = 3SE +/- 2.76, N = 3217.81216.03224.42232.491. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

lzbench

Test: Zstd 8 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression1234612182430222323231. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET12450K100K150K200K250KSE +/- 1327.90, N = 3SE +/- 576.84, N = 3SE +/- 1044.09, N = 3240905.58232094.70232063.531. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

lzbench

Test: Crush 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression1234714212835SE +/- 0.33, N = 3282827281. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption1241428425670SE +/- 1.32, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 360.461.861.8

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU141020304050SE +/- 0.73, N = 3SE +/- 0.73, N = 346451. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.11242004006008001000SE +/- 2.06, N = 3SE +/- 1.20, N = 3SE +/- 1.69, N = 3981.30960.25964.46MIN: 969.28 / MAX: 990.76MIN: 954.37 / MAX: 975.86MIN: 929.88 / MAX: 976.141. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD12450K100K150K200K250KSE +/- 1282.52, N = 3SE +/- 2092.98, N = 3SE +/- 1155.81, N = 3237915.06233442.45232857.601. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption12420406080100SE +/- 2.12, N = 3SE +/- 0.18, N = 3SE +/- 0.17, N = 3103.9105.9106.1

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression12344080120160200SE +/- 0.58, N = 31891931911931. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 8 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression1234120240360480600SE +/- 0.33, N = 3SE +/- 0.67, N = 35375475475471. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression12344080120160200SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.67, N = 31661691681691. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET12450K100K150K200K250KSE +/- 373.74, N = 3SE +/- 1149.17, N = 3SE +/- 2416.80, N = 3223333.06226995.93225671.641. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption124100200300400500SE +/- 3.75, N = 3SE +/- 0.66, N = 3SE +/- 1.52, N = 3443.5450.6448.2

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression1234130260390520650SE +/- 1.00, N = 3SE +/- 2.33, N = 3SE +/- 0.88, N = 35966046026051. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression12341632486480SE +/- 0.33, N = 3706970701. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein12341.30212.60423.90635.20846.5105SE +/- 0.023, N = 3SE +/- 0.028, N = 3SE +/- 0.018, N = 3SE +/- 0.024, N = 35.7325.7055.7425.7871. (CXX) g++ options: -O3 -pthread -lm

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding1246001200180024003000SE +/- 19.67, N = 3SE +/- 17.04, N = 3SE +/- 16.68, N = 32812.842802.902773.701. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

lzbench

Test: Brotli 2 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression123420406080100767675751. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 2 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression123470140210280350SE +/- 0.67, N = 33113103073081. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

GnuPG

2.7GB Sample File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File Encryption14306090120150SE +/- 0.69, N = 3SE +/- 0.17, N = 3120.15118.641. (CC) gcc options: -O2

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding1247001400210028003500SE +/- 12.43, N = 3SE +/- 0.00, N = 3SE +/- 12.73, N = 33157.283169.713195.171. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH12450K100K150K200K250KSE +/- 1424.80, N = 3SE +/- 1205.15, N = 3SE +/- 1620.07, N = 3231808.25229452.33229879.791. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF11212349001800270036004500SE +/- 20.00, N = 3SE +/- 20.00, N = 3SE +/- 20.00, N = 3SE +/- 20.00, N = 342804240424042401. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C1234400800120016002000SE +/- 5.44, N = 3SE +/- 14.91, N = 3SE +/- 6.86, N = 3SE +/- 1.15, N = 32062.312043.482061.272057.351. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP12460K120K180K240K300KSE +/- 1118.39, N = 3SE +/- 594.11, N = 3SE +/- 441.17, N = 3283851.07282255.79281522.671. (CXX) g++ options: -O3 -march=native -fopenmp

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D123470140210280350SE +/- 0.27, N = 3SE +/- 1.13, N = 3SE +/- 0.91, N = 3SE +/- 1.60, N = 3311.98314.50313.21313.471. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression123460120180240300SE +/- 0.58, N = 32652672662671. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile12450100150200250SE +/- 0.12, N = 3SE +/- 0.15, N = 3SE +/- 0.37, N = 3240.61242.33241.99

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha512124150K300K450K600K750KSE +/- 1144.86, N = 3SE +/- 1680.43, N = 3SE +/- 2731.60, N = 3706274706278701566

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p12320406080100SE +/- 0.23, N = 3SE +/- 0.23, N = 3SE +/- 0.21, N = 386.8687.0387.43MIN: 69.37 / MAX: 117.79MIN: 69.11 / MAX: 115.39MIN: 69.27 / MAX: 117.151. (CC) gcc options: -pthread

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9124110220330440550SE +/- 0.84, N = 3SE +/- 1.49, N = 3SE +/- 0.79, N = 3515.42512.10512.441. (CC) gcc options: -O2 -fvisibility=hidden

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.41410M20M30M40M50MSE +/- 139229.10, N = 3SE +/- 224373.99, N = 347496417471917401. (CXX) g++ options: -O3 -fopenmp

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default1243691215SE +/- 0.051, N = 3SE +/- 0.022, N = 3SE +/- 0.042, N = 39.4589.5139.4571. (CXX) g++ options: -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption12420406080100SE +/- 0.15, N = 2SE +/- 0.30, N = 2SE +/- 0.20, N = 2104.4104.4105.0

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU14140280420560700SE +/- 3.48, N = 3SE +/- 1.48, N = 36446411. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O123420406080100SE +/- 0.14, N = 3SE +/- 0.22, N = 3SE +/- 0.25, N = 3SE +/- 0.39, N = 391.5091.3591.4191.111. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -mcpu=native -O3 -fomit-frame-pointer -ffast-math -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p12320406080100SE +/- 0.90, N = 3SE +/- 0.77, N = 3SE +/- 0.43, N = 394.9695.1995.34MIN: 52.23 / MAX: 105.26MIN: 52.27 / MAX: 105.53MIN: 53.76 / MAX: 105.881. (CC) gcc options: -pthread

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3123412002400360048006000SE +/- 34.70, N = 3SE +/- 12.68, N = 3SE +/- 15.44, N = 3SE +/- 8.32, N = 35679.105684.945696.885701.771. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

lzbench

Test: Crush 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression123460120180240300SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 32532532522531. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Decompression1234110220330440550SE +/- 0.33, N = 35085065075071. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M1234306090120150SE +/- 0.18, N = 3SE +/- 0.17, N = 3SE +/- 0.26, N = 3SE +/- 0.04, N = 3127.23127.39127.51127.701. (CXX) g++ options: -std=c++11 -O3 -mcpu=native -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_1001244080120160200SE +/- 0.21, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3180.57180.96180.331. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5124510152025SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 318.7418.7918.721. (CXX) g++ options: -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP12440K80K120K160K200KSE +/- 147.83, N = 3SE +/- 398.01, N = 3SE +/- 81.88, N = 3175912.83176496.77176039.051. (CXX) g++ options: -O3 -march=native -fopenmp

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU1430060090012001500SE +/- 4.11, N = 3SE +/- 0.44, N = 3122312271. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption124142842567061.761.861.9

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption1241428425670SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.03, N = 361.961.862.0

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption241428425670SE +/- 0.10, N = 261.962.1

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding1247001400210028003500SE +/- 2.62, N = 3SE +/- 1.09, N = 3SE +/- 3.51, N = 33048.743042.933052.751. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 712480160240320400SE +/- 0.20, N = 3SE +/- 0.13, N = 3SE +/- 0.11, N = 3360.96359.81360.261. (CXX) g++ options: -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool12450K100K150K200K250KSE +/- 524.81, N = 3SE +/- 263.56, N = 3SE +/- 400.20, N = 3244084244537243780

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit121326395265SE +/- 0.02, N = 3SE +/- 0.17, N = 359.1059.28MIN: 47.26 / MAX: 90.17MIN: 47.29 / MAX: 89.031. (CC) gcc options: -pthread

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption12480160240320400SE +/- 0.75, N = 3SE +/- 1.15, N = 3SE +/- 1.17, N = 3361.3362.4361.9

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption124100200300400500SE +/- 1.23, N = 3SE +/- 2.04, N = 3SE +/- 1.59, N = 3447.1448.3447.1

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C123411002200330044005500SE +/- 13.57, N = 3SE +/- 8.95, N = 3SE +/- 21.14, N = 3SE +/- 3.88, N = 35036.965028.425041.765038.221. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption12480160240320400SE +/- 0.64, N = 3SE +/- 1.08, N = 3SE +/- 1.11, N = 3361.8362.6362.7

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 101240.09680.19360.29040.38720.484SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.4290.4300.430

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding124400800120016002000SE +/- 0.98, N = 3SE +/- 0.98, N = 3SE +/- 0.98, N = 31757.111753.201756.131. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C12348001600240032004000SE +/- 1.50, N = 3SE +/- 2.66, N = 3SE +/- 1.99, N = 3SE +/- 1.17, N = 33790.583790.993783.873792.061. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding1246001200180024003000SE +/- 0.64, N = 3SE +/- 3.89, N = 3SE +/- 0.85, N = 32868.112862.982869.071. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding124400800120016002000SE +/- 0.86, N = 3SE +/- 1.48, N = 3SE +/- 0.86, N = 31645.971642.541643.401. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v21242004006008001000SE +/- 2.36, N = 3SE +/- 2.18, N = 3SE +/- 2.46, N = 3983.78981.88983.64MIN: 885.36 / MAX: 1079.66MIN: 893 / MAX: 1070.96MIN: 905.66 / MAX: 1070.741. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption12420406080100SE +/- 0.09, N = 3SE +/- 0.19, N = 3SE +/- 0.13, N = 3104.6104.6104.8

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K1231020304050SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 342.0942.0342.11MIN: 26.15 / MAX: 45.35MIN: 25.81 / MAX: 45.2MIN: 26.09 / MAX: 45.451. (CC) gcc options: -pthread

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption12420406080100SE +/- 0.20, N = 2SE +/- 0.30, N = 2105.9105.8105.7

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D1234110220330440550SE +/- 0.60, N = 3SE +/- 0.79, N = 3SE +/- 0.19, N = 3SE +/- 0.77, N = 3511.46511.77512.18511.221. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C123412002400360048006000SE +/- 5.04, N = 3SE +/- 1.95, N = 3SE +/- 0.61, N = 3SE +/- 6.36, N = 35556.345564.335563.415554.041. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7124140280420560700SE +/- 0.69, N = 3SE +/- 0.15, N = 3SE +/- 0.37, N = 3649.41649.03650.031. (CXX) g++ options: -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux

Etcpak

Configuration: ETC1 + Dithering

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering1234816243240SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 333.7833.8133.8233.831. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.21234120M240M360M480M600MSE +/- 614324.88, N = 3SE +/- 310448.91, N = 3SE +/- 300973.14, N = 3SE +/- 290594.89, N = 35689641335693996005695848005693260331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC11234816243240SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 333.8333.8533.8633.861. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C1234110220330440550SE +/- 0.55, N = 3SE +/- 0.72, N = 3SE +/- 0.53, N = 3SE +/- 0.71, N = 3511.43511.77511.59511.751. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC21234510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 322.3822.3922.3922.381. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1123470140210280350SE +/- 0.22, N = 3SE +/- 0.14, N = 3SE +/- 0.16, N = 3SE +/- 0.06, N = 3299.36299.49299.36299.481. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU143691215SE +/- 0.00, N = 3SE +/- 0.00, N = 311111. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU124816243240SE +/- 0.17, N = 3SE +/- 0.17, N = 3SE +/- 0.17, N = 33636361. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 61240.05040.10080.15120.20160.252SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.2240.2240.224

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 51240.04190.08380.12570.16760.2095SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1860.1860.186

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 11240.01760.03520.05280.07040.088SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0780.0780.078

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression12341632486480727272721. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression123448121620171717171. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP1244080120160200SE +/- 5.27, N = 15SE +/- 3.33, N = 15SE +/- 6.40, N = 12165.86182.68175.771. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp


Phoronix Test Suite v10.8.4