Haswell 2021

Intel Xeon E5-2687W v3 testing with a MSI X99S SLI PLUS (MS-7885) v1.0 (1.E0 BIOS) and NVIDIA GeForce GTX 770 on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2101287-HA-HASWELL2015&grs&sor.

Haswell 2021ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen Resolution123Intel Xeon E5-2687W v3 @ 3.50GHz (10 Cores / 20 Threads)MSI X99S SLI PLUS (MS-7885) v1.0 (1.E0 BIOS)Intel Xeon E7 v3/Xeon32GB80GB INTEL SSDSCKGW08NVIDIA GeForce GTX 770Realtek ALC892LG Ultra HDIntel I218-VUbuntu 20.045.9.0-050900rc7daily20200928-generic (x86_64) 20200927GNOME Shell 3.36.4X Server 1.20.8modesetting 1.20.8GCC 9.3.0ext43840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x44Python Details- Python 3.8.5Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Haswell 2021redis: GETnpb: EP.Cncnn: CPU - yolov4-tinycpuminer-opt: Skeincoincloverleaf: Lagrangian-Eulerian Hydrodynamicsfinancebench: Repo OpenMPonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUcpuminer-opt: Blake-2 Sncnn: CPU - resnet18financebench: Bonds OpenMPcp2k: Fayalite-FIST Datancnn: CPU - googlenetredis: SETaskap: tConvolve OpenMP - Degriddingcpuminer-opt: Myriad-Groestllzbench: Zstd 8 - Decompressionnpb: EP.Dencode-opus: WAV To Opus Encodemnn: MobileNetV2_224webp2: Defaultkripke: qmcpack: simple-H2Oncnn: CPU - blazefacencnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - regnety_400mmnn: mobilenet-v1-1.0dav1d: Summer Nature 1080pgnupg: 2.7GB Sample File Encryptionbuild-eigen: Time To Compiledav1d: Summer Nature 4Kcpuminer-opt: Magiredis: SADDmnn: resnet-v2-50perf-bench: Sched Pipeunpack-firefox: firefox-84.0.source.tar.xzncnn: CPU - mobilenetncnn: CPU - resnet50cpuminer-opt: Garlicoincryptsetup: AES-XTS 256b Decryptionamg: onnx: bertsquad-10 - OpenMP CPUncnn: CPU - mnasnetcryptsetup: AES-XTS 256b Encryptionlulesh: cpuminer-opt: Ringcoinlzbench: Brotli 2 - Decompressionbuild2: Time To Compileaskap: tConvolve MT - Degriddingredis: LPUSHncnn: CPU - efficientnet-b0lzbench: Brotli 0 - Decompressionrav1e: 10onednn: Recurrent Neural Network Inference - u8s8f32 - CPUtnn: CPU - SqueezeNet v1.1cpuminer-opt: LBC, LBRY Creditsncnn: CPU - shufflenet-v2onednn: Recurrent Neural Network Inference - f32 - CPUcpuminer-opt: x25xonednn: IP Shapes 3D - f32 - CPUcryptsetup: Twofish-XTS 256b Encryptioncryptsetup: PBKDF2-whirlpooltnn: CPU - MobileNet v2onnx: shufflenet-v2-10 - OpenMP CPUrav1e: 6onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUcryptsetup: Serpent-XTS 256b Encryptiononednn: IP Shapes 3D - u8s8f32 - CPUencode-wavpack: WAV To WavPackgcrypt: dav1d: Chimera 1080p 10-bitetcpak: ETC1cryptsetup: Serpent-XTS 256b Decryptionetcpak: ETC2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - vgg16rav1e: 1etcpak: DXT1cryptsetup: AES-XTS 512b Decryptionncnn: CPU - alexnetaskap: tConvolve MT - Griddingcryptsetup: Twofish-XTS 512b Decryptioncryptsetup: Twofish-XTS 256b Decryptiononnx: yolov4 - OpenMP CPUlzbench: Zstd 1 - Decompressioncryptsetup: AES-XTS 512b Encryptiononednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUlzbench: Brotli 0 - Compressiononednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUdav1d: Chimera 1080pmnn: SqueezeNetV1.0lzbench: Zstd 1 - Compressionrav1e: 5webp2: Quality 95, Compression Effort 7openfoam: Motorbike 30Mlzbench: Crush 0 - Decompressionsynthmark: VoiceMark_100onednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUwebp2: Quality 75, Compression Effort 7cryptsetup: Serpent-XTS 512b Decryptiononednn: Convolution Batch Shapes Auto - u8s8f32 - CPUcryptsetup: Serpent-XTS 512b Encryptionmnn: inception-v3etcpak: ETC1 + Ditheringaskap: Hogbom Clean OpenMPbuild-godot: Time To Compilencnn: CPU - squeezenet_ssdwebp2: Quality 100, Lossless Compressiononednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: IP Shapes 1D - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUquantlib: lzbench: Libdeflate 1 - Decompressioncryptsetup: PBKDF2-sha512onnx: super-resolution-10 - OpenMP CPUonednn: Recurrent Neural Network Training - f32 - CPUwebp2: Quality 100, Compression Effort 5onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUcryptsetup: Twofish-XTS 512b Encryptiononnx: fcn-resnet101-11 - OpenMP CPUlzbench: Libdeflate 1 - Compressionlzbench: Brotli 2 - Compressionlzbench: Crush 0 - Compressionlzbench: Zstd 8 - Compressionlzbench: XZ 0 - Decompressionlzbench: XZ 0 - Compressionredis: LPOPaskap: tConvolve OpenMP - Griddingaskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingcpuminer-opt: Triple SHA-256, Onecoincpuminer-opt: Quad SHA-256, Pyritecpuminer-opt: Deepcoinlammps: Rhodopsin Proteinclomp: Static OMP Speedup1231887226.871050.1731.4334847136.8968017.6536466.6601122050815.20115661.1296871543.68316.601415197.082664.351036314231027.8810.6865.0615.6774610442051.1532.766.2224.086.196382.2181.081109.859142.04203.981602736.8854.5117375524.03820.0329.061736.891697.23028911335455.521702.34654.46641579.48581159.6871756.701247740.008.285032.3752208.25329.721233307.052209.39219.587.41691344.4530142339.21297361.0653.00389549.02.4899417.395273.55168.44235.540532.9140.7315.4151.780.2771083.8611389.612.081285.02346.4346.832813901400.24051.993563.368502.83074459.118.4474010.810545.956219.05431551.1818.393255.76536298.918533.712.7075550.755.865226.752239.235186.69726.55939.7605.0756413.78544.074764045.791689.6995132899340074048.9514.4822211.56345.659182148766596342010121.551778.102071.461699.8561247476537487.984.41513.21775802.501002.0131.9634338140.6469828.3619796.5193722193815.54114077.5651041576.95216.841408307.622716.91056714131008.9110.6175.1465.6644652985351.9032.796.2223.936.112380.8680.221111.128140.67204.181588662.7553.9667369523.89120.2128.861750.881702.63036123675415.551708.54644.46011582.49585160.6741753.611242294.798.305062.3612211.70328.625234077.072214.44220.737.39027345.4532814337.51197751.0702.99080550.32.4793017.322272.41168.62235.968531.5140.3155.4151.970.2771080.5461393.012.121284.71346.8345.832813891401.74040.593563.359472.83861459.328.4394010.810545.663219.10432550.1428.401245.77823298.253533.312.6938551.255.917227.136239.425186.84426.57938.9165.0771413.79854.074514044.151691.5994133011840084050.5414.4782211.32345.759182148766596341271416.381881.941984.731589.7560350486227013.874.26111.01765664.371047.6732.7335707140.5769374.8691416.6841122607315.19116695.3782551546.67816.951436898.962716.91051013961019.3710.7965.1005.7564684042751.9132.756.3123.756.151385.9980.127111.065140.47206.241605411.5854.1247443024.12720.0629.111738.451710.63051336335425.561714.44621.63051590.60582160.0931745.941249824.008.335052.3752221.13327.826234637.032221.93220.827.43079346.2532634338.74197291.0682.99407551.42.4812517.326273.01368.34236.454533.5140.2105.4351.850.2761084.3291394.312.111280.90345.7346.932713861404.24048.113573.368832.83373458.068.4244000.808544.662219.57431549.9358.382265.76890298.540532.612.7193551.755.960227.056239.617186.55726.59940.2315.0700513.78044.079804040.611691.6995132899340054051.4214.4752210.61345.659182148766596341758555.231962.741979.001597.8462923483417312.244.46313.0OpenBenchmarking.org

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123400K800K1200K1600K2000KSE +/- 6745.04, N = 3SE +/- 25400.58, N = 3SE +/- 20496.02, N = 31887226.871775802.501765664.371. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C1322004006008001000SE +/- 16.48, N = 3SE +/- 4.68, N = 3SE +/- 11.00, N = 151050.171047.671002.011. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny123816243240SE +/- 0.35, N = 3SE +/- 0.60, N = 3SE +/- 0.34, N = 331.4331.9632.73MIN: 30.14 / MAX: 34.58MIN: 30.06 / MAX: 35.42MIN: 31.73 / MAX: 35.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cpuminer-Opt

Algorithm: Skeincoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Skeincoin3128K16K24K32K40KSE +/- 140.75, N = 3SE +/- 291.68, N = 15SE +/- 325.79, N = 153570734847343381. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics132306090120150SE +/- 0.22, N = 3SE +/- 0.33, N = 3SE +/- 0.54, N = 3136.89140.57140.641. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP13215K30K45K60K75KSE +/- 89.04, N = 3SE +/- 922.31, N = 4SE +/- 1002.67, N = 368017.6569374.8769828.361. (CXX) g++ options: -O3 -march=native -fopenmp

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU213246810SE +/- 0.07423, N = 6SE +/- 0.08426, N = 4SE +/- 0.09722, N = 36.519376.660116.68411MIN: 6.3MIN: 6.33MIN: 6.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Cpuminer-Opt

Algorithm: Blake-2 S

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Blake-2 S32150K100K150K200K250KSE +/- 3030.09, N = 4SE +/- 2626.85, N = 15SE +/- 3265.93, N = 42260732219382205081. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet1831248121620SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.15, N = 315.1915.2015.54MIN: 14.96 / MAX: 16.48MIN: 15.01 / MAX: 15.36MIN: 15 / MAX: 110.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP21320K40K60K80K100KSE +/- 20.05, N = 3SE +/- 1477.66, N = 5SE +/- 1452.56, N = 12114077.57115661.13116695.381. (CXX) g++ options: -O3 -march=native -fopenmp

CP2K Molecular Dynamics

Fayalite-FIST Data

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.1Fayalite-FIST Data132300600900120015001543.681546.681576.95

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet12348121620SE +/- 0.14, N = 3SE +/- 0.20, N = 3SE +/- 0.31, N = 316.6016.8416.95MIN: 16.2 / MAX: 74.59MIN: 16.12 / MAX: 18.21MIN: 16.16 / MAX: 17.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET312300K600K900K1200K1500KSE +/- 9026.88, N = 3SE +/- 14514.92, N = 8SE +/- 19369.44, N = 151436898.961415197.081408307.621. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding3216001200180024003000SE +/- 0.00, N = 3SE +/- 0.00, N = 15SE +/- 1.79, N = 152716.902716.902664.351. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Cpuminer-Opt

Algorithm: Myriad-Groestl

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Myriad-Groestl2312K4K6K8K10KSE +/- 161.69, N = 3SE +/- 125.83, N = 3SE +/- 6.67, N = 31056710510103631. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

lzbench

Test: Zstd 8 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression12330060090012001500SE +/- 4.06, N = 3SE +/- 7.88, N = 3SE +/- 21.06, N = 31423141313961. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D1322004006008001000SE +/- 14.12, N = 4SE +/- 17.23, N = 3SE +/- 9.01, N = 121027.881019.371008.911. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode2133691215SE +/- 0.04, N = 5SE +/- 0.03, N = 5SE +/- 0.03, N = 510.6210.6910.801. (CXX) g++ options: -fvisibility=hidden -logg -lm

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_2241321.15792.31583.47374.63165.7895SE +/- 0.016, N = 3SE +/- 0.037, N = 3SE +/- 0.008, N = 35.0615.1005.146MIN: 4.98 / MAX: 5.99MIN: 4.42 / MAX: 11.39MIN: 5.03 / MAX: 6.031. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default2131.29512.59023.88535.18046.4755SE +/- 0.018, N = 3SE +/- 0.037, N = 3SE +/- 0.043, N = 35.6645.6775.7561. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.432110M20M30M40M50MSE +/- 186245.67, N = 3SE +/- 199447.85, N = 3SE +/- 200225.59, N = 34684042746529853461044201. (CXX) g++ options: -O3 -fopenmp

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O1231224364860SE +/- 0.64, N = 5SE +/- 0.70, N = 5SE +/- 0.74, N = 351.1551.9051.911. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface3120.62781.25561.88342.51123.139SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 32.752.762.79MIN: 2.69 / MAX: 3.11MIN: 2.7 / MAX: 3.13MIN: 2.69 / MAX: 3.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 36.226.226.31MIN: 6.14 / MAX: 6.38MIN: 6.12 / MAX: 6.83MIN: 6.11 / MAX: 64.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m321612182430SE +/- 0.06, N = 3SE +/- 0.13, N = 3SE +/- 0.03, N = 323.7523.9324.08MIN: 23.54 / MAX: 24.43MIN: 23.59 / MAX: 24.31MIN: 23.89 / MAX: 25.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.0231246810SE +/- 0.007, N = 3SE +/- 0.018, N = 3SE +/- 0.006, N = 36.1126.1516.196MIN: 6.03 / MAX: 12.61MIN: 6.05 / MAX: 6.9MIN: 4.66 / MAX: 19.351. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p31280160240320400SE +/- 1.19, N = 3SE +/- 3.01, N = 3SE +/- 3.16, N = 3385.99382.21380.86MIN: 312.93 / MAX: 423.3MIN: 305.4 / MAX: 420.06MIN: 302.45 / MAX: 418.31. (CC) gcc options: -pthread

GnuPG

2.7GB Sample File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File Encryption32120406080100SE +/- 0.07, N = 3SE +/- 0.16, N = 3SE +/- 1.03, N = 380.1380.2281.081. (CC) gcc options: -O2

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile13220406080100SE +/- 0.16, N = 3SE +/- 0.15, N = 3SE +/- 0.09, N = 3109.86111.07111.13

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K123306090120150SE +/- 0.27, N = 3SE +/- 0.53, N = 3SE +/- 0.14, N = 3142.04140.67140.47MIN: 130.3 / MAX: 163.29MIN: 125.57 / MAX: 161.64MIN: 130.58 / MAX: 160.851. (CC) gcc options: -pthread

Cpuminer-Opt

Algorithm: Magi

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Magi32150100150200250SE +/- 2.29, N = 14SE +/- 0.17, N = 3SE +/- 0.12, N = 3206.24204.18203.981. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD312300K600K900K1200K1500KSE +/- 7906.68, N = 3SE +/- 21115.79, N = 4SE +/- 10942.24, N = 31605411.581602736.881588662.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-502311224364860SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 353.9754.1254.51MIN: 53.45 / MAX: 128.98MIN: 53.84 / MAX: 125.64MIN: 53.53 / MAX: 130.531. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

perf-bench

Benchmark: Sched Pipe

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Sched Pipe31216K32K48K64K80KSE +/- 629.67, N = 3SE +/- 327.32, N = 3SE +/- 929.56, N = 57443073755736951. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

Unpacking Firefox

Extracting: firefox-84.0.source.tar.xz

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz213612182430SE +/- 0.07, N = 4SE +/- 0.17, N = 4SE +/- 0.10, N = 423.8924.0424.13

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet132510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 320.0320.0620.21MIN: 19.87 / MAX: 21.34MIN: 19.95 / MAX: 21.79MIN: 19.81 / MAX: 21.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50213714212835SE +/- 0.10, N = 3SE +/- 0.25, N = 3SE +/- 0.09, N = 328.8629.0629.11MIN: 28.14 / MAX: 29.56MIN: 28.13 / MAX: 134.5MIN: 28.24 / MAX: 30.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cpuminer-Opt

Algorithm: Garlicoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Garlicoin231400800120016002000SE +/- 6.73, N = 3SE +/- 4.72, N = 3SE +/- 7.48, N = 31750.881738.451736.891. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption321400800120016002000SE +/- 1.95, N = 3SE +/- 1.84, N = 3SE +/- 8.30, N = 31710.61702.61697.2

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.232170M140M210M280M350MSE +/- 980611.02, N = 3SE +/- 2849594.82, N = 3SE +/- 2829162.69, N = 33051336333036123673028911331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU132120240360480600SE +/- 0.87, N = 3SE +/- 0.83, N = 3SE +/- 1.36, N = 35455425411. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1231.2512.5023.7535.0046.255SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 35.525.555.56MIN: 5.41 / MAX: 5.78MIN: 5.4 / MAX: 5.65MIN: 5.41 / MAX: 5.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption321400800120016002000SE +/- 2.43, N = 3SE +/- 3.81, N = 3SE +/- 6.83, N = 31714.41708.51702.3

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.312310002000300040005000SE +/- 37.60, N = 3SE +/- 60.83, N = 3SE +/- 60.75, N = 34654.474644.464621.631. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

Cpuminer-Opt

Algorithm: Ringcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Ringcoin32130060090012001500SE +/- 11.74, N = 14SE +/- 10.01, N = 3SE +/- 3.43, N = 31590.601582.491579.481. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

lzbench

Test: Brotli 2 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression231130260390520650SE +/- 0.33, N = 3SE +/- 2.73, N = 3SE +/- 5.17, N = 35855825811. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile1324080120160200SE +/- 0.55, N = 3SE +/- 0.91, N = 3SE +/- 0.85, N = 3159.69160.09160.67

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding123400800120016002000SE +/- 0.84, N = 3SE +/- 0.77, N = 3SE +/- 0.99, N = 31756.701753.611745.941. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH312300K600K900K1200K1500KSE +/- 2768.29, N = 3SE +/- 3115.55, N = 3SE +/- 9099.06, N = 31249824.001247740.001242294.791. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0123246810SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 38.288.308.33MIN: 8.18 / MAX: 8.87MIN: 8.14 / MAX: 8.79MIN: 8.16 / MAX: 9.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression231110220330440550SE +/- 0.67, N = 3SE +/- 3.00, N = 35065055031. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 103120.53441.06881.60322.13762.672SE +/- 0.013, N = 3SE +/- 0.011, N = 3SE +/- 0.026, N = 32.3752.3752.361

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1235001000150020002500SE +/- 0.62, N = 3SE +/- 1.77, N = 3SE +/- 9.79, N = 32208.252211.702221.13MIN: 2204.85MIN: 2206.98MIN: 2207.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.132170140210280350SE +/- 0.61, N = 3SE +/- 0.16, N = 3SE +/- 0.32, N = 3327.83328.63329.72MIN: 326.73 / MAX: 330.31MIN: 328.14 / MAX: 330.14MIN: 329.09 / MAX: 331.081. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

Cpuminer-Opt

Algorithm: LBC, LBRY Credits

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: LBC, LBRY Credits3215K10K15K20K25KSE +/- 96.84, N = 3SE +/- 86.86, N = 3SE +/- 113.58, N = 32346323407233301. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2312246810SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 37.037.057.07MIN: 6.94 / MAX: 7.65MIN: 6.99 / MAX: 7.63MIN: 6.95 / MAX: 7.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1235001000150020002500SE +/- 1.96, N = 3SE +/- 4.53, N = 3SE +/- 8.06, N = 32209.392214.442221.93MIN: 2203.59MIN: 2206.84MIN: 2205.081. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Cpuminer-Opt

Algorithm: x25x

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: x25x32150100150200250SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 1.24, N = 3220.82220.73219.581. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU213246810SE +/- 0.00084, N = 3SE +/- 0.02331, N = 3SE +/- 0.03820, N = 37.390277.416917.43079MIN: 7.35MIN: 7.36MIN: 7.351. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption32180160240320400SE +/- 0.07, N = 3SE +/- 0.34, N = 3SE +/- 1.43, N = 3346.2345.4344.4

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool231110K220K330K440K550KSE +/- 541.00, N = 3SE +/- 720.67, N = 3SE +/- 2447.10, N = 3532814532634530142

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v223170140210280350SE +/- 0.24, N = 3SE +/- 0.90, N = 3SE +/- 1.18, N = 3337.51338.74339.21MIN: 336.44 / MAX: 346.96MIN: 336.55 / MAX: 341.55MIN: 336.01 / MAX: 348.171. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU2132K4K6K8K10KSE +/- 16.03, N = 3SE +/- 9.85, N = 3SE +/- 28.57, N = 39775973697291. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 62310.24080.48160.72240.96321.204SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.008, N = 31.0701.0681.065

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU2310.67591.35182.02772.70363.3795SE +/- 0.01863, N = 3SE +/- 0.00263, N = 3SE +/- 0.00849, N = 32.990802.994073.00389MIN: 2.83MIN: 2.94MIN: 2.941. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption321120240360480600SE +/- 0.07, N = 3SE +/- 1.50, N = 3SE +/- 2.47, N = 3551.4550.3549.0

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU2310.56021.12041.68062.24082.801SE +/- 0.00118, N = 3SE +/- 0.00042, N = 3SE +/- 0.00663, N = 32.479302.481252.48994MIN: 2.45MIN: 2.45MIN: 2.451. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack23148121620SE +/- 0.02, N = 5SE +/- 0.03, N = 5SE +/- 0.09, N = 517.3217.3317.401. (CXX) g++ options: -rdynamic

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.923160120180240300SE +/- 0.33, N = 3SE +/- 0.41, N = 3SE +/- 0.82, N = 3272.41273.01273.551. (CC) gcc options: -O2 -fvisibility=hidden

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit2131530456075SE +/- 0.17, N = 3SE +/- 0.18, N = 3SE +/- 0.08, N = 368.6268.4468.34MIN: 44.12 / MAX: 171.68MIN: 43.96 / MAX: 169.86MIN: 44.14 / MAX: 168.931. (CC) gcc options: -pthread

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC132150100150200250SE +/- 0.05, N = 3SE +/- 0.55, N = 3SE +/- 0.79, N = 3236.45235.97235.541. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption312120240360480600SE +/- 0.81, N = 3SE +/- 0.78, N = 3SE +/- 1.18, N = 3533.5532.9531.5

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2123306090120150SE +/- 0.03, N = 3SE +/- 0.38, N = 3SE +/- 0.53, N = 3140.73140.32140.211. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31231.22182.44363.66544.88726.109SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 35.415.415.43MIN: 5.29 / MAX: 5.77MIN: 5.29 / MAX: 5.54MIN: 5.28 / MAX: 6.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg161321224364860SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 351.7851.8551.97MIN: 51.37 / MAX: 99.9MIN: 51.49 / MAX: 53.93MIN: 51.54 / MAX: 54.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 12130.06230.12460.18690.24920.3115SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 30.2770.2770.276

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT13122004006008001000SE +/- 2.11, N = 3SE +/- 2.71, N = 3SE +/- 0.37, N = 31084.331083.861080.551. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption32130060090012001500SE +/- 2.16, N = 3SE +/- 2.11, N = 3SE +/- 1.87, N = 31394.31393.01389.6

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet1323691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 312.0812.1112.12MIN: 11.98 / MAX: 12.56MIN: 12.03 / MAX: 12.39MIN: 12.02 / MAX: 12.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding12330060090012001500SE +/- 0.36, N = 3SE +/- 0.31, N = 3SE +/- 0.57, N = 31285.021284.711280.901. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption21380160240320400SE +/- 0.20, N = 3SE +/- 0.25, N = 3SE +/- 1.02, N = 3346.8346.4345.7

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption31280160240320400SE +/- 0.15, N = 3SE +/- 0.24, N = 3SE +/- 0.91, N = 3346.9346.8345.8

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU21370140210280350SE +/- 0.60, N = 3SE +/- 1.01, N = 3SE +/- 0.44, N = 33283283271. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression12330060090012001500SE +/- 2.08, N = 3SE +/- 3.06, N = 3SE +/- 2.73, N = 31390138913861. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption32130060090012001500SE +/- 0.72, N = 3SE +/- 3.43, N = 3SE +/- 1.19, N = 31404.21401.71400.2

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU2319001800270036004500SE +/- 1.11, N = 3SE +/- 6.05, N = 3SE +/- 1.90, N = 34040.594048.114051.99MIN: 4035.29MIN: 4033MIN: 4039.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression32180160240320400SE +/- 0.33, N = 3SE +/- 0.33, N = 33573563561. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU2130.7581.5162.2743.0323.79SE +/- 0.00051, N = 3SE +/- 0.01155, N = 3SE +/- 0.01022, N = 33.359473.368503.36883MIN: 3.3MIN: 3.3MIN: 3.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1320.63871.27741.91612.55483.1935SE +/- 0.00327, N = 3SE +/- 0.00573, N = 3SE +/- 0.00201, N = 32.830742.833732.83861MIN: 2.8MIN: 2.79MIN: 2.81. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p213100200300400500SE +/- 0.88, N = 3SE +/- 0.72, N = 3SE +/- 2.60, N = 3459.32459.11458.06MIN: 342.72 / MAX: 583.61MIN: 341.89 / MAX: 588.95MIN: 341.78 / MAX: 587.391. (CC) gcc options: -pthread

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.0321246810SE +/- 0.021, N = 3SE +/- 0.029, N = 3SE +/- 0.011, N = 38.4248.4398.447MIN: 8.33 / MAX: 9.34MIN: 8.3 / MAX: 52.09MIN: 8.35 / MAX: 9.561. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression21390180270360450SE +/- 0.58, N = 3SE +/- 1.00, N = 34014014001. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 52130.18230.36460.54690.72920.9115SE +/- 0.005, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 30.8100.8100.808

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7321120240360480600SE +/- 0.96, N = 3SE +/- 0.47, N = 3SE +/- 1.36, N = 3544.66545.66545.961. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M12350100150200250SE +/- 0.54, N = 3SE +/- 0.61, N = 3SE +/- 1.02, N = 3219.05219.10219.571. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm

lzbench

Test: Crush 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression231901802703604504324314311. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100123120240360480600SE +/- 0.84, N = 3SE +/- 0.57, N = 3SE +/- 0.80, N = 3551.18550.14549.941. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU312246810SE +/- 0.00234, N = 3SE +/- 0.00368, N = 3SE +/- 0.00711, N = 38.382268.393258.40124MIN: 8.31MIN: 8.35MIN: 8.351. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1321.30012.60023.90035.20046.5005SE +/- 0.01646, N = 3SE +/- 0.00979, N = 3SE +/- 0.00194, N = 35.765365.768905.77823MIN: 5.7MIN: 5.71MIN: 5.741. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 723170140210280350SE +/- 0.19, N = 3SE +/- 0.52, N = 3SE +/- 0.36, N = 3298.25298.54298.921. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption123120240360480600SE +/- 0.52, N = 3SE +/- 0.39, N = 3533.7533.3532.6

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU2133691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 312.6912.7112.72MIN: 12.63MIN: 12.64MIN: 12.641. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption321120240360480600SE +/- 0.26, N = 3SE +/- 0.57, N = 3SE +/- 0.60, N = 2551.7551.2550.7

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v31231326395265SE +/- 0.19, N = 3SE +/- 0.21, N = 3SE +/- 0.26, N = 355.8755.9255.96MIN: 55.38 / MAX: 87.21MIN: 55.44 / MAX: 148.38MIN: 55.41 / MAX: 122.631. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Etcpak

Configuration: ETC1 + Dithering

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering23150100150200250SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.40, N = 3227.14227.06226.751. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP32150100150200250SE +/- 0.19, N = 3SE +/- 0.19, N = 3SE +/- 0.33, N = 3239.62239.43239.241. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile3124080120160200SE +/- 0.38, N = 3SE +/- 0.10, N = 3SE +/- 0.26, N = 3186.56186.70186.84

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd123612182430SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 326.5526.5726.59MIN: 26.01 / MAX: 27.38MIN: 26.04 / MAX: 27.24MIN: 26.05 / MAX: 27.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression2132004006008001000SE +/- 0.53, N = 3SE +/- 0.20, N = 3SE +/- 0.05, N = 3938.92939.76940.231. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU3121.14242.28483.42724.56965.712SE +/- 0.00135, N = 3SE +/- 0.00123, N = 3SE +/- 0.00166, N = 35.070055.075645.07714MIN: 5.04MIN: 5.05MIN: 5.061. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU31248121620SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 313.7813.7913.80MIN: 13.71MIN: 13.69MIN: 13.721. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU2130.9181.8362.7543.6724.59SE +/- 0.01870, N = 3SE +/- 0.00717, N = 3SE +/- 0.02757, N = 34.074514.074764.07980MIN: 3.98MIN: 4MIN: 3.991. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU3219001800270036004500SE +/- 3.06, N = 3SE +/- 1.68, N = 3SE +/- 2.55, N = 34040.614044.154045.79MIN: 4031.1MIN: 4035.82MIN: 4037.431. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21321400800120016002000SE +/- 4.94, N = 3SE +/- 2.43, N = 3SE +/- 3.77, N = 31691.61691.51689.61. (CXX) g++ options: -O3 -march=native -rdynamic

lzbench

Test: Libdeflate 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Decompression3122004006008001000SE +/- 0.33, N = 39959959941. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha512231300K600K900K1200K1500KSE +/- 562.33, N = 3133011813289931328993

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU2139001800270036004500SE +/- 5.93, N = 3SE +/- 9.80, N = 3SE +/- 8.12, N = 34008400740051. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1239001800270036004500SE +/- 5.71, N = 3SE +/- 6.88, N = 3SE +/- 5.47, N = 34048.954050.544051.42MIN: 4036MIN: 4036.39MIN: 4040.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 532148121620SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 314.4814.4814.481. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU3215001000150020002500SE +/- 2.02, N = 3SE +/- 0.63, N = 3SE +/- 5.90, N = 32210.612211.322211.56MIN: 2204.44MIN: 2206.12MIN: 2201.951. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption23180160240320400SE +/- 0.17, N = 3SE +/- 0.10, N = 3345.7345.6345.6

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU3211326395265SE +/- 0.17, N = 3SE +/- 0.00, N = 35959591. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression32140801201602001821821821. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 2 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression3213060901201501481481481. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Crush 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression321204060801007676761. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 8 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression3211530456075SE +/- 0.58, N = 36565651. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression321204060801009696961. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression321816243240SE +/- 0.33, N = 33434341. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP132400K800K1200K1600K2000KSE +/- 6269.54, N = 3SE +/- 91181.44, N = 12SE +/- 4394.55, N = 32010121.551758555.231271416.381. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding321400800120016002000SE +/- 12.81, N = 3SE +/- 31.45, N = 15SE +/- 39.18, N = 151962.741881.941778.101. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding123400800120016002000SE +/- 65.46, N = 3SE +/- 45.52, N = 15SE +/- 47.93, N = 122071.461984.731979.001. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding132400800120016002000SE +/- 18.20, N = 3SE +/- 29.00, N = 12SE +/- 28.57, N = 151699.851597.841589.751. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Cpuminer-Opt

Algorithm: Triple SHA-256, Onecoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Triple SHA-256, Onecoin31213K26K39K52K65KSE +/- 80.07, N = 3SE +/- 902.62, N = 15SE +/- 1076.82, N = 156292361247603501. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Quad SHA-256, Pyrite

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Quad SHA-256, Pyrite23110K20K30K40K50KSE +/- 748.75, N = 15SE +/- 1271.64, N = 15SE +/- 749.50, N = 34862248341476531. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Deepcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Deepcoin13216003200480064008000SE +/- 132.89, N = 15SE +/- 33.20, N = 3SE +/- 164.38, N = 157487.987312.247013.871. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein3121.00422.00843.01264.01685.021SE +/- 0.121, N = 15SE +/- 0.131, N = 15SE +/- 0.017, N = 34.4634.4154.2611. (CXX) g++ options: -O3 -pthread -lm

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1323691215SE +/- 0.13, N = 3SE +/- 0.09, N = 3SE +/- 1.35, N = 1213.213.011.01. (CC) gcc options: -fopenmp -O3 -lm


Phoronix Test Suite v10.8.4