Haswell 2021

Intel Xeon E5-2687W v3 testing with a MSI X99S SLI PLUS (MS-7885) v1.0 (1.E0 BIOS) and NVIDIA GeForce GTX 770 on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2101287-HA-HASWELL2015&grr&rdt.

Haswell 2021ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen Resolution123Intel Xeon E5-2687W v3 @ 3.50GHz (10 Cores / 20 Threads)MSI X99S SLI PLUS (MS-7885) v1.0 (1.E0 BIOS)Intel Xeon E7 v3/Xeon32GB80GB INTEL SSDSCKGW08NVIDIA GeForce GTX 770Realtek ALC892LG Ultra HDIntel I218-VUbuntu 20.045.9.0-050900rc7daily20200928-generic (x86_64) 20200927GNOME Shell 3.36.4X Server 1.20.8modesetting 1.20.8GCC 9.3.0ext43840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x44Python Details- Python 3.8.5Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Haswell 2021webp2: Quality 100, Lossless Compressionwebp2: Quality 95, Compression Effort 7cp2k: Fayalite-FIST Datawebp2: Quality 75, Compression Effort 7financebench: Bonds OpenMPnpb: EP.Dgcrypt: openfoam: Motorbike 30Mdav1d: Chimera 1080p 10-bitbuild-godot: Time To Compilebuild2: Time To Compileaskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingmnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: resnet-v2-50mnn: SqueezeNetV1.0cloverleaf: Lagrangian-Eulerian Hydrodynamicsonnx: fcn-resnet101-11 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: yolov4 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUcpuminer-opt: Triple SHA-256, Onecoincpuminer-opt: Deepcoincpuminer-opt: Quad SHA-256, Pyritecpuminer-opt: Skeincoinbuild-eigen: Time To Compileaskap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddingclomp: Static OMP Speedupfinancebench: Repo OpenMPonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUperf-bench: Sched Pipegnupg: 2.7GB Sample File Encryptioncpuminer-opt: Blake-2 Sonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUqmcpack: simple-H2Orav1e: 5kripke: rav1e: 1ncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetcpuminer-opt: Magicpuminer-opt: Ringcoinrav1e: 6askap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingredis: SETlzbench: XZ 0 - Decompressionlzbench: XZ 0 - Compressionquantlib: rav1e: 10etcpak: ETC2redis: LPOPcryptsetup: Twofish-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Encryptioncryptsetup: Serpent-XTS 512b Decryptioncryptsetup: Serpent-XTS 512b Encryptioncryptsetup: AES-XTS 512b Decryptioncryptsetup: AES-XTS 512b Encryptioncryptsetup: Twofish-XTS 256b Decryptioncryptsetup: Twofish-XTS 256b Encryptioncryptsetup: Serpent-XTS 256b Decryptioncryptsetup: Serpent-XTS 256b Encryptioncryptsetup: AES-XTS 256b Decryptioncryptsetup: AES-XTS 256b Encryptioncryptsetup: PBKDF2-whirlpoolcryptsetup: PBKDF2-sha512unpack-firefox: firefox-84.0.source.tar.xzonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUcpuminer-opt: Myriad-Groestlcpuminer-opt: Garlicoincpuminer-opt: LBC, LBRY Creditscpuminer-opt: x25xsynthmark: VoiceMark_100encode-wavpack: WAV To WavPackdav1d: Summer Nature 4Klzbench: Zstd 8 - Decompressionlzbench: Zstd 8 - Compressionlzbench: Crush 0 - Decompressionlzbench: Crush 0 - Compressiondav1d: Chimera 1080paskap: Hogbom Clean OpenMPlzbench: Brotli 2 - Decompressionlzbench: Brotli 2 - Compressionlzbench: Zstd 1 - Decompressionlzbench: Zstd 1 - Compressiontnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1lzbench: Brotli 0 - Decompressionlzbench: Brotli 0 - Compressionlzbench: Libdeflate 1 - Decompressionlzbench: Libdeflate 1 - Compressionetcpak: ETC1 + Ditheringonednn: Deconvolution Batch shapes_1d - f32 - CPUnpb: EP.Cetcpak: ETC1amg: redis: LPUSHredis: SADDencode-opus: WAV To Opus Encodelammps: Rhodopsin Proteinredis: GETonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUwebp2: Quality 100, Compression Effort 5onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUlulesh: dav1d: Summer Nature 1080ponednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUwebp2: Defaultetcpak: DXT1onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPU123939.760545.9561543.683298.918115661.1296871027.88273.551219.0568.44186.697159.6872071.461699.8555.8656.1965.06154.5118.447136.895954532897364007612477487.984765334847109.8591756.701285.0213.268017.6536464051.994048.954045.797375581.0812205082209.392211.562208.2551.1530.810461044200.27724.0826.5531.4329.0612.0815.2051.7816.602.768.285.527.055.416.2220.03203.981579.481.0652664.351778.101415197.0896341689.62.375140.7312010121.55346.4345.6533.7550.71389.61400.2346.8344.4532.9549.01697.21702.3530142132899324.0386.66011103631736.8923330219.58551.18117.395142.0414236543176459.11239.2355811481390401339.212329.721503356995182226.7525.765361050.17235.5403028911331247740.001602736.8810.6864.4151887226.874.074762.8307414.4823.368503.003894654.4664382.212.489947.4169112.707513.78545.6771083.8615.075648.39325938.916545.6631576.952298.253114077.5651041008.91272.411219.1068.62186.844160.6741984.731589.7555.9176.1125.14653.9668.439140.645954132897754008603507013.874862234338111.1281753.611284.7111.069828.3619794040.594050.544044.157369580.2212219382214.442211.322211.7051.9030.810465298530.27723.9326.5731.9628.8612.1215.5451.9716.842.798.305.557.075.416.2220.21204.181582.491.0702716.91881.941408307.6296341691.52.361140.3151271416.38346.8345.7533.3551.21393.01401.7345.8345.4531.5550.31702.61708.5532814133011823.8916.51937105671750.8823407220.73550.14217.322140.6714136543276459.32239.4255851481389401337.511328.625506356994182227.1365.778231002.01235.9683036123671242294.791588662.7510.6174.2611775802.504.074512.8386114.4783.359472.990804644.4601380.862.479307.3902712.693813.79855.6641080.5465.077148.40124940.231544.6621546.678298.540116695.3782551019.37273.013219.5768.34186.557160.0931979.001597.8455.9606.1515.10054.1248.424140.575954232797294005629237312.244834135707111.0651745.941280.9013.069374.8691414048.114051.424040.617443080.1272260732221.932210.612221.1351.9130.808468404270.27623.7526.5932.7329.1112.1115.1951.8516.952.758.335.567.035.436.3120.06206.241590.601.0682716.91962.741436898.9696341691.62.375140.2101758555.23345.7345.6532.6551.71394.31404.2346.9346.2533.5551.41710.61714.4532634132899324.1276.68411105101738.4523463220.82549.93517.326140.4713966543176458.06239.6175821481386400338.741327.826505357995182227.0565.768901047.67236.4543051336331249824.001605411.5810.7964.4631765664.374.079802.8337314.4753.368832.994074621.6305385.992.481257.4307912.719313.78045.7561084.3295.070058.38226OpenBenchmarking.org

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression1232004006008001000SE +/- 0.20, N = 3SE +/- 0.53, N = 3SE +/- 0.05, N = 3939.76938.92940.231. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7123120240360480600SE +/- 1.36, N = 3SE +/- 0.47, N = 3SE +/- 0.96, N = 3545.96545.66544.661. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

CP2K Molecular Dynamics

Fayalite-FIST Data

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.1Fayalite-FIST Data123300600900120015001543.681576.951546.68

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 712370140210280350SE +/- 0.36, N = 3SE +/- 0.19, N = 3SE +/- 0.52, N = 3298.92298.25298.541. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP12320K40K60K80K100KSE +/- 1477.66, N = 5SE +/- 20.05, N = 3SE +/- 1452.56, N = 12115661.13114077.57116695.381. (CXX) g++ options: -O3 -march=native -fopenmp

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D1232004006008001000SE +/- 14.12, N = 4SE +/- 9.01, N = 12SE +/- 17.23, N = 31027.881008.911019.371. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.912360120180240300SE +/- 0.82, N = 3SE +/- 0.33, N = 3SE +/- 0.41, N = 3273.55272.41273.011. (CC) gcc options: -O2 -fvisibility=hidden

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M12350100150200250SE +/- 0.54, N = 3SE +/- 0.61, N = 3SE +/- 1.02, N = 3219.05219.10219.571. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit1231530456075SE +/- 0.18, N = 3SE +/- 0.17, N = 3SE +/- 0.08, N = 368.4468.6268.34MIN: 43.96 / MAX: 169.86MIN: 44.12 / MAX: 171.68MIN: 44.14 / MAX: 168.931. (CC) gcc options: -pthread

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile1234080120160200SE +/- 0.10, N = 3SE +/- 0.26, N = 3SE +/- 0.38, N = 3186.70186.84186.56

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile1234080120160200SE +/- 0.55, N = 3SE +/- 0.85, N = 3SE +/- 0.91, N = 3159.69160.67160.09

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding123400800120016002000SE +/- 65.46, N = 3SE +/- 45.52, N = 15SE +/- 47.93, N = 122071.461984.731979.001. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding123400800120016002000SE +/- 18.20, N = 3SE +/- 28.57, N = 15SE +/- 29.00, N = 121699.851589.751597.841. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v31231326395265SE +/- 0.19, N = 3SE +/- 0.21, N = 3SE +/- 0.26, N = 355.8755.9255.96MIN: 55.38 / MAX: 87.21MIN: 55.44 / MAX: 148.38MIN: 55.41 / MAX: 122.631. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.0123246810SE +/- 0.006, N = 3SE +/- 0.007, N = 3SE +/- 0.018, N = 36.1966.1126.151MIN: 4.66 / MAX: 19.35MIN: 6.03 / MAX: 12.61MIN: 6.05 / MAX: 6.91. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_2241231.15792.31583.47374.63165.7895SE +/- 0.016, N = 3SE +/- 0.008, N = 3SE +/- 0.037, N = 35.0615.1465.100MIN: 4.98 / MAX: 5.99MIN: 5.03 / MAX: 6.03MIN: 4.42 / MAX: 11.391. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-501231224364860SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 354.5153.9754.12MIN: 53.53 / MAX: 130.53MIN: 53.45 / MAX: 128.98MIN: 53.84 / MAX: 125.641. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.0123246810SE +/- 0.011, N = 3SE +/- 0.029, N = 3SE +/- 0.021, N = 38.4478.4398.424MIN: 8.35 / MAX: 9.56MIN: 8.3 / MAX: 52.09MIN: 8.33 / MAX: 9.341. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics123306090120150SE +/- 0.22, N = 3SE +/- 0.54, N = 3SE +/- 0.33, N = 3136.89140.64140.571. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU1231326395265SE +/- 0.00, N = 3SE +/- 0.17, N = 35959591. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU123120240360480600SE +/- 0.87, N = 3SE +/- 1.36, N = 3SE +/- 0.83, N = 35455415421. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU12370140210280350SE +/- 1.01, N = 3SE +/- 0.60, N = 3SE +/- 0.44, N = 33283283271. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU1232K4K6K8K10KSE +/- 9.85, N = 3SE +/- 16.03, N = 3SE +/- 28.57, N = 39736977597291. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU1239001800270036004500SE +/- 9.80, N = 3SE +/- 5.93, N = 3SE +/- 8.12, N = 34007400840051. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Cpuminer-Opt

Algorithm: Triple SHA-256, Onecoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Triple SHA-256, Onecoin12313K26K39K52K65KSE +/- 902.62, N = 15SE +/- 1076.82, N = 15SE +/- 80.07, N = 36124760350629231. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Deepcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Deepcoin12316003200480064008000SE +/- 132.89, N = 15SE +/- 164.38, N = 15SE +/- 33.20, N = 37487.987013.877312.241. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Quad SHA-256, Pyrite

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Quad SHA-256, Pyrite12310K20K30K40K50KSE +/- 749.50, N = 3SE +/- 748.75, N = 15SE +/- 1271.64, N = 154765348622483411. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Skeincoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Skeincoin1238K16K24K32K40KSE +/- 291.68, N = 15SE +/- 325.79, N = 15SE +/- 140.75, N = 33484734338357071. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile12320406080100SE +/- 0.16, N = 3SE +/- 0.09, N = 3SE +/- 0.15, N = 3109.86111.13111.07

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding123400800120016002000SE +/- 0.84, N = 3SE +/- 0.77, N = 3SE +/- 0.99, N = 31756.701753.611745.941. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding12330060090012001500SE +/- 0.36, N = 3SE +/- 0.31, N = 3SE +/- 0.57, N = 31285.021284.711280.901. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1233691215SE +/- 0.13, N = 3SE +/- 1.35, N = 12SE +/- 0.09, N = 313.211.013.01. (CC) gcc options: -fopenmp -O3 -lm

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP12315K30K45K60K75KSE +/- 89.04, N = 3SE +/- 1002.67, N = 3SE +/- 922.31, N = 468017.6569828.3669374.871. (CXX) g++ options: -O3 -march=native -fopenmp

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1239001800270036004500SE +/- 1.90, N = 3SE +/- 1.11, N = 3SE +/- 6.05, N = 34051.994040.594048.11MIN: 4039.27MIN: 4035.29MIN: 40331. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1239001800270036004500SE +/- 5.71, N = 3SE +/- 6.88, N = 3SE +/- 5.47, N = 34048.954050.544051.42MIN: 4036MIN: 4036.39MIN: 4040.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1239001800270036004500SE +/- 2.55, N = 3SE +/- 1.68, N = 3SE +/- 3.06, N = 34045.794044.154040.61MIN: 4037.43MIN: 4035.82MIN: 4031.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

perf-bench

Benchmark: Sched Pipe

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Sched Pipe12316K32K48K64K80KSE +/- 327.32, N = 3SE +/- 929.56, N = 5SE +/- 629.67, N = 37375573695744301. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

GnuPG

2.7GB Sample File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File Encryption12320406080100SE +/- 1.03, N = 3SE +/- 0.16, N = 3SE +/- 0.07, N = 381.0880.2280.131. (CC) gcc options: -O2

Cpuminer-Opt

Algorithm: Blake-2 S

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Blake-2 S12350K100K150K200K250KSE +/- 3265.93, N = 4SE +/- 2626.85, N = 15SE +/- 3030.09, N = 42205082219382260731. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1235001000150020002500SE +/- 1.96, N = 3SE +/- 4.53, N = 3SE +/- 8.06, N = 32209.392214.442221.93MIN: 2203.59MIN: 2206.84MIN: 2205.081. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1235001000150020002500SE +/- 5.90, N = 3SE +/- 0.63, N = 3SE +/- 2.02, N = 32211.562211.322210.61MIN: 2201.95MIN: 2206.12MIN: 2204.441. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1235001000150020002500SE +/- 0.62, N = 3SE +/- 1.77, N = 3SE +/- 9.79, N = 32208.252211.702221.13MIN: 2204.85MIN: 2206.98MIN: 2207.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O1231224364860SE +/- 0.64, N = 5SE +/- 0.70, N = 5SE +/- 0.74, N = 351.1551.9051.911. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 51230.18230.36460.54690.72920.9115SE +/- 0.001, N = 3SE +/- 0.005, N = 3SE +/- 0.003, N = 30.8100.8100.808

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.412310M20M30M40M50MSE +/- 200225.59, N = 3SE +/- 199447.85, N = 3SE +/- 186245.67, N = 34610442046529853468404271. (CXX) g++ options: -O3 -fopenmp

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 11230.06230.12460.18690.24920.3115SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 30.2770.2770.276

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m123612182430SE +/- 0.03, N = 3SE +/- 0.13, N = 3SE +/- 0.06, N = 324.0823.9323.75MIN: 23.89 / MAX: 25.07MIN: 23.59 / MAX: 24.31MIN: 23.54 / MAX: 24.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd123612182430SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 326.5526.5726.59MIN: 26.01 / MAX: 27.38MIN: 26.04 / MAX: 27.24MIN: 26.05 / MAX: 27.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny123816243240SE +/- 0.35, N = 3SE +/- 0.60, N = 3SE +/- 0.34, N = 331.4331.9632.73MIN: 30.14 / MAX: 34.58MIN: 30.06 / MAX: 35.42MIN: 31.73 / MAX: 35.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50123714212835SE +/- 0.25, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 329.0628.8629.11MIN: 28.13 / MAX: 134.5MIN: 28.14 / MAX: 29.56MIN: 28.24 / MAX: 30.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet1233691215SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 312.0812.1212.11MIN: 11.98 / MAX: 12.56MIN: 12.02 / MAX: 12.63MIN: 12.03 / MAX: 12.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet1812348121620SE +/- 0.04, N = 3SE +/- 0.15, N = 3SE +/- 0.06, N = 315.2015.5415.19MIN: 15.01 / MAX: 15.36MIN: 15 / MAX: 110.46MIN: 14.96 / MAX: 16.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg161231224364860SE +/- 0.13, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 351.7851.9751.85MIN: 51.37 / MAX: 99.9MIN: 51.54 / MAX: 54.71MIN: 51.49 / MAX: 53.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet12348121620SE +/- 0.14, N = 3SE +/- 0.20, N = 3SE +/- 0.31, N = 316.6016.8416.95MIN: 16.2 / MAX: 74.59MIN: 16.12 / MAX: 18.21MIN: 16.16 / MAX: 17.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1230.62781.25561.88342.51123.139SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 32.762.792.75MIN: 2.7 / MAX: 3.13MIN: 2.69 / MAX: 3.4MIN: 2.69 / MAX: 3.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0123246810SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 38.288.308.33MIN: 8.18 / MAX: 8.87MIN: 8.14 / MAX: 8.79MIN: 8.16 / MAX: 9.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1231.2512.5023.7535.0046.255SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 35.525.555.56MIN: 5.41 / MAX: 5.78MIN: 5.4 / MAX: 5.65MIN: 5.41 / MAX: 5.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2123246810SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 37.057.077.03MIN: 6.99 / MAX: 7.63MIN: 6.95 / MAX: 7.92MIN: 6.94 / MAX: 7.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31231.22182.44363.66544.88726.109SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 35.415.415.43MIN: 5.29 / MAX: 5.77MIN: 5.29 / MAX: 5.54MIN: 5.28 / MAX: 6.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 36.226.226.31MIN: 6.14 / MAX: 6.38MIN: 6.12 / MAX: 6.83MIN: 6.11 / MAX: 64.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet123510152025SE +/- 0.02, N = 3SE +/- 0.14, N = 3SE +/- 0.01, N = 320.0320.2120.06MIN: 19.87 / MAX: 21.34MIN: 19.81 / MAX: 21.32MIN: 19.95 / MAX: 21.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cpuminer-Opt

Algorithm: Magi

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Magi12350100150200250SE +/- 0.12, N = 3SE +/- 0.17, N = 3SE +/- 2.29, N = 14203.98204.18206.241. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Ringcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Ringcoin12330060090012001500SE +/- 3.43, N = 3SE +/- 10.01, N = 3SE +/- 11.74, N = 141579.481582.491590.601. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 61230.24080.48160.72240.96321.204SE +/- 0.008, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 31.0651.0701.068

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding1236001200180024003000SE +/- 1.79, N = 15SE +/- 0.00, N = 15SE +/- 0.00, N = 32664.352716.902716.901. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding123400800120016002000SE +/- 39.18, N = 15SE +/- 31.45, N = 15SE +/- 12.81, N = 31778.101881.941962.741. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET123300K600K900K1200K1500KSE +/- 14514.92, N = 8SE +/- 19369.44, N = 15SE +/- 9026.88, N = 31415197.081408307.621436898.961. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression123204060801009696961. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression123816243240SE +/- 0.33, N = 33434341. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21123400800120016002000SE +/- 3.77, N = 3SE +/- 2.43, N = 3SE +/- 4.94, N = 31689.61691.51691.61. (CXX) g++ options: -O3 -march=native -rdynamic

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 101230.53441.06881.60322.13762.672SE +/- 0.011, N = 3SE +/- 0.026, N = 3SE +/- 0.013, N = 32.3752.3612.375

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2123306090120150SE +/- 0.03, N = 3SE +/- 0.38, N = 3SE +/- 0.53, N = 3140.73140.32140.211. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP123400K800K1200K1600K2000KSE +/- 6269.54, N = 3SE +/- 4394.55, N = 3SE +/- 91181.44, N = 122010121.551271416.381758555.231. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption12380160240320400SE +/- 0.25, N = 3SE +/- 0.20, N = 3SE +/- 1.02, N = 3346.4346.8345.7

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption12380160240320400SE +/- 0.17, N = 3SE +/- 0.10, N = 3345.6345.7345.6

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption123120240360480600SE +/- 0.52, N = 3SE +/- 0.39, N = 3533.7533.3532.6

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption123120240360480600SE +/- 0.60, N = 2SE +/- 0.57, N = 3SE +/- 0.26, N = 3550.7551.2551.7

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption12330060090012001500SE +/- 1.87, N = 3SE +/- 2.11, N = 3SE +/- 2.16, N = 31389.61393.01394.3

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption12330060090012001500SE +/- 1.19, N = 3SE +/- 3.43, N = 3SE +/- 0.72, N = 31400.21401.71404.2

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption12380160240320400SE +/- 0.24, N = 3SE +/- 0.91, N = 3SE +/- 0.15, N = 3346.8345.8346.9

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption12380160240320400SE +/- 1.43, N = 3SE +/- 0.34, N = 3SE +/- 0.07, N = 3344.4345.4346.2

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption123120240360480600SE +/- 0.78, N = 3SE +/- 1.18, N = 3SE +/- 0.81, N = 3532.9531.5533.5

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption123120240360480600SE +/- 2.47, N = 3SE +/- 1.50, N = 3SE +/- 0.07, N = 3549.0550.3551.4

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption123400800120016002000SE +/- 8.30, N = 3SE +/- 1.84, N = 3SE +/- 1.95, N = 31697.21702.61710.6

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption123400800120016002000SE +/- 6.83, N = 3SE +/- 3.81, N = 3SE +/- 2.43, N = 31702.31708.51714.4

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool123110K220K330K440K550KSE +/- 2447.10, N = 3SE +/- 541.00, N = 3SE +/- 720.67, N = 3530142532814532634

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha512123300K600K900K1200K1500KSE +/- 562.33, N = 3132899313301181328993

Unpacking Firefox

Extracting: firefox-84.0.source.tar.xz

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz123612182430SE +/- 0.17, N = 4SE +/- 0.07, N = 4SE +/- 0.10, N = 424.0423.8924.13

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU123246810SE +/- 0.08426, N = 4SE +/- 0.07423, N = 6SE +/- 0.09722, N = 36.660116.519376.68411MIN: 6.33MIN: 6.3MIN: 6.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Cpuminer-Opt

Algorithm: Myriad-Groestl

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Myriad-Groestl1232K4K6K8K10KSE +/- 6.67, N = 3SE +/- 161.69, N = 3SE +/- 125.83, N = 31036310567105101. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Garlicoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Garlicoin123400800120016002000SE +/- 7.48, N = 3SE +/- 6.73, N = 3SE +/- 4.72, N = 31736.891750.881738.451. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: LBC, LBRY Credits

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: LBC, LBRY Credits1235K10K15K20K25KSE +/- 113.58, N = 3SE +/- 86.86, N = 3SE +/- 96.84, N = 32333023407234631. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: x25x

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: x25x12350100150200250SE +/- 1.24, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3219.58220.73220.821. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100123120240360480600SE +/- 0.84, N = 3SE +/- 0.57, N = 3SE +/- 0.80, N = 3551.18550.14549.941. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack12348121620SE +/- 0.09, N = 5SE +/- 0.02, N = 5SE +/- 0.03, N = 517.4017.3217.331. (CXX) g++ options: -rdynamic

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K123306090120150SE +/- 0.27, N = 3SE +/- 0.53, N = 3SE +/- 0.14, N = 3142.04140.67140.47MIN: 130.3 / MAX: 163.29MIN: 125.57 / MAX: 161.64MIN: 130.58 / MAX: 160.851. (CC) gcc options: -pthread

lzbench

Test: Zstd 8 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression12330060090012001500SE +/- 4.06, N = 3SE +/- 7.88, N = 3SE +/- 21.06, N = 31423141313961. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 8 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression1231530456075SE +/- 0.58, N = 36565651. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Crush 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression123901802703604504314324311. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Crush 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression123204060801007676761. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p123100200300400500SE +/- 0.72, N = 3SE +/- 0.88, N = 3SE +/- 2.60, N = 3459.11459.32458.06MIN: 341.89 / MAX: 588.95MIN: 342.72 / MAX: 583.61MIN: 341.78 / MAX: 587.391. (CC) gcc options: -pthread

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP12350100150200250SE +/- 0.33, N = 3SE +/- 0.19, N = 3SE +/- 0.19, N = 3239.24239.43239.621. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

lzbench

Test: Brotli 2 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression123130260390520650SE +/- 5.17, N = 3SE +/- 0.33, N = 3SE +/- 2.73, N = 35815855821. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 2 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression1233060901201501481481481. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression12330060090012001500SE +/- 2.08, N = 3SE +/- 3.06, N = 3SE +/- 2.73, N = 31390138913861. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression12390180270360450SE +/- 0.58, N = 3SE +/- 1.00, N = 34014014001. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v212370140210280350SE +/- 1.18, N = 3SE +/- 0.24, N = 3SE +/- 0.90, N = 3339.21337.51338.74MIN: 336.01 / MAX: 348.17MIN: 336.44 / MAX: 346.96MIN: 336.55 / MAX: 341.551. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.112370140210280350SE +/- 0.32, N = 3SE +/- 0.16, N = 3SE +/- 0.61, N = 3329.72328.63327.83MIN: 329.09 / MAX: 331.08MIN: 328.14 / MAX: 330.14MIN: 326.73 / MAX: 330.311. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression123110220330440550SE +/- 3.00, N = 3SE +/- 0.67, N = 35035065051. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression12380160240320400SE +/- 0.33, N = 3SE +/- 0.33, N = 33563563571. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Decompression1232004006008001000SE +/- 0.33, N = 39959949951. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression12340801201602001821821821. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Etcpak

Configuration: ETC1 + Dithering

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering12350100150200250SE +/- 0.40, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3226.75227.14227.061. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1231.30012.60023.90035.20046.5005SE +/- 0.01646, N = 3SE +/- 0.00194, N = 3SE +/- 0.00979, N = 35.765365.778235.76890MIN: 5.7MIN: 5.74MIN: 5.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C1232004006008001000SE +/- 16.48, N = 3SE +/- 11.00, N = 15SE +/- 4.68, N = 31050.171002.011047.671. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC112350100150200250SE +/- 0.79, N = 3SE +/- 0.55, N = 3SE +/- 0.05, N = 3235.54235.97236.451. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.212370M140M210M280M350MSE +/- 2829162.69, N = 3SE +/- 2849594.82, N = 3SE +/- 980611.02, N = 33028911333036123673051336331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH123300K600K900K1200K1500KSE +/- 3115.55, N = 3SE +/- 9099.06, N = 3SE +/- 2768.29, N = 31247740.001242294.791249824.001. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD123300K600K900K1200K1500KSE +/- 21115.79, N = 4SE +/- 10942.24, N = 3SE +/- 7906.68, N = 31602736.881588662.751605411.581. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1233691215SE +/- 0.03, N = 5SE +/- 0.04, N = 5SE +/- 0.03, N = 510.6910.6210.801. (CXX) g++ options: -fvisibility=hidden -logg -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1231.00422.00843.01264.01685.021SE +/- 0.131, N = 15SE +/- 0.017, N = 3SE +/- 0.121, N = 154.4154.2614.4631. (CXX) g++ options: -O3 -pthread -lm

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123400K800K1200K1600K2000KSE +/- 6745.04, N = 3SE +/- 25400.58, N = 3SE +/- 20496.02, N = 31887226.871775802.501765664.371. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1230.9181.8362.7543.6724.59SE +/- 0.00717, N = 3SE +/- 0.01870, N = 3SE +/- 0.02757, N = 34.074764.074514.07980MIN: 4MIN: 3.98MIN: 3.991. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.63871.27741.91612.55483.1935SE +/- 0.00327, N = 3SE +/- 0.00201, N = 3SE +/- 0.00573, N = 32.830742.838612.83373MIN: 2.8MIN: 2.8MIN: 2.791. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 512348121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 314.4814.4814.481. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.7581.5162.2743.0323.79SE +/- 0.01155, N = 3SE +/- 0.00051, N = 3SE +/- 0.01022, N = 33.368503.359473.36883MIN: 3.3MIN: 3.3MIN: 3.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.67591.35182.02772.70363.3795SE +/- 0.00849, N = 3SE +/- 0.01863, N = 3SE +/- 0.00263, N = 33.003892.990802.99407MIN: 2.94MIN: 2.83MIN: 2.941. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.312310002000300040005000SE +/- 37.60, N = 3SE +/- 60.83, N = 3SE +/- 60.75, N = 34654.474644.464621.631. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p12380160240320400SE +/- 3.01, N = 3SE +/- 3.16, N = 3SE +/- 1.19, N = 3382.21380.86385.99MIN: 305.4 / MAX: 420.06MIN: 302.45 / MAX: 418.3MIN: 312.93 / MAX: 423.31. (CC) gcc options: -pthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.56021.12041.68062.24082.801SE +/- 0.00663, N = 3SE +/- 0.00118, N = 3SE +/- 0.00042, N = 32.489942.479302.48125MIN: 2.45MIN: 2.45MIN: 2.451. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU123246810SE +/- 0.02331, N = 3SE +/- 0.00084, N = 3SE +/- 0.03820, N = 37.416917.390277.43079MIN: 7.36MIN: 7.35MIN: 7.351. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1233691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 312.7112.6912.72MIN: 12.64MIN: 12.63MIN: 12.641. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU12348121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 313.7913.8013.78MIN: 13.69MIN: 13.72MIN: 13.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default1231.29512.59023.88535.18046.4755SE +/- 0.037, N = 3SE +/- 0.018, N = 3SE +/- 0.043, N = 35.6775.6645.7561. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT11232004006008001000SE +/- 2.71, N = 3SE +/- 0.37, N = 3SE +/- 2.11, N = 31083.861080.551084.331. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1231.14242.28483.42724.56965.712SE +/- 0.00123, N = 3SE +/- 0.00166, N = 3SE +/- 0.00135, N = 35.075645.077145.07005MIN: 5.05MIN: 5.06MIN: 5.041. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU123246810SE +/- 0.00368, N = 3SE +/- 0.00711, N = 3SE +/- 0.00234, N = 38.393258.401248.38226MIN: 8.35MIN: 8.35MIN: 8.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread


Phoronix Test Suite v10.8.4