Haswell 2021

Intel Xeon E5-2687W v3 testing with a MSI X99S SLI PLUS (MS-7885) v1.0 (1.E0 BIOS) and NVIDIA GeForce GTX 770 on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2101287-HA-HASWELL2015&sor&grw.

Haswell 2021ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen Resolution123Intel Xeon E5-2687W v3 @ 3.50GHz (10 Cores / 20 Threads)MSI X99S SLI PLUS (MS-7885) v1.0 (1.E0 BIOS)Intel Xeon E7 v3/Xeon32GB80GB INTEL SSDSCKGW08NVIDIA GeForce GTX 770Realtek ALC892LG Ultra HDIntel I218-VUbuntu 20.045.9.0-050900rc7daily20200928-generic (x86_64) 20200927GNOME Shell 3.36.4X Server 1.20.8modesetting 1.20.8GCC 9.3.0ext43840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x44Python Details- Python 3.8.5Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Haswell 2021clomp: Static OMP Speedupcryptsetup: PBKDF2-sha512cryptsetup: PBKDF2-whirlpoolcryptsetup: AES-XTS 256b Encryptioncryptsetup: AES-XTS 256b Decryptioncryptsetup: Serpent-XTS 256b Encryptioncryptsetup: Serpent-XTS 256b Decryptioncryptsetup: Twofish-XTS 256b Encryptioncryptsetup: Twofish-XTS 256b Decryptioncryptsetup: AES-XTS 512b Encryptioncryptsetup: AES-XTS 512b Decryptioncryptsetup: Serpent-XTS 512b Encryptioncryptsetup: Serpent-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Encryptioncryptsetup: Twofish-XTS 512b Decryptionlzbench: XZ 0 - Compressionlzbench: XZ 0 - Decompressionlzbench: Zstd 1 - Compressionlzbench: Zstd 1 - Decompressionlzbench: Zstd 8 - Compressionlzbench: Zstd 8 - Decompressionlzbench: Crush 0 - Compressionlzbench: Crush 0 - Decompressionlzbench: Brotli 0 - Compressionlzbench: Brotli 0 - Decompressionlzbench: Brotli 2 - Compressionlzbench: Brotli 2 - Decompressionlzbench: Libdeflate 1 - Compressionlzbench: Libdeflate 1 - Decompressionencode-opus: WAV To Opus Encodeencode-wavpack: WAV To WavPacketcpak: DXT1etcpak: ETC1etcpak: ETC2etcpak: ETC1 + Ditheringwebp2: Defaultwebp2: Quality 75, Compression Effort 7webp2: Quality 95, Compression Effort 7webp2: Quality 100, Compression Effort 5webp2: Quality 100, Lossless Compressionsynthmark: VoiceMark_100gcrypt: quantlib: cloverleaf: Lagrangian-Eulerian Hydrodynamicsmnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3onnx: yolov4 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: fcn-resnet101-11 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUtnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1ncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mlammps: Rhodopsin Proteinnpb: EP.Cnpb: EP.Donednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUaskap: tConvolve MT - Griddingaskap: tConvolve MT - Degriddingaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddingaskap: tConvolve OpenMP - Griddingaskap: tConvolve OpenMP - Degriddingaskap: Hogbom Clean OpenMPamg: kripke: lulesh: openfoam: Motorbike 30Mqmcpack: simple-H2Ocp2k: Fayalite-FIST Dataperf-bench: Sched Pipecpuminer-opt: Magicpuminer-opt: x25xcpuminer-opt: Deepcoincpuminer-opt: Ringcoincpuminer-opt: Blake-2 Scpuminer-opt: Garlicoincpuminer-opt: Skeincoincpuminer-opt: Myriad-Groestlcpuminer-opt: LBC, LBRY Creditscpuminer-opt: Quad SHA-256, Pyritecpuminer-opt: Triple SHA-256, Onecoindav1d: Chimera 1080pdav1d: Summer Nature 4Kdav1d: Summer Nature 1080pdav1d: Chimera 1080p 10-bitrav1e: 1rav1e: 5rav1e: 6rav1e: 10build-godot: Time To Compilebuild2: Time To Compilebuild-eigen: Time To Compilefinancebench: Repo OpenMPfinancebench: Bonds OpenMPredis: LPOPredis: SADDunpack-firefox: firefox-84.0.source.tar.xzredis: LPUSHredis: GETredis: SETgnupg: 2.7GB Sample File Encryption12313.213289935301421702.31697.2549.0532.9344.4346.81400.21389.6550.7533.7345.6346.4349640113906514237643135650314858118299510.68617.3951083.861235.540140.731226.7525.677298.918545.95614.482939.760551.181273.5511689.6136.898.44754.5115.0616.19655.8653285455997364007339.212329.72120.036.225.417.055.528.282.7616.6051.7815.2012.0829.0631.4326.5524.084.4151050.171027.884.074767.416912.830742.4899413.78545.765368.3932512.70756.660115.075644048.952209.394045.792208.253.368504051.992211.563.003891285.021756.701699.852071.461778.102664.35239.235302891133461044204654.4664219.0551.1531543.68373755203.98219.587487.981579.482205081736.893484710363233304765361247459.11142.04382.2168.440.2770.8101.0652.375186.697159.687109.85968017.653646115661.1296872010121.551602736.8824.0381247740.001887226.871415197.0881.08111.013301185328141708.51702.6550.3531.5345.4345.81401.71393.0551.2533.3345.7346.8349640113896514137643235650614858518299410.61717.3221080.546235.968140.315227.1365.664298.253545.66314.478938.916550.142272.4111691.5140.648.43953.9665.1466.11255.9173285415997754008337.511328.62520.216.225.417.075.558.302.7916.8451.9715.5412.1228.8631.9626.5723.934.2611002.011008.914.074517.390272.838612.4793013.79855.778238.4012412.69386.519375.077144050.542214.444044.152211.703.359474040.592211.322.990801284.711753.611589.751984.731881.942716.9239.425303612367465298534644.4601219.1051.9031576.95273695204.18220.737013.871582.492219381750.883433810567234074862260350459.32140.67380.8668.620.2770.8101.0702.361186.844160.674111.12869828.361979114077.5651041271416.381588662.7523.8911242294.791775802.501408307.6280.22113.013289935326341714.41710.6551.4533.5346.2346.91404.21394.3551.7532.6345.6345.7349640013866513967643135750514858218299510.79617.3261084.329236.454140.210227.0565.756298.540544.66214.475940.231549.935273.0131691.6140.578.42454.1245.1006.15155.9603275425997294005338.741327.82620.066.315.437.035.568.332.7516.9551.8515.1912.1129.1132.7326.5923.754.4631047.671019.374.079807.430792.833732.4812513.78045.768908.3822612.71936.684115.070054051.422221.934040.612221.133.368834048.112210.612.994071280.901745.941597.841979.001962.742716.9239.617305133633468404274621.6305219.5751.9131546.67874430206.24220.827312.241590.602260731738.453570710510234634834162923458.06140.47385.9968.340.2760.8081.0682.375186.557160.093111.06569374.869141116695.3782551758555.231605411.5824.1271249824.001765664.371436898.9680.127OpenBenchmarking.org

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1323691215SE +/- 0.13, N = 3SE +/- 0.09, N = 3SE +/- 1.35, N = 1213.213.011.01. (CC) gcc options: -fopenmp -O3 -lm

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha512231300K600K900K1200K1500KSE +/- 562.33, N = 3133011813289931328993

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool231110K220K330K440K550KSE +/- 541.00, N = 3SE +/- 720.67, N = 3SE +/- 2447.10, N = 3532814532634530142

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption321400800120016002000SE +/- 2.43, N = 3SE +/- 3.81, N = 3SE +/- 6.83, N = 31714.41708.51702.3

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption321400800120016002000SE +/- 1.95, N = 3SE +/- 1.84, N = 3SE +/- 8.30, N = 31710.61702.61697.2

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption321120240360480600SE +/- 0.07, N = 3SE +/- 1.50, N = 3SE +/- 2.47, N = 3551.4550.3549.0

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption312120240360480600SE +/- 0.81, N = 3SE +/- 0.78, N = 3SE +/- 1.18, N = 3533.5532.9531.5

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption32180160240320400SE +/- 0.07, N = 3SE +/- 0.34, N = 3SE +/- 1.43, N = 3346.2345.4344.4

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption31280160240320400SE +/- 0.15, N = 3SE +/- 0.24, N = 3SE +/- 0.91, N = 3346.9346.8345.8

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption32130060090012001500SE +/- 0.72, N = 3SE +/- 3.43, N = 3SE +/- 1.19, N = 31404.21401.71400.2

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption32130060090012001500SE +/- 2.16, N = 3SE +/- 2.11, N = 3SE +/- 1.87, N = 31394.31393.01389.6

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption321120240360480600SE +/- 0.26, N = 3SE +/- 0.57, N = 3SE +/- 0.60, N = 2551.7551.2550.7

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption123120240360480600SE +/- 0.52, N = 3SE +/- 0.39, N = 3533.7533.3532.6

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption23180160240320400SE +/- 0.17, N = 3SE +/- 0.10, N = 3345.7345.6345.6

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption21380160240320400SE +/- 0.20, N = 3SE +/- 0.25, N = 3SE +/- 1.02, N = 3346.8346.4345.7

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression321816243240SE +/- 0.33, N = 33434341. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression321204060801009696961. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression21390180270360450SE +/- 0.58, N = 3SE +/- 1.00, N = 34014014001. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression12330060090012001500SE +/- 2.08, N = 3SE +/- 3.06, N = 3SE +/- 2.73, N = 31390138913861. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 8 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression3211530456075SE +/- 0.58, N = 36565651. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 8 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression12330060090012001500SE +/- 4.06, N = 3SE +/- 7.88, N = 3SE +/- 21.06, N = 31423141313961. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Crush 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression321204060801007676761. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Crush 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression231901802703604504324314311. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression32180160240320400SE +/- 0.33, N = 3SE +/- 0.33, N = 33573563561. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression231110220330440550SE +/- 0.67, N = 3SE +/- 3.00, N = 35065055031. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 2 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression3213060901201501481481481. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 2 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression231130260390520650SE +/- 0.33, N = 3SE +/- 2.73, N = 3SE +/- 5.17, N = 35855825811. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression32140801201602001821821821. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Decompression3122004006008001000SE +/- 0.33, N = 39959959941. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode2133691215SE +/- 0.04, N = 5SE +/- 0.03, N = 5SE +/- 0.03, N = 510.6210.6910.801. (CXX) g++ options: -fvisibility=hidden -logg -lm

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack23148121620SE +/- 0.02, N = 5SE +/- 0.03, N = 5SE +/- 0.09, N = 517.3217.3317.401. (CXX) g++ options: -rdynamic

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT13122004006008001000SE +/- 2.11, N = 3SE +/- 2.71, N = 3SE +/- 0.37, N = 31084.331083.861080.551. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC132150100150200250SE +/- 0.05, N = 3SE +/- 0.55, N = 3SE +/- 0.79, N = 3236.45235.97235.541. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2123306090120150SE +/- 0.03, N = 3SE +/- 0.38, N = 3SE +/- 0.53, N = 3140.73140.32140.211. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC1 + Dithering

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering23150100150200250SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.40, N = 3227.14227.06226.751. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default2131.29512.59023.88535.18046.4755SE +/- 0.018, N = 3SE +/- 0.037, N = 3SE +/- 0.043, N = 35.6645.6775.7561. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 723170140210280350SE +/- 0.19, N = 3SE +/- 0.52, N = 3SE +/- 0.36, N = 3298.25298.54298.921. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7321120240360480600SE +/- 0.96, N = 3SE +/- 0.47, N = 3SE +/- 1.36, N = 3544.66545.66545.961. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 532148121620SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 314.4814.4814.481. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression2132004006008001000SE +/- 0.53, N = 3SE +/- 0.20, N = 3SE +/- 0.05, N = 3938.92939.76940.231. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100123120240360480600SE +/- 0.84, N = 3SE +/- 0.57, N = 3SE +/- 0.80, N = 3551.18550.14549.941. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.923160120180240300SE +/- 0.33, N = 3SE +/- 0.41, N = 3SE +/- 0.82, N = 3272.41273.01273.551. (CC) gcc options: -O2 -fvisibility=hidden

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21321400800120016002000SE +/- 4.94, N = 3SE +/- 2.43, N = 3SE +/- 3.77, N = 31691.61691.51689.61. (CXX) g++ options: -O3 -march=native -rdynamic

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics132306090120150SE +/- 0.22, N = 3SE +/- 0.33, N = 3SE +/- 0.54, N = 3136.89140.57140.641. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.0321246810SE +/- 0.021, N = 3SE +/- 0.029, N = 3SE +/- 0.011, N = 38.4248.4398.447MIN: 8.33 / MAX: 9.34MIN: 8.3 / MAX: 52.09MIN: 8.35 / MAX: 9.561. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-502311224364860SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 353.9754.1254.51MIN: 53.45 / MAX: 128.98MIN: 53.84 / MAX: 125.64MIN: 53.53 / MAX: 130.531. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_2241321.15792.31583.47374.63165.7895SE +/- 0.016, N = 3SE +/- 0.037, N = 3SE +/- 0.008, N = 35.0615.1005.146MIN: 4.98 / MAX: 5.99MIN: 4.42 / MAX: 11.39MIN: 5.03 / MAX: 6.031. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.0231246810SE +/- 0.007, N = 3SE +/- 0.018, N = 3SE +/- 0.006, N = 36.1126.1516.196MIN: 6.03 / MAX: 12.61MIN: 6.05 / MAX: 6.9MIN: 4.66 / MAX: 19.351. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v31231326395265SE +/- 0.19, N = 3SE +/- 0.21, N = 3SE +/- 0.26, N = 355.8755.9255.96MIN: 55.38 / MAX: 87.21MIN: 55.44 / MAX: 148.38MIN: 55.41 / MAX: 122.631. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU21370140210280350SE +/- 0.60, N = 3SE +/- 1.01, N = 3SE +/- 0.44, N = 33283283271. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU132120240360480600SE +/- 0.87, N = 3SE +/- 0.83, N = 3SE +/- 1.36, N = 35455425411. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU3211326395265SE +/- 0.17, N = 3SE +/- 0.00, N = 35959591. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU2132K4K6K8K10KSE +/- 16.03, N = 3SE +/- 9.85, N = 3SE +/- 28.57, N = 39775973697291. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU2139001800270036004500SE +/- 5.93, N = 3SE +/- 9.80, N = 3SE +/- 8.12, N = 34008400740051. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v223170140210280350SE +/- 0.24, N = 3SE +/- 0.90, N = 3SE +/- 1.18, N = 3337.51338.74339.21MIN: 336.44 / MAX: 346.96MIN: 336.55 / MAX: 341.55MIN: 336.01 / MAX: 348.171. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.132170140210280350SE +/- 0.61, N = 3SE +/- 0.16, N = 3SE +/- 0.32, N = 3327.83328.63329.72MIN: 326.73 / MAX: 330.31MIN: 328.14 / MAX: 330.14MIN: 329.09 / MAX: 331.081. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet132510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 320.0320.0620.21MIN: 19.87 / MAX: 21.34MIN: 19.95 / MAX: 21.79MIN: 19.81 / MAX: 21.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 36.226.226.31MIN: 6.14 / MAX: 6.38MIN: 6.12 / MAX: 6.83MIN: 6.11 / MAX: 64.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31231.22182.44363.66544.88726.109SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 35.415.415.43MIN: 5.29 / MAX: 5.77MIN: 5.29 / MAX: 5.54MIN: 5.28 / MAX: 6.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2312246810SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 37.037.057.07MIN: 6.94 / MAX: 7.65MIN: 6.99 / MAX: 7.63MIN: 6.95 / MAX: 7.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1231.2512.5023.7535.0046.255SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 35.525.555.56MIN: 5.41 / MAX: 5.78MIN: 5.4 / MAX: 5.65MIN: 5.41 / MAX: 5.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0123246810SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 38.288.308.33MIN: 8.18 / MAX: 8.87MIN: 8.14 / MAX: 8.79MIN: 8.16 / MAX: 9.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface3120.62781.25561.88342.51123.139SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 32.752.762.79MIN: 2.69 / MAX: 3.11MIN: 2.7 / MAX: 3.13MIN: 2.69 / MAX: 3.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet12348121620SE +/- 0.14, N = 3SE +/- 0.20, N = 3SE +/- 0.31, N = 316.6016.8416.95MIN: 16.2 / MAX: 74.59MIN: 16.12 / MAX: 18.21MIN: 16.16 / MAX: 17.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg161321224364860SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 351.7851.8551.97MIN: 51.37 / MAX: 99.9MIN: 51.49 / MAX: 53.93MIN: 51.54 / MAX: 54.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet1831248121620SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.15, N = 315.1915.2015.54MIN: 14.96 / MAX: 16.48MIN: 15.01 / MAX: 15.36MIN: 15 / MAX: 110.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet1323691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 312.0812.1112.12MIN: 11.98 / MAX: 12.56MIN: 12.03 / MAX: 12.39MIN: 12.02 / MAX: 12.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50213714212835SE +/- 0.10, N = 3SE +/- 0.25, N = 3SE +/- 0.09, N = 328.8629.0629.11MIN: 28.14 / MAX: 29.56MIN: 28.13 / MAX: 134.5MIN: 28.24 / MAX: 30.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny123816243240SE +/- 0.35, N = 3SE +/- 0.60, N = 3SE +/- 0.34, N = 331.4331.9632.73MIN: 30.14 / MAX: 34.58MIN: 30.06 / MAX: 35.42MIN: 31.73 / MAX: 35.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd123612182430SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 326.5526.5726.59MIN: 26.01 / MAX: 27.38MIN: 26.04 / MAX: 27.24MIN: 26.05 / MAX: 27.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m321612182430SE +/- 0.06, N = 3SE +/- 0.13, N = 3SE +/- 0.03, N = 323.7523.9324.08MIN: 23.54 / MAX: 24.43MIN: 23.59 / MAX: 24.31MIN: 23.89 / MAX: 25.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein3121.00422.00843.01264.01685.021SE +/- 0.121, N = 15SE +/- 0.131, N = 15SE +/- 0.017, N = 34.4634.4154.2611. (CXX) g++ options: -O3 -pthread -lm

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C1322004006008001000SE +/- 16.48, N = 3SE +/- 4.68, N = 3SE +/- 11.00, N = 151050.171047.671002.011. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D1322004006008001000SE +/- 14.12, N = 4SE +/- 17.23, N = 3SE +/- 9.01, N = 121027.881019.371008.911. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU2130.9181.8362.7543.6724.59SE +/- 0.01870, N = 3SE +/- 0.00717, N = 3SE +/- 0.02757, N = 34.074514.074764.07980MIN: 3.98MIN: 4MIN: 3.991. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU213246810SE +/- 0.00084, N = 3SE +/- 0.02331, N = 3SE +/- 0.03820, N = 37.390277.416917.43079MIN: 7.35MIN: 7.36MIN: 7.351. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1320.63871.27741.91612.55483.1935SE +/- 0.00327, N = 3SE +/- 0.00573, N = 3SE +/- 0.00201, N = 32.830742.833732.83861MIN: 2.8MIN: 2.79MIN: 2.81. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU2310.56021.12041.68062.24082.801SE +/- 0.00118, N = 3SE +/- 0.00042, N = 3SE +/- 0.00663, N = 32.479302.481252.48994MIN: 2.45MIN: 2.45MIN: 2.451. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU31248121620SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 313.7813.7913.80MIN: 13.71MIN: 13.69MIN: 13.721. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1321.30012.60023.90035.20046.5005SE +/- 0.01646, N = 3SE +/- 0.00979, N = 3SE +/- 0.00194, N = 35.765365.768905.77823MIN: 5.7MIN: 5.71MIN: 5.741. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU312246810SE +/- 0.00234, N = 3SE +/- 0.00368, N = 3SE +/- 0.00711, N = 38.382268.393258.40124MIN: 8.31MIN: 8.35MIN: 8.351. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU2133691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 312.6912.7112.72MIN: 12.63MIN: 12.64MIN: 12.641. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU213246810SE +/- 0.07423, N = 6SE +/- 0.08426, N = 4SE +/- 0.09722, N = 36.519376.660116.68411MIN: 6.3MIN: 6.33MIN: 6.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU3121.14242.28483.42724.56965.712SE +/- 0.00135, N = 3SE +/- 0.00123, N = 3SE +/- 0.00166, N = 35.070055.075645.07714MIN: 5.04MIN: 5.05MIN: 5.061. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1239001800270036004500SE +/- 5.71, N = 3SE +/- 6.88, N = 3SE +/- 5.47, N = 34048.954050.544051.42MIN: 4036MIN: 4036.39MIN: 4040.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1235001000150020002500SE +/- 1.96, N = 3SE +/- 4.53, N = 3SE +/- 8.06, N = 32209.392214.442221.93MIN: 2203.59MIN: 2206.84MIN: 2205.081. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU3219001800270036004500SE +/- 3.06, N = 3SE +/- 1.68, N = 3SE +/- 2.55, N = 34040.614044.154045.79MIN: 4031.1MIN: 4035.82MIN: 4037.431. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1235001000150020002500SE +/- 0.62, N = 3SE +/- 1.77, N = 3SE +/- 9.79, N = 32208.252211.702221.13MIN: 2204.85MIN: 2206.98MIN: 2207.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU2130.7581.5162.2743.0323.79SE +/- 0.00051, N = 3SE +/- 0.01155, N = 3SE +/- 0.01022, N = 33.359473.368503.36883MIN: 3.3MIN: 3.3MIN: 3.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU2319001800270036004500SE +/- 1.11, N = 3SE +/- 6.05, N = 3SE +/- 1.90, N = 34040.594048.114051.99MIN: 4035.29MIN: 4033MIN: 4039.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU3215001000150020002500SE +/- 2.02, N = 3SE +/- 0.63, N = 3SE +/- 5.90, N = 32210.612211.322211.56MIN: 2204.44MIN: 2206.12MIN: 2201.951. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU2310.67591.35182.02772.70363.3795SE +/- 0.01863, N = 3SE +/- 0.00263, N = 3SE +/- 0.00849, N = 32.990802.994073.00389MIN: 2.83MIN: 2.94MIN: 2.941. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding12330060090012001500SE +/- 0.36, N = 3SE +/- 0.31, N = 3SE +/- 0.57, N = 31285.021284.711280.901. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding123400800120016002000SE +/- 0.84, N = 3SE +/- 0.77, N = 3SE +/- 0.99, N = 31756.701753.611745.941. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding132400800120016002000SE +/- 18.20, N = 3SE +/- 29.00, N = 12SE +/- 28.57, N = 151699.851597.841589.751. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding123400800120016002000SE +/- 65.46, N = 3SE +/- 45.52, N = 15SE +/- 47.93, N = 122071.461984.731979.001. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding321400800120016002000SE +/- 12.81, N = 3SE +/- 31.45, N = 15SE +/- 39.18, N = 151962.741881.941778.101. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding3216001200180024003000SE +/- 0.00, N = 3SE +/- 0.00, N = 15SE +/- 1.79, N = 152716.902716.902664.351. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP32150100150200250SE +/- 0.19, N = 3SE +/- 0.19, N = 3SE +/- 0.33, N = 3239.62239.43239.241. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.232170M140M210M280M350MSE +/- 980611.02, N = 3SE +/- 2849594.82, N = 3SE +/- 2829162.69, N = 33051336333036123673028911331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.432110M20M30M40M50MSE +/- 186245.67, N = 3SE +/- 199447.85, N = 3SE +/- 200225.59, N = 34684042746529853461044201. (CXX) g++ options: -O3 -fopenmp

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.312310002000300040005000SE +/- 37.60, N = 3SE +/- 60.83, N = 3SE +/- 60.75, N = 34654.474644.464621.631. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M12350100150200250SE +/- 0.54, N = 3SE +/- 0.61, N = 3SE +/- 1.02, N = 3219.05219.10219.571. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O1231224364860SE +/- 0.64, N = 5SE +/- 0.70, N = 5SE +/- 0.74, N = 351.1551.9051.911. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

CP2K Molecular Dynamics

Fayalite-FIST Data

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.1Fayalite-FIST Data132300600900120015001543.681546.681576.95

perf-bench

Benchmark: Sched Pipe

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Sched Pipe31216K32K48K64K80KSE +/- 629.67, N = 3SE +/- 327.32, N = 3SE +/- 929.56, N = 57443073755736951. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

Cpuminer-Opt

Algorithm: Magi

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Magi32150100150200250SE +/- 2.29, N = 14SE +/- 0.17, N = 3SE +/- 0.12, N = 3206.24204.18203.981. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: x25x

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: x25x32150100150200250SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 1.24, N = 3220.82220.73219.581. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Deepcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Deepcoin13216003200480064008000SE +/- 132.89, N = 15SE +/- 33.20, N = 3SE +/- 164.38, N = 157487.987312.247013.871. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Ringcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Ringcoin32130060090012001500SE +/- 11.74, N = 14SE +/- 10.01, N = 3SE +/- 3.43, N = 31590.601582.491579.481. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Blake-2 S

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Blake-2 S32150K100K150K200K250KSE +/- 3030.09, N = 4SE +/- 2626.85, N = 15SE +/- 3265.93, N = 42260732219382205081. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Garlicoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Garlicoin231400800120016002000SE +/- 6.73, N = 3SE +/- 4.72, N = 3SE +/- 7.48, N = 31750.881738.451736.891. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Skeincoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Skeincoin3128K16K24K32K40KSE +/- 140.75, N = 3SE +/- 291.68, N = 15SE +/- 325.79, N = 153570734847343381. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Myriad-Groestl

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Myriad-Groestl2312K4K6K8K10KSE +/- 161.69, N = 3SE +/- 125.83, N = 3SE +/- 6.67, N = 31056710510103631. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: LBC, LBRY Credits

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: LBC, LBRY Credits3215K10K15K20K25KSE +/- 96.84, N = 3SE +/- 86.86, N = 3SE +/- 113.58, N = 32346323407233301. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Quad SHA-256, Pyrite

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Quad SHA-256, Pyrite23110K20K30K40K50KSE +/- 748.75, N = 15SE +/- 1271.64, N = 15SE +/- 749.50, N = 34862248341476531. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Triple SHA-256, Onecoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Triple SHA-256, Onecoin31213K26K39K52K65KSE +/- 80.07, N = 3SE +/- 902.62, N = 15SE +/- 1076.82, N = 156292361247603501. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p213100200300400500SE +/- 0.88, N = 3SE +/- 0.72, N = 3SE +/- 2.60, N = 3459.32459.11458.06MIN: 342.72 / MAX: 583.61MIN: 341.89 / MAX: 588.95MIN: 341.78 / MAX: 587.391. (CC) gcc options: -pthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K123306090120150SE +/- 0.27, N = 3SE +/- 0.53, N = 3SE +/- 0.14, N = 3142.04140.67140.47MIN: 130.3 / MAX: 163.29MIN: 125.57 / MAX: 161.64MIN: 130.58 / MAX: 160.851. (CC) gcc options: -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p31280160240320400SE +/- 1.19, N = 3SE +/- 3.01, N = 3SE +/- 3.16, N = 3385.99382.21380.86MIN: 312.93 / MAX: 423.3MIN: 305.4 / MAX: 420.06MIN: 302.45 / MAX: 418.31. (CC) gcc options: -pthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit2131530456075SE +/- 0.17, N = 3SE +/- 0.18, N = 3SE +/- 0.08, N = 368.6268.4468.34MIN: 44.12 / MAX: 171.68MIN: 43.96 / MAX: 169.86MIN: 44.14 / MAX: 168.931. (CC) gcc options: -pthread

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 12130.06230.12460.18690.24920.3115SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 30.2770.2770.276

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 52130.18230.36460.54690.72920.9115SE +/- 0.005, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 30.8100.8100.808

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 62310.24080.48160.72240.96321.204SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.008, N = 31.0701.0681.065

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 103120.53441.06881.60322.13762.672SE +/- 0.013, N = 3SE +/- 0.011, N = 3SE +/- 0.026, N = 32.3752.3752.361

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile3124080120160200SE +/- 0.38, N = 3SE +/- 0.10, N = 3SE +/- 0.26, N = 3186.56186.70186.84

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile1324080120160200SE +/- 0.55, N = 3SE +/- 0.91, N = 3SE +/- 0.85, N = 3159.69160.09160.67

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile13220406080100SE +/- 0.16, N = 3SE +/- 0.15, N = 3SE +/- 0.09, N = 3109.86111.07111.13

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP13215K30K45K60K75KSE +/- 89.04, N = 3SE +/- 922.31, N = 4SE +/- 1002.67, N = 368017.6569374.8769828.361. (CXX) g++ options: -O3 -march=native -fopenmp

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP21320K40K60K80K100KSE +/- 20.05, N = 3SE +/- 1477.66, N = 5SE +/- 1452.56, N = 12114077.57115661.13116695.381. (CXX) g++ options: -O3 -march=native -fopenmp

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP132400K800K1200K1600K2000KSE +/- 6269.54, N = 3SE +/- 91181.44, N = 12SE +/- 4394.55, N = 32010121.551758555.231271416.381. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD312300K600K900K1200K1500KSE +/- 7906.68, N = 3SE +/- 21115.79, N = 4SE +/- 10942.24, N = 31605411.581602736.881588662.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Unpacking Firefox

Extracting: firefox-84.0.source.tar.xz

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz213612182430SE +/- 0.07, N = 4SE +/- 0.17, N = 4SE +/- 0.10, N = 423.8924.0424.13

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH312300K600K900K1200K1500KSE +/- 2768.29, N = 3SE +/- 3115.55, N = 3SE +/- 9099.06, N = 31249824.001247740.001242294.791. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123400K800K1200K1600K2000KSE +/- 6745.04, N = 3SE +/- 25400.58, N = 3SE +/- 20496.02, N = 31887226.871775802.501765664.371. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET312300K600K900K1200K1500KSE +/- 9026.88, N = 3SE +/- 14514.92, N = 8SE +/- 19369.44, N = 151436898.961415197.081408307.621. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

GnuPG

2.7GB Sample File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File Encryption32120406080100SE +/- 0.07, N = 3SE +/- 0.16, N = 3SE +/- 1.03, N = 380.1380.2281.081. (CC) gcc options: -O2


Phoronix Test Suite v10.8.4