E3-1260L 2021

Intel Xeon E3-1260L v5 testing with a ASRock E3V5 WS (P7.10 BIOS) and XFX NVIDIA GeForce GT 220 on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2102050-HA-E31260L2045&grr.

E3-1260L 2021ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen Resolution1234Intel Xeon E3-1260L v5 @ 3.90GHz (4 Cores / 8 Threads)ASRock E3V5 WS (P7.10 BIOS)Intel Xeon E3-1200 v5/E3-15008GB120GB INTEL SSDSC2BW12XFX NVIDIA GeForce GT 220Realtek ALC892LG Ultra HDIntel I219-LMUbuntu 20.105.8.0-33-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.8xfxGCC 10.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe2 - Thermald 2.3Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

E3-1260L 2021webp2: Quality 100, Lossless Compressionwebp2: Quality 95, Compression Effort 7astcenc: Exhaustivewebp2: Quality 75, Compression Effort 7npb: EP.Dopenfoam: Motorbike 30Mbuild2: Time To Compilebuild-godot: Time To Compilegcrypt: npb: LU.Cnpb: BT.Cavifenc: 0dav1d: Chimera 1080p 10-bitpennant: sedovbigmnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: resnet-v2-50mnn: SqueezeNetV1.0kripke: onnx: fcn-resnet101-11 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: yolov4 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUavifenc: 2onednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUpennant: leblancbigncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetbuild-eigen: Time To Compileonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUastcenc: Thoroughnpb: SP.Bgnupg: 2.7GB Sample File Encryptioncompress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speedquantlib: compress-lz4: 3 - Decompression Speedcompress-lz4: 3 - Compression Speednpb: FT.Crav1e: 5rav1e: 1dav1d: Summer Nature 4Klzbench: XZ 0 - Decompressionlzbench: XZ 0 - Compressionrav1e: 6npb: CG.Cdav1d: Chimera 1080pwebp2: Quality 100, Compression Effort 5qmcpack: simple-H2Ocompress-lz4: 1 - Decompression Speedcompress-lz4: 1 - Compression Speedcryptsetup: Twofish-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Encryptioncryptsetup: Serpent-XTS 512b Decryptioncryptsetup: Serpent-XTS 512b Encryptioncryptsetup: AES-XTS 512b Decryptioncryptsetup: AES-XTS 512b Encryptioncryptsetup: Twofish-XTS 256b Decryptioncryptsetup: Twofish-XTS 256b Encryptioncryptsetup: Serpent-XTS 256b Decryptioncryptsetup: Serpent-XTS 256b Encryptioncryptsetup: AES-XTS 256b Decryptioncryptsetup: AES-XTS 256b Encryptioncryptsetup: PBKDF2-whirlpoolcryptsetup: PBKDF2-sha512rav1e: 10unpack-firefox: firefox-84.0.source.tar.xzsynthmark: VoiceMark_100redis: LPUSHencode-wavpack: WAV To WavPackcoremark: CoreMark Size 666 - Iterations Per Secondtnn: CPU - MobileNet v2npb: EP.Cnpb: MG.Ctnn: CPU - SqueezeNet v1.1lzbench: Brotli 2 - Decompressionlzbench: Brotli 2 - Compressionlzbench: Brotli 0 - Decompressionlzbench: Brotli 0 - Compressionlzbench: Zstd 8 - Decompressionlzbench: Zstd 8 - Compressionlzbench: Crush 0 - Decompressionlzbench: Crush 0 - Compressionlzbench: Zstd 1 - Decompressionlzbench: Zstd 1 - Compressionencode-ape: WAV To APEonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUredis: LPOPdav1d: Summer Nature 1080pastcenc: Mediumredis: SADDencode-opus: WAV To Opus Encoderedis: SETlzbench: Libdeflate 1 - Compressiononednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUredis: GETamg: onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUwebp2: Defaultonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUastcenc: Fastavifenc: 8avifenc: 10lammps: Rhodopsin Proteinonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUlulesh: ffte: N=256, 3D Complex FFT Routineonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPU12342398.2731178.693643.45639.610504.27386.71363.909363.213233.10014764.5013485.45203.25065.10172.986260.4325.0714.38948.8897.8112483486333312207141042231119.2018915.858927.638917.52105.063917.5631.3040.9546.6219.5121.8889.2221.752.7010.916.749.786.547.9930.4489.8884632.024631.894640.8380.274409.8273.3017847.444.522181.87869.745.786871.191.0380.36278.94105381.3773438.09299.5437.43734.1897915.26691.28393.0390.8719.6705.01835.91834.6394.1390.1720.0704.82051.02043.466590315799792.80723.668603.2531507373.5017.005137084.980366363.114514.127148.35345.55564816955842717498344790160545313.26812.596912.49742447471.58275.7112.301917948.349.7891717799.752089.449454.286592281791.251806211005.388947.4151310.5333.0679711.36399.428.4747.8142.61521.384122.60471172.900219035.0914023988.2505716.41332400.6631180.019643.88642.371503.55387.68367.309363.916232.51114562.2213518.04203.84665.04173.692259.9995.0704.36248.8897.7892491550733311207141032224120.2818919.798933.088925.45105.639917.5231.2340.9646.6619.5321.9088.8621.752.7010.916.739.806.547.9930.4590.2594628.674628.544630.4780.324387.5973.1067844.044.422168.37834.345.746902.751.0500.36279.53105381.3733463.42302.2537.44334.8147837.56682.99394.4390.5719.4705.71838.11838.3394.5390.9720.1705.82055.02049.566590415760542.86523.718603.1321486920.0716.938136732.151599362.724507.427153.82345.25164817055842717508444790160045113.18612.474912.57191501169.13277.6712.311934230.879.7641721937.792099.396974.284982189234.251805828335.377207.4272210.5082.9442811.01879.538.5127.8672.63821.300022.42761169.997818773.2668944958.2296116.2971502.03388.4514704.4413518.01173.68688939.188945.608944.35105.46404635.334645.084643.814391.747852.544.612167.07840.245.706986.40105383448.6234.3957870.76668.81513.267142.1464817055842717498344790159345212.478512.44422099.432664.291281805769005.436837.444492.9660711.80982.62921.354022.41941165.744419060.1889577238.2810916.48822397.7491177.828643.57641.285501.69388.02367.587364.058232.47714707.6813540.52203.68065.31173.527159.4945.0474.33048.3657.7712492555333311208141172229119.6238931.438936.318938.37105.573617.5331.2440.8646.6819.5321.9088.8421.762.6310.986.799.766.537.9830.3690.1874639.214632.414640.3480.224391.9573.1247843.844.402165.57851.045.646982.311.0490.36179.46105381.3893472.03301.0437.41934.7527867.16669.47394.5390.8718.8705.61833.41832.9394.4390.8719.0705.72047.62042.366534015815622.87023.486602.7461513254.5516.933137374.605600363.978510.537142.46345.33864617055742817618344792160445313.22412.747612.46931465596.16275.3412.281942305.799.9091721260.042089.414274.280452165661.081807050335.424047.3894410.5622.9646311.46329.538.5217.8072.62221.324622.43501166.668319047.0219170178.2066716.3802OpenBenchmarking.org

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression1245001000150020002500SE +/- 1.91, N = 3SE +/- 0.23, N = 3SE +/- 0.76, N = 32398.272400.662397.751. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 712430060090012001500SE +/- 0.76, N = 3SE +/- 0.85, N = 3SE +/- 1.46, N = 31178.691180.021177.831. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive124140280420560700SE +/- 0.13, N = 3SE +/- 0.27, N = 3SE +/- 0.29, N = 3643.45643.88643.571. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7124140280420560700SE +/- 0.26, N = 3SE +/- 0.75, N = 3SE +/- 1.61, N = 3639.61642.37641.291. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D1234110220330440550SE +/- 6.77, N = 4SE +/- 5.28, N = 9SE +/- 8.69, N = 3SE +/- 5.41, N = 7504.27503.55502.03501.691. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M123480160240320400SE +/- 0.17, N = 3SE +/- 0.21, N = 3SE +/- 0.53, N = 3SE +/- 0.33, N = 3386.71387.68388.45388.021. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lspecie -lfiniteVolume -lfvOptions -lgenericPatchFields -lmeshTools -lsampling -lOpenFOAM -ldl -lm

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile12480160240320400SE +/- 0.47, N = 3SE +/- 4.00, N = 3SE +/- 4.15, N = 3363.91367.31367.59

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile12480160240320400SE +/- 0.35, N = 3SE +/- 0.23, N = 3SE +/- 0.06, N = 3363.21363.92364.06

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.912450100150200250SE +/- 0.59, N = 3SE +/- 0.86, N = 3SE +/- 0.63, N = 3233.10232.51232.481. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C12343K6K9K12K15KSE +/- 14.35, N = 3SE +/- 134.72, N = 10SE +/- 10.10, N = 3SE +/- 15.78, N = 314764.5014562.2214704.4414707.681. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C12343K6K9K12K15KSE +/- 8.84, N = 3SE +/- 19.55, N = 3SE +/- 6.18, N = 3SE +/- 5.42, N = 313485.4513518.0413518.0113540.521. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 01244080120160200SE +/- 0.71, N = 3SE +/- 0.20, N = 3SE +/- 0.48, N = 3203.25203.85203.681. (CXX) g++ options: -O3 -fPIC

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit1241530456075SE +/- 0.57, N = 3SE +/- 0.58, N = 3SE +/- 0.39, N = 365.1065.0465.31MIN: 44.13 / MAX: 149.8MIN: 44.01 / MAX: 149.22MIN: 44.34 / MAX: 148.991. (CC) gcc options: -pthread

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig12344080120160200SE +/- 0.13, N = 3SE +/- 0.04, N = 3SE +/- 0.21, N = 3SE +/- 0.17, N = 3172.99173.69173.69173.531. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v31241428425670SE +/- 0.13, N = 3SE +/- 0.21, N = 3SE +/- 0.97, N = 360.4360.0059.49MIN: 59.98 / MAX: 66.21MIN: 59.43 / MAX: 64.63MIN: 57.28 / MAX: 89.731. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.01241.1412.2823.4234.5645.705SE +/- 0.015, N = 3SE +/- 0.018, N = 3SE +/- 0.029, N = 35.0715.0705.047MIN: 4.59 / MAX: 6.1MIN: 4.58 / MAX: 5.75MIN: 4.56 / MAX: 5.571. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_2241240.98751.9752.96253.954.9375SE +/- 0.010, N = 3SE +/- 0.025, N = 3SE +/- 0.049, N = 34.3894.3624.330MIN: 4.29 / MAX: 6.77MIN: 4.01 / MAX: 4.84MIN: 4.15 / MAX: 4.531. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-501241122334455SE +/- 0.14, N = 3SE +/- 0.14, N = 3SE +/- 0.75, N = 348.8948.8948.37MIN: 48.41 / MAX: 50.57MIN: 48.4 / MAX: 141.21MIN: 46.64 / MAX: 52.551. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.0124246810SE +/- 0.010, N = 3SE +/- 0.025, N = 3SE +/- 0.076, N = 37.8117.7897.771MIN: 7.14 / MAX: 13.45MIN: 7.12 / MAX: 8.86MIN: 7.02 / MAX: 8.971. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.41245M10M15M20M25MSE +/- 307740.69, N = 3SE +/- 267442.30, N = 3SE +/- 248353.03, N = 32483486324915507249255531. (CXX) g++ options: -O3 -fopenmp

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU124816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33333331. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU12470140210280350SE +/- 0.29, N = 3SE +/- 0.44, N = 3SE +/- 0.50, N = 33123113111. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU12450100150200250SE +/- 0.17, N = 3SE +/- 0.29, N = 3SE +/- 0.29, N = 32072072081. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU1243K6K9K12K15KSE +/- 43.67, N = 3SE +/- 12.85, N = 3SE +/- 40.07, N = 31410414103141171. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU1245001000150020002500SE +/- 4.48, N = 3SE +/- 2.17, N = 3SE +/- 1.48, N = 32231222422291. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 2124306090120150SE +/- 0.12, N = 3SE +/- 0.44, N = 3SE +/- 0.11, N = 3119.20120.28119.621. (CXX) g++ options: -O3 -fPIC

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU12342K4K6K8K10KSE +/- 2.73, N = 3SE +/- 2.10, N = 3SE +/- 4.02, N = 3SE +/- 2.31, N = 38915.858919.798939.188931.43MIN: 8864.25MIN: 8896.59MIN: 8893.94MIN: 8883.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU12342K4K6K8K10KSE +/- 6.77, N = 3SE +/- 3.96, N = 3SE +/- 2.35, N = 3SE +/- 1.00, N = 38927.638933.088945.608936.31MIN: 8896MIN: 8916.93MIN: 8926.38MIN: 8921.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU12342K4K6K8K10KSE +/- 5.71, N = 3SE +/- 2.41, N = 3SE +/- 2.21, N = 3SE +/- 1.31, N = 38917.528925.458944.358938.37MIN: 8892.85MIN: 8909.78MIN: 8929.69MIN: 8921.681. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig123420406080100SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3105.06105.64105.46105.571. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m12448121620SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 317.5617.5217.53MIN: 17.44 / MAX: 18.56MIN: 17.39 / MAX: 18.59MIN: 17.46 / MAX: 18.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd124714212835SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 331.3031.2331.24MIN: 31.14 / MAX: 32.35MIN: 31.12 / MAX: 32.11MIN: 31.14 / MAX: 32.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny124918273645SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 340.9540.9640.86MIN: 40.63 / MAX: 41.77MIN: 40.51 / MAX: 41.85MIN: 40.54 / MAX: 41.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet501241122334455SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 346.6246.6646.68MIN: 45.3 / MAX: 48.43MIN: 45.36 / MAX: 47.7MIN: 45.41 / MAX: 48.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet124510152025SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 319.5119.5319.53MIN: 19.38 / MAX: 20.06MIN: 19.41 / MAX: 20.29MIN: 19.4 / MAX: 20.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18124510152025SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 321.8821.9021.90MIN: 21.61 / MAX: 22.98MIN: 21.62 / MAX: 22.75MIN: 21.58 / MAX: 23.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg1612420406080100SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 389.2288.8688.84MIN: 88.84 / MAX: 148.87MIN: 88.59 / MAX: 91.44MIN: 88.59 / MAX: 89.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet124510152025SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 321.7521.7521.76MIN: 21.16 / MAX: 22.66MIN: 21.14 / MAX: 22.58MIN: 21.15 / MAX: 22.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1240.60751.2151.82252.433.0375SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.00, N = 32.702.702.63MIN: 2.6 / MAX: 11.85MIN: 2.58 / MAX: 11.85MIN: 2.6 / MAX: 3.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b01243691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.09, N = 310.9110.9110.98MIN: 10.85 / MAX: 11.97MIN: 10.67 / MAX: 11.08MIN: 10.67 / MAX: 59.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet124246810SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 36.746.736.79MIN: 6.58 / MAX: 7.21MIN: 6.56 / MAX: 7.06MIN: 6.72 / MAX: 6.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v21243691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 39.789.809.76MIN: 9.42 / MAX: 10.68MIN: 9.44 / MAX: 10.88MIN: 9.43 / MAX: 10.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3124246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.546.546.53MIN: 6.44 / MAX: 6.79MIN: 6.45 / MAX: 6.8MIN: 6.42 / MAX: 9.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2124246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 37.997.997.98MIN: 7.82 / MAX: 8.88MIN: 7.82 / MAX: 8.7MIN: 7.83 / MAX: 8.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet124714212835SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 330.4430.4530.36MIN: 29.84 / MAX: 31.45MIN: 29.81 / MAX: 39.56MIN: 29.76 / MAX: 31.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile12420406080100SE +/- 0.03, N = 3SE +/- 0.13, N = 3SE +/- 0.08, N = 389.8990.2690.19

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU123410002000300040005000SE +/- 3.08, N = 3SE +/- 3.81, N = 3SE +/- 1.60, N = 3SE +/- 2.68, N = 34632.024628.674635.334639.21MIN: 4614.43MIN: 4609.9MIN: 4621.25MIN: 4621.741. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU123410002000300040005000SE +/- 4.24, N = 3SE +/- 4.30, N = 3SE +/- 2.60, N = 3SE +/- 3.31, N = 34631.894628.544645.084632.41MIN: 4615.88MIN: 4606.88MIN: 4629.78MIN: 4614.331. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU123410002000300040005000SE +/- 7.28, N = 3SE +/- 5.57, N = 3SE +/- 5.16, N = 3SE +/- 3.98, N = 34640.834630.474643.814640.34MIN: 4620.66MIN: 4609.21MIN: 4625.36MIN: 4617.821. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough12420406080100SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 380.2780.3280.221. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B12349001800270036004500SE +/- 0.56, N = 3SE +/- 0.54, N = 3SE +/- 1.21, N = 3SE +/- 2.68, N = 34409.824387.594391.744391.951. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

GnuPG

2.7GB Sample File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File Encryption1241632486480SE +/- 0.81, N = 3SE +/- 0.55, N = 3SE +/- 0.59, N = 373.3073.1173.121. (CC) gcc options: -O2

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed12342K4K6K8K10KSE +/- 12.26, N = 3SE +/- 2.80, N = 3SE +/- 9.72, N = 3SE +/- 7.13, N = 37847.47844.07852.57843.81. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed12341020304050SE +/- 0.09, N = 3SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 0.13, N = 344.5244.4244.6144.401. (CC) gcc options: -O3

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.2112345001000150020002500SE +/- 6.62, N = 3SE +/- 27.08, N = 5SE +/- 19.28, N = 12SE +/- 27.91, N = 52181.82168.32167.02165.51. (CXX) g++ options: -O3 -march=native -rdynamic

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed12342K4K6K8K10KSE +/- 11.01, N = 3SE +/- 9.12, N = 3SE +/- 6.77, N = 3SE +/- 14.70, N = 37869.77834.37840.27851.01. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed12341020304050SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 345.7845.7445.7045.641. (CC) gcc options: -O3

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C123415003000450060007500SE +/- 72.82, N = 3SE +/- 15.03, N = 3SE +/- 12.69, N = 3SE +/- 15.55, N = 36871.196902.756986.406982.311. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 51240.23630.47260.70890.94521.1815SE +/- 0.007, N = 3SE +/- 0.002, N = 3SE +/- 0.017, N = 31.0381.0501.049

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 11240.08150.1630.24450.3260.4075SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 30.3620.3620.361

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K12420406080100SE +/- 0.77, N = 3SE +/- 0.27, N = 3SE +/- 0.42, N = 378.9479.5379.46MIN: 71.73 / MAX: 96.06MIN: 72.51 / MAX: 96.2MIN: 72.52 / MAX: 95.621. (CC) gcc options: -pthread

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression1234204060801001051051051051. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression1234918273645383838381. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 61240.31250.6250.93751.251.5625SE +/- 0.006, N = 3SE +/- 0.008, N = 3SE +/- 0.010, N = 31.3771.3731.389

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C12347001400210028003500SE +/- 0.98, N = 3SE +/- 3.60, N = 3SE +/- 3.85, N = 3SE +/- 1.69, N = 33438.093463.423448.623472.031. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p12470140210280350SE +/- 4.00, N = 3SE +/- 3.63, N = 3SE +/- 3.45, N = 3299.54302.25301.04MIN: 212.6 / MAX: 499.39MIN: 213.91 / MAX: 505.1MIN: 213.89 / MAX: 507.671. (CC) gcc options: -pthread

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5124918273645SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 337.4437.4437.421. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O1234816243240SE +/- 0.16, N = 3SE +/- 0.26, N = 3SE +/- 0.26, N = 3SE +/- 0.29, N = 334.1934.8134.4034.751. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed12342K4K6K8K10KSE +/- 33.42, N = 3SE +/- 34.62, N = 3SE +/- 2.99, N = 3SE +/- 10.98, N = 37915.27837.57870.77867.11. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed123414002800420056007000SE +/- 4.14, N = 3SE +/- 9.69, N = 3SE +/- 2.97, N = 3SE +/- 5.66, N = 36691.286682.996668.816669.471. (CC) gcc options: -O3

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption12490180270360450SE +/- 1.38, N = 3SE +/- 0.26, N = 3SE +/- 0.25, N = 2393.0394.4394.5

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption12480160240320400SE +/- 0.15, N = 2SE +/- 0.15, N = 3SE +/- 0.20, N = 3390.8390.5390.8

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption124160320480640800SE +/- 0.33, N = 3SE +/- 2.15, N = 2SE +/- 0.90, N = 3719.6719.4718.8

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption124150300450600750SE +/- 0.57, N = 3SE +/- 0.99, N = 3SE +/- 0.64, N = 3705.0705.7705.6

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption124400800120016002000SE +/- 2.21, N = 3SE +/- 2.49, N = 3SE +/- 5.05, N = 31835.91838.11833.4

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption124400800120016002000SE +/- 1.43, N = 3SE +/- 2.73, N = 3SE +/- 4.40, N = 31834.61838.31832.9

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption12490180270360450SE +/- 0.09, N = 3SE +/- 0.23, N = 3SE +/- 0.24, N = 3394.1394.5394.4

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption12480160240320400SE +/- 0.84, N = 3SE +/- 0.26, N = 3SE +/- 0.13, N = 3390.1390.9390.8

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption124160320480640800SE +/- 0.32, N = 3SE +/- 0.70, N = 3SE +/- 0.64, N = 3720.0720.1719.0

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption124150300450600750SE +/- 1.02, N = 3SE +/- 0.22, N = 3SE +/- 0.67, N = 3704.8705.8705.7

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption124400800120016002000SE +/- 2.09, N = 3SE +/- 3.87, N = 3SE +/- 5.93, N = 32051.02055.02047.6

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption124400800120016002000SE +/- 3.45, N = 3SE +/- 2.61, N = 3SE +/- 3.88, N = 32043.42049.52042.3

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool124140K280K420K560K700KSE +/- 281.67, N = 3SE +/- 564.33, N = 3665903665904665340

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha512124300K600K900K1200K1500KSE +/- 2100.92, N = 3SE +/- 5508.00, N = 3157997915760541581562

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 101240.64581.29161.93742.58323.229SE +/- 0.022, N = 3SE +/- 0.024, N = 3SE +/- 0.024, N = 32.8072.8652.870

Unpacking Firefox

Extracting: firefox-84.0.source.tar.xz

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz124612182430SE +/- 0.07, N = 4SE +/- 0.05, N = 4SE +/- 0.12, N = 423.6723.7223.49

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100124130260390520650SE +/- 0.60, N = 3SE +/- 1.48, N = 3SE +/- 1.02, N = 3603.25603.13602.751. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH124300K600K900K1200K1500KSE +/- 13446.73, N = 3SE +/- 14467.44, N = 9SE +/- 6781.65, N = 31507373.501486920.071513254.551. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack12448121620SE +/- 0.07, N = 5SE +/- 0.05, N = 5SE +/- 0.04, N = 517.0116.9416.931. (CXX) g++ options: -rdynamic

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second12430K60K90K120K150KSE +/- 574.00, N = 3SE +/- 313.64, N = 3SE +/- 645.39, N = 3137084.98136732.15137374.611. (CC) gcc options: -O2 -lrt" -lrt

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v212480160240320400SE +/- 0.30, N = 3SE +/- 0.12, N = 3SE +/- 0.79, N = 3363.11362.72363.98MIN: 361.8 / MAX: 364.48MIN: 361.88 / MAX: 365.57MIN: 362.08 / MAX: 366.51. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C1234110220330440550SE +/- 0.45, N = 3SE +/- 5.06, N = 8SE +/- 0.41, N = 3SE +/- 1.81, N = 3514.12507.42513.26510.531. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C123415003000450060007500SE +/- 4.14, N = 3SE +/- 3.90, N = 3SE +/- 1.13, N = 3SE +/- 2.21, N = 37148.357153.827142.147142.461. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.112480160240320400SE +/- 0.06, N = 3SE +/- 0.23, N = 3SE +/- 0.13, N = 3345.56345.25345.34MIN: 344.76 / MAX: 347.02MIN: 344.1 / MAX: 347.01MIN: 344.17 / MAX: 346.641. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

lzbench

Test: Brotli 2 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression1234140280420560700SE +/- 0.58, N = 36486486486461. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 2 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression12344080120160200SE +/- 0.33, N = 31691701701701. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression1234120240360480600SE +/- 0.67, N = 3SE +/- 0.67, N = 35585585585571. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression123490180270360450SE +/- 0.33, N = 34274274274281. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 8 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression1234400800120016002000SE +/- 3.28, N = 3SE +/- 4.26, N = 3SE +/- 5.03, N = 3SE +/- 3.38, N = 317491750174917611. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 8 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression123420406080100838483831. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Crush 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression1234100200300400500SE +/- 0.33, N = 3SE +/- 0.33, N = 34474474474471. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Crush 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression123420406080100SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3909090921. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression123430060090012001500SE +/- 2.65, N = 3SE +/- 0.67, N = 3SE +/- 7.97, N = 3SE +/- 2.85, N = 316051600159316041. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression1234100200300400500SE +/- 1.15, N = 3SE +/- 1.33, N = 3SE +/- 0.33, N = 34534514524531. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE1243691215SE +/- 0.09, N = 5SE +/- 0.04, N = 5SE +/- 0.05, N = 513.2713.1913.221. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU12343691215SE +/- 0.10, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 312.6012.4712.4812.75MIN: 11.11MIN: 11.03MIN: 11.03MIN: 11.151. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU12343691215SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 312.5012.5712.4412.47MIN: 11.34MIN: 11.44MIN: 11.38MIN: 11.011. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP124500K1000K1500K2000K2500KSE +/- 7282.72, N = 3SE +/- 21480.12, N = 3SE +/- 20957.17, N = 42447471.581501169.131465596.161. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p12460120180240300SE +/- 2.99, N = 3SE +/- 2.75, N = 3SE +/- 3.22, N = 6275.71277.67275.34MIN: 249.12 / MAX: 309.54MIN: 250.8 / MAX: 311.34MIN: 211.52 / MAX: 312.591. (CC) gcc options: -pthread

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1243691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.3012.3112.281. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD124400K800K1200K1600K2000KSE +/- 24215.71, N = 4SE +/- 8591.83, N = 3SE +/- 13080.73, N = 31917948.341934230.871942305.791. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1243691215SE +/- 0.037, N = 5SE +/- 0.068, N = 5SE +/- 0.044, N = 59.7899.7649.9091. (CXX) g++ options: -fvisibility=hidden -logg -lm

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET124400K800K1200K1600K2000KSE +/- 11364.87, N = 3SE +/- 11294.99, N = 3SE +/- 7748.68, N = 31717799.751721937.791721260.041. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression123450100150200250SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 32082092092081. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU12343691215SE +/- 0.03159, N = 3SE +/- 0.02455, N = 3SE +/- 0.01670, N = 3SE +/- 0.01698, N = 39.449459.396979.432669.41427MIN: 8.38MIN: 8.24MIN: 8.21MIN: 8.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU12340.96551.9312.89653.8624.8275SE +/- 0.00558, N = 3SE +/- 0.00482, N = 3SE +/- 0.00537, N = 3SE +/- 0.00642, N = 34.286594.284984.291284.28045MIN: 3.8MIN: 3.79MIN: 3.8MIN: 3.81. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET124500K1000K1500K2000K2500KSE +/- 32237.70, N = 3SE +/- 20266.35, N = 3SE +/- 19501.96, N = 32281791.252189234.252165661.081. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2123440M80M120M160M200MSE +/- 27815.64, N = 3SE +/- 72288.18, N = 3SE +/- 87081.74, N = 3SE +/- 65783.92, N = 31806211001805828331805769001807050331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU12341.22332.44663.66994.89326.1165SE +/- 0.00277, N = 3SE +/- 0.00432, N = 3SE +/- 0.00933, N = 3SE +/- 0.00360, N = 35.388945.377205.436835.42404MIN: 5.3MIN: 5.28MIN: 5.35MIN: 5.351. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1234246810SE +/- 0.00175, N = 3SE +/- 0.00949, N = 3SE +/- 0.01684, N = 3SE +/- 0.02622, N = 37.415137.427227.444497.38944MIN: 6.64MIN: 6.63MIN: 6.63MIN: 6.621. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default1243691215SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 310.5310.5110.561. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU12340.69031.38062.07092.76123.4515SE +/- 0.01345, N = 3SE +/- 0.00215, N = 3SE +/- 0.00090, N = 3SE +/- 0.01115, N = 33.067972.944282.966072.96463MIN: 2.96MIN: 2.85MIN: 2.89MIN: 2.871. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU12343691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 311.3611.0211.8111.46MIN: 11.21MIN: 10.88MIN: 11.65MIN: 11.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1243691215SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 39.429.539.531. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

libavif avifenc

Encoder Speed: 8

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 8124246810SE +/- 0.004, N = 3SE +/- 0.011, N = 3SE +/- 0.014, N = 38.4748.5128.5211. (CXX) g++ options: -O3 -fPIC

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 10124246810SE +/- 0.002, N = 3SE +/- 0.053, N = 3SE +/- 0.027, N = 37.8147.8677.8071. (CXX) g++ options: -O3 -fPIC

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein12340.59361.18721.78082.37442.968SE +/- 0.030, N = 3SE +/- 0.001, N = 3SE +/- 0.007, N = 3SE +/- 0.006, N = 32.6152.6382.6292.6221. (CXX) g++ options: -O3 -pthread -lm

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1234510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 321.3821.3021.3521.32MIN: 21.12MIN: 21.2MIN: 21.27MIN: 21.231. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1234510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 322.6022.4322.4222.44MIN: 22.5MIN: 22.31MIN: 22.33MIN: 22.331. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3123430060090012001500SE +/- 0.58, N = 3SE +/- 0.89, N = 3SE +/- 2.48, N = 3SE +/- 0.65, N = 31172.901170.001165.741166.671. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

FFTE

N=256, 3D Complex FFT Routine

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT Routine12344K8K12K16K20KSE +/- 29.86, N = 3SE +/- 16.02, N = 3SE +/- 44.69, N = 3SE +/- 42.48, N = 319035.0918773.2719060.1919047.021. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1234246810SE +/- 0.00369, N = 3SE +/- 0.00963, N = 3SE +/- 0.03920, N = 3SE +/- 0.00650, N = 38.250578.229618.281098.20667MIN: 7.78MIN: 7.72MIN: 7.77MIN: 7.721. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU123448121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 316.4116.3016.4916.38MIN: 15.36MIN: 15.26MIN: 15.39MIN: 15.261. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread


Phoronix Test Suite v10.8.4