E3-1260L 2021

Intel Xeon E3-1260L v5 testing with a ASRock E3V5 WS (P7.10 BIOS) and XFX NVIDIA GeForce GT 220 on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2102050-HA-E31260L2045&gru&sor.

E3-1260L 2021ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen Resolution1234Intel Xeon E3-1260L v5 @ 3.90GHz (4 Cores / 8 Threads)ASRock E3V5 WS (P7.10 BIOS)Intel Xeon E3-1200 v5/E3-15008GB120GB INTEL SSDSC2BW12XFX NVIDIA GeForce GT 220Realtek ALC892LG Ultra HDIntel I219-LMUbuntu 20.105.8.0-33-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.8xfxGCC 10.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe2 - Thermald 2.3Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

E3-1260L 2021amg: dav1d: Chimera 1080pdav1d: Summer Nature 4Kdav1d: Summer Nature 1080pdav1d: Chimera 1080p 10-bitrav1e: 1rav1e: 5rav1e: 6rav1e: 10onnx: yolov4 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: fcn-resnet101-11 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUcryptsetup: PBKDF2-sha512cryptsetup: PBKDF2-whirlpoolcoremark: CoreMark Size 666 - Iterations Per Secondlzbench: XZ 0 - Compressionlzbench: XZ 0 - Decompressionlzbench: Zstd 1 - Compressionlzbench: Zstd 1 - Decompressionlzbench: Zstd 8 - Compressionlzbench: Zstd 8 - Decompressionlzbench: Crush 0 - Compressionlzbench: Crush 0 - Decompressionlzbench: Brotli 0 - Compressionlzbench: Brotli 0 - Decompressionlzbench: Brotli 2 - Compressionlzbench: Brotli 2 - Decompressionlzbench: Libdeflate 1 - Compressioncompress-lz4: 1 - Compression Speedcompress-lz4: 1 - Decompression Speedcompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedquantlib: ffte: N=256, 3D Complex FFT Routinecryptsetup: AES-XTS 256b Encryptioncryptsetup: AES-XTS 256b Decryptioncryptsetup: Serpent-XTS 256b Encryptioncryptsetup: Serpent-XTS 256b Decryptioncryptsetup: Twofish-XTS 256b Encryptioncryptsetup: Twofish-XTS 256b Decryptioncryptsetup: AES-XTS 512b Encryptioncryptsetup: AES-XTS 512b Decryptioncryptsetup: Serpent-XTS 512b Encryptioncryptsetup: Serpent-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Encryptioncryptsetup: Twofish-XTS 512b Decryptionlammps: Rhodopsin Proteinredis: LPOPredis: SADDredis: LPUSHredis: GETredis: SETkripke: npb: BT.Cnpb: CG.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: LU.Cnpb: MG.Cnpb: SP.Bsynthmark: VoiceMark_100lulesh: pennant: sedovbigpennant: leblancbigonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUmnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3ncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mtnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1openfoam: Motorbike 30Mavifenc: 0avifenc: 2avifenc: 8avifenc: 10build-godot: Time To Compilebuild2: Time To Compilebuild-eigen: Time To Compileencode-ape: WAV To APEencode-opus: WAV To Opus Encodegcrypt: webp2: Defaultwebp2: Quality 75, Compression Effort 7webp2: Quality 95, Compression Effort 7webp2: Quality 100, Compression Effort 5webp2: Quality 100, Lossless Compressionastcenc: Fastastcenc: Mediumastcenc: Thoroughastcenc: Exhaustiveencode-wavpack: WAV To WavPackgnupg: 2.7GB Sample File Encryptionunpack-firefox: firefox-84.0.source.tar.xzqmcpack: simple-H2O1234180621100299.5478.94275.7165.100.3621.0381.3772.807207312331410422311579979665903137084.980366381054531605831749904474275581696482086691.287915.245.787869.744.527847.42181.819035.0914023982043.42051.0704.8720.0390.1394.11834.61835.9705.0719.6390.8393.02.6152447471.581917948.341507373.502281791.251717799.752483486313485.453438.09514.12504.276871.1914764.507148.354409.82603.2531172.9002172.9862105.06399.4494511.36394.286593.0679722.604712.497416.413321.384112.59698.250578915.854632.028917.524631.895.388948927.634640.837.415137.81148.8894.3895.07160.43230.447.996.549.786.7410.912.7021.7589.2221.8819.5146.6240.9531.3017.56363.114345.555386.71203.250119.2018.4747.814363.213363.90989.88813.2689.789233.10010.533639.6101178.69337.4372398.2739.4212.3080.27643.4517.00573.30123.66834.189180582833302.2579.53277.6765.040.3621.0501.3732.865207311331410322241576054665904136732.151599381054511600841750904474275581706482096682.997837.545.747834.344.427844.02168.318773.2668944952049.52055.0705.8720.1390.9394.51838.31838.1705.7719.4390.5394.42.6381501169.131934230.871486920.072189234.251721937.792491550713518.043463.42507.42503.556902.7514562.227153.824387.59603.1321169.9978173.6922105.63999.3969711.01874.284982.9442822.427612.571916.297121.300012.47498.229618919.794628.678925.454628.545.377208933.084630.477.427227.78948.8894.3625.07059.99930.457.996.549.806.7310.912.7021.7588.8621.9019.5346.6640.9631.2317.52362.724345.251387.68203.846120.2818.5127.867363.916367.30990.25913.1869.764232.51110.508642.3711180.01937.4432400.6639.5312.3180.32643.8816.93873.10623.71834.814180576900381054521593831749904474275581706482096668.817870.745.707840.244.617852.52167.019060.1889577232.62913518.013448.62513.26502.036986.4014704.447142.144391.741165.7444173.6868105.46409.4326611.80984.291282.9660722.419412.444216.488221.354012.47858.281098939.184635.338944.354645.085.436838945.604643.817.44449388.4534.395180705033301.0479.46275.3465.310.3611.0491.3892.870208311331411722291581562665340137374.605600381054531604831761924474285571706462086669.477867.145.647851.044.407843.82165.519047.0219170172042.32047.6705.7719.0390.8394.41832.91833.4705.6718.8390.8394.52.6221465596.161942305.791513254.552165661.081721260.042492555313540.523472.03510.53501.696982.3114707.687142.464391.95602.7461166.6683173.5271105.57369.4142711.46324.280452.9646322.435012.469316.380221.324612.74768.206678931.434639.218938.374632.415.424048936.314640.347.389447.77148.3654.3305.04759.49430.367.986.539.766.7910.982.6321.7688.8421.9019.5346.6840.8631.2417.53363.978345.338388.02203.680119.6238.5217.807364.058367.58790.18713.2249.909232.47710.562641.2851177.82837.4192397.7499.5312.2880.22643.5716.93373.12423.48634.752OpenBenchmarking.org

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2412340M80M120M160M200MSE +/- 65783.92, N = 3SE +/- 27815.64, N = 3SE +/- 72288.18, N = 3SE +/- 87081.74, N = 31807050331806211001805828331805769001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p24170140210280350SE +/- 3.63, N = 3SE +/- 3.45, N = 3SE +/- 4.00, N = 3302.25301.04299.54MIN: 213.91 / MAX: 505.1MIN: 213.89 / MAX: 507.67MIN: 212.6 / MAX: 499.391. (CC) gcc options: -pthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K24120406080100SE +/- 0.27, N = 3SE +/- 0.42, N = 3SE +/- 0.77, N = 379.5379.4678.94MIN: 72.51 / MAX: 96.2MIN: 72.52 / MAX: 95.62MIN: 71.73 / MAX: 96.061. (CC) gcc options: -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p21460120180240300SE +/- 2.75, N = 3SE +/- 2.99, N = 3SE +/- 3.22, N = 6277.67275.71275.34MIN: 250.8 / MAX: 311.34MIN: 249.12 / MAX: 309.54MIN: 211.52 / MAX: 312.591. (CC) gcc options: -pthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit4121530456075SE +/- 0.39, N = 3SE +/- 0.57, N = 3SE +/- 0.58, N = 365.3165.1065.04MIN: 44.34 / MAX: 148.99MIN: 44.13 / MAX: 149.8MIN: 44.01 / MAX: 149.221. (CC) gcc options: -pthread

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 12140.08150.1630.24450.3260.4075SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.3620.3620.361

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 52410.23630.47260.70890.94521.1815SE +/- 0.002, N = 3SE +/- 0.017, N = 3SE +/- 0.007, N = 31.0501.0491.038

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 64120.31250.6250.93751.251.5625SE +/- 0.010, N = 3SE +/- 0.006, N = 3SE +/- 0.008, N = 31.3891.3771.373

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 104210.64581.29161.93742.58323.229SE +/- 0.024, N = 3SE +/- 0.024, N = 3SE +/- 0.022, N = 32.8702.8652.807

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU42150100150200250SE +/- 0.29, N = 3SE +/- 0.29, N = 3SE +/- 0.17, N = 32082072071. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU14270140210280350SE +/- 0.29, N = 3SE +/- 0.50, N = 3SE +/- 0.44, N = 33123113111. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU421816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33333331. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU4123K6K9K12K15KSE +/- 40.07, N = 3SE +/- 43.67, N = 3SE +/- 12.85, N = 31411714104141031. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU1425001000150020002500SE +/- 4.48, N = 3SE +/- 1.48, N = 3SE +/- 2.17, N = 32231222922241. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha512412300K600K900K1200K1500KSE +/- 2100.92, N = 3SE +/- 5508.00, N = 3158156215799791576054

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool214140K280K420K560K700KSE +/- 564.33, N = 3SE +/- 281.67, N = 3665904665903665340

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second41230K60K90K120K150KSE +/- 645.39, N = 3SE +/- 574.00, N = 3SE +/- 313.64, N = 3137374.61137084.98136732.151. (CC) gcc options: -O2 -lrt" -lrt

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression4321918273645383838381. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression4321204060801001051051051051. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression4132100200300400500SE +/- 0.33, N = 3SE +/- 1.15, N = 3SE +/- 1.33, N = 34534534524511. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression142330060090012001500SE +/- 2.65, N = 3SE +/- 2.85, N = 3SE +/- 0.67, N = 3SE +/- 7.97, N = 316051604160015931. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 8 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression243120406080100848383831. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 8 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression4231400800120016002000SE +/- 3.38, N = 3SE +/- 4.26, N = 3SE +/- 5.03, N = 3SE +/- 3.28, N = 317611750174917491. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Crush 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression432120406080100SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3929090901. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Crush 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression4321100200300400500SE +/- 0.33, N = 3SE +/- 0.33, N = 34474474474471. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression432190180270360450SE +/- 0.33, N = 34284274274271. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression3214120240360480600SE +/- 0.67, N = 3SE +/- 0.67, N = 35585585585571. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 2 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression43214080120160200SE +/- 0.33, N = 31701701701691. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 2 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression3214140280420560700SE +/- 0.58, N = 36486486486461. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression324150100150200250SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 32092092082081. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed124314002800420056007000SE +/- 4.14, N = 3SE +/- 9.69, N = 3SE +/- 5.66, N = 3SE +/- 2.97, N = 36691.286682.996669.476668.811. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed13422K4K6K8K10KSE +/- 33.42, N = 3SE +/- 2.99, N = 3SE +/- 10.98, N = 3SE +/- 34.62, N = 37915.27870.77867.17837.51. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed12341020304050SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 345.7845.7445.7045.641. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed14322K4K6K8K10KSE +/- 11.01, N = 3SE +/- 14.70, N = 3SE +/- 6.77, N = 3SE +/- 9.12, N = 37869.77851.07840.27834.31. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed31241020304050SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.17, N = 3SE +/- 0.13, N = 344.6144.5244.4244.401. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed31242K4K6K8K10KSE +/- 9.72, N = 3SE +/- 12.26, N = 3SE +/- 2.80, N = 3SE +/- 7.13, N = 37852.57847.47844.07843.81. (CC) gcc options: -O3

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.2112345001000150020002500SE +/- 6.62, N = 3SE +/- 27.08, N = 5SE +/- 19.28, N = 12SE +/- 27.91, N = 52181.82168.32167.02165.51. (CXX) g++ options: -O3 -march=native -rdynamic

FFTE

N=256, 3D Complex FFT Routine

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT Routine34124K8K12K16K20KSE +/- 44.69, N = 3SE +/- 42.48, N = 3SE +/- 29.86, N = 3SE +/- 16.02, N = 319060.1919047.0219035.0918773.271. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption214400800120016002000SE +/- 2.61, N = 3SE +/- 3.45, N = 3SE +/- 3.88, N = 32049.52043.42042.3

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption214400800120016002000SE +/- 3.87, N = 3SE +/- 2.09, N = 3SE +/- 5.93, N = 32055.02051.02047.6

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption241150300450600750SE +/- 0.22, N = 3SE +/- 0.67, N = 3SE +/- 1.02, N = 3705.8705.7704.8

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption214160320480640800SE +/- 0.70, N = 3SE +/- 0.32, N = 3SE +/- 0.64, N = 3720.1720.0719.0

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption24180160240320400SE +/- 0.26, N = 3SE +/- 0.13, N = 3SE +/- 0.84, N = 3390.9390.8390.1

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption24190180270360450SE +/- 0.23, N = 3SE +/- 0.24, N = 3SE +/- 0.09, N = 3394.5394.4394.1

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption214400800120016002000SE +/- 2.73, N = 3SE +/- 1.43, N = 3SE +/- 4.40, N = 31838.31834.61832.9

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption214400800120016002000SE +/- 2.49, N = 3SE +/- 2.21, N = 3SE +/- 5.05, N = 31838.11835.91833.4

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption241150300450600750SE +/- 0.99, N = 3SE +/- 0.64, N = 3SE +/- 0.57, N = 3705.7705.6705.0

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption124160320480640800SE +/- 0.33, N = 3SE +/- 2.15, N = 2SE +/- 0.90, N = 3719.6719.4718.8

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption41280160240320400SE +/- 0.20, N = 3SE +/- 0.15, N = 2SE +/- 0.15, N = 3390.8390.8390.5

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption42190180270360450SE +/- 0.25, N = 2SE +/- 0.26, N = 3SE +/- 1.38, N = 3394.5394.4393.0

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein23410.59361.18721.78082.37442.968SE +/- 0.001, N = 3SE +/- 0.007, N = 3SE +/- 0.006, N = 3SE +/- 0.030, N = 32.6382.6292.6222.6151. (CXX) g++ options: -O3 -pthread -lm

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP124500K1000K1500K2000K2500KSE +/- 7282.72, N = 3SE +/- 21480.12, N = 3SE +/- 20957.17, N = 42447471.581501169.131465596.161. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD421400K800K1200K1600K2000KSE +/- 13080.73, N = 3SE +/- 8591.83, N = 3SE +/- 24215.71, N = 41942305.791934230.871917948.341. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH412300K600K900K1200K1500KSE +/- 6781.65, N = 3SE +/- 13446.73, N = 3SE +/- 14467.44, N = 91513254.551507373.501486920.071. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET124500K1000K1500K2000K2500KSE +/- 32237.70, N = 3SE +/- 20266.35, N = 3SE +/- 19501.96, N = 32281791.252189234.252165661.081. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET241400K800K1200K1600K2000KSE +/- 11294.99, N = 3SE +/- 7748.68, N = 3SE +/- 11364.87, N = 31721937.791721260.041717799.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.44215M10M15M20M25MSE +/- 248353.03, N = 3SE +/- 267442.30, N = 3SE +/- 307740.69, N = 32492555324915507248348631. (CXX) g++ options: -O3 -fopenmp

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C42313K6K9K12K15KSE +/- 5.42, N = 3SE +/- 19.55, N = 3SE +/- 6.18, N = 3SE +/- 8.84, N = 313540.5213518.0413518.0113485.451. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C42317001400210028003500SE +/- 1.69, N = 3SE +/- 3.60, N = 3SE +/- 3.85, N = 3SE +/- 0.98, N = 33472.033463.423448.623438.091. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C1342110220330440550SE +/- 0.45, N = 3SE +/- 0.41, N = 3SE +/- 1.81, N = 3SE +/- 5.06, N = 8514.12513.26510.53507.421. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D1234110220330440550SE +/- 6.77, N = 4SE +/- 5.28, N = 9SE +/- 8.69, N = 3SE +/- 5.41, N = 7504.27503.55502.03501.691. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C342115003000450060007500SE +/- 12.69, N = 3SE +/- 15.55, N = 3SE +/- 15.03, N = 3SE +/- 72.82, N = 36986.406982.316902.756871.191. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C14323K6K9K12K15KSE +/- 14.35, N = 3SE +/- 15.78, N = 3SE +/- 10.10, N = 3SE +/- 134.72, N = 1014764.5014707.6814704.4414562.221. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C214315003000450060007500SE +/- 3.90, N = 3SE +/- 4.14, N = 3SE +/- 2.21, N = 3SE +/- 1.13, N = 37153.827148.357142.467142.141. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B14329001800270036004500SE +/- 0.56, N = 3SE +/- 2.68, N = 3SE +/- 1.21, N = 3SE +/- 0.54, N = 34409.824391.954391.744387.591. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100124130260390520650SE +/- 0.60, N = 3SE +/- 1.48, N = 3SE +/- 1.02, N = 3603.25603.13602.751. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3124330060090012001500SE +/- 0.58, N = 3SE +/- 0.89, N = 3SE +/- 0.65, N = 3SE +/- 2.48, N = 31172.901170.001166.671165.741. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig14324080120160200SE +/- 0.13, N = 3SE +/- 0.17, N = 3SE +/- 0.21, N = 3SE +/- 0.04, N = 3172.99173.53173.69173.691. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig134220406080100SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3105.06105.46105.57105.641. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU24313691215SE +/- 0.02455, N = 3SE +/- 0.01698, N = 3SE +/- 0.01670, N = 3SE +/- 0.03159, N = 39.396979.414279.432669.44945MIN: 8.24MIN: 8.17MIN: 8.21MIN: 8.381. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU21433691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 311.0211.3611.4611.81MIN: 10.88MIN: 11.21MIN: 11.3MIN: 11.651. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU42130.96551.9312.89653.8624.8275SE +/- 0.00642, N = 3SE +/- 0.00482, N = 3SE +/- 0.00558, N = 3SE +/- 0.00537, N = 34.280454.284984.286594.29128MIN: 3.8MIN: 3.79MIN: 3.8MIN: 3.81. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU24310.69031.38062.07092.76123.4515SE +/- 0.00215, N = 3SE +/- 0.01115, N = 3SE +/- 0.00090, N = 3SE +/- 0.01345, N = 32.944282.964632.966073.06797MIN: 2.85MIN: 2.87MIN: 2.89MIN: 2.961. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU3241510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 322.4222.4322.4422.60MIN: 22.33MIN: 22.31MIN: 22.33MIN: 22.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU34123691215SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 312.4412.4712.5012.57MIN: 11.38MIN: 11.01MIN: 11.34MIN: 11.441. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU241348121620SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 316.3016.3816.4116.49MIN: 15.26MIN: 15.26MIN: 15.36MIN: 15.391. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU2431510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 321.3021.3221.3521.38MIN: 21.2MIN: 21.23MIN: 21.27MIN: 21.121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU23143691215SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 312.4712.4812.6012.75MIN: 11.03MIN: 11.03MIN: 11.11MIN: 11.151. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU4213246810SE +/- 0.00650, N = 3SE +/- 0.00963, N = 3SE +/- 0.00369, N = 3SE +/- 0.03920, N = 38.206678.229618.250578.28109MIN: 7.72MIN: 7.72MIN: 7.78MIN: 7.771. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU12432K4K6K8K10KSE +/- 2.73, N = 3SE +/- 2.10, N = 3SE +/- 2.31, N = 3SE +/- 4.02, N = 38915.858919.798931.438939.18MIN: 8864.25MIN: 8896.59MIN: 8883.84MIN: 8893.941. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU213410002000300040005000SE +/- 3.81, N = 3SE +/- 3.08, N = 3SE +/- 1.60, N = 3SE +/- 2.68, N = 34628.674632.024635.334639.21MIN: 4609.9MIN: 4614.43MIN: 4621.25MIN: 4621.741. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU12432K4K6K8K10KSE +/- 5.71, N = 3SE +/- 2.41, N = 3SE +/- 1.31, N = 3SE +/- 2.21, N = 38917.528925.458938.378944.35MIN: 8892.85MIN: 8909.78MIN: 8921.68MIN: 8929.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU214310002000300040005000SE +/- 4.30, N = 3SE +/- 4.24, N = 3SE +/- 3.31, N = 3SE +/- 2.60, N = 34628.544631.894632.414645.08MIN: 4606.88MIN: 4615.88MIN: 4614.33MIN: 4629.781. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU21431.22332.44663.66994.89326.1165SE +/- 0.00432, N = 3SE +/- 0.00277, N = 3SE +/- 0.00360, N = 3SE +/- 0.00933, N = 35.377205.388945.424045.43683MIN: 5.28MIN: 5.3MIN: 5.35MIN: 5.351. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU12432K4K6K8K10KSE +/- 6.77, N = 3SE +/- 3.96, N = 3SE +/- 1.00, N = 3SE +/- 2.35, N = 38927.638933.088936.318945.60MIN: 8896MIN: 8916.93MIN: 8921.31MIN: 8926.381. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU241310002000300040005000SE +/- 5.57, N = 3SE +/- 3.98, N = 3SE +/- 7.28, N = 3SE +/- 5.16, N = 34630.474640.344640.834643.81MIN: 4609.21MIN: 4617.82MIN: 4620.66MIN: 4625.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU4123246810SE +/- 0.02622, N = 3SE +/- 0.00175, N = 3SE +/- 0.00949, N = 3SE +/- 0.01684, N = 37.389447.415137.427227.44449MIN: 6.62MIN: 6.64MIN: 6.63MIN: 6.631. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.0421246810SE +/- 0.076, N = 3SE +/- 0.025, N = 3SE +/- 0.010, N = 37.7717.7897.811MIN: 7.02 / MAX: 8.97MIN: 7.12 / MAX: 8.86MIN: 7.14 / MAX: 13.451. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-504121122334455SE +/- 0.75, N = 3SE +/- 0.14, N = 3SE +/- 0.14, N = 348.3748.8948.89MIN: 46.64 / MAX: 52.55MIN: 48.41 / MAX: 50.57MIN: 48.4 / MAX: 141.211. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_2244210.98751.9752.96253.954.9375SE +/- 0.049, N = 3SE +/- 0.025, N = 3SE +/- 0.010, N = 34.3304.3624.389MIN: 4.15 / MAX: 4.53MIN: 4.01 / MAX: 4.84MIN: 4.29 / MAX: 6.771. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.04211.1412.2823.4234.5645.705SE +/- 0.029, N = 3SE +/- 0.018, N = 3SE +/- 0.015, N = 35.0475.0705.071MIN: 4.56 / MAX: 5.57MIN: 4.58 / MAX: 5.75MIN: 4.59 / MAX: 6.11. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v34211428425670SE +/- 0.97, N = 3SE +/- 0.21, N = 3SE +/- 0.13, N = 359.4960.0060.43MIN: 57.28 / MAX: 89.73MIN: 59.43 / MAX: 64.63MIN: 59.98 / MAX: 66.211. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet412714212835SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 330.3630.4430.45MIN: 29.76 / MAX: 31.32MIN: 29.84 / MAX: 31.45MIN: 29.81 / MAX: 39.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2412246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 37.987.997.99MIN: 7.83 / MAX: 8.11MIN: 7.82 / MAX: 8.88MIN: 7.82 / MAX: 8.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3412246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.536.546.54MIN: 6.42 / MAX: 9.19MIN: 6.44 / MAX: 6.79MIN: 6.45 / MAX: 6.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v24123691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 39.769.789.80MIN: 9.43 / MAX: 10.04MIN: 9.42 / MAX: 10.68MIN: 9.44 / MAX: 10.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet214246810SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 36.736.746.79MIN: 6.56 / MAX: 7.06MIN: 6.58 / MAX: 7.21MIN: 6.72 / MAX: 6.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b01243691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.09, N = 310.9110.9110.98MIN: 10.85 / MAX: 11.97MIN: 10.67 / MAX: 11.08MIN: 10.67 / MAX: 59.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface4120.60751.2151.82252.433.0375SE +/- 0.00, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 32.632.702.70MIN: 2.6 / MAX: 3.38MIN: 2.6 / MAX: 11.85MIN: 2.58 / MAX: 11.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet124510152025SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 321.7521.7521.76MIN: 21.16 / MAX: 22.66MIN: 21.14 / MAX: 22.58MIN: 21.15 / MAX: 22.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg1642120406080100SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 388.8488.8689.22MIN: 88.59 / MAX: 89.77MIN: 88.59 / MAX: 91.44MIN: 88.84 / MAX: 148.871. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18124510152025SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 321.8821.9021.90MIN: 21.61 / MAX: 22.98MIN: 21.62 / MAX: 22.75MIN: 21.58 / MAX: 23.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet124510152025SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 319.5119.5319.53MIN: 19.38 / MAX: 20.06MIN: 19.41 / MAX: 20.29MIN: 19.4 / MAX: 20.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet501241122334455SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 346.6246.6646.68MIN: 45.3 / MAX: 48.43MIN: 45.36 / MAX: 47.7MIN: 45.41 / MAX: 48.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny412918273645SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 340.8640.9540.96MIN: 40.54 / MAX: 41.98MIN: 40.63 / MAX: 41.77MIN: 40.51 / MAX: 41.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd241714212835SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 331.2331.2431.30MIN: 31.12 / MAX: 32.11MIN: 31.14 / MAX: 32.24MIN: 31.14 / MAX: 32.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m24148121620SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 317.5217.5317.56MIN: 17.39 / MAX: 18.59MIN: 17.46 / MAX: 18.5MIN: 17.44 / MAX: 18.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v221480160240320400SE +/- 0.12, N = 3SE +/- 0.30, N = 3SE +/- 0.79, N = 3362.72363.11363.98MIN: 361.88 / MAX: 365.57MIN: 361.8 / MAX: 364.48MIN: 362.08 / MAX: 366.51. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.124180160240320400SE +/- 0.23, N = 3SE +/- 0.13, N = 3SE +/- 0.06, N = 3345.25345.34345.56MIN: 344.1 / MAX: 347.01MIN: 344.17 / MAX: 346.64MIN: 344.76 / MAX: 347.021. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M124380160240320400SE +/- 0.17, N = 3SE +/- 0.21, N = 3SE +/- 0.33, N = 3SE +/- 0.53, N = 3386.71387.68388.02388.451. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lspecie -lfiniteVolume -lfvOptions -lgenericPatchFields -lmeshTools -lsampling -lOpenFOAM -ldl -lm

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 01424080120160200SE +/- 0.71, N = 3SE +/- 0.48, N = 3SE +/- 0.20, N = 3203.25203.68203.851. (CXX) g++ options: -O3 -fPIC

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 2142306090120150SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.44, N = 3119.20119.62120.281. (CXX) g++ options: -O3 -fPIC

libavif avifenc

Encoder Speed: 8

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 8124246810SE +/- 0.004, N = 3SE +/- 0.011, N = 3SE +/- 0.014, N = 38.4748.5128.5211. (CXX) g++ options: -O3 -fPIC

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 10412246810SE +/- 0.027, N = 3SE +/- 0.002, N = 3SE +/- 0.053, N = 37.8077.8147.8671. (CXX) g++ options: -O3 -fPIC

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile12480160240320400SE +/- 0.35, N = 3SE +/- 0.23, N = 3SE +/- 0.06, N = 3363.21363.92364.06

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile12480160240320400SE +/- 0.47, N = 3SE +/- 4.00, N = 3SE +/- 4.15, N = 3363.91367.31367.59

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile14220406080100SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.13, N = 389.8990.1990.26

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE2413691215SE +/- 0.04, N = 5SE +/- 0.05, N = 5SE +/- 0.09, N = 513.1913.2213.271. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode2143691215SE +/- 0.068, N = 5SE +/- 0.037, N = 5SE +/- 0.044, N = 59.7649.7899.9091. (CXX) g++ options: -fvisibility=hidden -logg -lm

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.942150100150200250SE +/- 0.63, N = 3SE +/- 0.86, N = 3SE +/- 0.59, N = 3232.48232.51233.101. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default2143691215SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 310.5110.5310.561. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7142140280420560700SE +/- 0.26, N = 3SE +/- 1.61, N = 3SE +/- 0.75, N = 3639.61641.29642.371. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 741230060090012001500SE +/- 1.46, N = 3SE +/- 0.76, N = 3SE +/- 0.85, N = 31177.831178.691180.021. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5412918273645SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 337.4237.4437.441. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression4125001000150020002500SE +/- 0.76, N = 3SE +/- 1.91, N = 3SE +/- 0.23, N = 32397.752398.272400.661. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1243691215SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 39.429.539.531. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium4123691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.2812.3012.311. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough41220406080100SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 380.2280.2780.321. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive142140280420560700SE +/- 0.13, N = 3SE +/- 0.29, N = 3SE +/- 0.27, N = 3643.45643.57643.881. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack42148121620SE +/- 0.04, N = 5SE +/- 0.05, N = 5SE +/- 0.07, N = 516.9316.9417.011. (CXX) g++ options: -rdynamic

GnuPG

2.7GB Sample File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File Encryption2411632486480SE +/- 0.55, N = 3SE +/- 0.59, N = 3SE +/- 0.81, N = 373.1173.1273.301. (CC) gcc options: -O2

Unpacking Firefox

Extracting: firefox-84.0.source.tar.xz

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz412612182430SE +/- 0.12, N = 4SE +/- 0.07, N = 4SE +/- 0.05, N = 423.4923.6723.72

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O1342816243240SE +/- 0.16, N = 3SE +/- 0.26, N = 3SE +/- 0.29, N = 3SE +/- 0.26, N = 334.1934.4034.7534.811. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm


Phoronix Test Suite v10.8.4