core-i7-8086k-2021

Intel Core i7-8086K testing with a ASUS PRIME Z370-A (1802 BIOS) and ASUS Intel UHD 630 3GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2102185-HA-COREI780895&sor&grs.

core-i7-8086k-2021 ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen Resolution121a34Intel Core i7-8086K @ 5.00GHz (6 Cores / 12 Threads)ASUS PRIME Z370-A (1802 BIOS)Intel 8th Gen Core8GB118GB INTEL SSDPEK1W120GAASUS Intel UHD 630 3GB (1200MHz)Realtek ALC1220G237HLIntel I219-VUbuntu 20.045.9.0-050900rc8daily20201009-generic (x86_64) 20201008GNOME Shell 3.36.4X Server 1.20.84.6 Mesa 20.0.81.2.131GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd6 - Thermald 1.9.1 Python Details- 1, 2, 3, 4: Python 2.7.18 + Python 3.8.5Security Details- itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

core-i7-8086k-2021 redis: LPOPonednn: IP Shapes 3D - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUredis: GETaskap: tConvolve OpenMP - Griddingonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUaskap: tConvolve OpenMP - Degriddingonednn: Convolution Batch Shapes Auto - f32 - CPUrav1e: 10onednn: IP Shapes 1D - f32 - CPUmnn: MobileNetV2_224redis: LPUSHnpb: EP.Dngspice: C7552jpegxl-decode: Allnpb: EP.Clzbench: XZ 0 - Compressionredis: SETvkmark: 1920 x 1080onednn: Recurrent Neural Network Inference - f32 - CPUqmcpack: simple-H2Oonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUngspice: C2670lzbench: Crush 0 - Compressiononednn: Recurrent Neural Network Inference - u8s8f32 - CPUrav1e: 5redis: SADDonednn: Deconvolution Batch shapes_1d - f32 - CPUjpegxl: JPEG - 5jpegxl: PNG - 8askap: Hogbom Clean OpenMPonnx: super-resolution-10 - OpenMP CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUwarsow: 1920 x 1080lammps: Rhodopsin Proteinaskap: tConvolve MPI - Degriddingonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUdav1d: Chimera 1080plzbench: Brotli 0 - Compressionlulesh: lzbench: Zstd 8 - Compressionquantlib: lzbench: Brotli 2 - Compressionjpegxl: JPEG - 8tnn: CPU - MobileNet v2jpegxl-decode: 1lzbench: Zstd 1 - Compressionmnn: mobilenet-v1-1.0jpegxl: JPEG - 7dav1d: Summer Nature 1080pgnupg: 2.7GB Sample File Encryptionlzbench: XZ 0 - Decompressionmnn: inception-v3etcpak: ETC1 + Ditheringjpegxl: PNG - 7onnx: bertsquad-10 - OpenMP CPUrav1e: 6lzbench: Zstd 1 - Decompressionmnn: resnet-v2-50lzbench: Zstd 8 - Decompressionamg: onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUetcpak: DXT1unpack-firefox: firefox-84.0.source.tar.xzkripke: webp2: Quality 95, Compression Effort 7v-ray: CPUopenfoam: Motorbike 30Maskap: tConvolve MPI - Griddingjpegxl: PNG - 5synthmark: VoiceMark_100onnx: shufflenet-v2-10 - OpenMP CPUgcrypt: etcpak: ETC1lzbench: Brotli 0 - Decompressionpennant: leblancbigpennant: sedovbiglzbench: Libdeflate 1 - Compressiondav1d: Summer Nature 4Kmnn: SqueezeNetV1.0askap: tConvolve MT - Degriddingparaview: Wavelet Contour - 1920 x 1080dav1d: Chimera 1080p 10-bitcloverleaf: Lagrangian-Eulerian Hydrodynamicswebp2: Defaultparaview: Wavelet Contour - 1920 x 1080onnx: yolov4 - OpenMP CPUaskap: tConvolve MT - Griddingwebp2: Quality 75, Compression Effort 7gromacs: water_GMX50_bareencode-ape: WAV To APEcryptsetup: Serpent-XTS 256b Encryptionetcpak: ETC2cryptsetup: Twofish-XTS 256b Encryptionencode-wavpack: WAV To WavPackwebp2: Quality 100, Compression Effort 5encode-opus: WAV To Opus Encoderav1e: 1toybrot: C++ Taskscryptsetup: AES-XTS 256b Decryptioncryptsetup: Serpent-XTS 512b Encryptiononednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUlzbench: Crush 0 - Decompressionwebp2: Quality 100, Lossless Compressioncryptsetup: Serpent-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Encryptioncryptsetup: AES-XTS 256b Encryptiontnn: CPU - SqueezeNet v1.1cryptsetup: AES-XTS 512b Decryptioncp2k: Fayalite-FIST Datacryptsetup: Serpent-XTS 256b Decryptionparaview: Wavelet Volume - 1920 x 1080paraview: Wavelet Volume - 1920 x 1080lzbench: Brotli 2 - Decompressiononednn: Recurrent Neural Network Training - u8s8f32 - CPUcryptsetup: Twofish-XTS 512b Decryptioncryptsetup: Twofish-XTS 256b Decryptionopenfoam: Motorbike 60Mcryptsetup: AES-XTS 512b Encryptionnpb: LU.Ccryptsetup: PBKDF2-whirlpoolbuild-godot: Time To Compilelzbench: Libdeflate 1 - Decompressiontoybrot: C++ Threadscryptsetup: PBKDF2-sha512onednn: IP Shapes 1D - u8s8f32 - CPUtoybrot: OpenMPonnx: fcn-resnet101-11 - OpenMP CPU121a343323281.833138734.001248.232218.83.8723.2152023847.68945.8190.134181.53942.53492369135.507192230.3825.6972220.83108.8841272223.461.3302635219.7561.730.86189.275469684.65.0291967.97532.045221585.76841082898.622228.80298.62746.205923.83361.59485.4360.75513343.382357.3329.725641.731205540.89122472502456331499.70616.78936331390600.5857483273.4958.66783.25216469179.825374.77572768.21744105.401266139.305.5082018.8038.0798.49171.565.726396.7193521145.05325.1650.7819.805906.6213.104502.613.03018.3717.6650.464619762634.2908.35851173.138924.7503.62617.1270.6222356.31046.02925.423.61377.784840506.8507.21218.522358.224235.45855749184.259129561838197471961735622085673.468.272343.616702908151.921235.6616.75652.104722226.2620.06853.7844.501013.2252059021.04949.6990.163178.47941.53492342227.427112248.6625.8632255.93108.3291282254.881.3112641940.86.4169161.030.87189.27547258.8911285.45.0491954.986.84973526.975261570.79681072901.022328.68301.24045.975893.86561.34484.0060.28713443.135357.5199.685621.742205740.63722422501188674.239041496.86216.78436317627600.3907462273.321929.5058.42779.63516544179.221373.76272668.33411105.4707265139.465.5222017.1038.0898.50171.345.719396.7913511144.51324.8870.7819.814907.7212.969503.013.00018.3297.6740.464621002630.7906.53.930884025.104023.675861175.087924.5502.82621.1270.9432359.41045.369925.223.63378.0688404023.42506.5506.61219.632355.824224.80856679184.25512946187819734822.1415461728628.306433.6181716.84242.0713120.07704.507542216.002220.332222.976.360658.935906.787794.236923.932054024.784025.374025.942.142752070909.2510.30473.909502927900.671240.4417.88122.182442225.0220.75433.8964.633683.1732072916.33945.8089.729177.78927.39492377147.507072250.3425.5192253.87107.1671282256.421.3192634387.836.3340460.990.87188.68046728.9734085.45.0341948.486.79490527.855251572.44301072875.722428.58299.73845.845883.86061.10481.6560.57313343.311354.9349.655621.740204740.80422472489440674.262661495.40916.87336175097597.4937445274.381920.0258.38782.58016483179.214374.17972968.49462105.8152265139.085.5212017.9537.9698.41171.055.711395.6223511144.37325.7480.7829.800908.9212.567503.813.00318.3617.6820.463620092633.5907.53.935294029.874030.565861173.720925.4503.42617.5271.0162358.11044.646925.923.60377.6108404028.19506.2507.21219.872355.624210.11856679184.35812956188419747242.1422061733622109775.010.40133.893532919971.421322.7817.86372.160422322.0420.76843.8414.602303.2522051125.08928.5288.280177.79946.58502385714.257092253.5525.9362256.59107.3411292254.151.3262607688.026.3482661.570.87187.15047068.9875685.54.9961948.616.85494527.215271572.55791082901.822428.55299.92545.805933.85061.11482.1360.41513443.452356.9109.655601.730206040.80622562487054334.253931503.94816.86536141050598.2767447274.671923.1458.42782.34116540179.061375.34072968.49578105.7825265138.975.5272012.0138.0698.19171.585.709396.6563511141.91325.6230.7809.825908.3212.984503.813.00018.3527.6670.464621052636.0907.63.938134032.074030.495861174.594923.9503.62620.2271.0102359.61044.592924.723.62377.9488414028.18506.8507.21219.282358.024209.08855749184.41312956188019747192.141486174362OpenBenchmarking.org

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP1423700K1400K2100K2800K3500KSE +/- 19183.06, N = 3SE +/- 8626.43, N = 3SE +/- 11005.15, N = 3SE +/- 32269.04, N = 33323281.832109775.002085673.462070909.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU21a343691215SE +/- 0.01851, N = 3SE +/- 0.02438, N = 3SE +/- 0.01907, N = 3SE +/- 0.01545, N = 38.272348.3064310.3047010.40130MIN: 7.96MIN: 8.05MIN: 10.11MIN: 10.251. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU21a430.87961.75922.63883.51844.398SE +/- 0.02046, N = 3SE +/- 0.00712, N = 3SE +/- 0.01176, N = 3SE +/- 0.00680, N = 33.616703.618173.893533.90950MIN: 3.52MIN: 3.55MIN: 3.82MIN: 3.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET1342700K1400K2100K2800K3500KSE +/- 18933.24, N = 3SE +/- 15356.90, N = 3SE +/- 18481.68, N = 3SE +/- 44568.10, N = 33138734.002927900.672919971.422908151.921. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding413230060090012001500SE +/- 14.50, N = 3SE +/- 9.83, N = 3SE +/- 8.41, N = 3SE +/- 14.95, N = 51322.781248.231240.441235.661. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU21a4348121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 316.7616.8417.8617.88MIN: 16.26MIN: 16.55MIN: 17.69MIN: 17.631. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1a2430.4910.9821.4731.9642.455SE +/- 0.00510, N = 3SE +/- 0.02847, N = 3SE +/- 0.00374, N = 3SE +/- 0.00718, N = 32.071312.104722.160422.18244MIN: 2.01MIN: 2MIN: 2.12MIN: 2.131. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding42315001000150020002500SE +/- 6.77, N = 3SE +/- 4.57, N = 5SE +/- 6.22, N = 3SE +/- 0.00, N = 32322.042226.262225.022218.801. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU21a34510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 320.0720.0820.7520.77MIN: 19.96MIN: 19.98MIN: 20.67MIN: 20.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 1031420.87661.75322.62983.50644.383SE +/- 0.026, N = 3SE +/- 0.019, N = 3SE +/- 0.014, N = 3SE +/- 0.061, N = 33.8963.8723.8413.784

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU21a431.04262.08523.12784.17045.213SE +/- 0.00886, N = 3SE +/- 0.01402, N = 3SE +/- 0.00329, N = 3SE +/- 0.00530, N = 34.501014.507544.602304.63368MIN: 4.37MIN: 4.39MIN: 4.51MIN: 4.541. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_22431240.73171.46342.19512.92683.6585SE +/- 0.041, N = 3SE +/- 0.012, N = 3SE +/- 0.020, N = 3SE +/- 0.025, N = 33.1733.2153.2253.252MIN: 3.06 / MAX: 7.08MIN: 3.13 / MAX: 20.95MIN: 3.1 / MAX: 4.26MIN: 3.17 / MAX: 7.81. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH3241400K800K1200K1600K2000KSE +/- 13361.81, N = 3SE +/- 9300.78, N = 3SE +/- 19842.96, N = 3SE +/- 25186.84, N = 52072916.332059021.042051125.082023847.681. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D21342004006008001000SE +/- 0.61, N = 3SE +/- 0.11, N = 3SE +/- 1.74, N = 3SE +/- 8.82, N = 12949.69945.81945.80928.521. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552431220406080100SE +/- 0.53, N = 3SE +/- 0.50, N = 3SE +/- 0.30, N = 3SE +/- 0.25, N = 388.2889.7390.1390.161. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

JPEG XL Decoding

CPU Threads: All

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: All12434080120160200SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.03, N = 3181.53178.47177.79177.78

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C41232004006008001000SE +/- 1.34, N = 3SE +/- 0.53, N = 3SE +/- 2.85, N = 3SE +/- 13.08, N = 3946.58942.53941.53927.391. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression43211122334455504949491. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET4312500K1000K1500K2000K2500KSE +/- 3546.38, N = 3SE +/- 12358.77, N = 3SE +/- 13128.97, N = 3SE +/- 10631.93, N = 32385714.252377147.502369135.502342227.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

VKMark

Resolution: 1920 x 1080

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 10801243160320480640800SE +/- 2.73, N = 3SE +/- 1.86, N = 3SE +/- 1.86, N = 37197117097071. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1a12345001000150020002500SE +/- 1.62, N = 3SE +/- 1.59, N = 3SE +/- 1.21, N = 3SE +/- 1.92, N = 3SE +/- 3.29, N = 32216.002230.382248.662250.342253.55MIN: 2211.59MIN: 2226.39MIN: 2245.47MIN: 2246.36MIN: 2246.461. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O3124612182430SE +/- 0.14, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 3SE +/- 0.21, N = 325.5225.7025.8625.941. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1a13245001000150020002500SE +/- 4.85, N = 3SE +/- 5.92, N = 3SE +/- 1.66, N = 3SE +/- 1.19, N = 3SE +/- 1.03, N = 32220.332220.832253.872255.932256.59MIN: 2210.09MIN: 2210.57MIN: 2248.87MIN: 2252MIN: 2253.051. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670342120406080100SE +/- 0.36, N = 3SE +/- 0.54, N = 3SE +/- 0.77, N = 3SE +/- 0.22, N = 3107.17107.34108.33108.881. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

lzbench

Test: Crush 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression4321306090120150SE +/- 0.33, N = 3SE +/- 1.00, N = 31291281281271. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1a14235001000150020002500SE +/- 2.04, N = 3SE +/- 3.64, N = 3SE +/- 3.51, N = 3SE +/- 5.08, N = 3SE +/- 3.29, N = 32222.972223.462254.152254.882256.42MIN: 2219.12MIN: 2215.76MIN: 2247.34MIN: 2245.82MIN: 2250.831. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 514320.29930.59860.89791.19721.4965SE +/- 0.005, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 3SE +/- 0.005, N = 31.3301.3261.3191.311

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD2134600K1200K1800K2400K3000KSE +/- 12039.46, N = 3SE +/- 27951.55, N = 3SE +/- 17037.49, N = 3SE +/- 28568.59, N = 132641940.802635219.752634387.832607688.021. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU341a2246810SE +/- 0.00815, N = 3SE +/- 0.01283, N = 3SE +/- 0.02142, N = 3SE +/- 0.01672, N = 36.334046.348266.360656.41691MIN: 6.28MIN: 6.29MIN: 6.27MIN: 6.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

JPEG XL

Input: JPEG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 514231428425670SE +/- 0.23, N = 3SE +/- 0.19, N = 3SE +/- 0.47, N = 3SE +/- 0.30, N = 361.7361.5761.0360.991. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: PNG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 843210.19580.39160.58740.78320.979SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.870.870.870.861. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP21344080120160200SE +/- 0.24, N = 3SE +/- 0.32, N = 3SE +/- 0.21, N = 3SE +/- 0.31, N = 3189.28189.28188.68187.151. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU241310002000300040005000SE +/- 9.84, N = 3SE +/- 4.67, N = 3SE +/- 8.06, N = 3SE +/- 9.22, N = 347254706469646721. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU21a343691215SE +/- 0.02003, N = 3SE +/- 0.00635, N = 3SE +/- 0.00861, N = 3SE +/- 0.01901, N = 38.891128.935908.973408.98756MIN: 8.82MIN: 8.89MIN: 8.92MIN: 8.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Warsow

Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 1080432120406080100SE +/- 0.12, N = 3SE +/- 0.17, N = 3SE +/- 0.09, N = 3SE +/- 0.47, N = 385.585.485.484.6

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein23141.1362.2723.4084.5445.68SE +/- 0.005, N = 3SE +/- 0.021, N = 3SE +/- 0.013, N = 3SE +/- 0.025, N = 35.0495.0345.0294.9961. (CXX) g++ options: -O3 -pthread -lm

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding1243400800120016002000SE +/- 0.00, N = 3SE +/- 6.50, N = 3SE +/- 11.14, N = 3SE +/- 0.00, N = 31967.971954.981948.611948.481. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1a324246810SE +/- 0.01649, N = 3SE +/- 0.00909, N = 3SE +/- 0.06379, N = 3SE +/- 0.06163, N = 36.787796.794906.849736.85494MIN: 6.69MIN: 6.7MIN: 6.72MIN: 6.721. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p1342120240360480600SE +/- 0.61, N = 3SE +/- 0.26, N = 3SE +/- 0.66, N = 3SE +/- 1.05, N = 3532.04527.85527.21526.97MIN: 393.83 / MAX: 800.33MIN: 391.98 / MAX: 817.62MIN: 391.75 / MAX: 803.53MIN: 392.13 / MAX: 800.131. (CC) gcc options: -pthread

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression4231110220330440550SE +/- 1.00, N = 3SE +/- 1.33, N = 35275265255221. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3143230060090012001500SE +/- 1.22, N = 3SE +/- 1.89, N = 3SE +/- 0.47, N = 3SE +/- 0.70, N = 31585.771572.561572.441570.801. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

lzbench

Test: Zstd 8 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression413220406080100SE +/- 0.58, N = 31081081071071. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.2142136001200180024003000SE +/- 5.47, N = 3SE +/- 9.53, N = 3SE +/- 8.50, N = 3SE +/- 10.28, N = 32901.82901.02898.62875.71. (CXX) g++ options: -O3 -march=native -rdynamic

lzbench

Test: Brotli 2 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression432150100150200250SE +/- 0.58, N = 32242242232221. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

JPEG XL

Input: JPEG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 81234714212835SE +/- 0.08, N = 3SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 0.17, N = 328.8028.6828.5828.551. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2134270140210280350SE +/- 0.27, N = 3SE +/- 0.54, N = 3SE +/- 1.01, N = 3SE +/- 0.93, N = 3298.63299.74299.93301.24MIN: 297.04 / MAX: 300.97MIN: 297.93 / MAX: 302.27MIN: 297.54 / MAX: 306.69MIN: 298.47 / MAX: 304.261. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

JPEG XL Decoding

CPU Threads: 1

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: 112341020304050SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 346.2045.9745.8445.80

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression4123130260390520650SE +/- 0.67, N = 3SE +/- 1.00, N = 35935925895881. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.014320.86961.73922.60883.47844.348SE +/- 0.008, N = 3SE +/- 0.004, N = 3SE +/- 0.023, N = 3SE +/- 0.010, N = 33.8333.8503.8603.865MIN: 3.76 / MAX: 6.28MIN: 3.8 / MAX: 4.36MIN: 3.77 / MAX: 20.88MIN: 3.81 / MAX: 5.881. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

JPEG XL

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 712431428425670SE +/- 0.15, N = 3SE +/- 0.46, N = 3SE +/- 0.21, N = 3SE +/- 0.10, N = 361.5961.3461.1161.101. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p1243110220330440550SE +/- 0.56, N = 3SE +/- 0.94, N = 3SE +/- 0.31, N = 3SE +/- 1.00, N = 3485.43484.00482.13481.65MIN: 440.26 / MAX: 533.5MIN: 431.31 / MAX: 528.36MIN: 428.5 / MAX: 525.3MIN: 426.68 / MAX: 527.141. (CC) gcc options: -pthread

GnuPG

2.7GB Sample File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File Encryption24311428425670SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.44, N = 360.2960.4260.5760.761. (CC) gcc options: -O2

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression4231306090120150SE +/- 0.33, N = 31341341331331. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v323141020304050SE +/- 0.27, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 343.1443.3143.3843.45MIN: 42.66 / MAX: 61.01MIN: 42.98 / MAX: 58.83MIN: 43.06 / MAX: 61.01MIN: 43.05 / MAX: 60.531. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Etcpak

Configuration: ETC1 + Dithering

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering214380160240320400SE +/- 0.12, N = 3SE +/- 0.18, N = 3SE +/- 0.35, N = 3SE +/- 2.23, N = 3357.52357.33356.91354.931. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

JPEG XL

Input: PNG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 712433691215SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 39.729.689.659.651. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU1324120240360480600SE +/- 0.50, N = 3SE +/- 0.76, N = 3SE +/- 1.36, N = 3SE +/- 0.73, N = 35645625625601. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 623140.3920.7841.1761.5681.96SE +/- 0.008, N = 3SE +/- 0.005, N = 3SE +/- 0.012, N = 3SE +/- 0.009, N = 31.7421.7401.7311.730

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression4213400800120016002000SE +/- 1.45, N = 3SE +/- 0.67, N = 3SE +/- 3.18, N = 3SE +/- 12.33, N = 320602057205520471. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-502341918273645SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 340.6440.8040.8140.89MIN: 40.34 / MAX: 57.72MIN: 40.51 / MAX: 56.52MIN: 40.64 / MAX: 58.22MIN: 40.57 / MAX: 57.961. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

lzbench

Test: Zstd 8 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression43125001000150020002500SE +/- 6.36, N = 3SE +/- 5.36, N = 3SE +/- 4.18, N = 322562247224722421. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2123450M100M150M200M250MSE +/- 323790.40, N = 3SE +/- 614809.90, N = 3SE +/- 1277684.47, N = 3SE +/- 1517817.32, N = 32502456332501188672489440672487054331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1a2430.95911.91822.87733.83644.7955SE +/- 0.00597, N = 3SE +/- 0.00463, N = 3SE +/- 0.00315, N = 3SE +/- 0.00176, N = 34.236924.239044.253934.26266MIN: 4.2MIN: 4.2MIN: 4.22MIN: 4.231. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1412330060090012001500SE +/- 0.57, N = 3SE +/- 3.68, N = 3SE +/- 2.90, N = 3SE +/- 3.85, N = 31503.951499.711496.861495.411. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Unpacking Firefox

Extracting: firefox-84.0.source.tar.xz

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz214348121620SE +/- 0.06, N = 4SE +/- 0.05, N = 4SE +/- 0.02, N = 4SE +/- 0.06, N = 416.7816.7916.8716.87

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.412348M16M24M32M40MSE +/- 116888.91, N = 3SE +/- 126361.09, N = 3SE +/- 107755.07, N = 3SE +/- 114405.39, N = 3363313903631762736175097361410501. (CXX) g++ options: -O3 -fopenmp

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 73421130260390520650SE +/- 0.46, N = 3SE +/- 1.37, N = 3SE +/- 0.37, N = 3SE +/- 1.42, N = 3597.49598.28600.39600.591. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -ljpeg -lgif -lpthread

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 5Mode: CPU124316003200480064008000SE +/- 17.10, N = 3SE +/- 28.75, N = 3SE +/- 42.90, N = 3SE +/- 32.98, N = 37483746274477445

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M213460120180240300SE +/- 0.06, N = 3SE +/- 0.20, N = 3SE +/- 0.20, N = 3SE +/- 0.10, N = 3273.32273.49274.38274.671. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lgenericPatchFields -lscotchDecomp -lptscotchDecomp -lmeshTools -ldynamicMesh -lOpenFOAM -ldl -lm

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding243400800120016002000SE +/- 10.92, N = 3SE +/- 6.24, N = 3SE +/- 9.37, N = 21929.501923.141920.021. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

JPEG XL

Input: PNG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 514231326395265SE +/- 0.10, N = 3SE +/- 0.15, N = 3SE +/- 0.04, N = 3SE +/- 0.20, N = 358.6658.4258.4258.381. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_10013422004006008001000SE +/- 0.34, N = 3SE +/- 0.39, N = 3SE +/- 0.30, N = 3SE +/- 3.25, N = 3783.25782.58782.34779.641. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU24314K8K12K16K20KSE +/- 15.42, N = 3SE +/- 16.98, N = 3SE +/- 45.62, N = 3SE +/- 8.66, N = 3165441654016483164691. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.943214080120160200SE +/- 0.14, N = 3SE +/- 0.52, N = 3SE +/- 0.37, N = 3SE +/- 0.39, N = 3179.06179.21179.22179.831. (CC) gcc options: -O2 -fvisibility=hidden

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1413280160240320400SE +/- 0.15, N = 3SE +/- 0.58, N = 3SE +/- 0.53, N = 3SE +/- 1.79, N = 3375.34374.78374.18373.761. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression4312160320480640800SE +/- 0.67, N = 3SE +/- 1.00, N = 3SE +/- 2.03, N = 37297297277261. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig12341530456075SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 368.2268.3368.4968.501. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig124320406080100SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 3105.40105.47105.78105.821. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression143260120180240300SE +/- 0.58, N = 3SE +/- 1.00, N = 3SE +/- 1.53, N = 3SE +/- 1.20, N = 32662652652651. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K2134306090120150SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 3139.46139.30139.08138.97MIN: 131.42 / MAX: 157.06MIN: 131.22 / MAX: 157.1MIN: 130.77 / MAX: 156.83MIN: 131.06 / MAX: 156.711. (CC) gcc options: -pthread

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.013241.24362.48723.73084.97446.218SE +/- 0.023, N = 3SE +/- 0.037, N = 3SE +/- 0.014, N = 3SE +/- 0.015, N = 35.5085.5215.5225.527MIN: 5.39 / MAX: 7.3MIN: 5.38 / MAX: 7.49MIN: 5.41 / MAX: 7.94MIN: 5.39 / MAX: 23.381. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding1324400800120016002000SE +/- 3.32, N = 3SE +/- 3.48, N = 3SE +/- 2.65, N = 3SE +/- 0.00, N = 32018.802017.952017.102012.011. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 10802143918273645SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 338.0838.0738.0637.96

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit213420406080100SE +/- 0.11, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.13, N = 398.5098.4998.4198.19MIN: 64.06 / MAX: 228.81MIN: 63.95 / MAX: 228.95MIN: 64.05 / MAX: 227.43MIN: 63.95 / MAX: 228.891. (CC) gcc options: -pthread

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics32144080120160200SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3171.05171.34171.56171.581. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default43211.28842.57683.86525.15366.442SE +/- 0.009, N = 3SE +/- 0.027, N = 3SE +/- 0.022, N = 3SE +/- 0.017, N = 35.7095.7115.7195.7261. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -ljpeg -lgif -lpthread

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1080214390180270360450SE +/- 0.59, N = 3SE +/- 1.00, N = 3SE +/- 0.91, N = 3SE +/- 0.27, N = 3396.79396.72396.66395.62

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU143280160240320400SE +/- 0.88, N = 3SE +/- 0.60, N = 3SE +/- 0.60, N = 3SE +/- 1.30, N = 33523513513511. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding12342004006008001000SE +/- 0.49, N = 3SE +/- 0.14, N = 3SE +/- 0.24, N = 3SE +/- 0.00, N = 31145.051144.511144.371141.911. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7214370140210280350SE +/- 0.58, N = 3SE +/- 0.94, N = 3SE +/- 0.66, N = 3SE +/- 0.98, N = 3324.89325.17325.62325.751. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -ljpeg -lgif -lpthread

GROMACS

Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bare32140.1760.3520.5280.7040.88SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 30.7820.7810.7810.7801. (CXX) g++ options: -O3 -pthread

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE31243691215SE +/- 0.015, N = 5SE +/- 0.014, N = 5SE +/- 0.016, N = 5SE +/- 0.016, N = 59.8009.8059.8149.8251. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption34212004006008001000SE +/- 0.48, N = 3SE +/- 0.87, N = 3SE +/- 0.61, N = 3SE +/- 1.39, N = 3908.9908.3907.7906.6

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2142350100150200250SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.42, N = 3213.10212.98212.97212.571. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption4321110220330440550SE +/- 0.38, N = 3SE +/- 0.06, N = 3SE +/- 0.66, N = 3SE +/- 1.12, N = 3503.8503.8503.0502.6

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack24313691215SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.02, N = 513.0013.0013.0013.031. (CXX) g++ options: -rdynamic

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 52431510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 318.3318.3518.3618.371. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -ljpeg -lgif -lpthread

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1423246810SE +/- 0.010, N = 5SE +/- 0.011, N = 5SE +/- 0.012, N = 5SE +/- 0.012, N = 57.6657.6677.6747.6821. (CXX) g++ options: -fvisibility=hidden -logg -lm

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 142130.10440.20880.31320.41760.522SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 30.4640.4640.4640.463

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasks132413K26K39K52K65KSE +/- 13.67, N = 3SE +/- 63.76, N = 3SE +/- 100.54, N = 3SE +/- 124.65, N = 3619766200962100621051. (CXX) g++ options: -O3 -lpthread

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption41326001200180024003000SE +/- 6.10, N = 3SE +/- 1.38, N = 3SE +/- 2.27, N = 3SE +/- 0.85, N = 32636.02634.22633.52630.7

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption14322004006008001000SE +/- 0.40, N = 3SE +/- 0.93, N = 3SE +/- 1.20, N = 3SE +/- 0.45, N = 3908.3907.6907.5906.5

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU21a340.88611.77222.65833.54444.4305SE +/- 0.00431, N = 3SE +/- 0.00444, N = 3SE +/- 0.00396, N = 3SE +/- 0.00145, N = 33.930883.932053.935293.93813MIN: 3.89MIN: 3.89MIN: 3.9MIN: 3.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1a2349001800270036004500SE +/- 2.30, N = 3SE +/- 1.81, N = 3SE +/- 1.28, N = 3SE +/- 1.09, N = 34024.784025.104029.874032.07MIN: 4018.57MIN: 4019.74MIN: 4025.1MIN: 4027.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU21a439001800270036004500SE +/- 2.32, N = 3SE +/- 1.13, N = 3SE +/- 1.40, N = 3SE +/- 2.36, N = 34023.674025.374030.494030.56MIN: 4017.93MIN: 4021.62MIN: 4026.5MIN: 4023.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

lzbench

Test: Crush 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression43211302603905206505865865865851. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression134230060090012001500SE +/- 0.06, N = 3SE +/- 0.28, N = 3SE +/- 0.60, N = 3SE +/- 0.50, N = 31173.141173.721174.591175.091. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -ljpeg -lgif -lpthread

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption31242004006008001000SE +/- 1.18, N = 3SE +/- 0.64, N = 3SE +/- 0.80, N = 3SE +/- 0.85, N = 2925.4924.7924.5923.9

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption4132110220330440550SE +/- 0.35, N = 3SE +/- 0.09, N = 3SE +/- 0.15, N = 3SE +/- 0.64, N = 3503.6503.6503.4502.8

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption24316001200180024003000SE +/- 0.93, N = 3SE +/- 8.59, N = 3SE +/- 3.07, N = 3SE +/- 6.46, N = 32621.12620.22617.52617.1

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1124360120180240300SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 3270.62270.94271.01271.02MIN: 270.11 / MAX: 271.33MIN: 270.24 / MAX: 271.7MIN: 270.36 / MAX: 271.8MIN: 270.32 / MAX: 271.671. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption42315001000150020002500SE +/- 6.54, N = 3SE +/- 5.47, N = 3SE +/- 1.04, N = 3SE +/- 3.61, N = 32359.62359.42358.12356.3

CP2K Molecular Dynamics

Fayalite-FIST Data

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.1Fayalite-FIST Data432120040060080010001044.591044.651045.371046.02

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption31242004006008001000SE +/- 0.92, N = 3SE +/- 0.49, N = 3SE +/- 0.96, N = 3SE +/- 0.50, N = 3925.9925.4925.2924.7

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 10802413612182430SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 323.6323.6223.6123.60

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1080241380160240320400SE +/- 0.38, N = 3SE +/- 0.12, N = 3SE +/- 0.19, N = 3SE +/- 0.59, N = 3378.07377.95377.78377.61

lzbench

Test: Brotli 2 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression43212004006008001000SE +/- 0.67, N = 38418408408401. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU21a439001800270036004500SE +/- 1.97, N = 3SE +/- 0.29, N = 3SE +/- 1.14, N = 3SE +/- 2.50, N = 34023.424025.944028.184028.19MIN: 4017.95MIN: 4021.96MIN: 4024.52MIN: 4022.481. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption4123110220330440550SE +/- 0.41, N = 3SE +/- 0.10, N = 2SE +/- 0.26, N = 3SE +/- 0.43, N = 3506.8506.8506.5506.2

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption4312110220330440550SE +/- 0.35, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.34, N = 3507.2507.2507.2506.6

OpenFOAM

Input: Motorbike 60M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60M142330060090012001500SE +/- 0.86, N = 3SE +/- 1.05, N = 3SE +/- 0.28, N = 3SE +/- 0.79, N = 31218.521219.281219.631219.871. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lgenericPatchFields -lscotchDecomp -lptscotchDecomp -lmeshTools -ldynamicMesh -lOpenFOAM -ldl -lm

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption14235001000150020002500SE +/- 0.52, N = 3SE +/- 5.53, N = 3SE +/- 3.53, N = 3SE +/- 0.71, N = 32358.22358.02355.82355.6

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C12345K10K15K20K25KSE +/- 10.13, N = 3SE +/- 49.20, N = 3SE +/- 8.06, N = 3SE +/- 10.80, N = 324235.4524224.8024210.1124209.081. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool3241200K400K600K800K1000KSE +/- 930.00, N = 3SE +/- 930.00, N = 3856679856679855749855749

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile21344080120160200SE +/- 0.19, N = 3SE +/- 0.32, N = 3SE +/- 0.26, N = 3SE +/- 0.12, N = 3184.26184.26184.36184.41

lzbench

Test: Libdeflate 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Decompression431230060090012001500SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 312951295129512941. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threads124313K26K39K52K65KSE +/- 13.45, N = 3SE +/- 28.06, N = 3SE +/- 32.42, N = 3SE +/- 42.55, N = 3618386187861880618841. (CXX) g++ options: -O3 -lpthread

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha5123412400K800K1200K1600K2000KSE +/- 2147.17, N = 3SE +/- 1237.33, N = 31974724197471919747191973482

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU4231a0.48210.96421.44631.92842.4105SE +/- 0.00130, N = 3SE +/- 0.00277, N = 3SE +/- 0.00203, N = 3SE +/- 0.00109, N = 32.141482.141542.142202.14275MIN: 2.13MIN: 2.12MIN: 2.12MIN: 2.121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMP231413K26K39K52K65KSE +/- 1.67, N = 3SE +/- 4.04, N = 3SE +/- 6.00, N = 3SE +/- 8.11, N = 3617286173361735617431. (CXX) g++ options: -O3 -lpthread

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU43211428425670SE +/- 0.17, N = 3SE +/- 0.17, N = 3SE +/- 0.00, N = 3SE +/- 0.17, N = 3626262621. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt


Phoronix Test Suite v10.8.4