core-i7-8086k-2021

Intel Core i7-8086K testing with a ASUS PRIME Z370-A (1802 BIOS) and ASUS Intel UHD 630 3GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2102185-HA-COREI780895&grs.

core-i7-8086k-2021 ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen Resolution121a34Intel Core i7-8086K @ 5.00GHz (6 Cores / 12 Threads)ASUS PRIME Z370-A (1802 BIOS)Intel 8th Gen Core8GB118GB INTEL SSDPEK1W120GAASUS Intel UHD 630 3GB (1200MHz)Realtek ALC1220G237HLIntel I219-VUbuntu 20.045.9.0-050900rc8daily20201009-generic (x86_64) 20201008GNOME Shell 3.36.4X Server 1.20.84.6 Mesa 20.0.81.2.131GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd6 - Thermald 1.9.1 Python Details- 1, 2, 3, 4: Python 2.7.18 + Python 3.8.5Security Details- itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

core-i7-8086k-2021 redis: LPOPonednn: IP Shapes 3D - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUredis: GETaskap: tConvolve OpenMP - Griddingonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUaskap: tConvolve OpenMP - Degriddingonednn: Convolution Batch Shapes Auto - f32 - CPUrav1e: 10onednn: IP Shapes 1D - f32 - CPUmnn: MobileNetV2_224redis: LPUSHnpb: EP.Dngspice: C7552jpegxl-decode: Allnpb: EP.Clzbench: XZ 0 - Compressionredis: SETvkmark: 1920 x 1080onednn: Recurrent Neural Network Inference - f32 - CPUqmcpack: simple-H2Oonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUngspice: C2670lzbench: Crush 0 - Compressiononednn: Recurrent Neural Network Inference - u8s8f32 - CPUrav1e: 5redis: SADDonednn: Deconvolution Batch shapes_1d - f32 - CPUjpegxl: JPEG - 5jpegxl: PNG - 8askap: Hogbom Clean OpenMPonnx: super-resolution-10 - OpenMP CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUwarsow: 1920 x 1080lammps: Rhodopsin Proteinaskap: tConvolve MPI - Degriddingonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUdav1d: Chimera 1080plzbench: Brotli 0 - Compressionlulesh: lzbench: Zstd 8 - Compressionquantlib: lzbench: Brotli 2 - Compressionjpegxl: JPEG - 8tnn: CPU - MobileNet v2jpegxl-decode: 1lzbench: Zstd 1 - Compressionmnn: mobilenet-v1-1.0jpegxl: JPEG - 7dav1d: Summer Nature 1080pgnupg: 2.7GB Sample File Encryptionlzbench: XZ 0 - Decompressionmnn: inception-v3etcpak: ETC1 + Ditheringjpegxl: PNG - 7onnx: bertsquad-10 - OpenMP CPUrav1e: 6lzbench: Zstd 1 - Decompressionmnn: resnet-v2-50lzbench: Zstd 8 - Decompressionamg: onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUetcpak: DXT1unpack-firefox: firefox-84.0.source.tar.xzkripke: webp2: Quality 95, Compression Effort 7v-ray: CPUopenfoam: Motorbike 30Maskap: tConvolve MPI - Griddingjpegxl: PNG - 5synthmark: VoiceMark_100onnx: shufflenet-v2-10 - OpenMP CPUgcrypt: etcpak: ETC1lzbench: Brotli 0 - Decompressionpennant: leblancbigpennant: sedovbiglzbench: Libdeflate 1 - Compressiondav1d: Summer Nature 4Kmnn: SqueezeNetV1.0askap: tConvolve MT - Degriddingparaview: Wavelet Contour - 1920 x 1080dav1d: Chimera 1080p 10-bitcloverleaf: Lagrangian-Eulerian Hydrodynamicswebp2: Defaultparaview: Wavelet Contour - 1920 x 1080onnx: yolov4 - OpenMP CPUaskap: tConvolve MT - Griddingwebp2: Quality 75, Compression Effort 7gromacs: water_GMX50_bareencode-ape: WAV To APEcryptsetup: Serpent-XTS 256b Encryptionetcpak: ETC2cryptsetup: Twofish-XTS 256b Encryptionencode-wavpack: WAV To WavPackwebp2: Quality 100, Compression Effort 5encode-opus: WAV To Opus Encoderav1e: 1toybrot: C++ Taskscryptsetup: AES-XTS 256b Decryptioncryptsetup: Serpent-XTS 512b Encryptiononednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUlzbench: Crush 0 - Decompressionwebp2: Quality 100, Lossless Compressioncryptsetup: Serpent-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Encryptioncryptsetup: AES-XTS 256b Encryptiontnn: CPU - SqueezeNet v1.1cryptsetup: AES-XTS 512b Decryptioncp2k: Fayalite-FIST Datacryptsetup: Serpent-XTS 256b Decryptionparaview: Wavelet Volume - 1920 x 1080paraview: Wavelet Volume - 1920 x 1080lzbench: Brotli 2 - Decompressiononednn: Recurrent Neural Network Training - u8s8f32 - CPUcryptsetup: Twofish-XTS 512b Decryptioncryptsetup: Twofish-XTS 256b Decryptionopenfoam: Motorbike 60Mcryptsetup: AES-XTS 512b Encryptionnpb: LU.Ccryptsetup: PBKDF2-whirlpoolbuild-godot: Time To Compilelzbench: Libdeflate 1 - Decompressiontoybrot: C++ Threadscryptsetup: PBKDF2-sha512onednn: IP Shapes 1D - u8s8f32 - CPUtoybrot: OpenMPonnx: fcn-resnet101-11 - OpenMP CPU121a343323281.833138734.001248.232218.83.8723.2152023847.68945.8190.134181.53942.53492369135.507192230.3825.6972220.83108.8841272223.461.3302635219.7561.730.86189.275469684.65.0291967.97532.045221585.76841082898.622228.80298.62746.205923.83361.59485.4360.75513343.382357.3329.725641.731205540.89122472502456331499.70616.78936331390600.5857483273.4958.66783.25216469179.825374.77572768.21744105.401266139.305.5082018.8038.0798.49171.565.726396.7193521145.05325.1650.7819.805906.6213.104502.613.03018.3717.6650.464619762634.2908.35851173.138924.7503.62617.1270.6222356.31046.02925.423.61377.784840506.8507.21218.522358.224235.45855749184.259129561838197471961735622085673.468.272343.616702908151.921235.6616.75652.104722226.2620.06853.7844.501013.2252059021.04949.6990.163178.47941.53492342227.427112248.6625.8632255.93108.3291282254.881.3112641940.86.4169161.030.87189.27547258.8911285.45.0491954.986.84973526.975261570.79681072901.022328.68301.24045.975893.86561.34484.0060.28713443.135357.5199.685621.742205740.63722422501188674.239041496.86216.78436317627600.3907462273.321929.5058.42779.63516544179.221373.76272668.33411105.4707265139.465.5222017.1038.0898.50171.345.719396.7913511144.51324.8870.7819.814907.7212.969503.013.00018.3297.6740.464621002630.7906.53.930884025.104023.675861175.087924.5502.82621.1270.9432359.41045.369925.223.63378.0688404023.42506.5506.61219.632355.824224.80856679184.25512946187819734822.1415461728628.306433.6181716.84242.0713120.07704.507542216.002220.332222.976.360658.935906.787794.236923.932054024.784025.374025.942.142752070909.2510.30473.909502927900.671240.4417.88122.182442225.0220.75433.8964.633683.1732072916.33945.8089.729177.78927.39492377147.507072250.3425.5192253.87107.1671282256.421.3192634387.836.3340460.990.87188.68046728.9734085.45.0341948.486.79490527.855251572.44301072875.722428.58299.73845.845883.86061.10481.6560.57313343.311354.9349.655621.740204740.80422472489440674.262661495.40916.87336175097597.4937445274.381920.0258.38782.58016483179.214374.17972968.49462105.8152265139.085.5212017.9537.9698.41171.055.711395.6223511144.37325.7480.7829.800908.9212.567503.813.00318.3617.6820.463620092633.5907.53.935294029.874030.565861173.720925.4503.42617.5271.0162358.11044.646925.923.60377.6108404028.19506.2507.21219.872355.624210.11856679184.35812956188419747242.1422061733622109775.010.40133.893532919971.421322.7817.86372.160422322.0420.76843.8414.602303.2522051125.08928.5288.280177.79946.58502385714.257092253.5525.9362256.59107.3411292254.151.3262607688.026.3482661.570.87187.15047068.9875685.54.9961948.616.85494527.215271572.55791082901.822428.55299.92545.805933.85061.11482.1360.41513443.452356.9109.655601.730206040.80622562487054334.253931503.94816.86536141050598.2767447274.671923.1458.42782.34116540179.061375.34072968.49578105.7825265138.975.5272012.0138.0698.19171.585.709396.6563511141.91325.6230.7809.825908.3212.984503.813.00018.3527.6670.464621052636.0907.63.938134032.074030.495861174.594923.9503.62620.2271.0102359.61044.592924.723.62377.9488414028.18506.8507.21219.282358.024209.08855749184.41312956188019747192.141486174362OpenBenchmarking.org

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP1234700K1400K2100K2800K3500KSE +/- 19183.06, N = 3SE +/- 11005.15, N = 3SE +/- 32269.04, N = 3SE +/- 8626.43, N = 33323281.832085673.462070909.252109775.001. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1a2343691215SE +/- 0.02438, N = 3SE +/- 0.01851, N = 3SE +/- 0.01907, N = 3SE +/- 0.01545, N = 38.306438.2723410.3047010.40130MIN: 8.05MIN: 7.96MIN: 10.11MIN: 10.251. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1a2340.87961.75922.63883.51844.398SE +/- 0.00712, N = 3SE +/- 0.02046, N = 3SE +/- 0.00680, N = 3SE +/- 0.01176, N = 33.618173.616703.909503.89353MIN: 3.55MIN: 3.52MIN: 3.84MIN: 3.821. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET1234700K1400K2100K2800K3500KSE +/- 18933.24, N = 3SE +/- 44568.10, N = 3SE +/- 15356.90, N = 3SE +/- 18481.68, N = 33138734.002908151.922927900.672919971.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding123430060090012001500SE +/- 9.83, N = 3SE +/- 14.95, N = 5SE +/- 8.41, N = 3SE +/- 14.50, N = 31248.231235.661240.441322.781. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1a23448121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 316.8416.7617.8817.86MIN: 16.55MIN: 16.26MIN: 17.63MIN: 17.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1a2340.4910.9821.4731.9642.455SE +/- 0.00510, N = 3SE +/- 0.02847, N = 3SE +/- 0.00718, N = 3SE +/- 0.00374, N = 32.071312.104722.182442.16042MIN: 2.01MIN: 2MIN: 2.13MIN: 2.121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding12345001000150020002500SE +/- 0.00, N = 3SE +/- 4.57, N = 5SE +/- 6.22, N = 3SE +/- 6.77, N = 32218.802226.262225.022322.041. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1a234510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 320.0820.0720.7520.77MIN: 19.98MIN: 19.96MIN: 20.67MIN: 20.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 1012340.87661.75322.62983.50644.383SE +/- 0.019, N = 3SE +/- 0.061, N = 3SE +/- 0.026, N = 3SE +/- 0.014, N = 33.8723.7843.8963.841

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1a2341.04262.08523.12784.17045.213SE +/- 0.01402, N = 3SE +/- 0.00886, N = 3SE +/- 0.00530, N = 3SE +/- 0.00329, N = 34.507544.501014.633684.60230MIN: 4.39MIN: 4.37MIN: 4.54MIN: 4.511. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_22412340.73171.46342.19512.92683.6585SE +/- 0.012, N = 3SE +/- 0.020, N = 3SE +/- 0.041, N = 3SE +/- 0.025, N = 33.2153.2253.1733.252MIN: 3.13 / MAX: 20.95MIN: 3.1 / MAX: 4.26MIN: 3.06 / MAX: 7.08MIN: 3.17 / MAX: 7.81. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH1234400K800K1200K1600K2000KSE +/- 25186.84, N = 5SE +/- 9300.78, N = 3SE +/- 13361.81, N = 3SE +/- 19842.96, N = 32023847.682059021.042072916.332051125.081. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D12342004006008001000SE +/- 0.11, N = 3SE +/- 0.61, N = 3SE +/- 1.74, N = 3SE +/- 8.82, N = 12945.81949.69945.80928.521. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552123420406080100SE +/- 0.30, N = 3SE +/- 0.25, N = 3SE +/- 0.50, N = 3SE +/- 0.53, N = 390.1390.1689.7388.281. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

JPEG XL Decoding

CPU Threads: All

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: All12344080120160200SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 3181.53178.47177.78177.79

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C12342004006008001000SE +/- 0.53, N = 3SE +/- 2.85, N = 3SE +/- 13.08, N = 3SE +/- 1.34, N = 3942.53941.53927.39946.581. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression12341122334455494949501. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET1234500K1000K1500K2000K2500KSE +/- 13128.97, N = 3SE +/- 10631.93, N = 3SE +/- 12358.77, N = 3SE +/- 3546.38, N = 32369135.502342227.422377147.502385714.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

VKMark

Resolution: 1920 x 1080

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 10801234160320480640800SE +/- 2.73, N = 3SE +/- 1.86, N = 3SE +/- 1.86, N = 37197117077091. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU121a345001000150020002500SE +/- 1.59, N = 3SE +/- 1.21, N = 3SE +/- 1.62, N = 3SE +/- 1.92, N = 3SE +/- 3.29, N = 32230.382248.662216.002250.342253.55MIN: 2226.39MIN: 2245.47MIN: 2211.59MIN: 2246.36MIN: 2246.461. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O1234612182430SE +/- 0.09, N = 3SE +/- 0.14, N = 3SE +/- 0.14, N = 3SE +/- 0.21, N = 325.7025.8625.5225.941. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU121a345001000150020002500SE +/- 5.92, N = 3SE +/- 1.19, N = 3SE +/- 4.85, N = 3SE +/- 1.66, N = 3SE +/- 1.03, N = 32220.832255.932220.332253.872256.59MIN: 2210.57MIN: 2252MIN: 2210.09MIN: 2248.87MIN: 2253.051. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670123420406080100SE +/- 0.22, N = 3SE +/- 0.77, N = 3SE +/- 0.36, N = 3SE +/- 0.54, N = 3108.88108.33107.17107.341. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

lzbench

Test: Crush 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression1234306090120150SE +/- 1.00, N = 3SE +/- 0.33, N = 31271281281291. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU121a345001000150020002500SE +/- 3.64, N = 3SE +/- 5.08, N = 3SE +/- 2.04, N = 3SE +/- 3.29, N = 3SE +/- 3.51, N = 32223.462254.882222.972256.422254.15MIN: 2215.76MIN: 2245.82MIN: 2219.12MIN: 2250.83MIN: 2247.341. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 512340.29930.59860.89791.19721.4965SE +/- 0.005, N = 3SE +/- 0.005, N = 3SE +/- 0.002, N = 3SE +/- 0.004, N = 31.3301.3111.3191.326

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD1234600K1200K1800K2400K3000KSE +/- 27951.55, N = 3SE +/- 12039.46, N = 3SE +/- 17037.49, N = 3SE +/- 28568.59, N = 132635219.752641940.802634387.832607688.021. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1a234246810SE +/- 0.02142, N = 3SE +/- 0.01672, N = 3SE +/- 0.00815, N = 3SE +/- 0.01283, N = 36.360656.416916.334046.34826MIN: 6.27MIN: 6.31MIN: 6.28MIN: 6.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

JPEG XL

Input: JPEG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 512341428425670SE +/- 0.23, N = 3SE +/- 0.47, N = 3SE +/- 0.30, N = 3SE +/- 0.19, N = 361.7361.0360.9961.571. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: PNG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 812340.19580.39160.58740.78320.979SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.860.870.870.871. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP12344080120160200SE +/- 0.32, N = 3SE +/- 0.24, N = 3SE +/- 0.21, N = 3SE +/- 0.31, N = 3189.28189.28188.68187.151. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU123410002000300040005000SE +/- 8.06, N = 3SE +/- 9.84, N = 3SE +/- 9.22, N = 3SE +/- 4.67, N = 346964725467247061. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1a2343691215SE +/- 0.00635, N = 3SE +/- 0.02003, N = 3SE +/- 0.00861, N = 3SE +/- 0.01901, N = 38.935908.891128.973408.98756MIN: 8.89MIN: 8.82MIN: 8.92MIN: 8.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Warsow

Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 1080123420406080100SE +/- 0.47, N = 3SE +/- 0.09, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 384.685.485.485.5

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein12341.1362.2723.4084.5445.68SE +/- 0.013, N = 3SE +/- 0.005, N = 3SE +/- 0.021, N = 3SE +/- 0.025, N = 35.0295.0495.0344.9961. (CXX) g++ options: -O3 -pthread -lm

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding1234400800120016002000SE +/- 0.00, N = 3SE +/- 6.50, N = 3SE +/- 0.00, N = 3SE +/- 11.14, N = 31967.971954.981948.481948.611. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1a234246810SE +/- 0.01649, N = 3SE +/- 0.06379, N = 3SE +/- 0.00909, N = 3SE +/- 0.06163, N = 36.787796.849736.794906.85494MIN: 6.69MIN: 6.72MIN: 6.7MIN: 6.721. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p1234120240360480600SE +/- 0.61, N = 3SE +/- 1.05, N = 3SE +/- 0.26, N = 3SE +/- 0.66, N = 3532.04526.97527.85527.21MIN: 393.83 / MAX: 800.33MIN: 392.13 / MAX: 800.13MIN: 391.98 / MAX: 817.62MIN: 391.75 / MAX: 803.531. (CC) gcc options: -pthread

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression1234110220330440550SE +/- 1.33, N = 3SE +/- 1.00, N = 35225265255271. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3123430060090012001500SE +/- 1.22, N = 3SE +/- 0.70, N = 3SE +/- 0.47, N = 3SE +/- 1.89, N = 31585.771570.801572.441572.561. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

lzbench

Test: Zstd 8 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression123420406080100SE +/- 0.58, N = 31081071071081. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.2112346001200180024003000SE +/- 8.50, N = 3SE +/- 9.53, N = 3SE +/- 10.28, N = 3SE +/- 5.47, N = 32898.62901.02875.72901.81. (CXX) g++ options: -O3 -march=native -rdynamic

lzbench

Test: Brotli 2 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression123450100150200250SE +/- 0.58, N = 32222232242241. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

JPEG XL

Input: JPEG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 81234714212835SE +/- 0.08, N = 3SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 0.17, N = 328.8028.6828.5828.551. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2123470140210280350SE +/- 0.27, N = 3SE +/- 0.93, N = 3SE +/- 0.54, N = 3SE +/- 1.01, N = 3298.63301.24299.74299.93MIN: 297.04 / MAX: 300.97MIN: 298.47 / MAX: 304.26MIN: 297.93 / MAX: 302.27MIN: 297.54 / MAX: 306.691. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

JPEG XL Decoding

CPU Threads: 1

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: 112341020304050SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 346.2045.9745.8445.80

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression1234130260390520650SE +/- 0.67, N = 3SE +/- 1.00, N = 35925895885931. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.012340.86961.73922.60883.47844.348SE +/- 0.008, N = 3SE +/- 0.010, N = 3SE +/- 0.023, N = 3SE +/- 0.004, N = 33.8333.8653.8603.850MIN: 3.76 / MAX: 6.28MIN: 3.81 / MAX: 5.88MIN: 3.77 / MAX: 20.88MIN: 3.8 / MAX: 4.361. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

JPEG XL

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 712341428425670SE +/- 0.15, N = 3SE +/- 0.46, N = 3SE +/- 0.10, N = 3SE +/- 0.21, N = 361.5961.3461.1061.111. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p1234110220330440550SE +/- 0.56, N = 3SE +/- 0.94, N = 3SE +/- 1.00, N = 3SE +/- 0.31, N = 3485.43484.00481.65482.13MIN: 440.26 / MAX: 533.5MIN: 431.31 / MAX: 528.36MIN: 426.68 / MAX: 527.14MIN: 428.5 / MAX: 525.31. (CC) gcc options: -pthread

GnuPG

2.7GB Sample File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File Encryption12341428425670SE +/- 0.44, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 360.7660.2960.5760.421. (CC) gcc options: -O2

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression1234306090120150SE +/- 0.33, N = 31331341331341. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v312341020304050SE +/- 0.06, N = 3SE +/- 0.27, N = 3SE +/- 0.09, N = 3SE +/- 0.10, N = 343.3843.1443.3143.45MIN: 43.06 / MAX: 61.01MIN: 42.66 / MAX: 61.01MIN: 42.98 / MAX: 58.83MIN: 43.05 / MAX: 60.531. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Etcpak

Configuration: ETC1 + Dithering

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering123480160240320400SE +/- 0.18, N = 3SE +/- 0.12, N = 3SE +/- 2.23, N = 3SE +/- 0.35, N = 3357.33357.52354.93356.911. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

JPEG XL

Input: PNG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 712343691215SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 39.729.689.659.651. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU1234120240360480600SE +/- 0.50, N = 3SE +/- 1.36, N = 3SE +/- 0.76, N = 3SE +/- 0.73, N = 35645625625601. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 612340.3920.7841.1761.5681.96SE +/- 0.012, N = 3SE +/- 0.008, N = 3SE +/- 0.005, N = 3SE +/- 0.009, N = 31.7311.7421.7401.730

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression1234400800120016002000SE +/- 3.18, N = 3SE +/- 0.67, N = 3SE +/- 12.33, N = 3SE +/- 1.45, N = 320552057204720601. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-501234918273645SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 340.8940.6440.8040.81MIN: 40.57 / MAX: 57.96MIN: 40.34 / MAX: 57.72MIN: 40.51 / MAX: 56.52MIN: 40.64 / MAX: 58.221. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

lzbench

Test: Zstd 8 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression12345001000150020002500SE +/- 4.18, N = 3SE +/- 5.36, N = 3SE +/- 6.36, N = 322472242224722561. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2123450M100M150M200M250MSE +/- 323790.40, N = 3SE +/- 614809.90, N = 3SE +/- 1277684.47, N = 3SE +/- 1517817.32, N = 32502456332501188672489440672487054331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1a2340.95911.91822.87733.83644.7955SE +/- 0.00597, N = 3SE +/- 0.00463, N = 3SE +/- 0.00176, N = 3SE +/- 0.00315, N = 34.236924.239044.262664.25393MIN: 4.2MIN: 4.2MIN: 4.23MIN: 4.221. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1123430060090012001500SE +/- 3.68, N = 3SE +/- 2.90, N = 3SE +/- 3.85, N = 3SE +/- 0.57, N = 31499.711496.861495.411503.951. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Unpacking Firefox

Extracting: firefox-84.0.source.tar.xz

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz123448121620SE +/- 0.05, N = 4SE +/- 0.06, N = 4SE +/- 0.06, N = 4SE +/- 0.02, N = 416.7916.7816.8716.87

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.412348M16M24M32M40MSE +/- 116888.91, N = 3SE +/- 126361.09, N = 3SE +/- 107755.07, N = 3SE +/- 114405.39, N = 3363313903631762736175097361410501. (CXX) g++ options: -O3 -fopenmp

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 71234130260390520650SE +/- 1.42, N = 3SE +/- 0.37, N = 3SE +/- 0.46, N = 3SE +/- 1.37, N = 3600.59600.39597.49598.281. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -ljpeg -lgif -lpthread

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 5Mode: CPU123416003200480064008000SE +/- 17.10, N = 3SE +/- 28.75, N = 3SE +/- 32.98, N = 3SE +/- 42.90, N = 37483746274457447

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M123460120180240300SE +/- 0.20, N = 3SE +/- 0.06, N = 3SE +/- 0.20, N = 3SE +/- 0.10, N = 3273.49273.32274.38274.671. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lgenericPatchFields -lscotchDecomp -lptscotchDecomp -lmeshTools -ldynamicMesh -lOpenFOAM -ldl -lm

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding234400800120016002000SE +/- 10.92, N = 3SE +/- 9.37, N = 2SE +/- 6.24, N = 31929.501920.021923.141. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

JPEG XL

Input: PNG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 512341326395265SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.20, N = 3SE +/- 0.15, N = 358.6658.4258.3858.421. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_10012342004006008001000SE +/- 0.34, N = 3SE +/- 3.25, N = 3SE +/- 0.39, N = 3SE +/- 0.30, N = 3783.25779.64782.58782.341. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU12344K8K12K16K20KSE +/- 8.66, N = 3SE +/- 15.42, N = 3SE +/- 45.62, N = 3SE +/- 16.98, N = 3164691654416483165401. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.912344080120160200SE +/- 0.39, N = 3SE +/- 0.37, N = 3SE +/- 0.52, N = 3SE +/- 0.14, N = 3179.83179.22179.21179.061. (CC) gcc options: -O2 -fvisibility=hidden

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1123480160240320400SE +/- 0.58, N = 3SE +/- 1.79, N = 3SE +/- 0.53, N = 3SE +/- 0.15, N = 3374.78373.76374.18375.341. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression1234160320480640800SE +/- 1.00, N = 3SE +/- 2.03, N = 3SE +/- 0.67, N = 37277267297291. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig12341530456075SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 368.2268.3368.4968.501. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig123420406080100SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 3105.40105.47105.82105.781. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression123460120180240300SE +/- 0.58, N = 3SE +/- 1.20, N = 3SE +/- 1.53, N = 3SE +/- 1.00, N = 32662652652651. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K1234306090120150SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 3139.30139.46139.08138.97MIN: 131.22 / MAX: 157.1MIN: 131.42 / MAX: 157.06MIN: 130.77 / MAX: 156.83MIN: 131.06 / MAX: 156.711. (CC) gcc options: -pthread

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.012341.24362.48723.73084.97446.218SE +/- 0.023, N = 3SE +/- 0.014, N = 3SE +/- 0.037, N = 3SE +/- 0.015, N = 35.5085.5225.5215.527MIN: 5.39 / MAX: 7.3MIN: 5.41 / MAX: 7.94MIN: 5.38 / MAX: 7.49MIN: 5.39 / MAX: 23.381. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding1234400800120016002000SE +/- 3.32, N = 3SE +/- 2.65, N = 3SE +/- 3.48, N = 3SE +/- 0.00, N = 32018.802017.102017.952012.011. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 10801234918273645SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 338.0738.0837.9638.06

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit123420406080100SE +/- 0.02, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.13, N = 398.4998.5098.4198.19MIN: 63.95 / MAX: 228.95MIN: 64.06 / MAX: 228.81MIN: 64.05 / MAX: 227.43MIN: 63.95 / MAX: 228.891. (CC) gcc options: -pthread

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics12344080120160200SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.10, N = 3171.56171.34171.05171.581. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default12341.28842.57683.86525.15366.442SE +/- 0.017, N = 3SE +/- 0.022, N = 3SE +/- 0.027, N = 3SE +/- 0.009, N = 35.7265.7195.7115.7091. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -ljpeg -lgif -lpthread

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1080123490180270360450SE +/- 1.00, N = 3SE +/- 0.59, N = 3SE +/- 0.27, N = 3SE +/- 0.91, N = 3396.72396.79395.62396.66

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU123480160240320400SE +/- 0.88, N = 3SE +/- 1.30, N = 3SE +/- 0.60, N = 3SE +/- 0.60, N = 33523513513511. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding12342004006008001000SE +/- 0.49, N = 3SE +/- 0.14, N = 3SE +/- 0.24, N = 3SE +/- 0.00, N = 31145.051144.511144.371141.911. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7123470140210280350SE +/- 0.94, N = 3SE +/- 0.58, N = 3SE +/- 0.98, N = 3SE +/- 0.66, N = 3325.17324.89325.75325.621. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -ljpeg -lgif -lpthread

GROMACS

Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bare12340.1760.3520.5280.7040.88SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 30.7810.7810.7820.7801. (CXX) g++ options: -O3 -pthread

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE12343691215SE +/- 0.014, N = 5SE +/- 0.016, N = 5SE +/- 0.015, N = 5SE +/- 0.016, N = 59.8059.8149.8009.8251. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption12342004006008001000SE +/- 1.39, N = 3SE +/- 0.61, N = 3SE +/- 0.48, N = 3SE +/- 0.87, N = 3906.6907.7908.9908.3

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2123450100150200250SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.42, N = 3SE +/- 0.00, N = 3213.10212.97212.57212.981. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption1234110220330440550SE +/- 1.12, N = 3SE +/- 0.66, N = 3SE +/- 0.06, N = 3SE +/- 0.38, N = 3502.6503.0503.8503.8

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack12343691215SE +/- 0.02, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 513.0313.0013.0013.001. (CXX) g++ options: -rdynamic

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 51234510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 318.3718.3318.3618.351. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -ljpeg -lgif -lpthread

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1234246810SE +/- 0.010, N = 5SE +/- 0.012, N = 5SE +/- 0.012, N = 5SE +/- 0.011, N = 57.6657.6747.6827.6671. (CXX) g++ options: -fvisibility=hidden -logg -lm

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 112340.10440.20880.31320.41760.522SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.4640.4640.4630.464

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasks123413K26K39K52K65KSE +/- 13.67, N = 3SE +/- 100.54, N = 3SE +/- 63.76, N = 3SE +/- 124.65, N = 3619766210062009621051. (CXX) g++ options: -O3 -lpthread

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption12346001200180024003000SE +/- 1.38, N = 3SE +/- 0.85, N = 3SE +/- 2.27, N = 3SE +/- 6.10, N = 32634.22630.72633.52636.0

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption12342004006008001000SE +/- 0.40, N = 3SE +/- 0.45, N = 3SE +/- 1.20, N = 3SE +/- 0.93, N = 3908.3906.5907.5907.6

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1a2340.88611.77222.65833.54444.4305SE +/- 0.00444, N = 3SE +/- 0.00431, N = 3SE +/- 0.00396, N = 3SE +/- 0.00145, N = 33.932053.930883.935293.93813MIN: 3.89MIN: 3.89MIN: 3.9MIN: 3.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1a2349001800270036004500SE +/- 2.30, N = 3SE +/- 1.81, N = 3SE +/- 1.28, N = 3SE +/- 1.09, N = 34024.784025.104029.874032.07MIN: 4018.57MIN: 4019.74MIN: 4025.1MIN: 4027.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1a2349001800270036004500SE +/- 1.13, N = 3SE +/- 2.32, N = 3SE +/- 2.36, N = 3SE +/- 1.40, N = 34025.374023.674030.564030.49MIN: 4021.62MIN: 4017.93MIN: 4023.71MIN: 4026.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

lzbench

Test: Crush 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression12341302603905206505855865865861. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression123430060090012001500SE +/- 0.06, N = 3SE +/- 0.50, N = 3SE +/- 0.28, N = 3SE +/- 0.60, N = 31173.141175.091173.721174.591. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -ljpeg -lgif -lpthread

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption12342004006008001000SE +/- 0.64, N = 3SE +/- 0.80, N = 3SE +/- 1.18, N = 3SE +/- 0.85, N = 2924.7924.5925.4923.9

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption1234110220330440550SE +/- 0.09, N = 3SE +/- 0.64, N = 3SE +/- 0.15, N = 3SE +/- 0.35, N = 3503.6502.8503.4503.6

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption12346001200180024003000SE +/- 6.46, N = 3SE +/- 0.93, N = 3SE +/- 3.07, N = 3SE +/- 8.59, N = 32617.12621.12617.52620.2

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1123460120180240300SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3270.62270.94271.02271.01MIN: 270.11 / MAX: 271.33MIN: 270.24 / MAX: 271.7MIN: 270.32 / MAX: 271.67MIN: 270.36 / MAX: 271.81. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption12345001000150020002500SE +/- 3.61, N = 3SE +/- 5.47, N = 3SE +/- 1.04, N = 3SE +/- 6.54, N = 32356.32359.42358.12359.6

CP2K Molecular Dynamics

Fayalite-FIST Data

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.1Fayalite-FIST Data123420040060080010001046.021045.371044.651044.59

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption12342004006008001000SE +/- 0.49, N = 3SE +/- 0.96, N = 3SE +/- 0.92, N = 3SE +/- 0.50, N = 3925.4925.2925.9924.7

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 10801234612182430SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 323.6123.6323.6023.62

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1080123480160240320400SE +/- 0.19, N = 3SE +/- 0.38, N = 3SE +/- 0.59, N = 3SE +/- 0.12, N = 3377.78378.07377.61377.95

lzbench

Test: Brotli 2 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression12342004006008001000SE +/- 0.67, N = 38408408408411. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1a2349001800270036004500SE +/- 0.29, N = 3SE +/- 1.97, N = 3SE +/- 2.50, N = 3SE +/- 1.14, N = 34025.944023.424028.194028.18MIN: 4021.96MIN: 4017.95MIN: 4022.48MIN: 4024.521. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption1234110220330440550SE +/- 0.10, N = 2SE +/- 0.26, N = 3SE +/- 0.43, N = 3SE +/- 0.41, N = 3506.8506.5506.2506.8

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption1234110220330440550SE +/- 0.03, N = 3SE +/- 0.34, N = 3SE +/- 0.07, N = 3SE +/- 0.35, N = 3507.2506.6507.2507.2

OpenFOAM

Input: Motorbike 60M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60M123430060090012001500SE +/- 0.86, N = 3SE +/- 0.28, N = 3SE +/- 0.79, N = 3SE +/- 1.05, N = 31218.521219.631219.871219.281. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lgenericPatchFields -lscotchDecomp -lptscotchDecomp -lmeshTools -ldynamicMesh -lOpenFOAM -ldl -lm

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption12345001000150020002500SE +/- 0.52, N = 3SE +/- 3.53, N = 3SE +/- 0.71, N = 3SE +/- 5.53, N = 32358.22355.82355.62358.0

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C12345K10K15K20K25KSE +/- 10.13, N = 3SE +/- 49.20, N = 3SE +/- 8.06, N = 3SE +/- 10.80, N = 324235.4524224.8024210.1124209.081. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool1234200K400K600K800K1000KSE +/- 930.00, N = 3SE +/- 930.00, N = 3855749856679856679855749

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile12344080120160200SE +/- 0.32, N = 3SE +/- 0.19, N = 3SE +/- 0.26, N = 3SE +/- 0.12, N = 3184.26184.26184.36184.41

lzbench

Test: Libdeflate 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Decompression123430060090012001500SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 312951294129512951. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threads123413K26K39K52K65KSE +/- 13.45, N = 3SE +/- 28.06, N = 3SE +/- 42.55, N = 3SE +/- 32.42, N = 3618386187861884618801. (CXX) g++ options: -O3 -lpthread

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha5121234400K800K1200K1600K2000KSE +/- 1237.33, N = 3SE +/- 2147.17, N = 31974719197348219747241974719

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1a2340.48210.96421.44631.92842.4105SE +/- 0.00109, N = 3SE +/- 0.00277, N = 3SE +/- 0.00203, N = 3SE +/- 0.00130, N = 32.142752.141542.142202.14148MIN: 2.12MIN: 2.12MIN: 2.12MIN: 2.131. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMP123413K26K39K52K65KSE +/- 6.00, N = 3SE +/- 1.67, N = 3SE +/- 4.04, N = 3SE +/- 8.11, N = 3617356172861733617431. (CXX) g++ options: -O3 -lpthread

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU12341428425670SE +/- 0.17, N = 3SE +/- 0.00, N = 3SE +/- 0.17, N = 3SE +/- 0.17, N = 3626262621. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt


Phoronix Test Suite v10.8.4