core-i7-8086k-2021

Intel Core i7-8086K testing with a ASUS PRIME Z370-A (1802 BIOS) and ASUS Intel UHD 630 3GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2102185-HA-COREI780895&grs&rdt.

core-i7-8086k-2021 ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen Resolution4121a3Intel Core i7-8086K @ 5.00GHz (6 Cores / 12 Threads)ASUS PRIME Z370-A (1802 BIOS)Intel 8th Gen Core8GB118GB INTEL SSDPEK1W120GAASUS Intel UHD 630 3GB (1200MHz)Realtek ALC1220G237HLIntel I219-VUbuntu 20.045.9.0-050900rc8daily20201009-generic (x86_64) 20201008GNOME Shell 3.36.4X Server 1.20.84.6 Mesa 20.0.81.2.131GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd6 - Thermald 1.9.1 Python Details- 4, 1, 2, 3: Python 2.7.18 + Python 3.8.5Security Details- itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

core-i7-8086k-2021 redis: LPOPonednn: IP Shapes 3D - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUredis: GETaskap: tConvolve OpenMP - Griddingonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUaskap: tConvolve OpenMP - Degriddingonednn: Convolution Batch Shapes Auto - f32 - CPUrav1e: 10onednn: IP Shapes 1D - f32 - CPUmnn: MobileNetV2_224redis: LPUSHnpb: EP.Dngspice: C7552jpegxl-decode: Allnpb: EP.Clzbench: XZ 0 - Compressionredis: SETvkmark: 1920 x 1080onednn: Recurrent Neural Network Inference - f32 - CPUqmcpack: simple-H2Oonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUngspice: C2670lzbench: Crush 0 - Compressiononednn: Recurrent Neural Network Inference - u8s8f32 - CPUrav1e: 5redis: SADDonednn: Deconvolution Batch shapes_1d - f32 - CPUjpegxl: JPEG - 5jpegxl: PNG - 8askap: Hogbom Clean OpenMPonnx: super-resolution-10 - OpenMP CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUwarsow: 1920 x 1080lammps: Rhodopsin Proteinaskap: tConvolve MPI - Degriddingonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUdav1d: Chimera 1080plzbench: Brotli 0 - Compressionlulesh: lzbench: Zstd 8 - Compressionquantlib: lzbench: Brotli 2 - Compressionjpegxl: JPEG - 8tnn: CPU - MobileNet v2jpegxl-decode: 1lzbench: Zstd 1 - Compressionmnn: mobilenet-v1-1.0jpegxl: JPEG - 7dav1d: Summer Nature 1080pgnupg: 2.7GB Sample File Encryptionlzbench: XZ 0 - Decompressionmnn: inception-v3etcpak: ETC1 + Ditheringjpegxl: PNG - 7onnx: bertsquad-10 - OpenMP CPUrav1e: 6lzbench: Zstd 1 - Decompressionmnn: resnet-v2-50lzbench: Zstd 8 - Decompressionamg: onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUetcpak: DXT1unpack-firefox: firefox-84.0.source.tar.xzkripke: webp2: Quality 95, Compression Effort 7v-ray: CPUopenfoam: Motorbike 30Maskap: tConvolve MPI - Griddingjpegxl: PNG - 5synthmark: VoiceMark_100onnx: shufflenet-v2-10 - OpenMP CPUgcrypt: etcpak: ETC1lzbench: Brotli 0 - Decompressionpennant: leblancbigpennant: sedovbiglzbench: Libdeflate 1 - Compressiondav1d: Summer Nature 4Kmnn: SqueezeNetV1.0askap: tConvolve MT - Degriddingparaview: Wavelet Contour - 1920 x 1080dav1d: Chimera 1080p 10-bitcloverleaf: Lagrangian-Eulerian Hydrodynamicswebp2: Defaultparaview: Wavelet Contour - 1920 x 1080onnx: yolov4 - OpenMP CPUaskap: tConvolve MT - Griddingwebp2: Quality 75, Compression Effort 7gromacs: water_GMX50_bareencode-ape: WAV To APEcryptsetup: Serpent-XTS 256b Encryptionetcpak: ETC2cryptsetup: Twofish-XTS 256b Encryptionencode-wavpack: WAV To WavPackwebp2: Quality 100, Compression Effort 5encode-opus: WAV To Opus Encoderav1e: 1toybrot: C++ Taskscryptsetup: AES-XTS 256b Decryptioncryptsetup: Serpent-XTS 512b Encryptiononednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUlzbench: Crush 0 - Decompressionwebp2: Quality 100, Lossless Compressioncryptsetup: Serpent-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Encryptioncryptsetup: AES-XTS 256b Encryptiontnn: CPU - SqueezeNet v1.1cryptsetup: AES-XTS 512b Decryptioncp2k: Fayalite-FIST Datacryptsetup: Serpent-XTS 256b Decryptionparaview: Wavelet Volume - 1920 x 1080paraview: Wavelet Volume - 1920 x 1080lzbench: Brotli 2 - Decompressiononednn: Recurrent Neural Network Training - u8s8f32 - CPUcryptsetup: Twofish-XTS 512b Decryptioncryptsetup: Twofish-XTS 256b Decryptionopenfoam: Motorbike 60Mcryptsetup: AES-XTS 512b Encryptionnpb: LU.Ccryptsetup: PBKDF2-whirlpoolbuild-godot: Time To Compilelzbench: Libdeflate 1 - Decompressiontoybrot: C++ Threadscryptsetup: PBKDF2-sha512onednn: IP Shapes 1D - u8s8f32 - CPUtoybrot: OpenMPonnx: fcn-resnet101-11 - OpenMP CPU4121a32109775.010.40133.893532919971.421322.7817.86372.160422322.0420.76843.8414.602303.2522051125.08928.5288.280177.79946.58502385714.257092253.5525.9362256.59107.3411292254.151.3262607688.026.3482661.570.87187.15047068.9875685.54.9961948.616.85494527.215271572.55791082901.822428.55299.92545.805933.85061.11482.1360.41513443.452356.9109.655601.730206040.80622562487054334.253931503.94816.86536141050598.2767447274.671923.1458.42782.34116540179.061375.34072968.49578105.7825265138.975.5272012.0138.0698.19171.585.709396.6563511141.91325.6230.7809.825908.3212.984503.813.00018.3527.6670.464621052636.0907.63.938134032.074030.495861174.594923.9503.62620.2271.0102359.61044.592924.723.62377.9488414028.18506.8507.21219.282358.024209.08855749184.41312956188019747192.1414861743623323281.833138734.001248.232218.83.8723.2152023847.68945.8190.134181.53942.53492369135.507192230.3825.6972220.83108.8841272223.461.3302635219.7561.730.86189.275469684.65.0291967.97532.045221585.76841082898.622228.80298.62746.205923.83361.59485.4360.75513343.382357.3329.725641.731205540.89122472502456331499.70616.78936331390600.5857483273.4958.66783.25216469179.825374.77572768.21744105.401266139.305.5082018.8038.0798.49171.565.726396.7193521145.05325.1650.7819.805906.6213.104502.613.03018.3717.6650.464619762634.2908.35851173.138924.7503.62617.1270.6222356.31046.02925.423.61377.784840506.8507.21218.522358.224235.45855749184.259129561838197471961735622085673.468.272343.616702908151.921235.6616.75652.104722226.2620.06853.7844.501013.2252059021.04949.6990.163178.47941.53492342227.427112248.6625.8632255.93108.3291282254.881.3112641940.86.4169161.030.87189.27547258.8911285.45.0491954.986.84973526.975261570.79681072901.022328.68301.24045.975893.86561.34484.0060.28713443.135357.5199.685621.742205740.63722422501188674.239041496.86216.78436317627600.3907462273.321929.5058.42779.63516544179.221373.76272668.33411105.4707265139.465.5222017.1038.0898.50171.345.719396.7913511144.51324.8870.7819.814907.7212.969503.013.00018.3297.6740.464621002630.7906.53.930884025.104023.675861175.087924.5502.82621.1270.9432359.41045.369925.223.63378.0688404023.42506.5506.61219.632355.824224.80856679184.25512946187819734822.1415461728628.306433.6181716.84242.0713120.07704.507542216.002220.332222.976.360658.935906.787794.236923.932054024.784025.374025.942.142752070909.2510.30473.909502927900.671240.4417.88122.182442225.0220.75433.8964.633683.1732072916.33945.8089.729177.78927.39492377147.507072250.3425.5192253.87107.1671282256.421.3192634387.836.3340460.990.87188.68046728.9734085.45.0341948.486.79490527.855251572.44301072875.722428.58299.73845.845883.86061.10481.6560.57313343.311354.9349.655621.740204740.80422472489440674.262661495.40916.87336175097597.4937445274.381920.0258.38782.58016483179.214374.17972968.49462105.8152265139.085.5212017.9537.9698.41171.055.711395.6223511144.37325.7480.7829.800908.9212.567503.813.00318.3617.6820.463620092633.5907.53.935294029.874030.565861173.720925.4503.42617.5271.0162358.11044.646925.923.60377.6108404028.19506.2507.21219.872355.624210.11856679184.35812956188419747242.142206173362OpenBenchmarking.org

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP4123700K1400K2100K2800K3500KSE +/- 8626.43, N = 3SE +/- 19183.06, N = 3SE +/- 11005.15, N = 3SE +/- 32269.04, N = 32109775.003323281.832085673.462070909.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU421a33691215SE +/- 0.01545, N = 3SE +/- 0.01851, N = 3SE +/- 0.02438, N = 3SE +/- 0.01907, N = 310.401308.272348.3064310.30470MIN: 10.25MIN: 7.96MIN: 8.05MIN: 10.111. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU421a30.87961.75922.63883.51844.398SE +/- 0.01176, N = 3SE +/- 0.02046, N = 3SE +/- 0.00712, N = 3SE +/- 0.00680, N = 33.893533.616703.618173.90950MIN: 3.82MIN: 3.52MIN: 3.55MIN: 3.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET4123700K1400K2100K2800K3500KSE +/- 18481.68, N = 3SE +/- 18933.24, N = 3SE +/- 44568.10, N = 3SE +/- 15356.90, N = 32919971.423138734.002908151.922927900.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding412330060090012001500SE +/- 14.50, N = 3SE +/- 9.83, N = 3SE +/- 14.95, N = 5SE +/- 8.41, N = 31322.781248.231235.661240.441. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU421a348121620SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 317.8616.7616.8417.88MIN: 17.69MIN: 16.26MIN: 16.55MIN: 17.631. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU421a30.4910.9821.4731.9642.455SE +/- 0.00374, N = 3SE +/- 0.02847, N = 3SE +/- 0.00510, N = 3SE +/- 0.00718, N = 32.160422.104722.071312.18244MIN: 2.12MIN: 2MIN: 2.01MIN: 2.131. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding41235001000150020002500SE +/- 6.77, N = 3SE +/- 0.00, N = 3SE +/- 4.57, N = 5SE +/- 6.22, N = 32322.042218.802226.262225.021. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU421a3510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 320.7720.0720.0820.75MIN: 20.69MIN: 19.96MIN: 19.98MIN: 20.671. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 1041230.87661.75322.62983.50644.383SE +/- 0.014, N = 3SE +/- 0.019, N = 3SE +/- 0.061, N = 3SE +/- 0.026, N = 33.8413.8723.7843.896

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU421a31.04262.08523.12784.17045.213SE +/- 0.00329, N = 3SE +/- 0.00886, N = 3SE +/- 0.01402, N = 3SE +/- 0.00530, N = 34.602304.501014.507544.63368MIN: 4.51MIN: 4.37MIN: 4.39MIN: 4.541. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_22441230.73171.46342.19512.92683.6585SE +/- 0.025, N = 3SE +/- 0.012, N = 3SE +/- 0.020, N = 3SE +/- 0.041, N = 33.2523.2153.2253.173MIN: 3.17 / MAX: 7.8MIN: 3.13 / MAX: 20.95MIN: 3.1 / MAX: 4.26MIN: 3.06 / MAX: 7.081. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH4123400K800K1200K1600K2000KSE +/- 19842.96, N = 3SE +/- 25186.84, N = 5SE +/- 9300.78, N = 3SE +/- 13361.81, N = 32051125.082023847.682059021.042072916.331. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D41232004006008001000SE +/- 8.82, N = 12SE +/- 0.11, N = 3SE +/- 0.61, N = 3SE +/- 1.74, N = 3928.52945.81949.69945.801. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552412320406080100SE +/- 0.53, N = 3SE +/- 0.30, N = 3SE +/- 0.25, N = 3SE +/- 0.50, N = 388.2890.1390.1689.731. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

JPEG XL Decoding

CPU Threads: All

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: All41234080120160200SE +/- 0.12, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3177.79181.53178.47177.78

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C41232004006008001000SE +/- 1.34, N = 3SE +/- 0.53, N = 3SE +/- 2.85, N = 3SE +/- 13.08, N = 3946.58942.53941.53927.391. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression41231122334455504949491. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET4123500K1000K1500K2000K2500KSE +/- 3546.38, N = 3SE +/- 13128.97, N = 3SE +/- 10631.93, N = 3SE +/- 12358.77, N = 32385714.252369135.502342227.422377147.501. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

VKMark

Resolution: 1920 x 1080

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 10804123160320480640800SE +/- 1.86, N = 3SE +/- 2.73, N = 3SE +/- 1.86, N = 37097197117071. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU4121a35001000150020002500SE +/- 3.29, N = 3SE +/- 1.59, N = 3SE +/- 1.21, N = 3SE +/- 1.62, N = 3SE +/- 1.92, N = 32253.552230.382248.662216.002250.34MIN: 2246.46MIN: 2226.39MIN: 2245.47MIN: 2211.59MIN: 2246.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O4123612182430SE +/- 0.21, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 3SE +/- 0.14, N = 325.9425.7025.8625.521. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU4121a35001000150020002500SE +/- 1.03, N = 3SE +/- 5.92, N = 3SE +/- 1.19, N = 3SE +/- 4.85, N = 3SE +/- 1.66, N = 32256.592220.832255.932220.332253.87MIN: 2253.05MIN: 2210.57MIN: 2252MIN: 2210.09MIN: 2248.871. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670412320406080100SE +/- 0.54, N = 3SE +/- 0.22, N = 3SE +/- 0.77, N = 3SE +/- 0.36, N = 3107.34108.88108.33107.171. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

lzbench

Test: Crush 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression4123306090120150SE +/- 1.00, N = 3SE +/- 0.33, N = 31291271281281. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU4121a35001000150020002500SE +/- 3.51, N = 3SE +/- 3.64, N = 3SE +/- 5.08, N = 3SE +/- 2.04, N = 3SE +/- 3.29, N = 32254.152223.462254.882222.972256.42MIN: 2247.34MIN: 2215.76MIN: 2245.82MIN: 2219.12MIN: 2250.831. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 541230.29930.59860.89791.19721.4965SE +/- 0.004, N = 3SE +/- 0.005, N = 3SE +/- 0.005, N = 3SE +/- 0.002, N = 31.3261.3301.3111.319

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD4123600K1200K1800K2400K3000KSE +/- 28568.59, N = 13SE +/- 27951.55, N = 3SE +/- 12039.46, N = 3SE +/- 17037.49, N = 32607688.022635219.752641940.802634387.831. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU421a3246810SE +/- 0.01283, N = 3SE +/- 0.01672, N = 3SE +/- 0.02142, N = 3SE +/- 0.00815, N = 36.348266.416916.360656.33404MIN: 6.29MIN: 6.31MIN: 6.27MIN: 6.281. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

JPEG XL

Input: JPEG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 541231428425670SE +/- 0.19, N = 3SE +/- 0.23, N = 3SE +/- 0.47, N = 3SE +/- 0.30, N = 361.5761.7361.0360.991. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: PNG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 841230.19580.39160.58740.78320.979SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.870.860.870.871. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP41234080120160200SE +/- 0.31, N = 3SE +/- 0.32, N = 3SE +/- 0.24, N = 3SE +/- 0.21, N = 3187.15189.28189.28188.681. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU412310002000300040005000SE +/- 4.67, N = 3SE +/- 8.06, N = 3SE +/- 9.84, N = 3SE +/- 9.22, N = 347064696472546721. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU421a33691215SE +/- 0.01901, N = 3SE +/- 0.02003, N = 3SE +/- 0.00635, N = 3SE +/- 0.00861, N = 38.987568.891128.935908.97340MIN: 8.91MIN: 8.82MIN: 8.89MIN: 8.921. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Warsow

Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 1080412320406080100SE +/- 0.12, N = 3SE +/- 0.47, N = 3SE +/- 0.09, N = 3SE +/- 0.17, N = 385.584.685.485.4

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein41231.1362.2723.4084.5445.68SE +/- 0.025, N = 3SE +/- 0.013, N = 3SE +/- 0.005, N = 3SE +/- 0.021, N = 34.9965.0295.0495.0341. (CXX) g++ options: -O3 -pthread -lm

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding4123400800120016002000SE +/- 11.14, N = 3SE +/- 0.00, N = 3SE +/- 6.50, N = 3SE +/- 0.00, N = 31948.611967.971954.981948.481. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU421a3246810SE +/- 0.06163, N = 3SE +/- 0.06379, N = 3SE +/- 0.01649, N = 3SE +/- 0.00909, N = 36.854946.849736.787796.79490MIN: 6.72MIN: 6.72MIN: 6.69MIN: 6.71. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p4123120240360480600SE +/- 0.66, N = 3SE +/- 0.61, N = 3SE +/- 1.05, N = 3SE +/- 0.26, N = 3527.21532.04526.97527.85MIN: 391.75 / MAX: 803.53MIN: 393.83 / MAX: 800.33MIN: 392.13 / MAX: 800.13MIN: 391.98 / MAX: 817.621. (CC) gcc options: -pthread

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression4123110220330440550SE +/- 1.33, N = 3SE +/- 1.00, N = 35275225265251. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3412330060090012001500SE +/- 1.89, N = 3SE +/- 1.22, N = 3SE +/- 0.70, N = 3SE +/- 0.47, N = 31572.561585.771570.801572.441. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

lzbench

Test: Zstd 8 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression412320406080100SE +/- 0.58, N = 31081081071071. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.2141236001200180024003000SE +/- 5.47, N = 3SE +/- 8.50, N = 3SE +/- 9.53, N = 3SE +/- 10.28, N = 32901.82898.62901.02875.71. (CXX) g++ options: -O3 -march=native -rdynamic

lzbench

Test: Brotli 2 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression412350100150200250SE +/- 0.58, N = 32242222232241. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

JPEG XL

Input: JPEG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 84123714212835SE +/- 0.17, N = 3SE +/- 0.08, N = 3SE +/- 0.14, N = 3SE +/- 0.12, N = 328.5528.8028.6828.581. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2412370140210280350SE +/- 1.01, N = 3SE +/- 0.27, N = 3SE +/- 0.93, N = 3SE +/- 0.54, N = 3299.93298.63301.24299.74MIN: 297.54 / MAX: 306.69MIN: 297.04 / MAX: 300.97MIN: 298.47 / MAX: 304.26MIN: 297.93 / MAX: 302.271. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

JPEG XL Decoding

CPU Threads: 1

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: 141231020304050SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 345.8046.2045.9745.84

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression4123130260390520650SE +/- 0.67, N = 3SE +/- 1.00, N = 35935925895881. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.041230.86961.73922.60883.47844.348SE +/- 0.004, N = 3SE +/- 0.008, N = 3SE +/- 0.010, N = 3SE +/- 0.023, N = 33.8503.8333.8653.860MIN: 3.8 / MAX: 4.36MIN: 3.76 / MAX: 6.28MIN: 3.81 / MAX: 5.88MIN: 3.77 / MAX: 20.881. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

JPEG XL

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 741231428425670SE +/- 0.21, N = 3SE +/- 0.15, N = 3SE +/- 0.46, N = 3SE +/- 0.10, N = 361.1161.5961.3461.101. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p4123110220330440550SE +/- 0.31, N = 3SE +/- 0.56, N = 3SE +/- 0.94, N = 3SE +/- 1.00, N = 3482.13485.43484.00481.65MIN: 428.5 / MAX: 525.3MIN: 440.26 / MAX: 533.5MIN: 431.31 / MAX: 528.36MIN: 426.68 / MAX: 527.141. (CC) gcc options: -pthread

GnuPG

2.7GB Sample File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File Encryption41231428425670SE +/- 0.06, N = 3SE +/- 0.44, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 360.4260.7660.2960.571. (CC) gcc options: -O2

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression4123306090120150SE +/- 0.33, N = 31341331341331. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v341231020304050SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.27, N = 3SE +/- 0.09, N = 343.4543.3843.1443.31MIN: 43.05 / MAX: 60.53MIN: 43.06 / MAX: 61.01MIN: 42.66 / MAX: 61.01MIN: 42.98 / MAX: 58.831. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Etcpak

Configuration: ETC1 + Dithering

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering412380160240320400SE +/- 0.35, N = 3SE +/- 0.18, N = 3SE +/- 0.12, N = 3SE +/- 2.23, N = 3356.91357.33357.52354.931. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

JPEG XL

Input: PNG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 741233691215SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 39.659.729.689.651. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU4123120240360480600SE +/- 0.73, N = 3SE +/- 0.50, N = 3SE +/- 1.36, N = 3SE +/- 0.76, N = 35605645625621. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 641230.3920.7841.1761.5681.96SE +/- 0.009, N = 3SE +/- 0.012, N = 3SE +/- 0.008, N = 3SE +/- 0.005, N = 31.7301.7311.7421.740

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression4123400800120016002000SE +/- 1.45, N = 3SE +/- 3.18, N = 3SE +/- 0.67, N = 3SE +/- 12.33, N = 320602055205720471. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-504123918273645SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 340.8140.8940.6440.80MIN: 40.64 / MAX: 58.22MIN: 40.57 / MAX: 57.96MIN: 40.34 / MAX: 57.72MIN: 40.51 / MAX: 56.521. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

lzbench

Test: Zstd 8 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression41235001000150020002500SE +/- 6.36, N = 3SE +/- 4.18, N = 3SE +/- 5.36, N = 322562247224222471. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2412350M100M150M200M250MSE +/- 1517817.32, N = 3SE +/- 323790.40, N = 3SE +/- 614809.90, N = 3SE +/- 1277684.47, N = 32487054332502456332501188672489440671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU421a30.95911.91822.87733.83644.7955SE +/- 0.00315, N = 3SE +/- 0.00463, N = 3SE +/- 0.00597, N = 3SE +/- 0.00176, N = 34.253934.239044.236924.26266MIN: 4.22MIN: 4.2MIN: 4.2MIN: 4.231. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1412330060090012001500SE +/- 0.57, N = 3SE +/- 3.68, N = 3SE +/- 2.90, N = 3SE +/- 3.85, N = 31503.951499.711496.861495.411. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Unpacking Firefox

Extracting: firefox-84.0.source.tar.xz

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz412348121620SE +/- 0.02, N = 4SE +/- 0.05, N = 4SE +/- 0.06, N = 4SE +/- 0.06, N = 416.8716.7916.7816.87

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.441238M16M24M32M40MSE +/- 114405.39, N = 3SE +/- 116888.91, N = 3SE +/- 126361.09, N = 3SE +/- 107755.07, N = 3361410503633139036317627361750971. (CXX) g++ options: -O3 -fopenmp

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 74123130260390520650SE +/- 1.37, N = 3SE +/- 1.42, N = 3SE +/- 0.37, N = 3SE +/- 0.46, N = 3598.28600.59600.39597.491. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -ljpeg -lgif -lpthread

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 5Mode: CPU412316003200480064008000SE +/- 42.90, N = 3SE +/- 17.10, N = 3SE +/- 28.75, N = 3SE +/- 32.98, N = 37447748374627445

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M412360120180240300SE +/- 0.10, N = 3SE +/- 0.20, N = 3SE +/- 0.06, N = 3SE +/- 0.20, N = 3274.67273.49273.32274.381. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lgenericPatchFields -lscotchDecomp -lptscotchDecomp -lmeshTools -ldynamicMesh -lOpenFOAM -ldl -lm

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding423400800120016002000SE +/- 6.24, N = 3SE +/- 10.92, N = 3SE +/- 9.37, N = 21923.141929.501920.021. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

JPEG XL

Input: PNG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 541231326395265SE +/- 0.15, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.20, N = 358.4258.6658.4258.381. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_10041232004006008001000SE +/- 0.30, N = 3SE +/- 0.34, N = 3SE +/- 3.25, N = 3SE +/- 0.39, N = 3782.34783.25779.64782.581. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU41234K8K12K16K20KSE +/- 16.98, N = 3SE +/- 8.66, N = 3SE +/- 15.42, N = 3SE +/- 45.62, N = 3165401646916544164831. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.941234080120160200SE +/- 0.14, N = 3SE +/- 0.39, N = 3SE +/- 0.37, N = 3SE +/- 0.52, N = 3179.06179.83179.22179.211. (CC) gcc options: -O2 -fvisibility=hidden

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1412380160240320400SE +/- 0.15, N = 3SE +/- 0.58, N = 3SE +/- 1.79, N = 3SE +/- 0.53, N = 3375.34374.78373.76374.181. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression4123160320480640800SE +/- 0.67, N = 3SE +/- 1.00, N = 3SE +/- 2.03, N = 37297277267291. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig41231530456075SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 368.5068.2268.3368.491. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig412320406080100SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 3105.78105.40105.47105.821. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression412360120180240300SE +/- 1.00, N = 3SE +/- 0.58, N = 3SE +/- 1.20, N = 3SE +/- 1.53, N = 32652662652651. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K4123306090120150SE +/- 0.12, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.17, N = 3138.97139.30139.46139.08MIN: 131.06 / MAX: 156.71MIN: 131.22 / MAX: 157.1MIN: 131.42 / MAX: 157.06MIN: 130.77 / MAX: 156.831. (CC) gcc options: -pthread

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.041231.24362.48723.73084.97446.218SE +/- 0.015, N = 3SE +/- 0.023, N = 3SE +/- 0.014, N = 3SE +/- 0.037, N = 35.5275.5085.5225.521MIN: 5.39 / MAX: 23.38MIN: 5.39 / MAX: 7.3MIN: 5.41 / MAX: 7.94MIN: 5.38 / MAX: 7.491. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding4123400800120016002000SE +/- 0.00, N = 3SE +/- 3.32, N = 3SE +/- 2.65, N = 3SE +/- 3.48, N = 32012.012018.802017.102017.951. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 10804123918273645SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 338.0638.0738.0837.96

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit412320406080100SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 398.1998.4998.5098.41MIN: 63.95 / MAX: 228.89MIN: 63.95 / MAX: 228.95MIN: 64.06 / MAX: 228.81MIN: 64.05 / MAX: 227.431. (CC) gcc options: -pthread

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics41234080120160200SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.09, N = 3171.58171.56171.34171.051. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default41231.28842.57683.86525.15366.442SE +/- 0.009, N = 3SE +/- 0.017, N = 3SE +/- 0.022, N = 3SE +/- 0.027, N = 35.7095.7265.7195.7111. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -ljpeg -lgif -lpthread

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1080412390180270360450SE +/- 0.91, N = 3SE +/- 1.00, N = 3SE +/- 0.59, N = 3SE +/- 0.27, N = 3396.66396.72396.79395.62

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU412380160240320400SE +/- 0.60, N = 3SE +/- 0.88, N = 3SE +/- 1.30, N = 3SE +/- 0.60, N = 33513523513511. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding41232004006008001000SE +/- 0.00, N = 3SE +/- 0.49, N = 3SE +/- 0.14, N = 3SE +/- 0.24, N = 31141.911145.051144.511144.371. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7412370140210280350SE +/- 0.66, N = 3SE +/- 0.94, N = 3SE +/- 0.58, N = 3SE +/- 0.98, N = 3325.62325.17324.89325.751. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -ljpeg -lgif -lpthread

GROMACS

Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bare41230.1760.3520.5280.7040.88SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 30.7800.7810.7810.7821. (CXX) g++ options: -O3 -pthread

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE41233691215SE +/- 0.016, N = 5SE +/- 0.014, N = 5SE +/- 0.016, N = 5SE +/- 0.015, N = 59.8259.8059.8149.8001. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption41232004006008001000SE +/- 0.87, N = 3SE +/- 1.39, N = 3SE +/- 0.61, N = 3SE +/- 0.48, N = 3908.3906.6907.7908.9

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2412350100150200250SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.42, N = 3212.98213.10212.97212.571. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption4123110220330440550SE +/- 0.38, N = 3SE +/- 1.12, N = 3SE +/- 0.66, N = 3SE +/- 0.06, N = 3503.8502.6503.0503.8

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack41233691215SE +/- 0.00, N = 5SE +/- 0.02, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 513.0013.0313.0013.001. (CXX) g++ options: -rdynamic

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 54123510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 318.3518.3718.3318.361. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -ljpeg -lgif -lpthread

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode4123246810SE +/- 0.011, N = 5SE +/- 0.010, N = 5SE +/- 0.012, N = 5SE +/- 0.012, N = 57.6677.6657.6747.6821. (CXX) g++ options: -fvisibility=hidden -logg -lm

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 141230.10440.20880.31320.41760.522SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 30.4640.4640.4640.463

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasks412313K26K39K52K65KSE +/- 124.65, N = 3SE +/- 13.67, N = 3SE +/- 100.54, N = 3SE +/- 63.76, N = 3621056197662100620091. (CXX) g++ options: -O3 -lpthread

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption41236001200180024003000SE +/- 6.10, N = 3SE +/- 1.38, N = 3SE +/- 0.85, N = 3SE +/- 2.27, N = 32636.02634.22630.72633.5

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption41232004006008001000SE +/- 0.93, N = 3SE +/- 0.40, N = 3SE +/- 0.45, N = 3SE +/- 1.20, N = 3907.6908.3906.5907.5

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU421a30.88611.77222.65833.54444.4305SE +/- 0.00145, N = 3SE +/- 0.00431, N = 3SE +/- 0.00444, N = 3SE +/- 0.00396, N = 33.938133.930883.932053.93529MIN: 3.91MIN: 3.89MIN: 3.89MIN: 3.91. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU421a39001800270036004500SE +/- 1.09, N = 3SE +/- 1.81, N = 3SE +/- 2.30, N = 3SE +/- 1.28, N = 34032.074025.104024.784029.87MIN: 4027.84MIN: 4019.74MIN: 4018.57MIN: 4025.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU421a39001800270036004500SE +/- 1.40, N = 3SE +/- 2.32, N = 3SE +/- 1.13, N = 3SE +/- 2.36, N = 34030.494023.674025.374030.56MIN: 4026.5MIN: 4017.93MIN: 4021.62MIN: 4023.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

lzbench

Test: Crush 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression41231302603905206505865855865861. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression412330060090012001500SE +/- 0.60, N = 3SE +/- 0.06, N = 3SE +/- 0.50, N = 3SE +/- 0.28, N = 31174.591173.141175.091173.721. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -ljpeg -lgif -lpthread

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption41232004006008001000SE +/- 0.85, N = 2SE +/- 0.64, N = 3SE +/- 0.80, N = 3SE +/- 1.18, N = 3923.9924.7924.5925.4

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption4123110220330440550SE +/- 0.35, N = 3SE +/- 0.09, N = 3SE +/- 0.64, N = 3SE +/- 0.15, N = 3503.6503.6502.8503.4

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption41236001200180024003000SE +/- 8.59, N = 3SE +/- 6.46, N = 3SE +/- 0.93, N = 3SE +/- 3.07, N = 32620.22617.12621.12617.5

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1412360120180240300SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 3271.01270.62270.94271.02MIN: 270.36 / MAX: 271.8MIN: 270.11 / MAX: 271.33MIN: 270.24 / MAX: 271.7MIN: 270.32 / MAX: 271.671. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption41235001000150020002500SE +/- 6.54, N = 3SE +/- 3.61, N = 3SE +/- 5.47, N = 3SE +/- 1.04, N = 32359.62356.32359.42358.1

CP2K Molecular Dynamics

Fayalite-FIST Data

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.1Fayalite-FIST Data412320040060080010001044.591046.021045.371044.65

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption41232004006008001000SE +/- 0.50, N = 3SE +/- 0.49, N = 3SE +/- 0.96, N = 3SE +/- 0.92, N = 3924.7925.4925.2925.9

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 10804123612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 323.6223.6123.6323.60

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1080412380160240320400SE +/- 0.12, N = 3SE +/- 0.19, N = 3SE +/- 0.38, N = 3SE +/- 0.59, N = 3377.95377.78378.07377.61

lzbench

Test: Brotli 2 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression41232004006008001000SE +/- 0.67, N = 38418408408401. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU421a39001800270036004500SE +/- 1.14, N = 3SE +/- 1.97, N = 3SE +/- 0.29, N = 3SE +/- 2.50, N = 34028.184023.424025.944028.19MIN: 4024.52MIN: 4017.95MIN: 4021.96MIN: 4022.481. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption4123110220330440550SE +/- 0.41, N = 3SE +/- 0.10, N = 2SE +/- 0.26, N = 3SE +/- 0.43, N = 3506.8506.8506.5506.2

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption4123110220330440550SE +/- 0.35, N = 3SE +/- 0.03, N = 3SE +/- 0.34, N = 3SE +/- 0.07, N = 3507.2507.2506.6507.2

OpenFOAM

Input: Motorbike 60M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60M412330060090012001500SE +/- 1.05, N = 3SE +/- 0.86, N = 3SE +/- 0.28, N = 3SE +/- 0.79, N = 31219.281218.521219.631219.871. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lgenericPatchFields -lscotchDecomp -lptscotchDecomp -lmeshTools -ldynamicMesh -lOpenFOAM -ldl -lm

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption41235001000150020002500SE +/- 5.53, N = 3SE +/- 0.52, N = 3SE +/- 3.53, N = 3SE +/- 0.71, N = 32358.02358.22355.82355.6

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C41235K10K15K20K25KSE +/- 10.80, N = 3SE +/- 10.13, N = 3SE +/- 49.20, N = 3SE +/- 8.06, N = 324209.0824235.4524224.8024210.111. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool4123200K400K600K800K1000KSE +/- 930.00, N = 3SE +/- 930.00, N = 3855749855749856679856679

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile41234080120160200SE +/- 0.12, N = 3SE +/- 0.32, N = 3SE +/- 0.19, N = 3SE +/- 0.26, N = 3184.41184.26184.26184.36

lzbench

Test: Libdeflate 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Decompression412330060090012001500SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 312951295129412951. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threads412313K26K39K52K65KSE +/- 32.42, N = 3SE +/- 13.45, N = 3SE +/- 28.06, N = 3SE +/- 42.55, N = 3618806183861878618841. (CXX) g++ options: -O3 -lpthread

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha5124123400K800K1200K1600K2000KSE +/- 1237.33, N = 3SE +/- 2147.17, N = 31974719197471919734821974724

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU421a30.48210.96421.44631.92842.4105SE +/- 0.00130, N = 3SE +/- 0.00277, N = 3SE +/- 0.00109, N = 3SE +/- 0.00203, N = 32.141482.141542.142752.14220MIN: 2.13MIN: 2.12MIN: 2.12MIN: 2.121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMP412313K26K39K52K65KSE +/- 8.11, N = 3SE +/- 6.00, N = 3SE +/- 1.67, N = 3SE +/- 4.04, N = 3617436173561728617331. (CXX) g++ options: -O3 -lpthread

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU41231428425670SE +/- 0.17, N = 3SE +/- 0.17, N = 3SE +/- 0.00, N = 3SE +/- 0.17, N = 3626262621. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt


Phoronix Test Suite v10.8.4