core-i7-8086k-2021

Intel Core i7-8086K testing with a ASUS PRIME Z370-A (1802 BIOS) and ASUS Intel UHD 630 3GB on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2102185-HA-COREI780895
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Audio Encoding 3 Tests
AV1 2 Tests
C++ Boost Tests 3 Tests
C/C++ Compiler Tests 4 Tests
CPU Massive 9 Tests
Creator Workloads 14 Tests
Cryptography 3 Tests
Electronic Design 1 Tests
Encoding 5 Tests
Fortran Tests 3 Tests
Game Development 2 Tests
HPC - High Performance Computing 16 Tests
Imaging 3 Tests
Machine Learning 4 Tests
Molecular Dynamics 7 Tests
MPI Benchmarks 6 Tests
Multi-Core 10 Tests
NVIDIA GPU Compute 2 Tests
OpenMPI Tests 11 Tests
Programmer / Developer System Benchmarks 3 Tests
Python Tests 3 Tests
Scientific Computing 10 Tests
Server CPU Tests 5 Tests
Single-Threaded 3 Tests
Video Encoding 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
1
April 16
  8 Hours, 13 Minutes
2
April 16
  8 Hours, 34 Minutes
1a
April 16
  32 Minutes
3
April 17
  8 Hours, 33 Minutes
4
February 17
  8 Hours, 59 Minutes
Invert Hiding All Results Option
  6 Hours, 58 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):


core-i7-8086k-2021 ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen Resolution121a34Intel Core i7-8086K @ 5.00GHz (6 Cores / 12 Threads)ASUS PRIME Z370-A (1802 BIOS)Intel 8th Gen Core8GB118GB INTEL SSDPEK1W120GAASUS Intel UHD 630 3GB (1200MHz)Realtek ALC1220G237HLIntel I219-VUbuntu 20.045.9.0-050900rc8daily20201009-generic (x86_64) 20201008GNOME Shell 3.36.4X Server 1.20.84.6 Mesa 20.0.81.2.131GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd6 - Thermald 1.9.1 Python Details- 1, 2, 3, 4: Python 2.7.18 + Python 3.8.5Security Details- itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

core-i7-8086k-2021 amg: dav1d: Chimera 1080pdav1d: Summer Nature 4Kdav1d: Summer Nature 1080pdav1d: Chimera 1080p 10-bitparaview: Wavelet Volume - 1920 x 1080paraview: Wavelet Contour - 1920 x 1080warsow: 1920 x 1080rav1e: 1rav1e: 5rav1e: 6rav1e: 10onnx: yolov4 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: fcn-resnet101-11 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUaskap: Hogbom Clean OpenMPcryptsetup: PBKDF2-sha512cryptsetup: PBKDF2-whirlpoollzbench: XZ 0 - Compressionlzbench: XZ 0 - Decompressionlzbench: Zstd 1 - Compressionlzbench: Zstd 1 - Decompressionlzbench: Zstd 8 - Compressionlzbench: Zstd 8 - Decompressionlzbench: Crush 0 - Compressionlzbench: Crush 0 - Decompressionlzbench: Brotli 0 - Compressionlzbench: Brotli 0 - Decompressionlzbench: Brotli 2 - Compressionlzbench: Brotli 2 - Decompressionlzbench: Libdeflate 1 - Compressionlzbench: Libdeflate 1 - Decompressionquantlib: cryptsetup: AES-XTS 256b Encryptioncryptsetup: AES-XTS 256b Decryptioncryptsetup: Serpent-XTS 256b Encryptioncryptsetup: Serpent-XTS 256b Decryptioncryptsetup: Twofish-XTS 256b Encryptioncryptsetup: Twofish-XTS 256b Decryptioncryptsetup: AES-XTS 512b Encryptioncryptsetup: AES-XTS 512b Decryptioncryptsetup: Serpent-XTS 512b Encryptioncryptsetup: Serpent-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Encryptioncryptsetup: Twofish-XTS 512b Decryptionaskap: tConvolve MT - Griddingaskap: tConvolve MT - Degriddingaskap: tConvolve OpenMP - Griddingaskap: tConvolve OpenMP - Degriddingparaview: Wavelet Contour - 1920 x 1080paraview: Wavelet Volume - 1920 x 1080jpegxl: PNG - 5jpegxl: PNG - 7jpegxl: PNG - 8jpegxl: JPEG - 5jpegxl: JPEG - 7jpegxl: JPEG - 8jpegxl-decode: 1jpegxl-decode: Allaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddingetcpak: DXT1etcpak: ETC1etcpak: ETC2etcpak: ETC1 + Ditheringgromacs: water_GMX50_barelammps: Rhodopsin Proteinredis: LPOPredis: SADDredis: LPUSHredis: GETredis: SETkripke: npb: EP.Cnpb: EP.Dnpb: LU.Cvkmark: 1920 x 1080synthmark: VoiceMark_100v-ray: CPUlulesh: pennant: sedovbigpennant: leblancbigtoybrot: OpenMPtoybrot: C++ Taskstoybrot: C++ Threadsonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUmnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3tnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1onednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUcloverleaf: Lagrangian-Eulerian Hydrodynamicscp2k: Fayalite-FIST Dataopenfoam: Motorbike 30Mopenfoam: Motorbike 60Mbuild-godot: Time To Compileencode-ape: WAV To APEencode-opus: WAV To Opus Encodegcrypt: ngspice: C2670ngspice: C7552webp2: Defaultwebp2: Quality 75, Compression Effort 7webp2: Quality 95, Compression Effort 7webp2: Quality 100, Compression Effort 5webp2: Quality 100, Lossless Compressionencode-wavpack: WAV To WavPackgnupg: 2.7GB Sample File Encryptionunpack-firefox: firefox-84.0.source.tar.xzqmcpack: simple-H2O121a34250245633532.04139.30485.4398.4923.6138.0784.60.4641.3301.7313.87235256462164694696189.2751974719855749491335922055108224712758552272722284026612952898.62617.12634.2906.6925.4502.6507.22358.22356.3908.3924.7503.6506.81145.052018.801248.232218.8396.719377.78458.669.720.8661.7361.5928.8046.20181.531967.971499.706374.775213.104357.3320.7815.0293323281.832635219.752023847.683138734.002369135.5036331390942.53945.8124235.45719783.25274831585.7684105.40168.217446173561976618382230.382223.462220.835.50840.8913.2153.83343.382298.627270.622171.561046.02273.491218.52184.2599.8057.665179.825108.88490.1345.726325.165600.58518.3711173.13813.03060.75516.78925.697250118867526.97139.46484.0098.5023.6338.0885.40.4641.3111.7423.78435156262165444725189.2751973482856679491345892057107224212858652672622384026512942901.02621.12630.7907.7925.2503.0506.62355.82359.4906.5924.5502.8506.51144.512017.101235.662226.26396.791378.06858.429.680.8761.0361.3428.6845.97178.471954.981929.501496.862373.762212.969357.5190.7815.0492085673.462641940.82059021.042908151.922342227.4236317627941.53949.6924224.80711779.63574621570.7968105.470768.334116172862100618782248.662254.882255.935.52240.6373.2253.86543.135301.240270.9434.501018.272342.141542.1047220.06856.416918.8911216.75656.849734.239044025.104023.423.616704023.673.93088171.341045.369273.321219.63184.2559.8147.674179.221108.32990.1635.719324.887600.39018.3291175.08713.00060.28716.78425.8632216.002222.972220.334.507548.306432.142752.0713120.07706.360658.9359016.84246.787794.236924024.784025.943.618174025.373.93205248944067527.85139.08481.6598.4123.6037.9685.40.4631.3191.7403.89635156262164834672188.6801974724856679491335882047107224712858652572922484026512952875.72617.52633.5908.9925.9503.8507.22355.62358.1907.5925.4503.4506.21144.372017.951240.442225.02395.622377.61058.389.650.8760.9961.1028.5845.84177.781948.481920.021495.409374.179212.567354.9340.7825.0342070909.252634387.832072916.332927900.672377147.5036175097927.39945.8024210.11707782.58074451572.4430105.815268.494626173362009618842250.342256.422253.875.52140.8043.1733.86043.311299.738271.0164.6336810.30472.142202.1824420.75436.334048.9734017.88126.794904.262664029.874028.193.909504030.563.93529171.051044.646274.381219.87184.3589.8007.682179.214107.16789.7295.711325.748597.49318.3611173.72013.00360.57316.87325.519248705433527.21138.97482.1398.1923.6238.0685.50.4641.3261.7303.84135156062165404706187.1501974719855749501345932060108225612958652772922484126512952901.82620.22636.0908.3924.7503.8507.22358.02359.6907.6923.9503.6506.81141.912012.011322.782322.04396.656377.94858.429.650.8761.5761.1128.5545.80177.791948.611923.141503.948375.340212.984356.9100.7804.9962109775.02607688.022051125.082919971.422385714.2536141050946.58928.5224209.08709782.34174471572.5579105.782568.495786174362105618802253.552254.152256.595.52740.8063.2523.85043.452299.925271.0104.6023010.40132.141482.1604220.76846.348268.9875617.86376.854944.253934032.074028.183.893534030.493.93813171.581044.592274.671219.28184.4139.8257.667179.061107.34188.2805.709325.623598.27618.3521174.59413.00060.41516.86525.936OpenBenchmarking.org

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2123450M100M150M200M250MSE +/- 323790.40, N = 3SE +/- 614809.90, N = 3SE +/- 1277684.47, N = 3SE +/- 1517817.32, N = 32502456332501188672489440672487054331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2123440M80M120M160M200MMin: 249603600 / Avg: 250245633.33 / Max: 250639900Min: 248897700 / Avg: 250118866.67 / Max: 250854100Min: 246388700 / Avg: 248944066.67 / Max: 250224700Min: 245672300 / Avg: 248705433.33 / Max: 2503287001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p1234120240360480600SE +/- 0.61, N = 3SE +/- 1.05, N = 3SE +/- 0.26, N = 3SE +/- 0.66, N = 3532.04526.97527.85527.21MIN: 393.83 / MAX: 800.33MIN: 392.13 / MAX: 800.13MIN: 391.98 / MAX: 817.62MIN: 391.75 / MAX: 803.531. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p123490180270360450Min: 530.98 / Avg: 532.04 / Max: 533.09Min: 525.18 / Avg: 526.97 / Max: 528.81Min: 527.48 / Avg: 527.85 / Max: 528.34Min: 525.9 / Avg: 527.21 / Max: 527.891. (CC) gcc options: -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K1234306090120150SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 3139.30139.46139.08138.97MIN: 131.22 / MAX: 157.1MIN: 131.42 / MAX: 157.06MIN: 130.77 / MAX: 156.83MIN: 131.06 / MAX: 156.711. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K1234306090120150Min: 139.19 / Avg: 139.3 / Max: 139.47Min: 139.41 / Avg: 139.46 / Max: 139.52Min: 138.79 / Avg: 139.08 / Max: 139.39Min: 138.74 / Avg: 138.97 / Max: 139.131. (CC) gcc options: -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p1234110220330440550SE +/- 0.56, N = 3SE +/- 0.94, N = 3SE +/- 1.00, N = 3SE +/- 0.31, N = 3485.43484.00481.65482.13MIN: 440.26 / MAX: 533.5MIN: 431.31 / MAX: 528.36MIN: 426.68 / MAX: 527.14MIN: 428.5 / MAX: 525.31. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p123490180270360450Min: 484.32 / Avg: 485.43 / Max: 486.15Min: 482.26 / Avg: 484 / Max: 485.48Min: 479.69 / Avg: 481.65 / Max: 482.98Min: 481.51 / Avg: 482.13 / Max: 482.521. (CC) gcc options: -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit123420406080100SE +/- 0.02, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.13, N = 398.4998.5098.4198.19MIN: 63.95 / MAX: 228.95MIN: 64.06 / MAX: 228.81MIN: 64.05 / MAX: 227.43MIN: 63.95 / MAX: 228.891. (CC) gcc options: -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit123420406080100Min: 98.45 / Avg: 98.49 / Max: 98.52Min: 98.28 / Avg: 98.5 / Max: 98.66Min: 98.26 / Avg: 98.41 / Max: 98.52Min: 97.96 / Avg: 98.19 / Max: 98.41. (CC) gcc options: -pthread

ParaView

This test runs ParaView benchmarks: an open-source data analytics and visualization application. Paraview describes itself as "an open-source, multi-platform data analysis and visualization application. ParaView users can quickly build visualizations to analyze their data using qualitative and quantitative techniques." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 10801234612182430SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 323.6123.6323.6023.62
OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 10801234612182430Min: 23.59 / Avg: 23.61 / Max: 23.63Min: 23.58 / Avg: 23.63 / Max: 23.66Min: 23.53 / Avg: 23.6 / Max: 23.66Min: 23.61 / Avg: 23.62 / Max: 23.63

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 10801234918273645SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 338.0738.0837.9638.06
OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 10801234816243240Min: 37.91 / Avg: 38.07 / Max: 38.24Min: 37.96 / Avg: 38.08 / Max: 38.15Min: 37.92 / Avg: 37.96 / Max: 38.01Min: 37.94 / Avg: 38.06 / Max: 38.23

Warsow

This is a benchmark of Warsow, a popular open-source first-person shooter. This game uses the QFusion engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 1080123420406080100SE +/- 0.47, N = 3SE +/- 0.09, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 384.685.485.485.5
OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 108012341632486480Min: 83.7 / Avg: 84.63 / Max: 85.2Min: 85.3 / Avg: 85.43 / Max: 85.6Min: 85.1 / Avg: 85.4 / Max: 85.7Min: 85.3 / Avg: 85.5 / Max: 85.7

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 112340.10440.20880.31320.41760.522SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.4640.4640.4630.464
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 1123412345Min: 0.46 / Avg: 0.46 / Max: 0.47Min: 0.46 / Avg: 0.46 / Max: 0.47Min: 0.46 / Avg: 0.46 / Max: 0.47Min: 0.46 / Avg: 0.46 / Max: 0.47

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 512340.29930.59860.89791.19721.4965SE +/- 0.005, N = 3SE +/- 0.005, N = 3SE +/- 0.002, N = 3SE +/- 0.004, N = 31.3301.3111.3191.326
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 51234246810Min: 1.32 / Avg: 1.33 / Max: 1.34Min: 1.31 / Avg: 1.31 / Max: 1.32Min: 1.32 / Avg: 1.32 / Max: 1.32Min: 1.32 / Avg: 1.33 / Max: 1.33

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 612340.3920.7841.1761.5681.96SE +/- 0.012, N = 3SE +/- 0.008, N = 3SE +/- 0.005, N = 3SE +/- 0.009, N = 31.7311.7421.7401.730
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 61234246810Min: 1.71 / Avg: 1.73 / Max: 1.75Min: 1.73 / Avg: 1.74 / Max: 1.76Min: 1.73 / Avg: 1.74 / Max: 1.75Min: 1.71 / Avg: 1.73 / Max: 1.74

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 1012340.87661.75322.62983.50644.383SE +/- 0.019, N = 3SE +/- 0.061, N = 3SE +/- 0.026, N = 3SE +/- 0.014, N = 33.8723.7843.8963.841
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 101234246810Min: 3.85 / Avg: 3.87 / Max: 3.91Min: 3.66 / Avg: 3.78 / Max: 3.85Min: 3.84 / Avg: 3.9 / Max: 3.93Min: 3.81 / Avg: 3.84 / Max: 3.86

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU123480160240320400SE +/- 0.88, N = 3SE +/- 1.30, N = 3SE +/- 0.60, N = 3SE +/- 0.60, N = 33523513513511. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU123460120180240300Min: 350 / Avg: 351.67 / Max: 353Min: 349 / Avg: 351.17 / Max: 353.5Min: 350.5 / Avg: 351.33 / Max: 352.5Min: 350 / Avg: 351.17 / Max: 3521. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU1234120240360480600SE +/- 0.50, N = 3SE +/- 1.36, N = 3SE +/- 0.76, N = 3SE +/- 0.73, N = 35645625625601. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU1234100200300400500Min: 563 / Avg: 564 / Max: 564.5Min: 559.5 / Avg: 562.17 / Max: 564Min: 560 / Avg: 561.5 / Max: 562.5Min: 559 / Avg: 560.17 / Max: 561.51. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU12341428425670SE +/- 0.17, N = 3SE +/- 0.00, N = 3SE +/- 0.17, N = 3SE +/- 0.17, N = 3626262621. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU12341224364860Min: 61.5 / Avg: 61.67 / Max: 62Min: 61.5 / Avg: 61.5 / Max: 61.5Min: 61.5 / Avg: 61.83 / Max: 62Min: 61.5 / Avg: 61.67 / Max: 621. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU12344K8K12K16K20KSE +/- 8.66, N = 3SE +/- 15.42, N = 3SE +/- 45.62, N = 3SE +/- 16.98, N = 3164691654416483165401. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU12343K6K9K12K15KMin: 16453 / Avg: 16469.33 / Max: 16482.5Min: 16528 / Avg: 16543.67 / Max: 16574.5Min: 16434 / Avg: 16482.83 / Max: 16574Min: 16508 / Avg: 16540.33 / Max: 16565.51. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU123410002000300040005000SE +/- 8.06, N = 3SE +/- 9.84, N = 3SE +/- 9.22, N = 3SE +/- 4.67, N = 346964725467247061. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU12348001600240032004000Min: 4681 / Avg: 4696.17 / Max: 4708.5Min: 4705 / Avg: 4724.67 / Max: 4735Min: 4653.5 / Avg: 4671.5 / Max: 4684Min: 4697 / Avg: 4706.33 / Max: 47111. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP12344080120160200SE +/- 0.32, N = 3SE +/- 0.24, N = 3SE +/- 0.21, N = 3SE +/- 0.31, N = 3189.28189.28188.68187.151. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP1234306090120150Min: 188.68 / Avg: 189.28 / Max: 189.75Min: 189.04 / Avg: 189.28 / Max: 189.75Min: 188.32 / Avg: 188.68 / Max: 189.04Min: 186.57 / Avg: 187.15 / Max: 187.621. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha5121234400K800K1200K1600K2000KSE +/- 1237.33, N = 3SE +/- 2147.17, N = 31974719197348219747241974719
OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha5121234300K600K900K1200K1500KMin: 1971007 / Avg: 1973481.67 / Max: 1974719Min: 1971007 / Avg: 1974723.67 / Max: 1978445

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool1234200K400K600K800K1000KSE +/- 930.00, N = 3SE +/- 930.00, N = 3855749856679856679855749
OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool1234150K300K450K600K750KMin: 853889 / Avg: 855749 / Max: 856679Min: 853889 / Avg: 855749 / Max: 856679

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression12341122334455494949501. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression1234306090120150SE +/- 0.33, N = 31331341331341. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression1234306090120150Min: 133 / Avg: 133.67 / Max: 1341. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression1234130260390520650SE +/- 0.67, N = 3SE +/- 1.00, N = 35925895885931. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression1234100200300400500Min: 591 / Avg: 592.33 / Max: 593Min: 587 / Avg: 588 / Max: 5901. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression1234400800120016002000SE +/- 3.18, N = 3SE +/- 0.67, N = 3SE +/- 12.33, N = 3SE +/- 1.45, N = 320552057204720601. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression1234400800120016002000Min: 2049 / Avg: 2055.33 / Max: 2059Min: 2056 / Avg: 2057.33 / Max: 2058Min: 2022 / Avg: 2046.67 / Max: 2059Min: 2058 / Avg: 2060.33 / Max: 20631. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression123420406080100SE +/- 0.58, N = 31081071071081. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression123420406080100Min: 107 / Avg: 108 / Max: 1091. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression12345001000150020002500SE +/- 4.18, N = 3SE +/- 5.36, N = 3SE +/- 6.36, N = 322472242224722561. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression1234400800120016002000Min: 2239 / Avg: 2247.33 / Max: 2252Min: 2236 / Avg: 2246.67 / Max: 2253Min: 2245 / Avg: 2256.33 / Max: 22671. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression1234306090120150SE +/- 1.00, N = 3SE +/- 0.33, N = 31271281281291. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression123420406080100Min: 127 / Avg: 128 / Max: 130Min: 128 / Avg: 128.33 / Max: 1291. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression12341302603905206505855865865861. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression1234110220330440550SE +/- 1.33, N = 3SE +/- 1.00, N = 35225265255271. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression123490180270360450Min: 519 / Avg: 521.67 / Max: 523Min: 524 / Avg: 525 / Max: 5271. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression1234160320480640800SE +/- 1.00, N = 3SE +/- 2.03, N = 3SE +/- 0.67, N = 37277267297291. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression1234130260390520650Min: 725 / Avg: 727 / Max: 728Min: 722 / Avg: 725.67 / Max: 729Min: 728 / Avg: 728.67 / Max: 7301. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression123450100150200250SE +/- 0.58, N = 32222232242241. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression12344080120160200Min: 222 / Avg: 223 / Max: 2241. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression12342004006008001000SE +/- 0.67, N = 38408408408411. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression1234150300450600750Min: 840 / Avg: 841.33 / Max: 8421. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression123460120180240300SE +/- 0.58, N = 3SE +/- 1.20, N = 3SE +/- 1.53, N = 3SE +/- 1.00, N = 32662652652651. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression123450100150200250Min: 265 / Avg: 266 / Max: 267Min: 263 / Avg: 265.33 / Max: 267Min: 262 / Avg: 265 / Max: 267Min: 264 / Avg: 265 / Max: 2671. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Decompression123430060090012001500SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 312951294129512951. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Decompression12342004006008001000Min: 1294 / Avg: 1294.67 / Max: 1296Min: 1294 / Avg: 1294.33 / Max: 1295Min: 1294 / Avg: 1295 / Max: 1296Min: 1295 / Avg: 1295.33 / Max: 12961. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.2112346001200180024003000SE +/- 8.50, N = 3SE +/- 9.53, N = 3SE +/- 10.28, N = 3SE +/- 5.47, N = 32898.62901.02875.72901.81. (CXX) g++ options: -O3 -march=native -rdynamic
OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.2112345001000150020002500Min: 2889.9 / Avg: 2898.6 / Max: 2915.6Min: 2882.2 / Avg: 2901 / Max: 2913.1Min: 2859.7 / Avg: 2875.73 / Max: 2894.9Min: 2890.9 / Avg: 2901.8 / Max: 2908.11. (CXX) g++ options: -O3 -march=native -rdynamic

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption12346001200180024003000SE +/- 6.46, N = 3SE +/- 0.93, N = 3SE +/- 3.07, N = 3SE +/- 8.59, N = 32617.12621.12617.52620.2
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption12345001000150020002500Min: 2604.8 / Avg: 2617.07 / Max: 2626.7Min: 2619.9 / Avg: 2621.07 / Max: 2622.9Min: 2612.5 / Avg: 2617.53 / Max: 2623.1Min: 2605.9 / Avg: 2620.17 / Max: 2635.6

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption12346001200180024003000SE +/- 1.38, N = 3SE +/- 0.85, N = 3SE +/- 2.27, N = 3SE +/- 6.10, N = 32634.22630.72633.52636.0
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption12345001000150020002500Min: 2632.3 / Avg: 2634.23 / Max: 2636.9Min: 2629 / Avg: 2630.7 / Max: 2631.6Min: 2629.2 / Avg: 2633.5 / Max: 2636.9Min: 2623.9 / Avg: 2636 / Max: 2643.4

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption12342004006008001000SE +/- 1.39, N = 3SE +/- 0.61, N = 3SE +/- 0.48, N = 3SE +/- 0.87, N = 3906.6907.7908.9908.3
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption1234160320480640800Min: 903.9 / Avg: 906.6 / Max: 908.5Min: 906.7 / Avg: 907.7 / Max: 908.8Min: 908 / Avg: 908.93 / Max: 909.6Min: 906.6 / Avg: 908.33 / Max: 909.4

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption12342004006008001000SE +/- 0.49, N = 3SE +/- 0.96, N = 3SE +/- 0.92, N = 3SE +/- 0.50, N = 3925.4925.2925.9924.7
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption1234160320480640800Min: 924.4 / Avg: 925.37 / Max: 926Min: 923.4 / Avg: 925.17 / Max: 926.7Min: 924.1 / Avg: 925.9 / Max: 927.1Min: 923.7 / Avg: 924.67 / Max: 925.4

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption1234110220330440550SE +/- 1.12, N = 3SE +/- 0.66, N = 3SE +/- 0.06, N = 3SE +/- 0.38, N = 3502.6503.0503.8503.8
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption123490180270360450Min: 500.4 / Avg: 502.63 / Max: 503.8Min: 501.7 / Avg: 502.97 / Max: 503.9Min: 503.7 / Avg: 503.8 / Max: 503.9Min: 503 / Avg: 503.77 / Max: 504.2

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption1234110220330440550SE +/- 0.03, N = 3SE +/- 0.34, N = 3SE +/- 0.07, N = 3SE +/- 0.35, N = 3507.2506.6507.2507.2
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption123490180270360450Min: 507.1 / Avg: 507.17 / Max: 507.2Min: 506.2 / Avg: 506.63 / Max: 507.3Min: 507.1 / Avg: 507.17 / Max: 507.3Min: 506.5 / Avg: 507.17 / Max: 507.7

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption12345001000150020002500SE +/- 0.52, N = 3SE +/- 3.53, N = 3SE +/- 0.71, N = 3SE +/- 5.53, N = 32358.22355.82355.62358.0
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption1234400800120016002000Min: 2357.2 / Avg: 2358.23 / Max: 2358.8Min: 2351.7 / Avg: 2355.77 / Max: 2362.8Min: 2354.2 / Avg: 2355.6 / Max: 2356.5Min: 2347.4 / Avg: 2357.97 / Max: 2366.1

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption12345001000150020002500SE +/- 3.61, N = 3SE +/- 5.47, N = 3SE +/- 1.04, N = 3SE +/- 6.54, N = 32356.32359.42358.12359.6
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption1234400800120016002000Min: 2349.2 / Avg: 2356.3 / Max: 2361Min: 2353.5 / Avg: 2359.37 / Max: 2370.3Min: 2356.2 / Avg: 2358.1 / Max: 2359.8Min: 2347.1 / Avg: 2359.57 / Max: 2369.2

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption12342004006008001000SE +/- 0.40, N = 3SE +/- 0.45, N = 3SE +/- 1.20, N = 3SE +/- 0.93, N = 3908.3906.5907.5907.6
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption1234160320480640800Min: 907.6 / Avg: 908.3 / Max: 909Min: 905.9 / Avg: 906.53 / Max: 907.4Min: 905.1 / Avg: 907.5 / Max: 908.7Min: 905.9 / Avg: 907.63 / Max: 909.1

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption12342004006008001000SE +/- 0.64, N = 3SE +/- 0.80, N = 3SE +/- 1.18, N = 3SE +/- 0.85, N = 2924.7924.5925.4923.9
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption1234160320480640800Min: 923.4 / Avg: 924.67 / Max: 925.5Min: 923.3 / Avg: 924.47 / Max: 926Min: 923.1 / Avg: 925.43 / Max: 926.9Min: 923 / Avg: 923.85 / Max: 924.7

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption1234110220330440550SE +/- 0.09, N = 3SE +/- 0.64, N = 3SE +/- 0.15, N = 3SE +/- 0.35, N = 3503.6502.8503.4503.6
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption123490180270360450Min: 503.4 / Avg: 503.57 / Max: 503.7Min: 501.6 / Avg: 502.77 / Max: 503.8Min: 503.1 / Avg: 503.37 / Max: 503.6Min: 502.9 / Avg: 503.57 / Max: 504.1

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption1234110220330440550SE +/- 0.10, N = 2SE +/- 0.26, N = 3SE +/- 0.43, N = 3SE +/- 0.41, N = 3506.8506.5506.2506.8
OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption123490180270360450Min: 506.7 / Avg: 506.8 / Max: 506.9Min: 506.1 / Avg: 506.5 / Max: 507Min: 505.4 / Avg: 506.17 / Max: 506.9Min: 506 / Avg: 506.77 / Max: 507.4

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding12342004006008001000SE +/- 0.49, N = 3SE +/- 0.14, N = 3SE +/- 0.24, N = 3SE +/- 0.00, N = 31145.051144.511144.371141.911. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding12342004006008001000Min: 1144.37 / Avg: 1145.05 / Max: 1146.01Min: 1144.37 / Avg: 1144.51 / Max: 1144.78Min: 1143.96 / Avg: 1144.37 / Max: 1144.78Min: 1141.91 / Avg: 1141.91 / Max: 1141.911. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding1234400800120016002000SE +/- 3.32, N = 3SE +/- 2.65, N = 3SE +/- 3.48, N = 3SE +/- 0.00, N = 32018.802017.102017.952012.011. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding1234400800120016002000Min: 2013.28 / Avg: 2018.8 / Max: 2024.76Min: 2013.28 / Avg: 2017.1 / Max: 2022.2Min: 2013.28 / Avg: 2017.95 / Max: 2024.76Min: 2012.01 / Avg: 2012.01 / Max: 2012.011. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding123430060090012001500SE +/- 9.83, N = 3SE +/- 14.95, N = 5SE +/- 8.41, N = 3SE +/- 14.50, N = 31248.231235.661240.441322.781. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding12342004006008001000Min: 1238.4 / Avg: 1248.23 / Max: 1267.89Min: 1204.78 / Avg: 1235.66 / Max: 1292.5Min: 1226.99 / Avg: 1240.44 / Max: 1255.92Min: 1305.18 / Avg: 1322.78 / Max: 1351.551. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding12345001000150020002500SE +/- 0.00, N = 3SE +/- 4.57, N = 5SE +/- 6.22, N = 3SE +/- 6.77, N = 32218.802226.262225.022322.041. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding1234400800120016002000Min: 2218.8 / Avg: 2218.8 / Max: 2218.8Min: 2218.8 / Avg: 2226.26 / Max: 2237.45Min: 2218.8 / Avg: 2225.02 / Max: 2237.45Min: 2315.27 / Avg: 2322.04 / Max: 2335.581. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ParaView

This test runs ParaView benchmarks: an open-source data analytics and visualization application. Paraview describes itself as "an open-source, multi-platform data analysis and visualization application. ParaView users can quickly build visualizations to analyze their data using qualitative and quantitative techniques." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1080123490180270360450SE +/- 1.00, N = 3SE +/- 0.59, N = 3SE +/- 0.27, N = 3SE +/- 0.91, N = 3396.72396.79395.62396.66
OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1080123470140210280350Min: 395.04 / Avg: 396.72 / Max: 398.51Min: 395.63 / Avg: 396.79 / Max: 397.52Min: 395.16 / Avg: 395.62 / Max: 396.11Min: 395.35 / Avg: 396.66 / Max: 398.42

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1080123480160240320400SE +/- 0.19, N = 3SE +/- 0.38, N = 3SE +/- 0.59, N = 3SE +/- 0.12, N = 3377.78378.07377.61377.95
OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1080123470140210280350Min: 377.43 / Avg: 377.78 / Max: 378.1Min: 377.31 / Avg: 378.07 / Max: 378.52Min: 376.56 / Avg: 377.61 / Max: 378.58Min: 377.74 / Avg: 377.95 / Max: 378.16

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 512341326395265SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.20, N = 3SE +/- 0.15, N = 358.6658.4258.3858.421. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 512341224364860Min: 58.5 / Avg: 58.66 / Max: 58.83Min: 58.35 / Avg: 58.42 / Max: 58.49Min: 57.99 / Avg: 58.38 / Max: 58.63Min: 58.24 / Avg: 58.42 / Max: 58.711. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 712343691215SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 39.729.689.659.651. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 712343691215Min: 9.68 / Avg: 9.72 / Max: 9.74Min: 9.64 / Avg: 9.68 / Max: 9.73Min: 9.56 / Avg: 9.65 / Max: 9.7Min: 9.59 / Avg: 9.65 / Max: 9.71. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 812340.19580.39160.58740.78320.979SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.860.870.870.871. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 81234246810Min: 0.86 / Avg: 0.86 / Max: 0.86Min: 0.86 / Avg: 0.87 / Max: 0.87Min: 0.87 / Avg: 0.87 / Max: 0.87Min: 0.87 / Avg: 0.87 / Max: 0.871. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 512341428425670SE +/- 0.23, N = 3SE +/- 0.47, N = 3SE +/- 0.30, N = 3SE +/- 0.19, N = 361.7361.0360.9961.571. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 512341224364860Min: 61.29 / Avg: 61.73 / Max: 62.08Min: 60.09 / Avg: 61.03 / Max: 61.54Min: 60.39 / Avg: 60.99 / Max: 61.3Min: 61.36 / Avg: 61.57 / Max: 61.941. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 712341428425670SE +/- 0.15, N = 3SE +/- 0.46, N = 3SE +/- 0.10, N = 3SE +/- 0.21, N = 361.5961.3461.1061.111. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 712341224364860Min: 61.35 / Avg: 61.59 / Max: 61.86Min: 60.45 / Avg: 61.34 / Max: 62.02Min: 60.9 / Avg: 61.1 / Max: 61.2Min: 60.77 / Avg: 61.11 / Max: 61.51. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 81234714212835SE +/- 0.08, N = 3SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 0.17, N = 328.8028.6828.5828.551. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 81234612182430Min: 28.66 / Avg: 28.8 / Max: 28.95Min: 28.41 / Avg: 28.68 / Max: 28.85Min: 28.39 / Avg: 28.58 / Max: 28.79Min: 28.23 / Avg: 28.55 / Max: 28.791. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

JPEG XL Decoding

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: 112341020304050SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 346.2045.9745.8445.80
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: 11234918273645Min: 46.14 / Avg: 46.2 / Max: 46.3Min: 45.92 / Avg: 45.97 / Max: 46.02Min: 45.79 / Avg: 45.84 / Max: 45.92Min: 45.78 / Avg: 45.8 / Max: 45.81

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: All12344080120160200SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 3181.53178.47177.78177.79
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: All1234306090120150Min: 181.51 / Avg: 181.53 / Max: 181.55Min: 178.38 / Avg: 178.47 / Max: 178.62Min: 177.72 / Avg: 177.78 / Max: 177.84Min: 177.66 / Avg: 177.79 / Max: 178.02

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding1234400800120016002000SE +/- 0.00, N = 3SE +/- 6.50, N = 3SE +/- 0.00, N = 3SE +/- 11.14, N = 31967.971954.981948.481948.611. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding123430060090012001500Min: 1967.97 / Avg: 1967.97 / Max: 1967.97Min: 1948.48 / Avg: 1954.98 / Max: 1967.97Min: 1948.48 / Avg: 1948.48 / Max: 1948.48Min: 1929.38 / Avg: 1948.61 / Max: 1967.971. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding234400800120016002000SE +/- 10.92, N = 3SE +/- 9.37, N = 2SE +/- 6.24, N = 31929.501920.021923.141. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding23430060090012001500Min: 1910.65 / Avg: 1929.5 / Max: 1948.48Min: 1910.65 / Avg: 1920.02 / Max: 1929.38Min: 1910.65 / Avg: 1923.14 / Max: 1929.381. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1123430060090012001500SE +/- 3.68, N = 3SE +/- 2.90, N = 3SE +/- 3.85, N = 3SE +/- 0.57, N = 31499.711496.861495.411503.951. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1123430060090012001500Min: 1492.35 / Avg: 1499.71 / Max: 1503.67Min: 1493.47 / Avg: 1496.86 / Max: 1502.63Min: 1491.33 / Avg: 1495.41 / Max: 1503.1Min: 1503.01 / Avg: 1503.95 / Max: 1504.991. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1123480160240320400SE +/- 0.58, N = 3SE +/- 1.79, N = 3SE +/- 0.53, N = 3SE +/- 0.15, N = 3374.78373.76374.18375.341. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1123470140210280350Min: 373.63 / Avg: 374.78 / Max: 375.53Min: 370.19 / Avg: 373.76 / Max: 375.64Min: 373.56 / Avg: 374.18 / Max: 375.24Min: 375.05 / Avg: 375.34 / Max: 375.551. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2123450100150200250SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.42, N = 3SE +/- 0.00, N = 3213.10212.97212.57212.981. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC212344080120160200Min: 213.07 / Avg: 213.1 / Max: 213.13Min: 212.89 / Avg: 212.97 / Max: 213.01Min: 211.72 / Avg: 212.57 / Max: 213.02Min: 212.98 / Avg: 212.98 / Max: 212.991. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering123480160240320400SE +/- 0.18, N = 3SE +/- 0.12, N = 3SE +/- 2.23, N = 3SE +/- 0.35, N = 3357.33357.52354.93356.911. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering123460120180240300Min: 357 / Avg: 357.33 / Max: 357.6Min: 357.4 / Avg: 357.52 / Max: 357.75Min: 350.48 / Avg: 354.93 / Max: 357.38Min: 356.23 / Avg: 356.91 / Max: 357.391. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bare12340.1760.3520.5280.7040.88SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 30.7810.7810.7820.7801. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bare1234246810Min: 0.78 / Avg: 0.78 / Max: 0.78Min: 0.78 / Avg: 0.78 / Max: 0.78Min: 0.78 / Avg: 0.78 / Max: 0.79Min: 0.78 / Avg: 0.78 / Max: 0.781. (CXX) g++ options: -O3 -pthread

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein12341.1362.2723.4084.5445.68SE +/- 0.013, N = 3SE +/- 0.005, N = 3SE +/- 0.021, N = 3SE +/- 0.025, N = 35.0295.0495.0344.9961. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1234246810Min: 5.02 / Avg: 5.03 / Max: 5.06Min: 5.04 / Avg: 5.05 / Max: 5.06Min: 4.99 / Avg: 5.03 / Max: 5.06Min: 4.97 / Avg: 5 / Max: 5.051. (CXX) g++ options: -O3 -pthread -lm

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP1234700K1400K2100K2800K3500KSE +/- 19183.06, N = 3SE +/- 11005.15, N = 3SE +/- 32269.04, N = 3SE +/- 8626.43, N = 33323281.832085673.462070909.252109775.001. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP1234600K1200K1800K2400K3000KMin: 3285266.75 / Avg: 3323281.83 / Max: 3346773.75Min: 2064462.5 / Avg: 2085673.46 / Max: 2101369.25Min: 2007693.25 / Avg: 2070909.25 / Max: 2113772.25Min: 2098636 / Avg: 2109775 / Max: 2126754.51. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD1234600K1200K1800K2400K3000KSE +/- 27951.55, N = 3SE +/- 12039.46, N = 3SE +/- 17037.49, N = 3SE +/- 28568.59, N = 132635219.752641940.802634387.832607688.021. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD1234500K1000K1500K2000K2500KMin: 2579413 / Avg: 2635219.75 / Max: 2665964.25Min: 2629511.5 / Avg: 2641940.83 / Max: 2666015.5Min: 2608972.75 / Avg: 2634387.83 / Max: 2666752Min: 2279462 / Avg: 2607688.02 / Max: 26718591. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH1234400K800K1200K1600K2000KSE +/- 25186.84, N = 5SE +/- 9300.78, N = 3SE +/- 13361.81, N = 3SE +/- 19842.96, N = 32023847.682059021.042072916.332051125.081. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH1234400K800K1200K1600K2000KMin: 1923878.38 / Avg: 2023847.68 / Max: 2057711.88Min: 2040419.5 / Avg: 2059021.04 / Max: 2068345Min: 2048380.12 / Avg: 2072916.33 / Max: 2094354.75Min: 2012888.88 / Avg: 2051125.08 / Max: 2079447.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET1234700K1400K2100K2800K3500KSE +/- 18933.24, N = 3SE +/- 44568.10, N = 3SE +/- 15356.90, N = 3SE +/- 18481.68, N = 33138734.002908151.922927900.672919971.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET1234500K1000K1500K2000K2500KMin: 3100884.5 / Avg: 3138734 / Max: 3158640.75Min: 2820874.5 / Avg: 2908151.92 / Max: 2967473Min: 2900297 / Avg: 2927900.67 / Max: 2953365.75Min: 2900241.25 / Avg: 2919971.42 / Max: 29569061. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET1234500K1000K1500K2000K2500KSE +/- 13128.97, N = 3SE +/- 10631.93, N = 3SE +/- 12358.77, N = 3SE +/- 3546.38, N = 32369135.502342227.422377147.502385714.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET1234400K800K1200K1600K2000KMin: 2349706.75 / Avg: 2369135.5 / Max: 2394147Min: 2321262.75 / Avg: 2342227.42 / Max: 2355788Min: 2353510 / Avg: 2377147.5 / Max: 2395224.75Min: 2378694.5 / Avg: 2385714.25 / Max: 2390103.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Kripke

Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.412348M16M24M32M40MSE +/- 116888.91, N = 3SE +/- 126361.09, N = 3SE +/- 107755.07, N = 3SE +/- 114405.39, N = 3363313903631762736175097361410501. (CXX) g++ options: -O3 -fopenmp
OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.412346M12M18M24M30MMin: 36180830 / Avg: 36331390 / Max: 36561550Min: 36151450 / Avg: 36317626.67 / Max: 36565610Min: 35980170 / Avg: 36175096.67 / Max: 36352160Min: 35913410 / Avg: 36141050 / Max: 362748901. (CXX) g++ options: -O3 -fopenmp

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C12342004006008001000SE +/- 0.53, N = 3SE +/- 2.85, N = 3SE +/- 13.08, N = 3SE +/- 1.34, N = 3942.53941.53927.39946.581. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C1234170340510680850Min: 941.67 / Avg: 942.53 / Max: 943.5Min: 937.51 / Avg: 941.53 / Max: 947.04Min: 901.97 / Avg: 927.39 / Max: 945.44Min: 943.92 / Avg: 946.58 / Max: 948.121. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D12342004006008001000SE +/- 0.11, N = 3SE +/- 0.61, N = 3SE +/- 1.74, N = 3SE +/- 8.82, N = 12945.81949.69945.80928.521. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D1234170340510680850Min: 945.59 / Avg: 945.81 / Max: 945.95Min: 948.72 / Avg: 949.69 / Max: 950.83Min: 942.56 / Avg: 945.8 / Max: 948.51Min: 876.5 / Avg: 928.52 / Max: 950.51. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C12345K10K15K20K25KSE +/- 10.13, N = 3SE +/- 49.20, N = 3SE +/- 8.06, N = 3SE +/- 10.80, N = 324235.4524224.8024210.1124209.081. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C12344K8K12K16K20KMin: 24222.54 / Avg: 24235.45 / Max: 24255.42Min: 24127 / Avg: 24224.8 / Max: 24283.13Min: 24199.61 / Avg: 24210.11 / Max: 24225.95Min: 24188.25 / Avg: 24209.08 / Max: 24224.431. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

VKMark

VKMark is a collection of Vulkan tests/benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 10801234160320480640800SE +/- 2.73, N = 3SE +/- 1.86, N = 3SE +/- 1.86, N = 37197117077091. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF
OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 10801234130260390520650Min: 707 / Avg: 710.67 / Max: 716Min: 703 / Avg: 706.67 / Max: 709Min: 707 / Avg: 709.33 / Max: 7131. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_10012342004006008001000SE +/- 0.34, N = 3SE +/- 3.25, N = 3SE +/- 0.39, N = 3SE +/- 0.30, N = 3783.25779.64782.58782.341. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_1001234140280420560700Min: 782.72 / Avg: 783.25 / Max: 783.89Min: 773.14 / Avg: 779.64 / Max: 782.95Min: 782 / Avg: 782.58 / Max: 783.33Min: 781.84 / Avg: 782.34 / Max: 782.871. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 5Mode: CPU123416003200480064008000SE +/- 17.10, N = 3SE +/- 28.75, N = 3SE +/- 32.98, N = 3SE +/- 42.90, N = 37483746274457447
OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 5Mode: CPU123413002600390052006500Min: 7455 / Avg: 7483 / Max: 7514Min: 7422 / Avg: 7462.33 / Max: 7518Min: 7384 / Avg: 7445.33 / Max: 7497Min: 7363 / Avg: 7446.67 / Max: 7505

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3123430060090012001500SE +/- 1.22, N = 3SE +/- 0.70, N = 3SE +/- 0.47, N = 3SE +/- 1.89, N = 31585.771570.801572.441572.561. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3123430060090012001500Min: 1583.49 / Avg: 1585.77 / Max: 1587.66Min: 1569.46 / Avg: 1570.8 / Max: 1571.83Min: 1571.85 / Avg: 1572.44 / Max: 1573.38Min: 1568.85 / Avg: 1572.56 / Max: 1575.021. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig123420406080100SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 3105.40105.47105.82105.781. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig123420406080100Min: 105.34 / Avg: 105.4 / Max: 105.47Min: 105.42 / Avg: 105.47 / Max: 105.5Min: 105.69 / Avg: 105.82 / Max: 106.05Min: 105.72 / Avg: 105.78 / Max: 105.891. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig12341530456075SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 368.2268.3368.4968.501. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig12341326395265Min: 68.12 / Avg: 68.22 / Max: 68.27Min: 68.28 / Avg: 68.33 / Max: 68.43Min: 68.38 / Avg: 68.49 / Max: 68.6Min: 68.32 / Avg: 68.5 / Max: 68.661. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMP123413K26K39K52K65KSE +/- 6.00, N = 3SE +/- 1.67, N = 3SE +/- 4.04, N = 3SE +/- 8.11, N = 3617356172861733617431. (CXX) g++ options: -O3 -lpthread
OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMP123411K22K33K44K55KMin: 61729 / Avg: 61735 / Max: 61747Min: 61726 / Avg: 61727.67 / Max: 61731Min: 61725 / Avg: 61733 / Max: 61738Min: 61728 / Avg: 61742.67 / Max: 617561. (CXX) g++ options: -O3 -lpthread

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasks123413K26K39K52K65KSE +/- 13.67, N = 3SE +/- 100.54, N = 3SE +/- 63.76, N = 3SE +/- 124.65, N = 3619766210062009621051. (CXX) g++ options: -O3 -lpthread
OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasks123411K22K33K44K55KMin: 61962 / Avg: 61975.67 / Max: 62003Min: 61900 / Avg: 62100 / Max: 62218Min: 61922 / Avg: 62008.67 / Max: 62133Min: 61905 / Avg: 62105.33 / Max: 623341. (CXX) g++ options: -O3 -lpthread

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threads123413K26K39K52K65KSE +/- 13.45, N = 3SE +/- 28.06, N = 3SE +/- 42.55, N = 3SE +/- 32.42, N = 3618386187861884618801. (CXX) g++ options: -O3 -lpthread
OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threads123411K22K33K44K55KMin: 61819 / Avg: 61838 / Max: 61864Min: 61823 / Avg: 61877.67 / Max: 61916Min: 61800 / Avg: 61883.67 / Max: 61939Min: 61844 / Avg: 61880.33 / Max: 619451. (CXX) g++ options: -O3 -lpthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU121a345001000150020002500SE +/- 1.59, N = 3SE +/- 1.21, N = 3SE +/- 1.62, N = 3SE +/- 1.92, N = 3SE +/- 3.29, N = 32230.382248.662216.002250.342253.55MIN: 2226.39MIN: 2245.47MIN: 2211.59MIN: 2246.36MIN: 2246.461. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU121a34400800120016002000Min: 2227.6 / Avg: 2230.38 / Max: 2233.12Min: 2247.17 / Avg: 2248.66 / Max: 2251.06Min: 2212.76 / Avg: 2216 / Max: 2217.79Min: 2247.87 / Avg: 2250.34 / Max: 2254.13Min: 2248.26 / Avg: 2253.55 / Max: 2259.581. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU121a345001000150020002500SE +/- 3.64, N = 3SE +/- 5.08, N = 3SE +/- 2.04, N = 3SE +/- 3.29, N = 3SE +/- 3.51, N = 32223.462254.882222.972256.422254.15MIN: 2215.76MIN: 2245.82MIN: 2219.12MIN: 2250.83MIN: 2247.341. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU121a34400800120016002000Min: 2218.42 / Avg: 2223.46 / Max: 2230.53Min: 2247.43 / Avg: 2254.88 / Max: 2264.59Min: 2220.86 / Avg: 2222.97 / Max: 2227.06Min: 2252.71 / Avg: 2256.42 / Max: 2262.98Min: 2248.97 / Avg: 2254.15 / Max: 2260.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU121a345001000150020002500SE +/- 5.92, N = 3SE +/- 1.19, N = 3SE +/- 4.85, N = 3SE +/- 1.66, N = 3SE +/- 1.03, N = 32220.832255.932220.332253.872256.59MIN: 2210.57MIN: 2252MIN: 2210.09MIN: 2248.87MIN: 2253.051. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU121a34400800120016002000Min: 2213.39 / Avg: 2220.83 / Max: 2232.53Min: 2253.7 / Avg: 2255.93 / Max: 2257.76Min: 2211.59 / Avg: 2220.33 / Max: 2228.33Min: 2250.54 / Avg: 2253.87 / Max: 2255.59Min: 2254.52 / Avg: 2256.59 / Max: 2257.681. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.012341.24362.48723.73084.97446.218SE +/- 0.023, N = 3SE +/- 0.014, N = 3SE +/- 0.037, N = 3SE +/- 0.015, N = 35.5085.5225.5215.527MIN: 5.39 / MAX: 7.3MIN: 5.41 / MAX: 7.94MIN: 5.38 / MAX: 7.49MIN: 5.39 / MAX: 23.381. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.01234246810Min: 5.46 / Avg: 5.51 / Max: 5.53Min: 5.51 / Avg: 5.52 / Max: 5.55Min: 5.46 / Avg: 5.52 / Max: 5.59Min: 5.51 / Avg: 5.53 / Max: 5.561. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-501234918273645SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 340.8940.6440.8040.81MIN: 40.57 / MAX: 57.96MIN: 40.34 / MAX: 57.72MIN: 40.51 / MAX: 56.52MIN: 40.64 / MAX: 58.221. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-501234918273645Min: 40.83 / Avg: 40.89 / Max: 41Min: 40.49 / Avg: 40.64 / Max: 40.82Min: 40.64 / Avg: 40.8 / Max: 40.92Min: 40.74 / Avg: 40.81 / Max: 40.891. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_22412340.73171.46342.19512.92683.6585SE +/- 0.012, N = 3SE +/- 0.020, N = 3SE +/- 0.041, N = 3SE +/- 0.025, N = 33.2153.2253.1733.252MIN: 3.13 / MAX: 20.95MIN: 3.1 / MAX: 4.26MIN: 3.06 / MAX: 7.08MIN: 3.17 / MAX: 7.81. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_2241234246810Min: 3.2 / Avg: 3.22 / Max: 3.24Min: 3.2 / Avg: 3.23 / Max: 3.27Min: 3.11 / Avg: 3.17 / Max: 3.25Min: 3.22 / Avg: 3.25 / Max: 3.31. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.012340.86961.73922.60883.47844.348SE +/- 0.008, N = 3SE +/- 0.010, N = 3SE +/- 0.023, N = 3SE +/- 0.004, N = 33.8333.8653.8603.850MIN: 3.76 / MAX: 6.28MIN: 3.81 / MAX: 5.88MIN: 3.77 / MAX: 20.88MIN: 3.8 / MAX: 4.361. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.01234246810Min: 3.82 / Avg: 3.83 / Max: 3.85Min: 3.85 / Avg: 3.86 / Max: 3.88Min: 3.81 / Avg: 3.86 / Max: 3.89Min: 3.85 / Avg: 3.85 / Max: 3.861. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v312341020304050SE +/- 0.06, N = 3SE +/- 0.27, N = 3SE +/- 0.09, N = 3SE +/- 0.10, N = 343.3843.1443.3143.45MIN: 43.06 / MAX: 61.01MIN: 42.66 / MAX: 61.01MIN: 42.98 / MAX: 58.83MIN: 43.05 / MAX: 60.531. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v31234918273645Min: 43.27 / Avg: 43.38 / Max: 43.48Min: 42.78 / Avg: 43.13 / Max: 43.66Min: 43.14 / Avg: 43.31 / Max: 43.46Min: 43.32 / Avg: 43.45 / Max: 43.651. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2123470140210280350SE +/- 0.27, N = 3SE +/- 0.93, N = 3SE +/- 0.54, N = 3SE +/- 1.01, N = 3298.63301.24299.74299.93MIN: 297.04 / MAX: 300.97MIN: 298.47 / MAX: 304.26MIN: 297.93 / MAX: 302.27MIN: 297.54 / MAX: 306.691. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2123450100150200250Min: 298.29 / Avg: 298.63 / Max: 299.16Min: 300 / Avg: 301.24 / Max: 303.07Min: 298.98 / Avg: 299.74 / Max: 300.77Min: 298.52 / Avg: 299.93 / Max: 301.881. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1123460120180240300SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3270.62270.94271.02271.01MIN: 270.11 / MAX: 271.33MIN: 270.24 / MAX: 271.7MIN: 270.32 / MAX: 271.67MIN: 270.36 / MAX: 271.81. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1123450100150200250Min: 270.56 / Avg: 270.62 / Max: 270.65Min: 270.93 / Avg: 270.94 / Max: 270.96Min: 270.89 / Avg: 271.02 / Max: 271.24Min: 270.89 / Avg: 271.01 / Max: 271.171. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1a2341.04262.08523.12784.17045.213SE +/- 0.01402, N = 3SE +/- 0.00886, N = 3SE +/- 0.00530, N = 3SE +/- 0.00329, N = 34.507544.501014.633684.60230MIN: 4.39MIN: 4.37MIN: 4.54MIN: 4.511. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1a234246810Min: 4.49 / Avg: 4.51 / Max: 4.53Min: 4.48 / Avg: 4.5 / Max: 4.51Min: 4.63 / Avg: 4.63 / Max: 4.64Min: 4.6 / Avg: 4.6 / Max: 4.611. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1a2343691215SE +/- 0.02438, N = 3SE +/- 0.01851, N = 3SE +/- 0.01907, N = 3SE +/- 0.01545, N = 38.306438.2723410.3047010.40130MIN: 8.05MIN: 7.96MIN: 10.11MIN: 10.251. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1a2343691215Min: 8.26 / Avg: 8.31 / Max: 8.35Min: 8.24 / Avg: 8.27 / Max: 8.3Min: 10.27 / Avg: 10.3 / Max: 10.33Min: 10.37 / Avg: 10.4 / Max: 10.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1a2340.48210.96421.44631.92842.4105SE +/- 0.00109, N = 3SE +/- 0.00277, N = 3SE +/- 0.00203, N = 3SE +/- 0.00130, N = 32.142752.141542.142202.14148MIN: 2.12MIN: 2.12MIN: 2.12MIN: 2.131. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1a234246810Min: 2.14 / Avg: 2.14 / Max: 2.14Min: 2.14 / Avg: 2.14 / Max: 2.15Min: 2.14 / Avg: 2.14 / Max: 2.15Min: 2.14 / Avg: 2.14 / Max: 2.141. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1a2340.4910.9821.4731.9642.455SE +/- 0.00510, N = 3SE +/- 0.02847, N = 3SE +/- 0.00718, N = 3SE +/- 0.00374, N = 32.071312.104722.182442.16042MIN: 2.01MIN: 2MIN: 2.13MIN: 2.121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1a234246810Min: 2.07 / Avg: 2.07 / Max: 2.08Min: 2.06 / Avg: 2.1 / Max: 2.16Min: 2.17 / Avg: 2.18 / Max: 2.2Min: 2.15 / Avg: 2.16 / Max: 2.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1a234510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 320.0820.0720.7520.77MIN: 19.98MIN: 19.96MIN: 20.67MIN: 20.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1a234510152025Min: 20.05 / Avg: 20.08 / Max: 20.11Min: 20.05 / Avg: 20.07 / Max: 20.1Min: 20.73 / Avg: 20.75 / Max: 20.8Min: 20.75 / Avg: 20.77 / Max: 20.81. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1a234246810SE +/- 0.02142, N = 3SE +/- 0.01672, N = 3SE +/- 0.00815, N = 3SE +/- 0.01283, N = 36.360656.416916.334046.34826MIN: 6.27MIN: 6.31MIN: 6.28MIN: 6.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1a2343691215Min: 6.32 / Avg: 6.36 / Max: 6.39Min: 6.39 / Avg: 6.42 / Max: 6.44Min: 6.32 / Avg: 6.33 / Max: 6.35Min: 6.33 / Avg: 6.35 / Max: 6.371. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1a2343691215SE +/- 0.00635, N = 3SE +/- 0.02003, N = 3SE +/- 0.00861, N = 3SE +/- 0.01901, N = 38.935908.891128.973408.98756MIN: 8.89MIN: 8.82MIN: 8.92MIN: 8.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1a2343691215Min: 8.93 / Avg: 8.94 / Max: 8.95Min: 8.86 / Avg: 8.89 / Max: 8.93Min: 8.96 / Avg: 8.97 / Max: 8.99Min: 8.95 / Avg: 8.99 / Max: 9.011. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1a23448121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 316.8416.7617.8817.86MIN: 16.55MIN: 16.26MIN: 17.63MIN: 17.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1a234510152025Min: 16.79 / Avg: 16.84 / Max: 16.89Min: 16.7 / Avg: 16.76 / Max: 16.8Min: 17.86 / Avg: 17.88 / Max: 17.91Min: 17.86 / Avg: 17.86 / Max: 17.871. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1a234246810SE +/- 0.01649, N = 3SE +/- 0.06379, N = 3SE +/- 0.00909, N = 3SE +/- 0.06163, N = 36.787796.849736.794906.85494MIN: 6.69MIN: 6.72MIN: 6.7MIN: 6.721. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1a2343691215Min: 6.76 / Avg: 6.79 / Max: 6.82Min: 6.77 / Avg: 6.85 / Max: 6.98Min: 6.78 / Avg: 6.79 / Max: 6.81Min: 6.77 / Avg: 6.85 / Max: 6.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1a2340.95911.91822.87733.83644.7955SE +/- 0.00597, N = 3SE +/- 0.00463, N = 3SE +/- 0.00176, N = 3SE +/- 0.00315, N = 34.236924.239044.262664.25393MIN: 4.2MIN: 4.2MIN: 4.23MIN: 4.221. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1a234246810Min: 4.23 / Avg: 4.24 / Max: 4.25Min: 4.23 / Avg: 4.24 / Max: 4.25Min: 4.26 / Avg: 4.26 / Max: 4.27Min: 4.25 / Avg: 4.25 / Max: 4.261. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1a2349001800270036004500SE +/- 2.30, N = 3SE +/- 1.81, N = 3SE +/- 1.28, N = 3SE +/- 1.09, N = 34024.784025.104029.874032.07MIN: 4018.57MIN: 4019.74MIN: 4025.1MIN: 4027.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1a2347001400210028003500Min: 4020.67 / Avg: 4024.78 / Max: 4028.61Min: 4021.62 / Avg: 4025.1 / Max: 4027.73Min: 4027.73 / Avg: 4029.87 / Max: 4032.15Min: 4030.11 / Avg: 4032.07 / Max: 4033.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1a2349001800270036004500SE +/- 0.29, N = 3SE +/- 1.97, N = 3SE +/- 2.50, N = 3SE +/- 1.14, N = 34025.944023.424028.194028.18MIN: 4021.96MIN: 4017.95MIN: 4022.48MIN: 4024.521. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1a2347001400210028003500Min: 4025.39 / Avg: 4025.94 / Max: 4026.35Min: 4019.83 / Avg: 4023.42 / Max: 4026.61Min: 4024.99 / Avg: 4028.19 / Max: 4033.12Min: 4026.32 / Avg: 4028.18 / Max: 4030.241. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1a2340.87961.75922.63883.51844.398SE +/- 0.00712, N = 3SE +/- 0.02046, N = 3SE +/- 0.00680, N = 3SE +/- 0.01176, N = 33.618173.616703.909503.89353MIN: 3.55MIN: 3.52MIN: 3.84MIN: 3.821. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1a234246810Min: 3.6 / Avg: 3.62 / Max: 3.63Min: 3.58 / Avg: 3.62 / Max: 3.65Min: 3.9 / Avg: 3.91 / Max: 3.92Min: 3.87 / Avg: 3.89 / Max: 3.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1a2349001800270036004500SE +/- 1.13, N = 3SE +/- 2.32, N = 3SE +/- 2.36, N = 3SE +/- 1.40, N = 34025.374023.674030.564030.49MIN: 4021.62MIN: 4017.93MIN: 4023.71MIN: 4026.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1a2347001400210028003500Min: 4023.24 / Avg: 4025.37 / Max: 4027.11Min: 4020.4 / Avg: 4023.67 / Max: 4028.16Min: 4027.12 / Avg: 4030.56 / Max: 4035.07Min: 4028.89 / Avg: 4030.49 / Max: 4033.281. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1a2340.88611.77222.65833.54444.4305SE +/- 0.00444, N = 3SE +/- 0.00431, N = 3SE +/- 0.00396, N = 3SE +/- 0.00145, N = 33.932053.930883.935293.93813MIN: 3.89MIN: 3.89MIN: 3.9MIN: 3.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1a234246810Min: 3.92 / Avg: 3.93 / Max: 3.94Min: 3.92 / Avg: 3.93 / Max: 3.94Min: 3.93 / Avg: 3.94 / Max: 3.94