HP Zbook

Intel Core i9-10885H testing with a HP 8736 (S91 Ver. 01.02.01 BIOS) and NVIDIA Quadro RTX 5000 with Max-Q Design 16GB on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2101076-HA-HPZBOOK6247
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Audio Encoding 2 Tests
AV1 2 Tests
Bioinformatics 2 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
Chess Test Suite 4 Tests
Timed Code Compilation 4 Tests
C/C++ Compiler Tests 13 Tests
Compression Tests 2 Tests
CPU Massive 23 Tests
Creator Workloads 22 Tests
Database Test Suite 3 Tests
Encoding 4 Tests
Fortran Tests 2 Tests
Game Development 4 Tests
HPC - High Performance Computing 19 Tests
Imaging 5 Tests
Common Kernel Benchmarks 2 Tests
Machine Learning 12 Tests
Molecular Dynamics 2 Tests
MPI Benchmarks 3 Tests
Multi-Core 22 Tests
NVIDIA GPU Compute 24 Tests
Intel oneAPI 3 Tests
OpenCL 6 Tests
OpenGL Demos Test Suite 2 Tests
OpenMPI Tests 4 Tests
Productivity 2 Tests
Programmer / Developer System Benchmarks 10 Tests
Python Tests 4 Tests
Renderers 2 Tests
Scientific Computing 5 Tests
Server 6 Tests
Server CPU Tests 11 Tests
Single-Threaded 6 Tests
Speech 3 Tests
Telephony 3 Tests
Texture Compression 3 Tests
Unigine Test Suite 2 Tests
Video Encoding 2 Tests
Vulkan Compute 6 Tests
Common Workstation Benchmarks 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
r1
January 04 2021
  21 Hours, 19 Minutes
r2
January 05 2021
  21 Hours, 8 Minutes
r3
January 06 2021
  20 Hours, 49 Minutes
Invert Hiding All Results Option
  21 Hours, 5 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


HP ZbookOpenBenchmarking.orgPhoronix Test SuiteIntel Core i9-10885H @ 5.30GHz (8 Cores / 16 Threads)HP 8736 (S91 Ver. 01.02.01 BIOS)Intel Comet Lake PCH32GB2048GB KXG50PNV2T04 KIOXIANVIDIA Quadro RTX 5000 with Max-Q Design 16GB (600/6000MHz)Intel Comet Lake PCH cAVSIntel Wi-Fi 6 AX201Ubuntu 20.045.6.0-1034-oem (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 450.80.024.6.0OpenCL 1.2 CUDA 11.0.2281.2.133GCC 9.3.0 + CUDA 10.1ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionHP Zbook BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NONE / errors=remount-ro,relatime,rw / Block Size: 4096- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe0 - Thermald 1.9.1- GPU Compute Cores: 3072- Python 3.8.3- itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: NVIDIA OptiXr2r3r130060090012001500SE +/- 0.85, N = 3SE +/- 2.01, N = 3SE +/- 0.44, N = 31190.051192.801192.96

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processingr2r1r32004006008001000SE +/- 0.35, N = 3SE +/- 0.74, N = 3SE +/- 0.62, N = 3840.32840.35841.231. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CUDAr2r3r1160320480640800SE +/- 0.26, N = 3SE +/- 0.41, N = 3SE +/- 0.24, N = 3731.67733.02734.81

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: CUDAr3r1r2130260390520650SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3608.62608.80609.56

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by ALibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2020-09-17Model: inception-v3r1r2r31428425670SE +/- 0.15, N = 10SE +/- 0.18, N = 11SE +/- 0.22, N = 1062.5763.1863.56MIN: 60.82 / MAX: 96.05MIN: 61.02 / MAX: 104.39MIN: 60.92 / MAX: 102.851. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2020-09-17Model: mobilenet-v1-1.0r1r3r23691215SE +/- 0.01, N = 10SE +/- 0.01, N = 10SE +/- 0.01, N = 1110.6510.6610.68MIN: 10.33 / MAX: 34.53MIN: 10.33 / MAX: 32.25MIN: 10.35 / MAX: 33.351. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2020-09-17Model: MobileNetV2_224r1r3r21.19052.3813.57154.7625.9525SE +/- 0.210, N = 10SE +/- 0.209, N = 10SE +/- 0.185, N = 115.2395.2855.291MIN: 3.19 / MAX: 26.27MIN: 3.27 / MAX: 26.82MIN: 3.3 / MAX: 27.381. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2020-09-17Model: resnet-v2-50r1r2r31326395265SE +/- 0.40, N = 10SE +/- 0.35, N = 11SE +/- 0.40, N = 1058.1658.5358.79MIN: 36.86 / MAX: 81.73MIN: 37.33 / MAX: 83.74MIN: 36.87 / MAX: 85.771. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2020-09-17Model: SqueezeNetV1.0r1r3r23691215SE +/- 0.373, N = 10SE +/- 0.373, N = 10SE +/- 0.316, N = 118.8998.9448.982MIN: 4.96 / MAX: 31.21MIN: 5.01 / MAX: 31.89MIN: 5.05 / MAX: 31.351. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI Scorer1r3r230060090012001500154615441544

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training Scorer1r3r22004006008001000816814814

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference Scorer3r2r1160320480640800730730730

RedShift Demo

This is a test of MAXON's RedShift demo build that currently requires NVIDIA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRedShift Demo 3.0r3r2r1100200300400500SE +/- 0.33, N = 3SE +/- 0.88, N = 3459460461

DDraceNetwork

This is a test of DDraceNetwork, an open-source cooperative platformer. OpenGL 3.3 is used for rendering, with fallbacks for older OpenGL versions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.0 - Zoom: Default - Demo: RaiNyMore2r1r2r34080120160200SE +/- 9.09, N = 15SE +/- 9.59, N = 15SE +/- 11.09, N = 15170.36169.30151.49MIN: 2.43 / MAX: 499.5MIN: 2.38 / MAX: 499.5MIN: 2.37 / MAX: 499.751. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustiver1r2r3100200300400500SE +/- 0.52, N = 3SE +/- 0.81, N = 3SE +/- 0.54, N = 3447.99449.37449.901. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: OpenCLr3r1r23K6K9K12K15KSE +/- 44.68, N = 3SE +/- 160.45, N = 3SE +/- 176.76, N = 31341613277131731. (CXX) g++ options: -flto -pthread

BRL-CAD

BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metricr3r1r214K28K42K56K70K6403363909638221. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

VkFFT

VkFFT is a Fast Fourier Transform (FFT) Library that is GPU accelerated by means of the Vulkan API. The VkFFT benchmark runs FFT performance differences of many different sizes before returning an overall benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1r1r3r26K12K18K24K30KSE +/- 62.93, N = 3SE +/- 108.37, N = 3SE +/- 58.68, N = 32582025683256471. (CXX) g++ options: -O3 -pthread

DDraceNetwork

This is a test of DDraceNetwork, an open-source cooperative platformer. OpenGL 3.3 is used for rendering, with fallbacks for older OpenGL versions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2r1r3r2306090120150SE +/- 9.86, N = 15SE +/- 13.14, N = 12158.21130.66100.58MIN: 7.02 / MAX: 449.03MIN: 6.67 / MAX: 498.75MIN: 6.72 / MAX: 493.341. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmarkr1r3r20.13880.27760.41640.55520.694SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.004, N = 30.6170.6140.6101. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthr3r1r23M6M9M12M15MSE +/- 142852.80, N = 3SE +/- 174263.56, N = 3SE +/- 148124.86, N = 3161806741598471915974611

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Timer2r1r32M4M6M8M10MSE +/- 85742.14, N = 3SE +/- 85083.98, N = 8SE +/- 67987.28, N = 129839292970313396293531. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

Unigine Heaven

This test calculates the average frame-rate within the Heaven demo for the Unigine engine. This engine is extremely demanding on the system's graphics card. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterUnigine Heaven 4.0Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGLr2r3r1306090120150SE +/- 0.96, N = 3SE +/- 0.56, N = 3SE +/- 0.71, N = 3139.91139.18139.13

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CUDAr1r3r260120180240300SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3250.78251.80251.90

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bitr1r3r220406080100SE +/- 0.99, N = 4SE +/- 1.03, N = 4SE +/- 1.05, N = 486.0885.9585.83MIN: 54.34 / MAX: 256.39MIN: 54.21 / MAX: 255.72MIN: 54.27 / MAX: 257.581. (CC) gcc options: -pthread

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compiler1r2r350100150200250SE +/- 0.40, N = 3SE +/- 0.49, N = 3SE +/- 0.85, N = 3210.05210.71210.95

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmarkr1r2r390180270360450SE +/- 1.54, N = 3SE +/- 0.84, N = 3SE +/- 0.70, N = 3419.58419.36417.03

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXr1r2r34080120160200SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 3196.21196.28196.41

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1r1r2r30.89141.78282.67423.56564.457SE +/- 0.00082, N = 3SE +/- 0.00692, N = 3SE +/- 0.01196, N = 33.961773.960683.954571. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

Unigine Superposition

This test calculates the average frame-rate within the Superposition demo for the Unigine engine, released in 2017. This engine is extremely demanding on the system's graphics card. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterUnigine Superposition 1.0Resolution: 1920 x 1080 - Mode: Fullscreen - Quality: Ultra - Renderer: OpenGLr2r3r1612182430SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 325.425.325.1MAX: 29.4MAX: 29.7MAX: 29.3

OpenBenchmarking.orgFrames Per Second, More Is BetterUnigine Superposition 1.0Resolution: 1920 x 1080 - Mode: Fullscreen - Quality: High - Renderer: OpenGLr2r3r11530456075SE +/- 0.12, N = 3SE +/- 0.09, N = 3SE +/- 0.19, N = 366.566.265.9MAX: 80.8MAX: 80.3MAX: 81.6

OpenBenchmarking.orgFrames Per Second, More Is BetterUnigine Superposition 1.0Resolution: 1920 x 1080 - Mode: Fullscreen - Quality: Medium - Renderer: OpenGLr2r3r120406080100SE +/- 0.15, N = 3SE +/- 0.15, N = 3SE +/- 0.15, N = 390.690.590.4MAX: 114.4MAX: 113MAX: 114.5

OpenBenchmarking.orgFrames Per Second, More Is BetterUnigine Superposition 1.0Resolution: 1920 x 1080 - Mode: Fullscreen - Quality: Low - Renderer: OpenGLr2r1r34080120160200SE +/- 0.71, N = 3SE +/- 0.23, N = 3SE +/- 0.52, N = 3178.1177.7177.4MAX: 259.4MAX: 260.1MAX: 263.9

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirlr3r2r150100150200250SE +/- 1.72, N = 8SE +/- 1.60, N = 10SE +/- 1.72, N = 82072072071. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedupr1r3r20.83251.6652.49753.334.1625SE +/- 0.03, N = 3SE +/- 0.03, N = 15SE +/- 0.03, N = 153.73.62.51. (CC) gcc options: -fopenmp -O3 -lm

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: CUDAr2r3r14080120160200SE +/- 0.11, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 3167.96168.08168.87

Warsow

This is a benchmark of Warsow, a popular open-source first-person shooter. This game uses the QFusion engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 1080r3r2r12004006008001000SE +/- 1.81, N = 3SE +/- 1.46, N = 3SE +/- 13.76, N = 12968.6967.9955.6

LuxCoreRender OpenCL

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on OpenCL accelerators/GPUs. The alternative luxcorerender test profile is for CPU execution due to a difference in tests, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: LuxCore Benchmarkr2r3r10.51981.03961.55942.07922.599SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 122.312.292.26MIN: 0.27 / MAX: 2.63MIN: 0.27 / MAX: 2.64MIN: 0.14 / MAX: 2.63

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2r1r2r31000K2000K3000K4000K5000KSE +/- 8775.31, N = 3SE +/- 8796.49, N = 3SE +/- 8398.83, N = 3466019746705674677473

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4r1r2r31.1M2.2M3.3M4.4M5.5MSE +/- 5618.75, N = 3SE +/- 7685.69, N = 3SE +/- 8609.77, N = 3516319051681835178263

LuxCoreRender OpenCL

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on OpenCL accelerators/GPUs. The alternative luxcorerender test profile is for CPU execution due to a difference in tests, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: Foodr2r3r10.2970.5940.8911.1881.485SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 121.321.301.27MIN: 0.29 / MAX: 1.57MIN: 0.26 / MAX: 1.57MIN: 0.13 / MAX: 1.57

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To Compiler3r1r2306090120150SE +/- 0.75, N = 3SE +/- 0.33, N = 3SE +/- 0.24, N = 3151.48151.66152.21

OctaneBench

OctaneBench is a test of the OctaneRender on the GPU and requires the use of NVIDIA CUDA. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterOctaneBench 2020.1Total Scorer3r2r14080120160200189.32189.10189.09

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP32 - Device: CPUr1r3r211002200330044005500SE +/- 15.43, N = 3SE +/- 14.45, N = 5SE +/- 9.68, N = 95069.445073.095079.891. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP32 - Device: CPUr3r2r10.17780.35560.53340.71120.889SE +/- 0.01, N = 5SE +/- 0.01, N = 9SE +/- 0.01, N = 30.790.790.791. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080pr1r3r2110220330440550SE +/- 5.73, N = 14SE +/- 3.24, N = 13SE +/- 3.02, N = 14489.84487.57486.46MIN: 317.1 / MAX: 898.12MIN: 316.7 / MAX: 911.47MIN: 316.37 / MAX: 900.571. (CC) gcc options: -pthread

LuxCoreRender OpenCL

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on OpenCL accelerators/GPUs. The alternative luxcorerender test profile is for CPU execution due to a difference in tests, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: DLSCr2r3r10.62331.24661.86992.49323.1165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 122.772.762.70MIN: 2.57 / MAX: 2.84MIN: 2.56 / MAX: 2.84MIN: 0.69 / MAX: 2.81

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crownr1r3r2246810SE +/- 0.0830, N = 3SE +/- 0.0756, N = 5SE +/- 0.0728, N = 47.07356.99766.9794MIN: 6.66 / MAX: 12.73MIN: 6.56 / MAX: 12.56MIN: 6.57 / MAX: 12.32

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: NVIDIA OptiXr2r3r1306090120150SE +/- 0.23, N = 3SE +/- 0.13, N = 3SE +/- 0.13, N = 3116.15116.26116.76

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3r1r2r320406080100SE +/- 0.55, N = 3SE +/- 0.55, N = 3SE +/- 0.53, N = 3110.84110.93111.041. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2r3r2r14080120160200SE +/- 0.11, N = 3SE +/- 0.14, N = 3SE +/- 0.23, N = 3186.62186.48186.46

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Searchr3r1r220406080100SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3105.51105.53105.571. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crownr2r1r3246810SE +/- 0.0737, N = 3SE +/- 0.0766, N = 3SE +/- 0.0667, N = 36.09896.08066.0641MIN: 5.88 / MAX: 10.98MIN: 5.86 / MAX: 11.02MIN: 5.86 / MAX: 10.95

RealSR-NCNN

RealSR-NCNN is an NCNN neural network implementation of the RealSR project and accelerated using the Vulkan API. RealSR is the Real-World Super Resolution via Kernel Estimation and Noise Injection. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image by a scale of 4x with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: Yesr1r2r320406080100SE +/- 0.31, N = 3SE +/- 0.48, N = 3SE +/- 0.35, N = 399.81100.62100.75

Timed FFmpeg Compilation

This test times how long it takes to build the FFmpeg multimedia library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compiler3r1r220406080100SE +/- 0.30, N = 3SE +/- 0.78, N = 3SE +/- 0.39, N = 3100.20100.26100.40

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUr1r3r215003000450060007500SE +/- 2.95, N = 3SE +/- 6.73, N = 3SE +/- 4.70, N = 37140.507151.587159.42MIN: 7021.68MIN: 7027.2MIN: 7041.41. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUr1r2r315003000450060007500SE +/- 12.55, N = 3SE +/- 1.75, N = 3SE +/- 6.55, N = 37155.417159.487169.03MIN: 7025.22MIN: 7040.61MIN: 7046.491. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUr1r3r215003000450060007500SE +/- 3.89, N = 3SE +/- 2.23, N = 3SE +/- 0.92, N = 37144.237147.097154.66MIN: 7028.46MIN: 7033.98MIN: 7035.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: NVIDIA OptiXr2r3r1918273645SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 3.33, N = 1538.0738.0741.47

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CUDAr2r3r120406080100SE +/- 0.16, N = 3SE +/- 0.10, N = 3SE +/- 0.14, N = 390.8290.9391.00

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu ISO) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 19r3r1r2714212835SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 328.828.828.71. (CC) gcc options: -O3 -pthread -lz -llzma

GEGL

GEGL is the Generic Graphics Library and is the library/framework used by GIMP and other applications like GNOME Photos. This test profile times how long it takes to complete various GEGL operations on a static set of sample JPEG images. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Cartoonr1r3r220406080100SE +/- 0.12, N = 3SE +/- 0.09, N = 3SE +/- 0.19, N = 386.7986.9987.32

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUr1r2r30.26780.53560.80341.07121.339SE +/- 0.00, N = 3SE +/- 0.00, N = 4SE +/- 0.00, N = 61.171.191.191. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUr1r3r27001400210028003500SE +/- 33.67, N = 3SE +/- 34.05, N = 6SE +/- 38.35, N = 43442.783405.923403.451. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUr1r2r38001600240032004000SE +/- 2.45, N = 3SE +/- 2.65, N = 3SE +/- 3.77, N = 33795.023797.053797.72MIN: 3682.24MIN: 3673.18MIN: 3684.191. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUr1r3r28001600240032004000SE +/- 6.76, N = 3SE +/- 3.22, N = 3SE +/- 4.34, N = 33795.813798.123800.41MIN: 3687.23MIN: 3685.27MIN: 3681.231. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUr3r1r28001600240032004000SE +/- 1.33, N = 3SE +/- 1.61, N = 3SE +/- 1.20, N = 33792.873797.323799.45MIN: 3672.83MIN: 3686.53MIN: 3692.971. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400mr2r1r3510152025SE +/- 0.24, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 318.9119.1619.38MIN: 13.5 / MAX: 30.63MIN: 18.07 / MAX: 22.36MIN: 14.45 / MAX: 42.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssdr2r3r1714212835SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.14, N = 327.5127.6327.64MIN: 26.93 / MAX: 43.6MIN: 27.02 / MAX: 46.56MIN: 27 / MAX: 40.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tinyr2r3r1816243240SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.48, N = 335.5935.6635.95MIN: 34.42 / MAX: 51.24MIN: 34.45 / MAX: 49.15MIN: 34.4 / MAX: 55.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50r3r2r1918273645SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.51, N = 337.2237.3037.81MIN: 33.9 / MAX: 52.84MIN: 33.91 / MAX: 56.28MIN: 34.04 / MAX: 52.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnetr2r3r148121620SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 315.4615.4915.50MIN: 14.35 / MAX: 27.24MIN: 14.41 / MAX: 24.83MIN: 14.41 / MAX: 55.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18r1r3r2510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 318.6218.6618.71MIN: 17.08 / MAX: 32.57MIN: 17.05 / MAX: 30.94MIN: 17.06 / MAX: 33.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16r3r2r11632486480SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.20, N = 371.8671.9172.09MIN: 70.48 / MAX: 88MIN: 70.43 / MAX: 92.47MIN: 70.5 / MAX: 88.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenetr1r2r3510152025SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 319.9820.0120.21MIN: 18.95 / MAX: 23.24MIN: 18.96 / MAX: 24.67MIN: 19.11 / MAX: 32.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazefacer1r3r20.5851.171.7552.342.925SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 32.542.572.60MIN: 2.35 / MAX: 2.74MIN: 2.45 / MAX: 2.83MIN: 2.45 / MAX: 10.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0r2r3r13691215SE +/- 0.96, N = 3SE +/- 0.96, N = 3SE +/- 0.05, N = 39.059.0610.00MIN: 6.99 / MAX: 21.76MIN: 7.04 / MAX: 12.38MIN: 9.46 / MAX: 24.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnetr2r3r1246810SE +/- 0.75, N = 3SE +/- 0.74, N = 3SE +/- 0.02, N = 35.965.966.67MIN: 4.32 / MAX: 14.32MIN: 4.33 / MAX: 28.21MIN: 5.99 / MAX: 21.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2r2r3r1246810SE +/- 0.94, N = 3SE +/- 0.95, N = 3SE +/- 0.03, N = 36.957.037.93MIN: 5.01 / MAX: 9.68MIN: 5.04 / MAX: 20.64MIN: 7.52 / MAX: 16.611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3r1r2r31.30732.61463.92195.22926.5365SE +/- 0.65, N = 3SE +/- 0.65, N = 3SE +/- 0.62, N = 35.745.815.81MIN: 4.3 / MAX: 7.75MIN: 4.43 / MAX: 17.76MIN: 4.48 / MAX: 10.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2r2r3r1246810SE +/- 0.73, N = 3SE +/- 0.73, N = 3SE +/- 0.67, N = 37.227.237.31MIN: 5.54 / MAX: 12.03MIN: 5.55 / MAX: 12.3MIN: 5.51 / MAX: 16.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenetr3r1r2612182430SE +/- 0.02, N = 3SE +/- 0.17, N = 3SE +/- 0.01, N = 326.5326.6226.63MIN: 25.78 / MAX: 41.25MIN: 25.69 / MAX: 38.05MIN: 25.7 / MAX: 41.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400mr2r3r1510152025SE +/- 1.83, N = 3SE +/- 1.77, N = 3SE +/- 0.09, N = 317.1517.6019.16MIN: 13.3 / MAX: 38.12MIN: 13.79 / MAX: 32.97MIN: 17.94 / MAX: 21.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssdr2r3r1612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 327.5227.5527.58MIN: 26.95 / MAX: 42.6MIN: 26.92 / MAX: 41.99MIN: 26.94 / MAX: 43.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tinyr2r1r3816243240SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 335.5135.5235.53MIN: 33.05 / MAX: 50.05MIN: 34.38 / MAX: 51.44MIN: 32.99 / MAX: 52.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet50r1r3r2918273645SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 337.2537.2637.34MIN: 34.07 / MAX: 48.19MIN: 33.79 / MAX: 52.48MIN: 33.97 / MAX: 56.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnetr1r3r248121620SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 315.4415.5015.53MIN: 14.41 / MAX: 26.42MIN: 14.41 / MAX: 26.23MIN: 14.41 / MAX: 25.621. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18r2r3r1510152025SE +/- 0.34, N = 3SE +/- 0.27, N = 3SE +/- 0.00, N = 318.3318.3818.62MIN: 14.43 / MAX: 32.39MIN: 14.4 / MAX: 32.57MIN: 17.13 / MAX: 20.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg16r2r3r11632486480SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 371.8271.8671.96MIN: 70.37 / MAX: 86.67MIN: 70.4 / MAX: 88.5MIN: 70.52 / MAX: 88.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenetr2r3r1510152025SE +/- 1.77, N = 3SE +/- 1.84, N = 3SE +/- 0.06, N = 318.2018.2620.05MIN: 14.26 / MAX: 31.74MIN: 14.28 / MAX: 36.09MIN: 18.94 / MAX: 32.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazefacer2r3r10.57381.14761.72142.29522.869SE +/- 0.26, N = 3SE +/- 0.25, N = 3SE +/- 0.02, N = 32.292.292.55MIN: 1.68 / MAX: 8.91MIN: 1.69 / MAX: 12.73MIN: 2.43 / MAX: 2.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b0r3r2r13691215SE +/- 0.94, N = 3SE +/- 0.95, N = 3SE +/- 0.10, N = 38.999.0210.01MIN: 6.99 / MAX: 13.79MIN: 7 / MAX: 19.29MIN: 9.44 / MAX: 29.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnetr2r3r1246810SE +/- 0.71, N = 3SE +/- 0.76, N = 3SE +/- 0.00, N = 35.865.916.63MIN: 4.3 / MAX: 15.47MIN: 4.32 / MAX: 7.94MIN: 6.21 / MAX: 8.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v2r2r3r1246810SE +/- 0.96, N = 3SE +/- 0.93, N = 3SE +/- 0.07, N = 36.987.057.92MIN: 4.98 / MAX: 27.09MIN: 5.04 / MAX: 20.37MIN: 7.27 / MAX: 20.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3r2r1r31.30732.61463.92195.22926.5365SE +/- 0.65, N = 3SE +/- 0.62, N = 3SE +/- 0.64, N = 35.735.745.81MIN: 4.33 / MAX: 10.47MIN: 4.43 / MAX: 9.64MIN: 4.41 / MAX: 25.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2r3r2r1246810SE +/- 0.73, N = 3SE +/- 0.79, N = 3SE +/- 0.74, N = 37.197.227.23MIN: 5.52 / MAX: 9.67MIN: 5.41 / MAX: 20.72MIN: 5.54 / MAX: 9.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenetr3r1r2612182430SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 326.5126.5226.53MIN: 25.69 / MAX: 45.35MIN: 25.69 / MAX: 43.81MIN: 25.76 / MAX: 43.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragonr2r1r3246810SE +/- 0.0719, N = 3SE +/- 0.0643, N = 3SE +/- 0.0754, N = 37.56567.55557.5496MIN: 7.18 / MAX: 12.51MIN: 7.18 / MAX: 12.55MIN: 7.19 / MAX: 12.66

RawTherapee

RawTherapee is a cross-platform, open-source multi-threaded RAW image processing program. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark Timer1r3r220406080100SE +/- 0.53, N = 3SE +/- 0.45, N = 3SE +/- 0.46, N = 380.5980.7180.931. RawTherapee, version 5.8, command line.

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP32 - Device: CPUr1r3r20.27680.55360.83041.10721.384SE +/- 0.00, N = 3SE +/- 0.00, N = 4SE +/- 0.00, N = 51.211.221.231. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP32 - Device: CPUr1r3r27001400210028003500SE +/- 35.01, N = 3SE +/- 40.89, N = 4SE +/- 33.23, N = 53363.553347.933307.531. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP32 - Device: CPUr1r2r37001400210028003500SE +/- 2.58, N = 3SE +/- 1.22, N = 4SE +/- 2.51, N = 33202.533207.353212.101. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP32 - Device: CPUr3r2r10.28580.57160.85741.14321.429SE +/- 0.02, N = 3SE +/- 0.02, N = 4SE +/- 0.01, N = 31.271.271.261. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4Kr1r3r2306090120150SE +/- 1.06, N = 6SE +/- 1.07, N = 6SE +/- 1.08, N = 6112.75112.65112.03MIN: 99.69 / MAX: 158.99MIN: 99.62 / MAX: 158.58MIN: 99.17 / MAX: 157.081. (CC) gcc options: -pthread

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP16 - Device: CPUr1r2r311002200330044005500SE +/- 4.97, N = 3SE +/- 19.24, N = 3SE +/- 4.20, N = 34961.994978.255006.341. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP16 - Device: CPUr3r2r10.180.360.540.720.9SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.800.800.801. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

Timed Eigen Compilation

This test times how long it takes to build all Eigen examples. The Eigen examples are compiled serially. Eigen is a C++ template library for linear algebra. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compiler2r3r11530456075SE +/- 0.30, N = 3SE +/- 0.22, N = 3SE +/- 0.16, N = 367.5468.7068.74

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speedr3r1r22K4K6K8K10KSE +/- 0.78, N = 3SE +/- 1.80, N = 5SE +/- 15.38, N = 39695.29679.89664.81. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speedr3r2r11326395265SE +/- 0.66, N = 3SE +/- 0.36, N = 3SE +/- 0.59, N = 557.0156.0755.721. (CC) gcc options: -O3

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP16 - Device: CPUr3r1r27001400210028003500SE +/- 7.78, N = 3SE +/- 4.35, N = 3SE +/- 3.88, N = 33164.513165.243166.571. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP16 - Device: CPUr3r2r10.2880.5760.8641.1521.44SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.281.281.281. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragonr2r3r13691215SE +/- 0.0236, N = 3SE +/- 0.1308, N = 3SE +/- 0.0822, N = 39.25969.19679.1343MIN: 8.82 / MAX: 14.99MIN: 8.85 / MAX: 15MIN: 8.81 / MAX: 15.06

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speedr3r1r22K4K6K8K10KSE +/- 0.67, N = 3SE +/- 1.84, N = 5SE +/- 16.28, N = 39685.29676.39653.71. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speedr3r1r21326395265SE +/- 0.48, N = 3SE +/- 0.61, N = 5SE +/- 0.58, N = 358.8957.8857.361. (CC) gcc options: -O3

Node.js V8 Web Tooling Benchmark

Running the V8 project's Web-Tooling-Benchmark under Node.js. The Web-Tooling-Benchmark stresses JavaScript-related workloads common to web developers like Babel and TypeScript and Babylon. This test profile can test the system's JavaScript performance with Node.js. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmarkr3r2r13691215SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 313.1813.1713.061. Nodejs v10.19.0

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostyar1r3r20.1710.3420.5130.6840.855SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.760.750.751. (CXX) g++ options: -O3 -pthread

DDraceNetwork

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.0 - Zoom: Default - Demo: Multeasymap - Total Frame Timer3r1r23691215Min: 2 / Avg: 2.39 / Max: 7.28Min: 2 / Avg: 2.43 / Max: 6.55Min: 2 / Avg: 2.46 / Max: 6.51. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.0 - Zoom: Default - Demo: Multeasymapr1r2r390180270360450SE +/- 0.79, N = 3SE +/- 2.87, N = 3SE +/- 4.35, N = 3413.88412.43412.38MIN: 119.86 / MAX: 499.75MIN: 103.17 / MAX: 499.75MIN: 127.91 / MAX: 499.751. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Seek Randomr2r3r13691215SE +/- 0.10, N = 15SE +/- 0.11, N = 14SE +/- 0.11, N = 1512.6312.6412.691. (CXX) g++ options: -O3 -lsnappy -lpthread

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: NVIDIA OptiXr2r3r11428425670SE +/- 0.04, N = 3SE +/- 0.12, N = 3SE +/- 0.03, N = 360.1860.2560.35

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroomr1r2r30.21130.42260.63390.84521.0565SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 30.9390.9380.935

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercarr3r2r10.48510.97021.45531.94042.4255SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 32.1562.1502.147

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetr1r2r380K160K240K320K400KSE +/- 2566.21, N = 3SE +/- 2576.61, N = 3SE +/- 2539.06, N = 3354892356034356258

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Quantr1r2r350K100K150K200K250KSE +/- 1686.36, N = 3SE +/- 1668.46, N = 3SE +/- 1810.35, N = 3236716237129237406

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet Mobiler1r3r270K140K210K280K350KSE +/- 3140.84, N = 3SE +/- 1284.72, N = 3SE +/- 2025.87, N = 3302594304079304756

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Floatr1r2r350K100K150K200K250KSE +/- 1996.41, N = 3SE +/- 1820.00, N = 3SE +/- 1638.46, N = 3239119239224239537

DDraceNetwork

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Timer1r2r33691215Min: 2 / Avg: 2.3 / Max: 10.06Min: 2 / Avg: 2.32 / Max: 5.18Min: 2 / Avg: 2.32 / Max: 8.681. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymapr1r3r290180270360450SE +/- 0.25, N = 3SE +/- 2.45, N = 3SE +/- 2.73, N = 3435.20434.24429.37MIN: 99.45 / MAX: 499.75MIN: 115.25 / MAX: 499.75MIN: 112.88 / MAX: 499.751. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpenr3r2r11632486480SE +/- 0.67, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 37372721. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhancedr3r2r1306090120150SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 31151151151. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussianr3r2r1306090120150SE +/- 1.20, N = 3SE +/- 1.00, N = 3SE +/- 1.33, N = 31471471461. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizingr1r3r2120240360480600SE +/- 2.73, N = 3SE +/- 5.36, N = 3SE +/- 5.00, N = 35525515511. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Spacer3r1r22004006008001000SE +/- 4.51, N = 3SE +/- 5.03, N = 3SE +/- 5.70, N = 37767757741. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotater1r3r22004006008001000SE +/- 2.52, N = 3SE +/- 1.86, N = 3SE +/- 3.18, N = 39029008751. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thoroughr1r2r31224364860SE +/- 0.54, N = 3SE +/- 0.54, N = 3SE +/- 0.42, N = 354.2954.3854.651. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1Sr1r3r21326395265SE +/- 0.38, N = 3SE +/- 0.56, N = 3SE +/- 0.15, N = 357.8258.0658.061. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

GEGL

GEGL is the Generic Graphics Library and is the library/framework used by GIMP and other applications like GNOME Photos. This test profile times how long it takes to complete various GEGL operations on a static set of sample JPEG images. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Wavelet Blurr3r2r11326395265SE +/- 0.25, N = 3SE +/- 0.39, N = 3SE +/- 0.25, N = 357.8457.9557.99

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 1r3r1r20.07810.15620.23430.31240.3905SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 30.3470.3470.346

DeepSpeech

Mozilla DeepSpeech is a speech-to-text engine powered by TensorFlow for machine learning and derived from Baidu's Deep Speech research paper. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDeepSpeech 0.6Acceleration: CPUr3r2r120406080100SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.21, N = 381.0481.0781.30

LuxCoreRender OpenCL

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on OpenCL accelerators/GPUs. The alternative luxcorerender test profile is for CPU execution due to a difference in tests, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: Rainbow Colors and Prismr3r2r11.21732.43463.65194.86926.0865SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.12, N = 125.415.395.30MIN: 4.58 / MAX: 5.7MIN: 4.6 / MAX: 5.67MIN: 1.66 / MAX: 5.7

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 5r1r3r20.24050.4810.72150.9621.2025SE +/- 0.005, N = 3SE +/- 0.003, N = 3SE +/- 0.004, N = 31.0691.0641.064

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2r1r2r31326395265SE +/- 0.55, N = 3SE +/- 0.41, N = 3SE +/- 0.58, N = 355.5055.7455.771. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Mediumr3r2r1246810SE +/- 0.16, N = 15SE +/- 0.11, N = 15SE +/- 0.14, N = 157.587.617.681. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

GEGL

GEGL is the Generic Graphics Library and is the library/framework used by GIMP and other applications like GNOME Photos. This test profile times how long it takes to complete various GEGL operations on a static set of sample JPEG images. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Color Enhancer3r1r21224364860SE +/- 0.28, N = 3SE +/- 0.22, N = 3SE +/- 0.04, N = 354.1054.1154.31

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandomr3r2r10.11250.2250.33750.450.5625SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.50.50.51. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweetsr2r3r10.19580.39160.58740.78320.979SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.870.860.861. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserIDr1r3r20.20030.40060.60090.80121.0015SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.890.880.881. (CXX) g++ options: -O3 -pthread

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000r1r3r21122334455SE +/- 0.25, N = 3SE +/- 0.13, N = 3SE +/- 0.17, N = 349.5550.2650.271. (CC) gcc options: -O2 -ldl -lz -lpthread

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNAr1r2r33691215SE +/- 0.08, N = 12SE +/- 0.10, N = 15SE +/- 0.10, N = 1410.5010.5610.611. (CC) gcc options: -std=c99 -O3 -lm -lpthread

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondr3r1r250K100K150K200K250KSE +/- 2209.16, N = 3SE +/- 2532.07, N = 3SE +/- 1894.03, N = 3223892.44223414.81223304.981. (CC) gcc options: -O2 -lrt" -lrt

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random Readr3r1r23691215SE +/- 0.214, N = 15SE +/- 0.250, N = 12SE +/- 0.206, N = 159.5739.6209.6921. (CXX) g++ options: -O3 -lsnappy -lpthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080pr1r3r2100200300400500SE +/- 3.60, N = 14SE +/- 3.80, N = 13SE +/- 3.46, N = 13460.02459.71459.61MIN: 375.05 / MAX: 590.01MIN: 374.63 / MAX: 587.93MIN: 374.03 / MAX: 582.971. (CC) gcc options: -pthread

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 6r1r3r20.32490.64980.97471.29961.6245SE +/- 0.010, N = 3SE +/- 0.012, N = 3SE +/- 0.006, N = 31.4441.4431.440

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCLr1r3r220406080100SE +/- 0.19, N = 3SE +/- 0.40, N = 3SE +/- 0.42, N = 3110.07109.99109.98

VkResample

VkResample is a Vulkan-based image upscaling library based on VkFFT. The sample input file is upscaling a 4K image to 8K using Vulkan-based GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Doubler1r2r360120180240300SE +/- 0.20, N = 3SE +/- 0.11, N = 3SE +/- 0.20, N = 3256.87257.06257.621. (CXX) g++ options: -O3 -pthread

GEGL

GEGL is the Generic Graphics Library and is the library/framework used by GIMP and other applications like GNOME Photos. This test profile times how long it takes to complete various GEGL operations on a static set of sample JPEG images. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Rotate 90 Degreesr2r3r1918273645SE +/- 0.36, N = 3SE +/- 0.43, N = 3SE +/- 0.31, N = 337.5437.6937.70

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Antialiasr1r2r3816243240SE +/- 0.45, N = 3SE +/- 0.35, N = 3SE +/- 0.38, N = 336.5636.5636.65

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesisr1r2r3714212835SE +/- 0.29, N = 4SE +/- 0.12, N = 4SE +/- 0.04, N = 426.4727.1827.711. (CC) gcc options: -O2 -std=c99 -lpthread -lm

Cryptsetup

This is a test profile for running the cryptsetup benchmark to report on the system's cryptography performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryptionr2r3r1110220330440550SE +/- 0.97, N = 3SE +/- 2.12, N = 3SE +/- 0.30, N = 2486.4483.8483.0

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryptionr2r3r1110220330440550SE +/- 1.44, N = 3SE +/- 2.34, N = 3SE +/- 0.10, N = 3485.7483.0482.7

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryptionr2r3r12004006008001000SE +/- 1.17, N = 3SE +/- 4.24, N = 3SE +/- 1.28, N = 3878.1873.5871.7

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryptionr2r1r32004006008001000SE +/- 0.87, N = 3SE +/- 0.83, N = 3SE +/- 4.25, N = 3882.1878.0874.4

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryptionr2r3r17001400210028003500SE +/- 10.03, N = 3SE +/- 13.02, N = 3SE +/- 1.21, N = 33388.53362.93348.3

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryptionr2r1r37001400210028003500SE +/- 15.69, N = 3SE +/- 3.15, N = 3SE +/- 25.61, N = 33381.93346.83336.0

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryptionr2r3r1110220330440550SE +/- 1.43, N = 3SE +/- 2.21, N = 3SE +/- 0.34, N = 3486.3483.0482.5

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryptionr2r3r1110220330440550SE +/- 1.08, N = 3SE +/- 2.51, N = 3SE +/- 0.75, N = 3487.4483.6482.0

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryptionr2r1r32004006008001000SE +/- 1.50, N = 3SE +/- 1.62, N = 3SE +/- 4.03, N = 3876.6872.3870.9

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryptionr2r3r12004006008001000SE +/- 1.25, N = 3SE +/- 2.67, N = 3SE +/- 0.92, N = 3881.4874.1874.1

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryptionr2r3r19001800270036004500SE +/- 17.20, N = 3SE +/- 15.07, N = 3SE +/- 4.92, N = 34055.14026.94002.4

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryptionr2r3r19001800270036004500SE +/- 25.91, N = 3SE +/- 20.10, N = 3SE +/- 1.66, N = 34080.54023.04005.6

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpoolr2r1r3200K400K600K800K1000KSE +/- 2314.28, N = 3SE +/- 4903.32, N = 3SE +/- 2497.33, N = 3830020816282810352

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha512r2r1r3400K800K1200K1600K2000KSE +/- 1201.00, N = 3SE +/- 7117.07, N = 3SE +/- 12877.64, N = 3194300819193491886103

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Sequential Fillr1r2r31122334455SE +/- 0.54, N = 4SE +/- 0.58, N = 4SE +/- 0.48, N = 547.2447.2947.421. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Sequential Fillr1r2r3918273645SE +/- 0.44, N = 4SE +/- 0.46, N = 4SE +/- 0.39, N = 537.537.437.31. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random Deleter1r2r31122334455SE +/- 0.49, N = 5SE +/- 0.57, N = 4SE +/- 0.56, N = 447.2347.3047.391. (CXX) g++ options: -O3 -lsnappy -lpthread

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2r2r3r170140210280350SE +/- 0.81, N = 3SE +/- 0.36, N = 3SE +/- 2.78, N = 8295.55299.40321.42MIN: 292.39 / MAX: 306.56MIN: 297.92 / MAX: 315.55MIN: 300.42 / MAX: 371.061. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision Doubler3r2r170140210280350SE +/- 3.74, N = 3SE +/- 3.68, N = 3SE +/- 3.78, N = 3340.59340.46340.421. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Masskrug - Acceleration: CPU-onlyr1r2r3246810SE +/- 0.097, N = 12SE +/- 0.096, N = 12SE +/- 0.099, N = 127.1287.1507.155

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUr2r3r1246810SE +/- 0.11582, N = 12SE +/- 0.02993, N = 3SE +/- 0.05152, N = 37.044047.145747.16575MIN: 4.11MIN: 5.45MIN: 5.581. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUr3r2r10.7151.432.1452.863.575SE +/- 0.06527, N = 12SE +/- 0.02081, N = 3SE +/- 0.01732, N = 33.112913.167693.17762MIN: 1.86MIN: 2.39MIN: 2.581. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

GEGL

GEGL is the Generic Graphics Library and is the library/framework used by GIMP and other applications like GNOME Photos. This test profile times how long it takes to complete various GEGL operations on a static set of sample JPEG images. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Scaler1r2r3246810SE +/- 0.055, N = 12SE +/- 0.059, N = 13SE +/- 0.056, N = 146.9546.9737.000

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speedr2r1r32K4K6K8K10KSE +/- 2.38, N = 3SE +/- 3.96, N = 3SE +/- 10.11, N = 39839.99823.29810.01. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speedr2r1r32K4K6K8K10KSE +/- 4.75, N = 3SE +/- 6.52, N = 3SE +/- 11.24, N = 38127.788120.678079.181. (CC) gcc options: -O3

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu ISO) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 3r3r1r26001200180024003000SE +/- 4.18, N = 3SE +/- 7.25, N = 3SE +/- 8.65, N = 32835.12833.62831.01. (CC) gcc options: -O3 -pthread -lz -llzma

GEGL

GEGL is the Generic Graphics Library and is the library/framework used by GIMP and other applications like GNOME Photos. This test profile times how long it takes to complete various GEGL operations on a static set of sample JPEG images. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Reflectr1r3r2714212835SE +/- 0.29, N = 3SE +/- 0.22, N = 3SE +/- 0.30, N = 328.1828.3128.50

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Tile Glassr3r2r1714212835SE +/- 0.39, N = 3SE +/- 0.27, N = 3SE +/- 0.36, N = 328.0628.2428.24

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Cropr3r2r1246810SE +/- 0.077, N = 8SE +/- 0.073, N = 9SE +/- 0.065, N = 118.8268.8398.900

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 10r1r3r20.771.542.313.083.85SE +/- 0.044, N = 3SE +/- 0.027, N = 3SE +/- 0.035, N = 33.4223.4203.404

NAMD CUDA

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. This version of the NAMD test profile uses CUDA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 Atomsr1r3r20.050.10.150.20.25SE +/- 0.00131, N = 3SE +/- 0.00272, N = 4SE +/- 0.00245, N = 50.221030.221710.22238

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADDr1r3r2600K1200K1800K2400K3000KSE +/- 28020.60, N = 3SE +/- 27994.25, N = 3SE +/- 23332.27, N = 152660539.422634908.832628039.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

PHPBench

PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suiter1r2r3200K400K600K800K1000KSE +/- 4346.11, N = 3SE +/- 2600.83, N = 3SE +/- 587.84, N = 3837911832417829705

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28r2r3r1510152025SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 321.3222.0422.081. (CC) gcc options: -O2 -pedantic -fvisibility=hidden -lm

Unpacking Firefox

This simple test profile measures how long it takes to extract the .tar.xz source package of the Mozilla Firefox Web Browser. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xzr1r3r248121620SE +/- 0.08, N = 4SE +/- 0.14, N = 4SE +/- 0.09, N = 416.0316.1016.14

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Proteinr1r3r21.16962.33923.50884.67845.848SE +/- 0.111, N = 15SE +/- 0.110, N = 15SE +/- 0.109, N = 155.1985.1795.1691. (CXX) g++ options: -O3 -pthread -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUr1r2r33691215SE +/- 0.04374, N = 3SE +/- 0.01715, N = 3SE +/- 0.11418, N = 38.967829.006929.06628MIN: 8.14MIN: 8.15MIN: 81. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUr3r2r13691215SE +/- 0.03582, N = 3SE +/- 0.03928, N = 3SE +/- 0.04555, N = 39.737329.764689.77594MIN: 8.75MIN: 8.72MIN: 8.771. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Inkscape

Inkscape is an open-source vector graphics editor. This test profile times how long it takes to complete various operations by Inkscape. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterInkscapeOperation: SVG Files To PNGr1r2r3510152025SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 321.0021.0521.071. Inkscape 0.92.5 (2060ec1f9f, 2020-04-08)

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOPr1r3r2700K1400K2100K2800K3500KSE +/- 36042.05, N = 3SE +/- 181152.66, N = 12SE +/- 3702.86, N = 33394660.202809233.482104092.331. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Crafty

This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Timer2r3r12M4M6M8M10MSE +/- 7176.35, N = 3SE +/- 16578.83, N = 3SE +/- 45086.65, N = 39584148956001294974141. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1r2r3r160120180240300SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 1.46, N = 3264.95272.68272.91MIN: 264.07 / MAX: 268.01MIN: 271.53 / MAX: 277.6MIN: 264.43 / MAX: 277.051. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

Monkey Audio Encoding

This test times how long it takes to encode a sample WAV file to Monkey's Audio APE format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APEr1r3r23691215SE +/- 0.03, N = 5SE +/- 0.01, N = 5SE +/- 0.04, N = 510.5110.5910.861. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Betsy GPU Compressor

Betsy is an open-source GPU compressor of various GPU compression techniques. Betsy is written in GLSL for Vulkan/OpenGL (compute shader) support for GPU-based texture compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: Highestr3r2r1246810SE +/- 0.023, N = 3SE +/- 0.018, N = 3SE +/- 0.064, N = 137.9037.9128.0161. (CXX) g++ options: -O3 -O2 -lpthread -ldl

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Boat - Acceleration: CPU-onlyr3r2r148121620SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 315.8615.8715.91

RealSR-NCNN

RealSR-NCNN is an NCNN neural network implementation of the RealSR project and accelerated using the Vulkan API. RealSR is the Real-World Super Resolution via Kernel Estimation and Noise Injection. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image by a scale of 4x with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: Nor2r3r148121620SE +/- 0.09, N = 3SE +/- 0.11, N = 3SE +/- 0.01, N = 314.6614.6914.73

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA-512r1r2r3200M400M600M800M1000MSE +/- 11546345.54, N = 15SE +/- 2594224.35, N = 3SE +/- 1852025.92, N = 3102310000010200000001016800000

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUr2r3r11.06822.13643.20464.27285.341SE +/- 0.06823, N = 15SE +/- 0.07477, N = 15SE +/- 0.10403, N = 124.714574.737284.74772MIN: 3.29MIN: 3.29MIN: 3.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUr2r3r13691215SE +/- 0.15643, N = 15SE +/- 0.22537, N = 12SE +/- 0.23621, N = 129.777019.812389.87893MIN: 6.67MIN: 6.65MIN: 6.661. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 1920 x 1080r3r2r11326395265SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 359.959.959.91. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 1920 x 1080r3r2r113263952656060601. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

Opus Codec Encoding

Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encoder2r3r1246810SE +/- 0.004, N = 5SE +/- 0.008, N = 5SE +/- 0.009, N = 57.6027.6167.6241. (CXX) g++ options: -fvisibility=hidden -logg -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUr1r2r30.98671.97342.96013.94684.9335SE +/- 0.00310, N = 3SE +/- 0.00806, N = 3SE +/- 0.00559, N = 34.363814.378524.38535MIN: 4.23MIN: 4.25MIN: 4.251. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUr1r3r21.00592.01183.01774.02365.0295SE +/- 0.00967, N = 3SE +/- 0.00726, N = 3SE +/- 0.01661, N = 34.455644.466564.47062MIN: 4.02MIN: 4.01MIN: 4.021. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSHr2r3r1400K800K1200K1600K2000KSE +/- 21753.96, N = 4SE +/- 8925.21, N = 3SE +/- 25221.07, N = 32094056.312083566.292041750.081. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Betsy GPU Compressor

Betsy is an open-source GPU compressor of various GPU compression techniques. Betsy is written in GLSL for Vulkan/OpenGL (compute shader) support for GPU-based texture compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: Highestr2r3r11.31722.63443.95165.26886.586SE +/- 0.008, N = 3SE +/- 0.024, N = 3SE +/- 0.068, N = 125.7895.7925.8541. (CXX) g++ options: -O3 -O2 -lpthread -ldl

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 1920 x 1080r2r1r31428425670SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 360.760.760.61. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

VkResample

VkResample is a Vulkan-based image upscaling library based on VkFFT. The sample input file is upscaling a 4K image to 8K using Vulkan-based GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Singler1r2r3612182430SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 324.9925.1925.231. (CXX) g++ options: -O3 -pthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fastr1r2r31.26682.53363.80045.06726.334SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 125.445.595.631. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLr3r2r1100200300400500SE +/- 2.86, N = 3SE +/- 1.92, N = 3SE +/- 0.36, N = 3478.73477.39463.34

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETr3r2r1500K1000K1500K2000K2500KSE +/- 6859.51, N = 3SE +/- 3903.32, N = 3SE +/- 17218.21, N = 32433543.802413657.002375800.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETr1r2r3700K1400K2100K2800K3500KSE +/- 41615.25, N = 3SE +/- 13828.40, N = 3SE +/- 8077.93, N = 33248596.083012560.833009326.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUr1r3r20.62481.24961.87442.49923.124SE +/- 0.00400, N = 3SE +/- 0.00352, N = 3SE +/- 0.01530, N = 32.725582.748742.77670MIN: 2.54MIN: 2.54MIN: 2.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUr2r1r33691215SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 312.4412.4712.61MIN: 12.09MIN: 12.08MIN: 12.21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTr3r2r112002400360048006000SE +/- 81.16, N = 15SE +/- 81.08, N = 15SE +/- 71.93, N = 155540.445519.395504.351. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 0r1r2r3246810SE +/- 0.079, N = 3SE +/- 0.061, N = 3SE +/- 0.095, N = 37.2887.3457.3531. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Hot Readr1r2r3246810SE +/- 0.013, N = 3SE +/- 0.075, N = 3SE +/- 0.049, N = 36.9467.0997.1281. (CXX) g++ options: -O3 -lsnappy -lpthread

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle Filterr3r2r1246810SE +/- 0.016, N = 3SE +/- 0.013, N = 3SE +/- 0.065, N = 37.0277.0557.1151. (CXX) g++ options: -O2 -lOpenCL

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: TrueCrypt RIPEMD160 + XTSr2r1r360K120K180K240K300KSE +/- 851.14, N = 3SE +/- 1322.04, N = 3SE +/- 545.69, N = 3301433301233298133

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA1r1r2r32000M4000M6000M8000M10000MSE +/- 31347213.24, N = 3SE +/- 17380832.35, N = 3SE +/- 18653000.95, N = 3858576666785445000008535333333

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: MD5r1r2r35000M10000M15000M20000M25000MSE +/- 110495102.96, N = 3SE +/- 81107726.72, N = 3SE +/- 49256167.13, N = 3243348666672426020000024196900000

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUr2r1r348121620SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 317.9018.0118.03MIN: 17.18MIN: 17.22MIN: 17.241. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUr3r1r2510152025SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 321.6221.6921.70MIN: 21.51MIN: 21.47MIN: 21.481. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Writer1r2r350100150200250SE +/- 0.47, N = 3SE +/- 0.26, N = 3SE +/- 0.50, N = 3215.7215.6214.81. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copyr1r2r350100150200250SE +/- 0.22, N = 3SE +/- 0.24, N = 3SE +/- 0.27, N = 3236.6235.4235.11. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Readr1r3r270140210280350SE +/- 0.18, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3330.3329.9329.91. (CC) gcc options: -O2 -flto -lOpenCL

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLr2r1r3400800120016002000SE +/- 3.54, N = 3SE +/- 7.57, N = 3SE +/- 8.53, N = 31823.061819.241817.78

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: Yesr1r3r2246810SE +/- 0.004, N = 3SE +/- 0.011, N = 3SE +/- 0.007, N = 36.0206.0936.102

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLr3r1r230060090012001500SE +/- 4.92, N = 3SE +/- 3.10, N = 3SE +/- 2.03, N = 31247.931246.781244.95

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Server Room - Acceleration: CPU-onlyr2r3r10.94071.88142.82213.76284.7035SE +/- 0.004, N = 3SE +/- 0.006, N = 3SE +/- 0.010, N = 34.1744.1784.181

NeatBench

NeatBench is a benchmark of the cross-platform Neat Video software on the CPU and optional GPU (OpenCL / CUDA) support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPUr3r1r2612182430SE +/- 0.60, N = 15SE +/- 0.57, N = 15SE +/- 0.47, N = 1527.627.527.1

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random Fillr3r2r1918273645SE +/- 0.07, N = 3SE +/- 0.20, N = 3SE +/- 0.19, N = 340.9841.0341.041. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Random Fillr3r2r11020304050SE +/- 0.07, N = 3SE +/- 0.19, N = 3SE +/- 0.21, N = 343.243.143.11. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Overwriter3r1r2918273645SE +/- 0.04, N = 3SE +/- 0.15, N = 3SE +/- 0.08, N = 340.7640.9340.961. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Overwriter3r2r11020304050SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.15, N = 343.443.243.21. (CXX) g++ options: -O3 -lsnappy -lpthread

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUr2r3r150M100M150M200M250MSE +/- 157365.45, N = 3SE +/- 1449538.54, N = 3SE +/- 1032565.22, N = 3252826584.8252822614.4251986408.71. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision Floatr1r3r213002600390052006500SE +/- 83.30, N = 15SE +/- 47.53, N = 3SE +/- 64.05, N = 35940.645892.705858.321. (CXX) g++ options: -O3 -rdynamic -lOpenCL

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Fill Syncr1r3r27001400210028003500SE +/- 33.91, N = 3SE +/- 60.32, N = 3SE +/- 25.98, N = 33361.783386.083424.921. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Fill Syncr3r2r10.11250.2250.33750.450.5625SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.50.50.51. (CXX) g++ options: -O3 -lsnappy -lpthread

ArrayFire

ArrayFire is an GPU and CPU numeric processing library, this test uses the built-in CPU and OpenCL ArrayFire benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLr2r3r10.57351.1471.72052.2942.8675SE +/- 0.022, N = 3SE +/- 0.018, N = 3SE +/- 0.015, N = 32.5312.5482.5491. (CXX) g++ options: -rdynamic

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: 7-Zipr1r2r380K160K240K320K400KSE +/- 1589.90, N = 3SE +/- 1858.31, N = 3SE +/- 3670.30, N = 3373667370400366433

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile uses ViennaCL OpenCL support and runs the included computational benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU Factorizationr1r3r21530456075SE +/- 0.36, N = 3SE +/- 0.44, N = 3SE +/- 0.08, N = 368.2965.9264.231. (CXX) g++ options: -rdynamic -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory Bandwidthr3r1r270140210280350SE +/- 0.28, N = 3SE +/- 0.32, N = 3SE +/- 0.28, N = 3324.78324.63324.581. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.1Test: Server Rack - Acceleration: CPU-onlyr1r2r30.04070.08140.12210.16280.2035SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1810.1810.181

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Black-Scholes OpenCLr2r3r148121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 317.4817.4817.481. (CXX) g++ options: -O3 -lOpenCL

253 Results Shown

Blender
Basis Universal
Blender:
  Barbershop - CUDA
  Pabellon Barcelona - CUDA
Mobile Neural Network:
  inception-v3
  mobilenet-v1-1.0
  MobileNetV2_224
  resnet-v2-50
  SqueezeNetV1.0
AI Benchmark Alpha:
  Device AI Score
  Device Training Score
  Device Inference Score
RedShift Demo
DDraceNetwork
ASTC Encoder
LeelaChessZero
BRL-CAD
VkFFT
DDraceNetwork
GROMACS
asmFish
Stockfish
Unigine Heaven
Blender
dav1d
Build2
Numpy Benchmark
Blender
High Performance Conjugate Gradient
Unigine Superposition:
  1920 x 1080 - Fullscreen - Ultra - OpenGL
  1920 x 1080 - Fullscreen - High - OpenGL
  1920 x 1080 - Fullscreen - Medium - OpenGL
  1920 x 1080 - Fullscreen - Low - OpenGL
GraphicsMagick
CLOMP
Blender
Warsow
LuxCoreRender OpenCL
TensorFlow Lite:
  Inception ResNet V2
  Inception V4
LuxCoreRender OpenCL
Timed Linux Kernel Compilation
OctaneBench
OpenVINO:
  Person Detection 0106 FP32 - CPU:
    ms
    FPS
dav1d
LuxCoreRender OpenCL
Embree
Blender
Basis Universal
FAHBench
Timed HMMer Search
Embree
RealSR-NCNN
Timed FFmpeg Compilation
oneDNN:
  Recurrent Neural Network Training - u8s8f32 - CPU
  Recurrent Neural Network Training - f32 - CPU
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
Blender:
  BMW27 - NVIDIA OptiX
  BMW27 - CUDA
Zstd Compression
GEGL
OpenVINO:
  Age Gender Recognition Retail 0013 FP16 - CPU:
    ms
    FPS
oneDNN:
  Recurrent Neural Network Inference - f32 - CPU
  Recurrent Neural Network Inference - u8s8f32 - CPU
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
NCNN:
  CPU - regnety_400m
  CPU - squeezenet_ssd
  CPU - yolov4-tiny
  CPU - resnet50
  CPU - alexnet
  CPU - resnet18
  CPU - vgg16
  CPU - googlenet
  CPU - blazeface
  CPU - efficientnet-b0
  CPU - mnasnet
  CPU - shufflenet-v2
  CPU-v3-v3 - mobilenet-v3
  CPU-v2-v2 - mobilenet-v2
  CPU - mobilenet
  Vulkan GPU - regnety_400m
  Vulkan GPU - squeezenet_ssd
  Vulkan GPU - yolov4-tiny
  Vulkan GPU - resnet50
  Vulkan GPU - alexnet
  Vulkan GPU - resnet18
  Vulkan GPU - vgg16
  Vulkan GPU - googlenet
  Vulkan GPU - blazeface
  Vulkan GPU - efficientnet-b0
  Vulkan GPU - mnasnet
  Vulkan GPU - shufflenet-v2
  Vulkan GPU-v3-v3 - mobilenet-v3
  Vulkan GPU-v2-v2 - mobilenet-v2
  Vulkan GPU - mobilenet
Embree
RawTherapee
OpenVINO:
  Age Gender Recognition Retail 0013 FP32 - CPU:
    ms
    FPS
  Face Detection 0106 FP32 - CPU:
    ms
    FPS
dav1d
OpenVINO:
  Person Detection 0106 FP16 - CPU:
    ms
    FPS
Timed Eigen Compilation
LZ4 Compression:
  9 - Decompression Speed
  9 - Compression Speed
OpenVINO:
  Face Detection 0106 FP16 - CPU:
    ms
    FPS
Embree
LZ4 Compression:
  3 - Decompression Speed
  3 - Compression Speed
Node.js V8 Web Tooling Benchmark
simdjson
DDraceNetwork
DDraceNetwork
LevelDB
Blender
IndigoBench:
  CPU - Bedroom
  CPU - Supercar
TensorFlow Lite:
  SqueezeNet
  Mobilenet Quant
  NASNet Mobile
  Mobilenet Float
DDraceNetwork
DDraceNetwork
GraphicsMagick:
  Sharpen
  Enhanced
  Noise-Gaussian
  Resizing
  HWB Color Space
  Rotate
ASTC Encoder
Basis Universal
GEGL
rav1e
DeepSpeech
LuxCoreRender OpenCL
rav1e
Basis Universal
ASTC Encoder
GEGL
simdjson:
  LargeRand
  PartialTweets
  DistinctUserID
SQLite Speedtest
Timed MAFFT Alignment
Coremark
LevelDB
dav1d
rav1e
PlaidML
VkResample
GEGL:
  Rotate 90 Degrees
  Antialias
eSpeak-NG Speech Engine
Cryptsetup:
  Twofish-XTS 512b Encryption
  Twofish-XTS 512b Decryption
  Serpent-XTS 512b Decryption
  Serpent-XTS 512b Encryption
  AES-XTS 512b Decryption
  AES-XTS 512b Encryption
  Twofish-XTS 256b Decryption
  Twofish-XTS 256b Encryption
  Serpent-XTS 256b Decryption
  Serpent-XTS 256b Encryption
  AES-XTS 256b Decryption
  AES-XTS 256b Encryption
  PBKDF2-whirlpool
  PBKDF2-sha512
LevelDB:
  Seq Fill:
    Microseconds Per Op
    MB/s
  Rand Delete:
    Microseconds Per Op
TNN
clpeak
Darktable
oneDNN:
  IP Shapes 1D - f32 - CPU
  IP Shapes 1D - u8s8f32 - CPU
GEGL
LZ4 Compression:
  1 - Decompression Speed
  1 - Compression Speed
Zstd Compression
GEGL:
  Reflect
  Tile Glass
  Crop
rav1e
NAMD CUDA
Redis
PHPBench
RNNoise
Unpacking Firefox
LAMMPS Molecular Dynamics Simulator
oneDNN:
  Deconvolution Batch shapes_1d - u8s8f32 - CPU
  Deconvolution Batch shapes_1d - f32 - CPU
Inkscape
Redis
Crafty
TNN
Monkey Audio Encoding
Betsy GPU Compressor
Darktable
RealSR-NCNN
Hashcat
oneDNN:
  Deconvolution Batch shapes_3d - u8s8f32 - CPU
  Deconvolution Batch shapes_3d - f32 - CPU
yquake2:
  OpenGL 1.x - 1920 x 1080
  OpenGL 3.x - 1920 x 1080
Opus Codec Encoding
oneDNN:
  Matrix Multiply Batch Shapes Transformer - f32 - CPU
  Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
Redis
Betsy GPU Compressor
yquake2
VkResample
ASTC Encoder
PlaidML
Redis:
  SET
  GET
oneDNN:
  IP Shapes 3D - u8s8f32 - CPU
  IP Shapes 3D - f32 - CPU
clpeak
Basis Universal
LevelDB
Rodinia
Hashcat:
  TrueCrypt RIPEMD160 + XTS
  SHA1
  MD5
oneDNN:
  Convolution Batch Shapes Auto - u8s8f32 - CPU
  Convolution Batch Shapes Auto - f32 - CPU
cl-mem:
  Write
  Copy
  Read
PlaidML
Waifu2x-NCNN Vulkan
PlaidML
Darktable
NeatBench
LevelDB:
  Rand Fill:
    Microseconds Per Op
    MB/s
  Overwrite:
    Microseconds Per Op
    MB/s
MandelGPU
clpeak
LevelDB:
  Fill Sync:
    Microseconds Per Op
    MB/s
ArrayFire
Hashcat
ViennaCL
clpeak
Darktable
FinanceBench