Benchmarks for a future article.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: Crypto Repeat Enabled Run 500 1000 1500 2000 2500 SE +/- 0.23, N = 3 SE +/- 1.85, N = 3 SE +/- 1.86, N = 3 2218.61 2217.94 2217.75 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: CPU Cache Run Repeat Enabled 6 12 18 24 30 SE +/- 0.21, N = 3 SE +/- 0.29, N = 3 SE +/- 0.19, N = 3 23.42 23.02 22.88 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: CPU Stress Run Enabled Repeat 1100 2200 3300 4400 5500 SE +/- 21.71, N = 3 SE +/- 5.24, N = 3 SE +/- 12.12, N = 3 5001.07 4991.24 4964.68 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: Matrix Math Run Repeat Enabled 12K 24K 36K 48K 60K SE +/- 210.46, N = 3 SE +/- 122.73, N = 3 SE +/- 126.41, N = 3 57377.45 57137.41 56981.91 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: Vector Math Run Repeat Enabled 13K 26K 39K 52K 65K SE +/- 57.06, N = 3 SE +/- 100.32, N = 3 SE +/- 97.11, N = 3 62436.51 62160.59 61555.92 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: Memory Copying Repeat Enabled Run 300 600 900 1200 1500 SE +/- 3.20, N = 3 SE +/- 2.01, N = 3 SE +/- 2.42, N = 3 1533.93 1533.48 1526.56 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: Context Switching Enabled Repeat Run 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 56528.36, N = 3 SE +/- 63682.03, N = 3 SE +/- 58907.92, N = 4 5101477.04 5095907.11 4881747.94 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
srsLTE srsLTE is an open-source LTE software radio suite created by Software Radio Systems (SRS). srsLTE can be used for building your own software defined (SDR) LTE mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org eNb Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test Run Enabled Repeat 70 140 210 280 350 SE +/- 1.22, N = 3 SE +/- 0.55, N = 3 SE +/- 0.32, N = 3 336.9 335.8 335.7 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Sysbench This is a benchmark of Sysbench with the built-in CPU and memory sub-tests. Sysbench is a scriptable multi-threaded benchmark tool based on LuaJIT. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU Enabled Run Repeat 7K 14K 21K 28K 35K SE +/- 0.29, N = 3 SE +/- 5.32, N = 3 SE +/- 4.23, N = 3 34101.32 34093.33 34092.54 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 Repeat Enabled Run 60M 120M 180M 240M 300M SE +/- 57627.28, N = 3 SE +/- 43622.17, N = 3 SE +/- 73645.19, N = 3 259622700 259341600 259329700 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 4K Enabled Repeat Run 40 80 120 160 200 SE +/- 0.18, N = 3 SE +/- 0.27, N = 3 SE +/- 0.27, N = 3 189.80 189.52 189.50 MIN: 176.26 / MAX: 202.54 MIN: 175.59 / MAX: 203 MIN: 177.16 / MAX: 202.89 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 1080p Run Repeat Enabled 150 300 450 600 750 SE +/- 1.92, N = 3 SE +/- 0.91, N = 3 SE +/- 0.85, N = 3 709.28 708.63 707.73 MIN: 632.86 / MAX: 774.84 MIN: 636.52 / MAX: 768.39 MIN: 632.84 / MAX: 768.08 1. (CC) gcc options: -pthread
OSPray Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: SciVis Repeat Enabled Run 5 10 15 20 25 SE +/- 0.16, N = 3 SE +/- 0.16, N = 3 SE +/- 0.00, N = 3 22.06 22.06 21.74 MIN: 21.28 / MAX: 22.22 MIN: 21.28 / MAX: 22.22 MIN: 20 / MAX: 22.22
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: SciVis Enabled Run Repeat 0.7515 1.503 2.2545 3.006 3.7575 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.34 3.33 3.32 MIN: 3.29 / MAX: 3.4 MIN: 3.27 / MAX: 3.41 MIN: 3.28 / MAX: 3.4
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: Path Tracer Run Repeat Enabled 0.3578 0.7156 1.0734 1.4312 1.789 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.59 1.59 1.59 MIN: 1.58 / MAX: 1.6 MIN: 1.58 / MAX: 1.6 MIN: 1.58 / MAX: 1.6
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: SciVis Run Repeat Enabled 6 12 18 24 30 25 25 25 MIN: 23.81 / MAX: 25.64 MIN: 24.39 / MAX: 25.64 MIN: 23.81 / MAX: 25.64
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: Path Tracer Enabled Run Repeat 0.3938 0.7876 1.1814 1.5752 1.969 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.75 1.74 1.73 MIN: 1.73 / MAX: 1.78 MIN: 1.65 / MAX: 1.78 MIN: 1.72 / MAX: 1.77
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: SciVis Run Repeat Enabled 5 10 15 20 25 20 20 20 MIN: 19.61 / MAX: 20.41 MIN: 19.23 / MAX: 20.41 MIN: 18.87 / MAX: 20.41
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: Path Tracer Run Enabled Repeat 1.0283 2.0566 3.0849 4.1132 5.1415 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 4.57 4.57 4.55 MIN: 4.42 / MAX: 4.74 MIN: 4.44 / MAX: 4.76 MIN: 4.44 / MAX: 4.78
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: Path Tracer Run Repeat Enabled 70 140 210 280 350 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 333.33 333.33 333.33 MIN: 250 MIN: 250 MIN: 250
TTSIOD 3D Renderer A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping Run Repeat Enabled 120 240 360 480 600 SE +/- 0.40, N = 3 SE +/- 1.14, N = 3 SE +/- 1.36, N = 3 561.83 556.31 550.73 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG19 - Device: CPU Run Repeat Enabled 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 17.86 17.79 17.73
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU Repeat Run Enabled 1.2465 2.493 3.7395 4.986 6.2325 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 5.54 5.52 5.52
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 6 Realtime Enabled Run Repeat 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 29.19 29.14 29.13 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 6 Two-Pass Repeat Enabled Run 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 22.89 22.80 22.77 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 8 Realtime Enabled Repeat Run 30 60 90 120 150 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 SE +/- 0.12, N = 3 114.16 113.96 113.06 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K Run Repeat Enabled 0.036 0.072 0.108 0.144 0.18 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.16 0.16 0.16 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K Repeat Enabled Run 0.891 1.782 2.673 3.564 4.455 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.96 3.96 3.94 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K Enabled Repeat Run 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 16.56 16.54 16.41 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K Enabled Run Repeat 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 7.59 7.56 7.56 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K Repeat Enabled Run 10 20 30 40 50 SE +/- 0.21, N = 3 SE +/- 0.06, N = 3 SE +/- 0.43, N = 15 43.24 42.80 40.83 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K Enabled Run Repeat 12 24 36 48 60 SE +/- 0.39, N = 14 SE +/- 0.03, N = 3 SE +/- 0.63, N = 15 52.82 51.88 51.82 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p Run Repeat Enabled 0.1125 0.225 0.3375 0.45 0.5625 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.50 0.50 0.50 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p Repeat Enabled Run 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 7.82 7.81 7.80 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p Enabled Repeat Run 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.10, N = 3 SE +/- 0.21, N = 3 31.11 31.05 30.84 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p Repeat Enabled Run 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 23.20 23.17 23.12 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p Run Repeat Enabled 30 60 90 120 150 SE +/- 0.15, N = 3 SE +/- 1.83, N = 3 SE +/- 1.42, N = 13 138.16 137.23 135.81 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p Repeat Enabled Run 30 60 90 120 150 SE +/- 1.46, N = 15 SE +/- 2.05, N = 3 SE +/- 1.88, N = 4 156.45 152.40 152.03 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Crown Enabled Run Repeat 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.13, N = 3 14.81 14.64 14.64 MIN: 14.58 / MAX: 15.49 MIN: 14.37 / MAX: 15.3 MIN: 13.95 / MAX: 15.33
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon Enabled Run Repeat 4 8 12 16 20 SE +/- 0.12, N = 3 SE +/- 0.15, N = 3 SE +/- 0.14, N = 3 13.95 13.92 13.86 MIN: 13.68 / MAX: 14.36 MIN: 13.63 / MAX: 14.38 MIN: 13.61 / MAX: 14.36
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon Obj Enabled Repeat Run 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 12.80 12.79 12.50 MIN: 12.73 / MAX: 13.04 MIN: 12.68 / MAX: 13.03 MIN: 12.24 / MAX: 12.8
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon Enabled Run Repeat 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 16.39 16.38 16.30 MIN: 16.23 / MAX: 16.83 MIN: 16.13 / MAX: 16.84 MIN: 15.75 / MAX: 16.79
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon Obj Enabled Repeat Run 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 14.47 14.45 14.19 MIN: 14.33 / MAX: 14.85 MIN: 14.33 / MAX: 14.87 MIN: 14.03 / MAX: 14.6
Kvazaar This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium Repeat Run Enabled 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 6.42 6.41 6.41 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium Run Repeat Enabled 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 27.92 27.92 27.89 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Repeat Enabled Run 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 17.71 17.71 17.64 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Repeat Enabled Run 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 31.89 31.73 31.65 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast Repeat Enabled Run 15 30 45 60 75 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 65.72 65.54 65.40 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast Repeat Enabled Run 30 60 90 120 150 SE +/- 0.23, N = 3 SE +/- 0.13, N = 3 SE +/- 0.21, N = 3 120.19 120.06 119.79 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 4 - Input: 1080p Enabled Repeat Run 0.9574 1.9148 2.8722 3.8296 4.787 SE +/- 0.010, N = 3 SE +/- 0.028, N = 3 SE +/- 0.023, N = 3 4.255 4.254 4.232 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 8 - Input: 1080p Enabled Repeat Run 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 37.85 37.76 37.61 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Repeat Run Enabled 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 9.29 9.27 9.26 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Enabled Run Repeat 30 60 90 120 150 SE +/- 0.28, N = 3 SE +/- 0.14, N = 3 SE +/- 0.08, N = 3 135.79 135.41 135.17 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Enabled Repeat Run 60 120 180 240 300 SE +/- 0.21, N = 3 SE +/- 0.14, N = 3 SE +/- 0.33, N = 3 266.00 262.54 262.47 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p Enabled Repeat Run 40 80 120 160 200 SE +/- 1.51, N = 3 SE +/- 2.07, N = 4 SE +/- 1.87, N = 3 185.98 184.70 183.50 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Enabled Repeat Run 40 80 120 160 200 SE +/- 0.25, N = 3 SE +/- 0.23, N = 3 SE +/- 0.55, N = 3 191.74 191.19 190.50 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p Enabled Repeat Run 30 60 90 120 150 SE +/- 0.11, N = 3 SE +/- 0.10, N = 3 SE +/- 0.23, N = 3 157.07 156.26 155.96 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
VP9 libvpx Encoding This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 0 Run Repeat Enabled 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 9.35 9.35 9.34 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 5 Enabled Repeat Run 8 16 24 32 40 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 32.91 32.90 32.84 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2019-12-17 H.264 Video Encoding Run Repeat Enabled 30 60 90 120 150 SE +/- 1.20, N = 3 SE +/- 0.83, N = 3 SE +/- 0.41, N = 3 119.15 119.08 118.95 1. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
x265 This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K Repeat Enabled Run 4 8 12 16 20 SE +/- 0.12, N = 9 SE +/- 0.21, N = 3 SE +/- 0.13, N = 3 15.14 15.04 14.82 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p Run Enabled Repeat 15 30 45 60 75 SE +/- 0.29, N = 3 SE +/- 0.48, N = 3 SE +/- 0.17, N = 3 67.60 67.47 67.44 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
simdjson This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: Kostya Run Repeat Enabled 0.81 1.62 2.43 3.24 4.05 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.60 3.60 3.59 1. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: LargeRandom Run Repeat Enabled 0.2723 0.5446 0.8169 1.0892 1.3615 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.21 1.21 1.21 1. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: PartialTweets Repeat Run Enabled 1.1813 2.3626 3.5439 4.7252 5.9065 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 5.25 5.24 5.22 1. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: DistinctUserID Repeat Run Enabled 1.287 2.574 3.861 5.148 6.435 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 5.72 5.68 5.60 1. (CXX) g++ options: -O3 -pthread
ViennaCL ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY Run Enabled Repeat 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 25.3 25.1 25.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY Run Enabled Repeat 9 18 27 36 45 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.12, N = 3 39.1 38.8 38.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT Run Enabled Repeat 10 20 30 40 50 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 41.7 41.3 41.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY Run Enabled Repeat 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 23.8 23.6 23.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY Run Enabled Repeat 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 36.6 36.3 36.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT Run Enabled Repeat 9 18 27 36 45 SE +/- 0.12, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 38.6 38.4 38.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N Enabled Repeat Run 11 22 33 44 55 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 46.7 46.7 46.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T Run Repeat Enabled 11 22 33 44 55 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.17, N = 3 48.4 48.4 48.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN Run Repeat Enabled 11 22 33 44 55 SE +/- 0.03, N = 3 SE +/- 0.19, N = 3 SE +/- 1.10, N = 3 47.0 46.8 45.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT Run Repeat Enabled 10 20 30 40 50 SE +/- 0.03, N = 3 SE +/- 0.17, N = 3 SE +/- 0.13, N = 3 45.1 44.7 44.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT Run Repeat Enabled 11 22 33 44 55 SE +/- 0.06, N = 3 SE +/- 0.00, N = 3 SE +/- 0.45, N = 3 47.5 47.3 46.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN Run Repeat Enabled 11 22 33 44 55 SE +/- 0.03, N = 3 SE +/- 0.00, N = 2 SE +/- 0.57, N = 3 48.7 48.7 48.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
GNU GMP GMPbench GMPbench is a test of the GNU Multiple Precision Arithmetic (GMP) Library. GMPbench is a single-threaded integer benchmark that leverages the GMP library to stress the CPU with widening integer multiplication. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GMPbench Score, More Is Better GNU GMP GMPbench 6.2.1 Total Time Enabled Run Repeat 1400 2800 4200 5600 7000 6449.1 6438.8 6432.5 1. (CC) gcc options: -O3 -fomit-frame-pointer -lm
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU Repeat Enabled Run 100 200 300 400 500 SE +/- 1.59, N = 3 SE +/- 1.59, N = 3 SE +/- 2.42, N = 3 457 457 456 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU Run Repeat Enabled 200 400 600 800 1000 SE +/- 1.17, N = 3 SE +/- 1.88, N = 3 SE +/- 0.76, N = 3 854 853 852 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU Run Repeat Enabled 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.17, N = 3 85 84 84 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU Enabled Repeat Run 4K 8K 12K 16K 20K SE +/- 14.97, N = 3 SE +/- 41.78, N = 3 SE +/- 28.64, N = 3 17685 17671 17624 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU Run Enabled Repeat 2K 4K 6K 8K 10K SE +/- 2.20, N = 3 SE +/- 21.10, N = 3 SE +/- 36.97, N = 3 7835 7801 7800 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenVKL OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmark Repeat Run Enabled 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 1.00, N = 3 197 196 194 MIN: 1 / MAX: 806 MIN: 1 / MAX: 807 MIN: 1 / MAX: 790
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkVdbVolume Repeat Run Enabled 5M 10M 15M 20M 25M SE +/- 283788.09, N = 3 SE +/- 120334.61, N = 3 SE +/- 44284.02, N = 3 25063015 24792840 24682690 MIN: 1580423 / MAX: 105264432 MIN: 1602573 / MAX: 99801972 MIN: 1560523 / MAX: 94703364
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkStructuredVolume Run Enabled Repeat 16M 32M 48M 64M 80M SE +/- 1052274.46, N = 3 SE +/- 576939.06, N = 15 SE +/- 357980.91, N = 3 76029329 74635152 72560377 MIN: 1954651 / MAX: 659359440 MIN: 1917397 / MAX: 657699912 MIN: 1941493 / MAX: 544635720
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkUnstructuredVolume Repeat Enabled Run 600K 1200K 1800K 2400K 3000K SE +/- 3937.22, N = 3 SE +/- 884.39, N = 3 SE +/- 2813.39, N = 3 2709587 2708582 2706127 MIN: 32743 / MAX: 9064631 MIN: 32681 / MAX: 9049793 MIN: 32688 / MAX: 9055017
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate Enabled Repeat Run 200 400 600 800 1000 SE +/- 6.00, N = 3 SE +/- 4.10, N = 3 SE +/- 1.20, N = 3 1073 1064 1041 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen Run Repeat Enabled 40 80 120 160 200 165 165 165 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced Repeat Enabled Run 50 100 150 200 250 218 218 217 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing Repeat Enabled Run 200 400 600 800 1000 SE +/- 1.33, N = 3 SE +/- 1.20, N = 3 SE +/- 3.53, N = 3 1103 1101 1089 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian Repeat Enabled Run 70 140 210 280 350 SE +/- 0.33, N = 3 310 310 309 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space Repeat Enabled Run 300 600 900 1200 1500 SE +/- 0.88, N = 3 SE +/- 0.33, N = 3 SE +/- 0.88, N = 3 1275 1272 1256 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP Repeat Enabled Run 50 100 150 200 250 SE +/- 0.26, N = 3 SE +/- 0.44, N = 3 SE +/- 0.15, N = 3 210.97 209.65 208.62 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Iterations Per Second, More Is Better Cryptsetup PBKDF2-whirlpool Enabled Repeat Run 200K 400K 600K 800K 1000K SE +/- 1682.92, N = 3 SE +/- 805.40, N = 3 SE +/- 924.33, N = 3 856220 855283 852965
OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Supercar Repeat Enabled Run 1.0643 2.1286 3.1929 4.2572 5.3215 SE +/- 0.002, N = 3 SE +/- 0.004, N = 3 SE +/- 0.002, N = 3 4.730 4.725 4.716
LuxCoreRender LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on the CPU as opposed to the OpenCL version. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.3 Scene: DLSC Repeat Enabled Run 0.441 0.882 1.323 1.764 2.205 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 1.96 1.96 1.94 MIN: 1.92 / MAX: 2.01 MIN: 1.92 / MAX: 2.02 MIN: 1.89 / MAX: 1.99
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.3 Scene: Rainbow Colors and Prism Run Enabled Repeat 0.4815 0.963 1.4445 1.926 2.4075 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 2.14 2.14 2.13 MIN: 2.07 / MAX: 2.17 MIN: 2.09 / MAX: 2.16 MIN: 2.08 / MAX: 2.16
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed Enabled Repeat Run 2K 4K 6K 8K 10K SE +/- 31.07, N = 3 SE +/- 6.63, N = 3 SE +/- 31.08, N = 3 10096.9 10047.5 10026.8 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed Run Repeat Enabled 15 30 45 60 75 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 66.00 66.00 65.95 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed Enabled Run Repeat 2K 4K 6K 8K 10K SE +/- 7.93, N = 3 SE +/- 13.00, N = 3 SE +/- 5.72, N = 3 10672.3 10670.1 10655.3 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed Run Repeat Enabled 14 28 42 56 70 SE +/- 0.05, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 64.72 64.67 64.67 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed Repeat Enabled Run 2K 4K 6K 8K 10K SE +/- 9.28, N = 3 SE +/- 7.05, N = 3 SE +/- 4.62, N = 3 10683.2 10679.1 10675.7 1. (CC) gcc options: -O3
Zstd Compression This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Compression Speed Enabled Repeat Run 70 140 210 280 350 SE +/- 2.05, N = 3 SE +/- 1.09, N = 3 SE +/- 0.86, N = 3 314.5 312.9 310.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Decompression Speed Run Enabled Repeat 1000 2000 3000 4000 5000 SE +/- 1.52, N = 3 SE +/- 3.90, N = 3 SE +/- 2.15, N = 3 4774.4 4763.0 4760.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed Run Repeat Enabled 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.19, N = 3 SE +/- 0.09, N = 3 33.1 33.1 33.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed Repeat Run Enabled 900 1800 2700 3600 4500 SE +/- 4.33, N = 3 SE +/- 8.51, N = 3 SE +/- 11.78, N = 3 4401.8 4394.0 4390.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Compression Speed Repeat Run Enabled 300 600 900 1200 1500 SE +/- 28.36, N = 15 SE +/- 22.02, N = 15 SE +/- 14.33, N = 15 1327.1 1303.8 1302.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Decompression Speed Repeat Enabled Run 1000 2000 3000 4000 5000 SE +/- 5.59, N = 15 SE +/- 12.94, N = 15 SE +/- 11.09, N = 15 4875.4 4865.2 4864.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Compression Speed Run Enabled Repeat 80 160 240 320 400 SE +/- 1.58, N = 3 SE +/- 3.48, N = 3 SE +/- 1.95, N = 3 389.4 386.2 382.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Decompression Speed Enabled Run Repeat 1100 2200 3300 4400 5500 SE +/- 6.01, N = 3 SE +/- 2.66, N = 3 SE +/- 16.73, N = 3 5071.8 5065.8 5042.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed Enabled Run Repeat 7 14 21 28 35 SE +/- 0.15, N = 3 SE +/- 0.07, N = 3 SE +/- 0.15, N = 3 31.4 31.3 31.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed Repeat Run Enabled 900 1800 2700 3600 4500 SE +/- 4.83, N = 3 SE +/- 3.52, N = 3 SE +/- 38.86, N = 3 4315.8 4308.3 4252.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 Enabled Repeat Run 700 1400 2100 2800 3500 SE +/- 17.21, N = 3 SE +/- 18.56, N = 3 SE +/- 24.43, N = 3 3323.0 3312.4 3311.3 1. (CXX) g++ options: -O3 -march=native -rdynamic
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 Run Enabled Repeat 1500 3000 4500 6000 7500 SE +/- 30.66, N = 3 SE +/- 25.44, N = 3 SE +/- 30.65, N = 3 7147.5 7133.5 7121.2 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 Run Enabled Repeat 6K 12K 18K 24K 30K SE +/- 300.18, N = 3 SE +/- 33.51, N = 3 SE +/- 336.17, N = 3 26855 26220 26165 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
LuaJIT This test profile is a collection of Lua scripts/benchmarks run against a locally-built copy of LuaJIT upstream. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Composite Run Enabled Repeat 400 800 1200 1600 2000 SE +/- 3.15, N = 3 SE +/- 1.28, N = 3 SE +/- 1.32, N = 3 1846.72 1832.95 1828.04 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
Botan Botan is a cross-platform open-source C++ crypto library that supports most all publicly known cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: KASUMI Repeat Run Enabled 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 105.76 105.75 105.70 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: AES-256 Run Repeat Enabled 2K 4K 6K 8K 10K SE +/- 0.38, N = 3 SE +/- 4.98, N = 3 SE +/- 25.80, N = 3 7852.89 7848.75 7826.31 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: Twofish Enabled Run Repeat 90 180 270 360 450 SE +/- 0.53, N = 3 SE +/- 0.64, N = 3 SE +/- 0.37, N = 3 427.59 426.80 426.30 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: Blowfish Run Enabled Repeat 120 240 360 480 600 SE +/- 0.32, N = 3 SE +/- 0.30, N = 3 SE +/- 0.15, N = 3 545.86 545.75 545.73 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: CAST-256 Repeat Enabled Run 40 80 120 160 200 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 162.52 162.51 162.50 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
LuaRadio LuaRadio is a lightweight software-defined radio (SDR) framework built atop LuaJIT. LuaRadio provides a suite of source, sink, and processing blocks, with a simple API for defining flow graphs, running flow graphs, creating blocks, and creating data types. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Five Back to Back FIR Filters Enabled Run Repeat 300 600 900 1200 1500 SE +/- 1.82, N = 3 SE +/- 4.43, N = 3 SE +/- 8.83, N = 3 1541.1 1536.0 1528.4
OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: FM Deemphasis Filter Run Enabled Repeat 120 240 360 480 600 SE +/- 0.06, N = 3 SE +/- 0.23, N = 3 SE +/- 0.10, N = 3 547.0 547.0 546.9
OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Hilbert Transform Repeat Enabled Run 20 40 60 80 100 SE +/- 0.25, N = 3 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 107.2 107.2 107.1
OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Complex Phase Enabled Repeat Run 200 400 600 800 1000 SE +/- 0.32, N = 3 SE +/- 2.01, N = 3 SE +/- 1.86, N = 3 787.1 784.4 783.3
GNU Radio GNU Radio is a free software development toolkit providing signal processing blocks to implement software-defined radios (SDR) and signal processing systems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Five Back to Back FIR Filters Enabled Run Repeat 300 600 900 1200 1500 SE +/- 5.99, N = 3 SE +/- 14.23, N = 3 SE +/- 14.53, N = 3 1527.4 1506.3 1499.4 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Signal Source (Cosine) Enabled Repeat Run 700 1400 2100 2800 3500 SE +/- 2.52, N = 3 SE +/- 4.70, N = 3 SE +/- 1.51, N = 3 3244.4 3240.7 3238.8 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FIR Filter Enabled Run Repeat 160 320 480 640 800 SE +/- 0.83, N = 3 SE +/- 0.98, N = 3 SE +/- 0.71, N = 3 734.0 733.7 733.4 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: IIR Filter Enabled Run Repeat 200 400 600 800 1000 SE +/- 0.78, N = 3 SE +/- 0.58, N = 3 SE +/- 0.46, N = 3 837.7 837.4 837.4 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FM Deemphasis Filter Enabled Run Repeat 200 400 600 800 1000 SE +/- 0.57, N = 3 SE +/- 0.47, N = 3 SE +/- 2.55, N = 3 1084.4 1084.2 1082.2 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Hilbert Transform Enabled Repeat Run 130 260 390 520 650 SE +/- 1.46, N = 3 SE +/- 0.84, N = 3 SE +/- 0.59, N = 3 594.4 593.7 591.4 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 256b Decryption Enabled Repeat Run 1200 2400 3600 4800 6000 SE +/- 1.34, N = 3 SE +/- 8.66, N = 3 SE +/- 9.91, N = 3 5436.5 5427.0 5425.7
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 256b Encryption Enabled Run Repeat 200 400 600 800 1000 SE +/- 0.12, N = 3 SE +/- 0.25, N = 3 SE +/- 1.34, N = 3 775.2 774.9 774.0
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 256b Decryption Enabled Repeat Run 160 320 480 640 800 SE +/- 0.06, N = 3 SE +/- 0.26, N = 3 SE +/- 0.20, N = 3 730.3 729.7 729.6
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 256b Encryption Enabled Run Repeat 110 220 330 440 550 SE +/- 0.27, N = 3 SE +/- 0.10, N = 3 SE +/- 0.88, N = 3 488.6 487.6 487.2
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 256b Decryption Enabled Run Repeat 110 220 330 440 550 SE +/- 0.09, N = 3 SE +/- 0.12, N = 3 SE +/- 0.38, N = 3 488.7 488.3 488.2
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 512b Encryption Enabled Repeat Run 1000 2000 3000 4000 5000 SE +/- 2.57, N = 3 SE +/- 6.39, N = 3 SE +/- 8.27, N = 3 4832.7 4824.1 4823.2
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 512b Decryption Enabled Run Repeat 1000 2000 3000 4000 5000 SE +/- 3.18, N = 3 SE +/- 8.27, N = 3 SE +/- 5.46, N = 3 4806.1 4800.5 4799.9
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 512b Encryption Enabled Run Repeat 200 400 600 800 1000 SE +/- 0.20, N = 3 SE +/- 0.50, N = 2 775.4 775.1 774.9
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 512b Encryption Enabled Run Repeat 110 220 330 440 550 SE +/- 0.26, N = 3 SE +/- 0.12, N = 3 SE +/- 0.30, N = 2 488.4 487.3 487.1
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 512b Decryption Enabled Run Repeat 110 220 330 440 550 SE +/- 0.10, N = 2 SE +/- 0.24, N = 3 488.4 488.0 487.9
Botan Botan is a BSD-licensed cross-platform open-source C++ crypto library "cryptography toolkit" that supports most publicly known cryptographic algorithms. The project's stated goal is to be "the best option for cryptography in C++ by offering the tools necessary to implement a range of practical systems, such as TLS protocol, X.509 certificates, modern AEAD ciphers, PKCS#11 and TPM hardware support, password hashing, and post quantum crypto schemes." Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI Enabled Repeat Run 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 105.78 105.73 105.73 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt Enabled Repeat Run 20 40 60 80 100 SE +/- 0.19, N = 3 SE +/- 0.15, N = 3 SE +/- 0.17, N = 3 103.65 103.64 103.50 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 Enabled Run Repeat 2K 4K 6K 8K 10K SE +/- 0.15, N = 3 SE +/- 0.75, N = 3 SE +/- 74.80, N = 3 7854.71 7854.05 7778.53 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt Enabled Run Repeat 2K 4K 6K 8K 10K SE +/- 0.09, N = 3 SE +/- 22.19, N = 3 SE +/- 77.84, N = 3 7840.11 7816.27 7761.24 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish Enabled Repeat Run 90 180 270 360 450 SE +/- 0.04, N = 3 SE +/- 0.25, N = 3 SE +/- 0.19, N = 3 427.84 427.37 427.28 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt Run Enabled Repeat 90 180 270 360 450 SE +/- 0.15, N = 3 SE +/- 0.22, N = 3 SE +/- 0.37, N = 3 426.12 426.10 425.88 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish Enabled Repeat Run 120 240 360 480 600 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.28, N = 3 545.45 545.42 544.75 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt Repeat Enabled Run 120 240 360 480 600 SE +/- 0.03, N = 3 SE +/- 0.14, N = 3 SE +/- 0.33, N = 3 535.87 535.85 535.21 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 Enabled Run Repeat 40 80 120 160 200 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 162.63 162.60 162.60 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt Repeat Enabled Run 40 80 120 160 200 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 162.33 162.32 162.31 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 Repeat Run Enabled 200 400 600 800 1000 SE +/- 0.57, N = 3 SE +/- 0.89, N = 3 SE +/- 1.45, N = 3 941.53 940.83 938.78 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt Run Repeat Enabled 200 400 600 800 1000 SE +/- 0.44, N = 3 SE +/- 1.47, N = 3 SE +/- 1.07, N = 3 936.65 936.21 935.22 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Sysbench This is a benchmark of Sysbench with the built-in CPU and memory sub-tests. Sysbench is a scriptable multi-threaded benchmark tool based on LuaJIT. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory Enabled Run Repeat 5K 10K 15K 20K 25K SE +/- 74.39, N = 3 SE +/- 87.98, N = 3 SE +/- 57.62, N = 3 25461.39 25429.68 25202.79 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding Enabled Run Repeat 300 600 900 1200 1500 SE +/- 0.30, N = 3 SE +/- 0.92, N = 3 SE +/- 0.19, N = 3 1200.82 1200.48 1200.37 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding Run Enabled Repeat 400 800 1200 1600 2000 SE +/- 1.19, N = 3 SE +/- 0.63, N = 3 SE +/- 0.24, N = 3 1745.70 1745.70 1743.56 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding Repeat Enabled Run 300 600 900 1200 1500 SE +/- 5.15, N = 3 SE +/- 14.15, N = 4 SE +/- 4.97, N = 3 1246.17 1236.02 1223.27 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding Repeat Run Enabled 400 800 1200 1600 2000 SE +/- 0.00, N = 3 SE +/- 5.37, N = 3 SE +/- 4.65, N = 4 2080.12 2074.75 2072.06 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
JPEG XL The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 5 Repeat Enabled Run 20 40 60 80 100 SE +/- 0.07, N = 3 SE +/- 0.12, N = 3 SE +/- 0.11, N = 3 75.02 74.76 74.75 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 7 Repeat Enabled Run 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 10.11 10.10 10.09 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 5 Run Repeat Enabled 20 40 60 80 100 SE +/- 0.07, N = 3 SE +/- 0.11, N = 3 SE +/- 0.36, N = 3 74.94 74.94 74.61 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 7 Run Repeat Enabled 20 40 60 80 100 SE +/- 0.49, N = 3 SE +/- 0.31, N = 3 SE +/- 0.18, N = 3 75.57 74.86 74.70 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
JPEG XL Decoding The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.3 CPU Threads: 1 Repeat Enabled Run 13 26 39 52 65 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.13, N = 3 56.71 56.70 56.37
Etcpak Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 Enabled Repeat Run 500 1000 1500 2000 2500 SE +/- 0.83, N = 3 SE +/- 0.55, N = 3 SE +/- 0.42, N = 3 2387.98 2387.90 2380.09 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 Repeat Enabled Run 80 160 240 320 400 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 369.33 369.31 369.26 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 Enabled Run Repeat 50 100 150 200 250 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 SE +/- 0.28, N = 3 209.63 209.56 209.36 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
rays1bench This is a test of rays1bench, a simple path-tracer / ray-tracing that supports SSE and AVX instructions, multi-threading, and other features. This test profile is measuring the performance of the "large scene" in rays1bench. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org mrays/s, More Is Better rays1bench 2020-01-09 Large Scene Enabled Run Repeat 16 32 48 64 80 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 71.82 71.69 71.69
OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen Run Enabled Repeat 200 400 600 800 1000 SE +/- 7.88, N = 3 SE +/- 8.54, N = 3 SE +/- 4.04, N = 3 841 840 831 1. (CXX) g++ options: -flto -pthread
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance Repeat Run Enabled 400K 800K 1200K 1600K 2000K SE +/- 1711.59, N = 5 SE +/- 1321.50, N = 5 SE +/- 1321.50, N = 5 1726583 1724418 1723339 1. (CC) gcc options: -O3 -march=native
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 12 Total Time Repeat Enabled Run 5M 10M 15M 20M 25M SE +/- 77091.76, N = 3 SE +/- 188309.24, N = 3 SE +/- 240951.84, N = 3 22712289 22389342 22203685 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time Enabled Repeat Run 6M 12M 18M 24M 30M SE +/- 213446.67, N = 3 SE +/- 139588.95, N = 3 SE +/- 88725.14, N = 3 28813121 28469418 28372238 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: MD5 Run Enabled Repeat 400K 800K 1200K 1600K 2000K SE +/- 10588.25, N = 3 SE +/- 13383.24, N = 3 SE +/- 12333.33, N = 3 2088333 2065333 2054667 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
Selenium This test profile uses the Selenium WebDriver for running various browser benchmarks in different available web browsers such as Firefox and Google Chrome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Runs / Minute, More Is Better Selenium Benchmark: StyleBench - Browser: Firefox Repeat Run Enabled 30 60 90 120 150 SE +/- 1.81, N = 15 SE +/- 1.76, N = 15 SE +/- 2.02, N = 15 119 118 117 1. firefox 86.0
OpenBenchmarking.org Runs / Minute, More Is Better Selenium Benchmark: StyleBench - Browser: Google Chrome Run Enabled Repeat 10 20 30 40 50 SE +/- 0.16, N = 3 SE +/- 0.13, N = 3 SE +/- 0.13, N = 3 46.19 45.95 45.78 1. chrome 89.0.4389.90
OpenBenchmarking.org Runs Per Minute, More Is Better Selenium Benchmark: Speedometer - Browser: Firefox Repeat Run Enabled 30 60 90 120 150 SE +/- 1.09, N = 3 SE +/- 1.34, N = 3 SE +/- 0.67, N = 3 137.9 136.7 134.0 1. firefox 86.0
srsLTE srsLTE is an open-source LTE software radio suite created by Software Radio Systems (SRS). srsLTE can be used for building your own software defined (SDR) LTE mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Samples / Second, More Is Better srsLTE 20.10.1 Test: OFDM_Test Repeat Enabled Run 40M 80M 120M 160M 200M SE +/- 2512855.83, N = 3 SE +/- 260341.66, N = 3 SE +/- 1473091.99, N = 3 178033333 176933333 176800000 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 Run Enabled Repeat 20M 40M 60M 80M 100M SE +/- 8504.90, N = 3 SE +/- 4910.31, N = 3 SE +/- 978008.35, N = 3 82631000 82626333 81644000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 2 - Buffer Length: 256 - Filter Length: 57 Run Enabled Repeat 30M 60M 90M 120M 150M SE +/- 33829.64, N = 3 SE +/- 1078368.11, N = 3 SE +/- 1484620.42, N = 6 157906667 156866667 155633333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 Enabled Run Repeat 70M 140M 210M 280M 350M SE +/- 158359.65, N = 3 SE +/- 968234.36, N = 3 SE +/- 817176.71, N = 3 321016667 319286667 319096667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 Enabled Repeat Run 130M 260M 390M 520M 650M SE +/- 463656.96, N = 3 SE +/- 286724.80, N = 3 SE +/- 8429373.91, N = 3 629876667 629466667 621093333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 12 - Buffer Length: 256 - Filter Length: 57 Enabled Run Repeat 140M 280M 420M 560M 700M SE +/- 2697671.84, N = 3 SE +/- 162924.66, N = 3 SE +/- 489534.93, N = 3 662150000 659116667 658753333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
PHPBench PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite Run Repeat Enabled 200K 400K 600K 800K 1000K SE +/- 3145.44, N = 3 SE +/- 1962.09, N = 3 SE +/- 4727.53, N = 3 1020071 1017890 1010404
Selenium This test profile uses the Selenium WebDriver for running various browser benchmarks in different available web browsers such as Firefox and Google Chrome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Score, More Is Better Selenium Benchmark: Jetstream 2 - Browser: Firefox Enabled Run Repeat 20 40 60 80 100 SE +/- 0.91, N = 3 SE +/- 0.45, N = 3 SE +/- 0.71, N = 3 102.78 100.00 99.74 1. firefox 86.0
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C Run Repeat Enabled 300 600 900 1200 1500 SE +/- 10.80, N = 3 SE +/- 14.97, N = 15 SE +/- 14.97, N = 15 1581.87 1565.92 1554.98 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C Enabled Repeat Run 7K 14K 21K 28K 35K SE +/- 10.27, N = 3 SE +/- 75.77, N = 3 SE +/- 35.20, N = 3 31194.25 31174.97 30717.61 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
srsLTE srsLTE is an open-source LTE software radio suite created by Software Radio Systems (SRS). srsLTE can be used for building your own software defined (SDR) LTE mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org UE Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test Enabled Run Repeat 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.40, N = 3 SE +/- 0.09, N = 3 132.7 132.6 132.4 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
BRL-CAD BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.30.8 VGR Performance Metric Repeat Enabled Run 30K 60K 90K 120K 150K 153258 152552 152408 1. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm
Google SynthMark SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 Enabled Repeat Run 200 400 600 800 1000 SE +/- 0.45, N = 3 SE +/- 0.35, N = 3 SE +/- 2.29, N = 3 928.21 928.09 926.29 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Chaos Group V-RAY This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU Enabled Repeat Run 3K 6K 9K 12K 15K SE +/- 14.74, N = 3 SE +/- 42.28, N = 3 SE +/- 23.78, N = 3 12346 12308 12279
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms Repeat Run Enabled 0.2923 0.5846 0.8769 1.1692 1.4615 SE +/- 0.00320, N = 3 SE +/- 0.00312, N = 3 SE +/- 0.00186, N = 3 1.29529 1.29607 1.29902
WebP Image Encode This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless Enabled Run Repeat 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 15.30 15.32 15.32 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Run Repeat Enabled 2 4 6 8 10 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 SE +/- 0.005, N = 3 6.080 6.083 6.095 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
Caffe This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 Enabled Repeat Run 7K 14K 21K 28K 35K SE +/- 21.53, N = 3 SE +/- 61.74, N = 3 SE +/- 28.35, N = 3 34712 34770 34786 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 Run Repeat Enabled 20K 40K 60K 80K 100K SE +/- 62.17, N = 3 SE +/- 16.20, N = 3 SE +/- 39.14, N = 3 88718 88720 88739 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
PyBench This test profile reports the total time of the different average timed test results from PyBench. PyBench reports average test times for different functions such as BuiltinFunctionCalls and NestedForLoops, with this total result providing a rough estimate as to Python's average performance on a given system. This test profile runs PyBench each time for 20 rounds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2018-02-16 Total For Average Test Times Run Enabled Repeat 150 300 450 600 750 SE +/- 1.00, N = 3 SE +/- 2.85, N = 3 SE +/- 2.08, N = 3 711 714 716
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU Repeat Enabled Run 0.9097 1.8194 2.7291 3.6388 4.5485 SE +/- 0.00225, N = 3 SE +/- 0.00318, N = 3 SE +/- 0.00367, N = 3 4.03238 4.04078 4.04332 MIN: 3.91 MIN: 3.91 MIN: 3.92 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU Repeat Run Enabled 3 6 9 12 15 SE +/- 0.00778, N = 3 SE +/- 0.00258, N = 3 SE +/- 0.00888, N = 3 9.49268 10.56970 10.57120 MIN: 9.43 MIN: 10.51 MIN: 10.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU Enabled Run Repeat 0.1585 0.317 0.4755 0.634 0.7925 SE +/- 0.001249, N = 3 SE +/- 0.002770, N = 3 SE +/- 0.001477, N = 3 0.703196 0.704534 0.704665 MIN: 0.66 MIN: 0.66 MIN: 0.66 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU Repeat Enabled Run 0.6462 1.2924 1.9386 2.5848 3.231 SE +/- 0.01085, N = 3 SE +/- 0.00634, N = 3 SE +/- 0.00257, N = 3 2.77561 2.86126 2.87200 MIN: 2.69 MIN: 2.81 MIN: 2.8 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU Repeat Run Enabled 2 4 6 8 10 SE +/- 0.00649, N = 3 SE +/- 0.00312, N = 3 SE +/- 0.00601, N = 3 8.39297 8.39591 8.39732 MIN: 8.25 MIN: 8.3 MIN: 8.25 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU Repeat Enabled Run 1.0461 2.0922 3.1383 4.1844 5.2305 SE +/- 0.01779, N = 3 SE +/- 0.03141, N = 3 SE +/- 0.03164, N = 3 4.38277 4.60823 4.64952 MIN: 4.04 MIN: 4.13 MIN: 4.15 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Repeat Run Enabled 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 13.86 13.93 13.95 MIN: 13.78 MIN: 13.85 MIN: 13.88 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU Run Enabled Repeat 2 4 6 8 10 SE +/- 0.06939, N = 15 SE +/- 0.07203, N = 15 SE +/- 0.09537, N = 15 6.07807 6.16825 6.19314 MIN: 3.69 MIN: 3.67 MIN: 3.68 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU Repeat Run Enabled 0.9426 1.8852 2.8278 3.7704 4.713 SE +/- 0.00355, N = 3 SE +/- 0.00910, N = 3 SE +/- 0.00971, N = 3 4.17427 4.17526 4.18919 MIN: 4.13 MIN: 4.13 MIN: 4.12 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Repeat Run Enabled 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 12.06 12.10 12.12 MIN: 11.93 MIN: 12.03 MIN: 12.06 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU Run Enabled Repeat 0.1851 0.3702 0.5553 0.7404 0.9255 SE +/- 0.003248, N = 3 SE +/- 0.003255, N = 3 SE +/- 0.003677, N = 3 0.821323 0.822350 0.822519 MIN: 0.8 MIN: 0.8 MIN: 0.8 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU Run Enabled Repeat 0.3246 0.6492 0.9738 1.2984 1.623 SE +/- 0.01056, N = 3 SE +/- 0.01018, N = 3 SE +/- 0.00726, N = 3 1.43273 1.43641 1.44288 MIN: 1.36 MIN: 1.37 MIN: 1.36 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Repeat Run Enabled 700 1400 2100 2800 3500 SE +/- 4.19, N = 3 SE +/- 3.51, N = 3 SE +/- 2.69, N = 3 2988.39 3066.19 3091.74 MIN: 2975.82 MIN: 3055.74 MIN: 3081.74 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Repeat Run Enabled 400 800 1200 1600 2000 SE +/- 1.82, N = 3 SE +/- 3.10, N = 3 SE +/- 1.39, N = 3 1760.75 1802.18 1812.22 MIN: 1753.88 MIN: 1794.84 MIN: 1806.43 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU Repeat Run Enabled 700 1400 2100 2800 3500 SE +/- 5.44, N = 3 SE +/- 5.99, N = 3 SE +/- 1.66, N = 3 3001.60 3066.56 3092.80 MIN: 2989.75 MIN: 3053.3 MIN: 3086.05 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU Run Repeat Enabled 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 15.81 15.82 15.82 MIN: 15.79 MIN: 15.79 MIN: 15.79 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU Run Repeat Enabled 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 16.45 16.45 16.45 MIN: 16.25 MIN: 16.24 MIN: 16.25 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU Run Repeat Enabled 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 16.70 16.73 16.74 MIN: 16.63 MIN: 16.63 MIN: 16.62 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU Repeat Run Enabled 400 800 1200 1600 2000 SE +/- 1.53, N = 3 SE +/- 1.75, N = 3 SE +/- 2.19, N = 3 1762.42 1800.95 1814.66 MIN: 1756.24 MIN: 1793.72 MIN: 1808.15 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Repeat Enabled Run 0.7736 1.5472 2.3208 3.0944 3.868 SE +/- 0.00297, N = 3 SE +/- 0.00397, N = 3 SE +/- 0.00271, N = 3 3.41721 3.42660 3.43837 MIN: 3.34 MIN: 3.35 MIN: 3.36 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU Repeat Run Enabled 700 1400 2100 2800 3500 SE +/- 6.46, N = 3 SE +/- 17.72, N = 3 SE +/- 1.99, N = 3 2998.59 3075.31 3092.81 MIN: 2981.81 MIN: 3049.81 MIN: 3084.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU Repeat Run Enabled 400 800 1200 1600 2000 SE +/- 4.58, N = 3 SE +/- 3.82, N = 3 SE +/- 1.95, N = 3 1762.48 1802.00 1816.65 MIN: 1751.25 MIN: 1791.17 MIN: 1809.76 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Repeat Enabled Run 0.2866 0.5732 0.8598 1.1464 1.433 SE +/- 0.00115, N = 3 SE +/- 0.00109, N = 3 SE +/- 0.00054, N = 3 1.26745 1.26765 1.27367 MIN: 1.22 MIN: 1.22 MIN: 1.23 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU Run Repeat Enabled 0.787 1.574 2.361 3.148 3.935 SE +/- 0.00395, N = 3 SE +/- 0.00650, N = 3 SE +/- 0.00666, N = 3 3.48539 3.49474 3.49756 MIN: 3.42 MIN: 3.42 MIN: 3.43 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
FinanceBench FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP Run Repeat Enabled 6K 12K 18K 24K 30K SE +/- 74.39, N = 3 SE +/- 46.34, N = 3 SE +/- 151.48, N = 3 26676.77 26809.90 26926.41 1. (CXX) g++ options: -O3 -march=native -fopenmp
OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP Run Repeat Enabled 9K 18K 27K 36K 45K SE +/- 79.47, N = 3 SE +/- 12.45, N = 3 SE +/- 58.76, N = 3 41115.48 41346.52 41370.77 1. (CXX) g++ options: -O3 -march=native -fopenmp
Mobile Neural Network MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: SqueezeNetV1.0 Enabled Run Repeat 0.8525 1.705 2.5575 3.41 4.2625 SE +/- 0.008, N = 3 SE +/- 0.018, N = 3 SE +/- 0.032, N = 3 3.751 3.753 3.789 MIN: 3.69 / MAX: 11.5 MIN: 3.69 / MAX: 4.53 MIN: 3.72 / MAX: 11.42 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: resnet-v2-50 Repeat Run Enabled 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 SE +/- 0.11, N = 3 19.70 19.79 19.79 MIN: 19.53 / MAX: 21.51 MIN: 19.59 / MAX: 21.26 MIN: 19.55 / MAX: 21.46 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: MobileNetV2_224 Enabled Repeat Run 0.448 0.896 1.344 1.792 2.24 SE +/- 0.006, N = 3 SE +/- 0.013, N = 3 SE +/- 0.004, N = 3 1.971 1.980 1.991 MIN: 1.92 / MAX: 2.79 MIN: 1.92 / MAX: 2.88 MIN: 1.94 / MAX: 2.84 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: mobilenet-v1-1.0 Enabled Repeat Run 0.4356 0.8712 1.3068 1.7424 2.178 SE +/- 0.007, N = 3 SE +/- 0.005, N = 3 SE +/- 0.012, N = 3 1.921 1.923 1.936 MIN: 1.87 / MAX: 2.71 MIN: 1.88 / MAX: 2.74 MIN: 1.88 / MAX: 3.19 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: inception-v3 Repeat Enabled Run 6 12 18 24 30 SE +/- 0.15, N = 3 SE +/- 0.23, N = 3 SE +/- 0.12, N = 3 22.90 23.06 23.19 MIN: 22.59 / MAX: 25.48 MIN: 22.7 / MAX: 25.12 MIN: 22.9 / MAX: 25.71 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
NCNN NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mobilenet Enabled Run Repeat 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.23, N = 3 SE +/- 0.18, N = 4 15.72 15.94 16.00 MIN: 15.49 / MAX: 16.88 MIN: 15.49 / MAX: 16.78 MIN: 15.6 / MAX: 16.91 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 Enabled Run Repeat 1.0103 2.0206 3.0309 4.0412 5.0515 SE +/- 0.05, N = 3 SE +/- 0.09, N = 3 SE +/- 0.13, N = 4 4.42 4.46 4.49 MIN: 4.24 / MAX: 6.41 MIN: 4.23 / MAX: 5.67 MIN: 4.23 / MAX: 5.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 Run Enabled Repeat 0.8213 1.6426 2.4639 3.2852 4.1065 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 4 3.60 3.63 3.65 MIN: 3.48 / MAX: 4.48 MIN: 3.48 / MAX: 4.52 MIN: 3.49 / MAX: 4.52 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: shufflenet-v2 Enabled Repeat Run 0.8528 1.7056 2.5584 3.4112 4.264 SE +/- 0.01, N = 3 SE +/- 0.01, N = 4 SE +/- 0.03, N = 3 3.76 3.77 3.79 MIN: 3.7 / MAX: 4.31 MIN: 3.7 / MAX: 5.82 MIN: 3.69 / MAX: 6.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mnasnet Enabled Repeat Run 0.8078 1.6156 2.4234 3.2312 4.039 SE +/- 0.09, N = 3 SE +/- 0.11, N = 4 SE +/- 0.08, N = 3 3.50 3.53 3.59 MIN: 3.34 / MAX: 4.22 MIN: 3.34 / MAX: 4.34 MIN: 3.38 / MAX: 4.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: efficientnet-b0 Enabled Repeat Run 1.2893 2.5786 3.8679 5.1572 6.4465 SE +/- 0.05, N = 3 SE +/- 0.11, N = 4 SE +/- 0.12, N = 3 5.63 5.68 5.73 MIN: 5.52 / MAX: 6.51 MIN: 5.5 / MAX: 6.87 MIN: 5.53 / MAX: 6.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: blazeface Enabled Repeat Run 0.315 0.63 0.945 1.26 1.575 SE +/- 0.04, N = 3 SE +/- 0.03, N = 4 SE +/- 0.04, N = 3 1.35 1.35 1.40 MIN: 1.26 / MAX: 1.47 MIN: 1.27 / MAX: 1.48 MIN: 1.28 / MAX: 1.81 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: googlenet Repeat Enabled Run 3 6 9 12 15 SE +/- 0.34, N = 4 SE +/- 0.36, N = 3 SE +/- 0.39, N = 3 12.15 12.16 12.60 MIN: 11.72 / MAX: 13.4 MIN: 11.71 / MAX: 13.31 MIN: 11.72 / MAX: 13.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: vgg16 Enabled Repeat Run 13 26 39 52 65 SE +/- 0.10, N = 3 SE +/- 0.11, N = 4 SE +/- 1.26, N = 3 56.10 56.35 57.57 MIN: 55.69 / MAX: 58.35 MIN: 55.59 / MAX: 62.94 MIN: 55.66 / MAX: 733.01 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet18 Enabled Repeat Run 3 6 9 12 15 SE +/- 0.32, N = 3 SE +/- 0.26, N = 4 SE +/- 0.03, N = 3 12.60 12.74 12.93 MIN: 11.81 / MAX: 13.19 MIN: 11.82 / MAX: 13.44 MIN: 12.69 / MAX: 13.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: alexnet Repeat Run Enabled 3 6 9 12 15 SE +/- 0.01, N = 4 SE +/- 0.17, N = 3 SE +/- 0.18, N = 3 11.19 11.35 11.36 MIN: 10.99 / MAX: 11.53 MIN: 10.99 / MAX: 12.87 MIN: 11 / MAX: 11.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet50 Enabled Repeat Run 6 12 18 24 30 SE +/- 0.50, N = 3 SE +/- 0.04, N = 4 SE +/- 0.05, N = 3 24.05 24.59 24.65 MIN: 22.83 / MAX: 32.85 MIN: 24.27 / MAX: 31.94 MIN: 24.32 / MAX: 25.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: yolov4-tiny Enabled Repeat Run 6 12 18 24 30 SE +/- 0.17, N = 3 SE +/- 0.14, N = 4 SE +/- 0.20, N = 3 23.05 23.11 23.13 MIN: 22.55 / MAX: 23.56 MIN: 22.55 / MAX: 23.74 MIN: 22.55 / MAX: 24.22 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: squeezenet_ssd Enabled Run Repeat 4 8 12 16 20 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 4 18.10 18.16 18.18 MIN: 17.75 / MAX: 18.89 MIN: 17.79 / MAX: 26.2 MIN: 17.83 / MAX: 18.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: regnety_400m Repeat Enabled Run 3 6 9 12 15 SE +/- 0.07, N = 4 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 11.04 11.06 11.09 MIN: 10.75 / MAX: 12.06 MIN: 10.79 / MAX: 12.12 MIN: 10.88 / MAX: 12.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
TNN TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 Enabled Repeat Run 60 120 180 240 300 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 263.34 263.47 263.60 MIN: 261.29 / MAX: 268.2 MIN: 261.32 / MAX: 269.11 MIN: 261.99 / MAX: 268.68 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 Enabled Repeat Run 60 120 180 240 300 SE +/- 0.05, N = 3 SE +/- 0.36, N = 3 SE +/- 0.68, N = 3 264.38 264.77 265.45 MIN: 262.68 / MAX: 267.22 MIN: 263.03 / MAX: 266.49 MIN: 263.15 / MAX: 269.59 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Selenium This test profile uses the Selenium WebDriver for running various browser benchmarks in different available web browsers such as Firefox and Google Chrome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: ARES-6 - Browser: Firefox Run Repeat Enabled 9 18 27 36 45 SE +/- 0.06, N = 3 SE +/- 0.14, N = 3 SE +/- 0.20, N = 3 37.86 38.09 38.12 1. firefox 86.0
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: Kraken - Browser: Firefox Repeat Enabled Run 200 400 600 800 1000 SE +/- 0.73, N = 3 SE +/- 0.82, N = 3 SE +/- 1.71, N = 3 842.2 844.1 844.6 1. firefox 86.0
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: Kraken - Browser: Google Chrome Repeat Run Enabled 130 260 390 520 650 SE +/- 0.93, N = 3 SE +/- 1.66, N = 3 SE +/- 1.50, N = 3 610.4 610.7 616.6 1. chrome 89.0.4389.90
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM imageConvolute - Browser: Firefox Repeat Enabled Run 6 12 18 24 30 SE +/- 0.07, N = 3 SE +/- 0.12, N = 3 SE +/- 0.34, N = 3 25.0 25.1 25.4 1. firefox 86.0
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM collisionDetection - Browser: Firefox Enabled Repeat Run 70 140 210 280 350 SE +/- 3.30, N = 3 SE +/- 3.18, N = 3 SE +/- 3.27, N = 3 337.8 337.9 337.9 1. firefox 86.0
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM imageConvolute - Browser: Google Chrome Enabled Repeat Run 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.35, N = 3 SE +/- 0.16, N = 3 26.57 26.65 26.95 1. chrome 89.0.4389.90
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM collisionDetection - Browser: Google Chrome Repeat Enabled Run 60 120 180 240 300 SE +/- 0.19, N = 3 SE +/- 0.22, N = 3 SE +/- 0.15, N = 3 280.30 280.38 280.72 1. chrome 89.0.4389.90
Selenium This test profile uses the Selenium WebDriver for running various browser benchmarks in different available web browsers such as Firefox and Google Chrome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Score, Fewer Is Better Selenium Benchmark: PSPDFKit WASM - Browser: Firefox Repeat Enabled Run 600 1200 1800 2400 3000 SE +/- 3.33, N = 3 SE +/- 10.17, N = 3 SE +/- 9.87, N = 3 2776 2779 2788 1. firefox 86.0
OpenBenchmarking.org Score, Fewer Is Better Selenium Benchmark: PSPDFKit WASM - Browser: Google Chrome Enabled Repeat Run 600 1200 1800 2400 3000 SE +/- 6.24, N = 3 SE +/- 2.31, N = 3 2813 2823 2825 1. chrome 89.0.4389.90
WireGuard + Linux Networking Stack Stress Test This is a benchmark of the WireGuard secure VPN tunnel and Linux networking stack stress test. The test runs on the local host but does require root permissions to run. The way it works is it creates three namespaces. ns0 has a loopback device. ns1 and ns2 each have wireguard devices. Those two wireguard devices send traffic through the loopback device of ns0. The end result of this is that tests wind up testing encryption and decryption at the same time -- a pretty CPU and scheduler-heavy workflow. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better WireGuard + Linux Networking Stack Stress Test Repeat Enabled Run 30 60 90 120 150 SE +/- 0.50, N = 3 SE +/- 0.56, N = 3 SE +/- 1.20, N = 3 129.89 130.03 130.53
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics Repeat Enabled Run 40 80 120 160 200 SE +/- 0.22, N = 3 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 159.40 159.78 166.03 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD Enabled Repeat Run 50 100 150 200 250 SE +/- 1.22, N = 3 SE +/- 1.60, N = 3 SE +/- 2.75, N = 3 208.70 208.79 209.92 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D Repeat Enabled Run 16 32 48 64 80 SE +/- 0.33, N = 3 SE +/- 0.41, N = 14 SE +/- 1.89, N = 14 70.25 70.42 72.24 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte Run Enabled Repeat 20 40 60 80 100 SE +/- 0.20, N = 3 SE +/- 0.42, N = 3 SE +/- 0.64, N = 3 103.49 103.84 104.96 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver Enabled Repeat Run 5 10 15 20 25 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 SE +/- 0.13, N = 3 21.08 21.10 21.30 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster Enabled Repeat Run 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 17.00 17.01 17.02 1. (CXX) g++ options: -O2 -lOpenCL
Dolfyn Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics Run Repeat Enabled 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 15.03 15.05 15.06
Timed MrBayes Analysis This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Enabled Run Repeat 13 26 39 52 65 SE +/- 0.41, N = 3 SE +/- 0.74, N = 3 SE +/- 0.47, N = 3 57.25 57.33 57.64 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mmpx -mabm -O3 -std=c99 -pedantic -lm
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction Repeat Run Enabled 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.29, N = 3 33.69 33.77 34.13 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction Repeat Run Enabled 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 118.12 118.35 118.39 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
Monte Carlo Simulations of Ionised Nebulae Mocassin is the Monte Carlo Simulations of Ionised Nebulae. MOCASSIN is a fully 3D or 2D photoionisation and dust radiative transfer code which employs a Monte Carlo approach to the transfer of radiation through media of arbitrary geometry and density distribution. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2019-03-24 Input: Dust 2D tau100.0 Repeat Run Enabled 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 173 173 174 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lrt -lz
OpenFOAM OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M Run Repeat Enabled 50 100 150 200 250 SE +/- 0.22, N = 3 SE +/- 0.44, N = 3 SE +/- 0.17, N = 3 227.09 227.61 228.01 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
Java Gradle Build This test runs Java software project builds using the Gradle build system. It is intended to give developers an idea as to the build performance for development activities and build servers. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Java Gradle Build Gradle Build: Reactor Repeat Enabled Run 40 80 120 160 200 SE +/- 1.79, N = 12 SE +/- 2.16, N = 12 SE +/- 1.63, N = 12 171.07 173.55 174.16
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 2 Enabled Run Repeat 7 14 21 28 35 SE +/- 0.19, N = 3 SE +/- 0.06, N = 3 SE +/- 0.14, N = 3 30.97 31.05 31.10 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 Run Enabled Repeat 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 11.03 11.05 11.06 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 Enabled Repeat Run 0.6467 1.2934 1.9401 2.5868 3.2335 SE +/- 0.003, N = 3 SE +/- 0.007, N = 3 SE +/- 0.008, N = 3 2.855 2.867 2.874 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Enabled Repeat Run 12 24 36 48 60 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 SE +/- 0.03, N = 3 51.51 51.57 52.21 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Repeat Run Enabled 1.1502 2.3004 3.4506 4.6008 5.751 SE +/- 0.002, N = 3 SE +/- 0.005, N = 3 SE +/- 0.017, N = 3 5.085 5.092 5.112 1. (CXX) g++ options: -O3 -fPIC -lm
Timed Mesa Compilation This test profile times how long it takes to compile Mesa with Meson/Ninja. For minimizing build dependencies and avoid versioning conflicts, test this is just the core Mesa build without LLVM or the extra Gallium3D/Mesa drivers enabled. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 21.0 Time To Compile Repeat Run Enabled 10 20 30 40 50 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 SE +/- 0.11, N = 3 45.87 45.94 46.06
Build2 This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile Repeat Run Enabled 30 60 90 120 150 SE +/- 0.19, N = 3 SE +/- 0.34, N = 3 SE +/- 0.35, N = 3 121.75 122.58 122.76
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Run Repeat Enabled 15 30 45 60 75 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 67.12 67.13 67.13 1. (CC) gcc options: -lm -lpthread -O3
POV-Ray This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time Enabled Run Repeat 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 38.19 38.30 38.35 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
Primesieve Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 7.4 1e12 Prime Number Generation Repeat Enabled Run 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 18.03 18.04 18.07 1. (CXX) g++ options: -O3 -lpthread
Smallpt Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples Enabled Repeat Run 2 4 6 8 10 SE +/- 0.006, N = 3 SE +/- 0.005, N = 3 SE +/- 0.001, N = 3 8.880 8.883 8.883 1. (CXX) g++ options: -fopenmp -O3
YafaRay YafaRay is an open-source physically based montecarlo ray-tracing engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.4.1 Total Time For Sample Scene Repeat Run Enabled 30 60 90 120 150 SE +/- 0.18, N = 3 SE +/- 0.46, N = 3 SE +/- 0.08, N = 3 121.24 121.33 121.45 1. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread
Timed Wasmer Compilation This test times how long it takes to compile Wasmer. Wasmer is written in the Rust programming language and is a WebAssembly runtime implementation that supports WASI and EmScripten. This test profile builds Wasmer with the Cranelift and Singlepast compiler features enabled. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Wasmer Compilation 1.0.2 Time To Compile Enabled Run Repeat 12 24 36 48 60 SE +/- 0.35, N = 3 SE +/- 0.26, N = 3 SE +/- 0.27, N = 3 52.85 52.88 53.06 1. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lgcc_s -lutil -lrt -lpthread -lm -lc
XZ Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 Enabled Repeat Run 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 29.80 29.81 29.86 1. (CC) gcc options: -pthread -fvisibility=hidden -O2
Cython Benchmark Cython provides a superset of Python that is geared to deliver C-like levels of performance. This test profile makes use of Cython's bundled benchmark tests and runs an N-Queens sample test as a simple benchmark to the system's Cython performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Cython Benchmark 0.29.21 Test: N-Queens Run Enabled Repeat 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 20.35 20.37 20.39
DeepSpeech Mozilla DeepSpeech is a speech-to-text engine powered by TensorFlow for machine learning and derived from Baidu's Deep Speech research paper. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better DeepSpeech 0.6 Acceleration: CPU Repeat Run Enabled 14 28 42 56 70 SE +/- 0.09, N = 3 SE +/- 0.10, N = 3 SE +/- 0.09, N = 3 59.56 60.30 60.45
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 Enabled Run Repeat 2 4 6 8 10 SE +/- 0.004, N = 3 SE +/- 0.005, N = 3 SE +/- 0.014, N = 3 6.521 6.526 6.532 1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lncurses -lm
Opus Codec Encoding Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode Repeat Enabled Run 2 4 6 8 10 SE +/- 0.011, N = 5 SE +/- 0.011, N = 5 SE +/- 0.010, N = 5 7.222 7.230 7.232 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
Ngspice Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 Repeat Run Enabled 20 40 60 80 100 SE +/- 0.69, N = 3 SE +/- 0.30, N = 3 SE +/- 0.32, N = 3 107.26 108.10 108.73 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 Repeat Run Enabled 20 40 60 80 100 SE +/- 0.29, N = 3 SE +/- 0.23, N = 3 SE +/- 0.57, N = 3 86.23 86.39 87.39 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
OpenBenchmarking.org Seconds, Fewer Is Better Perl Benchmarks Test: Interpreter Repeat Enabled Run 0.0002 0.0004 0.0006 0.0008 0.001 SE +/- 0.00000063, N = 3 SE +/- 0.00000726, N = 3 SE +/- 0.00000526, N = 3 0.00067438 0.00067785 0.00068233
RNNoise RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 Enabled Run Repeat 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 20.89 20.89 20.90 1. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
Tachyon This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time Run Enabled Repeat 20 40 60 80 100 SE +/- 0.43, N = 15 SE +/- 0.54, N = 3 SE +/- 0.53, N = 15 74.02 74.88 75.02 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
WebP2 Image Encode This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default Enabled Repeat Run 0.8021 1.6042 2.4063 3.2084 4.0105 SE +/- 0.015, N = 3 SE +/- 0.012, N = 3 SE +/- 0.004, N = 3 3.533 3.545 3.565 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 Run Repeat Enabled 40 80 120 160 200 SE +/- 0.18, N = 3 SE +/- 0.25, N = 3 SE +/- 0.34, N = 3 190.33 190.53 191.35 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 Repeat Enabled Run 3 6 9 12 15 SE +/- 0.008, N = 3 SE +/- 0.004, N = 3 SE +/- 0.016, N = 3 9.603 9.618 9.625 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
ASTC Encoder ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Medium Enabled Run Repeat 1.1336 2.2672 3.4008 4.5344 5.668 SE +/- 0.0020, N = 3 SE +/- 0.0044, N = 3 SE +/- 0.0024, N = 3 5.0307 5.0345 5.0381 1. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Thorough Repeat Enabled Run 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 11.67 11.67 11.68 1. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Exhaustive Enabled Repeat Run 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 87.81 87.85 87.86 1. (CXX) g++ options: -O3 -flto -pthread
Basis Universal Basis Universal is a GPU texture codec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: ETC1S Run Repeat Enabled 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 21.36 21.39 21.45 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 Run Repeat Enabled 2 4 6 8 10 SE +/- 0.002, N = 3 SE +/- 0.005, N = 3 SE +/- 0.004, N = 3 6.248 6.250 6.257 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 Repeat Enabled Run 6 12 18 24 30 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 27.60 27.61 27.61 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 Repeat Run Enabled 12 24 36 48 60 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 51.54 51.55 51.56 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Darktable Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Boat - Acceleration: CPU-only Run Enabled Repeat 0.968 1.936 2.904 3.872 4.84 SE +/- 0.016, N = 3 SE +/- 0.021, N = 3 SE +/- 0.022, N = 3 4.291 4.300 4.302
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Masskrug - Acceleration: CPU-only Run Repeat Enabled 1.0289 2.0578 3.0867 4.1156 5.1445 SE +/- 0.003, N = 3 SE +/- 0.004, N = 3 SE +/- 0.004, N = 3 4.558 4.571 4.573
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Server Rack - Acceleration: CPU-only Repeat Enabled Run 0.0371 0.0742 0.1113 0.1484 0.1855 SE +/- 0.000, N = 3 SE +/- 0.002, N = 15 SE +/- 0.001, N = 3 0.163 0.165 0.165
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Server Room - Acceleration: CPU-only Repeat Enabled Run 0.7814 1.5628 2.3442 3.1256 3.907 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 SE +/- 0.007, N = 3 3.458 3.468 3.473
GEGL GEGL is the Generic Graphics Library and is the library/framework used by GIMP and other applications like GNOME Photos. This test profile times how long it takes to complete various GEGL operations on a static set of sample JPEG images. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GEGL Operation: Crop Enabled Run Repeat 2 4 6 8 10 SE +/- 0.024, N = 3 SE +/- 0.023, N = 3 SE +/- 0.023, N = 3 6.479 6.482 6.485
OpenBenchmarking.org Seconds, Fewer Is Better GEGL Operation: Scale Enabled Run Repeat 1.0634 2.1268 3.1902 4.2536 5.317 SE +/- 0.017, N = 3 SE +/- 0.008, N = 3 SE +/- 0.013, N = 3 4.711 4.723 4.726
OpenBenchmarking.org Seconds, Fewer Is Better GEGL Operation: Color Enhance Repeat Enabled Run 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 42.87 42.89 42.93
OpenBenchmarking.org Seconds, Fewer Is Better GEGL Operation: Rotate 90 Degrees Repeat Enabled Run 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 29.51 29.54 29.56
GIMP GIMP is an open-source image manipulaton program. This test profile will use the system-provided GIMP program otherwise on Windows relys upon a pre-packaged Windows binary from upstream GIMP.org. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.22 Test: resize Enabled Repeat Run 2 4 6 8 10 SE +/- 0.048, N = 3 SE +/- 0.024, N = 3 SE +/- 0.023, N = 3 6.212 6.213 6.222
OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.22 Test: rotate Enabled Repeat Run 3 6 9 12 15 SE +/- 0.017, N = 3 SE +/- 0.007, N = 3 SE +/- 0.008, N = 3 9.027 9.036 9.044
OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.22 Test: auto-levels Repeat Enabled Run 3 6 9 12 15 SE +/- 0.021, N = 3 SE +/- 0.005, N = 3 SE +/- 0.035, N = 3 9.304 9.323 9.334
OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.22 Test: unsharp-mask Repeat Run Enabled 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 11.17 11.20 11.21
Hugin Hugin is an open-source, cross-platform panorama photo stitcher software package. This test profile times how long it takes to run the assistant and panorama photo stitching on a set of images. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Hugin Panorama Photo Assistant + Stitching Time Repeat Run Enabled 9 18 27 36 45 SE +/- 0.24, N = 3 SE +/- 0.18, N = 3 SE +/- 0.08, N = 3 38.34 38.67 38.73
OCRMyPDF OCRMyPDF is an optical character recognition (OCR) text layer to scanned PDF files, producing new PDFs with the text now selectable/searchable/copy-paste capable. OCRMyPDF leverages the Tesseract OCR engine and is written in Python. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OCRMyPDF 10.3.1+dfsg Processing 60 Page PDF Document Repeat Enabled Run 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 15.49 15.51 15.56
OpenSCAD OpenSCAD is a programmer-focused solid 3D CAD modeller. OpenSCAD is free software and allows creating 3D CAD objects in a script-based modelling environment. This test profile will use the system-provided OpenSCAD program otherwise and time how long it takes tn render different SCAD assets to PNG output. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Pistol Enabled Repeat Run 20 40 60 80 100 SE +/- 0.08, N = 3 SE +/- 0.24, N = 3 SE +/- 0.05, N = 3 77.85 78.07 78.21 1. OpenSCAD version 2021.01
OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Retro Car Run Repeat Enabled 0.824 1.648 2.472 3.296 4.12 SE +/- 0.002, N = 3 SE +/- 0.005, N = 3 SE +/- 0.006, N = 3 3.656 3.660 3.662 1. OpenSCAD version 2021.01
OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Mini-ITX Case Repeat Enabled Run 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 35.14 35.22 35.30 1. OpenSCAD version 2021.01
OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Projector Mount Swivel Run Enabled Repeat 2 4 6 8 10 SE +/- 0.008, N = 3 SE +/- 0.005, N = 3 SE +/- 0.042, N = 3 6.925 6.934 6.949 1. OpenSCAD version 2021.01
OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Leonardo Phone Case Slim Repeat Run Enabled 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 14.20 14.24 14.32 1. OpenSCAD version 2021.01
librsvg RSVG/librsvg is an SVG vector graphics library. This test profile times how long it takes to complete various operations by rsvg-convert. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better librsvg Operation: SVG Files To PNG Repeat Enabled Run 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 16.30 16.40 16.46 1. rsvg-convert version 2.50.3
Blender Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL, NVIDIA OptiX, and NVIDIA CUDA is supported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: BMW27 - Compute: CPU-Only Enabled Repeat Run 30 60 90 120 150 SE +/- 0.22, N = 3 SE +/- 0.19, N = 3 SE +/- 0.11, N = 3 132.84 132.87 132.99
Git This test measures the time needed to carry out some sample Git operations on an example, static repository that happens to be a copy of the GNOME GTK tool-kit repository. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Git Time To Complete Common Git Commands Repeat Enabled Run 9 18 27 36 45 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 40.88 40.97 40.99 1. git version 2.30.2
Tesseract OCR Tesseract-OCR is the open-source optical character recognition (OCR) engine for the conversion of text within images to raw text output. This test profile relies upon a system-supplied Tesseract installation. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Tesseract OCR 4.1.1 Time To OCR 7 Images Run Enabled Repeat 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 18.46 18.50 18.51
Enabled Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: DEBUGINFOD_URLS=Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x39 - Thermald 2.4.3Java Notes: OpenJDK Runtime Environment (build 11.0.11-ea+4-Ubuntu-0ubuntu2)Python Notes: Python 3.9.2Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 26 March 2021 14:03 by user ronix.
Repeat Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: DEBUGINFOD_URLS=Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x39 - Thermald 2.4.3Java Notes: OpenJDK Runtime Environment (build 11.0.11-ea+4-Ubuntu-0ubuntu2)Python Notes: Python 3.9.2Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 27 March 2021 10:46 by user ronix.
Run Processor: Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG MAXIMUS XIII HERO (0610 BIOS), Chipset: Intel Tiger Lake-H, Memory: 32GB, Disk: 1000GB Western Digital WD_BLACK SN850 1TB + 2000GB, Graphics: AMD Radeon RX 6800/6800 XT / 6900 16GB (2575/1000MHz), Audio: Intel Tiger Lake-H HD Audio, Monitor: ASUS MG28U, Network: 2 x Intel I225-V + Intel Device 2725
OS: Ubuntu 21.04, Kernel: 5.12.0-051200rc3daily20210315-generic (x86_64) 20210314, Desktop: GNOME Shell 3.38.3, Display Server: X Server 1.20.10 + Wayland, OpenGL: 4.6 Mesa 21.1.0-devel (git-616720d 2021-03-16 hirsute-oibaf-ppa) (LLVM 12.0.0), Compiler: GCC 10.2.1 20210312, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: DEBUGINFOD_URLS=Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x39 - Thermald 2.4.3Java Notes: OpenJDK Runtime Environment (build 11.0.11-ea+4-Ubuntu-0ubuntu2)Python Notes: Python 3.9.2Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 28 March 2021 07:45 by user ronix.