Benchmarks for a future article.
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 Run Repeat Enabled 60M 120M 180M 240M 300M SE +/- 73645.19, N = 3 SE +/- 57627.28, N = 3 SE +/- 43622.17, N = 3 259329700 259622700 259341600 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 6 Realtime Run Repeat Enabled 7 14 21 28 35 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 29.14 29.13 29.19 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 6 Two-Pass Run Repeat Enabled 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 22.77 22.89 22.80 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 8 Realtime Run Repeat Enabled 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 113.06 113.96 114.16 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K Run Repeat Enabled 0.036 0.072 0.108 0.144 0.18 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.16 0.16 0.16 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K Run Repeat Enabled 0.891 1.782 2.673 3.564 4.455 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.94 3.96 3.96 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K Run Repeat Enabled 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 16.41 16.54 16.56 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K Run Repeat Enabled 2 4 6 8 10 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 7.56 7.56 7.59 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K Run Repeat Enabled 10 20 30 40 50 SE +/- 0.43, N = 15 SE +/- 0.21, N = 3 SE +/- 0.06, N = 3 40.83 43.24 42.80 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K Run Repeat Enabled 12 24 36 48 60 SE +/- 0.03, N = 3 SE +/- 0.63, N = 15 SE +/- 0.39, N = 14 51.88 51.82 52.82 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p Run Repeat Enabled 0.1125 0.225 0.3375 0.45 0.5625 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.50 0.50 0.50 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p Run Repeat Enabled 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 7.80 7.82 7.81 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p Run Repeat Enabled 7 14 21 28 35 SE +/- 0.21, N = 3 SE +/- 0.10, N = 3 SE +/- 0.04, N = 3 30.84 31.05 31.11 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p Run Repeat Enabled 6 12 18 24 30 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 23.12 23.20 23.17 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p Run Repeat Enabled 30 60 90 120 150 SE +/- 0.15, N = 3 SE +/- 1.83, N = 3 SE +/- 1.42, N = 13 138.16 137.23 135.81 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p Run Repeat Enabled 30 60 90 120 150 SE +/- 1.88, N = 4 SE +/- 1.46, N = 15 SE +/- 2.05, N = 3 152.03 156.45 152.40 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP Run Repeat Enabled 50 100 150 200 250 SE +/- 0.15, N = 3 SE +/- 0.26, N = 3 SE +/- 0.44, N = 3 208.62 210.97 209.65 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding Run Repeat Enabled 300 600 900 1200 1500 SE +/- 0.92, N = 3 SE +/- 0.19, N = 3 SE +/- 0.30, N = 3 1200.48 1200.37 1200.82 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding Run Repeat Enabled 400 800 1200 1600 2000 SE +/- 1.19, N = 3 SE +/- 0.24, N = 3 SE +/- 0.63, N = 3 1745.70 1743.56 1745.70 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding Run Repeat Enabled 300 600 900 1200 1500 SE +/- 4.97, N = 3 SE +/- 5.15, N = 3 SE +/- 14.15, N = 4 1223.27 1246.17 1236.02 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding Run Repeat Enabled 400 800 1200 1600 2000 SE +/- 5.37, N = 3 SE +/- 0.00, N = 3 SE +/- 4.65, N = 4 2074.75 2080.12 2072.06 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASTC Encoder ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Medium Run Repeat Enabled 1.1336 2.2672 3.4008 4.5344 5.668 SE +/- 0.0044, N = 3 SE +/- 0.0024, N = 3 SE +/- 0.0020, N = 3 5.0345 5.0381 5.0307 1. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Thorough Run Repeat Enabled 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 11.68 11.67 11.67 1. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Exhaustive Run Repeat Enabled 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 87.86 87.85 87.81 1. (CXX) g++ options: -O3 -flto -pthread
Basis Universal Basis Universal is a GPU texture codec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: ETC1S Run Repeat Enabled 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 21.36 21.39 21.45 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 Run Repeat Enabled 2 4 6 8 10 SE +/- 0.002, N = 3 SE +/- 0.005, N = 3 SE +/- 0.004, N = 3 6.248 6.250 6.257 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 Run Repeat Enabled 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 27.61 27.60 27.61 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 Run Repeat Enabled 12 24 36 48 60 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 51.55 51.54 51.56 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Blender Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL, NVIDIA OptiX, and NVIDIA CUDA is supported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: BMW27 - Compute: CPU-Only Run Repeat Enabled 30 60 90 120 150 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 SE +/- 0.22, N = 3 132.99 132.87 132.84
Botan Botan is a cross-platform open-source C++ crypto library that supports most all publicly known cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: KASUMI Run Repeat Enabled 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 105.75 105.76 105.70 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: AES-256 Run Repeat Enabled 2K 4K 6K 8K 10K SE +/- 0.38, N = 3 SE +/- 4.98, N = 3 SE +/- 25.80, N = 3 7852.89 7848.75 7826.31 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: Twofish Run Repeat Enabled 90 180 270 360 450 SE +/- 0.64, N = 3 SE +/- 0.37, N = 3 SE +/- 0.53, N = 3 426.80 426.30 427.59 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: Blowfish Run Repeat Enabled 120 240 360 480 600 SE +/- 0.32, N = 3 SE +/- 0.15, N = 3 SE +/- 0.30, N = 3 545.86 545.73 545.75 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: CAST-256 Run Repeat Enabled 40 80 120 160 200 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 162.50 162.52 162.51 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI Run Repeat Enabled 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 105.73 105.73 105.78 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt Run Repeat Enabled 20 40 60 80 100 SE +/- 0.17, N = 3 SE +/- 0.15, N = 3 SE +/- 0.19, N = 3 103.50 103.64 103.65 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 Run Repeat Enabled 2K 4K 6K 8K 10K SE +/- 0.75, N = 3 SE +/- 74.80, N = 3 SE +/- 0.15, N = 3 7854.05 7778.53 7854.71 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt Run Repeat Enabled 2K 4K 6K 8K 10K SE +/- 22.19, N = 3 SE +/- 77.84, N = 3 SE +/- 0.09, N = 3 7816.27 7761.24 7840.11 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish Run Repeat Enabled 90 180 270 360 450 SE +/- 0.19, N = 3 SE +/- 0.25, N = 3 SE +/- 0.04, N = 3 427.28 427.37 427.84 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt Run Repeat Enabled 90 180 270 360 450 SE +/- 0.15, N = 3 SE +/- 0.37, N = 3 SE +/- 0.22, N = 3 426.12 425.88 426.10 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish Run Repeat Enabled 120 240 360 480 600 SE +/- 0.28, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 544.75 545.42 545.45 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt Run Repeat Enabled 120 240 360 480 600 SE +/- 0.33, N = 3 SE +/- 0.03, N = 3 SE +/- 0.14, N = 3 535.21 535.87 535.85 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 Run Repeat Enabled 40 80 120 160 200 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 162.60 162.60 162.63 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt Run Repeat Enabled 40 80 120 160 200 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 162.31 162.33 162.32 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 Run Repeat Enabled 200 400 600 800 1000 SE +/- 0.89, N = 3 SE +/- 0.57, N = 3 SE +/- 1.45, N = 3 940.83 941.53 938.78 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt Run Repeat Enabled 200 400 600 800 1000 SE +/- 0.44, N = 3 SE +/- 1.47, N = 3 SE +/- 1.07, N = 3 936.65 936.21 935.22 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
BRL-CAD BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.30.8 VGR Performance Metric Run Repeat Enabled 30K 60K 90K 120K 150K 152408 153258 152552 1. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm
Build2 This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile Run Repeat Enabled 30 60 90 120 150 SE +/- 0.34, N = 3 SE +/- 0.19, N = 3 SE +/- 0.35, N = 3 122.58 121.75 122.76
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Run Repeat Enabled 15 30 45 60 75 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 67.12 67.13 67.13 1. (CC) gcc options: -lm -lpthread -O3
Caffe This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 Run Repeat Enabled 7K 14K 21K 28K 35K SE +/- 28.35, N = 3 SE +/- 61.74, N = 3 SE +/- 21.53, N = 3 34786 34770 34712 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 Run Repeat Enabled 20K 40K 60K 80K 100K SE +/- 62.17, N = 3 SE +/- 16.20, N = 3 SE +/- 39.14, N = 3 88718 88720 88739 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Chaos Group V-RAY This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU Run Repeat Enabled 3K 6K 9K 12K 15K SE +/- 23.78, N = 3 SE +/- 42.28, N = 3 SE +/- 14.74, N = 3 12279 12308 12346
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics Run Repeat Enabled 40 80 120 160 200 SE +/- 0.07, N = 3 SE +/- 0.22, N = 3 SE +/- 0.09, N = 3 166.03 159.40 159.78 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
OpenBenchmarking.org Iterations Per Second, More Is Better Cryptsetup PBKDF2-whirlpool Run Repeat Enabled 200K 400K 600K 800K 1000K SE +/- 924.33, N = 3 SE +/- 805.40, N = 3 SE +/- 1384.00, N = 3 852965 855283 852505
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 256b Encryption Run Repeat Enabled 1200 2400 3600 4800 6000 SE +/- 7.67, N = 3 SE +/- 10.24, N = 3 SE +/- 13.50, N = 3 5430.1 5425.9 5430.2
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 256b Decryption Run Repeat Enabled 1200 2400 3600 4800 6000 SE +/- 9.91, N = 3 SE +/- 8.66, N = 3 SE +/- 15.52, N = 3 5425.7 5427.0 5430.1
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 256b Encryption Run Repeat Enabled 200 400 600 800 1000 SE +/- 0.25, N = 3 SE +/- 1.34, N = 3 SE +/- 1.04, N = 3 774.9 774.0 773.1
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 256b Decryption Run Repeat Enabled 160 320 480 640 800 SE +/- 0.20, N = 3 SE +/- 0.26, N = 3 SE +/- 0.35, N = 3 729.6 729.7 729.5
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 256b Encryption Run Repeat Enabled 110 220 330 440 550 SE +/- 0.10, N = 3 SE +/- 0.88, N = 3 SE +/- 0.47, N = 3 487.6 487.2 487.4
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 256b Decryption Run Repeat Enabled 110 220 330 440 550 SE +/- 0.12, N = 3 SE +/- 0.38, N = 3 SE +/- 0.00, N = 2 488.3 488.2 488.4
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 512b Encryption Run Repeat Enabled 1000 2000 3000 4000 5000 SE +/- 8.27, N = 3 SE +/- 6.39, N = 3 SE +/- 3.80, N = 3 4823.2 4824.1 4819.2
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 512b Decryption Run Repeat Enabled 1000 2000 3000 4000 5000 SE +/- 8.27, N = 3 SE +/- 5.46, N = 3 SE +/- 4.47, N = 3 4800.5 4799.9 4793.8
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 512b Encryption Run Repeat Enabled 200 400 600 800 1000 SE +/- 0.50, N = 2 SE +/- 0.49, N = 3 775.1 774.9 774.4
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 512b Encryption Run Repeat Enabled 110 220 330 440 550 SE +/- 0.12, N = 3 SE +/- 0.30, N = 2 SE +/- 0.41, N = 3 487.3 487.1 488.0
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 512b Decryption Run Repeat Enabled 110 220 330 440 550 SE +/- 0.24, N = 3 SE +/- 0.27, N = 3 488.0 487.9 487.9
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 512b Decryption Run Repeat Enabled 160 320 480 640 800 SE +/- 0.27, N = 3 SE +/- 0.25, N = 2 729.5 729.6 729.3
Cython Benchmark Cython provides a superset of Python that is geared to deliver C-like levels of performance. This test profile makes use of Cython's bundled benchmark tests and runs an N-Queens sample test as a simple benchmark to the system's Cython performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Cython Benchmark 0.29.21 Test: N-Queens Run Repeat Enabled 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 20.35 20.39 20.37
Darktable Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Boat - Acceleration: CPU-only Run Repeat Enabled 0.968 1.936 2.904 3.872 4.84 SE +/- 0.016, N = 3 SE +/- 0.022, N = 3 SE +/- 0.021, N = 3 4.291 4.302 4.300
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Masskrug - Acceleration: CPU-only Run Repeat Enabled 1.0289 2.0578 3.0867 4.1156 5.1445 SE +/- 0.003, N = 3 SE +/- 0.004, N = 3 SE +/- 0.004, N = 3 4.558 4.571 4.573
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Server Rack - Acceleration: CPU-only Run Repeat Enabled 0.0371 0.0742 0.1113 0.1484 0.1855 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.002, N = 15 0.165 0.163 0.165
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Server Room - Acceleration: CPU-only Run Repeat Enabled 0.7814 1.5628 2.3442 3.1256 3.907 SE +/- 0.007, N = 3 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 3.473 3.458 3.468
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 4K Run Repeat Enabled 40 80 120 160 200 SE +/- 0.27, N = 3 SE +/- 0.27, N = 3 SE +/- 0.18, N = 3 189.50 189.52 189.80 MIN: 177.16 / MAX: 202.89 MIN: 175.59 / MAX: 203 MIN: 176.26 / MAX: 202.54 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 1080p Run Repeat Enabled 150 300 450 600 750 SE +/- 1.92, N = 3 SE +/- 0.91, N = 3 SE +/- 0.85, N = 3 709.28 708.63 707.73 MIN: 632.86 / MAX: 774.84 MIN: 636.52 / MAX: 768.39 MIN: 632.84 / MAX: 768.08 1. (CC) gcc options: -pthread
DeepSpeech Mozilla DeepSpeech is a speech-to-text engine powered by TensorFlow for machine learning and derived from Baidu's Deep Speech research paper. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better DeepSpeech 0.6 Acceleration: CPU Run Repeat Enabled 14 28 42 56 70 SE +/- 0.10, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 60.30 59.56 60.45
Dolfyn Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics Run Repeat Enabled 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 15.03 15.05 15.06
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Crown Run Repeat Enabled 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.13, N = 3 SE +/- 0.04, N = 3 14.64 14.64 14.81 MIN: 14.37 / MAX: 15.3 MIN: 13.95 / MAX: 15.33 MIN: 14.58 / MAX: 15.49
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon Run Repeat Enabled 4 8 12 16 20 SE +/- 0.15, N = 3 SE +/- 0.14, N = 3 SE +/- 0.12, N = 3 13.92 13.86 13.95 MIN: 13.63 / MAX: 14.38 MIN: 13.61 / MAX: 14.36 MIN: 13.68 / MAX: 14.36
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon Obj Run Repeat Enabled 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 12.50 12.79 12.80 MIN: 12.24 / MAX: 12.8 MIN: 12.68 / MAX: 13.03 MIN: 12.73 / MAX: 13.04
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon Run Repeat Enabled 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 16.38 16.30 16.39 MIN: 16.13 / MAX: 16.84 MIN: 15.75 / MAX: 16.79 MIN: 16.23 / MAX: 16.83
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon Obj Run Repeat Enabled 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 14.19 14.45 14.47 MIN: 14.03 / MAX: 14.6 MIN: 14.33 / MAX: 14.87 MIN: 14.33 / MAX: 14.85
Etcpak Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 Run Repeat Enabled 500 1000 1500 2000 2500 SE +/- 0.42, N = 3 SE +/- 0.55, N = 3 SE +/- 0.83, N = 3 2380.09 2387.90 2387.98 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 Run Repeat Enabled 80 160 240 320 400 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 369.26 369.33 369.31 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 Run Repeat Enabled 50 100 150 200 250 SE +/- 0.07, N = 3 SE +/- 0.28, N = 3 SE +/- 0.07, N = 3 209.56 209.36 209.63 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 Run Repeat Enabled 1500 3000 4500 6000 7500 SE +/- 30.66, N = 3 SE +/- 30.65, N = 3 SE +/- 25.44, N = 3 7147.5 7121.2 7133.5 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 Run Repeat Enabled 6K 12K 18K 24K 30K SE +/- 300.18, N = 3 SE +/- 336.17, N = 3 SE +/- 33.51, N = 3 26855 26165 26220 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FinanceBench FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP Run Repeat Enabled 6K 12K 18K 24K 30K SE +/- 74.39, N = 3 SE +/- 46.34, N = 3 SE +/- 151.48, N = 3 26676.77 26809.90 26926.41 1. (CXX) g++ options: -O3 -march=native -fopenmp
OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP Run Repeat Enabled 9K 18K 27K 36K 45K SE +/- 79.47, N = 3 SE +/- 12.45, N = 3 SE +/- 58.76, N = 3 41115.48 41346.52 41370.77 1. (CXX) g++ options: -O3 -march=native -fopenmp
GEGL GEGL is the Generic Graphics Library and is the library/framework used by GIMP and other applications like GNOME Photos. This test profile times how long it takes to complete various GEGL operations on a static set of sample JPEG images. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GEGL Operation: Crop Run Repeat Enabled 2 4 6 8 10 SE +/- 0.023, N = 3 SE +/- 0.023, N = 3 SE +/- 0.024, N = 3 6.482 6.485 6.479
OpenBenchmarking.org Seconds, Fewer Is Better GEGL Operation: Scale Run Repeat Enabled 1.0634 2.1268 3.1902 4.2536 5.317 SE +/- 0.008, N = 3 SE +/- 0.013, N = 3 SE +/- 0.017, N = 3 4.723 4.726 4.711
OpenBenchmarking.org Seconds, Fewer Is Better GEGL Operation: Color Enhance Run Repeat Enabled 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 42.93 42.87 42.89
OpenBenchmarking.org Seconds, Fewer Is Better GEGL Operation: Rotate 90 Degrees Run Repeat Enabled 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 29.56 29.51 29.54
GIMP GIMP is an open-source image manipulaton program. This test profile will use the system-provided GIMP program otherwise on Windows relys upon a pre-packaged Windows binary from upstream GIMP.org. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.22 Test: resize Run Repeat Enabled 2 4 6 8 10 SE +/- 0.023, N = 3 SE +/- 0.024, N = 3 SE +/- 0.048, N = 3 6.222 6.213 6.212
OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.22 Test: rotate Run Repeat Enabled 3 6 9 12 15 SE +/- 0.008, N = 3 SE +/- 0.007, N = 3 SE +/- 0.017, N = 3 9.044 9.036 9.027
OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.22 Test: auto-levels Run Repeat Enabled 3 6 9 12 15 SE +/- 0.035, N = 3 SE +/- 0.021, N = 3 SE +/- 0.005, N = 3 9.334 9.304 9.323
OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.22 Test: unsharp-mask Run Repeat Enabled 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 11.20 11.17 11.21
Git This test measures the time needed to carry out some sample Git operations on an example, static repository that happens to be a copy of the GNOME GTK tool-kit repository. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Git Time To Complete Common Git Commands Run Repeat Enabled 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 40.99 40.88 40.97 1. git version 2.30.2
GNU GMP GMPbench GMPbench is a test of the GNU Multiple Precision Arithmetic (GMP) Library. GMPbench is a single-threaded integer benchmark that leverages the GMP library to stress the CPU with widening integer multiplication. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GMPbench Score, More Is Better GNU GMP GMPbench 6.2.1 Total Time Run Repeat Enabled 1400 2800 4200 5600 7000 6438.8 6432.5 6449.1 1. (CC) gcc options: -O3 -fomit-frame-pointer -lm
GNU Radio GNU Radio is a free software development toolkit providing signal processing blocks to implement software-defined radios (SDR) and signal processing systems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Five Back to Back FIR Filters Run Repeat Enabled 300 600 900 1200 1500 SE +/- 14.23, N = 3 SE +/- 14.53, N = 3 SE +/- 5.99, N = 3 1506.3 1499.4 1527.4 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Signal Source (Cosine) Run Repeat Enabled 700 1400 2100 2800 3500 SE +/- 1.51, N = 3 SE +/- 4.70, N = 3 SE +/- 2.52, N = 3 3238.8 3240.7 3244.4 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FIR Filter Run Repeat Enabled 160 320 480 640 800 SE +/- 0.98, N = 3 SE +/- 0.71, N = 3 SE +/- 0.83, N = 3 733.7 733.4 734.0 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: IIR Filter Run Repeat Enabled 200 400 600 800 1000 SE +/- 0.58, N = 3 SE +/- 0.46, N = 3 SE +/- 0.78, N = 3 837.4 837.4 837.7 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FM Deemphasis Filter Run Repeat Enabled 200 400 600 800 1000 SE +/- 0.47, N = 3 SE +/- 2.55, N = 3 SE +/- 0.57, N = 3 1084.2 1082.2 1084.4 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Hilbert Transform Run Repeat Enabled 130 260 390 520 650 SE +/- 0.59, N = 3 SE +/- 0.84, N = 3 SE +/- 1.46, N = 3 591.4 593.7 594.4 1. 3.8.2.0
Google SynthMark SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 Run Repeat Enabled 200 400 600 800 1000 SE +/- 2.29, N = 3 SE +/- 0.35, N = 3 SE +/- 0.45, N = 3 926.29 928.09 928.21 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate Run Repeat Enabled 200 400 600 800 1000 SE +/- 1.20, N = 3 SE +/- 4.10, N = 3 SE +/- 6.00, N = 3 1041 1064 1073 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen Run Repeat Enabled 40 80 120 160 200 165 165 165 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced Run Repeat Enabled 50 100 150 200 250 217 218 218 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing Run Repeat Enabled 200 400 600 800 1000 SE +/- 3.53, N = 3 SE +/- 1.33, N = 3 SE +/- 1.20, N = 3 1089 1103 1101 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian Run Repeat Enabled 70 140 210 280 350 SE +/- 0.33, N = 3 309 310 310 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space Run Repeat Enabled 300 600 900 1200 1500 SE +/- 0.88, N = 3 SE +/- 0.88, N = 3 SE +/- 0.33, N = 3 1256 1275 1272 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
Hugin Hugin is an open-source, cross-platform panorama photo stitcher software package. This test profile times how long it takes to run the assistant and panorama photo stitching on a set of images. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Hugin Panorama Photo Assistant + Stitching Time Run Repeat Enabled 9 18 27 36 45 SE +/- 0.18, N = 3 SE +/- 0.24, N = 3 SE +/- 0.08, N = 3 38.67 38.34 38.73
OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Supercar Run Repeat Enabled 1.0643 2.1286 3.1929 4.2572 5.3215 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.004, N = 3 4.716 4.730 4.725
Java Gradle Build This test runs Java software project builds using the Gradle build system. It is intended to give developers an idea as to the build performance for development activities and build servers. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Java Gradle Build Gradle Build: Reactor Run Repeat Enabled 40 80 120 160 200 SE +/- 1.63, N = 12 SE +/- 1.79, N = 12 SE +/- 2.16, N = 12 174.16 171.07 173.55
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: MD5 Run Repeat Enabled 400K 800K 1200K 1600K 2000K SE +/- 10588.25, N = 3 SE +/- 12333.33, N = 3 SE +/- 13383.24, N = 3 2088333 2054667 2065333 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
JPEG XL The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 5 Run Repeat Enabled 20 40 60 80 100 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 SE +/- 0.12, N = 3 74.75 75.02 74.76 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 7 Run Repeat Enabled 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 10.09 10.11 10.10 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 5 Run Repeat Enabled 20 40 60 80 100 SE +/- 0.07, N = 3 SE +/- 0.11, N = 3 SE +/- 0.36, N = 3 74.94 74.94 74.61 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 7 Run Repeat Enabled 20 40 60 80 100 SE +/- 0.49, N = 3 SE +/- 0.31, N = 3 SE +/- 0.18, N = 3 75.57 74.86 74.70 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
JPEG XL Decoding The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.3 CPU Threads: 1 Run Repeat Enabled 13 26 39 52 65 SE +/- 0.13, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 56.37 56.71 56.70
Kvazaar This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium Run Repeat Enabled 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 6.41 6.42 6.41 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium Run Repeat Enabled 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 27.92 27.92 27.89 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Run Repeat Enabled 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 17.64 17.71 17.71 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Run Repeat Enabled 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 31.65 31.89 31.73 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast Run Repeat Enabled 15 30 45 60 75 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 65.40 65.72 65.54 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast Run Repeat Enabled 30 60 90 120 150 SE +/- 0.21, N = 3 SE +/- 0.23, N = 3 SE +/- 0.13, N = 3 119.79 120.19 120.06 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 Run Repeat Enabled 2 4 6 8 10 SE +/- 0.005, N = 3 SE +/- 0.014, N = 3 SE +/- 0.004, N = 3 6.526 6.532 6.521 1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lncurses -lm
OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen Run Repeat Enabled 200 400 600 800 1000 SE +/- 7.88, N = 3 SE +/- 4.04, N = 3 SE +/- 8.54, N = 3 841 831 840 1. (CXX) g++ options: -flto -pthread
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 2 Run Repeat Enabled 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 31.05 31.10 30.97 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 Run Repeat Enabled 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 11.03 11.06 11.05 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 Run Repeat Enabled 0.6467 1.2934 1.9401 2.5868 3.2335 SE +/- 0.008, N = 3 SE +/- 0.007, N = 3 SE +/- 0.003, N = 3 2.874 2.867 2.855 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Run Repeat Enabled 12 24 36 48 60 SE +/- 0.03, N = 3 SE +/- 0.19, N = 3 SE +/- 0.11, N = 3 52.21 51.57 51.51 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Run Repeat Enabled 1.1502 2.3004 3.4506 4.6008 5.751 SE +/- 0.005, N = 3 SE +/- 0.002, N = 3 SE +/- 0.017, N = 3 5.092 5.085 5.112 1. (CXX) g++ options: -O3 -fPIC -lm
librsvg RSVG/librsvg is an SVG vector graphics library. This test profile times how long it takes to complete various operations by rsvg-convert. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better librsvg Operation: SVG Files To PNG Run Repeat Enabled 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 16.46 16.30 16.40 1. rsvg-convert version 2.50.3
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 Run Repeat Enabled 20M 40M 60M 80M 100M SE +/- 8504.90, N = 3 SE +/- 978008.35, N = 3 SE +/- 4910.31, N = 3 82631000 81644000 82626333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 2 - Buffer Length: 256 - Filter Length: 57 Run Repeat Enabled 30M 60M 90M 120M 150M SE +/- 33829.64, N = 3 SE +/- 1484620.42, N = 6 SE +/- 1078368.11, N = 3 157906667 155633333 156866667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 Run Repeat Enabled 70M 140M 210M 280M 350M SE +/- 968234.36, N = 3 SE +/- 817176.71, N = 3 SE +/- 158359.65, N = 3 319286667 319096667 321016667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 Run Repeat Enabled 130M 260M 390M 520M 650M SE +/- 8429373.91, N = 3 SE +/- 286724.80, N = 3 SE +/- 463656.96, N = 3 621093333 629466667 629876667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 12 - Buffer Length: 256 - Filter Length: 57 Run Repeat Enabled 140M 280M 420M 560M 700M SE +/- 162924.66, N = 3 SE +/- 489534.93, N = 3 SE +/- 2697671.84, N = 3 659116667 658753333 662150000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
LuaJIT This test profile is a collection of Lua scripts/benchmarks run against a locally-built copy of LuaJIT upstream. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Composite Run Repeat Enabled 400 800 1200 1600 2000 SE +/- 3.15, N = 3 SE +/- 1.32, N = 3 SE +/- 1.28, N = 3 1846.72 1828.04 1832.95 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
LuaRadio LuaRadio is a lightweight software-defined radio (SDR) framework built atop LuaJIT. LuaRadio provides a suite of source, sink, and processing blocks, with a simple API for defining flow graphs, running flow graphs, creating blocks, and creating data types. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Five Back to Back FIR Filters Run Repeat Enabled 300 600 900 1200 1500 SE +/- 4.43, N = 3 SE +/- 8.83, N = 3 SE +/- 1.82, N = 3 1536.0 1528.4 1541.1
OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: FM Deemphasis Filter Run Repeat Enabled 120 240 360 480 600 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 SE +/- 0.23, N = 3 547.0 546.9 547.0
OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Hilbert Transform Run Repeat Enabled 20 40 60 80 100 SE +/- 0.10, N = 3 SE +/- 0.25, N = 3 SE +/- 0.06, N = 3 107.1 107.2 107.2
OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Complex Phase Run Repeat Enabled 200 400 600 800 1000 SE +/- 1.86, N = 3 SE +/- 2.01, N = 3 SE +/- 0.32, N = 3 783.3 784.4 787.1
LuxCoreRender LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on the CPU as opposed to the OpenCL version. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.3 Scene: DLSC Run Repeat Enabled 0.441 0.882 1.323 1.764 2.205 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 1.94 1.96 1.96 MIN: 1.89 / MAX: 1.99 MIN: 1.92 / MAX: 2.01 MIN: 1.92 / MAX: 2.02
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.3 Scene: Rainbow Colors and Prism Run Repeat Enabled 0.4815 0.963 1.4445 1.926 2.4075 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 2.14 2.13 2.14 MIN: 2.07 / MAX: 2.17 MIN: 2.08 / MAX: 2.16 MIN: 2.09 / MAX: 2.16
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed Run Repeat Enabled 2K 4K 6K 8K 10K SE +/- 31.08, N = 3 SE +/- 6.63, N = 3 SE +/- 31.07, N = 3 10026.8 10047.5 10096.9 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed Run Repeat Enabled 15 30 45 60 75 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 66.00 66.00 65.95 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed Run Repeat Enabled 2K 4K 6K 8K 10K SE +/- 13.00, N = 3 SE +/- 5.72, N = 3 SE +/- 7.93, N = 3 10670.1 10655.3 10672.3 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed Run Repeat Enabled 14 28 42 56 70 SE +/- 0.05, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 64.72 64.67 64.67 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed Run Repeat Enabled 2K 4K 6K 8K 10K SE +/- 4.62, N = 3 SE +/- 9.28, N = 3 SE +/- 7.05, N = 3 10675.7 10683.2 10679.1 1. (CC) gcc options: -O3
Mobile Neural Network MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: SqueezeNetV1.0 Run Repeat Enabled 0.8525 1.705 2.5575 3.41 4.2625 SE +/- 0.018, N = 3 SE +/- 0.032, N = 3 SE +/- 0.008, N = 3 3.753 3.789 3.751 MIN: 3.69 / MAX: 4.53 MIN: 3.72 / MAX: 11.42 MIN: 3.69 / MAX: 11.5 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: resnet-v2-50 Run Repeat Enabled 5 10 15 20 25 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.11, N = 3 19.79 19.70 19.79 MIN: 19.59 / MAX: 21.26 MIN: 19.53 / MAX: 21.51 MIN: 19.55 / MAX: 21.46 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: MobileNetV2_224 Run Repeat Enabled 0.448 0.896 1.344 1.792 2.24 SE +/- 0.004, N = 3 SE +/- 0.013, N = 3 SE +/- 0.006, N = 3 1.991 1.980 1.971 MIN: 1.94 / MAX: 2.84 MIN: 1.92 / MAX: 2.88 MIN: 1.92 / MAX: 2.79 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: mobilenet-v1-1.0 Run Repeat Enabled 0.4356 0.8712 1.3068 1.7424 2.178 SE +/- 0.012, N = 3 SE +/- 0.005, N = 3 SE +/- 0.007, N = 3 1.936 1.923 1.921 MIN: 1.88 / MAX: 3.19 MIN: 1.88 / MAX: 2.74 MIN: 1.87 / MAX: 2.71 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: inception-v3 Run Repeat Enabled 6 12 18 24 30 SE +/- 0.12, N = 3 SE +/- 0.15, N = 3 SE +/- 0.23, N = 3 23.19 22.90 23.06 MIN: 22.9 / MAX: 25.71 MIN: 22.59 / MAX: 25.48 MIN: 22.7 / MAX: 25.12 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Monte Carlo Simulations of Ionised Nebulae Mocassin is the Monte Carlo Simulations of Ionised Nebulae. MOCASSIN is a fully 3D or 2D photoionisation and dust radiative transfer code which employs a Monte Carlo approach to the transfer of radiation through media of arbitrary geometry and density distribution. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2019-03-24 Input: Dust 2D tau100.0 Run Repeat Enabled 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 173 173 174 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lrt -lz
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms Run Repeat Enabled 0.2923 0.5846 0.8769 1.1692 1.4615 SE +/- 0.00312, N = 3 SE +/- 0.00320, N = 3 SE +/- 0.00186, N = 3 1.29607 1.29529 1.29902
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C Run Repeat Enabled 300 600 900 1200 1500 SE +/- 10.80, N = 3 SE +/- 14.97, N = 15 SE +/- 14.97, N = 15 1581.87 1565.92 1554.98 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C Run Repeat Enabled 7K 14K 21K 28K 35K SE +/- 35.20, N = 3 SE +/- 75.77, N = 3 SE +/- 10.27, N = 3 30717.61 31174.97 31194.25 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NCNN NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mobilenet Run Repeat Enabled 4 8 12 16 20 SE +/- 0.23, N = 3 SE +/- 0.18, N = 4 SE +/- 0.03, N = 3 15.94 16.00 15.72 MIN: 15.49 / MAX: 16.78 MIN: 15.6 / MAX: 16.91 MIN: 15.49 / MAX: 16.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 Run Repeat Enabled 1.0103 2.0206 3.0309 4.0412 5.0515 SE +/- 0.09, N = 3 SE +/- 0.13, N = 4 SE +/- 0.05, N = 3 4.46 4.49 4.42 MIN: 4.23 / MAX: 5.67 MIN: 4.23 / MAX: 5.82 MIN: 4.24 / MAX: 6.41 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 Run Repeat Enabled 0.8213 1.6426 2.4639 3.2852 4.1065 SE +/- 0.04, N = 3 SE +/- 0.07, N = 4 SE +/- 0.06, N = 3 3.60 3.65 3.63 MIN: 3.48 / MAX: 4.48 MIN: 3.49 / MAX: 4.52 MIN: 3.48 / MAX: 4.52 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: shufflenet-v2 Run Repeat Enabled 0.8528 1.7056 2.5584 3.4112 4.264 SE +/- 0.03, N = 3 SE +/- 0.01, N = 4 SE +/- 0.01, N = 3 3.79 3.77 3.76 MIN: 3.69 / MAX: 6.18 MIN: 3.7 / MAX: 5.82 MIN: 3.7 / MAX: 4.31 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mnasnet Run Repeat Enabled 0.8078 1.6156 2.4234 3.2312 4.039 SE +/- 0.08, N = 3 SE +/- 0.11, N = 4 SE +/- 0.09, N = 3 3.59 3.53 3.50 MIN: 3.38 / MAX: 4.48 MIN: 3.34 / MAX: 4.34 MIN: 3.34 / MAX: 4.22 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: efficientnet-b0 Run Repeat Enabled 1.2893 2.5786 3.8679 5.1572 6.4465 SE +/- 0.12, N = 3 SE +/- 0.11, N = 4 SE +/- 0.05, N = 3 5.73 5.68 5.63 MIN: 5.53 / MAX: 6.75 MIN: 5.5 / MAX: 6.87 MIN: 5.52 / MAX: 6.51 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: blazeface Run Repeat Enabled 0.315 0.63 0.945 1.26 1.575 SE +/- 0.04, N = 3 SE +/- 0.03, N = 4 SE +/- 0.04, N = 3 1.40 1.35 1.35 MIN: 1.28 / MAX: 1.81 MIN: 1.27 / MAX: 1.48 MIN: 1.26 / MAX: 1.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: googlenet Run Repeat Enabled 3 6 9 12 15 SE +/- 0.39, N = 3 SE +/- 0.34, N = 4 SE +/- 0.36, N = 3 12.60 12.15 12.16 MIN: 11.72 / MAX: 13.76 MIN: 11.72 / MAX: 13.4 MIN: 11.71 / MAX: 13.31 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: vgg16 Run Repeat Enabled 13 26 39 52 65 SE +/- 1.26, N = 3 SE +/- 0.11, N = 4 SE +/- 0.10, N = 3 57.57 56.35 56.10 MIN: 55.66 / MAX: 733.01 MIN: 55.59 / MAX: 62.94 MIN: 55.69 / MAX: 58.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet18 Run Repeat Enabled 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.26, N = 4 SE +/- 0.32, N = 3 12.93 12.74 12.60 MIN: 12.69 / MAX: 13.39 MIN: 11.82 / MAX: 13.44 MIN: 11.81 / MAX: 13.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: alexnet Run Repeat Enabled 3 6 9 12 15 SE +/- 0.17, N = 3 SE +/- 0.01, N = 4 SE +/- 0.18, N = 3 11.35 11.19 11.36 MIN: 10.99 / MAX: 12.87 MIN: 10.99 / MAX: 11.53 MIN: 11 / MAX: 11.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet50 Run Repeat Enabled 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.04, N = 4 SE +/- 0.50, N = 3 24.65 24.59 24.05 MIN: 24.32 / MAX: 25.71 MIN: 24.27 / MAX: 31.94 MIN: 22.83 / MAX: 32.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: yolov4-tiny Run Repeat Enabled 6 12 18 24 30 SE +/- 0.20, N = 3 SE +/- 0.14, N = 4 SE +/- 0.17, N = 3 23.13 23.11 23.05 MIN: 22.55 / MAX: 24.22 MIN: 22.55 / MAX: 23.74 MIN: 22.55 / MAX: 23.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: squeezenet_ssd Run Repeat Enabled 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.06, N = 4 SE +/- 0.10, N = 3 18.16 18.18 18.10 MIN: 17.79 / MAX: 26.2 MIN: 17.83 / MAX: 18.68 MIN: 17.75 / MAX: 18.89 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: regnety_400m Run Repeat Enabled 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.07, N = 4 SE +/- 0.09, N = 3 11.09 11.04 11.06 MIN: 10.88 / MAX: 12.08 MIN: 10.75 / MAX: 12.06 MIN: 10.79 / MAX: 12.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Ngspice Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 Run Repeat Enabled 20 40 60 80 100 SE +/- 0.30, N = 3 SE +/- 0.69, N = 3 SE +/- 0.32, N = 3 108.10 107.26 108.73 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 Run Repeat Enabled 20 40 60 80 100 SE +/- 0.23, N = 3 SE +/- 0.29, N = 3 SE +/- 0.57, N = 3 86.39 86.23 87.39 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
OCRMyPDF OCRMyPDF is an optical character recognition (OCR) text layer to scanned PDF files, producing new PDFs with the text now selectable/searchable/copy-paste capable. OCRMyPDF leverages the Tesseract OCR engine and is written in Python. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OCRMyPDF 10.3.1+dfsg Processing 60 Page PDF Document Run Repeat Enabled 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 15.56 15.49 15.51
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU Run Repeat Enabled 0.9097 1.8194 2.7291 3.6388 4.5485 SE +/- 0.00367, N = 3 SE +/- 0.00225, N = 3 SE +/- 0.00318, N = 3 4.04332 4.03238 4.04078 MIN: 3.92 MIN: 3.91 MIN: 3.91 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU Run Repeat Enabled 3 6 9 12 15 SE +/- 0.00258, N = 3 SE +/- 0.00778, N = 3 SE +/- 0.00888, N = 3 10.56970 9.49268 10.57120 MIN: 10.51 MIN: 9.43 MIN: 10.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU Run Repeat Enabled 0.1585 0.317 0.4755 0.634 0.7925 SE +/- 0.002770, N = 3 SE +/- 0.001477, N = 3 SE +/- 0.001249, N = 3 0.704534 0.704665 0.703196 MIN: 0.66 MIN: 0.66 MIN: 0.66 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU Run Repeat Enabled 0.6462 1.2924 1.9386 2.5848 3.231 SE +/- 0.00257, N = 3 SE +/- 0.01085, N = 3 SE +/- 0.00634, N = 3 2.87200 2.77561 2.86126 MIN: 2.8 MIN: 2.69 MIN: 2.81 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU Run Repeat Enabled 2 4 6 8 10 SE +/- 0.00312, N = 3 SE +/- 0.00649, N = 3 SE +/- 0.00601, N = 3 8.39591 8.39297 8.39732 MIN: 8.3 MIN: 8.25 MIN: 8.25 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU Run Repeat Enabled 1.0461 2.0922 3.1383 4.1844 5.2305 SE +/- 0.03164, N = 3 SE +/- 0.01779, N = 3 SE +/- 0.03141, N = 3 4.64952 4.38277 4.60823 MIN: 4.15 MIN: 4.04 MIN: 4.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Run Repeat Enabled 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 13.93 13.86 13.95 MIN: 13.85 MIN: 13.78 MIN: 13.88 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU Run Repeat Enabled 2 4 6 8 10 SE +/- 0.06939, N = 15 SE +/- 0.09537, N = 15 SE +/- 0.07203, N = 15 6.07807 6.19314 6.16825 MIN: 3.69 MIN: 3.68 MIN: 3.67 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU Run Repeat Enabled 0.9426 1.8852 2.8278 3.7704 4.713 SE +/- 0.00910, N = 3 SE +/- 0.00355, N = 3 SE +/- 0.00971, N = 3 4.17526 4.17427 4.18919 MIN: 4.13 MIN: 4.13 MIN: 4.12 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Run Repeat Enabled 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 12.10 12.06 12.12 MIN: 12.03 MIN: 11.93 MIN: 12.06 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU Run Repeat Enabled 0.1851 0.3702 0.5553 0.7404 0.9255 SE +/- 0.003248, N = 3 SE +/- 0.003677, N = 3 SE +/- 0.003255, N = 3 0.821323 0.822519 0.822350 MIN: 0.8 MIN: 0.8 MIN: 0.8 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU Run Repeat Enabled 0.3246 0.6492 0.9738 1.2984 1.623 SE +/- 0.01056, N = 3 SE +/- 0.00726, N = 3 SE +/- 0.01018, N = 3 1.43273 1.44288 1.43641 MIN: 1.36 MIN: 1.36 MIN: 1.37 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Run Repeat Enabled 700 1400 2100 2800 3500 SE +/- 3.51, N = 3 SE +/- 4.19, N = 3 SE +/- 2.69, N = 3 3066.19 2988.39 3091.74 MIN: 3055.74 MIN: 2975.82 MIN: 3081.74 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Run Repeat Enabled 400 800 1200 1600 2000 SE +/- 3.10, N = 3 SE +/- 1.82, N = 3 SE +/- 1.39, N = 3 1802.18 1760.75 1812.22 MIN: 1794.84 MIN: 1753.88 MIN: 1806.43 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU Run Repeat Enabled 700 1400 2100 2800 3500 SE +/- 5.99, N = 3 SE +/- 5.44, N = 3 SE +/- 1.66, N = 3 3066.56 3001.60 3092.80 MIN: 3053.3 MIN: 2989.75 MIN: 3086.05 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU Run Repeat Enabled 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 15.81 15.82 15.82 MIN: 15.79 MIN: 15.79 MIN: 15.79 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU Run Repeat Enabled 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 16.45 16.45 16.45 MIN: 16.25 MIN: 16.24 MIN: 16.25 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU Run Repeat Enabled 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 16.70 16.73 16.74 MIN: 16.63 MIN: 16.63 MIN: 16.62 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU Run Repeat Enabled 400 800 1200 1600 2000 SE +/- 1.75, N = 3 SE +/- 1.53, N = 3 SE +/- 2.19, N = 3 1800.95 1762.42 1814.66 MIN: 1793.72 MIN: 1756.24 MIN: 1808.15 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Run Repeat Enabled 0.7736 1.5472 2.3208 3.0944 3.868 SE +/- 0.00271, N = 3 SE +/- 0.00297, N = 3 SE +/- 0.00397, N = 3 3.43837 3.41721 3.42660 MIN: 3.36 MIN: 3.34 MIN: 3.35 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU Run Repeat Enabled 700 1400 2100 2800 3500 SE +/- 17.72, N = 3 SE +/- 6.46, N = 3 SE +/- 1.99, N = 3 3075.31 2998.59 3092.81 MIN: 3049.81 MIN: 2981.81 MIN: 3084.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU Run Repeat Enabled 400 800 1200 1600 2000 SE +/- 3.82, N = 3 SE +/- 4.58, N = 3 SE +/- 1.95, N = 3 1802.00 1762.48 1816.65 MIN: 1791.17 MIN: 1751.25 MIN: 1809.76 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Run Repeat Enabled 0.2866 0.5732 0.8598 1.1464 1.433 SE +/- 0.00054, N = 3 SE +/- 0.00115, N = 3 SE +/- 0.00109, N = 3 1.27367 1.26745 1.26765 MIN: 1.23 MIN: 1.22 MIN: 1.22 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU Run Repeat Enabled 0.787 1.574 2.361 3.148 3.935 SE +/- 0.00395, N = 3 SE +/- 0.00650, N = 3 SE +/- 0.00666, N = 3 3.48539 3.49474 3.49756 MIN: 3.42 MIN: 3.42 MIN: 3.43 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU Run Repeat Enabled 100 200 300 400 500 SE +/- 2.42, N = 3 SE +/- 1.59, N = 3 SE +/- 1.59, N = 3 456 457 457 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU Run Repeat Enabled 200 400 600 800 1000 SE +/- 1.17, N = 3 SE +/- 1.88, N = 3 SE +/- 0.76, N = 3 854 853 852 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU Run Repeat Enabled 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.17, N = 3 85 84 84 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU Run Repeat Enabled 4K 8K 12K 16K 20K SE +/- 28.64, N = 3 SE +/- 41.78, N = 3 SE +/- 14.97, N = 3 17624 17671 17685 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU Run Repeat Enabled 2K 4K 6K 8K 10K SE +/- 2.20, N = 3 SE +/- 36.97, N = 3 SE +/- 21.10, N = 3 7835 7800 7801 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenFOAM OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M Run Repeat Enabled 50 100 150 200 250 SE +/- 0.22, N = 3 SE +/- 0.44, N = 3 SE +/- 0.17, N = 3 227.09 227.61 228.01 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
OpenSCAD OpenSCAD is a programmer-focused solid 3D CAD modeller. OpenSCAD is free software and allows creating 3D CAD objects in a script-based modelling environment. This test profile will use the system-provided OpenSCAD program otherwise and time how long it takes tn render different SCAD assets to PNG output. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Pistol Run Repeat Enabled 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.24, N = 3 SE +/- 0.08, N = 3 78.21 78.07 77.85 1. OpenSCAD version 2021.01
OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Retro Car Run Repeat Enabled 0.824 1.648 2.472 3.296 4.12 SE +/- 0.002, N = 3 SE +/- 0.005, N = 3 SE +/- 0.006, N = 3 3.656 3.660 3.662 1. OpenSCAD version 2021.01
OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Mini-ITX Case Run Repeat Enabled 8 16 24 32 40 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 35.30 35.14 35.22 1. OpenSCAD version 2021.01
OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Projector Mount Swivel Run Repeat Enabled 2 4 6 8 10 SE +/- 0.008, N = 3 SE +/- 0.042, N = 3 SE +/- 0.005, N = 3 6.925 6.949 6.934 1. OpenSCAD version 2021.01
OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Leonardo Phone Case Slim Run Repeat Enabled 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 14.24 14.20 14.32 1. OpenSCAD version 2021.01
OpenVKL OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmark Run Repeat Enabled 40 80 120 160 200 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 SE +/- 1.00, N = 3 196 197 194 MIN: 1 / MAX: 807 MIN: 1 / MAX: 806 MIN: 1 / MAX: 790
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkVdbVolume Run Repeat Enabled 5M 10M 15M 20M 25M SE +/- 120334.61, N = 3 SE +/- 283788.09, N = 3 SE +/- 44284.02, N = 3 24792840 25063015 24682690 MIN: 1602573 / MAX: 99801972 MIN: 1580423 / MAX: 105264432 MIN: 1560523 / MAX: 94703364
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkStructuredVolume Run Repeat Enabled 16M 32M 48M 64M 80M SE +/- 1052274.46, N = 3 SE +/- 357980.91, N = 3 SE +/- 576939.06, N = 15 76029329 72560377 74635152 MIN: 1954651 / MAX: 659359440 MIN: 1941493 / MAX: 544635720 MIN: 1917397 / MAX: 657699912
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkUnstructuredVolume Run Repeat Enabled 600K 1200K 1800K 2400K 3000K SE +/- 2813.39, N = 3 SE +/- 3937.22, N = 3 SE +/- 884.39, N = 3 2706127 2709587 2708582 MIN: 32688 / MAX: 9055017 MIN: 32743 / MAX: 9064631 MIN: 32681 / MAX: 9049793
Opus Codec Encoding Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode Run Repeat Enabled 2 4 6 8 10 SE +/- 0.010, N = 5 SE +/- 0.011, N = 5 SE +/- 0.011, N = 5 7.232 7.222 7.230 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
OSPray Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: SciVis Run Repeat Enabled 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.16, N = 3 SE +/- 0.16, N = 3 21.74 22.06 22.06 MIN: 20 / MAX: 22.22 MIN: 21.28 / MAX: 22.22 MIN: 21.28 / MAX: 22.22
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: SciVis Run Repeat Enabled 0.7515 1.503 2.2545 3.006 3.7575 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.33 3.32 3.34 MIN: 3.27 / MAX: 3.41 MIN: 3.28 / MAX: 3.4 MIN: 3.29 / MAX: 3.4
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: Path Tracer Run Repeat Enabled 0.3578 0.7156 1.0734 1.4312 1.789 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.59 1.59 1.59 MIN: 1.58 / MAX: 1.6 MIN: 1.58 / MAX: 1.6 MIN: 1.58 / MAX: 1.6
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: SciVis Run Repeat Enabled 6 12 18 24 30 25 25 25 MIN: 23.81 / MAX: 25.64 MIN: 24.39 / MAX: 25.64 MIN: 23.81 / MAX: 25.64
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: Path Tracer Run Repeat Enabled 0.3938 0.7876 1.1814 1.5752 1.969 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.74 1.73 1.75 MIN: 1.65 / MAX: 1.78 MIN: 1.72 / MAX: 1.77 MIN: 1.73 / MAX: 1.78
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: SciVis Run Repeat Enabled 5 10 15 20 25 20 20 20 MIN: 19.61 / MAX: 20.41 MIN: 19.23 / MAX: 20.41 MIN: 18.87 / MAX: 20.41
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: Path Tracer Run Repeat Enabled 1.0283 2.0566 3.0849 4.1132 5.1415 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 4.57 4.55 4.57 MIN: 4.42 / MAX: 4.74 MIN: 4.44 / MAX: 4.78 MIN: 4.44 / MAX: 4.76
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: Path Tracer Run Repeat Enabled 70 140 210 280 350 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 333.33 333.33 333.33 MIN: 250 MIN: 250 MIN: 250
OpenBenchmarking.org Seconds, Fewer Is Better Perl Benchmarks Test: Interpreter Run Repeat Enabled 0.0002 0.0004 0.0006 0.0008 0.001 SE +/- 0.00000526, N = 3 SE +/- 0.00000063, N = 3 SE +/- 0.00000726, N = 3 0.00068233 0.00067438 0.00067785
PHPBench PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite Run Repeat Enabled 200K 400K 600K 800K 1000K SE +/- 3145.44, N = 3 SE +/- 1962.09, N = 3 SE +/- 4727.53, N = 3 1020071 1017890 1010404
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG19 - Device: CPU Run Repeat Enabled 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 17.86 17.79 17.73
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU Run Repeat Enabled 1.2465 2.493 3.7395 4.986 6.2325 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 5.52 5.54 5.52
POV-Ray This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time Run Repeat Enabled 9 18 27 36 45 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 38.30 38.35 38.19 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
Primesieve Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 7.4 1e12 Prime Number Generation Run Repeat Enabled 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 18.07 18.03 18.04 1. (CXX) g++ options: -O3 -lpthread
PyBench This test profile reports the total time of the different average timed test results from PyBench. PyBench reports average test times for different functions such as BuiltinFunctionCalls and NestedForLoops, with this total result providing a rough estimate as to Python's average performance on a given system. This test profile runs PyBench each time for 20 rounds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2018-02-16 Total For Average Test Times Run Repeat Enabled 150 300 450 600 750 SE +/- 1.00, N = 3 SE +/- 2.08, N = 3 SE +/- 2.85, N = 3 711 716 714
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 Run Repeat Enabled 700 1400 2100 2800 3500 SE +/- 24.43, N = 3 SE +/- 18.56, N = 3 SE +/- 17.21, N = 3 3311.3 3312.4 3323.0 1. (CXX) g++ options: -O3 -march=native -rdynamic
rays1bench This is a test of rays1bench, a simple path-tracer / ray-tracing that supports SSE and AVX instructions, multi-threading, and other features. This test profile is measuring the performance of the "large scene" in rays1bench. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org mrays/s, More Is Better rays1bench 2020-01-09 Large Scene Run Repeat Enabled 16 32 48 64 80 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 71.69 71.69 71.82
RNNoise RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 Run Repeat Enabled 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 20.89 20.90 20.89 1. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD Run Repeat Enabled 50 100 150 200 250 SE +/- 2.75, N = 3 SE +/- 1.60, N = 3 SE +/- 1.22, N = 3 209.92 208.79 208.70 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D Run Repeat Enabled 16 32 48 64 80 SE +/- 1.89, N = 14 SE +/- 0.33, N = 3 SE +/- 0.41, N = 14 72.24 70.25 70.42 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte Run Repeat Enabled 20 40 60 80 100 SE +/- 0.20, N = 3 SE +/- 0.64, N = 3 SE +/- 0.42, N = 3 103.49 104.96 103.84 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver Run Repeat Enabled 5 10 15 20 25 SE +/- 0.13, N = 3 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 21.30 21.10 21.08 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster Run Repeat Enabled 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 17.02 17.01 17.00 1. (CXX) g++ options: -O2 -lOpenCL
Selenium This test profile uses the Selenium WebDriver for running various browser benchmarks in different available web browsers such as Firefox and Google Chrome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: ARES-6 - Browser: Firefox Run Repeat Enabled 9 18 27 36 45 SE +/- 0.06, N = 3 SE +/- 0.14, N = 3 SE +/- 0.20, N = 3 37.86 38.09 38.12 1. firefox 86.0
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: Kraken - Browser: Firefox Run Repeat Enabled 200 400 600 800 1000 SE +/- 1.71, N = 3 SE +/- 0.73, N = 3 SE +/- 0.82, N = 3 844.6 842.2 844.1 1. firefox 86.0
OpenBenchmarking.org Runs / Minute, More Is Better Selenium Benchmark: StyleBench - Browser: Firefox Run Repeat Enabled 30 60 90 120 150 SE +/- 1.76, N = 15 SE +/- 1.81, N = 15 SE +/- 2.02, N = 15 118 119 117 1. firefox 86.0
OpenBenchmarking.org Score, More Is Better Selenium Benchmark: Jetstream 2 - Browser: Firefox Run Repeat Enabled 20 40 60 80 100 SE +/- 0.45, N = 3 SE +/- 0.71, N = 3 SE +/- 0.91, N = 3 100.00 99.74 102.78 1. firefox 86.0
OpenBenchmarking.org Runs Per Minute, More Is Better Selenium Benchmark: Speedometer - Browser: Firefox Run Repeat Enabled 30 60 90 120 150 SE +/- 1.34, N = 3 SE +/- 1.09, N = 3 SE +/- 0.67, N = 3 136.7 137.9 134.0 1. firefox 86.0
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: Kraken - Browser: Google Chrome Run Repeat Enabled 130 260 390 520 650 SE +/- 1.66, N = 3 SE +/- 0.93, N = 3 SE +/- 1.50, N = 3 610.7 610.4 616.6 1. chrome 89.0.4389.90
OpenBenchmarking.org Score, Fewer Is Better Selenium Benchmark: PSPDFKit WASM - Browser: Firefox Run Repeat Enabled 600 1200 1800 2400 3000 SE +/- 9.87, N = 3 SE +/- 3.33, N = 3 SE +/- 10.17, N = 3 2788 2776 2779 1. firefox 86.0
OpenBenchmarking.org Runs / Minute, More Is Better Selenium Benchmark: StyleBench - Browser: Google Chrome Run Repeat Enabled 10 20 30 40 50 SE +/- 0.16, N = 3 SE +/- 0.13, N = 3 SE +/- 0.13, N = 3 46.19 45.78 45.95 1. chrome 89.0.4389.90
OpenBenchmarking.org Score, Fewer Is Better Selenium Benchmark: PSPDFKit WASM - Browser: Google Chrome Run Repeat Enabled 600 1200 1800 2400 3000 SE +/- 2.31, N = 3 SE +/- 6.24, N = 3 2825 2823 2813 1. chrome 89.0.4389.90
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM imageConvolute - Browser: Firefox Run Repeat Enabled 6 12 18 24 30 SE +/- 0.34, N = 3 SE +/- 0.07, N = 3 SE +/- 0.12, N = 3 25.4 25.0 25.1 1. firefox 86.0
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM collisionDetection - Browser: Firefox Run Repeat Enabled 70 140 210 280 350 SE +/- 3.27, N = 3 SE +/- 3.18, N = 3 SE +/- 3.30, N = 3 337.9 337.9 337.8 1. firefox 86.0
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM imageConvolute - Browser: Google Chrome Run Repeat Enabled 6 12 18 24 30 SE +/- 0.16, N = 3 SE +/- 0.35, N = 3 SE +/- 0.05, N = 3 26.95 26.65 26.57 1. chrome 89.0.4389.90
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM collisionDetection - Browser: Google Chrome Run Repeat Enabled 60 120 180 240 300 SE +/- 0.15, N = 3 SE +/- 0.19, N = 3 SE +/- 0.22, N = 3 280.72 280.30 280.38 1. chrome 89.0.4389.90
simdjson This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: Kostya Run Repeat Enabled 0.81 1.62 2.43 3.24 4.05 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.60 3.60 3.59 1. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: LargeRandom Run Repeat Enabled 0.2723 0.5446 0.8169 1.0892 1.3615 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.21 1.21 1.21 1. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: PartialTweets Run Repeat Enabled 1.1813 2.3626 3.5439 4.7252 5.9065 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 5.24 5.25 5.22 1. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: DistinctUserID Run Repeat Enabled 1.287 2.574 3.861 5.148 6.435 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 5.68 5.72 5.60 1. (CXX) g++ options: -O3 -pthread
Smallpt Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples Run Repeat Enabled 2 4 6 8 10 SE +/- 0.001, N = 3 SE +/- 0.005, N = 3 SE +/- 0.006, N = 3 8.883 8.883 8.880 1. (CXX) g++ options: -fopenmp -O3
srsLTE srsLTE is an open-source LTE software radio suite created by Software Radio Systems (SRS). srsLTE can be used for building your own software defined (SDR) LTE mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Samples / Second, More Is Better srsLTE 20.10.1 Test: OFDM_Test Run Repeat Enabled 40M 80M 120M 160M 200M SE +/- 1473091.99, N = 3 SE +/- 2512855.83, N = 3 SE +/- 260341.66, N = 3 176800000 178033333 176933333 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
OpenBenchmarking.org eNb Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test Run Repeat Enabled 70 140 210 280 350 SE +/- 1.22, N = 3 SE +/- 0.32, N = 3 SE +/- 0.55, N = 3 336.9 335.7 335.8 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
OpenBenchmarking.org UE Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test Run Repeat Enabled 30 60 90 120 150 SE +/- 0.40, N = 3 SE +/- 0.09, N = 3 SE +/- 0.12, N = 3 132.6 132.4 132.7 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 12 Total Time Run Repeat Enabled 5M 10M 15M 20M 25M SE +/- 240951.84, N = 3 SE +/- 77091.76, N = 3 SE +/- 188309.24, N = 3 22203685 22712289 22389342 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time Run Repeat Enabled 6M 12M 18M 24M 30M SE +/- 88725.14, N = 3 SE +/- 139588.95, N = 3 SE +/- 213446.67, N = 3 28372238 28469418 28813121 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: Crypto Run Repeat Enabled 500 1000 1500 2000 2500 SE +/- 1.86, N = 3 SE +/- 0.23, N = 3 SE +/- 1.85, N = 3 2217.75 2218.61 2217.94 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: CPU Cache Run Repeat Enabled 6 12 18 24 30 SE +/- 0.21, N = 3 SE +/- 0.29, N = 3 SE +/- 0.19, N = 3 23.42 23.02 22.88 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: CPU Stress Run Repeat Enabled 1100 2200 3300 4400 5500 SE +/- 21.71, N = 3 SE +/- 12.12, N = 3 SE +/- 5.24, N = 3 5001.07 4964.68 4991.24 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: Matrix Math Run Repeat Enabled 12K 24K 36K 48K 60K SE +/- 210.46, N = 3 SE +/- 122.73, N = 3 SE +/- 126.41, N = 3 57377.45 57137.41 56981.91 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: Vector Math Run Repeat Enabled 13K 26K 39K 52K 65K SE +/- 57.06, N = 3 SE +/- 100.32, N = 3 SE +/- 97.11, N = 3 62436.51 62160.59 61555.92 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: Memory Copying Run Repeat Enabled 300 600 900 1200 1500 SE +/- 2.42, N = 3 SE +/- 3.20, N = 3 SE +/- 2.01, N = 3 1526.56 1533.93 1533.48 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: Context Switching Run Repeat Enabled 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 58907.92, N = 4 SE +/- 63682.03, N = 3 SE +/- 56528.36, N = 3 4881747.94 5095907.11 5101477.04 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 4 - Input: 1080p Run Repeat Enabled 0.9574 1.9148 2.8722 3.8296 4.787 SE +/- 0.023, N = 3 SE +/- 0.028, N = 3 SE +/- 0.010, N = 3 4.232 4.254 4.255 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 8 - Input: 1080p Run Repeat Enabled 9 18 27 36 45 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 37.61 37.76 37.85 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Run Repeat Enabled 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 9.27 9.29 9.26 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Run Repeat Enabled 30 60 90 120 150 SE +/- 0.14, N = 3 SE +/- 0.08, N = 3 SE +/- 0.28, N = 3 135.41 135.17 135.79 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Run Repeat Enabled 60 120 180 240 300 SE +/- 0.33, N = 3 SE +/- 0.14, N = 3 SE +/- 0.21, N = 3 262.47 262.54 266.00 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p Run Repeat Enabled 40 80 120 160 200 SE +/- 1.87, N = 3 SE +/- 2.07, N = 4 SE +/- 1.51, N = 3 183.50 184.70 185.98 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Run Repeat Enabled 40 80 120 160 200 SE +/- 0.55, N = 3 SE +/- 0.23, N = 3 SE +/- 0.25, N = 3 190.50 191.19 191.74 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p Run Repeat Enabled 30 60 90 120 150 SE +/- 0.23, N = 3 SE +/- 0.10, N = 3 SE +/- 0.11, N = 3 155.96 156.26 157.07 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Sysbench This is a benchmark of Sysbench with the built-in CPU and memory sub-tests. Sysbench is a scriptable multi-threaded benchmark tool based on LuaJIT. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory Run Repeat Enabled 5K 10K 15K 20K 25K SE +/- 87.98, N = 3 SE +/- 57.62, N = 3 SE +/- 74.39, N = 3 25429.68 25202.79 25461.39 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU Run Repeat Enabled 7K 14K 21K 28K 35K SE +/- 5.32, N = 3 SE +/- 4.23, N = 3 SE +/- 0.29, N = 3 34093.33 34092.54 34101.32 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
Tachyon This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time Run Repeat Enabled 20 40 60 80 100 SE +/- 0.43, N = 15 SE +/- 0.53, N = 15 SE +/- 0.54, N = 3 74.02 75.02 74.88 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
Tesseract OCR Tesseract-OCR is the open-source optical character recognition (OCR) engine for the conversion of text within images to raw text output. This test profile relies upon a system-supplied Tesseract installation. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Tesseract OCR 4.1.1 Time To OCR 7 Images Run Repeat Enabled 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 18.46 18.51 18.50
Timed Mesa Compilation This test profile times how long it takes to compile Mesa with Meson/Ninja. For minimizing build dependencies and avoid versioning conflicts, test this is just the core Mesa build without LLVM or the extra Gallium3D/Mesa drivers enabled. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 21.0 Time To Compile Run Repeat Enabled 10 20 30 40 50 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 45.94 45.87 46.06
Timed MrBayes Analysis This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Run Repeat Enabled 13 26 39 52 65 SE +/- 0.74, N = 3 SE +/- 0.47, N = 3 SE +/- 0.41, N = 3 57.33 57.64 57.25 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mmpx -mabm -O3 -std=c99 -pedantic -lm
Timed Wasmer Compilation This test times how long it takes to compile Wasmer. Wasmer is written in the Rust programming language and is a WebAssembly runtime implementation that supports WASI and EmScripten. This test profile builds Wasmer with the Cranelift and Singlepast compiler features enabled. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Wasmer Compilation 1.0.2 Time To Compile Run Repeat Enabled 12 24 36 48 60 SE +/- 0.26, N = 3 SE +/- 0.27, N = 3 SE +/- 0.35, N = 3 52.88 53.06 52.85 1. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lgcc_s -lutil -lrt -lpthread -lm -lc
TNN TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 Run Repeat Enabled 60 120 180 240 300 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 263.60 263.47 263.34 MIN: 261.99 / MAX: 268.68 MIN: 261.32 / MAX: 269.11 MIN: 261.29 / MAX: 268.2 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 Run Repeat Enabled 60 120 180 240 300 SE +/- 0.68, N = 3 SE +/- 0.36, N = 3 SE +/- 0.05, N = 3 265.45 264.77 264.38 MIN: 263.15 / MAX: 269.59 MIN: 263.03 / MAX: 266.49 MIN: 262.68 / MAX: 267.22 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance Run Repeat Enabled 400K 800K 1200K 1600K 2000K SE +/- 1321.50, N = 5 SE +/- 1711.59, N = 5 SE +/- 1321.50, N = 5 1724418 1726583 1723339 1. (CC) gcc options: -O3 -march=native
TTSIOD 3D Renderer A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping Run Repeat Enabled 120 240 360 480 600 SE +/- 0.40, N = 3 SE +/- 1.14, N = 3 SE +/- 1.36, N = 3 561.83 556.31 550.73 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
ViennaCL ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY Run Repeat Enabled 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 25.3 25.1 25.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY Run Repeat Enabled 9 18 27 36 45 SE +/- 0.03, N = 3 SE +/- 0.12, N = 3 SE +/- 0.06, N = 3 39.1 38.7 38.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT Run Repeat Enabled 10 20 30 40 50 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 41.7 41.3 41.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY Run Repeat Enabled 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 23.8 23.6 23.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY Run Repeat Enabled 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 36.6 36.3 36.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT Run Repeat Enabled 9 18 27 36 45 SE +/- 0.12, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 38.6 38.4 38.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N Run Repeat Enabled 11 22 33 44 55 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 46.6 46.7 46.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T Run Repeat Enabled 11 22 33 44 55 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.17, N = 3 48.4 48.4 48.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN Run Repeat Enabled 11 22 33 44 55 SE +/- 0.03, N = 3 SE +/- 0.19, N = 3 SE +/- 1.10, N = 3 47.0 46.8 45.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT Run Repeat Enabled 10 20 30 40 50 SE +/- 0.03, N = 3 SE +/- 0.17, N = 3 SE +/- 0.13, N = 3 45.1 44.7 44.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT Run Repeat Enabled 11 22 33 44 55 SE +/- 0.06, N = 3 SE +/- 0.00, N = 3 SE +/- 0.45, N = 3 47.5 47.3 46.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN Run Repeat Enabled 11 22 33 44 55 SE +/- 0.03, N = 3 SE +/- 0.00, N = 2 SE +/- 0.57, N = 3 48.7 48.7 48.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
VP9 libvpx Encoding This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 0 Run Repeat Enabled 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 9.35 9.35 9.34 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 5 Run Repeat Enabled 8 16 24 32 40 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 32.84 32.90 32.91 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
WebP Image Encode This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless Run Repeat Enabled 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 15.32 15.32 15.30 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Run Repeat Enabled 2 4 6 8 10 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 SE +/- 0.005, N = 3 6.080 6.083 6.095 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP2 Image Encode This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default Run Repeat Enabled 0.8021 1.6042 2.4063 3.2084 4.0105 SE +/- 0.004, N = 3 SE +/- 0.012, N = 3 SE +/- 0.015, N = 3 3.565 3.545 3.533 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 Run Repeat Enabled 40 80 120 160 200 SE +/- 0.18, N = 3 SE +/- 0.25, N = 3 SE +/- 0.34, N = 3 190.33 190.53 191.35 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 Run Repeat Enabled 3 6 9 12 15 SE +/- 0.016, N = 3 SE +/- 0.008, N = 3 SE +/- 0.004, N = 3 9.625 9.603 9.618 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WireGuard + Linux Networking Stack Stress Test This is a benchmark of the WireGuard secure VPN tunnel and Linux networking stack stress test. The test runs on the local host but does require root permissions to run. The way it works is it creates three namespaces. ns0 has a loopback device. ns1 and ns2 each have wireguard devices. Those two wireguard devices send traffic through the loopback device of ns0. The end result of this is that tests wind up testing encryption and decryption at the same time -- a pretty CPU and scheduler-heavy workflow. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better WireGuard + Linux Networking Stack Stress Test Run Repeat Enabled 30 60 90 120 150 SE +/- 1.20, N = 3 SE +/- 0.50, N = 3 SE +/- 0.56, N = 3 130.53 129.89 130.03
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2019-12-17 H.264 Video Encoding Run Repeat Enabled 30 60 90 120 150 SE +/- 1.20, N = 3 SE +/- 0.83, N = 3 SE +/- 0.41, N = 3 119.15 119.08 118.95 1. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
x265 This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K Run Repeat Enabled 4 8 12 16 20 SE +/- 0.13, N = 3 SE +/- 0.12, N = 9 SE +/- 0.21, N = 3 14.82 15.14 15.04 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p Run Repeat Enabled 15 30 45 60 75 SE +/- 0.29, N = 3 SE +/- 0.17, N = 3 SE +/- 0.48, N = 3 67.60 67.44 67.47 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction Run Repeat Enabled 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.29, N = 3 33.77 33.69 34.13 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction Run Repeat Enabled 30 60 90 120 150 SE +/- 0.03, N = 3 SE +/- 0.12, N = 3 SE +/- 0.06, N = 3 118.35 118.12 118.39 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
XZ Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 Run Repeat Enabled 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 29.86 29.81 29.80 1. (CC) gcc options: -pthread -fvisibility=hidden -O2
YafaRay YafaRay is an open-source physically based montecarlo ray-tracing engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.4.1 Total Time For Sample Scene Run Repeat Enabled 30 60 90 120 150 SE +/- 0.46, N = 3 SE +/- 0.18, N = 3 SE +/- 0.08, N = 3 121.33 121.24 121.45 1. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread
Zstd Compression This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Compression Speed Run Repeat Enabled 70 140 210 280 350 SE +/- 0.86, N = 3 SE +/- 1.09, N = 3 SE +/- 2.05, N = 3 310.2 312.9 314.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Decompression Speed Run Repeat Enabled 1000 2000 3000 4000 5000 SE +/- 1.52, N = 3 SE +/- 2.15, N = 3 SE +/- 3.90, N = 3 4774.4 4760.7 4763.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed Run Repeat Enabled 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.19, N = 3 SE +/- 0.09, N = 3 33.1 33.1 33.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed Run Repeat Enabled 900 1800 2700 3600 4500 SE +/- 8.51, N = 3 SE +/- 4.33, N = 3 SE +/- 11.78, N = 3 4394.0 4401.8 4390.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Compression Speed Run Repeat Enabled 300 600 900 1200 1500 SE +/- 22.02, N = 15 SE +/- 28.36, N = 15 SE +/- 14.33, N = 15 1303.8 1327.1 1302.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Decompression Speed Run Repeat Enabled 1000 2000 3000 4000 5000 SE +/- 11.09, N = 15 SE +/- 5.59, N = 15 SE +/- 12.94, N = 15 4864.6 4875.4 4865.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Compression Speed Run Repeat Enabled 80 160 240 320 400 SE +/- 1.58, N = 3 SE +/- 1.95, N = 3 SE +/- 3.48, N = 3 389.4 382.8 386.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Decompression Speed Run Repeat Enabled 1100 2200 3300 4400 5500 SE +/- 2.66, N = 3 SE +/- 16.73, N = 3 SE +/- 6.01, N = 3 5065.8 5042.4 5071.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed Run Repeat Enabled 7 14 21 28 35 SE +/- 0.07, N = 3 SE +/- 0.15, N = 3 SE +/- 0.15, N = 3 31.3 31.3 31.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed Run Repeat Enabled 900 1800 2700 3600 4500 SE +/- 3.52, N = 3 SE +/- 4.83, N = 3 SE +/- 38.86, N = 3 4308.3 4315.8 4252.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
Enabled Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: DEBUGINFOD_URLS=Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x39 - Thermald 2.4.3Java Notes: OpenJDK Runtime Environment (build 11.0.11-ea+4-Ubuntu-0ubuntu2)Python Notes: Python 3.9.2Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 26 March 2021 14:03 by user ronix.
Repeat Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: DEBUGINFOD_URLS=Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x39 - Thermald 2.4.3Java Notes: OpenJDK Runtime Environment (build 11.0.11-ea+4-Ubuntu-0ubuntu2)Python Notes: Python 3.9.2Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 27 March 2021 10:46 by user ronix.
Run Processor: Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG MAXIMUS XIII HERO (0610 BIOS), Chipset: Intel Tiger Lake-H, Memory: 32GB, Disk: 1000GB Western Digital WD_BLACK SN850 1TB + 2000GB, Graphics: AMD Radeon RX 6800/6800 XT / 6900 16GB (2575/1000MHz), Audio: Intel Tiger Lake-H HD Audio, Monitor: ASUS MG28U, Network: 2 x Intel I225-V + Intel Device 2725
OS: Ubuntu 21.04, Kernel: 5.12.0-051200rc3daily20210315-generic (x86_64) 20210314, Desktop: GNOME Shell 3.38.3, Display Server: X Server 1.20.10 + Wayland, OpenGL: 4.6 Mesa 21.1.0-devel (git-616720d 2021-03-16 hirsute-oibaf-ppa) (LLVM 12.0.0), Compiler: GCC 10.2.1 20210312, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: DEBUGINFOD_URLS=Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x39 - Thermald 2.4.3Java Notes: OpenJDK Runtime Environment (build 11.0.11-ea+4-Ubuntu-0ubuntu2)Python Notes: Python 3.9.2Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 28 March 2021 07:45 by user ronix.