Benchmarks for a future article.
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 Enabled Repeat Run 60M 120M 180M 240M 300M SE +/- 43622.17, N = 3 SE +/- 57627.28, N = 3 SE +/- 73645.19, N = 3 259341600 259622700 259329700 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 6 Realtime Enabled Repeat Run 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 29.19 29.13 29.14 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 6 Two-Pass Enabled Repeat Run 5 10 15 20 25 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 22.80 22.89 22.77 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.1-rc Encoder Mode: Speed 8 Realtime Enabled Repeat Run 30 60 90 120 150 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 SE +/- 0.12, N = 3 114.16 113.96 113.06 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K Enabled Repeat Run 0.036 0.072 0.108 0.144 0.18 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.16 0.16 0.16 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K Enabled Repeat Run 0.891 1.782 2.673 3.564 4.455 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.96 3.96 3.94 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K Enabled Repeat Run 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 16.56 16.54 16.41 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K Enabled Repeat Run 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 7.59 7.56 7.56 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K Enabled Repeat Run 10 20 30 40 50 SE +/- 0.06, N = 3 SE +/- 0.21, N = 3 SE +/- 0.43, N = 15 42.80 43.24 40.83 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K Enabled Repeat Run 12 24 36 48 60 SE +/- 0.39, N = 14 SE +/- 0.63, N = 15 SE +/- 0.03, N = 3 52.82 51.82 51.88 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p Enabled Repeat Run 0.1125 0.225 0.3375 0.45 0.5625 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.50 0.50 0.50 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p Enabled Repeat Run 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 7.81 7.82 7.80 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p Enabled Repeat Run 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.10, N = 3 SE +/- 0.21, N = 3 31.11 31.05 30.84 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p Enabled Repeat Run 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 23.17 23.20 23.12 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p Enabled Repeat Run 30 60 90 120 150 SE +/- 1.42, N = 13 SE +/- 1.83, N = 3 SE +/- 0.15, N = 3 135.81 137.23 138.16 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p Enabled Repeat Run 30 60 90 120 150 SE +/- 2.05, N = 3 SE +/- 1.46, N = 15 SE +/- 1.88, N = 4 152.40 156.45 152.03 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP Enabled Repeat Run 50 100 150 200 250 SE +/- 0.44, N = 3 SE +/- 0.26, N = 3 SE +/- 0.15, N = 3 209.65 210.97 208.62 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding Enabled Repeat Run 300 600 900 1200 1500 SE +/- 0.30, N = 3 SE +/- 0.19, N = 3 SE +/- 0.92, N = 3 1200.82 1200.37 1200.48 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding Enabled Repeat Run 400 800 1200 1600 2000 SE +/- 0.63, N = 3 SE +/- 0.24, N = 3 SE +/- 1.19, N = 3 1745.70 1743.56 1745.70 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding Enabled Repeat Run 300 600 900 1200 1500 SE +/- 14.15, N = 4 SE +/- 5.15, N = 3 SE +/- 4.97, N = 3 1236.02 1246.17 1223.27 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding Enabled Repeat Run 400 800 1200 1600 2000 SE +/- 4.65, N = 4 SE +/- 0.00, N = 3 SE +/- 5.37, N = 3 2072.06 2080.12 2074.75 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASTC Encoder ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Medium Enabled Repeat Run 1.1336 2.2672 3.4008 4.5344 5.668 SE +/- 0.0020, N = 3 SE +/- 0.0024, N = 3 SE +/- 0.0044, N = 3 5.0307 5.0381 5.0345 1. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Thorough Enabled Repeat Run 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 11.67 11.67 11.68 1. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Exhaustive Enabled Repeat Run 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 87.81 87.85 87.86 1. (CXX) g++ options: -O3 -flto -pthread
Basis Universal Basis Universal is a GPU texture codec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: ETC1S Enabled Repeat Run 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 21.45 21.39 21.36 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 Enabled Repeat Run 2 4 6 8 10 SE +/- 0.004, N = 3 SE +/- 0.005, N = 3 SE +/- 0.002, N = 3 6.257 6.250 6.248 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 Enabled Repeat Run 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 27.61 27.60 27.61 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 Enabled Repeat Run 12 24 36 48 60 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 51.56 51.54 51.55 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Blender Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL, NVIDIA OptiX, and NVIDIA CUDA is supported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: BMW27 - Compute: CPU-Only Enabled Repeat Run 30 60 90 120 150 SE +/- 0.22, N = 3 SE +/- 0.19, N = 3 SE +/- 0.11, N = 3 132.84 132.87 132.99
Botan Botan is a cross-platform open-source C++ crypto library that supports most all publicly known cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: KASUMI Enabled Repeat Run 20 40 60 80 100 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 105.70 105.76 105.75 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: AES-256 Enabled Repeat Run 2K 4K 6K 8K 10K SE +/- 25.80, N = 3 SE +/- 4.98, N = 3 SE +/- 0.38, N = 3 7826.31 7848.75 7852.89 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: Twofish Enabled Repeat Run 90 180 270 360 450 SE +/- 0.53, N = 3 SE +/- 0.37, N = 3 SE +/- 0.64, N = 3 427.59 426.30 426.80 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: Blowfish Enabled Repeat Run 120 240 360 480 600 SE +/- 0.30, N = 3 SE +/- 0.15, N = 3 SE +/- 0.32, N = 3 545.75 545.73 545.86 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.13.0 Test: CAST-256 Enabled Repeat Run 40 80 120 160 200 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 162.51 162.52 162.50 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI Repeat Enabled Run 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 105.73 105.78 105.73 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt Repeat Enabled Run 20 40 60 80 100 SE +/- 0.15, N = 3 SE +/- 0.19, N = 3 SE +/- 0.17, N = 3 103.64 103.65 103.50 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 Repeat Enabled Run 2K 4K 6K 8K 10K SE +/- 74.80, N = 3 SE +/- 0.15, N = 3 SE +/- 0.75, N = 3 7778.53 7854.71 7854.05 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt Repeat Enabled Run 2K 4K 6K 8K 10K SE +/- 77.84, N = 3 SE +/- 0.09, N = 3 SE +/- 22.19, N = 3 7761.24 7840.11 7816.27 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish Repeat Enabled Run 90 180 270 360 450 SE +/- 0.25, N = 3 SE +/- 0.04, N = 3 SE +/- 0.19, N = 3 427.37 427.84 427.28 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt Repeat Enabled Run 90 180 270 360 450 SE +/- 0.37, N = 3 SE +/- 0.22, N = 3 SE +/- 0.15, N = 3 425.88 426.10 426.12 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish Repeat Enabled Run 120 240 360 480 600 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.28, N = 3 545.42 545.45 544.75 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt Repeat Enabled Run 120 240 360 480 600 SE +/- 0.03, N = 3 SE +/- 0.14, N = 3 SE +/- 0.33, N = 3 535.87 535.85 535.21 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 Repeat Enabled Run 40 80 120 160 200 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 162.60 162.63 162.60 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt Repeat Enabled Run 40 80 120 160 200 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 162.33 162.32 162.31 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 Repeat Enabled Run 200 400 600 800 1000 SE +/- 0.57, N = 3 SE +/- 1.45, N = 3 SE +/- 0.89, N = 3 941.53 938.78 940.83 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt Repeat Enabled Run 200 400 600 800 1000 SE +/- 1.47, N = 3 SE +/- 1.07, N = 3 SE +/- 0.44, N = 3 936.21 935.22 936.65 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
BRL-CAD BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.30.8 VGR Performance Metric Enabled Repeat Run 30K 60K 90K 120K 150K 152552 153258 152408 1. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm
Build2 This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile Enabled Repeat Run 30 60 90 120 150 SE +/- 0.35, N = 3 SE +/- 0.19, N = 3 SE +/- 0.34, N = 3 122.76 121.75 122.58
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Enabled Repeat Run 15 30 45 60 75 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 67.13 67.13 67.12 1. (CC) gcc options: -lm -lpthread -O3
Caffe This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 Enabled Repeat Run 7K 14K 21K 28K 35K SE +/- 21.53, N = 3 SE +/- 61.74, N = 3 SE +/- 28.35, N = 3 34712 34770 34786 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 Enabled Repeat Run 20K 40K 60K 80K 100K SE +/- 39.14, N = 3 SE +/- 16.20, N = 3 SE +/- 62.17, N = 3 88739 88720 88718 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Chaos Group V-RAY This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU Enabled Repeat Run 3K 6K 9K 12K 15K SE +/- 14.74, N = 3 SE +/- 42.28, N = 3 SE +/- 23.78, N = 3 12346 12308 12279
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics Enabled Repeat Run 40 80 120 160 200 SE +/- 0.09, N = 3 SE +/- 0.22, N = 3 SE +/- 0.07, N = 3 159.78 159.40 166.03 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
OpenBenchmarking.org Iterations Per Second, More Is Better Cryptsetup PBKDF2-whirlpool Enabled Repeat Run 200K 400K 600K 800K 1000K SE +/- 1384.00, N = 3 SE +/- 805.40, N = 3 SE +/- 924.33, N = 3 852505 855283 852965
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 256b Encryption Enabled Repeat Run 1200 2400 3600 4800 6000 SE +/- 13.50, N = 3 SE +/- 10.24, N = 3 SE +/- 7.67, N = 3 5430.2 5425.9 5430.1
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 256b Decryption Enabled Repeat Run 1200 2400 3600 4800 6000 SE +/- 15.52, N = 3 SE +/- 8.66, N = 3 SE +/- 9.91, N = 3 5430.1 5427.0 5425.7
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 256b Encryption Enabled Repeat Run 200 400 600 800 1000 SE +/- 1.04, N = 3 SE +/- 1.34, N = 3 SE +/- 0.25, N = 3 773.1 774.0 774.9
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 256b Decryption Enabled Repeat Run 160 320 480 640 800 SE +/- 0.35, N = 3 SE +/- 0.26, N = 3 SE +/- 0.20, N = 3 729.5 729.7 729.6
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 256b Encryption Enabled Repeat Run 110 220 330 440 550 SE +/- 0.47, N = 3 SE +/- 0.88, N = 3 SE +/- 0.10, N = 3 487.4 487.2 487.6
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 256b Decryption Enabled Repeat Run 110 220 330 440 550 SE +/- 0.00, N = 2 SE +/- 0.38, N = 3 SE +/- 0.12, N = 3 488.4 488.2 488.3
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 512b Encryption Enabled Repeat Run 1000 2000 3000 4000 5000 SE +/- 3.80, N = 3 SE +/- 6.39, N = 3 SE +/- 8.27, N = 3 4819.2 4824.1 4823.2
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 512b Decryption Enabled Repeat Run 1000 2000 3000 4000 5000 SE +/- 4.47, N = 3 SE +/- 5.46, N = 3 SE +/- 8.27, N = 3 4793.8 4799.9 4800.5
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 512b Encryption Enabled Repeat Run 200 400 600 800 1000 SE +/- 0.49, N = 3 SE +/- 0.50, N = 2 774.4 774.9 775.1
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 512b Encryption Enabled Repeat Run 110 220 330 440 550 SE +/- 0.41, N = 3 SE +/- 0.30, N = 2 SE +/- 0.12, N = 3 488.0 487.1 487.3
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 512b Decryption Enabled Repeat Run 110 220 330 440 550 SE +/- 0.27, N = 3 SE +/- 0.24, N = 3 487.9 487.9 488.0
OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 512b Decryption Enabled Repeat Run 160 320 480 640 800 SE +/- 0.25, N = 2 SE +/- 0.27, N = 3 729.3 729.6 729.5
Cython Benchmark Cython provides a superset of Python that is geared to deliver C-like levels of performance. This test profile makes use of Cython's bundled benchmark tests and runs an N-Queens sample test as a simple benchmark to the system's Cython performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Cython Benchmark 0.29.21 Test: N-Queens Enabled Repeat Run 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 20.37 20.39 20.35
Darktable Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Boat - Acceleration: CPU-only Enabled Repeat Run 0.968 1.936 2.904 3.872 4.84 SE +/- 0.021, N = 3 SE +/- 0.022, N = 3 SE +/- 0.016, N = 3 4.300 4.302 4.291
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Masskrug - Acceleration: CPU-only Enabled Repeat Run 1.0289 2.0578 3.0867 4.1156 5.1445 SE +/- 0.004, N = 3 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 4.573 4.571 4.558
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Server Rack - Acceleration: CPU-only Enabled Repeat Run 0.0371 0.0742 0.1113 0.1484 0.1855 SE +/- 0.002, N = 15 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.165 0.163 0.165
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Server Room - Acceleration: CPU-only Enabled Repeat Run 0.7814 1.5628 2.3442 3.1256 3.907 SE +/- 0.004, N = 3 SE +/- 0.001, N = 3 SE +/- 0.007, N = 3 3.468 3.458 3.473
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 4K Enabled Repeat Run 40 80 120 160 200 SE +/- 0.18, N = 3 SE +/- 0.27, N = 3 SE +/- 0.27, N = 3 189.80 189.52 189.50 MIN: 176.26 / MAX: 202.54 MIN: 175.59 / MAX: 203 MIN: 177.16 / MAX: 202.89 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 1080p Enabled Repeat Run 150 300 450 600 750 SE +/- 0.85, N = 3 SE +/- 0.91, N = 3 SE +/- 1.92, N = 3 707.73 708.63 709.28 MIN: 632.84 / MAX: 768.08 MIN: 636.52 / MAX: 768.39 MIN: 632.86 / MAX: 774.84 1. (CC) gcc options: -pthread
DeepSpeech Mozilla DeepSpeech is a speech-to-text engine powered by TensorFlow for machine learning and derived from Baidu's Deep Speech research paper. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better DeepSpeech 0.6 Acceleration: CPU Enabled Repeat Run 14 28 42 56 70 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 SE +/- 0.10, N = 3 60.45 59.56 60.30
Dolfyn Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics Enabled Repeat Run 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 15.06 15.05 15.03
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Crown Enabled Repeat Run 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.13, N = 3 SE +/- 0.05, N = 3 14.81 14.64 14.64 MIN: 14.58 / MAX: 15.49 MIN: 13.95 / MAX: 15.33 MIN: 14.37 / MAX: 15.3
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon Enabled Repeat Run 4 8 12 16 20 SE +/- 0.12, N = 3 SE +/- 0.14, N = 3 SE +/- 0.15, N = 3 13.95 13.86 13.92 MIN: 13.68 / MAX: 14.36 MIN: 13.61 / MAX: 14.36 MIN: 13.63 / MAX: 14.38
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon Obj Enabled Repeat Run 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 12.80 12.79 12.50 MIN: 12.73 / MAX: 13.04 MIN: 12.68 / MAX: 13.03 MIN: 12.24 / MAX: 12.8
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon Enabled Repeat Run 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 SE +/- 0.07, N = 3 16.39 16.30 16.38 MIN: 16.23 / MAX: 16.83 MIN: 15.75 / MAX: 16.79 MIN: 16.13 / MAX: 16.84
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon Obj Enabled Repeat Run 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 14.47 14.45 14.19 MIN: 14.33 / MAX: 14.85 MIN: 14.33 / MAX: 14.87 MIN: 14.03 / MAX: 14.6
Etcpak Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 Enabled Repeat Run 500 1000 1500 2000 2500 SE +/- 0.83, N = 3 SE +/- 0.55, N = 3 SE +/- 0.42, N = 3 2387.98 2387.90 2380.09 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 Enabled Repeat Run 80 160 240 320 400 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 369.31 369.33 369.26 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 Enabled Repeat Run 50 100 150 200 250 SE +/- 0.07, N = 3 SE +/- 0.28, N = 3 SE +/- 0.07, N = 3 209.63 209.36 209.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 Enabled Repeat Run 1500 3000 4500 6000 7500 SE +/- 25.44, N = 3 SE +/- 30.65, N = 3 SE +/- 30.66, N = 3 7133.5 7121.2 7147.5 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 Enabled Repeat Run 6K 12K 18K 24K 30K SE +/- 33.51, N = 3 SE +/- 336.17, N = 3 SE +/- 300.18, N = 3 26220 26165 26855 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FinanceBench FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP Enabled Repeat Run 6K 12K 18K 24K 30K SE +/- 151.48, N = 3 SE +/- 46.34, N = 3 SE +/- 74.39, N = 3 26926.41 26809.90 26676.77 1. (CXX) g++ options: -O3 -march=native -fopenmp
OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP Enabled Repeat Run 9K 18K 27K 36K 45K SE +/- 58.76, N = 3 SE +/- 12.45, N = 3 SE +/- 79.47, N = 3 41370.77 41346.52 41115.48 1. (CXX) g++ options: -O3 -march=native -fopenmp
GEGL GEGL is the Generic Graphics Library and is the library/framework used by GIMP and other applications like GNOME Photos. This test profile times how long it takes to complete various GEGL operations on a static set of sample JPEG images. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GEGL Operation: Crop Enabled Repeat Run 2 4 6 8 10 SE +/- 0.024, N = 3 SE +/- 0.023, N = 3 SE +/- 0.023, N = 3 6.479 6.485 6.482
OpenBenchmarking.org Seconds, Fewer Is Better GEGL Operation: Scale Enabled Repeat Run 1.0634 2.1268 3.1902 4.2536 5.317 SE +/- 0.017, N = 3 SE +/- 0.013, N = 3 SE +/- 0.008, N = 3 4.711 4.726 4.723
OpenBenchmarking.org Seconds, Fewer Is Better GEGL Operation: Color Enhance Enabled Repeat Run 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 42.89 42.87 42.93
OpenBenchmarking.org Seconds, Fewer Is Better GEGL Operation: Rotate 90 Degrees Enabled Repeat Run 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 29.54 29.51 29.56
GIMP GIMP is an open-source image manipulaton program. This test profile will use the system-provided GIMP program otherwise on Windows relys upon a pre-packaged Windows binary from upstream GIMP.org. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.22 Test: resize Enabled Repeat Run 2 4 6 8 10 SE +/- 0.048, N = 3 SE +/- 0.024, N = 3 SE +/- 0.023, N = 3 6.212 6.213 6.222
OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.22 Test: rotate Enabled Repeat Run 3 6 9 12 15 SE +/- 0.017, N = 3 SE +/- 0.007, N = 3 SE +/- 0.008, N = 3 9.027 9.036 9.044
OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.22 Test: auto-levels Enabled Repeat Run 3 6 9 12 15 SE +/- 0.005, N = 3 SE +/- 0.021, N = 3 SE +/- 0.035, N = 3 9.323 9.304 9.334
OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.22 Test: unsharp-mask Enabled Repeat Run 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 11.21 11.17 11.20
Git This test measures the time needed to carry out some sample Git operations on an example, static repository that happens to be a copy of the GNOME GTK tool-kit repository. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Git Time To Complete Common Git Commands Enabled Repeat Run 9 18 27 36 45 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 40.97 40.88 40.99 1. git version 2.30.2
GNU GMP GMPbench GMPbench is a test of the GNU Multiple Precision Arithmetic (GMP) Library. GMPbench is a single-threaded integer benchmark that leverages the GMP library to stress the CPU with widening integer multiplication. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GMPbench Score, More Is Better GNU GMP GMPbench 6.2.1 Total Time Repeat Enabled Run 1400 2800 4200 5600 7000 6432.5 6449.1 6438.8 1. (CC) gcc options: -O3 -fomit-frame-pointer -lm
GNU Radio GNU Radio is a free software development toolkit providing signal processing blocks to implement software-defined radios (SDR) and signal processing systems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Five Back to Back FIR Filters Enabled Repeat Run 300 600 900 1200 1500 SE +/- 5.99, N = 3 SE +/- 14.53, N = 3 SE +/- 14.23, N = 3 1527.4 1499.4 1506.3 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Signal Source (Cosine) Enabled Repeat Run 700 1400 2100 2800 3500 SE +/- 2.52, N = 3 SE +/- 4.70, N = 3 SE +/- 1.51, N = 3 3244.4 3240.7 3238.8 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FIR Filter Enabled Repeat Run 160 320 480 640 800 SE +/- 0.83, N = 3 SE +/- 0.71, N = 3 SE +/- 0.98, N = 3 734.0 733.4 733.7 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: IIR Filter Enabled Repeat Run 200 400 600 800 1000 SE +/- 0.78, N = 3 SE +/- 0.46, N = 3 SE +/- 0.58, N = 3 837.7 837.4 837.4 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FM Deemphasis Filter Enabled Repeat Run 200 400 600 800 1000 SE +/- 0.57, N = 3 SE +/- 2.55, N = 3 SE +/- 0.47, N = 3 1084.4 1082.2 1084.2 1. 3.8.2.0
OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Hilbert Transform Enabled Repeat Run 130 260 390 520 650 SE +/- 1.46, N = 3 SE +/- 0.84, N = 3 SE +/- 0.59, N = 3 594.4 593.7 591.4 1. 3.8.2.0
Google SynthMark SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 Enabled Repeat Run 200 400 600 800 1000 SE +/- 0.45, N = 3 SE +/- 0.35, N = 3 SE +/- 2.29, N = 3 928.21 928.09 926.29 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate Enabled Repeat Run 200 400 600 800 1000 SE +/- 6.00, N = 3 SE +/- 4.10, N = 3 SE +/- 1.20, N = 3 1073 1064 1041 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen Enabled Repeat Run 40 80 120 160 200 165 165 165 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced Enabled Repeat Run 50 100 150 200 250 218 218 217 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing Enabled Repeat Run 200 400 600 800 1000 SE +/- 1.20, N = 3 SE +/- 1.33, N = 3 SE +/- 3.53, N = 3 1101 1103 1089 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian Enabled Repeat Run 70 140 210 280 350 SE +/- 0.33, N = 3 310 310 309 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space Enabled Repeat Run 300 600 900 1200 1500 SE +/- 0.33, N = 3 SE +/- 0.88, N = 3 SE +/- 0.88, N = 3 1272 1275 1256 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
Hugin Hugin is an open-source, cross-platform panorama photo stitcher software package. This test profile times how long it takes to run the assistant and panorama photo stitching on a set of images. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Hugin Panorama Photo Assistant + Stitching Time Enabled Repeat Run 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.24, N = 3 SE +/- 0.18, N = 3 38.73 38.34 38.67
OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Supercar Enabled Repeat Run 1.0643 2.1286 3.1929 4.2572 5.3215 SE +/- 0.004, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 4.725 4.730 4.716
Java Gradle Build This test runs Java software project builds using the Gradle build system. It is intended to give developers an idea as to the build performance for development activities and build servers. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Java Gradle Build Gradle Build: Reactor Enabled Repeat Run 40 80 120 160 200 SE +/- 2.16, N = 12 SE +/- 1.79, N = 12 SE +/- 1.63, N = 12 173.55 171.07 174.16
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: MD5 Enabled Repeat Run 400K 800K 1200K 1600K 2000K SE +/- 13383.24, N = 3 SE +/- 12333.33, N = 3 SE +/- 10588.25, N = 3 2065333 2054667 2088333 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
JPEG XL The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 5 Enabled Repeat Run 20 40 60 80 100 SE +/- 0.12, N = 3 SE +/- 0.07, N = 3 SE +/- 0.11, N = 3 74.76 75.02 74.75 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 7 Enabled Repeat Run 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 10.10 10.11 10.09 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 5 Enabled Repeat Run 20 40 60 80 100 SE +/- 0.36, N = 3 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 74.61 74.94 74.94 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 7 Enabled Repeat Run 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.31, N = 3 SE +/- 0.49, N = 3 74.70 74.86 75.57 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
JPEG XL Decoding The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.3 CPU Threads: 1 Enabled Repeat Run 13 26 39 52 65 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.13, N = 3 56.70 56.71 56.37
Kvazaar This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium Enabled Repeat Run 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 6.41 6.42 6.41 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium Enabled Repeat Run 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 27.89 27.92 27.92 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Enabled Repeat Run 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 17.71 17.71 17.64 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Enabled Repeat Run 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 31.73 31.89 31.65 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast Enabled Repeat Run 15 30 45 60 75 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 65.54 65.72 65.40 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast Enabled Repeat Run 30 60 90 120 150 SE +/- 0.13, N = 3 SE +/- 0.23, N = 3 SE +/- 0.21, N = 3 120.06 120.19 119.79 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 Enabled Repeat Run 2 4 6 8 10 SE +/- 0.004, N = 3 SE +/- 0.014, N = 3 SE +/- 0.005, N = 3 6.521 6.532 6.526 1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lncurses -lm
OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen Enabled Repeat Run 200 400 600 800 1000 SE +/- 8.54, N = 3 SE +/- 4.04, N = 3 SE +/- 7.88, N = 3 840 831 841 1. (CXX) g++ options: -flto -pthread
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 2 Enabled Repeat Run 7 14 21 28 35 SE +/- 0.19, N = 3 SE +/- 0.14, N = 3 SE +/- 0.06, N = 3 30.97 31.10 31.05 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 Enabled Repeat Run 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 11.05 11.06 11.03 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 Enabled Repeat Run 0.6467 1.2934 1.9401 2.5868 3.2335 SE +/- 0.003, N = 3 SE +/- 0.007, N = 3 SE +/- 0.008, N = 3 2.855 2.867 2.874 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Enabled Repeat Run 12 24 36 48 60 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 SE +/- 0.03, N = 3 51.51 51.57 52.21 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Enabled Repeat Run 1.1502 2.3004 3.4506 4.6008 5.751 SE +/- 0.017, N = 3 SE +/- 0.002, N = 3 SE +/- 0.005, N = 3 5.112 5.085 5.092 1. (CXX) g++ options: -O3 -fPIC -lm
librsvg RSVG/librsvg is an SVG vector graphics library. This test profile times how long it takes to complete various operations by rsvg-convert. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better librsvg Operation: SVG Files To PNG Enabled Repeat Run 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 16.40 16.30 16.46 1. rsvg-convert version 2.50.3
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 Enabled Repeat Run 20M 40M 60M 80M 100M SE +/- 4910.31, N = 3 SE +/- 978008.35, N = 3 SE +/- 8504.90, N = 3 82626333 81644000 82631000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 2 - Buffer Length: 256 - Filter Length: 57 Enabled Repeat Run 30M 60M 90M 120M 150M SE +/- 1078368.11, N = 3 SE +/- 1484620.42, N = 6 SE +/- 33829.64, N = 3 156866667 155633333 157906667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 Enabled Repeat Run 70M 140M 210M 280M 350M SE +/- 158359.65, N = 3 SE +/- 817176.71, N = 3 SE +/- 968234.36, N = 3 321016667 319096667 319286667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 Enabled Repeat Run 130M 260M 390M 520M 650M SE +/- 463656.96, N = 3 SE +/- 286724.80, N = 3 SE +/- 8429373.91, N = 3 629876667 629466667 621093333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 12 - Buffer Length: 256 - Filter Length: 57 Enabled Repeat Run 140M 280M 420M 560M 700M SE +/- 2697671.84, N = 3 SE +/- 489534.93, N = 3 SE +/- 162924.66, N = 3 662150000 658753333 659116667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
LuaJIT This test profile is a collection of Lua scripts/benchmarks run against a locally-built copy of LuaJIT upstream. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Composite Enabled Repeat Run 400 800 1200 1600 2000 SE +/- 1.28, N = 3 SE +/- 1.32, N = 3 SE +/- 3.15, N = 3 1832.95 1828.04 1846.72 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
LuaRadio LuaRadio is a lightweight software-defined radio (SDR) framework built atop LuaJIT. LuaRadio provides a suite of source, sink, and processing blocks, with a simple API for defining flow graphs, running flow graphs, creating blocks, and creating data types. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Five Back to Back FIR Filters Enabled Repeat Run 300 600 900 1200 1500 SE +/- 1.82, N = 3 SE +/- 8.83, N = 3 SE +/- 4.43, N = 3 1541.1 1528.4 1536.0
OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: FM Deemphasis Filter Enabled Repeat Run 120 240 360 480 600 SE +/- 0.23, N = 3 SE +/- 0.10, N = 3 SE +/- 0.06, N = 3 547.0 546.9 547.0
OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Hilbert Transform Enabled Repeat Run 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.25, N = 3 SE +/- 0.10, N = 3 107.2 107.2 107.1
OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Complex Phase Enabled Repeat Run 200 400 600 800 1000 SE +/- 0.32, N = 3 SE +/- 2.01, N = 3 SE +/- 1.86, N = 3 787.1 784.4 783.3
LuxCoreRender LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on the CPU as opposed to the OpenCL version. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.3 Scene: DLSC Enabled Repeat Run 0.441 0.882 1.323 1.764 2.205 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.96 1.96 1.94 MIN: 1.92 / MAX: 2.02 MIN: 1.92 / MAX: 2.01 MIN: 1.89 / MAX: 1.99
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.3 Scene: Rainbow Colors and Prism Enabled Repeat Run 0.4815 0.963 1.4445 1.926 2.4075 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.14 2.13 2.14 MIN: 2.09 / MAX: 2.16 MIN: 2.08 / MAX: 2.16 MIN: 2.07 / MAX: 2.17
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed Enabled Repeat Run 2K 4K 6K 8K 10K SE +/- 31.07, N = 3 SE +/- 6.63, N = 3 SE +/- 31.08, N = 3 10096.9 10047.5 10026.8 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed Enabled Repeat Run 15 30 45 60 75 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 65.95 66.00 66.00 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed Enabled Repeat Run 2K 4K 6K 8K 10K SE +/- 7.93, N = 3 SE +/- 5.72, N = 3 SE +/- 13.00, N = 3 10672.3 10655.3 10670.1 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed Enabled Repeat Run 14 28 42 56 70 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.05, N = 3 64.67 64.67 64.72 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed Enabled Repeat Run 2K 4K 6K 8K 10K SE +/- 7.05, N = 3 SE +/- 9.28, N = 3 SE +/- 4.62, N = 3 10679.1 10683.2 10675.7 1. (CC) gcc options: -O3
Mobile Neural Network MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: SqueezeNetV1.0 Enabled Repeat Run 0.8525 1.705 2.5575 3.41 4.2625 SE +/- 0.008, N = 3 SE +/- 0.032, N = 3 SE +/- 0.018, N = 3 3.751 3.789 3.753 MIN: 3.69 / MAX: 11.5 MIN: 3.72 / MAX: 11.42 MIN: 3.69 / MAX: 4.53 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: resnet-v2-50 Enabled Repeat Run 5 10 15 20 25 SE +/- 0.11, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 19.79 19.70 19.79 MIN: 19.55 / MAX: 21.46 MIN: 19.53 / MAX: 21.51 MIN: 19.59 / MAX: 21.26 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: MobileNetV2_224 Enabled Repeat Run 0.448 0.896 1.344 1.792 2.24 SE +/- 0.006, N = 3 SE +/- 0.013, N = 3 SE +/- 0.004, N = 3 1.971 1.980 1.991 MIN: 1.92 / MAX: 2.79 MIN: 1.92 / MAX: 2.88 MIN: 1.94 / MAX: 2.84 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: mobilenet-v1-1.0 Enabled Repeat Run 0.4356 0.8712 1.3068 1.7424 2.178 SE +/- 0.007, N = 3 SE +/- 0.005, N = 3 SE +/- 0.012, N = 3 1.921 1.923 1.936 MIN: 1.87 / MAX: 2.71 MIN: 1.88 / MAX: 2.74 MIN: 1.88 / MAX: 3.19 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: inception-v3 Enabled Repeat Run 6 12 18 24 30 SE +/- 0.23, N = 3 SE +/- 0.15, N = 3 SE +/- 0.12, N = 3 23.06 22.90 23.19 MIN: 22.7 / MAX: 25.12 MIN: 22.59 / MAX: 25.48 MIN: 22.9 / MAX: 25.71 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Monte Carlo Simulations of Ionised Nebulae Mocassin is the Monte Carlo Simulations of Ionised Nebulae. MOCASSIN is a fully 3D or 2D photoionisation and dust radiative transfer code which employs a Monte Carlo approach to the transfer of radiation through media of arbitrary geometry and density distribution. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2019-03-24 Input: Dust 2D tau100.0 Enabled Repeat Run 40 80 120 160 200 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 174 173 173 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lrt -lz
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms Enabled Repeat Run 0.2923 0.5846 0.8769 1.1692 1.4615 SE +/- 0.00186, N = 3 SE +/- 0.00320, N = 3 SE +/- 0.00312, N = 3 1.29902 1.29529 1.29607
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C Enabled Repeat Run 300 600 900 1200 1500 SE +/- 14.97, N = 15 SE +/- 14.97, N = 15 SE +/- 10.80, N = 3 1554.98 1565.92 1581.87 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C Enabled Repeat Run 7K 14K 21K 28K 35K SE +/- 10.27, N = 3 SE +/- 75.77, N = 3 SE +/- 35.20, N = 3 31194.25 31174.97 30717.61 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NCNN NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mobilenet Enabled Repeat Run 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.18, N = 4 SE +/- 0.23, N = 3 15.72 16.00 15.94 MIN: 15.49 / MAX: 16.88 MIN: 15.6 / MAX: 16.91 MIN: 15.49 / MAX: 16.78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 Enabled Repeat Run 1.0103 2.0206 3.0309 4.0412 5.0515 SE +/- 0.05, N = 3 SE +/- 0.13, N = 4 SE +/- 0.09, N = 3 4.42 4.49 4.46 MIN: 4.24 / MAX: 6.41 MIN: 4.23 / MAX: 5.82 MIN: 4.23 / MAX: 5.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 Enabled Repeat Run 0.8213 1.6426 2.4639 3.2852 4.1065 SE +/- 0.06, N = 3 SE +/- 0.07, N = 4 SE +/- 0.04, N = 3 3.63 3.65 3.60 MIN: 3.48 / MAX: 4.52 MIN: 3.49 / MAX: 4.52 MIN: 3.48 / MAX: 4.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: shufflenet-v2 Enabled Repeat Run 0.8528 1.7056 2.5584 3.4112 4.264 SE +/- 0.01, N = 3 SE +/- 0.01, N = 4 SE +/- 0.03, N = 3 3.76 3.77 3.79 MIN: 3.7 / MAX: 4.31 MIN: 3.7 / MAX: 5.82 MIN: 3.69 / MAX: 6.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mnasnet Enabled Repeat Run 0.8078 1.6156 2.4234 3.2312 4.039 SE +/- 0.09, N = 3 SE +/- 0.11, N = 4 SE +/- 0.08, N = 3 3.50 3.53 3.59 MIN: 3.34 / MAX: 4.22 MIN: 3.34 / MAX: 4.34 MIN: 3.38 / MAX: 4.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: efficientnet-b0 Enabled Repeat Run 1.2893 2.5786 3.8679 5.1572 6.4465 SE +/- 0.05, N = 3 SE +/- 0.11, N = 4 SE +/- 0.12, N = 3 5.63 5.68 5.73 MIN: 5.52 / MAX: 6.51 MIN: 5.5 / MAX: 6.87 MIN: 5.53 / MAX: 6.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: blazeface Enabled Repeat Run 0.315 0.63 0.945 1.26 1.575 SE +/- 0.04, N = 3 SE +/- 0.03, N = 4 SE +/- 0.04, N = 3 1.35 1.35 1.40 MIN: 1.26 / MAX: 1.47 MIN: 1.27 / MAX: 1.48 MIN: 1.28 / MAX: 1.81 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: googlenet Enabled Repeat Run 3 6 9 12 15 SE +/- 0.36, N = 3 SE +/- 0.34, N = 4 SE +/- 0.39, N = 3 12.16 12.15 12.60 MIN: 11.71 / MAX: 13.31 MIN: 11.72 / MAX: 13.4 MIN: 11.72 / MAX: 13.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: vgg16 Enabled Repeat Run 13 26 39 52 65 SE +/- 0.10, N = 3 SE +/- 0.11, N = 4 SE +/- 1.26, N = 3 56.10 56.35 57.57 MIN: 55.69 / MAX: 58.35 MIN: 55.59 / MAX: 62.94 MIN: 55.66 / MAX: 733.01 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet18 Enabled Repeat Run 3 6 9 12 15 SE +/- 0.32, N = 3 SE +/- 0.26, N = 4 SE +/- 0.03, N = 3 12.60 12.74 12.93 MIN: 11.81 / MAX: 13.19 MIN: 11.82 / MAX: 13.44 MIN: 12.69 / MAX: 13.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: alexnet Enabled Repeat Run 3 6 9 12 15 SE +/- 0.18, N = 3 SE +/- 0.01, N = 4 SE +/- 0.17, N = 3 11.36 11.19 11.35 MIN: 11 / MAX: 11.98 MIN: 10.99 / MAX: 11.53 MIN: 10.99 / MAX: 12.87 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet50 Enabled Repeat Run 6 12 18 24 30 SE +/- 0.50, N = 3 SE +/- 0.04, N = 4 SE +/- 0.05, N = 3 24.05 24.59 24.65 MIN: 22.83 / MAX: 32.85 MIN: 24.27 / MAX: 31.94 MIN: 24.32 / MAX: 25.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: yolov4-tiny Enabled Repeat Run 6 12 18 24 30 SE +/- 0.17, N = 3 SE +/- 0.14, N = 4 SE +/- 0.20, N = 3 23.05 23.11 23.13 MIN: 22.55 / MAX: 23.56 MIN: 22.55 / MAX: 23.74 MIN: 22.55 / MAX: 24.22 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: squeezenet_ssd Enabled Repeat Run 4 8 12 16 20 SE +/- 0.10, N = 3 SE +/- 0.06, N = 4 SE +/- 0.03, N = 3 18.10 18.18 18.16 MIN: 17.75 / MAX: 18.89 MIN: 17.83 / MAX: 18.68 MIN: 17.79 / MAX: 26.2 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: regnety_400m Enabled Repeat Run 3 6 9 12 15 SE +/- 0.09, N = 3 SE +/- 0.07, N = 4 SE +/- 0.07, N = 3 11.06 11.04 11.09 MIN: 10.79 / MAX: 12.12 MIN: 10.75 / MAX: 12.06 MIN: 10.88 / MAX: 12.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Ngspice Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 Enabled Repeat Run 20 40 60 80 100 SE +/- 0.32, N = 3 SE +/- 0.69, N = 3 SE +/- 0.30, N = 3 108.73 107.26 108.10 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 Enabled Repeat Run 20 40 60 80 100 SE +/- 0.57, N = 3 SE +/- 0.29, N = 3 SE +/- 0.23, N = 3 87.39 86.23 86.39 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
OCRMyPDF OCRMyPDF is an optical character recognition (OCR) text layer to scanned PDF files, producing new PDFs with the text now selectable/searchable/copy-paste capable. OCRMyPDF leverages the Tesseract OCR engine and is written in Python. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OCRMyPDF 10.3.1+dfsg Processing 60 Page PDF Document Enabled Repeat Run 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 15.51 15.49 15.56
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU Enabled Repeat Run 0.9097 1.8194 2.7291 3.6388 4.5485 SE +/- 0.00318, N = 3 SE +/- 0.00225, N = 3 SE +/- 0.00367, N = 3 4.04078 4.03238 4.04332 MIN: 3.91 MIN: 3.91 MIN: 3.92 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU Enabled Repeat Run 3 6 9 12 15 SE +/- 0.00888, N = 3 SE +/- 0.00778, N = 3 SE +/- 0.00258, N = 3 10.57120 9.49268 10.56970 MIN: 10.5 MIN: 9.43 MIN: 10.51 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU Enabled Repeat Run 0.1585 0.317 0.4755 0.634 0.7925 SE +/- 0.001249, N = 3 SE +/- 0.001477, N = 3 SE +/- 0.002770, N = 3 0.703196 0.704665 0.704534 MIN: 0.66 MIN: 0.66 MIN: 0.66 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU Enabled Repeat Run 0.6462 1.2924 1.9386 2.5848 3.231 SE +/- 0.00634, N = 3 SE +/- 0.01085, N = 3 SE +/- 0.00257, N = 3 2.86126 2.77561 2.87200 MIN: 2.81 MIN: 2.69 MIN: 2.8 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU Enabled Repeat Run 2 4 6 8 10 SE +/- 0.00601, N = 3 SE +/- 0.00649, N = 3 SE +/- 0.00312, N = 3 8.39732 8.39297 8.39591 MIN: 8.25 MIN: 8.25 MIN: 8.3 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU Enabled Repeat Run 1.0461 2.0922 3.1383 4.1844 5.2305 SE +/- 0.03141, N = 3 SE +/- 0.01779, N = 3 SE +/- 0.03164, N = 3 4.60823 4.38277 4.64952 MIN: 4.13 MIN: 4.04 MIN: 4.15 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Enabled Repeat Run 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 13.95 13.86 13.93 MIN: 13.88 MIN: 13.78 MIN: 13.85 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU Enabled Repeat Run 2 4 6 8 10 SE +/- 0.07203, N = 15 SE +/- 0.09537, N = 15 SE +/- 0.06939, N = 15 6.16825 6.19314 6.07807 MIN: 3.67 MIN: 3.68 MIN: 3.69 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU Enabled Repeat Run 0.9426 1.8852 2.8278 3.7704 4.713 SE +/- 0.00971, N = 3 SE +/- 0.00355, N = 3 SE +/- 0.00910, N = 3 4.18919 4.17427 4.17526 MIN: 4.12 MIN: 4.13 MIN: 4.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Enabled Repeat Run 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 SE +/- 0.00, N = 3 12.12 12.06 12.10 MIN: 12.06 MIN: 11.93 MIN: 12.03 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU Enabled Repeat Run 0.1851 0.3702 0.5553 0.7404 0.9255 SE +/- 0.003255, N = 3 SE +/- 0.003677, N = 3 SE +/- 0.003248, N = 3 0.822350 0.822519 0.821323 MIN: 0.8 MIN: 0.8 MIN: 0.8 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU Enabled Repeat Run 0.3246 0.6492 0.9738 1.2984 1.623 SE +/- 0.01018, N = 3 SE +/- 0.00726, N = 3 SE +/- 0.01056, N = 3 1.43641 1.44288 1.43273 MIN: 1.37 MIN: 1.36 MIN: 1.36 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Enabled Repeat Run 700 1400 2100 2800 3500 SE +/- 2.69, N = 3 SE +/- 4.19, N = 3 SE +/- 3.51, N = 3 3091.74 2988.39 3066.19 MIN: 3081.74 MIN: 2975.82 MIN: 3055.74 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Enabled Repeat Run 400 800 1200 1600 2000 SE +/- 1.39, N = 3 SE +/- 1.82, N = 3 SE +/- 3.10, N = 3 1812.22 1760.75 1802.18 MIN: 1806.43 MIN: 1753.88 MIN: 1794.84 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU Enabled Repeat Run 700 1400 2100 2800 3500 SE +/- 1.66, N = 3 SE +/- 5.44, N = 3 SE +/- 5.99, N = 3 3092.80 3001.60 3066.56 MIN: 3086.05 MIN: 2989.75 MIN: 3053.3 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU Enabled Repeat Run 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 15.82 15.82 15.81 MIN: 15.79 MIN: 15.79 MIN: 15.79 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU Enabled Repeat Run 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 16.45 16.45 16.45 MIN: 16.25 MIN: 16.24 MIN: 16.25 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU Enabled Repeat Run 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 16.74 16.73 16.70 MIN: 16.62 MIN: 16.63 MIN: 16.63 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU Enabled Repeat Run 400 800 1200 1600 2000 SE +/- 2.19, N = 3 SE +/- 1.53, N = 3 SE +/- 1.75, N = 3 1814.66 1762.42 1800.95 MIN: 1808.15 MIN: 1756.24 MIN: 1793.72 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Enabled Repeat Run 0.7736 1.5472 2.3208 3.0944 3.868 SE +/- 0.00397, N = 3 SE +/- 0.00297, N = 3 SE +/- 0.00271, N = 3 3.42660 3.41721 3.43837 MIN: 3.35 MIN: 3.34 MIN: 3.36 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU Enabled Repeat Run 700 1400 2100 2800 3500 SE +/- 1.99, N = 3 SE +/- 6.46, N = 3 SE +/- 17.72, N = 3 3092.81 2998.59 3075.31 MIN: 3084.13 MIN: 2981.81 MIN: 3049.81 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU Enabled Repeat Run 400 800 1200 1600 2000 SE +/- 1.95, N = 3 SE +/- 4.58, N = 3 SE +/- 3.82, N = 3 1816.65 1762.48 1802.00 MIN: 1809.76 MIN: 1751.25 MIN: 1791.17 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Enabled Repeat Run 0.2866 0.5732 0.8598 1.1464 1.433 SE +/- 0.00109, N = 3 SE +/- 0.00115, N = 3 SE +/- 0.00054, N = 3 1.26765 1.26745 1.27367 MIN: 1.22 MIN: 1.22 MIN: 1.23 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU Enabled Repeat Run 0.787 1.574 2.361 3.148 3.935 SE +/- 0.00666, N = 3 SE +/- 0.00650, N = 3 SE +/- 0.00395, N = 3 3.49756 3.49474 3.48539 MIN: 3.43 MIN: 3.42 MIN: 3.42 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU Enabled Repeat Run 100 200 300 400 500 SE +/- 1.59, N = 3 SE +/- 1.59, N = 3 SE +/- 2.42, N = 3 457 457 456 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU Enabled Repeat Run 200 400 600 800 1000 SE +/- 0.76, N = 3 SE +/- 1.88, N = 3 SE +/- 1.17, N = 3 852 853 854 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU Enabled Repeat Run 20 40 60 80 100 SE +/- 0.17, N = 3 SE +/- 0.00, N = 3 84 84 85 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU Enabled Repeat Run 4K 8K 12K 16K 20K SE +/- 14.97, N = 3 SE +/- 41.78, N = 3 SE +/- 28.64, N = 3 17685 17671 17624 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU Enabled Repeat Run 2K 4K 6K 8K 10K SE +/- 21.10, N = 3 SE +/- 36.97, N = 3 SE +/- 2.20, N = 3 7801 7800 7835 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenFOAM OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M Enabled Repeat Run 50 100 150 200 250 SE +/- 0.17, N = 3 SE +/- 0.44, N = 3 SE +/- 0.22, N = 3 228.01 227.61 227.09 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
OpenSCAD OpenSCAD is a programmer-focused solid 3D CAD modeller. OpenSCAD is free software and allows creating 3D CAD objects in a script-based modelling environment. This test profile will use the system-provided OpenSCAD program otherwise and time how long it takes tn render different SCAD assets to PNG output. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Pistol Enabled Repeat Run 20 40 60 80 100 SE +/- 0.08, N = 3 SE +/- 0.24, N = 3 SE +/- 0.05, N = 3 77.85 78.07 78.21 1. OpenSCAD version 2021.01
OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Retro Car Enabled Repeat Run 0.824 1.648 2.472 3.296 4.12 SE +/- 0.006, N = 3 SE +/- 0.005, N = 3 SE +/- 0.002, N = 3 3.662 3.660 3.656 1. OpenSCAD version 2021.01
OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Mini-ITX Case Enabled Repeat Run 8 16 24 32 40 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 35.22 35.14 35.30 1. OpenSCAD version 2021.01
OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Projector Mount Swivel Enabled Repeat Run 2 4 6 8 10 SE +/- 0.005, N = 3 SE +/- 0.042, N = 3 SE +/- 0.008, N = 3 6.934 6.949 6.925 1. OpenSCAD version 2021.01
OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Leonardo Phone Case Slim Enabled Repeat Run 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 14.32 14.20 14.24 1. OpenSCAD version 2021.01
OpenVKL OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmark Enabled Repeat Run 40 80 120 160 200 SE +/- 1.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 194 197 196 MIN: 1 / MAX: 790 MIN: 1 / MAX: 806 MIN: 1 / MAX: 807
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkVdbVolume Enabled Repeat Run 5M 10M 15M 20M 25M SE +/- 44284.02, N = 3 SE +/- 283788.09, N = 3 SE +/- 120334.61, N = 3 24682690 25063015 24792840 MIN: 1560523 / MAX: 94703364 MIN: 1580423 / MAX: 105264432 MIN: 1602573 / MAX: 99801972
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkStructuredVolume Enabled Repeat Run 16M 32M 48M 64M 80M SE +/- 576939.06, N = 15 SE +/- 357980.91, N = 3 SE +/- 1052274.46, N = 3 74635152 72560377 76029329 MIN: 1917397 / MAX: 657699912 MIN: 1941493 / MAX: 544635720 MIN: 1954651 / MAX: 659359440
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkUnstructuredVolume Enabled Repeat Run 600K 1200K 1800K 2400K 3000K SE +/- 884.39, N = 3 SE +/- 3937.22, N = 3 SE +/- 2813.39, N = 3 2708582 2709587 2706127 MIN: 32681 / MAX: 9049793 MIN: 32743 / MAX: 9064631 MIN: 32688 / MAX: 9055017
Opus Codec Encoding Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode Enabled Repeat Run 2 4 6 8 10 SE +/- 0.011, N = 5 SE +/- 0.011, N = 5 SE +/- 0.010, N = 5 7.230 7.222 7.232 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
OSPray Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: SciVis Enabled Repeat Run 5 10 15 20 25 SE +/- 0.16, N = 3 SE +/- 0.16, N = 3 SE +/- 0.00, N = 3 22.06 22.06 21.74 MIN: 21.28 / MAX: 22.22 MIN: 21.28 / MAX: 22.22 MIN: 20 / MAX: 22.22
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: SciVis Enabled Repeat Run 0.7515 1.503 2.2545 3.006 3.7575 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.34 3.32 3.33 MIN: 3.29 / MAX: 3.4 MIN: 3.28 / MAX: 3.4 MIN: 3.27 / MAX: 3.41
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: Path Tracer Enabled Repeat Run 0.3578 0.7156 1.0734 1.4312 1.789 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.59 1.59 1.59 MIN: 1.58 / MAX: 1.6 MIN: 1.58 / MAX: 1.6 MIN: 1.58 / MAX: 1.6
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: SciVis Enabled Repeat Run 6 12 18 24 30 25 25 25 MIN: 23.81 / MAX: 25.64 MIN: 24.39 / MAX: 25.64 MIN: 23.81 / MAX: 25.64
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: Path Tracer Enabled Repeat Run 0.3938 0.7876 1.1814 1.5752 1.969 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.75 1.73 1.74 MIN: 1.73 / MAX: 1.78 MIN: 1.72 / MAX: 1.77 MIN: 1.65 / MAX: 1.78
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: SciVis Enabled Repeat Run 5 10 15 20 25 20 20 20 MIN: 18.87 / MAX: 20.41 MIN: 19.23 / MAX: 20.41 MIN: 19.61 / MAX: 20.41
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: Path Tracer Enabled Repeat Run 1.0283 2.0566 3.0849 4.1132 5.1415 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 4.57 4.55 4.57 MIN: 4.44 / MAX: 4.76 MIN: 4.44 / MAX: 4.78 MIN: 4.42 / MAX: 4.74
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: Path Tracer Enabled Repeat Run 70 140 210 280 350 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 333.33 333.33 333.33 MIN: 250 MIN: 250 MIN: 250
OpenBenchmarking.org Seconds, Fewer Is Better Perl Benchmarks Test: Interpreter Enabled Repeat Run 0.0002 0.0004 0.0006 0.0008 0.001 SE +/- 0.00000726, N = 3 SE +/- 0.00000063, N = 3 SE +/- 0.00000526, N = 3 0.00067785 0.00067438 0.00068233
PHPBench PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite Enabled Repeat Run 200K 400K 600K 800K 1000K SE +/- 4727.53, N = 3 SE +/- 1962.09, N = 3 SE +/- 3145.44, N = 3 1010404 1017890 1020071
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG19 - Device: CPU Enabled Repeat Run 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 17.73 17.79 17.86
OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU Enabled Repeat Run 1.2465 2.493 3.7395 4.986 6.2325 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 5.52 5.54 5.52
POV-Ray This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time Enabled Repeat Run 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 38.19 38.35 38.30 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
Primesieve Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 7.4 1e12 Prime Number Generation Enabled Repeat Run 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 18.04 18.03 18.07 1. (CXX) g++ options: -O3 -lpthread
PyBench This test profile reports the total time of the different average timed test results from PyBench. PyBench reports average test times for different functions such as BuiltinFunctionCalls and NestedForLoops, with this total result providing a rough estimate as to Python's average performance on a given system. This test profile runs PyBench each time for 20 rounds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2018-02-16 Total For Average Test Times Enabled Repeat Run 150 300 450 600 750 SE +/- 2.85, N = 3 SE +/- 2.08, N = 3 SE +/- 1.00, N = 3 714 716 711
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 Enabled Repeat Run 700 1400 2100 2800 3500 SE +/- 17.21, N = 3 SE +/- 18.56, N = 3 SE +/- 24.43, N = 3 3323.0 3312.4 3311.3 1. (CXX) g++ options: -O3 -march=native -rdynamic
rays1bench This is a test of rays1bench, a simple path-tracer / ray-tracing that supports SSE and AVX instructions, multi-threading, and other features. This test profile is measuring the performance of the "large scene" in rays1bench. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org mrays/s, More Is Better rays1bench 2020-01-09 Large Scene Enabled Repeat Run 16 32 48 64 80 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 71.82 71.69 71.69
RNNoise RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 Enabled Repeat Run 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 20.89 20.90 20.89 1. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD Enabled Repeat Run 50 100 150 200 250 SE +/- 1.22, N = 3 SE +/- 1.60, N = 3 SE +/- 2.75, N = 3 208.70 208.79 209.92 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D Enabled Repeat Run 16 32 48 64 80 SE +/- 0.41, N = 14 SE +/- 0.33, N = 3 SE +/- 1.89, N = 14 70.42 70.25 72.24 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte Enabled Repeat Run 20 40 60 80 100 SE +/- 0.42, N = 3 SE +/- 0.64, N = 3 SE +/- 0.20, N = 3 103.84 104.96 103.49 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver Enabled Repeat Run 5 10 15 20 25 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 SE +/- 0.13, N = 3 21.08 21.10 21.30 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster Enabled Repeat Run 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 17.00 17.01 17.02 1. (CXX) g++ options: -O2 -lOpenCL
Selenium This test profile uses the Selenium WebDriver for running various browser benchmarks in different available web browsers such as Firefox and Google Chrome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: ARES-6 - Browser: Firefox Enabled Repeat Run 9 18 27 36 45 SE +/- 0.20, N = 3 SE +/- 0.14, N = 3 SE +/- 0.06, N = 3 38.12 38.09 37.86 1. firefox 86.0
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: Kraken - Browser: Firefox Enabled Repeat Run 200 400 600 800 1000 SE +/- 0.82, N = 3 SE +/- 0.73, N = 3 SE +/- 1.71, N = 3 844.1 842.2 844.6 1. firefox 86.0
OpenBenchmarking.org Runs / Minute, More Is Better Selenium Benchmark: StyleBench - Browser: Firefox Enabled Repeat Run 30 60 90 120 150 SE +/- 2.02, N = 15 SE +/- 1.81, N = 15 SE +/- 1.76, N = 15 117 119 118 1. firefox 86.0
OpenBenchmarking.org Score, More Is Better Selenium Benchmark: Jetstream 2 - Browser: Firefox Enabled Repeat Run 20 40 60 80 100 SE +/- 0.91, N = 3 SE +/- 0.71, N = 3 SE +/- 0.45, N = 3 102.78 99.74 100.00 1. firefox 86.0
OpenBenchmarking.org Runs Per Minute, More Is Better Selenium Benchmark: Speedometer - Browser: Firefox Enabled Repeat Run 30 60 90 120 150 SE +/- 0.67, N = 3 SE +/- 1.09, N = 3 SE +/- 1.34, N = 3 134.0 137.9 136.7 1. firefox 86.0
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: Kraken - Browser: Google Chrome Enabled Repeat Run 130 260 390 520 650 SE +/- 1.50, N = 3 SE +/- 0.93, N = 3 SE +/- 1.66, N = 3 616.6 610.4 610.7 1. chrome 89.0.4389.90
OpenBenchmarking.org Score, Fewer Is Better Selenium Benchmark: PSPDFKit WASM - Browser: Firefox Enabled Repeat Run 600 1200 1800 2400 3000 SE +/- 10.17, N = 3 SE +/- 3.33, N = 3 SE +/- 9.87, N = 3 2779 2776 2788 1. firefox 86.0
OpenBenchmarking.org Runs / Minute, More Is Better Selenium Benchmark: StyleBench - Browser: Google Chrome Enabled Repeat Run 10 20 30 40 50 SE +/- 0.13, N = 3 SE +/- 0.13, N = 3 SE +/- 0.16, N = 3 45.95 45.78 46.19 1. chrome 89.0.4389.90
OpenBenchmarking.org Score, Fewer Is Better Selenium Benchmark: PSPDFKit WASM - Browser: Google Chrome Enabled Repeat Run 600 1200 1800 2400 3000 SE +/- 6.24, N = 3 SE +/- 2.31, N = 3 2813 2823 2825 1. chrome 89.0.4389.90
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM imageConvolute - Browser: Firefox Enabled Repeat Run 6 12 18 24 30 SE +/- 0.12, N = 3 SE +/- 0.07, N = 3 SE +/- 0.34, N = 3 25.1 25.0 25.4 1. firefox 86.0
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM collisionDetection - Browser: Firefox Enabled Repeat Run 70 140 210 280 350 SE +/- 3.30, N = 3 SE +/- 3.18, N = 3 SE +/- 3.27, N = 3 337.8 337.9 337.9 1. firefox 86.0
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM imageConvolute - Browser: Google Chrome Enabled Repeat Run 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.35, N = 3 SE +/- 0.16, N = 3 26.57 26.65 26.95 1. chrome 89.0.4389.90
OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM collisionDetection - Browser: Google Chrome Enabled Repeat Run 60 120 180 240 300 SE +/- 0.22, N = 3 SE +/- 0.19, N = 3 SE +/- 0.15, N = 3 280.38 280.30 280.72 1. chrome 89.0.4389.90
simdjson This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: Kostya Enabled Repeat Run 0.81 1.62 2.43 3.24 4.05 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.59 3.60 3.60 1. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: LargeRandom Enabled Repeat Run 0.2723 0.5446 0.8169 1.0892 1.3615 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.21 1.21 1.21 1. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: PartialTweets Enabled Repeat Run 1.1813 2.3626 3.5439 4.7252 5.9065 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 5.22 5.25 5.24 1. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: DistinctUserID Enabled Repeat Run 1.287 2.574 3.861 5.148 6.435 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 5.60 5.72 5.68 1. (CXX) g++ options: -O3 -pthread
Smallpt Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples Enabled Repeat Run 2 4 6 8 10 SE +/- 0.006, N = 3 SE +/- 0.005, N = 3 SE +/- 0.001, N = 3 8.880 8.883 8.883 1. (CXX) g++ options: -fopenmp -O3
srsLTE srsLTE is an open-source LTE software radio suite created by Software Radio Systems (SRS). srsLTE can be used for building your own software defined (SDR) LTE mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Samples / Second, More Is Better srsLTE 20.10.1 Test: OFDM_Test Enabled Repeat Run 40M 80M 120M 160M 200M SE +/- 260341.66, N = 3 SE +/- 2512855.83, N = 3 SE +/- 1473091.99, N = 3 176933333 178033333 176800000 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
OpenBenchmarking.org eNb Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test Enabled Repeat Run 70 140 210 280 350 SE +/- 0.55, N = 3 SE +/- 0.32, N = 3 SE +/- 1.22, N = 3 335.8 335.7 336.9 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
OpenBenchmarking.org UE Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test Enabled Repeat Run 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.09, N = 3 SE +/- 0.40, N = 3 132.7 132.4 132.6 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 12 Total Time Enabled Repeat Run 5M 10M 15M 20M 25M SE +/- 188309.24, N = 3 SE +/- 77091.76, N = 3 SE +/- 240951.84, N = 3 22389342 22712289 22203685 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time Enabled Repeat Run 6M 12M 18M 24M 30M SE +/- 213446.67, N = 3 SE +/- 139588.95, N = 3 SE +/- 88725.14, N = 3 28813121 28469418 28372238 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: Crypto Enabled Repeat Run 500 1000 1500 2000 2500 SE +/- 1.85, N = 3 SE +/- 0.23, N = 3 SE +/- 1.86, N = 3 2217.94 2218.61 2217.75 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: CPU Cache Enabled Repeat Run 6 12 18 24 30 SE +/- 0.19, N = 3 SE +/- 0.29, N = 3 SE +/- 0.21, N = 3 22.88 23.02 23.42 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: CPU Stress Enabled Repeat Run 1100 2200 3300 4400 5500 SE +/- 5.24, N = 3 SE +/- 12.12, N = 3 SE +/- 21.71, N = 3 4991.24 4964.68 5001.07 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: Matrix Math Enabled Repeat Run 12K 24K 36K 48K 60K SE +/- 126.41, N = 3 SE +/- 122.73, N = 3 SE +/- 210.46, N = 3 56981.91 57137.41 57377.45 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: Vector Math Enabled Repeat Run 13K 26K 39K 52K 65K SE +/- 97.11, N = 3 SE +/- 100.32, N = 3 SE +/- 57.06, N = 3 61555.92 62160.59 62436.51 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: Memory Copying Enabled Repeat Run 300 600 900 1200 1500 SE +/- 2.01, N = 3 SE +/- 3.20, N = 3 SE +/- 2.42, N = 3 1533.48 1533.93 1526.56 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.11.07 Test: Context Switching Enabled Repeat Run 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 56528.36, N = 3 SE +/- 63682.03, N = 3 SE +/- 58907.92, N = 4 5101477.04 5095907.11 4881747.94 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lsctp -lz -ldl -lpthread -lc
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 4 - Input: 1080p Enabled Repeat Run 0.9574 1.9148 2.8722 3.8296 4.787 SE +/- 0.010, N = 3 SE +/- 0.028, N = 3 SE +/- 0.023, N = 3 4.255 4.254 4.232 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 8 - Input: 1080p Enabled Repeat Run 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 37.85 37.76 37.61 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Enabled Repeat Run 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 9.26 9.29 9.27 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Enabled Repeat Run 30 60 90 120 150 SE +/- 0.28, N = 3 SE +/- 0.08, N = 3 SE +/- 0.14, N = 3 135.79 135.17 135.41 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Enabled Repeat Run 60 120 180 240 300 SE +/- 0.21, N = 3 SE +/- 0.14, N = 3 SE +/- 0.33, N = 3 266.00 262.54 262.47 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p Enabled Repeat Run 40 80 120 160 200 SE +/- 1.51, N = 3 SE +/- 2.07, N = 4 SE +/- 1.87, N = 3 185.98 184.70 183.50 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Enabled Repeat Run 40 80 120 160 200 SE +/- 0.25, N = 3 SE +/- 0.23, N = 3 SE +/- 0.55, N = 3 191.74 191.19 190.50 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p Enabled Repeat Run 30 60 90 120 150 SE +/- 0.11, N = 3 SE +/- 0.10, N = 3 SE +/- 0.23, N = 3 157.07 156.26 155.96 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Sysbench This is a benchmark of Sysbench with the built-in CPU and memory sub-tests. Sysbench is a scriptable multi-threaded benchmark tool based on LuaJIT. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory Enabled Repeat Run 5K 10K 15K 20K 25K SE +/- 74.39, N = 3 SE +/- 57.62, N = 3 SE +/- 87.98, N = 3 25461.39 25202.79 25429.68 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU Enabled Repeat Run 7K 14K 21K 28K 35K SE +/- 0.29, N = 3 SE +/- 4.23, N = 3 SE +/- 5.32, N = 3 34101.32 34092.54 34093.33 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
Tachyon This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time Enabled Repeat Run 20 40 60 80 100 SE +/- 0.54, N = 3 SE +/- 0.53, N = 15 SE +/- 0.43, N = 15 74.88 75.02 74.02 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
Tesseract OCR Tesseract-OCR is the open-source optical character recognition (OCR) engine for the conversion of text within images to raw text output. This test profile relies upon a system-supplied Tesseract installation. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Tesseract OCR 4.1.1 Time To OCR 7 Images Enabled Repeat Run 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 18.50 18.51 18.46
Timed Mesa Compilation This test profile times how long it takes to compile Mesa with Meson/Ninja. For minimizing build dependencies and avoid versioning conflicts, test this is just the core Mesa build without LLVM or the extra Gallium3D/Mesa drivers enabled. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 21.0 Time To Compile Enabled Repeat Run 10 20 30 40 50 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 46.06 45.87 45.94
Timed MrBayes Analysis This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Enabled Repeat Run 13 26 39 52 65 SE +/- 0.41, N = 3 SE +/- 0.47, N = 3 SE +/- 0.74, N = 3 57.25 57.64 57.33 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mmpx -mabm -O3 -std=c99 -pedantic -lm
Timed Wasmer Compilation This test times how long it takes to compile Wasmer. Wasmer is written in the Rust programming language and is a WebAssembly runtime implementation that supports WASI and EmScripten. This test profile builds Wasmer with the Cranelift and Singlepast compiler features enabled. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Wasmer Compilation 1.0.2 Time To Compile Enabled Repeat Run 12 24 36 48 60 SE +/- 0.35, N = 3 SE +/- 0.27, N = 3 SE +/- 0.26, N = 3 52.85 53.06 52.88 1. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lgcc_s -lutil -lrt -lpthread -lm -lc
TNN TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 Enabled Repeat Run 60 120 180 240 300 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 263.34 263.47 263.60 MIN: 261.29 / MAX: 268.2 MIN: 261.32 / MAX: 269.11 MIN: 261.99 / MAX: 268.68 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 Enabled Repeat Run 60 120 180 240 300 SE +/- 0.05, N = 3 SE +/- 0.36, N = 3 SE +/- 0.68, N = 3 264.38 264.77 265.45 MIN: 262.68 / MAX: 267.22 MIN: 263.03 / MAX: 266.49 MIN: 263.15 / MAX: 269.59 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance Enabled Repeat Run 400K 800K 1200K 1600K 2000K SE +/- 1321.50, N = 5 SE +/- 1711.59, N = 5 SE +/- 1321.50, N = 5 1723339 1726583 1724418 1. (CC) gcc options: -O3 -march=native
TTSIOD 3D Renderer A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping Enabled Repeat Run 120 240 360 480 600 SE +/- 1.36, N = 3 SE +/- 1.14, N = 3 SE +/- 0.40, N = 3 550.73 556.31 561.83 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
ViennaCL ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY Repeat Enabled Run 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 25.1 25.1 25.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY Repeat Enabled Run 9 18 27 36 45 SE +/- 0.12, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 38.7 38.8 39.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT Repeat Enabled Run 10 20 30 40 50 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 41.3 41.3 41.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY Repeat Enabled Run 6 12 18 24 30 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 23.6 23.6 23.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY Repeat Enabled Run 8 16 24 32 40 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 36.3 36.3 36.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT Repeat Enabled Run 9 18 27 36 45 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.12, N = 3 38.4 38.4 38.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N Repeat Enabled Run 11 22 33 44 55 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 46.7 46.7 46.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T Repeat Enabled Run 11 22 33 44 55 SE +/- 0.03, N = 3 SE +/- 0.17, N = 3 SE +/- 0.06, N = 3 48.4 48.2 48.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN Repeat Enabled Run 11 22 33 44 55 SE +/- 0.19, N = 3 SE +/- 1.10, N = 3 SE +/- 0.03, N = 3 46.8 45.2 47.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT Repeat Enabled Run 10 20 30 40 50 SE +/- 0.17, N = 3 SE +/- 0.13, N = 3 SE +/- 0.03, N = 3 44.7 44.4 45.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT Repeat Enabled Run 11 22 33 44 55 SE +/- 0.00, N = 3 SE +/- 0.45, N = 3 SE +/- 0.06, N = 3 47.3 46.8 47.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN Repeat Enabled Run 11 22 33 44 55 SE +/- 0.00, N = 2 SE +/- 0.57, N = 3 SE +/- 0.03, N = 3 48.7 48.1 48.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
VP9 libvpx Encoding This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 0 Enabled Repeat Run 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 9.34 9.35 9.35 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 5 Enabled Repeat Run 8 16 24 32 40 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 32.91 32.90 32.84 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
WebP Image Encode This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless Enabled Repeat Run 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 15.30 15.32 15.32 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Enabled Repeat Run 2 4 6 8 10 SE +/- 0.005, N = 3 SE +/- 0.003, N = 3 SE +/- 0.004, N = 3 6.095 6.083 6.080 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP2 Image Encode This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default Enabled Repeat Run 0.8021 1.6042 2.4063 3.2084 4.0105 SE +/- 0.015, N = 3 SE +/- 0.012, N = 3 SE +/- 0.004, N = 3 3.533 3.545 3.565 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 Enabled Repeat Run 40 80 120 160 200 SE +/- 0.34, N = 3 SE +/- 0.25, N = 3 SE +/- 0.18, N = 3 191.35 190.53 190.33 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 Enabled Repeat Run 3 6 9 12 15 SE +/- 0.004, N = 3 SE +/- 0.008, N = 3 SE +/- 0.016, N = 3 9.618 9.603 9.625 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WireGuard + Linux Networking Stack Stress Test This is a benchmark of the WireGuard secure VPN tunnel and Linux networking stack stress test. The test runs on the local host but does require root permissions to run. The way it works is it creates three namespaces. ns0 has a loopback device. ns1 and ns2 each have wireguard devices. Those two wireguard devices send traffic through the loopback device of ns0. The end result of this is that tests wind up testing encryption and decryption at the same time -- a pretty CPU and scheduler-heavy workflow. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better WireGuard + Linux Networking Stack Stress Test Enabled Repeat Run 30 60 90 120 150 SE +/- 0.56, N = 3 SE +/- 0.50, N = 3 SE +/- 1.20, N = 3 130.03 129.89 130.53
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2019-12-17 H.264 Video Encoding Enabled Repeat Run 30 60 90 120 150 SE +/- 0.41, N = 3 SE +/- 0.83, N = 3 SE +/- 1.20, N = 3 118.95 119.08 119.15 1. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
x265 This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K Enabled Repeat Run 4 8 12 16 20 SE +/- 0.21, N = 3 SE +/- 0.12, N = 9 SE +/- 0.13, N = 3 15.04 15.14 14.82 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p Enabled Repeat Run 15 30 45 60 75 SE +/- 0.48, N = 3 SE +/- 0.17, N = 3 SE +/- 0.29, N = 3 67.47 67.44 67.60 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction Enabled Repeat Run 8 16 24 32 40 SE +/- 0.29, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 34.13 33.69 33.77 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction Enabled Repeat Run 30 60 90 120 150 SE +/- 0.06, N = 3 SE +/- 0.12, N = 3 SE +/- 0.03, N = 3 118.39 118.12 118.35 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
XZ Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 Enabled Repeat Run 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 29.80 29.81 29.86 1. (CC) gcc options: -pthread -fvisibility=hidden -O2
YafaRay YafaRay is an open-source physically based montecarlo ray-tracing engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.4.1 Total Time For Sample Scene Enabled Repeat Run 30 60 90 120 150 SE +/- 0.08, N = 3 SE +/- 0.18, N = 3 SE +/- 0.46, N = 3 121.45 121.24 121.33 1. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread
Zstd Compression This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Compression Speed Enabled Repeat Run 70 140 210 280 350 SE +/- 2.05, N = 3 SE +/- 1.09, N = 3 SE +/- 0.86, N = 3 314.5 312.9 310.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Decompression Speed Enabled Repeat Run 1000 2000 3000 4000 5000 SE +/- 3.90, N = 3 SE +/- 2.15, N = 3 SE +/- 1.52, N = 3 4763.0 4760.7 4774.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed Enabled Repeat Run 8 16 24 32 40 SE +/- 0.09, N = 3 SE +/- 0.19, N = 3 SE +/- 0.06, N = 3 33.0 33.1 33.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed Enabled Repeat Run 900 1800 2700 3600 4500 SE +/- 11.78, N = 3 SE +/- 4.33, N = 3 SE +/- 8.51, N = 3 4390.6 4401.8 4394.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Compression Speed Enabled Repeat Run 300 600 900 1200 1500 SE +/- 14.33, N = 15 SE +/- 28.36, N = 15 SE +/- 22.02, N = 15 1302.6 1327.1 1303.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Decompression Speed Enabled Repeat Run 1000 2000 3000 4000 5000 SE +/- 12.94, N = 15 SE +/- 5.59, N = 15 SE +/- 11.09, N = 15 4865.2 4875.4 4864.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Compression Speed Enabled Repeat Run 80 160 240 320 400 SE +/- 3.48, N = 3 SE +/- 1.95, N = 3 SE +/- 1.58, N = 3 386.2 382.8 389.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Decompression Speed Enabled Repeat Run 1100 2200 3300 4400 5500 SE +/- 6.01, N = 3 SE +/- 16.73, N = 3 SE +/- 2.66, N = 3 5071.8 5042.4 5065.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed Enabled Repeat Run 7 14 21 28 35 SE +/- 0.15, N = 3 SE +/- 0.15, N = 3 SE +/- 0.07, N = 3 31.4 31.3 31.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed Enabled Repeat Run 900 1800 2700 3600 4500 SE +/- 38.86, N = 3 SE +/- 4.83, N = 3 SE +/- 3.52, N = 3 4252.3 4315.8 4308.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
Enabled Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: DEBUGINFOD_URLS=Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x39 - Thermald 2.4.3Java Notes: OpenJDK Runtime Environment (build 11.0.11-ea+4-Ubuntu-0ubuntu2)Python Notes: Python 3.9.2Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 26 March 2021 14:03 by user ronix.
Repeat Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: DEBUGINFOD_URLS=Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x39 - Thermald 2.4.3Java Notes: OpenJDK Runtime Environment (build 11.0.11-ea+4-Ubuntu-0ubuntu2)Python Notes: Python 3.9.2Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 27 March 2021 10:46 by user ronix.
Run Processor: Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG MAXIMUS XIII HERO (0610 BIOS), Chipset: Intel Tiger Lake-H, Memory: 32GB, Disk: 1000GB Western Digital WD_BLACK SN850 1TB + 2000GB, Graphics: AMD Radeon RX 6800/6800 XT / 6900 16GB (2575/1000MHz), Audio: Intel Tiger Lake-H HD Audio, Monitor: ASUS MG28U, Network: 2 x Intel I225-V + Intel Device 2725
OS: Ubuntu 21.04, Kernel: 5.12.0-051200rc3daily20210315-generic (x86_64) 20210314, Desktop: GNOME Shell 3.38.3, Display Server: X Server 1.20.10 + Wayland, OpenGL: 4.6 Mesa 21.1.0-devel (git-616720d 2021-03-16 hirsute-oibaf-ppa) (LLVM 12.0.0), Compiler: GCC 10.2.1 20210312, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseEnvironment Notes: DEBUGINFOD_URLS=Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-p9aljy/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x39 - Thermald 2.4.3Java Notes: OpenJDK Runtime Environment (build 11.0.11-ea+4-Ubuntu-0ubuntu2)Python Notes: Python 3.9.2Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 28 March 2021 07:45 by user ronix.