Intel Xeon E5-1680 v3 testing with a ASUS X99-A (3902 BIOS) and eVGA NVIDIA NVE7 1GB on Ubuntu 20.04 via the Phoronix Test Suite.
1 Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: MQ-DEADLINE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x43Python Notes: Python 3.8.5Security Notes: itlb_multihit: KVM: Vulnerable + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
2 3 Processor: Intel Xeon E5-1680 v3 @ 3.80GHz (8 Cores / 16 Threads), Motherboard: ASUS X99-A (3902 BIOS), Chipset: Intel Xeon E7 v3/Xeon, Memory: 16GB, Disk: PNY CS900 240GB, Graphics: eVGA NVIDIA NVE7 1GB, Audio: Realtek ALC1150, Monitor: G237HL, Network: Intel I218-V
OS: Ubuntu 20.04, Kernel: 5.4.0-47-generic (x86_64), Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.3 Mesa 20.0.8, Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 1920x1080
Compile Bench Compilebench tries to age a filesystem by simulating some of the disk IO common in creating, compiling, patching, stating and reading kernel trees. It indirectly measures how well filesystems can maintain directory locality as the disk fills up and directories age. This current test is setup to use the makej mode with 10 initial directories Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Compile 1 2 3 170 340 510 680 850 SE +/- 8.09, N = 3 SE +/- 5.81, N = 3 SE +/- 52.39, N = 12 760.84 773.58 653.68
yquake2 This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better yquake2 7.45 Renderer: Software CPU - Resolution: 1920 x 1080 1 2 3 20 40 60 80 100 SE +/- 0.98, N = 5 SE +/- 0.35, N = 3 SE +/- 0.26, N = 3 75.4 73.6 74.8 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Ping Pong 1 2 3 1.0148 2.0296 3.0444 4.0592 5.074 SE +/- 0.037, N = 25 SE +/- 0.032, N = 5 SE +/- 0.055, N = 6 4.499 4.487 4.510 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Under Load 1 2 3 6 12 18 24 30 SE +/- 0.84, N = 25 SE +/- 1.08, N = 20 SE +/- 1.22, N = 25 25.47 25.96 24.48 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen 1 2 3 200 400 600 800 1000 SE +/- 2.89, N = 3 SE +/- 5.03, N = 3 SE +/- 11.79, N = 3 1046 1050 1038 1. (CXX) g++ options: -flto -pthread
CLOMP CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Speedup, More Is Better CLOMP 1.2 Static OMP Speedup 1 2 3 3 6 9 12 15 SE +/- 0.14, N = 4 SE +/- 0.15, N = 4 SE +/- 0.09, N = 3 9.9 10.2 10.2 1. (CC) gcc options: -fopenmp -O3 -lm
Dolfyn Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 1 2 3 6 12 18 24 30 SE +/- 0.10, N = 3 SE +/- 0.16, N = 3 SE +/- 0.17, N = 3 23.21 23.40 23.32
FFTE FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine 1 2 3 7K 14K 21K 28K 35K SE +/- 340.92, N = 3 SE +/- 209.55, N = 3 SE +/- 89.40, N = 3 30109.61 30542.46 30708.45 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed 1 2 3 1600 3200 4800 6400 8000 SE +/- 8.20, N = 3 SE +/- 10.99, N = 3 SE +/- 10.24, N = 3 7499.0 7494.0 7503.5 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed 1 2 3 10 20 30 40 50 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 42.97 42.99 43.04 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed 1 2 3 1600 3200 4800 6400 8000 SE +/- 4.72, N = 3 SE +/- 6.48, N = 3 SE +/- 4.32, N = 3 7267.3 7256.9 7267.9 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed 1 2 3 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.56, N = 3 42.06 42.11 41.60 1. (CC) gcc options: -O3
OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed 1 2 3 1600 3200 4800 6400 8000 SE +/- 5.97, N = 3 SE +/- 4.05, N = 3 SE +/- 14.52, N = 3 7283.6 7259.4 7278.7 1. (CC) gcc options: -O3
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl 1 2 3 60 120 180 240 300 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 262 264 264 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate 1 2 3 130 260 390 520 650 SE +/- 0.33, N = 3 598 612 613 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen 1 2 3 20 40 60 80 100 90 90 90 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced 1 2 3 20 40 60 80 100 101 101 101 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing 1 2 3 140 280 420 560 700 SE +/- 1.15, N = 3 SE +/- 1.15, N = 3 SE +/- 0.67, N = 3 648 651 656 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian 1 2 3 40 80 120 160 200 SE +/- 0.33, N = 3 175 176 176 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space 1 2 3 120 240 360 480 600 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 565 575 578 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 1 2 3 1.0313 2.0626 3.0939 4.1252 5.1565 SE +/- 0.01478, N = 3 SE +/- 0.03023, N = 3 SE +/- 0.02200, N = 3 4.53012 4.58350 4.55778 MIN: 4.43 MIN: 4.48 MIN: 4.45 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.01639, N = 3 SE +/- 0.02409, N = 3 SE +/- 0.03424, N = 3 6.87752 6.78791 6.81139 MIN: 6.77 MIN: 6.64 MIN: 6.68 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.7107 1.4214 2.1321 2.8428 3.5535 SE +/- 0.01227, N = 3 SE +/- 0.01228, N = 3 SE +/- 0.01108, N = 3 3.13341 3.15874 3.14475 MIN: 3.07 MIN: 3.1 MIN: 3.07 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.5742 1.1484 1.7226 2.2968 2.871 SE +/- 0.01421, N = 3 SE +/- 0.01518, N = 3 SE +/- 0.01253, N = 3 2.55204 2.53457 2.52494 MIN: 2.5 MIN: 2.49 MIN: 2.48 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 12.55 12.50 12.48 MIN: 12.41 MIN: 12.37 MIN: 12.34 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.00863, N = 3 SE +/- 0.01093, N = 3 SE +/- 0.03639, N = 3 6.33479 6.40014 6.37270 MIN: 6.26 MIN: 6.3 MIN: 6.26 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.01399, N = 3 SE +/- 0.02794, N = 3 SE +/- 0.00829, N = 3 9.17727 9.23463 9.19921 MIN: 9.08 MIN: 9.09 MIN: 9.09 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 10.82 10.75 10.81 MIN: 10.52 MIN: 10.48 MIN: 10.47 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.10075, N = 5 SE +/- 0.12917, N = 3 SE +/- 0.01396, N = 3 7.87695 7.90354 7.66474 MIN: 7.57 MIN: 7.65 MIN: 7.58 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.01219, N = 3 SE +/- 0.02072, N = 3 SE +/- 0.01601, N = 3 6.20319 6.21295 6.21070 MIN: 6.12 MIN: 6.11 MIN: 6.1 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 1 2 3 1000 2000 3000 4000 5000 SE +/- 1.69, N = 3 SE +/- 1.21, N = 3 SE +/- 2.17, N = 3 4531.11 4525.42 4533.64 MIN: 4524.03 MIN: 4516.35 MIN: 4525.25 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 1 2 3 500 1000 1500 2000 2500 SE +/- 2.75, N = 3 SE +/- 3.28, N = 3 SE +/- 1.82, N = 3 2474.88 2468.97 2468.80 MIN: 2467.18 MIN: 2460.79 MIN: 2463.37 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 1 2 3 1000 2000 3000 4000 5000 SE +/- 0.35, N = 3 SE +/- 3.64, N = 3 SE +/- 7.29, N = 3 4529.12 4523.86 4536.47 MIN: 4522.47 MIN: 4513.41 MIN: 4520.13 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 1 2 3 500 1000 1500 2000 2500 SE +/- 0.91, N = 3 SE +/- 0.84, N = 3 SE +/- 1.99, N = 3 2472.67 2470.37 2468.25 MIN: 2467.6 MIN: 2466.62 MIN: 2462.29 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 1 2 3 0.6781 1.3562 2.0343 2.7124 3.3905 SE +/- 0.00785, N = 3 SE +/- 0.00758, N = 3 SE +/- 0.00748, N = 3 3.01373 2.98814 2.97447 MIN: 2.93 MIN: 2.91 MIN: 2.9 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 1000 2000 3000 4000 5000 SE +/- 2.65, N = 3 SE +/- 1.37, N = 3 SE +/- 3.38, N = 3 4533.01 4523.92 4526.33 MIN: 4522.75 MIN: 4517.94 MIN: 4516.82 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 500 1000 1500 2000 2500 SE +/- 10.28, N = 3 SE +/- 2.48, N = 3 SE +/- 0.45, N = 3 2480.10 2469.55 2466.09 MIN: 2465 MIN: 2462.7 MIN: 2461.63 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.7897 1.5794 2.3691 3.1588 3.9485 SE +/- 0.00979, N = 3 SE +/- 0.02742, N = 3 SE +/- 0.01650, N = 3 3.50993 3.49680 3.46034 MIN: 3.44 MIN: 3.33 MIN: 3.32 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Crown 1 2 3 3 6 9 12 15 SE +/- 0.0256, N = 3 SE +/- 0.0397, N = 3 SE +/- 0.0193, N = 3 9.6807 9.7353 9.7629 MIN: 9.6 / MAX: 9.86 MIN: 9.62 / MAX: 9.95 MIN: 9.68 / MAX: 9.94
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 10.01 10.05 10.05 MIN: 9.97 / MAX: 10.12 MIN: 9.94 / MAX: 10.21 MIN: 9.86 / MAX: 10.24
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon Obj 1 2 3 3 6 9 12 15 SE +/- 0.0214, N = 3 SE +/- 0.0255, N = 3 SE +/- 0.0092, N = 3 9.4002 9.4191 9.3499 MIN: 9.33 / MAX: 9.52 MIN: 9.36 / MAX: 9.54 MIN: 9.3 / MAX: 9.45
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.11, N = 3 SE +/- 0.10, N = 3 12.03 12.14 12.10 MIN: 11.97 / MAX: 12.2 MIN: 11.91 / MAX: 12.45 MIN: 11.87 / MAX: 12.37
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon Obj 1 2 3 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 10.81 10.80 10.79 MIN: 10.7 / MAX: 11.01 MIN: 10.71 / MAX: 11.01 MIN: 10.7 / MAX: 10.96
Kvazaar This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium 1 2 3 0.8145 1.629 2.4435 3.258 4.0725 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.62 3.61 3.55 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium 1 2 3 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 15.65 15.67 15.43 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 10.28 10.28 10.10 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast 1 2 3 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 18.56 18.52 18.29 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast 1 2 3 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 36.78 36.78 36.02 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast 1 2 3 16 32 48 64 80 SE +/- 0.11, N = 3 SE +/- 0.21, N = 3 SE +/- 0.22, N = 3 69.70 69.21 68.68 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 5 1 2 3 0.1841 0.3682 0.5523 0.7364 0.9205 SE +/- 0.002, N = 3 SE +/- 0.005, N = 3 SE +/- 0.005, N = 3 0.815 0.816 0.818
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 6 1 2 3 0.2459 0.4918 0.7377 0.9836 1.2295 SE +/- 0.001, N = 3 SE +/- 0.006, N = 3 SE +/- 0.003, N = 3 1.088 1.093 1.086
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 10 1 2 3 0.5456 1.0912 1.6368 2.1824 2.728 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 SE +/- 0.007, N = 3 2.415 2.425 2.419
x265 This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K 1 2 3 2 4 6 8 10 SE +/- 0.09, N = 8 SE +/- 0.12, N = 4 SE +/- 0.08, N = 3 8.85 8.75 8.87 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p 1 2 3 8 16 24 32 40 SE +/- 0.53, N = 3 SE +/- 0.16, N = 3 SE +/- 0.18, N = 3 35.51 36.22 35.60 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 12 Total Time 1 2 3 3M 6M 9M 12M 15M SE +/- 166711.74, N = 3 SE +/- 71363.03, N = 3 SE +/- 194479.46, N = 4 14124404 14141342 13900575 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
Build2 This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile 1 2 3 40 80 120 160 200 SE +/- 2.07, N = 5 SE +/- 2.37, N = 4 SE +/- 1.61, N = 3 173.72 172.59 178.60
Opus Codec Encoding Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode 1 2 3 3 6 9 12 15 SE +/- 0.024, N = 5 SE +/- 0.033, N = 5 SE +/- 0.008, N = 5 9.678 9.643 9.616 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
RNNoise RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 1 2 3 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.14, N = 3 SE +/- 0.06, N = 3 25.71 25.67 25.52 1. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
Node.js V8 Web Tooling Benchmark Running the V8 project's Web-Tooling-Benchmark under Node.js. The Web-Tooling-Benchmark stresses JavaScript-related workloads common to web developers like Babel and TypeScript and Babylon. This test profile can test the system's JavaScript performance with Node.js. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark 1 2 3 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 9.77 9.71 9.50 1. Nodejs
v10.19.0
Apache CouchDB This is a bulk insertion benchmark of Apache CouchDB. CouchDB is a document-oriented NoSQL database implemented in Erlang. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.1.1 Bulk Size: 100 - Inserts: 1000 - Rounds: 24 1 2 3 30 60 90 120 150 SE +/- 0.34, N = 3 SE +/- 0.62, N = 3 SE +/- 2.21, N = 3 134.60 134.09 135.30 1. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
KeyDB A benchmark of KeyDB as a multi-threaded fork of the Redis server. The KeyDB benchmark is conducted using memtier-benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 1 2 3 100K 200K 300K 400K 500K SE +/- 1362.98, N = 3 SE +/- 111.39, N = 3 SE +/- 531.93, N = 3 470540.93 475933.28 475766.15 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.3 Water Benchmark 1 2 3 0.1699 0.3398 0.5097 0.6796 0.8495 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 0.754 0.753 0.755 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
ASTC Encoder ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Fast 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 7.00 7.11 7.04 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Medium 1 2 3 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 6.59 6.58 6.59 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Thorough 1 2 3 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 41.52 41.51 41.50 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Exhaustive 1 2 3 70 140 210 280 350 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 334.10 334.17 334.21 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
Basis Universal Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: ETC1S 1 2 3 15 30 45 60 75 SE +/- 0.44, N = 3 SE +/- 0.19, N = 3 SE +/- 0.05, N = 3 65.82 66.20 66.22 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 0 1 2 3 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 10.24 10.20 10.20 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 2 1 2 3 11 22 33 44 55 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 47.38 47.42 47.51 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 3 1 2 3 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 88.87 88.99 88.93 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 11131.75, N = 3 SE +/- 31572.89, N = 15 SE +/- 24311.83, N = 15 1639601.21 1625787.13 1635637.53 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 300K 600K 900K 1200K 1500K SE +/- 16275.71, N = 5 SE +/- 10203.98, N = 15 SE +/- 34112.41, N = 12 1289264.92 1306966.33 1243644.16 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 22966.71, N = 3 SE +/- 26369.12, N = 15 SE +/- 28672.73, N = 3 1834428.00 1750688.54 1852979.38 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 300K 600K 900K 1200K 1500K SE +/- 34189.05, N = 15 SE +/- 26544.26, N = 15 SE +/- 32485.29, N = 15 1481853.31 1456546.48 1464928.24 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Caffe This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 1 2 3 13K 26K 39K 52K 65K SE +/- 170.12, N = 3 SE +/- 51.35, N = 3 SE +/- 27.33, N = 3 59307 59205 59221 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 1 2 3 30K 60K 90K 120K 150K SE +/- 18.73, N = 3 SE +/- 244.34, N = 3 SE +/- 86.87, N = 3 118086 118507 118149 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 1 2 3 30K 60K 90K 120K 150K SE +/- 103.00, N = 3 SE +/- 78.70, N = 3 SE +/- 108.38, N = 3 150994 150853 151020 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 1 2 3 60K 120K 180K 240K 300K SE +/- 189.29, N = 3 SE +/- 155.18, N = 3 SE +/- 180.52, N = 3 301960 301686 301760 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
NCNN NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mobilenet 1 2 3 5 10 15 20 25 SE +/- 0.15, N = 3 SE +/- 0.01, N = 3 SE +/- 0.20, N = 3 20.50 20.23 20.44 MIN: 20.12 / MAX: 38.51 MIN: 20.1 / MAX: 22.33 MIN: 20.12 / MAX: 27.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 6.19 6.23 6.32 MIN: 6 / MAX: 15.89 MIN: 6.01 / MAX: 16.48 MIN: 6.04 / MAX: 17.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 1.2263 2.4526 3.6789 4.9052 6.1315 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 5.40 5.42 5.45 MIN: 5.25 / MAX: 17.21 MIN: 5.27 / MAX: 16.73 MIN: 5.3 / MAX: 16.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: shufflenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 6.88 6.95 6.97 MIN: 6.77 / MAX: 17.73 MIN: 6.77 / MAX: 18.58 MIN: 6.8 / MAX: 18.91 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mnasnet 1 2 3 1.2668 2.5336 3.8004 5.0672 6.334 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 5.45 5.63 5.60 MIN: 5.28 / MAX: 16.44 MIN: 5.37 / MAX: 17.72 MIN: 5.39 / MAX: 16.65 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: efficientnet-b0 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.35, N = 3 8.17 8.34 8.67 MIN: 8.07 / MAX: 8.37 MIN: 8.06 / MAX: 16.21 MIN: 8.06 / MAX: 89.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: blazeface 1 2 3 0.5873 1.1746 1.7619 2.3492 2.9365 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 2.55 2.55 2.61 MIN: 2.48 / MAX: 2.62 MIN: 2.51 / MAX: 2.59 MIN: 2.55 / MAX: 3.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: googlenet 1 2 3 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 15.39 15.58 15.47 MIN: 15.11 / MAX: 19.35 MIN: 15.19 / MAX: 16.78 MIN: 15.2 / MAX: 16.43 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: vgg16 1 2 3 11 22 33 44 55 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 47.95 48.14 48.18 MIN: 47.64 / MAX: 50.16 MIN: 47.78 / MAX: 64.77 MIN: 47.69 / MAX: 57.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet18 1 2 3 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 14.02 14.07 14.00 MIN: 13.88 / MAX: 14.41 MIN: 13.87 / MAX: 19.55 MIN: 13.86 / MAX: 15.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: alexnet 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 11.16 11.19 11.19 MIN: 11.09 / MAX: 11.53 MIN: 11.11 / MAX: 11.28 MIN: 11.11 / MAX: 20.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet50 1 2 3 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 27.37 27.46 27.47 MIN: 26.85 / MAX: 35.85 MIN: 27.01 / MAX: 44.87 MIN: 27.05 / MAX: 45.32 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: yolov4-tiny 1 2 3 7 14 21 28 35 SE +/- 0.25, N = 3 SE +/- 0.07, N = 3 SE +/- 0.76, N = 3 29.91 28.20 28.92 MIN: 27.91 / MAX: 34.95 MIN: 27.88 / MAX: 28.62 MIN: 27.88 / MAX: 35.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: squeezenet_ssd 1 2 3 6 12 18 24 30 SE +/- 0.11, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 24.81 24.72 24.73 MIN: 24.62 / MAX: 48.23 MIN: 24.64 / MAX: 26.71 MIN: 24.64 / MAX: 27.27 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: regnety_400m 1 2 3 5 10 15 20 25 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 20.68 20.87 20.95 MIN: 20.41 / MAX: 37.64 MIN: 20.52 / MAX: 52.3 MIN: 20.63 / MAX: 34.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP16 - Device: CPU 1 2 3 500 1000 1500 2000 2500 SE +/- 0.54, N = 3 SE +/- 0.91, N = 3 SE +/- 1.10, N = 3 2114.40 2115.29 2116.99
OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP32 - Device: CPU 1 2 3 0.423 0.846 1.269 1.692 2.115 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.88 1.88 1.88
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP32 - Device: CPU 1 2 3 500 1000 1500 2000 2500 SE +/- 0.83, N = 3 SE +/- 0.51, N = 3 SE +/- 1.55, N = 3 2113.87 2115.31 2116.84
OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP16 - Device: CPU 1 2 3 0.2925 0.585 0.8775 1.17 1.4625 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 1.30 1.27 1.30
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP16 - Device: CPU 1 2 3 700 1400 2100 2800 3500 SE +/- 2.92, N = 3 SE +/- 2.89, N = 3 SE +/- 0.75, N = 3 3070.21 3078.70 3073.95
OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP32 - Device: CPU 1 2 3 0.2925 0.585 0.8775 1.17 1.4625 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.3 1.3 1.3
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP32 - Device: CPU 1 2 3 700 1400 2100 2800 3500 SE +/- 1.72, N = 3 SE +/- 1.16, N = 3 SE +/- 5.64, N = 3 3074.79 3069.32 3073.87
OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU 1 2 3 1100 2200 3300 4400 5500 SE +/- 13.21, N = 3 SE +/- 8.75, N = 3 SE +/- 7.38, N = 3 5069.47 5044.33 5066.61
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU 1 2 3 0.1733 0.3466 0.5199 0.6932 0.8665 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.77 0.77 0.77
OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU 1 2 3 1100 2200 3300 4400 5500 SE +/- 9.42, N = 3 SE +/- 9.51, N = 3 SE +/- 7.20, N = 3 5077.11 5066.12 5060.17
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU 1 2 3 0.1733 0.3466 0.5199 0.6932 0.8665 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.77 0.77 0.77
OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Supercar 1 2 3 0.6674 1.3348 2.0022 2.6696 3.337 SE +/- 0.004, N = 3 SE +/- 0.004, N = 3 SE +/- 0.002, N = 3 2.966 2.963 2.946
PHPBench PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite 1 2 3 130K 260K 390K 520K 650K SE +/- 676.23, N = 3 SE +/- 872.14, N = 3 SE +/- 1083.96, N = 3 626480 626218 624167
OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression 1 2 3 0.7178 1.4356 2.1534 2.8712 3.589 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.18 3.19 3.19
BRL-CAD BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.30.8 VGR Performance Metric 1 2 3 20K 40K 60K 80K 100K 88817 88995 88885 1. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm
1 Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: MQ-DEADLINE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x43Python Notes: Python 3.8.5Security Notes: itlb_multihit: KVM: Vulnerable + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 26 December 2020 10:52 by user phoronix.
2 Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: MQ-DEADLINE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x43Python Notes: Python 3.8.5Security Notes: itlb_multihit: KVM: Vulnerable + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 26 December 2020 20:07 by user phoronix.
3 Processor: Intel Xeon E5-1680 v3 @ 3.80GHz (8 Cores / 16 Threads), Motherboard: ASUS X99-A (3902 BIOS), Chipset: Intel Xeon E7 v3/Xeon, Memory: 16GB, Disk: PNY CS900 240GB, Graphics: eVGA NVIDIA NVE7 1GB, Audio: Realtek ALC1150, Monitor: G237HL, Network: Intel I218-V
OS: Ubuntu 20.04, Kernel: 5.4.0-47-generic (x86_64), Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.3 Mesa 20.0.8, Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vDisk Notes: MQ-DEADLINE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x43Python Notes: Python 3.8.5Security Notes: itlb_multihit: KVM: Vulnerable + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 27 December 2020 05:50 by user phoronix.