Intel Core i3-7100 testing with a Gigabyte B250M-DS3H-CF (F9 BIOS) and Gigabyte Intel HD 630 3GB on Ubuntu 20.10 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2012250-HA-COREI371034 Core i3 7100 Xmas Eve - Phoronix Test Suite Core i3 7100 Xmas Eve Intel Core i3-7100 testing with a Gigabyte B250M-DS3H-CF (F9 BIOS) and Gigabyte Intel HD 630 3GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2012250-HA-COREI371034&rdt&grt .
Core i3 7100 Xmas Eve Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 Intel Core i3-7100 @ 3.90GHz (2 Cores / 4 Threads) Gigabyte B250M-DS3H-CF (F9 BIOS) Intel Xeon E3-1200 v6/7th + B250 8GB 250GB Western Digital WDS250G1B0A- Gigabyte Intel HD 630 3GB (1100MHz) Realtek ALC887-VD VA2431 Realtek RTL8111/8168/8411 Ubuntu 20.10 5.8.0-28-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 modesetting 1.20.9 4.6 Mesa 20.2.1 1.2.145 GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - MQ-DEADLINE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xde - Thermald 2.3 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected
Core i3 7100 Xmas Eve asmfish: 1024 Hash Memory, 26 Depth astcenc: Fast astcenc: Medium astcenc: Thorough astcenc: Exhaustive build2: Time To Compile clomp: Static OMP Speedup coremark: CoreMark Size 666 - Iterations Per Second crafty: Elapsed Time hpcc: G-HPL hpcc: G-Ffte hpcc: EP-DGEMM hpcc: G-Ptrans hpcc: EP-STREAM Triad hpcc: G-Rand Access hpcc: Rand Ring Latency hpcc: Rand Ring Bandwidth hpcc: Max Ping Pong Bandwidth encode-ape: WAV To APE ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - regnety_400m node-web-tooling: onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU encode-opus: WAV To Opus Encode phpbench: PHP Benchmark Suite rav1e: 1 rav1e: 5 rav1e: 6 rav1e: 10 simdjson: Kostya simdjson: LargeRand simdjson: PartialTweets simdjson: DistinctUserID sqlite: 1 stockfish: Total Time build-eigen: Time To Compile build-ffmpeg: Time To Compile hmmer: Pfam Database Search mafft: Multiple Sequence Alignment - LSU RNA vkfft: vkmark: 1280 x 1024 vkmark: 1920 x 1080 vkresample: 2x - Double vkresample: 2x - Single encode-wavpack: WAV To WavPack 1 2 3 6144009 7.97 20.42 135.29 1087.58 644.789 1.4 77661.557996 7332207 86.83917 1.75523 47.15443 1.31124 9.87479 0.01270 0.21035 5.14701 8711.352 12.781 43.12 10.63 9.18 15.31 9.86 15.44 3.88 31.51 111.37 30.82 24.95 68.5 57.17 47.11 18.60 43.07 10.62 9.19 15.33 9.84 15.32 3.86 31.55 111.29 30.75 24.93 68.43 57.15 47.19 18.60 9.44 15.7995 15.5361 6.97730 4.69411 20.7537 20.0429 24.5757 24.0199 20.0642 13.3641 12994.0 7188.49 13022.3 7192.68 6.03889 12989.9 7184.04 12.3520 9.593 657887 0.253 0.779 1.035 2.404 0.49 0.37 0.6 0.62 30.315 4232953 92.847 232.085 114.826 14.000 1332 905 614 1011.320 523.034 16.693 6257439 7.97 20.42 135.42 1104.70 648.380 1.4 77631.826919 7369776 85.29487 1.76210 47.50403 1.46076 9.88819 0.01267 0.20996 5.18462 8813.097 12.752 43.24 10.65 9.18 15.33 9.91 15.41 3.87 31.51 111.63 30.75 24.96 68.42 57.23 47.28 18.62 43.20 10.64 9.15 15.32 9.87 15.36 3.87 31.53 111.32 30.76 24.93 68.36 57.18 47.24 18.67 9.34 15.6580 14.8365 6.97030 4.81695 20.6337 19.9684 24.6236 23.9439 20.0247 13.3447 13403.5 7424.10 13494.4 7437.08 6.03112 13484.9 7447.30 12.3525 9.597 658126 0.248 0.779 1.038 2.406 0.49 0.37 0.6 0.62 31.916 4043871 92.730 232.002 114.834 13.867 1335 906 614 1010.740 532.957 16.696 6240412 7.96 20.42 135.31 1088.46 645.349 1.4 78242.226116 7347021 86.81380 1.75386 47.69460 1.46079 9.82052 0.01277 0.20908 4.90255 8644.793 12.768 43.24 10.67 9.19 15.31 9.86 15.36 3.88 31.52 111.59 30.74 24.94 68.42 57.26 47.15 18.60 43.26 10.66 9.15 15.32 9.86 15.37 3.87 31.48 111.09 30.72 24.90 68.42 57.19 47.26 18.58 9.34 15.7434 16.1817 7.01559 4.70981 20.7444 19.8612 24.6031 24.0531 19.9289 13.3239 13012.2 7201.09 13017.5 7207.99 6.05271 13005.4 7201.51 12.3401 9.599 656295 0.251 0.776 1.036 2.406 0.49 0.37 0.60 0.62 31.702 4101980 92.920 231.806 114.826 14.004 1332 881 606 1006.890 538.286 16.709 OpenBenchmarking.org
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth 1 2 3 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 29250.15, N = 3 SE +/- 82994.07, N = 3 SE +/- 17217.84, N = 3 6144009 6257439 6240412
ASTC Encoder Preset: Fast OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Fast 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 7.97 7.97 7.96 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Medium 1 2 3 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 20.42 20.42 20.42 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Thorough 1 2 3 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 135.29 135.42 135.31 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Exhaustive 1 2 3 200 400 600 800 1000 SE +/- 0.37, N = 3 SE +/- 8.12, N = 3 SE +/- 0.17, N = 3 1087.58 1104.70 1088.46 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile 1 2 3 140 280 420 560 700 SE +/- 0.27, N = 3 SE +/- 0.56, N = 3 SE +/- 0.41, N = 3 644.79 648.38 645.35
CLOMP Static OMP Speedup OpenBenchmarking.org Speedup, More Is Better CLOMP 1.2 Static OMP Speedup 1 2 3 0.315 0.63 0.945 1.26 1.575 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.4 1.4 1.4 1. (CC) gcc options: -fopenmp -O3 -lm
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second 1 2 3 20K 40K 60K 80K 100K SE +/- 672.82, N = 3 SE +/- 652.47, N = 3 SE +/- 631.76, N = 3 77661.56 77631.83 78242.23 1. (CC) gcc options: -O2 -lrt" -lrt
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time 1 2 3 1.6M 3.2M 4.8M 6.4M 8M SE +/- 19203.96, N = 3 SE +/- 11879.59, N = 3 SE +/- 3975.85, N = 3 7332207 7369776 7347021 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
HPC Challenge Test / Class: G-HPL OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-HPL 1 2 3 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.24, N = 3 SE +/- 0.15, N = 3 86.84 85.29 86.81 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
HPC Challenge Test / Class: G-Ffte OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ffte 1 2 3 0.3965 0.793 1.1895 1.586 1.9825 SE +/- 0.00362, N = 3 SE +/- 0.00193, N = 3 SE +/- 0.00359, N = 3 1.75523 1.76210 1.75386 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
HPC Challenge Test / Class: EP-DGEMM OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: EP-DGEMM 1 2 3 11 22 33 44 55 SE +/- 0.52, N = 3 SE +/- 0.13, N = 3 SE +/- 0.10, N = 3 47.15 47.50 47.69 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
HPC Challenge Test / Class: G-Ptrans OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ptrans 1 2 3 0.3287 0.6574 0.9861 1.3148 1.6435 SE +/- 0.15709, N = 3 SE +/- 0.00835, N = 3 SE +/- 0.00604, N = 3 1.31124 1.46076 1.46079 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
HPC Challenge Test / Class: EP-STREAM Triad OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: EP-STREAM Triad 1 2 3 3 6 9 12 15 SE +/- 0.01832, N = 3 SE +/- 0.00967, N = 3 SE +/- 0.01214, N = 3 9.87479 9.88819 9.82052 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
HPC Challenge Test / Class: G-Random Access OpenBenchmarking.org GUP/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Random Access 1 2 3 0.0029 0.0058 0.0087 0.0116 0.0145 SE +/- 0.00008, N = 3 SE +/- 0.00011, N = 3 SE +/- 0.00004, N = 3 0.01270 0.01267 0.01277 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
HPC Challenge Test / Class: Random Ring Latency OpenBenchmarking.org usecs, Fewer Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Latency 1 2 3 0.0473 0.0946 0.1419 0.1892 0.2365 SE +/- 0.00110, N = 3 SE +/- 0.00200, N = 3 SE +/- 0.00069, N = 3 0.21035 0.20996 0.20908 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
HPC Challenge Test / Class: Random Ring Bandwidth OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Bandwidth 1 2 3 1.1665 2.333 3.4995 4.666 5.8325 SE +/- 0.09024, N = 3 SE +/- 0.08185, N = 3 SE +/- 0.08913, N = 3 5.14701 5.18462 4.90255 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
HPC Challenge Test / Class: Max Ping Pong Bandwidth OpenBenchmarking.org MB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Max Ping Pong Bandwidth 1 2 3 2K 4K 6K 8K 10K SE +/- 98.36, N = 3 SE +/- 66.91, N = 3 SE +/- 80.85, N = 3 8711.35 8813.10 8644.79 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
Monkey Audio Encoding WAV To APE OpenBenchmarking.org Seconds, Fewer Is Better Monkey Audio Encoding 3.99.6 WAV To APE 1 2 3 3 6 9 12 15 SE +/- 0.04, N = 5 SE +/- 0.07, N = 5 SE +/- 0.07, N = 5 12.78 12.75 12.77 1. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mobilenet 1 2 3 10 20 30 40 50 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 43.12 43.24 43.24 MIN: 42.84 / MAX: 53.15 MIN: 42.94 / MAX: 54.26 MIN: 42.96 / MAX: 53.6 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 10.63 10.65 10.67 MIN: 10.46 / MAX: 12.77 MIN: 10.45 / MAX: 20.53 MIN: 10.49 / MAX: 25.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 9.18 9.18 9.19 MIN: 9.02 / MAX: 10.67 MIN: 9.03 / MAX: 10.57 MIN: 9.02 / MAX: 19.6 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: shufflenet-v2 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 15.31 15.33 15.31 MIN: 15.18 / MAX: 15.94 MIN: 15.14 / MAX: 26.25 MIN: 15.14 / MAX: 25.64 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mnasnet 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 9.86 9.91 9.86 MIN: 9.73 / MAX: 20.02 MIN: 9.73 / MAX: 22 MIN: 9.74 / MAX: 10.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: efficientnet-b0 1 2 3 4 8 12 16 20 SE +/- 0.04, N = 2 SE +/- 0.05, N = 2 SE +/- 0.01, N = 3 15.44 15.41 15.36 MIN: 15.2 / MAX: 25.72 MIN: 15.24 / MAX: 25.3 MIN: 15.2 / MAX: 16.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: blazeface 1 2 3 0.873 1.746 2.619 3.492 4.365 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 3.88 3.87 3.88 MIN: 3.82 / MAX: 4.53 MIN: 3.81 / MAX: 4.48 MIN: 3.82 / MAX: 4.55 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: googlenet 1 2 3 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 31.51 31.51 31.52 MIN: 31.3 / MAX: 41.74 MIN: 31.31 / MAX: 41.35 MIN: 31.29 / MAX: 41.86 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: vgg16 1 2 3 20 40 60 80 100 SE +/- 0.11, N = 3 SE +/- 0.25, N = 3 SE +/- 0.22, N = 3 111.37 111.63 111.59 MIN: 110.74 / MAX: 121.81 MIN: 110.94 / MAX: 123.3 MIN: 110.86 / MAX: 122.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet18 1 2 3 7 14 21 28 35 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 30.82 30.75 30.74 MIN: 30.55 / MAX: 41.04 MIN: 30.52 / MAX: 41.61 MIN: 30.51 / MAX: 41.31 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: alexnet 1 2 3 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 24.95 24.96 24.94 MIN: 24.75 / MAX: 34.42 MIN: 24.78 / MAX: 34.8 MIN: 24.72 / MAX: 35.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet50 1 2 3 15 30 45 60 75 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 68.50 68.42 68.42 MIN: 68.08 / MAX: 122.19 MIN: 68.09 / MAX: 79.66 MIN: 68.07 / MAX: 79.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: yolov4-tiny 1 2 3 13 26 39 52 65 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 57.17 57.23 57.26 MIN: 56.85 / MAX: 67.25 MIN: 56.88 / MAX: 68 MIN: 56.9 / MAX: 67.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: squeezenet_ssd 1 2 3 11 22 33 44 55 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 47.11 47.28 47.15 MIN: 46.79 / MAX: 57.46 MIN: 46.79 / MAX: 58.17 MIN: 46.78 / MAX: 57.9 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: regnety_400m 1 2 3 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 18.60 18.62 18.60 MIN: 18.49 / MAX: 28.74 MIN: 18.42 / MAX: 26.27 MIN: 18.41 / MAX: 29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: mobilenet 1 2 3 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.11, N = 3 SE +/- 0.09, N = 3 43.07 43.20 43.26 MIN: 42.83 / MAX: 53.49 MIN: 42.87 / MAX: 53.52 MIN: 42.86 / MAX: 53.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 10.62 10.64 10.66 MIN: 10.45 / MAX: 12.13 MIN: 10.46 / MAX: 20.48 MIN: 10.45 / MAX: 21.04 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 9.19 9.15 9.15 MIN: 9.01 / MAX: 19.17 MIN: 9.02 / MAX: 10.69 MIN: 9.01 / MAX: 10.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: shufflenet-v2 1 2 3 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 15.33 15.32 15.32 MIN: 15.18 / MAX: 25.5 MIN: 15.18 / MAX: 27.16 MIN: 15.15 / MAX: 25.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: mnasnet 1 2 3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 9.84 9.87 9.86 MIN: 9.75 / MAX: 10.52 MIN: 9.75 / MAX: 19.76 MIN: 9.72 / MAX: 20.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: efficientnet-b0 1 2 3 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 2 15.32 15.36 15.37 MIN: 15.19 / MAX: 16 MIN: 15.2 / MAX: 25.21 MIN: 15.21 / MAX: 25.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: blazeface 1 2 3 0.8708 1.7416 2.6124 3.4832 4.354 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 3.86 3.87 3.87 MIN: 3.81 / MAX: 4.17 MIN: 3.82 / MAX: 4.01 MIN: 3.81 / MAX: 4.04 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: googlenet 1 2 3 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 31.55 31.53 31.48 MIN: 31.32 / MAX: 41.79 MIN: 31.34 / MAX: 41.26 MIN: 31.29 / MAX: 41.94 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: vgg16 1 2 3 20 40 60 80 100 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 111.29 111.32 111.09 MIN: 110.65 / MAX: 122.03 MIN: 110.69 / MAX: 121.89 MIN: 110.52 / MAX: 121.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: resnet18 1 2 3 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 30.75 30.76 30.72 MIN: 30.52 / MAX: 40.83 MIN: 30.56 / MAX: 41.08 MIN: 30.5 / MAX: 40.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: alexnet 1 2 3 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 24.93 24.93 24.90 MIN: 24.74 / MAX: 35 MIN: 24.67 / MAX: 34.83 MIN: 24.72 / MAX: 35.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: resnet50 1 2 3 15 30 45 60 75 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 68.43 68.36 68.42 MIN: 68.13 / MAX: 78.66 MIN: 68.07 / MAX: 79.76 MIN: 68.07 / MAX: 79.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: yolov4-tiny 1 2 3 13 26 39 52 65 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 57.15 57.18 57.19 MIN: 56.82 / MAX: 67.66 MIN: 56.79 / MAX: 67.52 MIN: 56.86 / MAX: 67.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: squeezenet_ssd 1 2 3 11 22 33 44 55 SE +/- 0.11, N = 3 SE +/- 0.12, N = 3 SE +/- 0.11, N = 3 47.19 47.24 47.26 MIN: 46.77 / MAX: 98.44 MIN: 46.79 / MAX: 55.85 MIN: 46.76 / MAX: 57.51 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: regnety_400m 1 2 3 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 18.60 18.67 18.58 MIN: 18.4 / MAX: 28.6 MIN: 18.47 / MAX: 29.58 MIN: 18.48 / MAX: 28.89 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark 1 2 3 3 6 9 12 15 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 9.44 9.34 9.34 1. Nodejs
v12.18.2
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 1 2 3 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 15.80 15.66 15.74 MIN: 14.73 MIN: 14.75 MIN: 14.72 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1 2 3 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 15.54 14.84 16.18 MIN: 15.01 MIN: 14.45 MIN: 15.82 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.01584, N = 3 SE +/- 0.01316, N = 3 SE +/- 0.03839, N = 3 6.97730 6.97030 7.01559 MIN: 6.86 MIN: 6.87 MIN: 6.87 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU 1 2 3 1.0838 2.1676 3.2514 4.3352 5.419 SE +/- 0.05317, N = 3 SE +/- 0.00803, N = 3 SE +/- 0.00674, N = 3 4.69411 4.81695 4.70981 MIN: 4.48 MIN: 4.39 MIN: 4.56 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 1 2 3 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 20.75 20.63 20.74 MIN: 20.42 MIN: 20.39 MIN: 20.47 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 1 2 3 5 10 15 20 25 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 20.04 19.97 19.86 MIN: 19.64 MIN: 19.66 MIN: 19.64 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 1 2 3 6 12 18 24 30 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 24.58 24.62 24.60 MIN: 24.35 MIN: 24.37 MIN: 24.43 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 1 2 3 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 24.02 23.94 24.05 MIN: 23.78 MIN: 23.7 MIN: 23.81 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU 1 2 3 5 10 15 20 25 SE +/- 0.24, N = 3 SE +/- 0.26, N = 3 SE +/- 0.04, N = 3 20.06 20.02 19.93 MIN: 19.52 MIN: 19.56 MIN: 19.62 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.00, N = 3 13.36 13.34 13.32 MIN: 13.2 MIN: 13.14 MIN: 13.18 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 1 2 3 3K 6K 9K 12K 15K SE +/- 4.10, N = 3 SE +/- 76.51, N = 3 SE +/- 7.57, N = 3 12994.0 13403.5 13012.2 MIN: 12966.6 MIN: 13048.6 MIN: 12979.3 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 1 2 3 1600 3200 4800 6400 8000 SE +/- 4.08, N = 3 SE +/- 4.88, N = 3 SE +/- 2.75, N = 3 7188.49 7424.10 7201.09 MIN: 7171.82 MIN: 7267.22 MIN: 7180.72 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 1 2 3 3K 6K 9K 12K 15K SE +/- 34.32, N = 3 SE +/- 5.06, N = 3 SE +/- 6.69, N = 3 13022.3 13494.4 13017.5 MIN: 12970.1 MIN: 13309.7 MIN: 12988.3 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 1 2 3 1600 3200 4800 6400 8000 SE +/- 5.80, N = 3 SE +/- 4.96, N = 3 SE +/- 7.86, N = 3 7192.68 7437.08 7207.99 MIN: 7171.23 MIN: 7270.53 MIN: 7180.31 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.00365, N = 3 SE +/- 0.00495, N = 3 SE +/- 0.00775, N = 3 6.03889 6.03112 6.05271 MIN: 5.92 MIN: 5.89 MIN: 5.92 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 3K 6K 9K 12K 15K SE +/- 2.60, N = 3 SE +/- 28.75, N = 3 SE +/- 5.12, N = 3 12989.9 13484.9 13005.4 MIN: 12963.9 MIN: 13078.8 MIN: 12979.2 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 1600 3200 4800 6400 8000 SE +/- 4.85, N = 3 SE +/- 7.22, N = 3 SE +/- 5.32, N = 3 7184.04 7447.30 7201.51 MIN: 7162.14 MIN: 7277.81 MIN: 7179.7 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 12.35 12.35 12.34 MIN: 12.13 MIN: 12.09 MIN: 12.1 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode 1 2 3 3 6 9 12 15 SE +/- 0.021, N = 5 SE +/- 0.022, N = 5 SE +/- 0.022, N = 5 9.593 9.597 9.599 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
PHPBench PHP Benchmark Suite OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite 1 2 3 140K 280K 420K 560K 700K SE +/- 1373.01, N = 3 SE +/- 1397.02, N = 3 SE +/- 1397.20, N = 3 657887 658126 656295
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 1 1 2 3 0.0569 0.1138 0.1707 0.2276 0.2845 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 0.253 0.248 0.251
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 5 1 2 3 0.1753 0.3506 0.5259 0.7012 0.8765 SE +/- 0.003, N = 3 SE +/- 0.003, N = 3 SE +/- 0.001, N = 3 0.779 0.779 0.776
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 6 1 2 3 0.2336 0.4672 0.7008 0.9344 1.168 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 1.035 1.038 1.036
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 10 1 2 3 0.5414 1.0828 1.6242 2.1656 2.707 SE +/- 0.008, N = 3 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 2.404 2.406 2.406
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: Kostya 1 2 3 0.1103 0.2206 0.3309 0.4412 0.5515 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.49 0.49 0.49 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: LargeRandom 1 2 3 0.0833 0.1666 0.2499 0.3332 0.4165 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.37 0.37 0.37 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: PartialTweets 1 2 3 0.135 0.27 0.405 0.54 0.675 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.60 0.60 0.60 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: DistinctUserID 1 2 3 0.1395 0.279 0.4185 0.558 0.6975 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.62 0.62 0.62 1. (CXX) g++ options: -O3 -pthread
SQLite Threads / Copies: 1 OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.30.1 Threads / Copies: 1 1 2 3 7 14 21 28 35 SE +/- 0.36, N = 15 SE +/- 0.04, N = 3 SE +/- 0.36, N = 3 30.32 31.92 31.70 1. (CC) gcc options: -O2 -lz -lm -ldl -lpthread
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 12 Total Time 1 2 3 900K 1800K 2700K 3600K 4500K SE +/- 40004.50, N = 3 SE +/- 24874.87, N = 3 SE +/- 14449.66, N = 3 4232953 4043871 4101980 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
Timed Eigen Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Eigen Compilation 3.3.9 Time To Compile 1 2 3 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 92.85 92.73 92.92
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.2.2 Time To Compile 1 2 3 50 100 150 200 250 SE +/- 0.36, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 232.09 232.00 231.81
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 2 3 30 60 90 120 150 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 114.83 114.83 114.83 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 1 2 3 4 8 12 16 20 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 0.11, N = 3 14.00 13.87 14.00 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
VkFFT OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.1.1 1 2 3 300 600 900 1200 1500 SE +/- 1.67, N = 3 SE +/- 1.20, N = 3 1332 1335 1332 1. (CXX) g++ options: -O3 -pthread
VKMark Resolution: 1280 x 1024 OpenBenchmarking.org VKMark Score, More Is Better VKMark 2020-05-21 Resolution: 1280 x 1024 1 2 3 200 400 600 800 1000 SE +/- 1.00, N = 3 SE +/- 0.58, N = 3 SE +/- 3.28, N = 3 905 906 881 1. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF
VKMark Resolution: 1920 x 1080 OpenBenchmarking.org VKMark Score, More Is Better VKMark 2020-05-21 Resolution: 1920 x 1080 1 2 3 130 260 390 520 650 SE +/- 1.20, N = 3 SE +/- 0.58, N = 3 SE +/- 7.51, N = 3 614 614 606 1. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF
VkResample Upscale: 2x - Precision: Double OpenBenchmarking.org ms, Fewer Is Better VkResample 1.0 Upscale: 2x - Precision: Double 1 2 3 200 400 600 800 1000 SE +/- 5.17, N = 3 SE +/- 4.98, N = 3 SE +/- 2.44, N = 3 1011.32 1010.74 1006.89 1. (CXX) g++ options: -O3 -pthread
VkResample Upscale: 2x - Precision: Single OpenBenchmarking.org ms, Fewer Is Better VkResample 1.0 Upscale: 2x - Precision: Single 1 2 3 120 240 360 480 600 SE +/- 3.39, N = 3 SE +/- 4.10, N = 3 SE +/- 4.21, N = 3 523.03 532.96 538.29 1. (CXX) g++ options: -O3 -pthread
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack 1 2 3 4 8 12 16 20 SE +/- 0.00, N = 5 SE +/- 0.00, N = 5 SE +/- 0.02, N = 5 16.69 16.70 16.71 1. (CXX) g++ options: -rdynamic
Phoronix Test Suite v10.8.4