8100 Intel Core i3 Linux Intel Core i3-8100 testing with a ASRock Z370M-ITX/ac (P4.10 BIOS) and Intel 8th Gen Core Gaussian Mixture Model 3GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009309-FI-8100INTEL87&grs&rdt .
8100 Intel Core i3 Linux Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 Intel Core i3-8100 @ 3.60GHz (4 Cores) ASRock Z370M-ITX/ac (P4.10 BIOS) Intel 8th Gen Core 4-core Desktop 8GB 60GB DREVO X1 SSD Intel 8th Gen Core Gaussian Mixture Model 3GB (1100MHz) Realtek ALC892 VA2431 Intel I219-V + Intel I211 + Intel Dual Band-AC 3168NGW Ubuntu 20.04 5.9.0-050900rc1daily20200819-generic (x86_64) 20200818 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 20.0.8 1.2.131 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd6 Python Details - Python 3.8.2 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Mitigation of Clear buffers; SMT disabled + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected
8100 Intel Core i3 Linux couchdb: 100 - 1000 - 24 mlpack: scikit_linearridgeregression gromacs: Water Benchmark lczero: BLAS lczero: Eigen ncnn: CPU - vgg16 byte: Dhrystone 2 mlpack: scikit_ica caffe: GoogleNet - CPU - 200 keydb: ffte: N=256, 3D Complex FFT Routine mlpack: scikit_qda mafft: Multiple Sequence Alignment - LSU RNA ncnn: Vulkan GPU - mobilenet ncnn: CPU - alexnet ncnn: CPU - blazeface webp: Quality 100, Lossless, Highest Compression ncnn: CPU - resnet18 webp: Quality 100, Lossless webp: Quality 100, Highest Compression dolfyn: Computational Fluid Dynamics ncnn: CPU - efficientnet-b0 ncnn: CPU - googlenet hint: FLOAT ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - squeezenet tnn: CPU - MobileNet v2 ncnn: Vulkan GPU - shufflenet-v2 ncnn: CPU - shufflenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: CPU - yolov4-tiny ncnn: CPU - mobilenet libraw: Post-Processing Benchmark lczero: Rand webp: Quality 100 ncnn: CPU - squeezenet ncnn: Vulkan GPU - mnasnet ncnn: CPU-v3-v3 - mobilenet-v3 caffe: AlexNet - CPU - 200 ncnn: Vulkan GPU - vgg16 ncnn: CPU - resnet50 ncnn: Vulkan GPU - efficientnet-b0 webp: Default mlpack: scikit_svm caffe: GoogleNet - CPU - 100 realsr-ncnn: 4x - No ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - googlenet tnn: CPU - SqueezeNet v1.1 hmmer: Pfam Database Search caffe: AlexNet - CPU - 100 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: CPU - mnasnet ncnn: CPU-v2-v2 - mobilenet-v2 influxdb: 64 - 10000 - 2,5000,1 - 10000 influxdb: 4 - 10000 - 2,5000,1 - 10000 ncnn: Vulkan GPU - blazeface espeak: Text-To-Speech Synthesis 1 2 3 188.832 4.27 0.509 939 902 90.48 35583808.2 58.73 173170 273760.53 19448.140985465 75.66 12.055 38.0 23.84 1.77 50.981 25.69 21.527 8.783 22.800 10.30 24.72 362585787.01509 45.28 43.34 360.909 8.63 4.47 13.65 42.21 28.36 24.48 240816 2.935 24.75 12.60 6.35 69577 197.87 48.51 25.49 1.867 29.09 85352 250.642 76.05 34.66 372.017 126.136 34832 29.92 73.36 12.36 6.50 7.15 953921.4 843649.2 2.95 34.775 192.701 4.19 0.520 944 893 88.71 35652755.4 59.29 170836 270281.26 19239.678420561 76.53 12.143 37.75 23.82 1.76 51.099 25.67 21.457 8.750 22.733 10.33 24.73 361612815.49080 45.27 43.27 361.397 8.61 4.47 13.64 42.16 28.32 24.44 241071 2.930 24.71 12.58 6.35 69582 198.06 48.48 25.46 1.865 29.07 85366 250.420 76.09 34.64 371.816 126.135 34817 29.92 73.37 12.36 6.50 7.15 793797.4 827692.2 2.77 35.514 198.547 4.29 0.515 959 911 88.76 35085353.6 59.60 171312 270228.34 19206.929427191 75.90 12.043 37.71 24.00 1.76 51.207 25.58 21.439 8.754 22.744 10.31 24.79 362494358.43670 45.16 43.23 361.823 8.61 4.48 13.62 42.25 28.38 24.43 240592 2.933 24.72 12.58 6.36 69483 197.78 48.54 25.46 1.866 29.06 85438 250.580 76.10 34.64 371.851 126.190 34817 29.93 73.36 12.36 6.50 7.15 684847.8 342313.8 2.78 51.926 OpenBenchmarking.org
Apache CouchDB Bulk Size: 100 - Inserts: 1000 - Rounds: 24 OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.1.1 Bulk Size: 100 - Inserts: 1000 - Rounds: 24 1 2 3 40 80 120 160 200 SE +/- 0.52, N = 3 SE +/- 1.00, N = 3 SE +/- 2.75, N = 3 188.83 192.70 198.55 1. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression 1 2 3 0.9653 1.9306 2.8959 3.8612 4.8265 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 4.27 4.19 4.29
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.3 Water Benchmark 1 2 3 0.117 0.234 0.351 0.468 0.585 SE +/- 0.009, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 0.509 0.520 0.515 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS 1 2 3 200 400 600 800 1000 SE +/- 4.58, N = 3 SE +/- 4.63, N = 3 SE +/- 6.08, N = 3 939 944 959 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen 1 2 3 200 400 600 800 1000 SE +/- 0.88, N = 3 SE +/- 3.28, N = 3 SE +/- 7.51, N = 3 902 893 911 1. (CXX) g++ options: -flto -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16 1 2 3 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 90.48 88.71 88.76 MIN: 90.08 / MAX: 113.36 MIN: 88.5 / MAX: 99.21 MIN: 88.55 / MAX: 100.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 1 2 3 8M 16M 24M 32M 40M SE +/- 106176.28, N = 3 SE +/- 106435.80, N = 3 SE +/- 422950.56, N = 3 35583808.2 35652755.4 35085353.6
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica 1 2 3 13 26 39 52 65 SE +/- 0.76, N = 5 SE +/- 0.33, N = 3 SE +/- 0.04, N = 3 58.73 59.29 59.60
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 1 2 3 40K 80K 120K 160K 200K SE +/- 2420.78, N = 4 SE +/- 72.06, N = 3 SE +/- 546.11, N = 3 173170 170836 171312 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 1 2 3 60K 120K 180K 240K 300K SE +/- 1001.21, N = 3 SE +/- 1044.91, N = 3 SE +/- 2203.01, N = 3 273760.53 270281.26 270228.34 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine 1 2 3 4K 8K 12K 16K 20K SE +/- 38.14, N = 3 SE +/- 131.88, N = 3 SE +/- 40.31, N = 3 19448.14 19239.68 19206.93 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda 1 2 3 20 40 60 80 100 SE +/- 0.95, N = 3 SE +/- 0.99, N = 4 SE +/- 0.37, N = 3 75.66 76.53 75.90
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 1 2 3 3 6 9 12 15 SE +/- 0.10, N = 15 SE +/- 0.17, N = 3 SE +/- 0.17, N = 14 12.06 12.14 12.04 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet 1 2 3 9 18 27 36 45 SE +/- 0.22, N = 3 SE +/- 0.31, N = 3 SE +/- 0.28, N = 3 38.00 37.75 37.71 MIN: 35.24 / MAX: 45.19 MIN: 35.25 / MAX: 52.25 MIN: 36.17 / MAX: 39.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet 1 2 3 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 23.84 23.82 24.00 MIN: 23.77 / MAX: 33.67 MIN: 23.75 / MAX: 26.11 MIN: 23.93 / MAX: 33.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface 1 2 3 0.3983 0.7966 1.1949 1.5932 1.9915 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.77 1.76 1.76 MIN: 1.74 / MAX: 1.9 MIN: 1.74 / MAX: 1.88 MIN: 1.74 / MAX: 1.8 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression 1 2 3 12 24 36 48 60 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 50.98 51.10 51.21 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18 1 2 3 6 12 18 24 30 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 25.69 25.67 25.58 MIN: 25.51 / MAX: 72.98 MIN: 25.5 / MAX: 66.83 MIN: 25.51 / MAX: 27.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless 1 2 3 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 21.53 21.46 21.44 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression 1 2 3 2 4 6 8 10 SE +/- 0.005, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 8.783 8.750 8.754 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 1 2 3 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 22.80 22.73 22.74
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: efficientnet-b0 1 2 3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 10.30 10.33 10.31 MIN: 10.26 / MAX: 10.44 MIN: 10.26 / MAX: 21.52 MIN: 10.27 / MAX: 10.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet 1 2 3 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 24.72 24.73 24.79 MIN: 24.62 / MAX: 25.88 MIN: 24.65 / MAX: 34.69 MIN: 24.64 / MAX: 65.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT 1 2 3 80M 160M 240M 320M 400M SE +/- 273136.79, N = 3 SE +/- 982928.80, N = 3 SE +/- 317316.98, N = 3 362585787.02 361612815.49 362494358.44 1. (CC) gcc options: -O3 -march=native -lm
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet 1 2 3 10 20 30 40 50 SE +/- 0.10, N = 3 SE +/- 0.49, N = 3 SE +/- 0.16, N = 3 45.28 45.27 45.16 MIN: 43.52 / MAX: 47.31 MIN: 43.01 / MAX: 48.32 MIN: 43.29 / MAX: 47.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet 1 2 3 10 20 30 40 50 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 43.34 43.27 43.23 MIN: 42.81 / MAX: 43.82 MIN: 42.83 / MAX: 44.11 MIN: 42.79 / MAX: 43.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 3 80 160 240 320 400 SE +/- 0.34, N = 3 SE +/- 0.57, N = 3 SE +/- 0.40, N = 3 360.91 361.40 361.82 MIN: 359.32 / MAX: 362.5 MIN: 359.36 / MAX: 363.38 MIN: 360.41 / MAX: 366.13 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 8.63 8.61 8.61 MIN: 8.56 / MAX: 8.72 MIN: 8.58 / MAX: 8.68 MIN: 8.57 / MAX: 8.86 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: shufflenet-v2 1 2 3 1.008 2.016 3.024 4.032 5.04 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 4.47 4.47 4.48 MIN: 4.44 / MAX: 5.65 MIN: 4.43 / MAX: 5.57 MIN: 4.43 / MAX: 5.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 13.65 13.64 13.62 MIN: 12.7 / MAX: 14.54 MIN: 13.3 / MAX: 13.68 MIN: 13.26 / MAX: 13.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: yolov4-tiny 1 2 3 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 42.21 42.16 42.25 MIN: 42.07 / MAX: 52.08 MIN: 42.02 / MAX: 44.35 MIN: 42.09 / MAX: 52.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet 1 2 3 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.08, N = 3 28.36 28.32 28.38 MIN: 28.21 / MAX: 37.77 MIN: 28.2 / MAX: 30.13 MIN: 28.2 / MAX: 54.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark 1 2 3 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 24.48 24.44 24.43 1. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
LeelaChessZero Backend: Random OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Random 1 2 3 50K 100K 150K 200K 250K SE +/- 271.35, N = 3 SE +/- 399.00, N = 3 SE +/- 308.93, N = 3 240816 241071 240592 1. (CXX) g++ options: -flto -pthread
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 1 2 3 0.6604 1.3208 1.9812 2.6416 3.302 SE +/- 0.006, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 2.935 2.930 2.933 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet 1 2 3 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 24.75 24.71 24.72 MIN: 24.65 / MAX: 34.64 MIN: 24.62 / MAX: 34.65 MIN: 24.64 / MAX: 27.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 12.60 12.58 12.58 MIN: 12.57 / MAX: 12.67 MIN: 11.94 / MAX: 12.67 MIN: 12.55 / MAX: 12.65 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 6.35 6.35 6.36 MIN: 6.28 / MAX: 8.14 MIN: 6.28 / MAX: 7.54 MIN: 6.29 / MAX: 7.69 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 1 2 3 15K 30K 45K 60K 75K SE +/- 36.98, N = 3 SE +/- 65.90, N = 3 SE +/- 27.67, N = 3 69577 69582 69483 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 1 2 3 40 80 120 160 200 SE +/- 0.08, N = 3 SE +/- 0.20, N = 3 SE +/- 0.21, N = 3 197.87 198.06 197.78 MIN: 196.37 / MAX: 199.96 MIN: 196.4 / MAX: 200.15 MIN: 196.24 / MAX: 199.59 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50 1 2 3 11 22 33 44 55 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 48.51 48.48 48.54 MIN: 48.35 / MAX: 55.69 MIN: 48.31 / MAX: 60.48 MIN: 48.37 / MAX: 57.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 1 2 3 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 25.49 25.46 25.46 MIN: 24.77 / MAX: 25.57 MIN: 25.1 / MAX: 25.53 MIN: 24.48 / MAX: 25.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
WebP Image Encode Encode Settings: Default OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Default 1 2 3 0.4201 0.8402 1.2603 1.6804 2.1005 SE +/- 0.002, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 1.867 1.865 1.866 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm 1 2 3 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 29.09 29.07 29.06
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 1 2 3 20K 40K 60K 80K 100K SE +/- 35.92, N = 3 SE +/- 58.89, N = 3 SE +/- 58.71, N = 3 85352 85366 85438 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No 1 2 3 50 100 150 200 250 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 250.64 250.42 250.58
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny 1 2 3 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 SE +/- 0.07, N = 3 76.05 76.09 76.10 MIN: 74.54 / MAX: 107.02 MIN: 67.88 / MAX: 109.29 MIN: 74.55 / MAX: 87.04 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet 1 2 3 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 34.66 34.64 34.64 MIN: 33.97 / MAX: 34.82 MIN: 34.15 / MAX: 34.75 MIN: 33.94 / MAX: 34.9 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 2 3 80 160 240 320 400 SE +/- 0.15, N = 3 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 372.02 371.82 371.85 MIN: 371.32 / MAX: 372.59 MIN: 371.21 / MAX: 372.33 MIN: 371.39 / MAX: 372.51 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 2 3 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 126.14 126.14 126.19 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 1 2 3 7K 14K 21K 28K 35K SE +/- 13.09, N = 3 SE +/- 4.70, N = 3 SE +/- 17.49, N = 3 34832 34817 34817 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 1 2 3 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 29.92 29.92 29.93 MIN: 29.59 / MAX: 30.03 MIN: 29.35 / MAX: 30.2 MIN: 29.35 / MAX: 30.14 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 1 2 3 16 32 48 64 80 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 73.36 73.37 73.36 MIN: 72.89 / MAX: 73.6 MIN: 72.72 / MAX: 74.16 MIN: 72.79 / MAX: 73.65 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 12.36 12.36 12.36 MIN: 12.12 / MAX: 12.6 MIN: 12 / MAX: 12.62 MIN: 12.11 / MAX: 12.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 6.50 6.50 6.50 MIN: 6.45 / MAX: 7.93 MIN: 6.45 / MAX: 7.53 MIN: 6.47 / MAX: 7.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 7.15 7.15 7.15 MIN: 7.09 / MAX: 8.64 MIN: 7.08 / MAX: 8.66 MIN: 7.08 / MAX: 9.1 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
InfluxDB Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 1 2 3 200K 400K 600K 800K 1000K SE +/- 1324.57, N = 3 SE +/- 43294.05, N = 9 SE +/- 5759.46, N = 3 953921.4 793797.4 684847.8
InfluxDB Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 1 2 3 200K 400K 600K 800K 1000K SE +/- 2560.05, N = 3 SE +/- 5278.70, N = 3 SE +/- 16232.59, N = 6 843649.2 827692.2 342313.8
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface 1 2 3 0.6638 1.3276 1.9914 2.6552 3.319 SE +/- 0.09, N = 3 SE +/- 0.21, N = 3 SE +/- 0.14, N = 3 2.95 2.77 2.78 MIN: 2.55 / MAX: 3.98 MIN: 2.35 / MAX: 16.04 MIN: 2.35 / MAX: 3.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis 1 2 3 12 24 36 48 60 SE +/- 0.44, N = 4 SE +/- 0.22, N = 4 SE +/- 1.30, N = 20 34.78 35.51 51.93 1. (CC) gcc options: -O2 -std=c99
Phoronix Test Suite v10.8.4