8100 Intel Core i3 Linux Intel Core i3-8100 testing with a ASRock Z370M-ITX/ac (P4.10 BIOS) and Intel 8th Gen Core Gaussian Mixture Model 3GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009309-FI-8100INTEL87 .
8100 Intel Core i3 Linux Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 Intel Core i3-8100 @ 3.60GHz (4 Cores) ASRock Z370M-ITX/ac (P4.10 BIOS) Intel 8th Gen Core 4-core Desktop 8GB 60GB DREVO X1 SSD Intel 8th Gen Core Gaussian Mixture Model 3GB (1100MHz) Realtek ALC892 VA2431 Intel I219-V + Intel I211 + Intel Dual Band-AC 3168NGW Ubuntu 20.04 5.9.0-050900rc1daily20200819-generic (x86_64) 20200818 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 20.0.8 1.2.131 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd6 Python Details - Python 3.8.2 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Mitigation of Clear buffers; SMT disabled + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected
8100 Intel Core i3 Linux realsr-ncnn: 4x - No lczero: BLAS lczero: Eigen lczero: Rand dolfyn: Computational Fluid Dynamics ffte: N=256, 3D Complex FFT Routine hmmer: Pfam Database Search mafft: Multiple Sequence Alignment - LSU RNA webp: Default webp: Quality 100 webp: Quality 100, Lossless webp: Quality 100, Highest Compression webp: Quality 100, Lossless, Highest Compression byte: Dhrystone 2 libraw: Post-Processing Benchmark espeak: Text-To-Speech Synthesis couchdb: 100 - 1000 - 24 keydb: gromacs: Water Benchmark caffe: AlexNet - CPU - 100 caffe: AlexNet - CPU - 200 caffe: GoogleNet - CPU - 100 caffe: GoogleNet - CPU - 200 ncnn: CPU - squeezenet ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 hint: FLOAT mlpack: scikit_ica mlpack: scikit_qda mlpack: scikit_svm mlpack: scikit_linearridgeregression influxdb: 4 - 10000 - 2,5000,1 - 10000 influxdb: 64 - 10000 - 2,5000,1 - 10000 1 2 3 250.642 939 902 240816 22.800 19448.140985465 126.136 12.055 1.867 2.935 21.527 8.783 50.981 35583808.2 24.48 34.775 188.832 273760.53 0.509 34832 69577 85352 173170 24.75 28.36 7.15 6.35 4.47 6.50 10.30 1.77 24.72 90.48 25.69 23.84 48.51 42.21 43.34 38.0 12.36 13.65 8.63 12.60 25.49 2.95 34.66 197.87 29.92 45.28 73.36 76.05 360.909 372.017 362585787.01509 58.73 75.66 29.09 4.27 843649.2 953921.4 250.420 944 893 241071 22.733 19239.678420561 126.135 12.143 1.865 2.930 21.457 8.750 51.099 35652755.4 24.44 35.514 192.701 270281.26 0.520 34817 69582 85366 170836 24.71 28.32 7.15 6.35 4.47 6.50 10.33 1.76 24.73 88.71 25.67 23.82 48.48 42.16 43.27 37.75 12.36 13.64 8.61 12.58 25.46 2.77 34.64 198.06 29.92 45.27 73.37 76.09 361.397 371.816 361612815.49080 59.29 76.53 29.07 4.19 827692.2 793797.4 250.580 959 911 240592 22.744 19206.929427191 126.190 12.043 1.866 2.933 21.439 8.754 51.207 35085353.6 24.43 51.926 198.547 270228.34 0.515 34817 69483 85438 171312 24.72 28.38 7.15 6.36 4.48 6.50 10.31 1.76 24.79 88.76 25.58 24.00 48.54 42.25 43.23 37.71 12.36 13.62 8.61 12.58 25.46 2.78 34.64 197.78 29.93 45.16 73.36 76.10 361.823 371.851 362494358.43670 59.60 75.90 29.06 4.29 342313.8 684847.8 OpenBenchmarking.org
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No 1 2 3 50 100 150 200 250 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 250.64 250.42 250.58
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS 1 2 3 200 400 600 800 1000 SE +/- 4.58, N = 3 SE +/- 4.63, N = 3 SE +/- 6.08, N = 3 939 944 959 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen 1 2 3 200 400 600 800 1000 SE +/- 0.88, N = 3 SE +/- 3.28, N = 3 SE +/- 7.51, N = 3 902 893 911 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Random OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Random 1 2 3 50K 100K 150K 200K 250K SE +/- 271.35, N = 3 SE +/- 399.00, N = 3 SE +/- 308.93, N = 3 240816 241071 240592 1. (CXX) g++ options: -flto -pthread
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 1 2 3 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 22.80 22.73 22.74
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine 1 2 3 4K 8K 12K 16K 20K SE +/- 38.14, N = 3 SE +/- 131.88, N = 3 SE +/- 40.31, N = 3 19448.14 19239.68 19206.93 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 2 3 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 126.14 126.14 126.19 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 1 2 3 3 6 9 12 15 SE +/- 0.10, N = 15 SE +/- 0.17, N = 3 SE +/- 0.17, N = 14 12.06 12.14 12.04 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
WebP Image Encode Encode Settings: Default OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Default 1 2 3 0.4201 0.8402 1.2603 1.6804 2.1005 SE +/- 0.002, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 1.867 1.865 1.866 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 1 2 3 0.6604 1.3208 1.9812 2.6416 3.302 SE +/- 0.006, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 2.935 2.930 2.933 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless 1 2 3 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 21.53 21.46 21.44 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression 1 2 3 2 4 6 8 10 SE +/- 0.005, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 8.783 8.750 8.754 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression 1 2 3 12 24 36 48 60 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 50.98 51.10 51.21 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 1 2 3 8M 16M 24M 32M 40M SE +/- 106176.28, N = 3 SE +/- 106435.80, N = 3 SE +/- 422950.56, N = 3 35583808.2 35652755.4 35085353.6
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark 1 2 3 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 24.48 24.44 24.43 1. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis 1 2 3 12 24 36 48 60 SE +/- 0.44, N = 4 SE +/- 0.22, N = 4 SE +/- 1.30, N = 20 34.78 35.51 51.93 1. (CC) gcc options: -O2 -std=c99
Apache CouchDB Bulk Size: 100 - Inserts: 1000 - Rounds: 24 OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.1.1 Bulk Size: 100 - Inserts: 1000 - Rounds: 24 1 2 3 40 80 120 160 200 SE +/- 0.52, N = 3 SE +/- 1.00, N = 3 SE +/- 2.75, N = 3 188.83 192.70 198.55 1. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 1 2 3 60K 120K 180K 240K 300K SE +/- 1001.21, N = 3 SE +/- 1044.91, N = 3 SE +/- 2203.01, N = 3 273760.53 270281.26 270228.34 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.3 Water Benchmark 1 2 3 0.117 0.234 0.351 0.468 0.585 SE +/- 0.009, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 0.509 0.520 0.515 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 1 2 3 7K 14K 21K 28K 35K SE +/- 13.09, N = 3 SE +/- 4.70, N = 3 SE +/- 17.49, N = 3 34832 34817 34817 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 1 2 3 15K 30K 45K 60K 75K SE +/- 36.98, N = 3 SE +/- 65.90, N = 3 SE +/- 27.67, N = 3 69577 69582 69483 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 1 2 3 20K 40K 60K 80K 100K SE +/- 35.92, N = 3 SE +/- 58.89, N = 3 SE +/- 58.71, N = 3 85352 85366 85438 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 1 2 3 40K 80K 120K 160K 200K SE +/- 2420.78, N = 4 SE +/- 72.06, N = 3 SE +/- 546.11, N = 3 173170 170836 171312 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet 1 2 3 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 24.75 24.71 24.72 MIN: 24.65 / MAX: 34.64 MIN: 24.62 / MAX: 34.65 MIN: 24.64 / MAX: 27.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet 1 2 3 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.08, N = 3 28.36 28.32 28.38 MIN: 28.21 / MAX: 37.77 MIN: 28.2 / MAX: 30.13 MIN: 28.2 / MAX: 54.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 7.15 7.15 7.15 MIN: 7.09 / MAX: 8.64 MIN: 7.08 / MAX: 8.66 MIN: 7.08 / MAX: 9.1 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 6.35 6.35 6.36 MIN: 6.28 / MAX: 8.14 MIN: 6.28 / MAX: 7.54 MIN: 6.29 / MAX: 7.69 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: shufflenet-v2 1 2 3 1.008 2.016 3.024 4.032 5.04 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 4.47 4.47 4.48 MIN: 4.44 / MAX: 5.65 MIN: 4.43 / MAX: 5.57 MIN: 4.43 / MAX: 5.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 6.50 6.50 6.50 MIN: 6.45 / MAX: 7.93 MIN: 6.45 / MAX: 7.53 MIN: 6.47 / MAX: 7.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: efficientnet-b0 1 2 3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 10.30 10.33 10.31 MIN: 10.26 / MAX: 10.44 MIN: 10.26 / MAX: 21.52 MIN: 10.27 / MAX: 10.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface 1 2 3 0.3983 0.7966 1.1949 1.5932 1.9915 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.77 1.76 1.76 MIN: 1.74 / MAX: 1.9 MIN: 1.74 / MAX: 1.88 MIN: 1.74 / MAX: 1.8 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet 1 2 3 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 24.72 24.73 24.79 MIN: 24.62 / MAX: 25.88 MIN: 24.65 / MAX: 34.69 MIN: 24.64 / MAX: 65.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16 1 2 3 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 90.48 88.71 88.76 MIN: 90.08 / MAX: 113.36 MIN: 88.5 / MAX: 99.21 MIN: 88.55 / MAX: 100.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18 1 2 3 6 12 18 24 30 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 25.69 25.67 25.58 MIN: 25.51 / MAX: 72.98 MIN: 25.5 / MAX: 66.83 MIN: 25.51 / MAX: 27.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet 1 2 3 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 23.84 23.82 24.00 MIN: 23.77 / MAX: 33.67 MIN: 23.75 / MAX: 26.11 MIN: 23.93 / MAX: 33.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50 1 2 3 11 22 33 44 55 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 48.51 48.48 48.54 MIN: 48.35 / MAX: 55.69 MIN: 48.31 / MAX: 60.48 MIN: 48.37 / MAX: 57.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: yolov4-tiny 1 2 3 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 42.21 42.16 42.25 MIN: 42.07 / MAX: 52.08 MIN: 42.02 / MAX: 44.35 MIN: 42.09 / MAX: 52.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet 1 2 3 10 20 30 40 50 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 43.34 43.27 43.23 MIN: 42.81 / MAX: 43.82 MIN: 42.83 / MAX: 44.11 MIN: 42.79 / MAX: 43.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet 1 2 3 9 18 27 36 45 SE +/- 0.22, N = 3 SE +/- 0.31, N = 3 SE +/- 0.28, N = 3 38.00 37.75 37.71 MIN: 35.24 / MAX: 45.19 MIN: 35.25 / MAX: 52.25 MIN: 36.17 / MAX: 39.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 12.36 12.36 12.36 MIN: 12.12 / MAX: 12.6 MIN: 12 / MAX: 12.62 MIN: 12.11 / MAX: 12.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 13.65 13.64 13.62 MIN: 12.7 / MAX: 14.54 MIN: 13.3 / MAX: 13.68 MIN: 13.26 / MAX: 13.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 8.63 8.61 8.61 MIN: 8.56 / MAX: 8.72 MIN: 8.58 / MAX: 8.68 MIN: 8.57 / MAX: 8.86 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 12.60 12.58 12.58 MIN: 12.57 / MAX: 12.67 MIN: 11.94 / MAX: 12.67 MIN: 12.55 / MAX: 12.65 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 1 2 3 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 25.49 25.46 25.46 MIN: 24.77 / MAX: 25.57 MIN: 25.1 / MAX: 25.53 MIN: 24.48 / MAX: 25.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface 1 2 3 0.6638 1.3276 1.9914 2.6552 3.319 SE +/- 0.09, N = 3 SE +/- 0.21, N = 3 SE +/- 0.14, N = 3 2.95 2.77 2.78 MIN: 2.55 / MAX: 3.98 MIN: 2.35 / MAX: 16.04 MIN: 2.35 / MAX: 3.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet 1 2 3 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 34.66 34.64 34.64 MIN: 33.97 / MAX: 34.82 MIN: 34.15 / MAX: 34.75 MIN: 33.94 / MAX: 34.9 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 1 2 3 40 80 120 160 200 SE +/- 0.08, N = 3 SE +/- 0.20, N = 3 SE +/- 0.21, N = 3 197.87 198.06 197.78 MIN: 196.37 / MAX: 199.96 MIN: 196.4 / MAX: 200.15 MIN: 196.24 / MAX: 199.59 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 1 2 3 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 29.92 29.92 29.93 MIN: 29.59 / MAX: 30.03 MIN: 29.35 / MAX: 30.2 MIN: 29.35 / MAX: 30.14 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet 1 2 3 10 20 30 40 50 SE +/- 0.10, N = 3 SE +/- 0.49, N = 3 SE +/- 0.16, N = 3 45.28 45.27 45.16 MIN: 43.52 / MAX: 47.31 MIN: 43.01 / MAX: 48.32 MIN: 43.29 / MAX: 47.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 1 2 3 16 32 48 64 80 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 73.36 73.37 73.36 MIN: 72.89 / MAX: 73.6 MIN: 72.72 / MAX: 74.16 MIN: 72.79 / MAX: 73.65 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny 1 2 3 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 SE +/- 0.07, N = 3 76.05 76.09 76.10 MIN: 74.54 / MAX: 107.02 MIN: 67.88 / MAX: 109.29 MIN: 74.55 / MAX: 87.04 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 3 80 160 240 320 400 SE +/- 0.34, N = 3 SE +/- 0.57, N = 3 SE +/- 0.40, N = 3 360.91 361.40 361.82 MIN: 359.32 / MAX: 362.5 MIN: 359.36 / MAX: 363.38 MIN: 360.41 / MAX: 366.13 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 2 3 80 160 240 320 400 SE +/- 0.15, N = 3 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 372.02 371.82 371.85 MIN: 371.32 / MAX: 372.59 MIN: 371.21 / MAX: 372.33 MIN: 371.39 / MAX: 372.51 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT 1 2 3 80M 160M 240M 320M 400M SE +/- 273136.79, N = 3 SE +/- 982928.80, N = 3 SE +/- 317316.98, N = 3 362585787.02 361612815.49 362494358.44 1. (CC) gcc options: -O3 -march=native -lm
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica 1 2 3 13 26 39 52 65 SE +/- 0.76, N = 5 SE +/- 0.33, N = 3 SE +/- 0.04, N = 3 58.73 59.29 59.60
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda 1 2 3 20 40 60 80 100 SE +/- 0.95, N = 3 SE +/- 0.99, N = 4 SE +/- 0.37, N = 3 75.66 76.53 75.90
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm 1 2 3 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 29.09 29.07 29.06
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression 1 2 3 0.9653 1.9306 2.8959 3.8612 4.8265 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 4.27 4.19 4.29
InfluxDB Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 1 2 3 200K 400K 600K 800K 1000K SE +/- 2560.05, N = 3 SE +/- 5278.70, N = 3 SE +/- 16232.59, N = 6 843649.2 827692.2 342313.8
InfluxDB Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 1 2 3 200K 400K 600K 800K 1000K SE +/- 1324.57, N = 3 SE +/- 43294.05, N = 9 SE +/- 5759.46, N = 3 953921.4 793797.4 684847.8
Phoronix Test Suite v10.8.4