Intel Core i7-8565U testing with a Dell 0KTW76 (1.0.0 BIOS) and Intel UHD 620 3GB on Ubuntu 20.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2009295-PTS-8565UMON93 8565U Monday - Phoronix Test Suite 8565U Monday Intel Core i7-8565U testing with a Dell 0KTW76 (1.0.0 BIOS) and Intel UHD 620 3GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009295-PTS-8565UMON93&grs&sor&export=pdf .
8565U Monday Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 1 2 3 Intel Core i7-8565U @ 4.60GHz (4 Cores / 8 Threads) Dell 0KTW76 (1.0.0 BIOS) Intel Cannon Point-LP 16GB PC401 NVMe SK hynix 256GB Intel UHD 620 3GB (1150MHz) Realtek ALC3271 Qualcomm Atheros QCA6174 802.11ac Ubuntu 20.04 5.4.0-48-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 20.0.4 OpenCL 2.1 1.2.131 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd6 Python Details - Python 3.8.2 Security Details - itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected
8565U Monday lczero: Eigen lczero: BLAS couchdb: 100 - 1000 - 24 mafft: Multiple Sequence Alignment - LSU RNA hmmer: Pfam Database Search lczero: Rand dolfyn: Computational Fluid Dynamics mlpack: scikit_linearridgeregression mlpack: scikit_ica ncnn: CPU - alexnet mlpack: scikit_qda ncnn: CPU - blazeface ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - alexnet gromacs: Water Benchmark tnn: CPU - MobileNet v2 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 realsr-ncnn: 4x - No ncnn: CPU - efficientnet-b0 ncnn: CPU - shufflenet-v2 mlpack: scikit_svm ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: CPU - resnet18 ncnn: Vulkan GPU - vgg16 caffe: GoogleNet - CPU - 200 caffe: AlexNet - CPU - 100 caffe: GoogleNet - CPU - 100 caffe: AlexNet - CPU - 200 ncnn: Vulkan GPU - squeezenet ncnn: CPU - vgg16 ncnn: Vulkan GPU - yolov4-tiny ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - mnasnet hint: FLOAT ncnn: Vulkan GPU - blazeface byte: Dhrystone 2 ncnn: CPU - squeezenet ncnn: CPU - resnet50 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - googlenet ncnn: CPU - googlenet ncnn: CPU - mobilenet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU - resnet18 ncnn: CPU - yolov4-tiny tnn: CPU - SqueezeNet v1.1 ncnn: CPU-v2-v2 - mobilenet-v2 ffte: N=256, 3D Complex FFT Routine 1 2 3 393 402 219.597 14.227 115.828 162558 19.864 4.91 105.60 32.37 128.63 3.01 79.43 27.27 111.97 0.350 354.311 39.98 16.827 14.42 6.44 27.01 44.16 29.91 264.91 516263 104649 258236 209370 113.17 112.25 152.72 8.75 9.30 413697236.17006 7.24 41388503.2 33.77 58.20 41.05 100.19 31.98 39.88 145.94 109.79 93.52 52.38 321.297 9.66 14955.245278133 359 375 232.583 15.013 119.817 155574 20.439 5.12 102.04 32.56 130.89 3.02 80.51 27.22 110.69 0.349 354.240 39.69 16.861 14.48 6.45 26.85 43.77 30.13 266.39 517125 104160 259823 210394 113.23 112.74 152.19 8.74 9.30 413895480.89544 7.26 41285586.6 33.68 58.07 41.11 99.97 32.05 39.88 146.21 109.95 93.48 52.34 321.592 9.67 13669.482756807 358 376 235.273 14.902 121.430 155116 20.760 5.11 102.85 31.70 129.82 2.97 80.43 26.95 111.37 0.346 358.291 39.53 17.010 14.34 6.50 26.77 44.12 29.92 264.49 519953 104862 258993 210101 112.73 112.26 152.35 8.72 9.27 415027165.81048 7.26 41275405.5 33.70 58.22 41.01 99.97 32.04 39.96 145.98 109.75 93.58 52.39 321.335 9.65 13988.362401492 OpenBenchmarking.org
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen 1 2 3 90 180 270 360 450 SE +/- 1.15, N = 3 SE +/- 4.98, N = 3 SE +/- 2.52, N = 3 393 359 358 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS 1 3 2 90 180 270 360 450 SE +/- 4.40, N = 7 SE +/- 2.19, N = 3 SE +/- 4.55, N = 6 402 376 375 1. (CXX) g++ options: -flto -pthread
Apache CouchDB Bulk Size: 100 - Inserts: 1000 - Rounds: 24 OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.1.1 Bulk Size: 100 - Inserts: 1000 - Rounds: 24 1 2 3 50 100 150 200 250 SE +/- 1.79, N = 3 SE +/- 1.86, N = 3 SE +/- 1.54, N = 3 219.60 232.58 235.27 1. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 1 3 2 4 8 12 16 20 SE +/- 0.20, N = 4 SE +/- 0.18, N = 5 SE +/- 0.17, N = 6 14.23 14.90 15.01 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 2 3 30 60 90 120 150 SE +/- 0.20, N = 3 SE +/- 0.33, N = 3 SE +/- 0.89, N = 3 115.83 119.82 121.43 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
LeelaChessZero Backend: Random OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Random 1 2 3 30K 60K 90K 120K 150K SE +/- 191.57, N = 3 SE +/- 220.43, N = 3 SE +/- 348.18, N = 3 162558 155574 155116 1. (CXX) g++ options: -flto -pthread
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 1 2 3 5 10 15 20 25 SE +/- 0.05, N = 3 SE +/- 0.29, N = 3 SE +/- 0.13, N = 3 19.86 20.44 20.76
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression 1 3 2 1.152 2.304 3.456 4.608 5.76 SE +/- 0.08, N = 3 SE +/- 0.07, N = 15 SE +/- 0.05, N = 15 4.91 5.11 5.12
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica 2 3 1 20 40 60 80 100 SE +/- 0.52, N = 3 SE +/- 1.53, N = 3 SE +/- 0.48, N = 3 102.04 102.85 105.60
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet 3 1 2 8 16 24 32 40 SE +/- 0.15, N = 3 SE +/- 0.40, N = 3 SE +/- 0.63, N = 3 31.70 32.37 32.56 MIN: 24.92 / MAX: 66.78 MIN: 29.45 / MAX: 40.61 MIN: 28.88 / MAX: 49.52 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda 1 3 2 30 60 90 120 150 SE +/- 0.32, N = 3 SE +/- 0.33, N = 3 SE +/- 1.23, N = 12 128.63 129.82 130.89
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface 3 1 2 0.6795 1.359 2.0385 2.718 3.3975 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 2.97 3.01 3.02 MIN: 2.77 / MAX: 6.62 MIN: 2.77 / MAX: 6.88 MIN: 2.79 / MAX: 7.55 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 1 3 2 20 40 60 80 100 SE +/- 1.16, N = 3 SE +/- 0.15, N = 3 SE +/- 0.04, N = 3 79.43 80.43 80.51 MIN: 25.06 / MAX: 82.26 MIN: 32.99 / MAX: 82.09 MIN: 63.31 / MAX: 82.32 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 3 2 1 6 12 18 24 30 SE +/- 0.30, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 26.95 27.22 27.27 MIN: 8.15 / MAX: 27.92 MIN: 25.25 / MAX: 27.65 MIN: 26.82 / MAX: 27.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet 2 3 1 30 60 90 120 150 SE +/- 0.68, N = 3 SE +/- 0.15, N = 3 SE +/- 0.31, N = 3 110.69 111.37 111.97 MIN: 46.21 / MAX: 124.73 MIN: 83.8 / MAX: 119.26 MIN: 49.79 / MAX: 119.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.3 Water Benchmark 1 2 3 0.0788 0.1576 0.2364 0.3152 0.394 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 0.350 0.349 0.346 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 2 1 3 80 160 240 320 400 SE +/- 3.97, N = 7 SE +/- 4.28, N = 6 SE +/- 3.08, N = 11 354.24 354.31 358.29 MIN: 336.42 / MAX: 379.51 MIN: 336.38 / MAX: 421.59 MIN: 335.91 / MAX: 398.13 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 3 2 1 9 18 27 36 45 SE +/- 0.18, N = 3 SE +/- 0.07, N = 3 SE +/- 0.40, N = 3 39.53 39.69 39.98 MIN: 37.9 / MAX: 41.14 MIN: 37.31 / MAX: 41.35 MIN: 37.94 / MAX: 41.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No 1 2 3 4 8 12 16 20 SE +/- 0.16, N = 3 SE +/- 0.12, N = 3 SE +/- 0.10, N = 3 16.83 16.86 17.01
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: efficientnet-b0 3 1 2 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 14.34 14.42 14.48 MIN: 14.12 / MAX: 32.22 MIN: 13.92 / MAX: 19.14 MIN: 13.9 / MAX: 32.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: shufflenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 6.44 6.45 6.50 MIN: 6.35 / MAX: 11.01 MIN: 6.35 / MAX: 11.05 MIN: 6.35 / MAX: 20.86 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm 3 2 1 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 26.77 26.85 27.01
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 2 3 1 10 20 30 40 50 SE +/- 0.35, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 43.77 44.12 44.16 MIN: 13.11 / MAX: 44.95 MIN: 37.44 / MAX: 46.32 MIN: 41.14 / MAX: 45.1 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18 1 3 2 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.29, N = 3 29.91 29.92 30.13 MIN: 27.51 / MAX: 48.14 MIN: 27.37 / MAX: 47.11 MIN: 27.34 / MAX: 47.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 3 1 2 60 120 180 240 300 SE +/- 1.79, N = 3 SE +/- 0.53, N = 3 SE +/- 3.51, N = 3 264.49 264.91 266.39 MIN: 245.81 / MAX: 277.42 MIN: 200.25 / MAX: 278.56 MIN: 226.52 / MAX: 278 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 1 2 3 110K 220K 330K 440K 550K SE +/- 1254.00, N = 3 SE +/- 122.56, N = 3 SE +/- 1934.13, N = 3 516263 517125 519953 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 2 1 3 20K 40K 60K 80K 100K SE +/- 418.70, N = 3 SE +/- 911.99, N = 3 SE +/- 504.63, N = 3 104160 104649 104862 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 1 3 2 60K 120K 180K 240K 300K SE +/- 723.58, N = 3 SE +/- 401.70, N = 3 SE +/- 1861.78, N = 3 258236 258993 259823 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 1 3 2 50K 100K 150K 200K 250K SE +/- 168.23, N = 3 SE +/- 289.07, N = 3 SE +/- 494.50, N = 3 209370 210101 210394 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet 3 1 2 30 60 90 120 150 SE +/- 0.42, N = 3 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 112.73 113.17 113.23 MIN: 47.2 / MAX: 122.09 MIN: 77.63 / MAX: 142.17 MIN: 78.34 / MAX: 131.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16 1 3 2 30 60 90 120 150 SE +/- 0.13, N = 3 SE +/- 0.14, N = 3 SE +/- 0.07, N = 3 112.25 112.26 112.74 MIN: 110.69 / MAX: 136.47 MIN: 110.23 / MAX: 135.09 MIN: 110.79 / MAX: 135.36 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny 2 3 1 30 60 90 120 150 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 152.19 152.35 152.72 MIN: 105.13 / MAX: 163.85 MIN: 113.42 / MAX: 185.04 MIN: 143.57 / MAX: 188.32 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v3-v3 - Model: mobilenet-v3 3 2 1 2 4 6 8 10 SE +/- 0.10, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 8.72 8.74 8.75 MIN: 7.05 / MAX: 13.31 MIN: 7.07 / MAX: 25.14 MIN: 7.78 / MAX: 24.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet 3 1 2 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 9.27 9.30 9.30 MIN: 8.85 / MAX: 13.94 MIN: 8.92 / MAX: 13.87 MIN: 8.84 / MAX: 14.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT 3 2 1 90M 180M 270M 360M 450M SE +/- 481014.66, N = 3 SE +/- 571379.17, N = 3 SE +/- 535857.19, N = 3 415027165.81 413895480.90 413697236.17 1. (CC) gcc options: -O3 -march=native -lm
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 7.24 7.26 7.26 MIN: 6.92 / MAX: 7.74 MIN: 6.96 / MAX: 7.68 MIN: 6.93 / MAX: 7.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 1 2 3 9M 18M 27M 36M 45M SE +/- 144856.28, N = 3 SE +/- 120193.17, N = 3 SE +/- 110567.66, N = 3 41388503.2 41285586.6 41275405.5
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet 2 3 1 8 16 24 32 40 SE +/- 0.08, N = 3 SE +/- 0.20, N = 3 SE +/- 0.09, N = 3 33.68 33.70 33.77 MIN: 32.44 / MAX: 38.42 MIN: 32.91 / MAX: 50.35 MIN: 32.55 / MAX: 49.1 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50 2 1 3 13 26 39 52 65 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 58.07 58.20 58.22 MIN: 50.39 / MAX: 80.94 MIN: 50.41 / MAX: 73.95 MIN: 49.31 / MAX: 84.52 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet 3 1 2 9 18 27 36 45 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 SE +/- 0.12, N = 3 41.01 41.05 41.11 MIN: 37.91 / MAX: 42.59 MIN: 39.15 / MAX: 42.78 MIN: 39.17 / MAX: 42.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet 2 3 1 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.18, N = 3 SE +/- 0.01, N = 3 99.97 99.97 100.19 MIN: 52.87 / MAX: 103.44 MIN: 50.54 / MAX: 104.58 MIN: 91.26 / MAX: 104.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet 1 3 2 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 31.98 32.04 32.05 MIN: 30.35 / MAX: 37.74 MIN: 30.31 / MAX: 52.75 MIN: 30.26 / MAX: 49.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet 1 2 3 9 18 27 36 45 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 SE +/- 0.22, N = 3 39.88 39.88 39.96 MIN: 38.48 / MAX: 56.21 MIN: 38.54 / MAX: 60.99 MIN: 38.62 / MAX: 57.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 1 3 2 30 60 90 120 150 SE +/- 0.19, N = 3 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 145.94 145.98 146.21 MIN: 71.98 / MAX: 150.92 MIN: 102.95 / MAX: 150.92 MIN: 134.59 / MAX: 150.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet 3 1 2 20 40 60 80 100 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 SE +/- 0.13, N = 3 109.75 109.79 109.95 MIN: 80.74 / MAX: 114.88 MIN: 82.84 / MAX: 143.69 MIN: 81.69 / MAX: 140.69 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 2 1 3 20 40 60 80 100 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 93.48 93.52 93.58 MIN: 63.46 / MAX: 96.79 MIN: 63.85 / MAX: 96.7 MIN: 68.4 / MAX: 96.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: yolov4-tiny 2 1 3 12 24 36 48 60 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 52.34 52.38 52.39 MIN: 50.48 / MAX: 69.03 MIN: 50.66 / MAX: 70.3 MIN: 50.54 / MAX: 70.46 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 3 2 70 140 210 280 350 SE +/- 0.07, N = 3 SE +/- 0.12, N = 3 SE +/- 0.12, N = 3 321.30 321.34 321.59 MIN: 320.64 / MAX: 322.9 MIN: 320.34 / MAX: 324.07 MIN: 320.33 / MAX: 326.07 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v2-v2 - Model: mobilenet-v2 3 1 2 3 6 9 12 15 SE +/- 0.96, N = 3 SE +/- 0.92, N = 3 SE +/- 0.98, N = 3 9.65 9.66 9.67 MIN: 7.53 / MAX: 29.9 MIN: 7.51 / MAX: 15.29 MIN: 7.44 / MAX: 15.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine 1 3 2 3K 6K 9K 12K 15K SE +/- 374.98, N = 15 SE +/- 389.74, N = 15 SE +/- 416.61, N = 15 14955.25 13988.36 13669.48 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
Phoronix Test Suite v10.8.4