3300X NN + More AMD Ryzen 7 1700 Eight-Core testing with a MSI B350 TOMAHAWK (MS-7A34) v1.0 (1.H0 BIOS) and AMD Radeon RX 470/480/570/570X/580/580X/590 8GB on Ubuntu 19.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009269-FI-3300XNNMO11&rdt&grt .
3300X NN + More Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 4 AMD Ryzen 7 1700 Eight-Core @ 3.00GHz (8 Cores / 16 Threads) MSI B350 TOMAHAWK (MS-7A34) v1.0 (1.H0 BIOS) AMD 17h 16GB 120GB Samsung SSD 840 AMD Radeon RX 470/480/570/570X/580/580X/590 8GB (1266/2000MHz) AMD Ellesmere HDMI Audio VA2431 Realtek RTL8111/8168/8411 Ubuntu 19.10 5.9.0-050900rc2-generic (x86_64) 20200823 GNOME Shell 3.34.1 X Server 1.20.5 amdgpu 19.0.1 4.5 Mesa 19.2.8 (LLVM 9.0.0) 1.1.107 GCC 9.2.1 20191008 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8001137 Graphics Details - GLAMOR Python Details - Python 2.7.17rc1 + Python 3.7.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected
3300X NN + More dcraw: RAW To PPM Image Conversion espeak: Text-To-Speech Synthesis glmark2: 1280 x 1024 glmark2: 1920 x 1080 gpaw: Carbon Nanotube incompact3d: Cylinder influxdb: 4 - 10000 - 2,5000,1 - 10000 influxdb: 64 - 10000 - 2,5000,1 - 10000 influxdb: 1024 - 10000 - 2,5000,1 - 10000 kripke: lammps: 20k Atoms lammps: Rhodopsin Protein libraw: Post-Processing Benchmark mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 mocassin: Dust 2D tau100.0 mpv: Big Buck Bunny Sunflower 4K - Software Only mpv: Big Buck Bunny Sunflower 1080p - Software Only ncnn: CPU - squeezenet ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny opencv: DNN - Deep Neural Network osbench: Create Files osbench: Create Threads osbench: Launch Programs osbench: Create Processes osbench: Memory Allocations realsr-ncnn: 4x - No realsr-ncnn: 4x - Yes tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 webp: Default webp: Quality 100 webp: Quality 100, Lossless webp: Quality 100, Highest Compression webp: Quality 100, Lossless, Highest Compression 1 2 3 4 53.807 41.484 5886 4443 518.236 532.993001 571253.4 897546.8 963918.4 2274229 4.571 4.377 24.24 12.499 70.600 6.843 13.386 75.264 283 558.92 1496.59 30.75 37.46 10.77 9.34 7.43 9.91 14.12 2.86 31.08 111.92 24.09 29.08 56.22 50.54 18.86 17.86 8.12 9.76 5.68 8.99 19.33 1.78 20.77 66.60 11.55 12.02 28.59 28.52 21458 19.150593 16.569297 57.036877 35.312017 91.620366 58.452 443.805 333.936 304.720 1.976 2.947 23.735 9.456 49.640 53.810 43.635 5896 4438 515.265 532.589945 528243.2 848484.3 899150.3 4.557 4.381 24.29 12.276 69.864 6.873 13.428 73.795 283 559.45 1497.78 30.87 38.03 10.77 9.67 7.38 9.69 14.12 2.74 29.05 111.22 23.57 29.58 52.76 50.46 18.83 17.84 8.13 9.76 5.65 8.96 19.18 1.73 20.76 66.55 11.61 11.99 28.45 28.44 20024 19.078225 16.277631 56.003730 34.882704 91.819684 58.413 443.827 333.412 306.242 1.975 2.969 23.666 9.447 50.313 54.359 38.142 5875 4435 517.886 537.422607 507255.7 764198.1 4.510 4.490 24.19 12.452 69.864 6.940 13.365 73.968 283 558.11 1490.85 30.57 37.54 11.09 9.57 7.37 9.93 13.75 3.29 29.02 110.78 24.98 29.73 56.21 50.44 18.86 17.87 8.10 9.78 5.67 8.96 19.36 1.74 20.76 66.55 11.57 12.04 28.46 28.61 21553 18.783694 16.246637 60.716470 35.650253 91.783682 58.385 443.843 332.492 306.045 1.967 2.943 23.440 9.445 50.630 OpenBenchmarking.org
dcraw RAW To PPM Image Conversion OpenBenchmarking.org Seconds, Fewer Is Better dcraw RAW To PPM Image Conversion 1 2 4 12 24 36 48 60 SE +/- 0.02, N = 3 SE +/- 0.29, N = 3 SE +/- 0.12, N = 3 53.81 53.81 54.36 1. (CC) gcc options: -lm
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis 1 2 4 10 20 30 40 50 SE +/- 0.90, N = 16 SE +/- 1.05, N = 20 SE +/- 0.51, N = 5 41.48 43.64 38.14 1. (CC) gcc options: -O2 -std=c99
GLmark2 Resolution: 1280 x 1024 OpenBenchmarking.org Score, More Is Better GLmark2 2020.04 Resolution: 1280 x 1024 1 2 4 1300 2600 3900 5200 6500 5886 5896 5875
GLmark2 Resolution: 1920 x 1080 OpenBenchmarking.org Score, More Is Better GLmark2 2020.04 Resolution: 1920 x 1080 1 2 4 1000 2000 3000 4000 5000 4443 4438 4435
GPAW Input: Carbon Nanotube OpenBenchmarking.org Seconds, Fewer Is Better GPAW 20.1 Input: Carbon Nanotube 1 2 4 110 220 330 440 550 SE +/- 1.11, N = 3 SE +/- 1.72, N = 3 SE +/- 0.91, N = 3 518.24 515.27 517.89 1. (CC) gcc options: -pthread -shared -lxc -lblas -lmpi
Incompact3D Input: Cylinder OpenBenchmarking.org Seconds, Fewer Is Better Incompact3D 2020-09-17 Input: Cylinder 1 2 4 120 240 360 480 600 SE +/- 2.71, N = 3 SE +/- 3.03, N = 3 SE +/- 4.14, N = 3 532.99 532.59 537.42 1. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
InfluxDB Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 1 2 4 120K 240K 360K 480K 600K SE +/- 18598.16, N = 12 SE +/- 21790.68, N = 12 SE +/- 18076.30, N = 12 571253.4 528243.2 507255.7
InfluxDB Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 1 2 4 200K 400K 600K 800K 1000K SE +/- 7142.37, N = 3 SE +/- 14288.50, N = 12 SE +/- 13119.06, N = 12 897546.8 848484.3 764198.1
InfluxDB Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 1 2 200K 400K 600K 800K 1000K SE +/- 4179.09, N = 3 SE +/- 39090.53, N = 13 963918.4 899150.3
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 1 500K 1000K 1500K 2000K 2500K SE +/- 40613.50, N = 2 2274229 1. (CXX) g++ options: -O3 -fopenmp
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 24Aug2020 Model: 20k Atoms 1 2 4 1.0285 2.057 3.0855 4.114 5.1425 SE +/- 0.024, N = 3 SE +/- 0.015, N = 3 SE +/- 0.049, N = 9 4.571 4.557 4.510 1. (CXX) g++ options: -O3 -pthread -lm
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 24Aug2020 Model: Rhodopsin Protein 1 2 4 1.0103 2.0206 3.0309 4.0412 5.0515 SE +/- 0.048, N = 3 SE +/- 0.068, N = 15 SE +/- 0.036, N = 3 4.377 4.381 4.490 1. (CXX) g++ options: -O3 -pthread -lm
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark 1 2 4 6 12 18 24 30 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 24.24 24.29 24.19 1. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: SqueezeNetV1.0 1 2 4 3 6 9 12 15 SE +/- 0.09, N = 3 SE +/- 0.13, N = 3 SE +/- 0.05, N = 3 12.50 12.28 12.45 MIN: 11.62 / MAX: 26.02 MIN: 11.47 / MAX: 30.07 MIN: 11.56 / MAX: 29.12 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: resnet-v2-50 1 2 4 16 32 48 64 80 SE +/- 0.45, N = 3 SE +/- 0.16, N = 3 SE +/- 0.35, N = 3 70.60 69.86 69.86 MIN: 66.38 / MAX: 106.89 MIN: 67.17 / MAX: 89.11 MIN: 65.78 / MAX: 106.91 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: MobileNetV2_224 1 2 4 2 4 6 8 10 SE +/- 0.017, N = 3 SE +/- 0.064, N = 3 SE +/- 0.030, N = 3 6.843 6.873 6.940 MIN: 6.34 / MAX: 22.76 MIN: 6.37 / MAX: 8.93 MIN: 6.37 / MAX: 10.3 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: mobilenet-v1-1.0 1 2 4 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 13.39 13.43 13.37 MIN: 12.62 / MAX: 27.98 MIN: 13.04 / MAX: 29.42 MIN: 10.78 / MAX: 28.87 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: inception-v3 1 2 4 20 40 60 80 100 SE +/- 1.22, N = 3 SE +/- 0.11, N = 3 SE +/- 0.47, N = 3 75.26 73.80 73.97 MIN: 70.86 / MAX: 132.02 MIN: 70.29 / MAX: 128.12 MIN: 70.72 / MAX: 107.56 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Monte Carlo Simulations of Ionised Nebulae Input: Dust 2D tau100.0 OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2019-03-24 Input: Dust 2D tau100.0 1 2 4 60 120 180 240 300 SE +/- 0.88, N = 3 283 283 283 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
MPV Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only OpenBenchmarking.org FPS, More Is Better MPV Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only 1 2 4 120 240 360 480 600 SE +/- 0.53, N = 3 SE +/- 0.83, N = 3 SE +/- 0.60, N = 3 558.92 559.45 558.11 MIN: 353.51 / MAX: 741.98 MIN: 355.56 / MAX: 751.21 MIN: 354.93 / MAX: 741.6 1. mpv 0.29.1
MPV Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only OpenBenchmarking.org FPS, More Is Better MPV Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only 1 2 4 300 600 900 1200 1500 SE +/- 4.23, N = 3 SE +/- 3.04, N = 3 SE +/- 0.82, N = 3 1496.59 1497.78 1490.85 MIN: 890.33 / MAX: 2242.83 MIN: 916.25 / MAX: 2144.91 MIN: 901.12 / MAX: 2234.81 1. mpv 0.29.1
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet 1 2 4 7 14 21 28 35 SE +/- 0.13, N = 3 SE +/- 0.26, N = 3 SE +/- 0.30, N = 3 30.75 30.87 30.57 MIN: 27.03 / MAX: 82.21 MIN: 27.17 / MAX: 74.24 MIN: 26.49 / MAX: 75.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet 1 2 4 9 18 27 36 45 SE +/- 0.21, N = 3 SE +/- 0.47, N = 3 SE +/- 0.13, N = 3 37.46 38.03 37.54 MIN: 33.76 / MAX: 70.94 MIN: 34.21 / MAX: 102.74 MIN: 34.06 / MAX: 82.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 4 3 6 9 12 15 SE +/- 0.10, N = 3 SE +/- 0.54, N = 3 SE +/- 0.34, N = 3 10.77 10.77 11.09 MIN: 9.68 / MAX: 26.18 MIN: 9.36 / MAX: 64.52 MIN: 9.74 / MAX: 30.81 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 4 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.12, N = 3 SE +/- 0.37, N = 3 9.34 9.67 9.57 MIN: 8.58 / MAX: 23.21 MIN: 8.5 / MAX: 32.82 MIN: 8.51 / MAX: 64.81 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: shufflenet-v2 1 2 4 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 7.43 7.38 7.37 MIN: 6.94 / MAX: 8.1 MIN: 6.85 / MAX: 8.04 MIN: 6.87 / MAX: 17.87 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet 1 2 4 3 6 9 12 15 SE +/- 0.18, N = 3 SE +/- 0.26, N = 3 SE +/- 0.24, N = 3 9.91 9.69 9.93 MIN: 9.06 / MAX: 23.96 MIN: 8.77 / MAX: 11.64 MIN: 8.86 / MAX: 45.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: efficientnet-b0 1 2 4 4 8 12 16 20 SE +/- 0.38, N = 3 SE +/- 0.45, N = 3 SE +/- 0.14, N = 3 14.12 14.12 13.75 MIN: 12.85 / MAX: 66.8 MIN: 12.97 / MAX: 29.34 MIN: 12.89 / MAX: 86.55 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface 1 2 4 0.7403 1.4806 2.2209 2.9612 3.7015 SE +/- 0.14, N = 3 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 2.86 2.74 3.29 MIN: 2.54 / MAX: 18.63 MIN: 2.51 / MAX: 9.36 MIN: 2.55 / MAX: 92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet 1 2 4 7 14 21 28 35 SE +/- 0.32, N = 3 SE +/- 1.68, N = 3 SE +/- 1.45, N = 3 31.08 29.05 29.02 MIN: 27.19 / MAX: 97.75 MIN: 23.64 / MAX: 72.29 MIN: 23.86 / MAX: 69.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16 1 2 4 30 60 90 120 150 SE +/- 0.91, N = 3 SE +/- 0.30, N = 3 SE +/- 0.98, N = 3 111.92 111.22 110.78 MIN: 87.88 / MAX: 165.26 MIN: 86.34 / MAX: 166.75 MIN: 84.59 / MAX: 174.23 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18 1 2 4 6 12 18 24 30 SE +/- 0.60, N = 3 SE +/- 0.91, N = 3 SE +/- 0.66, N = 3 24.09 23.57 24.98 MIN: 19.49 / MAX: 77.09 MIN: 19.44 / MAX: 72.07 MIN: 21.61 / MAX: 75.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet 1 2 4 7 14 21 28 35 SE +/- 0.19, N = 3 SE +/- 0.80, N = 3 SE +/- 0.43, N = 3 29.08 29.58 29.73 MIN: 18.15 / MAX: 57.79 MIN: 18.17 / MAX: 59.76 MIN: 18.19 / MAX: 94.45 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50 1 2 4 13 26 39 52 65 SE +/- 0.17, N = 3 SE +/- 1.78, N = 3 SE +/- 0.13, N = 3 56.22 52.76 56.21 MIN: 47.73 / MAX: 101.17 MIN: 47.28 / MAX: 120.42 MIN: 47.44 / MAX: 100.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: yolov4-tiny 1 2 4 11 22 33 44 55 SE +/- 0.15, N = 3 SE +/- 0.13, N = 3 SE +/- 0.16, N = 3 50.54 50.46 50.44 MIN: 42.79 / MAX: 71.94 MIN: 42.79 / MAX: 79.82 MIN: 43.22 / MAX: 70.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet 1 2 4 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 18.86 18.83 18.86 MIN: 17.74 / MAX: 23.12 MIN: 17.59 / MAX: 21.01 MIN: 17.79 / MAX: 22.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet 1 2 4 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 SE +/- 0.17, N = 3 17.86 17.84 17.87 MIN: 14.9 / MAX: 24.7 MIN: 14.88 / MAX: 23.2 MIN: 14.98 / MAX: 22.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 1 2 4 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.12 8.13 8.10 MIN: 7.45 / MAX: 11.88 MIN: 7.57 / MAX: 11.54 MIN: 7.51 / MAX: 10.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 1 2 4 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 9.76 9.76 9.78 MIN: 8.77 / MAX: 14 MIN: 8.8 / MAX: 13.68 MIN: 8.84 / MAX: 14.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 1 2 4 1.278 2.556 3.834 5.112 6.39 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 5.68 5.65 5.67 MIN: 4.95 / MAX: 7.6 MIN: 4.79 / MAX: 7.91 MIN: 4.91 / MAX: 7.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet 1 2 4 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 8.99 8.96 8.96 MIN: 8.26 / MAX: 12.37 MIN: 8.14 / MAX: 12.22 MIN: 8.26 / MAX: 11.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 1 2 4 5 10 15 20 25 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 19.33 19.18 19.36 MIN: 17.83 / MAX: 25.94 MIN: 17.58 / MAX: 25.38 MIN: 17.63 / MAX: 26.89 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface 1 2 4 0.4005 0.801 1.2015 1.602 2.0025 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.78 1.73 1.74 MIN: 1.52 / MAX: 4.36 MIN: 1.51 / MAX: 3.65 MIN: 1.54 / MAX: 3.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet 1 2 4 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 20.77 20.76 20.76 MIN: 19.45 / MAX: 25.92 MIN: 19.53 / MAX: 25.18 MIN: 19.47 / MAX: 25.41 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 1 2 4 15 30 45 60 75 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 66.60 66.55 66.55 MIN: 65.77 / MAX: 69.74 MIN: 65.83 / MAX: 69.88 MIN: 65.79 / MAX: 69.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 1 2 4 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.00, N = 3 11.55 11.61 11.57 MIN: 10.44 / MAX: 17.06 MIN: 10.49 / MAX: 17.22 MIN: 10.49 / MAX: 16.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet 1 2 4 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 12.02 11.99 12.04 MIN: 11.21 / MAX: 15.88 MIN: 11.31 / MAX: 15.43 MIN: 11.3 / MAX: 16.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 1 2 4 7 14 21 28 35 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 28.59 28.45 28.46 MIN: 26.36 / MAX: 37.83 MIN: 26.33 / MAX: 35.78 MIN: 26.08 / MAX: 35.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny 1 2 4 7 14 21 28 35 SE +/- 0.05, N = 3 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 28.52 28.44 28.61 MIN: 24.35 / MAX: 33.26 MIN: 24.22 / MAX: 32.89 MIN: 24.14 / MAX: 32.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenCV Test: DNN - Deep Neural Network OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.4 Test: DNN - Deep Neural Network 1 2 4 5K 10K 15K 20K 25K SE +/- 431.59, N = 15 SE +/- 400.07, N = 15 SE +/- 324.81, N = 3 21458 20024 21553 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
OSBench Test: Create Files OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Files 1 2 4 5 10 15 20 25 SE +/- 0.22, N = 3 SE +/- 0.15, N = 3 SE +/- 0.06, N = 3 19.15 19.08 18.78 1. (CC) gcc options: -lm
OSBench Test: Create Threads OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Threads 1 2 4 4 8 12 16 20 SE +/- 0.13, N = 3 SE +/- 0.16, N = 3 SE +/- 0.17, N = 3 16.57 16.28 16.25 1. (CC) gcc options: -lm
OSBench Test: Launch Programs OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Launch Programs 1 2 4 14 28 42 56 70 SE +/- 0.36, N = 3 SE +/- 0.70, N = 3 SE +/- 0.74, N = 3 57.04 56.00 60.72 1. (CC) gcc options: -lm
OSBench Test: Create Processes OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Processes 1 2 4 8 16 24 32 40 SE +/- 0.47, N = 15 SE +/- 0.39, N = 15 SE +/- 0.54, N = 15 35.31 34.88 35.65 1. (CC) gcc options: -lm
OSBench Test: Memory Allocations OpenBenchmarking.org Ns Per Event, Fewer Is Better OSBench Test: Memory Allocations 1 2 4 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.21, N = 3 SE +/- 0.38, N = 3 91.62 91.82 91.78 1. (CC) gcc options: -lm
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No 1 2 4 13 26 39 52 65 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 58.45 58.41 58.39
RealSR-NCNN Scale: 4x - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: Yes 1 2 4 100 200 300 400 500 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 443.81 443.83 443.84
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 4 70 140 210 280 350 SE +/- 0.97, N = 3 SE +/- 1.19, N = 3 SE +/- 0.92, N = 3 333.94 333.41 332.49 MIN: 322.28 / MAX: 361.07 MIN: 320.39 / MAX: 360.51 MIN: 320.63 / MAX: 353.14 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 2 4 70 140 210 280 350 SE +/- 0.98, N = 3 SE +/- 1.73, N = 3 SE +/- 0.40, N = 3 304.72 306.24 306.05 MIN: 294.26 / MAX: 317.02 MIN: 294.08 / MAX: 321.12 MIN: 293.97 / MAX: 314.93 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
WebP Image Encode Encode Settings: Default OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Default 1 2 4 0.4446 0.8892 1.3338 1.7784 2.223 SE +/- 0.006, N = 3 SE +/- 0.014, N = 3 SE +/- 0.010, N = 3 1.976 1.975 1.967 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 1 2 4 0.668 1.336 2.004 2.672 3.34 SE +/- 0.016, N = 3 SE +/- 0.004, N = 3 SE +/- 0.012, N = 3 2.947 2.969 2.943 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless 1 2 4 6 12 18 24 30 SE +/- 0.19, N = 3 SE +/- 0.18, N = 3 SE +/- 0.05, N = 3 23.74 23.67 23.44 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression 1 2 4 3 6 9 12 15 SE +/- 0.030, N = 3 SE +/- 0.026, N = 3 SE +/- 0.016, N = 3 9.456 9.447 9.445 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression 1 2 4 11 22 33 44 55 SE +/- 0.30, N = 3 SE +/- 0.17, N = 3 SE +/- 0.35, N = 3 49.64 50.31 50.63 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
Phoronix Test Suite v10.8.5