3900X New Tests AMD Ryzen 9 3900X 12-Core testing with a ASUS TUF GAMING X570-PLUS (WI-FI) (2203 BIOS) and MSI AMD Radeon RX 470/480/570/570X/580/580X/590 8GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009250-PTS-3900XNEW05&sro&grt .
3900X New Tests Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads) ASUS TUF GAMING X570-PLUS (WI-FI) (2203 BIOS) AMD Starship/Matisse 16GB Samsung SSD 970 EVO Plus 250GB MSI AMD Radeon RX 470/480/570/570X/580/580X/590 8GB (1366/2000MHz) AMD Ellesmere HDMI Audio G237HL Realtek RTL8111/8168/8411 + Intel-AC 9260 Ubuntu 20.04 5.9.0-050900rc6daily20200922-generic (x86_64) 20200921 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 20.2.0-devel (git-64cdc13 2020-07-02 focal-oibaf-ppa) (LLVM 10.0.0) 1.2.131 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701021 Graphics Details - GLAMOR Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
3900X New Tests dcraw: RAW To PPM Image Conversion espeak: Text-To-Speech Synthesis influxdb: 4 - 10000 - 2,5000,1 - 10000 influxdb: 64 - 10000 - 2,5000,1 - 10000 influxdb: 1024 - 10000 - 2,5000,1 - 10000 libraw: Post-Processing Benchmark mpv: Big Buck Bunny Sunflower 4K - Software Only mpv: Big Buck Bunny Sunflower 1080p - Software Only ncnn: CPU - squeezenet ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny opencv: Features 2D opencv: Object Detection opencv: DNN - Deep Neural Network osbench: Create Files osbench: Create Threads osbench: Launch Programs osbench: Create Processes osbench: Memory Allocations realsr-ncnn: 4x - No realsr-ncnn: 4x - Yes tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 webp: Default webp: Quality 100 webp: Quality 100, Lossless webp: Quality 100, Highest Compression webp: Quality 100, Lossless, Highest Compression 1 2 3 39.505 27.298 1310447.9 1444861.4 1472018.5 42.42 698.08 2345.26 16.32 16.68 5.45 4.78 4.90 4.85 6.65 1.95 17.16 67.17 16.40 16.24 81.33 28.70 5.71 6.11 2.90 4.07 2.49 3.06 11.03 0.93 6.32 15.15 2.73 5.58 7.71 8.61 143206 66325 4384 12.166849 14.062723 41.739146 33.159256 65.431913 15.580 111.642 253.966 233.480 1.455 2.240 15.390 6.944 32.017 39.085 27.316 1313053.3 1447435.3 1468089.0 42.32 700.67 2334.36 15.93 16.43 5.41 4.79 4.90 4.83 6.63 1.96 17.13 66.98 16.31 16.31 27.24 28.34 5.65 6.05 2.93 4.08 2.48 3.04 10.90 0.93 6.33 15.11 2.76 5.58 7.74 8.57 135808 65007 4354 12.420693 12.553533 40.913423 32.787323 68.543672 15.605 111.925 250.077 233.491 1.433 2.175 15.543 6.938 32.533 38.500 27.513 1313875.7 1445347.6 1471736.4 42.74 697.91 2327.51 16.10 16.54 5.43 4.78 4.88 4.84 6.63 1.96 17.25 67.37 16.44 16.21 27.69 28.55 5.61 6.06 2.88 4.09 2.49 3.06 11.06 0.94 6.32 15.06 2.73 5.58 7.85 8.58 142960 66757 4443 12.050966 12.506962 40.993690 32.373269 68.105936 15.616 111.696 252.154 230.640 1.445 2.223 15.204 6.780 33.093 OpenBenchmarking.org
dcraw RAW To PPM Image Conversion OpenBenchmarking.org Seconds, Fewer Is Better dcraw RAW To PPM Image Conversion 1 2 3 9 18 27 36 45 SE +/- 0.68, N = 3 SE +/- 0.54, N = 3 SE +/- 0.03, N = 3 39.51 39.09 38.50 1. (CC) gcc options: -lm
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis 1 2 3 6 12 18 24 30 SE +/- 0.34, N = 5 SE +/- 0.17, N = 4 SE +/- 0.24, N = 4 27.30 27.32 27.51 1. (CC) gcc options: -O2 -std=c99
InfluxDB Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 1 2 3 300K 600K 900K 1200K 1500K SE +/- 1715.06, N = 3 SE +/- 1422.20, N = 3 SE +/- 2904.05, N = 3 1310447.9 1313053.3 1313875.7
InfluxDB Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 1 2 3 300K 600K 900K 1200K 1500K SE +/- 3035.14, N = 3 SE +/- 1343.37, N = 3 SE +/- 2565.01, N = 3 1444861.4 1447435.3 1445347.6
InfluxDB Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 1 2 3 300K 600K 900K 1200K 1500K SE +/- 415.31, N = 3 SE +/- 1895.88, N = 3 SE +/- 2749.38, N = 3 1472018.5 1468089.0 1471736.4
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark 1 2 3 10 20 30 40 50 SE +/- 0.06, N = 3 SE +/- 0.15, N = 3 SE +/- 0.06, N = 3 42.42 42.32 42.74 1. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
MPV Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only OpenBenchmarking.org FPS, More Is Better MPV Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only 1 2 3 150 300 450 600 750 SE +/- 2.78, N = 3 SE +/- 0.52, N = 3 SE +/- 0.55, N = 3 698.08 700.67 697.91 MIN: 444.45 / MAX: 857.17 MIN: 444.45 / MAX: 857.17 MIN: 461.55 / MAX: 857.17 1. mpv 0.32.0
MPV Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only OpenBenchmarking.org FPS, More Is Better MPV Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only 1 2 3 500 1000 1500 2000 2500 SE +/- 6.47, N = 3 SE +/- 11.23, N = 3 SE +/- 10.07, N = 3 2345.26 2334.36 2327.51 MIN: 1333.31 / MAX: 4000 MIN: 1333.35 / MAX: 4000.24 MIN: 1333.32 / MAX: 4000.16 1. mpv 0.32.0
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet 1 2 3 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.14, N = 3 SE +/- 0.13, N = 3 16.32 15.93 16.10 MIN: 15.64 / MAX: 91.99 MIN: 15.47 / MAX: 19.58 MIN: 15.7 / MAX: 16.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet 1 2 3 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 16.68 16.43 16.54 MIN: 16.31 / MAX: 20.34 MIN: 16.12 / MAX: 16.83 MIN: 16.09 / MAX: 41.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 1.2263 2.4526 3.6789 4.9052 6.1315 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 5.45 5.41 5.43 MIN: 5.28 / MAX: 6.75 MIN: 5.3 / MAX: 6.62 MIN: 5.26 / MAX: 7.02 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 1.0778 2.1556 3.2334 4.3112 5.389 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 4.78 4.79 4.78 MIN: 4.72 / MAX: 6.18 MIN: 4.72 / MAX: 5.99 MIN: 4.71 / MAX: 6.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: shufflenet-v2 1 2 3 1.1025 2.205 3.3075 4.41 5.5125 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 4.90 4.90 4.88 MIN: 4.83 / MAX: 5.5 MIN: 4.82 / MAX: 5.78 MIN: 4.82 / MAX: 6.14 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet 1 2 3 1.0913 2.1826 3.2739 4.3652 5.4565 SE +/- 0.00, N = 3 SE +/- 0.01, N = 2 SE +/- 0.01, N = 3 4.85 4.83 4.84 MIN: 4.74 / MAX: 17.84 MIN: 4.77 / MAX: 5.94 MIN: 4.75 / MAX: 6.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: efficientnet-b0 1 2 3 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 6.65 6.63 6.63 MIN: 6.57 / MAX: 7.57 MIN: 6.51 / MAX: 7.64 MIN: 6.49 / MAX: 7.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface 1 2 3 0.441 0.882 1.323 1.764 2.205 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.95 1.96 1.96 MIN: 1.91 / MAX: 2.38 MIN: 1.93 / MAX: 2.01 MIN: 1.91 / MAX: 2.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet 1 2 3 4 8 12 16 20 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 0.16, N = 3 17.16 17.13 17.25 MIN: 16.73 / MAX: 20.56 MIN: 16.63 / MAX: 43.37 MIN: 16.73 / MAX: 45.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16 1 2 3 15 30 45 60 75 SE +/- 0.13, N = 3 SE +/- 0.36, N = 3 SE +/- 0.28, N = 3 67.17 66.98 67.37 MIN: 66.36 / MAX: 73.24 MIN: 65.64 / MAX: 126.21 MIN: 66.06 / MAX: 73.16 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18 1 2 3 4 8 12 16 20 SE +/- 0.15, N = 3 SE +/- 0.11, N = 3 SE +/- 0.09, N = 3 16.40 16.31 16.44 MIN: 16.08 / MAX: 26.36 MIN: 16.09 / MAX: 19.82 MIN: 16.16 / MAX: 21.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 16.24 16.31 16.21 MIN: 16.13 / MAX: 16.37 MIN: 16.09 / MAX: 44.78 MIN: 15.99 / MAX: 17.45 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50 1 2 3 20 40 60 80 100 SE +/- 53.67, N = 3 SE +/- 0.08, N = 3 SE +/- 0.22, N = 3 81.33 27.24 27.69 MIN: 27.13 / MAX: 4085.39 MIN: 26.91 / MAX: 29.33 MIN: 27.07 / MAX: 28.69 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: yolov4-tiny 1 2 3 7 14 21 28 35 SE +/- 0.08, N = 3 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 28.70 28.34 28.55 MIN: 28.44 / MAX: 29.82 MIN: 27.9 / MAX: 29.03 MIN: 28.23 / MAX: 29.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet 1 2 3 1.2848 2.5696 3.8544 5.1392 6.424 SE +/- 0.06, N = 7 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.71 5.65 5.61 MIN: 5.52 / MAX: 49.8 MIN: 5.52 / MAX: 13.59 MIN: 5.52 / MAX: 6.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet 1 2 3 2 4 6 8 10 SE +/- 0.03, N = 7 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 6.11 6.05 6.06 MIN: 6 / MAX: 44.29 MIN: 5.98 / MAX: 6.65 MIN: 5.99 / MAX: 6.31 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 1 2 3 0.6593 1.3186 1.9779 2.6372 3.2965 SE +/- 0.01, N = 7 SE +/- 0.00, N = 3 SE +/- 0.05, N = 3 2.90 2.93 2.88 MIN: 2.68 / MAX: 3.38 MIN: 2.68 / MAX: 4.09 MIN: 2.68 / MAX: 4.69 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 1 2 3 0.9203 1.8406 2.7609 3.6812 4.6015 SE +/- 0.00, N = 7 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 4.07 4.08 4.09 MIN: 3.81 / MAX: 6.15 MIN: 3.81 / MAX: 6.2 MIN: 3.81 / MAX: 6.11 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 1 2 3 0.5603 1.1206 1.6809 2.2412 2.8015 SE +/- 0.01, N = 7 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 2.49 2.48 2.49 MIN: 2.33 / MAX: 3.42 MIN: 2.34 / MAX: 3.41 MIN: 2.33 / MAX: 3.42 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet 1 2 3 0.6885 1.377 2.0655 2.754 3.4425 SE +/- 0.01, N = 7 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 3.06 3.04 3.06 MIN: 2.83 / MAX: 4.89 MIN: 2.84 / MAX: 4.92 MIN: 2.83 / MAX: 4.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 1 2 3 3 6 9 12 15 SE +/- 0.04, N = 7 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 11.03 10.90 11.06 MIN: 9.73 / MAX: 24.03 MIN: 9.76 / MAX: 23.27 MIN: 9.72 / MAX: 24.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface 1 2 3 0.2115 0.423 0.6345 0.846 1.0575 SE +/- 0.01, N = 7 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 0.93 0.93 0.94 MIN: 0.87 / MAX: 1.52 MIN: 0.87 / MAX: 1.38 MIN: 0.87 / MAX: 1.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet 1 2 3 2 4 6 8 10 SE +/- 0.04, N = 6 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 6.32 6.33 6.32 MIN: 6.09 / MAX: 18.95 MIN: 6.1 / MAX: 21.22 MIN: 6.11 / MAX: 20.01 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 1 2 3 4 8 12 16 20 SE +/- 0.08, N = 7 SE +/- 0.01, N = 3 SE +/- 0.14, N = 3 15.15 15.11 15.06 MIN: 14.65 / MAX: 32 MIN: 14.7 / MAX: 24.03 MIN: 14.67 / MAX: 24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 1 2 3 0.621 1.242 1.863 2.484 3.105 SE +/- 0.01, N = 7 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 2.73 2.76 2.73 MIN: 2.62 / MAX: 2.95 MIN: 2.65 / MAX: 18.85 MIN: 2.65 / MAX: 3.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet 1 2 3 1.2555 2.511 3.7665 5.022 6.2775 SE +/- 0.02, N = 7 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 5.58 5.58 5.58 MIN: 5.46 / MAX: 15.53 MIN: 5.48 / MAX: 9.56 MIN: 5.48 / MAX: 10.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 1 2 3 2 4 6 8 10 SE +/- 0.03, N = 7 SE +/- 0.12, N = 3 SE +/- 0.01, N = 3 7.71 7.74 7.85 MIN: 7.36 / MAX: 20.01 MIN: 7.36 / MAX: 20.93 MIN: 7.37 / MAX: 20.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 7 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 8.61 8.57 8.58 MIN: 8.53 / MAX: 9.25 MIN: 8.51 / MAX: 8.81 MIN: 8.52 / MAX: 18.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenCV Test: Features 2D OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.4 Test: Features 2D 1 2 3 30K 60K 90K 120K 150K SE +/- 2677.01, N = 12 SE +/- 1818.45, N = 12 SE +/- 2214.44, N = 12 143206 135808 142960 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
OpenCV Test: Object Detection OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.4 Test: Object Detection 1 2 3 14K 28K 42K 56K 70K SE +/- 962.41, N = 3 SE +/- 974.24, N = 3 SE +/- 941.51, N = 15 66325 65007 66757 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
OpenCV Test: DNN - Deep Neural Network OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.4 Test: DNN - Deep Neural Network 1 2 3 1000 2000 3000 4000 5000 SE +/- 189.01, N = 15 SE +/- 147.14, N = 12 SE +/- 135.75, N = 15 4384 4354 4443 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
OSBench Test: Create Files OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Files 1 2 3 3 6 9 12 15 SE +/- 0.19, N = 3 SE +/- 0.12, N = 3 SE +/- 0.10, N = 3 12.17 12.42 12.05 1. (CC) gcc options: -lm
OSBench Test: Create Threads OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Threads 1 2 3 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.37, N = 15 SE +/- 0.38, N = 15 14.06 12.55 12.51 1. (CC) gcc options: -lm
OSBench Test: Launch Programs OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Launch Programs 1 2 3 10 20 30 40 50 SE +/- 0.44, N = 3 SE +/- 0.35, N = 3 SE +/- 0.40, N = 3 41.74 40.91 40.99 1. (CC) gcc options: -lm
OSBench Test: Create Processes OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Processes 1 2 3 8 16 24 32 40 SE +/- 0.49, N = 3 SE +/- 0.20, N = 3 SE +/- 0.27, N = 3 33.16 32.79 32.37 1. (CC) gcc options: -lm
OSBench Test: Memory Allocations OpenBenchmarking.org Ns Per Event, Fewer Is Better OSBench Test: Memory Allocations 1 2 3 15 30 45 60 75 SE +/- 0.18, N = 3 SE +/- 0.10, N = 3 SE +/- 0.22, N = 3 65.43 68.54 68.11 1. (CC) gcc options: -lm
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No 1 2 3 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 15.58 15.61 15.62
RealSR-NCNN Scale: 4x - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: Yes 1 2 3 30 60 90 120 150 SE +/- 0.60, N = 3 SE +/- 0.80, N = 3 SE +/- 0.63, N = 3 111.64 111.93 111.70
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 3 60 120 180 240 300 SE +/- 0.28, N = 3 SE +/- 1.56, N = 3 SE +/- 1.60, N = 3 253.97 250.08 252.15 MIN: 245.27 / MAX: 273.35 MIN: 242.75 / MAX: 269.72 MIN: 242.11 / MAX: 284.91 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 2 3 50 100 150 200 250 SE +/- 0.44, N = 3 SE +/- 0.21, N = 3 SE +/- 0.82, N = 3 233.48 233.49 230.64 MIN: 231.3 / MAX: 241.85 MIN: 230.13 / MAX: 234.78 MIN: 227.29 / MAX: 233.37 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
WebP Image Encode Encode Settings: Default OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Default 1 2 3 0.3274 0.6548 0.9822 1.3096 1.637 SE +/- 0.018, N = 4 SE +/- 0.023, N = 3 SE +/- 0.022, N = 3 1.455 1.433 1.445 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 1 2 3 0.504 1.008 1.512 2.016 2.52 SE +/- 0.019, N = 3 SE +/- 0.017, N = 3 SE +/- 0.013, N = 3 2.240 2.175 2.223 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless 1 2 3 4 8 12 16 20 SE +/- 0.18, N = 3 SE +/- 0.21, N = 3 SE +/- 0.08, N = 3 15.39 15.54 15.20 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression 1 2 3 2 4 6 8 10 SE +/- 0.076, N = 3 SE +/- 0.097, N = 4 SE +/- 0.046, N = 3 6.944 6.938 6.780 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression 1 2 3 8 16 24 32 40 SE +/- 0.16, N = 3 SE +/- 0.42, N = 3 SE +/- 0.29, N = 3 32.02 32.53 33.09 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
Phoronix Test Suite v10.8.5