3900X New Tests AMD Ryzen 9 3900X 12-Core testing with a ASUS TUF GAMING X570-PLUS (WI-FI) (2203 BIOS) and MSI AMD Radeon RX 470/480/570/570X/580/580X/590 8GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009250-PTS-3900XNEW05&grr&rdt .
3900X New Tests Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads) ASUS TUF GAMING X570-PLUS (WI-FI) (2203 BIOS) AMD Starship/Matisse 16GB Samsung SSD 970 EVO Plus 250GB MSI AMD Radeon RX 470/480/570/570X/580/580X/590 8GB (1366/2000MHz) AMD Ellesmere HDMI Audio G237HL Realtek RTL8111/8168/8411 + Intel-AC 9260 Ubuntu 20.04 5.9.0-050900rc6daily20200922-generic (x86_64) 20200921 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 20.2.0-devel (git-64cdc13 2020-07-02 focal-oibaf-ppa) (LLVM 10.0.0) 1.2.131 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701021 Graphics Details - GLAMOR Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
3900X New Tests opencv: Features 2D opencv: Object Detection realsr-ncnn: 4x - Yes influxdb: 4 - 10000 - 2,5000,1 - 10000 influxdb: 64 - 10000 - 2,5000,1 - 10000 influxdb: 1024 - 10000 - 2,5000,1 - 10000 ncnn: CPU - yolov4-tiny ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet ncnn: CPU - squeezenet espeak: Text-To-Speech Synthesis dcraw: RAW To PPM Image Conversion ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU - squeezenet webp: Quality 100, Lossless, Highest Compression libraw: Post-Processing Benchmark mpv: Big Buck Bunny Sunflower 4K - Software Only opencv: DNN - Deep Neural Network osbench: Create Threads tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 realsr-ncnn: 4x - No webp: Quality 100, Lossless mpv: Big Buck Bunny Sunflower 1080p - Software Only webp: Quality 100, Highest Compression osbench: Create Files osbench: Memory Allocations osbench: Create Processes osbench: Launch Programs webp: Quality 100 webp: Default 1 2 3 143206 66325 111.642 1310447.9 1444861.4 1472018.5 28.70 81.33 16.24 16.40 67.17 17.16 1.95 6.65 4.85 4.90 4.78 5.45 16.68 16.32 27.298 39.505 8.61 7.71 5.58 2.73 15.15 6.32 0.93 11.03 3.06 2.49 4.07 2.90 6.11 5.71 32.017 42.42 698.08 4384 14.062723 253.966 233.480 15.580 15.390 2345.26 6.944 12.166849 65.431913 33.159256 41.739146 2.240 1.455 135808 65007 111.925 1313053.3 1447435.3 1468089.0 28.34 27.24 16.31 16.31 66.98 17.13 1.96 6.63 4.83 4.90 4.79 5.41 16.43 15.93 27.316 39.085 8.57 7.74 5.58 2.76 15.11 6.33 0.93 10.90 3.04 2.48 4.08 2.93 6.05 5.65 32.533 42.32 700.67 4354 12.553533 250.077 233.491 15.605 15.543 2334.36 6.938 12.420693 68.543672 32.787323 40.913423 2.175 1.433 142960 66757 111.696 1313875.7 1445347.6 1471736.4 28.55 27.69 16.21 16.44 67.37 17.25 1.96 6.63 4.84 4.88 4.78 5.43 16.54 16.10 27.513 38.500 8.58 7.85 5.58 2.73 15.06 6.32 0.94 11.06 3.06 2.49 4.09 2.88 6.06 5.61 33.093 42.74 697.91 4443 12.506962 252.154 230.640 15.616 15.204 2327.51 6.780 12.050966 68.105936 32.373269 40.993690 2.223 1.445 OpenBenchmarking.org
OpenCV Test: Features 2D OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.4 Test: Features 2D 1 2 3 30K 60K 90K 120K 150K SE +/- 2677.01, N = 12 SE +/- 1818.45, N = 12 SE +/- 2214.44, N = 12 143206 135808 142960 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
OpenCV Test: Object Detection OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.4 Test: Object Detection 1 2 3 14K 28K 42K 56K 70K SE +/- 962.41, N = 3 SE +/- 974.24, N = 3 SE +/- 941.51, N = 15 66325 65007 66757 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
RealSR-NCNN Scale: 4x - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: Yes 1 2 3 30 60 90 120 150 SE +/- 0.60, N = 3 SE +/- 0.80, N = 3 SE +/- 0.63, N = 3 111.64 111.93 111.70
InfluxDB Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 1 2 3 300K 600K 900K 1200K 1500K SE +/- 1715.06, N = 3 SE +/- 1422.20, N = 3 SE +/- 2904.05, N = 3 1310447.9 1313053.3 1313875.7
InfluxDB Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 1 2 3 300K 600K 900K 1200K 1500K SE +/- 3035.14, N = 3 SE +/- 1343.37, N = 3 SE +/- 2565.01, N = 3 1444861.4 1447435.3 1445347.6
InfluxDB Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 1 2 3 300K 600K 900K 1200K 1500K SE +/- 415.31, N = 3 SE +/- 1895.88, N = 3 SE +/- 2749.38, N = 3 1472018.5 1468089.0 1471736.4
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: yolov4-tiny 1 2 3 7 14 21 28 35 SE +/- 0.08, N = 3 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 28.70 28.34 28.55 MIN: 28.44 / MAX: 29.82 MIN: 27.9 / MAX: 29.03 MIN: 28.23 / MAX: 29.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50 1 2 3 20 40 60 80 100 SE +/- 53.67, N = 3 SE +/- 0.08, N = 3 SE +/- 0.22, N = 3 81.33 27.24 27.69 MIN: 27.13 / MAX: 4085.39 MIN: 26.91 / MAX: 29.33 MIN: 27.07 / MAX: 28.69 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 16.24 16.31 16.21 MIN: 16.13 / MAX: 16.37 MIN: 16.09 / MAX: 44.78 MIN: 15.99 / MAX: 17.45 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18 1 2 3 4 8 12 16 20 SE +/- 0.15, N = 3 SE +/- 0.11, N = 3 SE +/- 0.09, N = 3 16.40 16.31 16.44 MIN: 16.08 / MAX: 26.36 MIN: 16.09 / MAX: 19.82 MIN: 16.16 / MAX: 21.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16 1 2 3 15 30 45 60 75 SE +/- 0.13, N = 3 SE +/- 0.36, N = 3 SE +/- 0.28, N = 3 67.17 66.98 67.37 MIN: 66.36 / MAX: 73.24 MIN: 65.64 / MAX: 126.21 MIN: 66.06 / MAX: 73.16 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet 1 2 3 4 8 12 16 20 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 0.16, N = 3 17.16 17.13 17.25 MIN: 16.73 / MAX: 20.56 MIN: 16.63 / MAX: 43.37 MIN: 16.73 / MAX: 45.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface 1 2 3 0.441 0.882 1.323 1.764 2.205 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.95 1.96 1.96 MIN: 1.91 / MAX: 2.38 MIN: 1.93 / MAX: 2.01 MIN: 1.91 / MAX: 2.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: efficientnet-b0 1 2 3 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 6.65 6.63 6.63 MIN: 6.57 / MAX: 7.57 MIN: 6.51 / MAX: 7.64 MIN: 6.49 / MAX: 7.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet 1 2 3 1.0913 2.1826 3.2739 4.3652 5.4565 SE +/- 0.00, N = 3 SE +/- 0.01, N = 2 SE +/- 0.01, N = 3 4.85 4.83 4.84 MIN: 4.74 / MAX: 17.84 MIN: 4.77 / MAX: 5.94 MIN: 4.75 / MAX: 6.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: shufflenet-v2 1 2 3 1.1025 2.205 3.3075 4.41 5.5125 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 4.90 4.90 4.88 MIN: 4.83 / MAX: 5.5 MIN: 4.82 / MAX: 5.78 MIN: 4.82 / MAX: 6.14 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 1.0778 2.1556 3.2334 4.3112 5.389 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 4.78 4.79 4.78 MIN: 4.72 / MAX: 6.18 MIN: 4.72 / MAX: 5.99 MIN: 4.71 / MAX: 6.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 1.2263 2.4526 3.6789 4.9052 6.1315 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 5.45 5.41 5.43 MIN: 5.28 / MAX: 6.75 MIN: 5.3 / MAX: 6.62 MIN: 5.26 / MAX: 7.02 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet 1 2 3 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 16.68 16.43 16.54 MIN: 16.31 / MAX: 20.34 MIN: 16.12 / MAX: 16.83 MIN: 16.09 / MAX: 41.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet 1 2 3 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.14, N = 3 SE +/- 0.13, N = 3 16.32 15.93 16.10 MIN: 15.64 / MAX: 91.99 MIN: 15.47 / MAX: 19.58 MIN: 15.7 / MAX: 16.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis 1 2 3 6 12 18 24 30 SE +/- 0.34, N = 5 SE +/- 0.17, N = 4 SE +/- 0.24, N = 4 27.30 27.32 27.51 1. (CC) gcc options: -O2 -std=c99
dcraw RAW To PPM Image Conversion OpenBenchmarking.org Seconds, Fewer Is Better dcraw RAW To PPM Image Conversion 1 2 3 9 18 27 36 45 SE +/- 0.68, N = 3 SE +/- 0.54, N = 3 SE +/- 0.03, N = 3 39.51 39.09 38.50 1. (CC) gcc options: -lm
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 7 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 8.61 8.57 8.58 MIN: 8.53 / MAX: 9.25 MIN: 8.51 / MAX: 8.81 MIN: 8.52 / MAX: 18.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 1 2 3 2 4 6 8 10 SE +/- 0.03, N = 7 SE +/- 0.12, N = 3 SE +/- 0.01, N = 3 7.71 7.74 7.85 MIN: 7.36 / MAX: 20.01 MIN: 7.36 / MAX: 20.93 MIN: 7.37 / MAX: 20.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet 1 2 3 1.2555 2.511 3.7665 5.022 6.2775 SE +/- 0.02, N = 7 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 5.58 5.58 5.58 MIN: 5.46 / MAX: 15.53 MIN: 5.48 / MAX: 9.56 MIN: 5.48 / MAX: 10.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 1 2 3 0.621 1.242 1.863 2.484 3.105 SE +/- 0.01, N = 7 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 2.73 2.76 2.73 MIN: 2.62 / MAX: 2.95 MIN: 2.65 / MAX: 18.85 MIN: 2.65 / MAX: 3.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 1 2 3 4 8 12 16 20 SE +/- 0.08, N = 7 SE +/- 0.01, N = 3 SE +/- 0.14, N = 3 15.15 15.11 15.06 MIN: 14.65 / MAX: 32 MIN: 14.7 / MAX: 24.03 MIN: 14.67 / MAX: 24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet 1 2 3 2 4 6 8 10 SE +/- 0.04, N = 6 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 6.32 6.33 6.32 MIN: 6.09 / MAX: 18.95 MIN: 6.1 / MAX: 21.22 MIN: 6.11 / MAX: 20.01 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface 1 2 3 0.2115 0.423 0.6345 0.846 1.0575 SE +/- 0.01, N = 7 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 0.93 0.93 0.94 MIN: 0.87 / MAX: 1.52 MIN: 0.87 / MAX: 1.38 MIN: 0.87 / MAX: 1.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 1 2 3 3 6 9 12 15 SE +/- 0.04, N = 7 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 11.03 10.90 11.06 MIN: 9.73 / MAX: 24.03 MIN: 9.76 / MAX: 23.27 MIN: 9.72 / MAX: 24.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet 1 2 3 0.6885 1.377 2.0655 2.754 3.4425 SE +/- 0.01, N = 7 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 3.06 3.04 3.06 MIN: 2.83 / MAX: 4.89 MIN: 2.84 / MAX: 4.92 MIN: 2.83 / MAX: 4.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 1 2 3 0.5603 1.1206 1.6809 2.2412 2.8015 SE +/- 0.01, N = 7 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 2.49 2.48 2.49 MIN: 2.33 / MAX: 3.42 MIN: 2.34 / MAX: 3.41 MIN: 2.33 / MAX: 3.42 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 1 2 3 0.9203 1.8406 2.7609 3.6812 4.6015 SE +/- 0.00, N = 7 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 4.07 4.08 4.09 MIN: 3.81 / MAX: 6.15 MIN: 3.81 / MAX: 6.2 MIN: 3.81 / MAX: 6.11 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 1 2 3 0.6593 1.3186 1.9779 2.6372 3.2965 SE +/- 0.01, N = 7 SE +/- 0.00, N = 3 SE +/- 0.05, N = 3 2.90 2.93 2.88 MIN: 2.68 / MAX: 3.38 MIN: 2.68 / MAX: 4.09 MIN: 2.68 / MAX: 4.69 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet 1 2 3 2 4 6 8 10 SE +/- 0.03, N = 7 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 6.11 6.05 6.06 MIN: 6 / MAX: 44.29 MIN: 5.98 / MAX: 6.65 MIN: 5.99 / MAX: 6.31 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet 1 2 3 1.2848 2.5696 3.8544 5.1392 6.424 SE +/- 0.06, N = 7 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.71 5.65 5.61 MIN: 5.52 / MAX: 49.8 MIN: 5.52 / MAX: 13.59 MIN: 5.52 / MAX: 6.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression 1 2 3 8 16 24 32 40 SE +/- 0.16, N = 3 SE +/- 0.42, N = 3 SE +/- 0.29, N = 3 32.02 32.53 33.09 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark 1 2 3 10 20 30 40 50 SE +/- 0.06, N = 3 SE +/- 0.15, N = 3 SE +/- 0.06, N = 3 42.42 42.32 42.74 1. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
MPV Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only OpenBenchmarking.org FPS, More Is Better MPV Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only 1 2 3 150 300 450 600 750 SE +/- 2.78, N = 3 SE +/- 0.52, N = 3 SE +/- 0.55, N = 3 698.08 700.67 697.91 MIN: 444.45 / MAX: 857.17 MIN: 444.45 / MAX: 857.17 MIN: 461.55 / MAX: 857.17 1. mpv 0.32.0
OpenCV Test: DNN - Deep Neural Network OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.4 Test: DNN - Deep Neural Network 1 2 3 1000 2000 3000 4000 5000 SE +/- 189.01, N = 15 SE +/- 147.14, N = 12 SE +/- 135.75, N = 15 4384 4354 4443 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
OSBench Test: Create Threads OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Threads 1 2 3 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.37, N = 15 SE +/- 0.38, N = 15 14.06 12.55 12.51 1. (CC) gcc options: -lm
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 3 60 120 180 240 300 SE +/- 0.28, N = 3 SE +/- 1.56, N = 3 SE +/- 1.60, N = 3 253.97 250.08 252.15 MIN: 245.27 / MAX: 273.35 MIN: 242.75 / MAX: 269.72 MIN: 242.11 / MAX: 284.91 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 2 3 50 100 150 200 250 SE +/- 0.44, N = 3 SE +/- 0.21, N = 3 SE +/- 0.82, N = 3 233.48 233.49 230.64 MIN: 231.3 / MAX: 241.85 MIN: 230.13 / MAX: 234.78 MIN: 227.29 / MAX: 233.37 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No 1 2 3 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 15.58 15.61 15.62
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless 1 2 3 4 8 12 16 20 SE +/- 0.18, N = 3 SE +/- 0.21, N = 3 SE +/- 0.08, N = 3 15.39 15.54 15.20 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
MPV Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only OpenBenchmarking.org FPS, More Is Better MPV Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only 1 2 3 500 1000 1500 2000 2500 SE +/- 6.47, N = 3 SE +/- 11.23, N = 3 SE +/- 10.07, N = 3 2345.26 2334.36 2327.51 MIN: 1333.31 / MAX: 4000 MIN: 1333.35 / MAX: 4000.24 MIN: 1333.32 / MAX: 4000.16 1. mpv 0.32.0
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression 1 2 3 2 4 6 8 10 SE +/- 0.076, N = 3 SE +/- 0.097, N = 4 SE +/- 0.046, N = 3 6.944 6.938 6.780 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OSBench Test: Create Files OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Files 1 2 3 3 6 9 12 15 SE +/- 0.19, N = 3 SE +/- 0.12, N = 3 SE +/- 0.10, N = 3 12.17 12.42 12.05 1. (CC) gcc options: -lm
OSBench Test: Memory Allocations OpenBenchmarking.org Ns Per Event, Fewer Is Better OSBench Test: Memory Allocations 1 2 3 15 30 45 60 75 SE +/- 0.18, N = 3 SE +/- 0.10, N = 3 SE +/- 0.22, N = 3 65.43 68.54 68.11 1. (CC) gcc options: -lm
OSBench Test: Create Processes OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Processes 1 2 3 8 16 24 32 40 SE +/- 0.49, N = 3 SE +/- 0.20, N = 3 SE +/- 0.27, N = 3 33.16 32.79 32.37 1. (CC) gcc options: -lm
OSBench Test: Launch Programs OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Launch Programs 1 2 3 10 20 30 40 50 SE +/- 0.44, N = 3 SE +/- 0.35, N = 3 SE +/- 0.40, N = 3 41.74 40.91 40.99 1. (CC) gcc options: -lm
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 1 2 3 0.504 1.008 1.512 2.016 2.52 SE +/- 0.019, N = 3 SE +/- 0.017, N = 3 SE +/- 0.013, N = 3 2.240 2.175 2.223 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Default OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Default 1 2 3 0.3274 0.6548 0.9822 1.3096 1.637 SE +/- 0.018, N = 4 SE +/- 0.023, N = 3 SE +/- 0.022, N = 3 1.455 1.433 1.445 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
Phoronix Test Suite v10.8.5