NVIDIA TITAN RTX On Linux AMD Ryzen 9 3950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) and NVIDIA TITAN RTX 24GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2010017-PTS-NVIDIATI83&grs .
NVIDIA TITAN RTX On Linux Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 1 2 3 4 5 AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) AMD Starship/Matisse 16GB 2000GB Corsair Force MP600 + 2000GB NVIDIA TITAN RTX 24GB (390/405MHz) NVIDIA TU102 HD Audio DELL P2415Q Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.4.0-48-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.8 NVIDIA 450.66 4.6.0 OpenCL 2.0 AMD-APP (3182.0) + OpenCL 1.2 CUDA 11.0.228 1.2.133 GCC 9.3.0 + CUDA 11.0 ext4 3840x2160 NVIDIA TITAN RTX 24GB (1350/7000MHz) NVIDIA TITAN RTX 24GB (390/405MHz) NVIDIA TITAN RTX 24GB (1350/7000MHz) NVIDIA TITAN RTX 24GB (390/405MHz) OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701013 OpenCL Details - GPU Compute Cores: 4608 Python Details - Python 3.8.2 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
NVIDIA TITAN RTX On Linux ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - yolov4-tiny realsr-ncnn: 4x - No ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 realsr-ncnn: 4x - Yes ncnn: Vulkan GPU - squeezenet caffe: AlexNet - NVIDIA CUDA - 1000 caffe: AlexNet - NVIDIA CUDA - 100 caffe: GoogleNet - NVIDIA CUDA - 100 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU - mnasnet caffe: GoogleNet - NVIDIA CUDA - 1000 caffe: GoogleNet - NVIDIA CUDA - 200 hashcat: 7-Zip ncnn: Vulkan GPU - vgg16 caffe: AlexNet - NVIDIA CUDA - 200 hashcat: SHA1 ncnn: Vulkan GPU - resnet50 hashcat: MD5 vkfft: hashcat: SHA-512 hashcat: TrueCrypt RIPEMD160 + XTS ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 redshift: 1 2 3 4 5 1.63 0.58 7.34 7.831 3.02 1.38 42.884 3.86 8921.31 922.123 2855.00 1.28 2.6 1.32 4.14 1.44 28171.0 5680.16 916533 5.08 1805.53 18307600000 3.06 57420166667 40138 2541833333 672533 1.64 235 1.57 0.57 7.17 7.976 2.97 1.39 43.085 3.86 8920.17 924.703 2875.90 1.29 2.59 1.32 4.13 1.45 28230.8 5678.60 917200 5.09 1806.40 18259766667 3.05 57241333333 40047 2534366667 670800 1.64 235 1.62 0.58 7.15 7.858 2.98 1.38 42.655 3.90 8900.18 923.238 2860.43 1.28 2.60 1.32 4.14 1.45 28178.6 5693.34 917000 5.08 1810.29 18322300000 3.05 57365533333 40019 2541733333 672600 1.64 235 1.58 0.59 7.17 7.849 2.97 1.38 42.816 3.90 8934.21 923.667 2861.31 1.28 2.60 1.31 4.13 1.45 28240.1 5657.51 915167 5.09 1812.07 18285433333 3.06 57329633333 40048 2540333333 671700 1.64 235 1.60 0.58 7.15 7.825 2.97 1.40 42.606 3.88 8986.35 931.024 2878.31 1.28 2.61 1.31 4.16 1.45 28365.7 5693.66 919933 5.07 1811.35 18324466667 3.06 57416166667 40061 2541766667 672200 1.64 235 OpenBenchmarking.org
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet 1 2 3 4 5 0.3668 0.7336 1.1004 1.4672 1.834 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 2 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 1.63 1.57 1.62 1.58 1.60 MIN: 1.54 / MAX: 4.84 MIN: 1.54 / MAX: 1.82 MIN: 1.55 / MAX: 1.86 MIN: 1.54 / MAX: 6.38 MIN: 1.55 / MAX: 2.1 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface 1 2 3 4 5 0.1328 0.2656 0.3984 0.5312 0.664 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 0.58 0.57 0.58 0.59 0.58 MIN: 0.57 / MAX: 0.71 MAX: 0.6 MIN: 0.56 / MAX: 0.62 MIN: 0.57 / MAX: 1.82 MIN: 0.57 / MAX: 1.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny 1 2 3 4 5 2 4 6 8 10 SE +/- 0.12, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 7.34 7.17 7.15 7.17 7.15 MIN: 6.93 / MAX: 18.18 MIN: 6.94 / MAX: 13.24 MIN: 6.93 / MAX: 13.66 MIN: 6.93 / MAX: 8.75 MIN: 6.92 / MAX: 9.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No 1 2 3 4 5 2 4 6 8 10 SE +/- 0.063, N = 3 SE +/- 0.063, N = 3 SE +/- 0.068, N = 3 SE +/- 0.073, N = 3 SE +/- 0.087, N = 3 7.831 7.976 7.858 7.849 7.825
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet 1 2 3 4 5 0.6795 1.359 2.0385 2.718 3.3975 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.02 2.97 2.98 2.97 2.97 MIN: 2.96 / MAX: 7.02 MIN: 2.95 / MAX: 3.79 MIN: 2.95 / MAX: 4.92 MIN: 2.95 / MAX: 4.06 MIN: 2.95 / MAX: 3.59 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 1 2 3 4 5 0.315 0.63 0.945 1.26 1.575 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 1.38 1.39 1.38 1.38 1.40 MIN: 1.36 / MAX: 1.43 MIN: 1.37 / MAX: 2.58 MIN: 1.37 / MAX: 1.77 MIN: 1.37 / MAX: 2.56 MIN: 1.37 / MAX: 9.38 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
RealSR-NCNN Scale: 4x - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: Yes 1 2 3 4 5 10 20 30 40 50 SE +/- 0.16, N = 3 SE +/- 0.18, N = 3 SE +/- 0.24, N = 3 SE +/- 0.18, N = 3 SE +/- 0.23, N = 3 42.88 43.09 42.66 42.82 42.61
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet 1 2 3 4 5 0.8775 1.755 2.6325 3.51 4.3875 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 3.86 3.86 3.90 3.90 3.88 MIN: 3.74 / MAX: 5.31 MIN: 3.71 / MAX: 5.2 MIN: 3.74 / MAX: 4.41 MIN: 3.77 / MAX: 7.98 MIN: 3.76 / MAX: 5 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Caffe Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 1 2 3 4 5 2K 4K 6K 8K 10K SE +/- 3.89, N = 3 SE +/- 2.81, N = 3 SE +/- 10.24, N = 3 SE +/- 25.22, N = 3 SE +/- 66.63, N = 3 8921.31 8920.17 8900.18 8934.21 8986.35 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 1 2 3 4 5 200 400 600 800 1000 SE +/- 3.49, N = 3 SE +/- 6.92, N = 3 SE +/- 5.77, N = 3 SE +/- 6.30, N = 3 SE +/- 6.25, N = 3 922.12 924.70 923.24 923.67 931.02 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 1 2 3 4 5 600 1200 1800 2400 3000 SE +/- 2.35, N = 3 SE +/- 6.81, N = 3 SE +/- 13.01, N = 3 SE +/- 19.64, N = 3 SE +/- 16.43, N = 3 2855.00 2875.90 2860.43 2861.31 2878.31 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 1 2 3 4 5 0.2903 0.5806 0.8709 1.1612 1.4515 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.28 1.29 1.28 1.28 1.28 MIN: 1.27 / MAX: 1.34 MIN: 1.27 / MAX: 1.5 MIN: 1.27 / MAX: 1.35 MIN: 1.27 / MAX: 1.47 MIN: 1.27 / MAX: 1.31 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 1 2 3 4 5 0.5873 1.1746 1.7619 2.3492 2.9365 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 2.60 2.59 2.60 2.60 2.61 MIN: 2.59 / MAX: 2.89 MIN: 2.58 / MAX: 2.79 MIN: 2.59 / MAX: 2.87 MIN: 2.59 / MAX: 3.69 MIN: 2.59 / MAX: 3.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 1 2 3 4 5 0.297 0.594 0.891 1.188 1.485 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.32 1.32 1.32 1.31 1.31 MIN: 1.3 / MAX: 3.18 MIN: 1.3 / MAX: 1.45 MIN: 1.31 / MAX: 1.5 MIN: 1.3 / MAX: 1.97 MIN: 1.3 / MAX: 2.42 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet 1 2 3 4 5 0.936 1.872 2.808 3.744 4.68 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 4.14 4.13 4.14 4.13 4.16 MIN: 4.06 / MAX: 7.21 MIN: 4.03 / MAX: 8.16 MIN: 4.08 / MAX: 5.99 MIN: 4.04 / MAX: 7.11 MIN: 4.09 / MAX: 15.72 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet 1 2 3 4 5 0.3263 0.6526 0.9789 1.3052 1.6315 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 1.44 1.45 1.45 1.45 1.45 MIN: 1.43 / MAX: 1.49 MIN: 1.43 / MAX: 6.87 MIN: 1.43 / MAX: 2.12 MIN: 1.43 / MAX: 2.58 MIN: 1.43 / MAX: 2.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Caffe Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 1 2 3 4 5 6K 12K 18K 24K 30K SE +/- 93.92, N = 3 SE +/- 11.37, N = 3 SE +/- 39.43, N = 3 SE +/- 49.35, N = 3 SE +/- 60.61, N = 3 28171.0 28230.8 28178.6 28240.1 28365.7 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 1 2 3 4 5 1200 2400 3600 4800 6000 SE +/- 9.16, N = 3 SE +/- 5.02, N = 3 SE +/- 23.06, N = 3 SE +/- 15.61, N = 3 SE +/- 9.15, N = 3 5680.16 5678.60 5693.34 5657.51 5693.66 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Hashcat Benchmark: 7-Zip OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: 7-Zip 1 2 3 4 5 200K 400K 600K 800K 1000K SE +/- 290.59, N = 3 SE +/- 4404.92, N = 3 SE +/- 2150.19, N = 3 SE +/- 218.58, N = 3 SE +/- 2945.24, N = 3 916533 917200 917000 915167 919933
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 1 2 3 4 5 1.1453 2.2906 3.4359 4.5812 5.7265 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 5.08 5.09 5.08 5.09 5.07 MIN: 4.92 / MAX: 18.61 MIN: 4.93 / MAX: 9.69 MIN: 4.92 / MAX: 18.32 MIN: 4.93 / MAX: 18.51 MIN: 4.92 / MAX: 18.36 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Caffe Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 4.34, N = 3 SE +/- 3.74, N = 3 SE +/- 7.02, N = 3 SE +/- 1.42, N = 3 SE +/- 4.80, N = 3 1805.53 1806.40 1810.29 1812.07 1811.35 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Hashcat Benchmark: SHA1 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA1 1 2 3 4 5 4000M 8000M 12000M 16000M 20000M SE +/- 14435719.59, N = 3 SE +/- 7574592.03, N = 3 SE +/- 14301048.91, N = 3 SE +/- 14493025.14, N = 3 SE +/- 10927844.15, N = 3 18307600000 18259766667 18322300000 18285433333 18324466667
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 1 2 3 4 5 0.6885 1.377 2.0655 2.754 3.4425 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 3.06 3.05 3.05 3.06 3.06 MIN: 3.03 / MAX: 4.65 MIN: 3.03 / MAX: 3.55 MIN: 3.02 / MAX: 4.01 MIN: 3.04 / MAX: 4.53 MIN: 3.02 / MAX: 7.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Hashcat Benchmark: MD5 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: MD5 1 2 3 4 5 12000M 24000M 36000M 48000M 60000M SE +/- 31137883.32, N = 3 SE +/- 70861139.64, N = 3 SE +/- 47871262.55, N = 3 SE +/- 85923577.15, N = 3 SE +/- 76423651.08, N = 3 57420166667 57241333333 57365533333 57329633333 57416166667
VkFFT OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 2020-09-29 1 2 3 4 5 9K 18K 27K 36K 45K SE +/- 35.69, N = 3 SE +/- 17.33, N = 3 SE +/- 11.20, N = 3 SE +/- 17.68, N = 3 SE +/- 7.23, N = 3 40138 40047 40019 40048 40061
Hashcat Benchmark: SHA-512 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA-512 1 2 3 4 5 500M 1000M 1500M 2000M 2500M SE +/- 1348249.89, N = 3 SE +/- 2493547.23, N = 3 SE +/- 2233333.33, N = 3 SE +/- 1920358.76, N = 3 SE +/- 2162046.36, N = 3 2541833333 2534366667 2541733333 2540333333 2541766667
Hashcat Benchmark: TrueCrypt RIPEMD160 + XTS OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: TrueCrypt RIPEMD160 + XTS 1 2 3 4 5 140K 280K 420K 560K 700K SE +/- 66.67, N = 3 SE +/- 300.00, N = 3 SE +/- 57.74, N = 3 SE +/- 400.00, N = 3 SE +/- 702.38, N = 3 672533 670800 672600 671700 672200
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 1 2 3 4 5 0.369 0.738 1.107 1.476 1.845 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.64 1.64 1.64 1.64 1.64 MIN: 1.62 / MAX: 1.67 MIN: 1.63 / MAX: 1.71 MIN: 1.62 / MAX: 1.67 MIN: 1.61 / MAX: 2.74 MIN: 1.62 / MAX: 2.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
RedShift Demo OpenBenchmarking.org Seconds, Fewer Is Better RedShift Demo 3.0 1 2 3 4 5 50 100 150 200 250 SE +/- 0.33, N = 3 235 235 235 235 235
Phoronix Test Suite v10.8.4