vulkan-benchmarks AMD Ryzen 9 7950X 16-Core testing with a ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS) and NVIDIA GeForce RTX 4090 24GB on Ubuntu 23.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2308069-PTS-VULKANBE16&export=txt&grt&rdt&rro .
vulkan-benchmarks Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution Display Driver a b c d e f g h i 4080 4080 rep 4080 xxx 4080 zzz 3090 3090 rep 3070 RTX 3070 Ti 4090 4090 rep nv 4090 AMD Ryzen 9 7950X 16-Core @ 4.50GHz (16 Cores / 32 Threads) ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS) AMD Device 14d8 32GB Western Digital WD_BLACK SN850X 1000GB + 4001GB AMD Radeon RX 6700 XT (2855/1000MHz) AMD Navi 21/23 ASUS MG28U Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411 Ubuntu 23.04 6.4.6-060406-generic (x86_64) GNOME Shell 44.2 X Server 1.21.1.7 + Wayland 4.6 Mesa 23.3~git2307260600.87109c~oibaf~l (git-87109c3 2023-07-26 lunar-oibaf-ppa) (LLVM 15.0.7 DRM 3.52) GCC 12.2.0 ext4 3840x2160 MSI NVIDIA GeForce RTX 4060 8GB NVIDIA Device 22be X Server 1.21.1.7 NVIDIA 535.86.05 4.6.0 eVGA NVIDIA GeForce RTX 3060 12GB NVIDIA GA106 HD Audio NVIDIA GeForce RTX 3060 Ti 8GB NVIDIA GA104 HD Audio 2560x1440 NVIDIA GeForce RTX 4080 16GB NVIDIA Device 22bb 3840x2160 NVIDIA GeForce RTX 3090 24GB NVIDIA GA102 HD Audio NVIDIA GeForce RTX 3070 8GB NVIDIA GA104 HD Audio 2560x1440 NVIDIA GeForce RTX 3070 Ti 8GB NVIDIA GeForce RTX 4090 24GB NVIDIA AD102 HD Audio 3840x2160 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-Pa930Z/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-Pa930Z/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - a: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa601203 - b: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa601203 - c: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa601203 - d: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - e: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - f: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - g: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - h: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - i: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - 4080: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - 4080 rep: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - 4080 xxx: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - 4080 zzz: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - 3090: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - 3090 rep: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - 3070: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - RTX 3070 Ti: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - 4090: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - 4090 rep: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 - nv 4090: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 Graphics Details - a: BAR1 / Visible vRAM Size: 12272 MB - vBIOS Version: 113-D5121100-101 - b: BAR1 / Visible vRAM Size: 12272 MB - vBIOS Version: 113-D5121100-101 - c: BAR1 / Visible vRAM Size: 12272 MB - vBIOS Version: 113-D5121100-101 - d: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3 - e: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3 - f: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.06.14.40.46 - g: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.06.14.40.46 - h: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.06.14.40.46 - i: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2c - 4080: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04 - 4080 rep: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04 - 4080 xxx: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04 - 4080 zzz: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04 - 3090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.27.00.02 - 3090 rep: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.27.00.02 - 3070: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2b - RTX 3070 Ti: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.5b.00.02 - 4090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01 - 4090 rep: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01 - nv 4090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
vulkan-benchmarks ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m ncnn: CPU - vision_transformer ncnn: CPU - FastestDet ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - vision_transformer ncnn: Vulkan GPU - FastestDet ncnn: CPU-v3-v3-v3 - mobilenet ncnn: CPU-v3-v3-v3-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3-v3-v3-v3 - mobilenet-v3 ncnn: CPU-v3-v3-v3 - shufflenet-v2 ncnn: CPU-v3-v3-v3 - mnasnet ncnn: CPU-v3-v3-v3 - efficientnet-b0 ncnn: CPU-v3-v3-v3 - blazeface ncnn: CPU-v3-v3-v3 - googlenet ncnn: CPU-v3-v3-v3 - vgg16 ncnn: CPU-v3-v3-v3 - resnet18 ncnn: CPU-v3-v3-v3 - alexnet ncnn: CPU-v3-v3-v3 - resnet50 ncnn: CPU-v3-v3-v3 - yolov4-tiny ncnn: CPU-v3-v3-v3 - squeezenet_ssd ncnn: CPU-v3-v3-v3 - regnety_400m ncnn: CPU-v3-v3-v3 - vision_transformer ncnn: CPU-v3-v3-v3 - FastestDet ncnn: Vulkan GPU-v3-v3-v3 - mobilenet ncnn: Vulkan GPU-v3-v3-v3-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3-v3-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v3-v3-v3 - shufflenet-v2 ncnn: Vulkan GPU-v3-v3-v3 - mnasnet ncnn: Vulkan GPU-v3-v3-v3 - efficientnet-b0 ncnn: Vulkan GPU-v3-v3-v3 - blazeface ncnn: Vulkan GPU-v3-v3-v3 - googlenet ncnn: Vulkan GPU-v3-v3-v3 - vgg16 ncnn: Vulkan GPU-v3-v3-v3 - resnet18 ncnn: Vulkan GPU-v3-v3-v3 - alexnet ncnn: Vulkan GPU-v3-v3-v3 - resnet50 ncnn: Vulkan GPU-v3-v3-v3 - yolov4-tiny ncnn: Vulkan GPU-v3-v3-v3 - squeezenet_ssd ncnn: Vulkan GPU-v3-v3-v3 - regnety_400m ncnn: Vulkan GPU-v3-v3-v3 - vision_transformer ncnn: Vulkan GPU-v3-v3-v3 - FastestDet ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - mobilenet ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - shufflenet-v2 ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - mnasnet ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - efficientnet-b0 ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - blazeface ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - googlenet ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - vgg16 ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - resnet18 ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - alexnet ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - resnet50 ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - yolov4-tiny ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - squeezenet_ssd ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - regnety_400m ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - vision_transformer ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - FastestDet ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - shufflenet-v2 ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mnasnet ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - efficientnet-b0 ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - blazeface ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - googlenet ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vgg16 ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet18 ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - alexnet ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet50 ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - yolov4-tiny ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - squeezenet_ssd ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - regnety_400m ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vision_transformer ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - FastestDet ncnn: CPU-v3-v3-v3-v3-v3-v3 - mobilenet ncnn: CPU-v3-v3-v3-v3-v3-v3-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet-v3 ncnn: CPU-v3-v3-v3-v3-v3-v3 - shufflenet-v2 ncnn: CPU-v3-v3-v3-v3-v3-v3 - mnasnet ncnn: CPU-v3-v3-v3-v3-v3-v3 - efficientnet-b0 ncnn: CPU-v3-v3-v3-v3-v3-v3 - blazeface ncnn: CPU-v3-v3-v3-v3-v3-v3 - googlenet ncnn: CPU-v3-v3-v3-v3-v3-v3 - vgg16 ncnn: CPU-v3-v3-v3-v3-v3-v3 - resnet18 ncnn: CPU-v3-v3-v3-v3-v3-v3 - alexnet ncnn: CPU-v3-v3-v3-v3-v3-v3 - resnet50 ncnn: CPU-v3-v3-v3-v3-v3-v3 - yolov4-tiny ncnn: CPU-v3-v3-v3-v3-v3-v3 - squeezenet_ssd ncnn: CPU-v3-v3-v3-v3-v3-v3 - regnety_400m ncnn: CPU-v3-v3-v3-v3-v3-v3 - vision_transformer ncnn: CPU-v3-v3-v3-v3-v3-v3 - FastestDet ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet-v3 ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - shufflenet-v2 ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mnasnet ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - efficientnet-b0 ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - blazeface ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - googlenet ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vgg16 ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet18 ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - alexnet ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet50 ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - yolov4-tiny ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - squeezenet_ssd ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - regnety_400m ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vision_transformer ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - FastestDet ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet-v3 ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - shufflenet-v2 ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mnasnet ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - efficientnet-b0 ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - blazeface ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - googlenet ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vgg16 ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet18 ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - alexnet ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet50 ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - yolov4-tiny ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - squeezenet_ssd ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - regnety_400m ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vision_transformer ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - FastestDet vkfft: FFT + iFFT R2C / C2R vkfft: FFT + iFFT C2C 1D batched in half precision vkfft: FFT + iFFT C2C Bluestein in single precision vkfft: FFT + iFFT C2C 1D batched in double precision vkfft: FFT + iFFT C2C 1D batched in single precision vkfft: FFT + iFFT C2C multidimensional in single precision vkfft: FFT + iFFT C2C Bluestein benchmark in double precision vkfft: FFT + iFFT C2C 1D batched in single precision, no reshuffling vkpeak: fp32-scalar vkpeak: fp32-vec4 vkpeak: fp16-scalar vkpeak: fp16-vec4 vkpeak: fp64-scalar vkpeak: fp64-vec4 vkpeak: int32-scalar vkpeak: int32-vec4 vkpeak: int16-scalar vkpeak: int16-vec4 vkresample: 2x - Single vkresample: 2x - Double a b c d e f g h i 4080 4080 rep 4080 xxx 4080 zzz 3090 3090 rep 3070 RTX 3070 Ti 4090 4090 rep nv 4090 8.05 3.16 3.34 2.97 3.90 1.38 7.94 23.75 5.29 4.41 10.20 12.90 7.07 8.18 32.49 3.62 3.18 8.05 3.17 3.17 3.35 2.98 3.86 1.38 7.90 23.51 5.28 4.31 10.01 12.84 7.09 8.16 31.88 4.1 42105 91597 11340 20816 47887 33001 4717 50504 13190.09 12730.08 13154.15 23232.42 841.40 841.80 2272.62 2658.73 13102.75 23123.77 11.686 7.97 3.16 3.34 2.97 3.85 1.38 7.82 23.49 5.2 4.32 10.01 12.74 7.06 8.18 31.95 4.05 8.04 3.14 3.33 2.95 3.82 1.37 7.85 23.56 5.23 4.33 10 12.87 7.07 8.21 31.85 4.07 8.01 3.15 3.16 3.33 2.97 3.82 1.37 7.84 23.5 5.21 4.29 10.01 12.98 7.03 8.05 31.65 4.07 8 3.15 3.16 3.33 2.96 3.85 1.37 7.97 23.42 5.42 4.42 9.87 12.77 7.14 8.27 31.71 4.06 42163 91812 11273 20822 47948 32751 4695 50643 12807.06 12808.59 13145.19 23390.44 839.2 836.55 2269.25 2640.08 13070.81 23396.59 11.69 8.02 3.18 3.35 2.99 3.88 1.39 7.93 23.45 5.24 4.31 10.11 12.81 7.1 8.27 31.77 4.11 3.17 8.03 3.13 3.2 3.32 2.96 3.83 1.37 7.8 23.54 5.21 4.33 10 12.81 7.06 8 31.79 4.09 8 3.14 3.16 3.34 2.97 3.89 1.38 7.88 23.99 5.26 4.28 10.33 12.89 7.04 8.14 31.78 4.08 7.95 3.15 3.17 3.33 2.96 3.82 1.36 7.83 23.54 5.23 4.3 10.03 12.86 7.07 7.98 31.66 3.69 43021 91744 11311 20847 47971 32812 4670 50596 12860.56 12822.01 13136.79 23387.26 839.01 836.16 2269.06 2638.69 13063.86 23385.44 11.688 8.10 3.17 3.35 2.98 3.85 1.38 7.85 23.51 5.23 4.30 10.00 12.95 7.09 8.23 32.43 4.08 3.17 8.02 3.16 3.17 3.35 2.97 3.87 1.38 7.85 23.56 5.23 4.31 10.10 12.85 7.08 8.17 32.12 4.11 35399 85181 10719 12143 42645 36328 2346 43365 8531.96 11251.17 8412.33 16864.47 267.43 267.74 8520.02 8465.82 5676.02 7352.85 32.855 500.014 8.04 3.14 3.18 3.33 2.96 3.84 1.38 7.85 23.60 5.22 4.31 10.10 12.87 7.05 8.10 31.93 4.08 35304 85191 10560 12168 42651 37090 2343 43365 8515.58 11231.72 8397.80 16865.29 267.41 267.25 8505.20 8465.71 5675.99 7336.25 32.850 500.016 8.45 3.15 3.55 2.97 3.87 1.37 7.92 24.19 5.69 4.36 11.05 13.32 7.23 8.08 33.56 4.22 8.27 3.13 3.14 3.4 2.97 3.86 1.38 8.15 24.55 5.48 4.64 10.26 13.17 7.08 8.34 32.92 4.24 8.56 3.16 3.15 3.4 3.12 4.04 1.43 8.07 24.45 6.13 4.83 11.05 13.07 6.97 8.34 33.47 3.85 8.65 3.16 3.15 3.33 2.96 3.85 1.37 7.94 24.12 5.3 4.35 10.25 14.34 7.09 8.5 33.36 4.2 26593 104146 7571 10561 56476 26238 1814 57110 6837.94 9006.57 6812.52 13440.97 214.17 214.23 6827.92 6800.17 4480.59 5959.75 26.738 500.01 22.74 3.16 3.59 3 3.91 1.38 7.98 23.78 5.55 4.32 10.72 17.23 7.13 8.3 32.73 2.57 3.14 8.17 3.18 3.35 2.98 3.86 1.41 8.96 24.2 6.22 4.87 10.34 13.64 7.1 8.36 32.42 4.07 8.98 3.15 3.16 3.38 3.05 4.14 1.37 9.15 24.92 5.48 4.71 11.25 13.08 7.26 8.07 33.39 3.97 8.2 3.17 3.15 3.35 2.98 4.63 1.38 8.35 24.71 5.5 4.86 10.43 13.35 7.31 7.99 32.68 3.97 8.5 3.17 3.16 3.34 2.97 3.84 1.38 7.96 24.04 5.28 4.35 10.33 13.14 7.14 8.38 33.32 3.92 26638 104171 7574 10548 56455 26541 1818 57094 6812.99 9002.59 6811.35 13438.47 213.96 213.95 6824.29 6794.92 4479.22 5956.38 26.769 500.011 26524 104298 7622 10572 56431 6810.73 9036.17 6838.32 13490.24 213.37 210.96 6800.6 6772.98 4495.98 5978.38 8.37 3.52 5.03 2.74 4.05 1.4 10.3 30.96 5.6 5.3 12.09 14.65 8.96 9.94 37.8 2.66 3.26 10.4 3.29 4.87 3.49 3.2 5.88 1.4 8.75 27.83 5.82 6.53 12.96 15.16 7.46 9.88 36.42 5.14 10.08 3.3 3.29 3.52 3.39 4.68 1.28 10.17 29.12 5.86 5.01 14.05 15.11 8.16 8.21 36.55 5.69 9.05 3.28 3.26 3.36 2.99 4.21 1.25 10.19 29.07 5.85 4.99 11.15 15.43 8.33 7.99 38.33 4.43 10.02 3.29 3.26 3.43 3.07 4.19 1.41 10.47 27.43 5.88 5.1 13.1 13.77 7.21 8.46 38.01 3.83 33727 132270 10061 14780 69738 34686 2417 71163 20.93 500.006 8.73 3.28 3.41 3.05 3.99 1.4 8.42 25.37 5.69 4.75 11.16 13.85 7.73 8.24 35.07 4.42 3.24 8.44 3.26 3.43 3.07 4.01 1.41 8.42 25 5.67 4.62 10.81 13.79 7.58 8.39 35.56 4.2 8.43 3.31 3.28 3.46 3.08 4.02 1.44 8.45 25.1 5.7 4.66 11.11 13.81 7.64 8.33 34.2 4.2 8.43 3.29 3.27 3.44 3.09 4.05 1.42 8.49 25.04 5.65 4.69 10.95 13.79 7.66 8.45 34.13 4.2 8.84 3.28 3.46 3.06 4.06 1.43 8.4 25.48 5.61 4.61 11.4 13.86 7.66 8.61 34.91 4.28 8.43 3.29 3.26 3.48 3.1 4.04 1.44 8.79 25.67 5.92 4.98 11.48 13.93 7.71 8.67 35.6 4.19 66473 211076 17121 34974 104556 65869 5579 106210 13.136 288.201 8.41 3.28 3.43 3.06 4.09 1.42 8.52 26.11 5.68 4.67 11.76 14.03 7.86 8.67 35.28 4.34 3.26 8.57 3.29 3.44 3.08 4.05 1.41 8.4 25.04 5.61 4.65 10.84 13.67 7.64 8.56 35.07 4.21 8.48 3.27 3.43 3.09 4.02 1.42 8.52 24.91 5.63 4.65 10.79 13.55 7.59 8.57 34.27 4.18 8.46 3.3 3.27 3.44 3.07 4.04 1.45 8.58 25.04 5.69 4.69 10.84 13.68 7.63 8.44 34.29 4.09 8.4 3.27 3.24 3.39 3.03 3.98 1.41 8.49 25.05 5.61 4.72 10.8 13.55 7.55 8.24 34.1 4.14 8.38 3.27 3.31 3.44 3.06 4.01 1.42 8.42 25.56 5.67 4.64 11.07 13.73 7.62 8.35 33.93 4.17 8.45 3.3 3.28 3.47 3.09 4.07 1.43 8.52 25.01 5.64 4.68 10.86 13.71 7.67 8.72 34.22 4.2 68279 211058 17287 35038 104491 70068 5583 106205 13.136 288.166 8.37 3.28 3.45 3.06 4.04 1.42 8.38 25.03 5.56 4.68 10.82 13.6 7.62 8.45 34.27 4.17 3.27 8.37 3.26 3.5 3.07 4.04 1.42 8.42 25.01 5.62 4.68 10.94 13.65 7.62 8.56 34.19 4.2 8.44 3.3 3.31 3.47 3.07 4.06 1.42 8.5 25 5.66 4.65 10.91 13.69 7.64 8.75 34.37 4.19 8.88 3.4 3.33 3.51 3.13 4.22 1.42 8.99 26.08 5.89 5.21 11.5 13.95 7.7 8.58 35.4 4.31 8.46 3.27 3.26 3.43 3.05 4.02 1.42 8.43 25.4 5.67 4.67 10.91 13.62 7.67 8.52 34.23 4.17 8.31 3.14 3.05 3.34 2.98 3.97 1.31 8.26 25.33 5.65 4.71 11.22 13.52 7.27 8.25 33.9 3.75 8.34 3.2 3.08 3.4 3 4.01 1.32 8.32 25.44 5.78 4.69 11.26 13.63 7.27 8.38 34.14 3.8 69068 210713 17343 35071 104528 67887 5587 106099 13.137 288.039 8.38 3.25 3.42 3.04 3.99 1.4 8.4 25.4 5.63 4.69 11.1 13.63 7.55 8.37 34.1 4.16 3.24 8.47 3.29 3.28 3.46 3.06 4.04 1.42 8.41 25.82 5.59 4.68 11.07 13.8 7.63 8.58 34.32 4.79 8.4 3.28 3.27 3.43 3.06 4.03 1.41 8.55 25.45 5.71 4.67 11.21 13.83 7.35 8.47 34.47 4.04 8.38 3.23 3.2 3.37 3.01 3.95 1.39 8.37 25.26 5.59 4.65 11.09 13.62 7.51 8.1 34.05 4.12 8.46 3.28 3.24 3.43 3.08 4.01 1.42 8.42 25.16 5.6 4.68 10.91 13.61 7.62 8.49 34.1 4.2 9.19 3.28 3.26 3.44 3.08 4.05 1.41 8.55 26.09 5.74 4.7 12.5 15.26 8.06 8.37 35.36 4.61 8.25 3.16 3.06 3.36 2.96 3.95 1.31 8.29 25.26 5.77 4.66 11.1 13.42 7.25 8.34 34.47 3.82 67689 210991 17185 35058 104543 70040 5584 105926 13.126 288.028 8.6 3.17 3.34 2.99 3.87 1.38 7.86 23.43 5.21 4.3 10.3 14.26 7.52 8.25 33.01 4.21 3.18 8.07 3.18 3.19 3.39 2.99 3.88 1.39 7.87 23.55 5.27 4.35 10.1 12.88 7.16 8.38 31.94 4.11 8.06 3.14 3.15 3.32 2.95 3.83 1.36 7.83 23.5 5.19 4.31 10.05 12.87 7.04 7.95 31.89 4.04 8 3.16 3.34 2.97 3.85 1.36 7.82 23.5 5.23 4.3 9.97 12.88 7.04 8.01 31.86 4.04 8.11 3.15 3.16 3.33 2.96 3.83 1.36 7.86 23.55 5.19 4.31 10.38 13.1 7.04 7.99 33.22 4.03 8.03 3.12 3.13 3.32 2.94 3.88 1.38 7.84 23.51 5.21 4.32 10.07 12.97 7.05 8.33 31.94 3.83 8.07 3.16 3.36 2.98 3.88 1.39 7.9 23.58 5.2 4.33 10.03 12.82 7.12 8.22 32.1 4.1 8.01 3.15 3.16 3.36 2.97 3.86 1.39 7.83 23.5 5.2 4.3 10.03 12.86 7.05 8.2 32.16 4.08 55347 255207 14406 30945 141357 51005 4282 143969 21269.72 27797.8 20845.09 41149.1 653.13 653.15 20909.02 20820.09 13710.88 16886.66 10.399 371.699 8.01 3.17 3.35 2.97 3.86 1.38 7.9 23.43 5.29 4.31 9.95 12.77 7.12 8.24 31.8 4.08 3.19 8.03 3.17 3.16 3.36 2.97 3.86 1.37 7.82 23.48 5.2 4.31 10.06 12.83 7.06 8.02 31.91 4.08 8.03 3.17 3.15 3.33 2.96 3.85 1.37 7.85 23.54 5.2 4.3 10.07 12.82 7.09 8.09 31.93 4.11 8.01 3.17 3.15 3.36 2.97 3.85 1.38 7.85 23.43 5.24 4.3 10.01 12.84 7.08 8.25 32.11 4.1 8.04 3.19 3.37 2.99 3.87 1.39 7.89 23.52 5.22 4.31 10.04 12.86 7.09 8.34 32.09 4.08 8.05 3.17 3.18 3.34 2.97 3.85 1.37 7.86 23.38 5.2 4.3 9.98 12.9 7.08 8.07 31.97 4.07 8.06 3.16 3.17 3.33 2.97 3.85 1.38 7.91 23.4 5.3 4.31 10.06 12.81 7.09 8.03 31.85 4.07 8.05 3.17 3.36 2.98 3.85 1.38 7.82 23.47 5.2 4.3 10.04 12.89 7.07 8.19 32.13 4.1 8.03 3.15 3.19 3.32 2.96 3.84 1.38 7.86 23.72 5.27 4.31 10.27 12.92 7.07 8.06 31.94 4.07 54432 265171 14449 31122 141437 54814 4289 143956 20925.3 27807.58 20953.3 41188.02 653.63 20767.64 20517.68 13608.57 16881.47 10.428 371.422 17.81 9.67 6.82 6.88 9.23 2.98 18.25 56.64 14.03 9.62 21.5 27.66 13.2 18 75.34 8.65 7.52 21.11 7.81 6.6 7.07 5.09 8.99 3.98 19.49 48.29 12.68 10.88 23.48 28.59 15.82 16.22 70.76 8.41 17.82 9.19 5.38 8.13 6.87 9.01 3.03 17 49.75 11.14 11 24.07 29.34 17.75 19.66 81.77 9.18 16.34 7.24 8.06 4.89 6.02 7.81 3.18 20.72 55.42 13.38 9.86 23.11 29.49 16.15 17.23 73.51 8.63 18.39 8.35 6.56 8 4.59 8.41 1.77 18.6 55.48 12.14 10.08 23.59 29.8 15.4 17.88 71.08 6.93 17.09 5.46 5.99 5.59 6.06 9.81 2.99 16.97 49.7 11.3 11.89 23.44 28.73 18.83 17.61 70.29 6.71 16.52 7.22 6.43 7.81 6.07 9.19 2.53 19.2 50.32 12.64 10.59 22.19 28.41 14.27 18.25 65.41 7.12 18.54 5.49 5.97 6.3 8.15 9.53 3.57 18.66 51.28 13.34 10.69 23.54 26.33 15.46 18.24 69.48 4.48 17.06 5.92 7.34 5.89 8.55 6.63 2.69 18.8 53.48 12.13 11.43 22.15 29.38 15.32 17.02 70.53 7.23 22.064 24.745 9.43 3.76 3.77 3.26 4.72 1.60 9.69 28.36 6.08 5.49 12.73 15.20 8.13 8.83 37.91 3.94 3.61 9.35 3.56 3.64 3.98 3.40 4.53 1.60 9.87 29.06 6.28 5.25 12.60 15.54 8.47 9.07 38.03 4.26 9.62 3.41 3.76 4.09 3.11 4.78 1.79 9.65 28.40 6.18 5.55 12.11 15.42 8.39 9.02 37.86 4.33 9.62 3.66 3.65 3.95 3.37 4.73 1.71 9.86 28.40 6.23 5.67 12.35 15.44 8.29 8.89 38.29 4.26 9.62 3.66 3.62 3.75 3.34 4.60 1.49 9.84 28.63 6.57 5.53 12.73 15.21 8.28 9.19 37.88 4.41 9.52 3.66 3.44 3.89 3.10 4.37 1.34 9.90 28.53 6.40 5.34 12.42 15.00 8.65 9.05 38.27 4.32 10.03 3.91 3.70 4.02 3.24 4.74 2.48 9.68 27.98 6.22 6.17 12.81 14.57 7.57 8.42 38.04 4.18 9.98 3.69 3.52 3.92 3.25 4.55 1.51 9.58 28.53 6.69 5.41 12.52 15.56 8.31 9.10 38.32 4.25 10.02 3.83 3.24 3.48 3.12 4.17 1.40 9.97 27.86 5.94 6.25 13.15 14.64 7.45 9.14 38.50 4.14 27.183 24.805 10.08 3.3 3.55 3.12 4.23 1.27 10.27 27.75 6 5.14 14.1 13.68 7.86 8.13 38.25 5.48 3.53 10.55 3.3 3.28 3.45 3.18 4.34 1.39 10.62 28.82 5.69 4.64 14.13 13.97 7.83 8.64 38.76 4.39 8.96 3.46 3.25 5.18 3.19 4.09 1.17 9.97 28.55 5.81 4.94 11.39 15.3 9.32 8.13 38.79 2.93 8.81 4.99 3.12 3.34 3 4.18 1.33 8.87 28.21 5.97 6.11 14.58 15.44 7.93 9.6 39.01 3.94 10.18 3.32 3.3 5.09 3.17 4.36 1.16 10.87 31.57 6.58 5.14 14.08 15.55 7.57 8.1 38.38 4.03 9.16 3.48 3.62 3.52 4.93 4.15 1.42 9.05 27.44 7.52 4.67 12.4 15.95 9.81 9.87 38.62 4.45 9.04 3.36 3.33 3.48 5.19 4.47 1.3 8.38 30.16 7.74 4.99 11.72 15.85 9.51 10.09 38.76 2.85 8.46 5.25 3.36 3.47 3.19 4.14 1.45 8.91 27.31 7.78 4.94 12.98 15.69 7.4 10.05 38.82 2.82 10.56 4.75 3.36 3.56 3.23 4.63 1.35 8.55 27.32 6.96 5.14 13 16.05 7.43 10.11 39.35 4.62 84351 290342 20373 55214 153896 81406 8039 152656 9.284 172.883 8.43 4.74 3.51 3.22 4.03 1.45 9.53 28.19 8.07 5.14 12.73 15.38 7.31 9.66 37.81 5.27 3.41 10.23 3.31 3.3 3.49 3.15 4.09 1.4 10.65 27.04 6.01 6.79 13.08 15.72 8.22 8.48 37.59 4.59 8.74 3.45 3.53 3.59 3.23 4.34 1.34 9.29 27.25 7.75 5.27 13.82 15.34 9.3 17.15 39.12 4.16 9.54 3.31 4.9 3.4 3.1 6.28 1.41 8.9 27.59 5.81 6.58 13.57 16.39 7.81 10.34 38.73 4.16 8.37 3.34 3.33 5.27 3.13 4.04 1.46 10.38 29.12 6.05 5.33 12.17 15.45 9.16 8.64 38.17 3.96 9.02 3.36 3.34 3.48 4.99 4.41 1.41 10.39 29.17 5.9 5.25 11.51 15.4 9.46 10.69 39.03 4.11 10.61 3.44 3.3 3.42 5.11 4.35 1.42 8.97 30.74 8.14 5.45 12.47 13.88 9.38 10.23 38.79 3.12 8.83 3.6 3.44 5.18 3.28 4.44 1.38 10.18 29.35 5.84 5.16 11.24 16.6 9.34 8.45 38.69 3.91 8.22 3.38 3.35 5.23 3.12 4.1 1.42 10.47 29.85 5.87 5.34 10.96 15.41 9.44 8.7 38.65 4.59 81329 287651 20404 55383 153939 80999 8119 155936 8.962 173.043 8.15 3.6 3.45 4.77 4.37 1.33 8.93 29.54 7.82 5.18 11.41 15.4 9.11 10.17 38.9 4.51 3.47 8.45 3.43 3.36 3.51 4.7 4.1 1.4 8.7 29.29 7.44 5.2 12.45 15.26 9.11 9.81 39.04 3.93 8.91 3.39 3.35 3.46 4.61 4.04 1.16 9.02 27.04 7.61 4.67 13.68 15.62 9.37 9.55 38.99 4.06 10.54 4.45 2.61 3.17 2.54 5.26 1.07 10.01 27.77 5.84 6.11 13.13 16.61 7.72 7.73 38.58 5.92 9.41 5.1 3.26 3.51 3.16 4.12 1.18 8.61 27.89 8.16 5.14 13.63 17.3 9.21 10.09 39.18 2.81 8.93 3.42 3.17 3.5 3.07 4.1 1.26 8.35 28.14 7.38 4.69 13.13 15.55 7.02 10.03 38.46 2.64 12.12 3.29 4.97 3.32 3.12 5.94 1.42 10.75 27.61 6.07 6.62 13.29 17.67 7.48 8.25 38.95 3.93 10.15 3.27 4.81 3.37 3.1 5.88 2.91 8.85 29.4 5.97 6.54 13.46 15.67 7.72 8.37 38.58 4.01 10.64 3.29 4.96 3.43 3.1 5.82 1.4 10.14 27.25 5.58 6.32 13.25 16.3 8.26 8.34 37.13 5.86 84887 292768 20601 54950 152170 82875 8132 155148 8.967 172.887 OpenBenchmarking.org
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mobilenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 5 10 15 20 25 SE +/- 0.22, N = 15 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 8.15 8.43 10.08 9.43 17.81 8.01 8.60 8.38 8.37 8.41 8.73 8.37 22.74 8.45 8.10 8.02 7.97 8.05 MIN: 7.73 / MAX: 9.34 MIN: 8.04 / MAX: 18.04 MIN: 8.1 / MAX: 118.32 MIN: 7.95 / MAX: 398.1 MIN: 8.05 / MAX: 159.41 MIN: 7.96 / MAX: 9.85 MIN: 8.5 / MAX: 13.72 MIN: 7.94 / MAX: 10.16 MIN: 7.96 / MAX: 9.72 MIN: 8.14 / MAX: 11.03 MIN: 8.15 / MAX: 10.96 MIN: 8.15 / MAX: 9.75 MIN: 8.24 / MAX: 1264.67 MIN: 8.37 / MAX: 9.44 MIN: 7.94 / MAX: 14.4 MIN: 7.98 / MAX: 8.33 MIN: 7.94 / MAX: 8.26 MIN: 7.97 / MAX: 9.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v2-v2 - Model: mobilenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 3 6 9 12 15 SE +/- 0.17, N = 15 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 3.60 4.74 3.30 3.76 9.67 3.17 3.17 3.25 3.28 3.28 3.28 3.52 3.16 3.15 3.17 3.18 3.16 3.16 MIN: 3.43 / MAX: 4.62 MIN: 3.09 / MAX: 140.79 MIN: 3.11 / MAX: 4.81 MIN: 2.6 / MAX: 364.73 MIN: 3.19 / MAX: 225.84 MIN: 3.11 / MAX: 4.94 MIN: 3.12 / MAX: 4.05 MIN: 3.09 / MAX: 4.51 MIN: 3.1 / MAX: 4.05 MIN: 3.11 / MAX: 4 MIN: 3.11 / MAX: 3.88 MIN: 3.29 / MAX: 19.18 MIN: 3.11 / MAX: 3.83 MIN: 3.1 / MAX: 3.65 MIN: 3.1 / MAX: 8.86 MIN: 3.13 / MAX: 3.84 MIN: 3.11 / MAX: 3.61 MIN: 3.1 / MAX: 3.8 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: shufflenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 2 4 6 8 10 SE +/- 0.19, N = 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.45 3.51 3.55 3.77 6.82 3.35 3.34 3.42 3.45 3.43 3.41 5.03 3.59 3.55 3.35 3.35 3.34 3.34 MIN: 3.32 / MAX: 4.91 MIN: 3.38 / MAX: 5.4 MIN: 3.39 / MAX: 5.48 MIN: 3.02 / MAX: 511.95 MIN: 3.16 / MAX: 64.72 MIN: 3.31 / MAX: 3.68 MIN: 3.3 / MAX: 4.19 MIN: 3.28 / MAX: 4.19 MIN: 3.32 / MAX: 3.85 MIN: 3.3 / MAX: 4.15 MIN: 3.28 / MAX: 4.87 MIN: 3.07 / MAX: 228.55 MIN: 3.3 / MAX: 25.28 MIN: 3.27 / MAX: 22.86 MIN: 3.3 / MAX: 3.82 MIN: 3.31 / MAX: 3.8 MIN: 3.31 / MAX: 3.77 MIN: 3.3 / MAX: 3.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mnasnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 2 4 6 8 10 SE +/- 0.14, N = 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 4.77 3.22 3.12 3.26 6.88 2.97 2.99 3.04 3.06 3.06 3.05 2.74 3.00 2.97 2.98 2.99 2.97 2.97 MIN: 3.07 / MAX: 97.57 MIN: 3.11 / MAX: 3.71 MIN: 2.98 / MAX: 3.79 MIN: 2.46 / MAX: 277.54 MIN: 3.05 / MAX: 110.25 MIN: 2.93 / MAX: 3.28 MIN: 2.95 / MAX: 3.88 MIN: 2.91 / MAX: 4.47 MIN: 2.94 / MAX: 4.45 MIN: 2.94 / MAX: 4.51 MIN: 2.92 / MAX: 3.82 MIN: 2.62 / MAX: 4.22 MIN: 2.96 / MAX: 3.68 MIN: 2.93 / MAX: 3.95 MIN: 2.94 / MAX: 3.83 MIN: 2.96 / MAX: 3.44 MIN: 2.93 / MAX: 3.45 MIN: 2.92 / MAX: 3.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: efficientnet-b0 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 3 6 9 12 15 SE +/- 0.19, N = 15 SE +/- 0.00, N = 3 SE +/- 0.05, N = 3 4.37 4.03 4.23 4.72 9.23 3.86 3.87 3.99 4.04 4.09 3.99 4.05 3.91 3.87 3.85 3.88 3.85 3.90 MIN: 4.15 / MAX: 5.96 MIN: 3.86 / MAX: 4.82 MIN: 3.98 / MAX: 12.23 MIN: 3.37 / MAX: 486.93 MIN: 3.43 / MAX: 156.19 MIN: 3.81 / MAX: 4.75 MIN: 3.83 / MAX: 4.69 MIN: 3.8 / MAX: 5.69 MIN: 3.83 / MAX: 5.71 MIN: 3.86 / MAX: 5.59 MIN: 3.79 / MAX: 5.83 MIN: 3.78 / MAX: 5.45 MIN: 3.85 / MAX: 4.64 MIN: 3.81 / MAX: 4.97 MIN: 3.81 / MAX: 4.46 MIN: 3.84 / MAX: 4.41 MIN: 3.81 / MAX: 4.42 MIN: 3.82 / MAX: 4.51 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: blazeface nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 0.6705 1.341 2.0115 2.682 3.3525 SE +/- 0.14, N = 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 1.33 1.45 1.27 1.60 2.98 1.38 1.38 1.40 1.42 1.42 1.40 1.40 1.38 1.37 1.38 1.39 1.38 1.38 MIN: 1.27 / MAX: 1.77 MIN: 1.38 / MAX: 2.96 MIN: 1.21 / MAX: 1.95 MIN: 0.95 / MAX: 433.24 MIN: 1.29 / MAX: 144.96 MIN: 1.36 / MAX: 1.71 MIN: 1.35 / MAX: 2.23 MIN: 1.34 / MAX: 2.1 MIN: 1.36 / MAX: 2.02 MIN: 1.35 / MAX: 1.88 MIN: 1.34 / MAX: 2.15 MIN: 1.34 / MAX: 2 MIN: 1.36 / MAX: 1.62 MIN: 1.34 / MAX: 2.11 MIN: 1.35 / MAX: 2.05 MIN: 1.36 / MAX: 1.53 MIN: 1.35 / MAX: 1.67 MIN: 1.35 / MAX: 2.06 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: googlenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 4 8 12 16 20 SE +/- 0.22, N = 15 SE +/- 0.01, N = 3 SE +/- 0.11, N = 3 8.93 9.53 10.27 9.69 18.25 7.90 7.86 8.40 8.38 8.52 8.42 10.30 7.98 7.92 7.85 7.93 7.82 7.94 MIN: 8.27 / MAX: 10.68 MIN: 8.86 / MAX: 11.44 MIN: 7.95 / MAX: 115.68 MIN: 7.29 / MAX: 407.61 MIN: 7.5 / MAX: 267.89 MIN: 7.79 / MAX: 8.74 MIN: 7.76 / MAX: 8.74 MIN: 7.72 / MAX: 10.5 MIN: 7.72 / MAX: 10.05 MIN: 7.84 / MAX: 10.21 MIN: 7.75 / MAX: 9.96 MIN: 8.19 / MAX: 349.57 MIN: 7.86 / MAX: 8.78 MIN: 7.8 / MAX: 8.96 MIN: 7.71 / MAX: 8.83 MIN: 7.82 / MAX: 8.91 MIN: 7.73 / MAX: 8.65 MIN: 7.71 / MAX: 8.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vgg16 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 13 26 39 52 65 SE +/- 0.23, N = 15 SE +/- 0.05, N = 3 SE +/- 0.30, N = 3 29.54 28.19 27.75 28.36 56.64 23.43 23.43 25.40 25.03 26.11 25.37 30.96 23.78 24.19 23.51 23.45 23.49 23.75 MIN: 24.77 / MAX: 364.86 MIN: 24.69 / MAX: 205.72 MIN: 24.58 / MAX: 282.59 MIN: 24.13 / MAX: 449.57 MIN: 25.75 / MAX: 367.74 MIN: 23.23 / MAX: 24.39 MIN: 23.2 / MAX: 24.1 MIN: 24.09 / MAX: 32.86 MIN: 23.85 / MAX: 28.9 MIN: 24.54 / MAX: 30.29 MIN: 24.26 / MAX: 36.52 MIN: 25.92 / MAX: 328.63 MIN: 23.52 / MAX: 24.89 MIN: 23.99 / MAX: 30.98 MIN: 23.19 / MAX: 24.68 MIN: 23.26 / MAX: 24.51 MIN: 23.36 / MAX: 24.62 MIN: 23.31 / MAX: 25.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet18 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 4 8 12 16 20 SE +/- 0.17, N = 15 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 7.82 8.07 6.00 6.08 14.03 5.29 5.21 5.63 5.56 5.68 5.69 5.60 5.55 5.69 5.23 5.24 5.20 5.29 MIN: 5.54 / MAX: 303.05 MIN: 5.86 / MAX: 121.03 MIN: 5.47 / MAX: 7.29 MIN: 4.97 / MAX: 245.95 MIN: 5 / MAX: 303.38 MIN: 5.18 / MAX: 6.19 MIN: 5.09 / MAX: 6.04 MIN: 5.08 / MAX: 7.55 MIN: 5.09 / MAX: 6.84 MIN: 5.17 / MAX: 7.45 MIN: 5.16 / MAX: 7.68 MIN: 5.13 / MAX: 6.83 MIN: 5.19 / MAX: 25.4 MIN: 5.22 / MAX: 92.59 MIN: 5.1 / MAX: 6.28 MIN: 5.15 / MAX: 6.09 MIN: 5.1 / MAX: 5.9 MIN: 5.09 / MAX: 6.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: alexnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 3 6 9 12 15 SE +/- 0.21, N = 14 SE +/- 0.01, N = 3 SE +/- 0.11, N = 3 5.18 5.14 5.14 5.49 9.62 4.31 4.30 4.69 4.68 4.67 4.75 5.30 4.32 4.36 4.30 4.31 4.32 4.41 MIN: 4.75 / MAX: 7.12 MIN: 4.76 / MAX: 6.26 MIN: 4.73 / MAX: 6.32 MIN: 4.26 / MAX: 363.39 MIN: 4.31 / MAX: 147.6 MIN: 4.26 / MAX: 5.18 MIN: 4.25 / MAX: 4.83 MIN: 4.29 / MAX: 5.78 MIN: 4.28 / MAX: 6.37 MIN: 4.27 / MAX: 5.88 MIN: 4.31 / MAX: 13.88 MIN: 4.92 / MAX: 7.18 MIN: 4.25 / MAX: 5.17 MIN: 4.29 / MAX: 5.7 MIN: 4.23 / MAX: 5.32 MIN: 4.26 / MAX: 4.98 MIN: 4.26 / MAX: 5.15 MIN: 4.24 / MAX: 5.16 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet50 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 5 10 15 20 25 SE +/- 0.26, N = 15 SE +/- 0.01, N = 3 SE +/- 0.23, N = 3 11.41 12.73 14.10 12.73 21.50 9.95 10.30 11.10 10.82 11.76 11.16 12.09 10.72 11.05 10.00 10.11 10.01 10.20 MIN: 10.57 / MAX: 12.22 MIN: 10.22 / MAX: 181.72 MIN: 10.27 / MAX: 287 MIN: 10.18 / MAX: 541.92 MIN: 10.24 / MAX: 116.85 MIN: 9.85 / MAX: 10.72 MIN: 9.82 / MAX: 17.56 MIN: 10.2 / MAX: 13.06 MIN: 9.9 / MAX: 12.26 MIN: 10.68 / MAX: 44.94 MIN: 10.29 / MAX: 15.03 MIN: 11.16 / MAX: 13.48 MIN: 10.1 / MAX: 108.3 MIN: 10.14 / MAX: 162.88 MIN: 9.86 / MAX: 11.02 MIN: 9.95 / MAX: 16.18 MIN: 9.85 / MAX: 11.06 MIN: 9.84 / MAX: 12.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: yolov4-tiny nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 7 14 21 28 35 SE +/- 0.18, N = 15 SE +/- 0.05, N = 3 SE +/- 0.11, N = 3 15.40 15.38 13.68 15.20 27.66 12.77 14.26 13.63 13.60 14.03 13.85 14.65 17.23 13.32 12.95 12.81 12.74 12.90 MIN: 12.35 / MAX: 321.43 MIN: 12.32 / MAX: 188.07 MIN: 12.83 / MAX: 14.63 MIN: 12.69 / MAX: 431.37 MIN: 12.74 / MAX: 294.9 MIN: 12.7 / MAX: 13.02 MIN: 14.17 / MAX: 14.53 MIN: 12.77 / MAX: 15.36 MIN: 12.8 / MAX: 16.23 MIN: 13.15 / MAX: 15.97 MIN: 12.84 / MAX: 16.75 MIN: 12.44 / MAX: 202.68 MIN: 12.99 / MAX: 196.66 MIN: 12.95 / MAX: 35.49 MIN: 12.75 / MAX: 18.88 MIN: 12.74 / MAX: 13.2 MIN: 12.66 / MAX: 13.28 MIN: 12.69 / MAX: 15.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: squeezenet_ssd nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 3 6 9 12 15 SE +/- 0.24, N = 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 9.11 7.31 7.86 8.13 13.20 7.12 7.52 7.55 7.62 7.86 7.73 8.96 7.13 7.23 7.09 7.10 7.06 7.07 MIN: 6.77 / MAX: 101.58 MIN: 6.71 / MAX: 9.3 MIN: 7.25 / MAX: 8.98 MIN: 6.37 / MAX: 399.11 MIN: 6.9 / MAX: 68.61 MIN: 7.05 / MAX: 7.63 MIN: 7.45 / MAX: 7.74 MIN: 7 / MAX: 8.72 MIN: 7.01 / MAX: 8.84 MIN: 7.22 / MAX: 10.84 MIN: 7.13 / MAX: 9.7 MIN: 6.92 / MAX: 244.02 MIN: 7.04 / MAX: 8.43 MIN: 7.15 / MAX: 8.02 MIN: 6.99 / MAX: 9.39 MIN: 7.05 / MAX: 7.65 MIN: 7.01 / MAX: 7.55 MIN: 7.01 / MAX: 8.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: regnety_400m nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 4 8 12 16 20 SE +/- 0.19, N = 15 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 10.17 9.66 8.13 8.83 18.00 8.24 8.25 8.37 8.45 8.67 8.24 9.94 8.30 8.08 8.23 8.27 8.18 8.18 MIN: 8.12 / MAX: 209.53 MIN: 7.78 / MAX: 95.3 MIN: 7.78 / MAX: 9.98 MIN: 7.65 / MAX: 351.08 MIN: 7.91 / MAX: 176.28 MIN: 8.17 / MAX: 8.84 MIN: 8.17 / MAX: 8.9 MIN: 8.05 / MAX: 10.19 MIN: 8.12 / MAX: 9.68 MIN: 8.22 / MAX: 15.29 MIN: 7.89 / MAX: 9.52 MIN: 7.43 / MAX: 166.02 MIN: 8.22 / MAX: 9.1 MIN: 7.98 / MAX: 10.87 MIN: 8.03 / MAX: 8.9 MIN: 8.22 / MAX: 9.18 MIN: 8.12 / MAX: 8.86 MIN: 8.07 / MAX: 9.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vision_transformer nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 20 40 60 80 100 SE +/- 0.12, N = 15 SE +/- 0.39, N = 3 SE +/- 0.29, N = 3 38.90 37.81 38.25 37.91 75.34 31.80 33.01 34.10 34.27 35.28 35.07 37.80 32.73 33.56 32.43 31.77 31.95 32.49 MIN: 34.2 / MAX: 300.84 MIN: 32.66 / MAX: 453.44 MIN: 33.04 / MAX: 447.7 MIN: 32.08 / MAX: 541.11 MIN: 38.72 / MAX: 418.01 MIN: 31.66 / MAX: 32.23 MIN: 32.88 / MAX: 33.42 MIN: 32.65 / MAX: 37.64 MIN: 32.82 / MAX: 39.79 MIN: 33.9 / MAX: 38.67 MIN: 33.14 / MAX: 43.26 MIN: 33.74 / MAX: 321.51 MIN: 31.44 / MAX: 81.32 MIN: 32.98 / MAX: 51.93 MIN: 31.56 / MAX: 37.69 MIN: 31.61 / MAX: 35.68 MIN: 31.79 / MAX: 32.33 MIN: 31.67 / MAX: 40.11 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: FastestDet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f d c b a 2 4 6 8 10 SE +/- 0.23, N = 15 SE +/- 0.01, N = 3 SE +/- 0.45, N = 3 4.51 5.27 5.48 3.94 8.65 4.08 4.21 4.16 4.17 4.34 4.42 2.66 2.57 4.22 4.08 4.11 4.05 3.62 MIN: 4.34 / MAX: 5.96 MIN: 4.05 / MAX: 247.02 MIN: 2.67 / MAX: 259.34 MIN: 2.43 / MAX: 267.02 MIN: 3.94 / MAX: 185.21 MIN: 4.05 / MAX: 4.84 MIN: 4.19 / MAX: 4.41 MIN: 4 / MAX: 4.69 MIN: 4.05 / MAX: 4.74 MIN: 4.19 / MAX: 5.77 MIN: 4.25 / MAX: 6.71 MIN: 2.54 / MAX: 3.41 MIN: 2.53 / MAX: 3.21 MIN: 4.18 / MAX: 4.97 MIN: 4.02 / MAX: 4.28 MIN: 4.08 / MAX: 4.4 MIN: 4.02 / MAX: 4.35 MIN: 2.7 / MAX: 4.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3 - Model: mobilenet-v3 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g d c a 2 4 6 8 10 SE +/- 0.21, N = 14 SE +/- 0.00, N = 3 SE +/- 0.00, N = 2 3.47 3.41 3.53 3.61 7.52 3.19 3.18 3.24 3.27 3.26 3.24 3.26 3.14 3.17 3.17 3.18 MIN: 3.32 / MAX: 4.91 MIN: 3.27 / MAX: 5.24 MIN: 3.39 / MAX: 4.31 MIN: 2.51 / MAX: 502.85 MIN: 2.94 / MAX: 215 MIN: 3.15 / MAX: 3.72 MIN: 3.14 / MAX: 4.14 MIN: 3.11 / MAX: 4.47 MIN: 3.13 / MAX: 3.85 MIN: 3.09 / MAX: 3.96 MIN: 3.09 / MAX: 4.73 MIN: 3.14 / MAX: 3.9 MIN: 3.1 / MAX: 3.81 MIN: 3.12 / MAX: 3.96 MIN: 3.15 / MAX: 3.74 MIN: 3.14 / MAX: 3.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mobilenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 5 10 15 20 25 SE +/- 0.25, N = 15 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 8.45 10.23 10.55 9.35 21.11 8.03 8.07 8.47 8.37 8.57 8.44 10.40 8.17 8.27 8.04 8.02 8.03 8.04 8.05 MIN: 8.03 / MAX: 12.61 MIN: 8.13 / MAX: 386.42 MIN: 8.22 / MAX: 303.1 MIN: 7.49 / MAX: 474.12 MIN: 7.98 / MAX: 322.43 MIN: 7.96 / MAX: 8.77 MIN: 7.99 / MAX: 8.8 MIN: 8.04 / MAX: 10.17 MIN: 7.97 / MAX: 16.09 MIN: 7.98 / MAX: 10 MIN: 7.98 / MAX: 10.55 MIN: 7.97 / MAX: 455.46 MIN: 8.08 / MAX: 9.37 MIN: 8.17 / MAX: 9.04 MIN: 7.95 / MAX: 9.09 MIN: 7.95 / MAX: 9.81 MIN: 7.98 / MAX: 8.84 MIN: 7.95 / MAX: 14.33 MIN: 7.95 / MAX: 8.89 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 2 4 6 8 10 SE +/- 0.14, N = 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.43 3.31 3.30 3.56 7.81 3.17 3.18 3.29 3.26 3.29 3.26 3.29 3.18 3.13 3.14 3.16 3.13 3.14 3.17 MIN: 3.25 / MAX: 4.81 MIN: 3.14 / MAX: 4.92 MIN: 3.12 / MAX: 4.82 MIN: 3.09 / MAX: 345.01 MIN: 3.07 / MAX: 154.75 MIN: 3.12 / MAX: 3.64 MIN: 3.14 / MAX: 3.63 MIN: 3.11 / MAX: 3.98 MIN: 3.1 / MAX: 3.87 MIN: 3.12 / MAX: 4.14 MIN: 3.1 / MAX: 4.12 MIN: 3.12 / MAX: 3.93 MIN: 3.13 / MAX: 3.9 MIN: 3.07 / MAX: 3.82 MIN: 3.08 / MAX: 4.06 MIN: 3.09 / MAX: 3.92 MIN: 3.08 / MAX: 3.85 MIN: 3.1 / MAX: 3.73 MIN: 3.09 / MAX: 3.78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz i f e d c a 2 4 6 8 10 SE +/- 0.20, N = 14 SE +/- 0.02, N = 3 SE +/- 0.00, N = 2 SE +/- 0.02, N = 3 3.36 3.30 3.28 3.64 6.60 3.16 3.19 3.28 4.87 3.14 3.18 3.17 3.20 3.17 MIN: 3.21 / MAX: 4.3 MIN: 3.15 / MAX: 3.92 MIN: 3.15 / MAX: 3.9 MIN: 2.87 / MAX: 429.02 MIN: 2.98 / MAX: 166.19 MIN: 3.11 / MAX: 3.62 MIN: 3.14 / MAX: 3.48 MIN: 3.13 / MAX: 4.65 MIN: 3.14 / MAX: 278.98 MIN: 3.09 / MAX: 3.54 MIN: 3.11 / MAX: 3.78 MIN: 3.1 / MAX: 3.83 MIN: 3.16 / MAX: 3.68 MIN: 3.11 / MAX: 3.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 2 4 6 8 10 SE +/- 0.20, N = 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.51 3.49 3.45 3.98 7.07 3.36 3.39 3.46 3.50 3.44 3.43 3.49 3.35 3.40 3.33 3.35 3.32 3.33 3.35 MIN: 3.37 / MAX: 4 MIN: 3.36 / MAX: 4.33 MIN: 3.32 / MAX: 3.99 MIN: 3.14 / MAX: 529.82 MIN: 3.25 / MAX: 243.32 MIN: 3.32 / MAX: 4.06 MIN: 3.35 / MAX: 3.69 MIN: 3.32 / MAX: 5.24 MIN: 3.37 / MAX: 4.85 MIN: 3.3 / MAX: 5.36 MIN: 3.3 / MAX: 4.22 MIN: 3.35 / MAX: 4.24 MIN: 3.3 / MAX: 4.02 MIN: 3.35 / MAX: 5.89 MIN: 3.28 / MAX: 4.14 MIN: 3.3 / MAX: 3.82 MIN: 3.29 / MAX: 4.19 MIN: 3.3 / MAX: 3.59 MIN: 3.29 / MAX: 3.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mnasnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 1.1453 2.2906 3.4359 4.5812 5.7265 SE +/- 0.16, N = 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 4.70 3.15 3.18 3.40 5.09 2.97 2.99 3.06 3.07 3.08 3.07 3.20 2.98 2.97 2.96 2.97 2.96 2.95 2.98 MIN: 3 / MAX: 188.08 MIN: 3 / MAX: 4.54 MIN: 3.05 / MAX: 4.64 MIN: 2.72 / MAX: 432.18 MIN: 2.86 / MAX: 53.75 MIN: 2.94 / MAX: 3.28 MIN: 2.96 / MAX: 3.14 MIN: 2.92 / MAX: 3.73 MIN: 2.95 / MAX: 4.19 MIN: 2.94 / MAX: 3.67 MIN: 2.93 / MAX: 4.63 MIN: 3.07 / MAX: 3.86 MIN: 2.94 / MAX: 3.65 MIN: 2.93 / MAX: 3.66 MIN: 2.91 / MAX: 5.9 MIN: 2.92 / MAX: 3.34 MIN: 2.93 / MAX: 3.41 MIN: 2.92 / MAX: 3.42 MIN: 2.92 / MAX: 4.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 3 6 9 12 15 SE +/- 0.18, N = 15 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 4.10 4.09 4.34 4.53 8.99 3.86 3.88 4.04 4.04 4.05 4.01 5.88 3.86 3.86 3.84 3.87 3.83 3.82 3.86 MIN: 3.86 / MAX: 5.46 MIN: 3.87 / MAX: 5.46 MIN: 4.14 / MAX: 5.84 MIN: 3.75 / MAX: 396.62 MIN: 3.71 / MAX: 129.99 MIN: 3.82 / MAX: 4.34 MIN: 3.84 / MAX: 4.39 MIN: 3.8 / MAX: 5.31 MIN: 3.82 / MAX: 5.33 MIN: 3.83 / MAX: 6.11 MIN: 3.78 / MAX: 5.34 MIN: 4.04 / MAX: 364.21 MIN: 3.82 / MAX: 4.22 MIN: 3.78 / MAX: 10.45 MIN: 3.79 / MAX: 4.76 MIN: 3.77 / MAX: 9.91 MIN: 3.79 / MAX: 4.61 MIN: 3.78 / MAX: 4.39 MIN: 3.8 / MAX: 4.6 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: blazeface nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 0.8955 1.791 2.6865 3.582 4.4775 SE +/- 0.16, N = 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.40 1.40 1.39 1.60 3.98 1.37 1.39 1.42 1.42 1.41 1.41 1.40 1.41 1.38 1.38 1.38 1.37 1.37 1.38 MIN: 1.34 / MAX: 1.87 MIN: 1.33 / MAX: 1.93 MIN: 1.33 / MAX: 1.94 MIN: 1.11 / MAX: 436.01 MIN: 1.31 / MAX: 228.4 MIN: 1.36 / MAX: 1.46 MIN: 1.37 / MAX: 1.82 MIN: 1.36 / MAX: 2.01 MIN: 1.36 / MAX: 1.93 MIN: 1.35 / MAX: 1.9 MIN: 1.35 / MAX: 2.01 MIN: 1.33 / MAX: 2 MIN: 1.38 / MAX: 2.09 MIN: 1.35 / MAX: 2.08 MIN: 1.34 / MAX: 1.88 MIN: 1.34 / MAX: 2.25 MIN: 1.35 / MAX: 1.82 MIN: 1.35 / MAX: 1.75 MIN: 1.34 / MAX: 1.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: googlenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 5 10 15 20 25 SE +/- 0.22, N = 15 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 8.70 10.65 10.62 9.87 19.49 7.82 7.87 8.41 8.42 8.40 8.42 8.75 8.96 8.15 7.85 7.85 7.80 7.85 7.90 MIN: 7.96 / MAX: 10.01 MIN: 8.29 / MAX: 236.11 MIN: 7.83 / MAX: 323.31 MIN: 7.33 / MAX: 399.24 MIN: 7.4 / MAX: 200.01 MIN: 7.69 / MAX: 8.61 MIN: 7.76 / MAX: 10.36 MIN: 7.72 / MAX: 9.9 MIN: 7.73 / MAX: 10.06 MIN: 7.77 / MAX: 9.78 MIN: 7.79 / MAX: 10.01 MIN: 8.08 / MAX: 16.01 MIN: 8.82 / MAX: 9.87 MIN: 8.02 / MAX: 9.02 MIN: 7.71 / MAX: 8.76 MIN: 7.71 / MAX: 8.85 MIN: 7.72 / MAX: 8.74 MIN: 7.76 / MAX: 8.76 MIN: 7.74 / MAX: 9.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vgg16 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 11 22 33 44 55 SE +/- 0.24, N = 15 SE +/- 0.14, N = 3 SE +/- 0.10, N = 3 SE +/- 0.04, N = 3 29.29 27.04 28.82 29.06 48.29 23.48 23.55 25.82 25.01 25.04 25.00 27.83 24.20 24.55 23.60 23.56 23.54 23.56 23.51 MIN: 24.63 / MAX: 296.95 MIN: 24.22 / MAX: 296.13 MIN: 24.35 / MAX: 214.1 MIN: 24.11 / MAX: 541.55 MIN: 24.97 / MAX: 183.12 MIN: 23.24 / MAX: 29.21 MIN: 23.3 / MAX: 24.45 MIN: 24.35 / MAX: 62.94 MIN: 23.8 / MAX: 26.41 MIN: 24.06 / MAX: 27.35 MIN: 23.93 / MAX: 26.69 MIN: 24.98 / MAX: 262.23 MIN: 23.56 / MAX: 58.31 MIN: 23.62 / MAX: 97.69 MIN: 23.17 / MAX: 24.71 MIN: 23.24 / MAX: 24.78 MIN: 23.33 / MAX: 24.61 MIN: 23.34 / MAX: 24.72 MIN: 23.29 / MAX: 24.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet18 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 3 6 9 12 15 SE +/- 0.20, N = 15 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 7.44 6.01 5.69 6.28 12.68 5.20 5.27 5.59 5.62 5.61 5.67 5.82 6.22 5.48 5.22 5.23 5.21 5.23 5.28 MIN: 5.29 / MAX: 320.54 MIN: 5.44 / MAX: 8.18 MIN: 5.16 / MAX: 8.22 MIN: 4.94 / MAX: 298.06 MIN: 5.39 / MAX: 262.62 MIN: 5.09 / MAX: 5.98 MIN: 5.15 / MAX: 6.19 MIN: 5.06 / MAX: 6.95 MIN: 5.1 / MAX: 7.65 MIN: 5.11 / MAX: 7.44 MIN: 5.18 / MAX: 7.22 MIN: 5.28 / MAX: 7.02 MIN: 6.11 / MAX: 7 MIN: 5.33 / MAX: 6.16 MIN: 5.09 / MAX: 11.15 MIN: 5.08 / MAX: 6.28 MIN: 5.11 / MAX: 6.04 MIN: 5.13 / MAX: 6.18 MIN: 5.17 / MAX: 6.16 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: alexnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 3 6 9 12 15 SE +/- 0.18, N = 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.20 6.79 4.64 5.25 10.88 4.31 4.35 4.68 4.68 4.65 4.62 6.53 4.87 4.64 4.31 4.31 4.33 4.33 4.31 MIN: 4.82 / MAX: 7.07 MIN: 4.23 / MAX: 262.43 MIN: 4.26 / MAX: 5.98 MIN: 4.23 / MAX: 375.94 MIN: 4.38 / MAX: 52.99 MIN: 4.26 / MAX: 5.26 MIN: 4.28 / MAX: 7.49 MIN: 4.26 / MAX: 6.23 MIN: 4.26 / MAX: 6.61 MIN: 4.26 / MAX: 6.53 MIN: 4.26 / MAX: 6.15 MIN: 4.57 / MAX: 242.16 MIN: 4.8 / MAX: 5.62 MIN: 4.57 / MAX: 5.49 MIN: 4.23 / MAX: 11.03 MIN: 4.25 / MAX: 5.28 MIN: 4.26 / MAX: 10.59 MIN: 4.28 / MAX: 5.16 MIN: 4.24 / MAX: 5.2 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet50 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 6 12 18 24 30 SE +/- 0.26, N = 15 SE +/- 0.12, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 12.45 13.08 14.13 12.60 23.48 10.06 10.10 11.07 10.94 10.84 10.81 12.96 10.34 10.26 10.10 10.10 10.00 10.00 10.01 MIN: 11.55 / MAX: 14.48 MIN: 10.11 / MAX: 444.45 MIN: 10.63 / MAX: 167.28 MIN: 9.82 / MAX: 418.4 MIN: 10.06 / MAX: 112.91 MIN: 9.95 / MAX: 11.04 MIN: 9.97 / MAX: 11.42 MIN: 10.1 / MAX: 13.23 MIN: 9.95 / MAX: 12.7 MIN: 9.93 / MAX: 12.81 MIN: 9.95 / MAX: 12.78 MIN: 10.23 / MAX: 424.46 MIN: 10.14 / MAX: 11.37 MIN: 10.09 / MAX: 11.22 MIN: 9.84 / MAX: 11.72 MIN: 9.86 / MAX: 11.08 MIN: 9.91 / MAX: 11.15 MIN: 9.92 / MAX: 12.35 MIN: 9.88 / MAX: 11.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 7 14 21 28 35 SE +/- 0.18, N = 15 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 15.26 15.72 13.97 15.54 28.59 12.83 12.88 13.80 13.65 13.67 13.79 15.16 13.64 13.17 12.87 12.85 12.81 12.87 12.84 MIN: 12.87 / MAX: 132.82 MIN: 13.2 / MAX: 301.81 MIN: 13.11 / MAX: 16.15 MIN: 12.15 / MAX: 492.01 MIN: 12.87 / MAX: 325.37 MIN: 12.74 / MAX: 13.59 MIN: 12.76 / MAX: 13.67 MIN: 12.76 / MAX: 15.76 MIN: 12.71 / MAX: 14.99 MIN: 12.71 / MAX: 14.88 MIN: 12.75 / MAX: 19.63 MIN: 12.86 / MAX: 248.64 MIN: 13.04 / MAX: 76.32 MIN: 13.03 / MAX: 14.1 MIN: 12.68 / MAX: 13.84 MIN: 12.72 / MAX: 13.93 MIN: 12.73 / MAX: 13.08 MIN: 12.76 / MAX: 13.73 MIN: 12.69 / MAX: 15.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 4 8 12 16 20 SE +/- 0.26, N = 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 9.11 8.22 7.83 8.47 15.82 7.06 7.16 7.63 7.62 7.64 7.58 7.46 7.10 7.08 7.05 7.08 7.06 7.07 7.09 MIN: 6.35 / MAX: 130.38 MIN: 7.56 / MAX: 9.8 MIN: 7.21 / MAX: 9.32 MIN: 6.29 / MAX: 533.92 MIN: 6.99 / MAX: 82.57 MIN: 7 / MAX: 7.82 MIN: 7.05 / MAX: 13.55 MIN: 7 / MAX: 9.17 MIN: 7.01 / MAX: 9.28 MIN: 7.05 / MAX: 9.12 MIN: 6.98 / MAX: 9.05 MIN: 6.9 / MAX: 8.9 MIN: 6.99 / MAX: 8.59 MIN: 6.98 / MAX: 8.07 MIN: 6.95 / MAX: 8 MIN: 6.97 / MAX: 7.99 MIN: 7 / MAX: 8.03 MIN: 7 / MAX: 8.07 MIN: 6.98 / MAX: 7.95 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 4 8 12 16 20 SE +/- 0.21, N = 15 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 9.81 8.48 8.64 9.07 16.22 8.02 8.38 8.58 8.56 8.56 8.39 9.88 8.36 8.34 8.10 8.17 8.00 8.21 8.16 MIN: 7.82 / MAX: 241.19 MIN: 8.09 / MAX: 9.64 MIN: 8.28 / MAX: 10.42 MIN: 7.61 / MAX: 402.49 MIN: 7.74 / MAX: 314.84 MIN: 7.95 / MAX: 8.63 MIN: 8.31 / MAX: 8.86 MIN: 8.13 / MAX: 9.78 MIN: 8.15 / MAX: 9.8 MIN: 8.17 / MAX: 10.28 MIN: 8 / MAX: 10.29 MIN: 8.14 / MAX: 251.77 MIN: 8.27 / MAX: 9.08 MIN: 7.99 / MAX: 26.72 MIN: 7.98 / MAX: 8.84 MIN: 7.99 / MAX: 8.97 MIN: 7.94 / MAX: 8.88 MIN: 8.14 / MAX: 8.84 MIN: 7.9 / MAX: 8.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 16 32 48 64 80 SE +/- 0.16, N = 15 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 SE +/- 0.09, N = 3 39.04 37.59 38.76 38.03 70.76 31.91 31.94 34.32 34.19 35.07 35.56 36.42 32.42 32.92 31.93 32.12 31.79 31.85 31.88 MIN: 33.83 / MAX: 463.88 MIN: 34.45 / MAX: 457.98 MIN: 33.12 / MAX: 539.58 MIN: 32.66 / MAX: 467.28 MIN: 38.81 / MAX: 250.01 MIN: 31.74 / MAX: 34.28 MIN: 31.73 / MAX: 34.21 MIN: 32.58 / MAX: 41.88 MIN: 32.72 / MAX: 36.79 MIN: 33.66 / MAX: 39.36 MIN: 33.19 / MAX: 40.43 MIN: 33.49 / MAX: 224.86 MIN: 31.89 / MAX: 65.47 MIN: 32.67 / MAX: 36.93 MIN: 31.62 / MAX: 35.85 MIN: 31.66 / MAX: 46.9 MIN: 31.63 / MAX: 35.57 MIN: 31.69 / MAX: 33.06 MIN: 31.55 / MAX: 37.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: FastestDet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 2 4 6 8 10 SE +/- 0.29, N = 15 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 3.93 4.59 4.39 4.26 8.41 4.08 4.11 4.79 4.20 4.21 4.20 5.14 4.07 4.24 4.08 4.11 4.09 4.07 4.10 MIN: 3.8 / MAX: 5.4 MIN: 2.62 / MAX: 232.18 MIN: 4.25 / MAX: 5.86 MIN: 2.5 / MAX: 396.93 MIN: 2.89 / MAX: 487.78 MIN: 4.04 / MAX: 4.35 MIN: 4.07 / MAX: 4.29 MIN: 4.64 / MAX: 6.21 MIN: 4.03 / MAX: 6.49 MIN: 4.04 / MAX: 4.97 MIN: 4.02 / MAX: 4.97 MIN: 3.7 / MAX: 81.79 MIN: 4.02 / MAX: 4.82 MIN: 3.88 / MAX: 24.21 MIN: 4.03 / MAX: 5.29 MIN: 4.01 / MAX: 9.72 MIN: 4.05 / MAX: 5.5 MIN: 4.04 / MAX: 4.53 MIN: 4.06 / MAX: 4.81 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3 - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3 - Model: mobilenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 4 8 12 16 20 SE +/- 0.23, N = 15 8.91 8.74 8.96 9.62 17.82 8.03 8.06 8.40 8.44 8.48 8.43 10.08 8.98 8.56 8.00 8.01 MIN: 8.33 / MAX: 10.07 MIN: 8.25 / MAX: 10.5 MIN: 8.39 / MAX: 10.77 MIN: 7.76 / MAX: 454.91 MIN: 7.57 / MAX: 211.62 MIN: 7.98 / MAX: 8.77 MIN: 7.94 / MAX: 13.92 MIN: 8.12 / MAX: 10.11 MIN: 7.97 / MAX: 10.71 MIN: 7.96 / MAX: 10.32 MIN: 7.99 / MAX: 10.44 MIN: 8.08 / MAX: 286.28 MIN: 8.1 / MAX: 124.43 MIN: 8.04 / MAX: 75.44 MIN: 7.96 / MAX: 8.63 MIN: 7.95 / MAX: 8.95 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v2-v2 - Model: mobilenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 3 6 9 12 15 SE +/- 0.10, N = 15 3.39 3.45 3.46 3.41 9.19 3.17 3.14 3.28 3.30 3.27 3.31 3.30 3.15 3.16 3.14 3.15 MIN: 3.21 / MAX: 4.24 MIN: 3.23 / MAX: 4.55 MIN: 3.29 / MAX: 4.38 MIN: 2.99 / MAX: 184.91 MIN: 3.04 / MAX: 232.12 MIN: 3.11 / MAX: 4.5 MIN: 3.08 / MAX: 3.7 MIN: 3.1 / MAX: 4 MIN: 3.12 / MAX: 4.03 MIN: 3.1 / MAX: 4.34 MIN: 3.12 / MAX: 4.76 MIN: 3.14 / MAX: 4.82 MIN: 3.1 / MAX: 3.63 MIN: 3.09 / MAX: 3.89 MIN: 3.1 / MAX: 3.67 MIN: 3.1 / MAX: 3.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3 - Model: mobilenet-v3 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 i g f c b 1.2105 2.421 3.6315 4.842 6.0525 SE +/- 0.22, N = 14 3.35 3.53 3.25 3.76 5.38 3.15 3.15 3.27 3.31 3.28 3.29 3.16 3.15 3.16 3.16 MIN: 3.21 / MAX: 5.23 MIN: 3.2 / MAX: 40.81 MIN: 3.11 / MAX: 4.74 MIN: 2.89 / MAX: 366.04 MIN: 2.74 / MAX: 121.29 MIN: 3.11 / MAX: 3.6 MIN: 3.11 / MAX: 3.71 MIN: 3.14 / MAX: 4.63 MIN: 3.16 / MAX: 5.3 MIN: 3.14 / MAX: 3.89 MIN: 3.15 / MAX: 4.32 MIN: 3.11 / MAX: 3.93 MIN: 3.11 / MAX: 3.48 MIN: 3.12 / MAX: 3.7 MIN: 3.12 / MAX: 3.69 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3 - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3 - Model: shufflenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 2 4 6 8 10 SE +/- 0.21, N = 15 3.46 3.59 5.18 4.09 8.13 3.33 3.32 3.43 3.47 3.43 3.46 3.52 3.38 3.40 3.34 3.33 MIN: 3.32 / MAX: 5.2 MIN: 3.46 / MAX: 4.09 MIN: 3.34 / MAX: 283.54 MIN: 3.12 / MAX: 435.28 MIN: 3.09 / MAX: 147.21 MIN: 3.3 / MAX: 3.67 MIN: 3.28 / MAX: 3.66 MIN: 3.31 / MAX: 3.94 MIN: 3.33 / MAX: 5.01 MIN: 3.3 / MAX: 4.03 MIN: 3.34 / MAX: 3.93 MIN: 3.39 / MAX: 4.05 MIN: 3.34 / MAX: 4.15 MIN: 3.35 / MAX: 4.17 MIN: 3.32 / MAX: 3.79 MIN: 3.3 / MAX: 3.79 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3 - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3 - Model: mnasnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 2 4 6 8 10 SE +/- 0.04, N = 15 4.61 3.23 3.19 3.11 6.87 2.96 2.95 3.06 3.07 3.09 3.08 3.39 3.05 3.12 2.97 2.97 MIN: 2.78 / MAX: 222.99 MIN: 3.1 / MAX: 3.75 MIN: 3.06 / MAX: 3.75 MIN: 2.8 / MAX: 4.98 MIN: 2.93 / MAX: 216.41 MIN: 2.94 / MAX: 3.38 MIN: 2.92 / MAX: 3.29 MIN: 2.93 / MAX: 3.64 MIN: 2.94 / MAX: 3.6 MIN: 2.95 / MAX: 4.52 MIN: 2.94 / MAX: 4.52 MIN: 3.26 / MAX: 4.86 MIN: 3.01 / MAX: 3.88 MIN: 3.08 / MAX: 3.86 MIN: 2.94 / MAX: 3.43 MIN: 2.94 / MAX: 3.43 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3 - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3 - Model: efficientnet-b0 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 3 6 9 12 15 SE +/- 0.22, N = 15 4.04 4.34 4.09 4.78 9.01 3.85 3.83 4.03 4.06 4.02 4.02 4.68 4.14 4.04 3.89 3.82 MIN: 3.78 / MAX: 4.9 MIN: 4.16 / MAX: 5.28 MIN: 3.86 / MAX: 4.83 MIN: 3.82 / MAX: 411.19 MIN: 3.98 / MAX: 188.57 MIN: 3.81 / MAX: 4.53 MIN: 3.78 / MAX: 4.41 MIN: 3.82 / MAX: 5.43 MIN: 3.83 / MAX: 5.55 MIN: 3.82 / MAX: 5.39 MIN: 3.82 / MAX: 5.66 MIN: 4.48 / MAX: 6.02 MIN: 4.09 / MAX: 5.13 MIN: 3.99 / MAX: 4.82 MIN: 3.83 / MAX: 9.72 MIN: 3.79 / MAX: 4.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3 - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3 - Model: blazeface nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 0.6818 1.3636 2.0454 2.7272 3.409 SE +/- 0.19, N = 15 1.16 1.34 1.17 1.79 3.03 1.37 1.36 1.41 1.42 1.42 1.44 1.28 1.37 1.43 1.38 1.37 MIN: 1.11 / MAX: 1.67 MIN: 1.27 / MAX: 1.95 MIN: 1.11 / MAX: 1.9 MIN: 1.13 / MAX: 312.12 MIN: 1.28 / MAX: 96.94 MIN: 1.35 / MAX: 1.46 MIN: 1.34 / MAX: 1.46 MIN: 1.34 / MAX: 1.91 MIN: 1.36 / MAX: 1.92 MIN: 1.36 / MAX: 2.2 MIN: 1.37 / MAX: 3.45 MIN: 1.23 / MAX: 1.73 MIN: 1.34 / MAX: 2.07 MIN: 1.4 / MAX: 1.77 MIN: 1.36 / MAX: 1.58 MIN: 1.35 / MAX: 1.52 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3 - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3 - Model: googlenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 4 8 12 16 20 SE +/- 0.24, N = 15 9.02 9.29 9.97 9.65 17.00 7.85 7.83 8.55 8.50 8.52 8.45 10.17 9.15 8.07 7.88 7.84 MIN: 8.41 / MAX: 11.08 MIN: 7.98 / MAX: 83.03 MIN: 7.67 / MAX: 258.52 MIN: 7.59 / MAX: 472.81 MIN: 7.35 / MAX: 277.79 MIN: 7.75 / MAX: 8.69 MIN: 7.71 / MAX: 8.8 MIN: 7.85 / MAX: 10.35 MIN: 7.79 / MAX: 9.94 MIN: 7.81 / MAX: 10.78 MIN: 7.79 / MAX: 10.32 MIN: 7.94 / MAX: 150.01 MIN: 7.84 / MAX: 198.46 MIN: 7.92 / MAX: 8.86 MIN: 7.79 / MAX: 8.78 MIN: 7.74 / MAX: 8.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3 - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3 - Model: vgg16 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 11 22 33 44 55 SE +/- 0.24, N = 15 27.04 27.25 28.55 28.40 49.75 23.54 23.50 25.45 25.00 24.91 25.10 29.12 24.92 24.45 23.99 23.50 MIN: 24.33 / MAX: 215.56 MIN: 24.14 / MAX: 379.93 MIN: 24.05 / MAX: 201.8 MIN: 24.12 / MAX: 509.06 MIN: 25.45 / MAX: 273.86 MIN: 23.33 / MAX: 24.41 MIN: 23.17 / MAX: 24.44 MIN: 24.22 / MAX: 27.73 MIN: 23.91 / MAX: 27.99 MIN: 23.8 / MAX: 26.87 MIN: 24.12 / MAX: 27.57 MIN: 26.33 / MAX: 310.23 MIN: 24.58 / MAX: 31.89 MIN: 24.26 / MAX: 25.26 MIN: 23.72 / MAX: 24.98 MIN: 23.3 / MAX: 24.41 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3 - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3 - Model: resnet18 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 3 6 9 12 15 SE +/- 0.19, N = 15 7.61 7.75 5.81 6.18 11.14 5.20 5.19 5.71 5.66 5.63 5.70 5.86 5.48 6.13 5.26 5.21 MIN: 5.23 / MAX: 90.18 MIN: 5.57 / MAX: 125.43 MIN: 5.27 / MAX: 7.16 MIN: 5.17 / MAX: 262.79 MIN: 4.79 / MAX: 65.12 MIN: 5.1 / MAX: 5.97 MIN: 5.09 / MAX: 6.13 MIN: 5.12 / MAX: 8.19 MIN: 5.14 / MAX: 7.49 MIN: 5.09 / MAX: 7.75 MIN: 5.15 / MAX: 7.9 MIN: 5.35 / MAX: 7.79 MIN: 5.37 / MAX: 6.51 MIN: 5.41 / MAX: 151.51 MIN: 5.18 / MAX: 6.27 MIN: 5.12 / MAX: 6.22 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3 - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3 - Model: alexnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 3 6 9 12 15 SE +/- 0.23, N = 15 4.67 5.27 4.94 5.55 11.00 4.30 4.31 4.67 4.65 4.65 4.66 5.01 4.71 4.83 4.28 4.29 MIN: 4.28 / MAX: 5.7 MIN: 4.78 / MAX: 7.7 MIN: 4.51 / MAX: 6.64 MIN: 4.2 / MAX: 281.58 MIN: 4.33 / MAX: 199.92 MIN: 4.24 / MAX: 4.99 MIN: 4.25 / MAX: 5.13 MIN: 4.28 / MAX: 6.29 MIN: 4.28 / MAX: 6.42 MIN: 4.26 / MAX: 6.13 MIN: 4.29 / MAX: 6.1 MIN: 4.6 / MAX: 6.68 MIN: 4.65 / MAX: 5.57 MIN: 4.76 / MAX: 5.74 MIN: 4.24 / MAX: 5.12 MIN: 4.24 / MAX: 5.64 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3 - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3 - Model: resnet50 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 6 12 18 24 30 SE +/- 0.22, N = 15 13.68 13.82 11.39 12.11 24.07 10.07 10.05 11.21 10.91 10.79 11.11 14.05 11.25 11.05 10.33 10.01 MIN: 10.25 / MAX: 566.67 MIN: 10.34 / MAX: 245.6 MIN: 10.48 / MAX: 13.29 MIN: 10.16 / MAX: 382.56 MIN: 10.02 / MAX: 218.35 MIN: 9.94 / MAX: 11.06 MIN: 9.85 / MAX: 12.64 MIN: 10.3 / MAX: 13.25 MIN: 9.91 / MAX: 13.1 MIN: 9.91 / MAX: 12.75 MIN: 10.19 / MAX: 13.03 MIN: 11.69 / MAX: 252.21 MIN: 10.55 / MAX: 118.12 MIN: 10.46 / MAX: 112.6 MIN: 10.16 / MAX: 13.97 MIN: 9.89 / MAX: 10.86 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3 - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3 - Model: yolov4-tiny nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 7 14 21 28 35 SE +/- 0.14, N = 15 15.62 15.34 15.30 15.42 29.34 12.82 12.87 13.83 13.69 13.55 13.81 15.11 13.08 13.07 12.89 12.98 MIN: 12.99 / MAX: 184 MIN: 12.94 / MAX: 157.95 MIN: 12.87 / MAX: 144.73 MIN: 12.21 / MAX: 414.81 MIN: 12.17 / MAX: 245.34 MIN: 12.72 / MAX: 13.48 MIN: 12.75 / MAX: 13.58 MIN: 12.89 / MAX: 15.4 MIN: 12.73 / MAX: 15.68 MIN: 12.75 / MAX: 14.74 MIN: 12.84 / MAX: 15.1 MIN: 12.93 / MAX: 151.45 MIN: 12.96 / MAX: 13.83 MIN: 12.95 / MAX: 14.55 MIN: 12.84 / MAX: 13.19 MIN: 12.73 / MAX: 35.55 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3 - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3 - Model: squeezenet_ssd nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 4 8 12 16 20 SE +/- 0.24, N = 15 9.37 9.30 9.32 8.39 17.75 7.09 7.04 7.35 7.64 7.59 7.64 8.16 7.26 6.97 7.04 7.03 MIN: 7.07 / MAX: 281.92 MIN: 6.92 / MAX: 310.91 MIN: 7.1 / MAX: 172.56 MIN: 6.53 / MAX: 436.05 MIN: 6.47 / MAX: 272.11 MIN: 7.02 / MAX: 7.99 MIN: 6.96 / MAX: 7.74 MIN: 6.79 / MAX: 9.82 MIN: 7.03 / MAX: 9.19 MIN: 7.02 / MAX: 8.87 MIN: 7.05 / MAX: 9.9 MIN: 7.51 / MAX: 9.94 MIN: 7.14 / MAX: 8.59 MIN: 6.83 / MAX: 13.87 MIN: 6.96 / MAX: 7.83 MIN: 6.97 / MAX: 7.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3 - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3 - Model: regnety_400m nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 5 10 15 20 25 SE +/- 0.21, N = 15 9.55 17.15 8.13 9.02 19.66 8.09 7.95 8.47 8.75 8.57 8.33 8.21 8.07 8.34 8.14 8.05 MIN: 7.5 / MAX: 193.79 MIN: 8.02 / MAX: 773.45 MIN: 7.75 / MAX: 10.05 MIN: 7.69 / MAX: 501.76 MIN: 7.5 / MAX: 235.36 MIN: 7.99 / MAX: 14.25 MIN: 7.88 / MAX: 8.67 MIN: 8.13 / MAX: 10.27 MIN: 8.35 / MAX: 10.08 MIN: 8.21 / MAX: 10.39 MIN: 8.02 / MAX: 9.64 MIN: 7.9 / MAX: 9.99 MIN: 7.97 / MAX: 8.81 MIN: 8.26 / MAX: 9.3 MIN: 8.08 / MAX: 8.69 MIN: 8 / MAX: 8.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3 - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3 - Model: vision_transformer nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 20 40 60 80 100 SE +/- 0.18, N = 15 38.99 39.12 38.79 37.86 81.77 31.93 31.89 34.47 34.37 34.27 34.20 36.55 33.39 33.47 31.78 31.65 MIN: 34.17 / MAX: 473.06 MIN: 33.92 / MAX: 465.83 MIN: 33.95 / MAX: 457.41 MIN: 32.9 / MAX: 463.9 MIN: 44.4 / MAX: 460.28 MIN: 31.76 / MAX: 33.09 MIN: 31.66 / MAX: 39.97 MIN: 33.32 / MAX: 37.42 MIN: 33.01 / MAX: 38.7 MIN: 33.07 / MAX: 37.01 MIN: 32.92 / MAX: 36.19 MIN: 33 / MAX: 209.38 MIN: 32.73 / MAX: 88.83 MIN: 32.89 / MAX: 74.09 MIN: 31.64 / MAX: 34.51 MIN: 31.53 / MAX: 32.23 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3 - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3 - Model: FastestDet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 3 6 9 12 15 SE +/- 0.27, N = 14 4.06 4.16 2.93 4.33 9.18 4.11 4.04 4.04 4.19 4.18 4.20 5.69 3.97 3.85 4.08 4.07 MIN: 3.91 / MAX: 5.78 MIN: 4 / MAX: 5.58 MIN: 2.84 / MAX: 3.38 MIN: 2.59 / MAX: 433.58 MIN: 3.64 / MAX: 122.65 MIN: 4.07 / MAX: 4.21 MIN: 4.01 / MAX: 4.15 MIN: 3.89 / MAX: 5.01 MIN: 4.04 / MAX: 5.47 MIN: 4.03 / MAX: 5.07 MIN: 4.06 / MAX: 4.86 MIN: 3.69 / MAX: 261.71 MIN: 3.92 / MAX: 4.75 MIN: 3.8 / MAX: 4.65 MIN: 4.05 / MAX: 4.36 MIN: 4.03 / MAX: 5.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3 - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3 - Model: mobilenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 4 8 12 16 20 SE +/- 0.26, N = 15 10.54 9.54 8.81 9.62 16.34 8.01 8.00 8.38 8.88 8.46 8.43 9.05 8.20 8.65 7.95 8.00 MIN: 8.41 / MAX: 134.08 MIN: 8.94 / MAX: 10.54 MIN: 8.32 / MAX: 10.7 MIN: 7.76 / MAX: 502.83 MIN: 8.13 / MAX: 80.69 MIN: 7.95 / MAX: 8.35 MIN: 7.94 / MAX: 8.78 MIN: 7.95 / MAX: 10.41 MIN: 8.31 / MAX: 10.01 MIN: 7.99 / MAX: 10.62 MIN: 7.99 / MAX: 10.66 MIN: 8.48 / MAX: 11.28 MIN: 8.12 / MAX: 9.4 MIN: 8.55 / MAX: 9.53 MIN: 7.89 / MAX: 8.79 MIN: 7.95 / MAX: 8.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v2-v2 - Model: mobilenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 2 4 6 8 10 SE +/- 0.18, N = 15 4.45 3.31 4.99 3.66 7.24 3.17 3.16 3.23 3.40 3.30 3.29 3.28 3.17 3.16 3.15 3.15 MIN: 2.65 / MAX: 216.76 MIN: 3.12 / MAX: 4.6 MIN: 3.1 / MAX: 201.8 MIN: 3.01 / MAX: 437.59 MIN: 3.04 / MAX: 261.68 MIN: 3.12 / MAX: 4.03 MIN: 3.11 / MAX: 3.51 MIN: 3.06 / MAX: 4.66 MIN: 3.23 / MAX: 4.8 MIN: 3.12 / MAX: 4.7 MIN: 3.12 / MAX: 4.64 MIN: 3.09 / MAX: 5.28 MIN: 3.13 / MAX: 3.58 MIN: 3.1 / MAX: 3.71 MIN: 3.11 / MAX: 3.85 MIN: 3.11 / MAX: 3.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3 - Model: mobilenet-v3 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 2 4 6 8 10 SE +/- 0.18, N = 15 2.61 4.90 3.12 3.65 8.06 3.15 3.20 3.33 3.27 3.27 3.26 3.15 3.15 3.17 3.16 MIN: 2.5 / MAX: 3.12 MIN: 3.17 / MAX: 120.84 MIN: 2.99 / MAX: 5.09 MIN: 2.87 / MAX: 347.75 MIN: 2.96 / MAX: 219.87 MIN: 3.11 / MAX: 3.83 MIN: 3.06 / MAX: 3.84 MIN: 3.19 / MAX: 4.2 MIN: 3.14 / MAX: 3.99 MIN: 3.12 / MAX: 5.24 MIN: 3.12 / MAX: 4.19 MIN: 3.1 / MAX: 3.87 MIN: 3.1 / MAX: 3.8 MIN: 3.11 / MAX: 8.89 MIN: 3.11 / MAX: 3.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3 - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3 - Model: shufflenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 1.1003 2.2006 3.3009 4.4012 5.5015 SE +/- 0.22, N = 15 3.17 3.40 3.34 3.95 4.89 3.36 3.34 3.37 3.51 3.44 3.44 3.36 3.35 3.33 3.33 3.33 MIN: 3.04 / MAX: 3.78 MIN: 3.26 / MAX: 4.84 MIN: 3.23 / MAX: 4.78 MIN: 3.19 / MAX: 410.41 MIN: 3.04 / MAX: 18.32 MIN: 3.32 / MAX: 3.7 MIN: 3.31 / MAX: 3.6 MIN: 3.25 / MAX: 3.95 MIN: 3.37 / MAX: 4.26 MIN: 3.32 / MAX: 4.16 MIN: 3.31 / MAX: 4.88 MIN: 3.25 / MAX: 4.02 MIN: 3.31 / MAX: 4.01 MIN: 3.29 / MAX: 3.99 MIN: 3.31 / MAX: 3.81 MIN: 3.3 / MAX: 3.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3 - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3 - Model: mnasnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 2 4 6 8 10 SE +/- 0.16, N = 14 2.54 3.10 3.00 3.37 6.02 2.97 2.97 3.01 3.13 3.07 3.09 2.99 2.98 2.96 2.96 2.96 MIN: 2.44 / MAX: 3.58 MIN: 2.97 / MAX: 3.72 MIN: 2.89 / MAX: 3.46 MIN: 2.86 / MAX: 278.87 MIN: 2.79 / MAX: 50.49 MIN: 2.94 / MAX: 3.39 MIN: 2.92 / MAX: 3.28 MIN: 2.91 / MAX: 3.6 MIN: 3 / MAX: 5.1 MIN: 2.94 / MAX: 3.72 MIN: 2.94 / MAX: 3.79 MIN: 2.86 / MAX: 4.38 MIN: 2.95 / MAX: 3.63 MIN: 2.92 / MAX: 3.81 MIN: 2.93 / MAX: 3.41 MIN: 2.93 / MAX: 3.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3 - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3 - Model: efficientnet-b0 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 2 4 6 8 10 SE +/- 0.18, N = 15 5.26 6.28 4.18 4.73 7.81 3.85 3.85 3.95 4.22 4.04 4.05 4.21 4.63 3.85 3.82 3.85 MIN: 3.48 / MAX: 250.88 MIN: 3.91 / MAX: 337.73 MIN: 4 / MAX: 5.25 MIN: 3.79 / MAX: 418.72 MIN: 3.73 / MAX: 159.47 MIN: 3.81 / MAX: 4.62 MIN: 3.8 / MAX: 4.43 MIN: 3.76 / MAX: 4.84 MIN: 4 / MAX: 5.58 MIN: 3.81 / MAX: 5.08 MIN: 3.83 / MAX: 5 MIN: 3.96 / MAX: 4.94 MIN: 3.8 / MAX: 159.43 MIN: 3.8 / MAX: 4.6 MIN: 3.78 / MAX: 4.53 MIN: 3.82 / MAX: 4.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3 - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3 - Model: blazeface nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 0.7155 1.431 2.1465 2.862 3.5775 SE +/- 0.18, N = 15 1.07 1.41 1.33 1.71 3.18 1.38 1.36 1.39 1.42 1.45 1.42 1.25 1.38 1.37 1.36 1.37 MIN: 1.02 / MAX: 1.52 MIN: 1.35 / MAX: 1.89 MIN: 1.27 / MAX: 1.98 MIN: 1.09 / MAX: 448.17 MIN: 1.31 / MAX: 185.03 MIN: 1.36 / MAX: 1.9 MIN: 1.34 / MAX: 1.61 MIN: 1.34 / MAX: 1.89 MIN: 1.36 / MAX: 1.92 MIN: 1.36 / MAX: 8.73 MIN: 1.35 / MAX: 2.15 MIN: 1.19 / MAX: 2.61 MIN: 1.36 / MAX: 1.76 MIN: 1.35 / MAX: 1.62 MIN: 1.34 / MAX: 1.44 MIN: 1.35 / MAX: 1.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3 - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3 - Model: googlenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 5 10 15 20 25 SE +/- 0.21, N = 15 10.01 8.90 8.87 9.86 20.72 7.85 7.82 8.37 8.99 8.58 8.49 10.19 8.35 7.94 7.83 7.97 MIN: 7.29 / MAX: 259.11 MIN: 8.22 / MAX: 11.07 MIN: 8.18 / MAX: 11.09 MIN: 7.54 / MAX: 396.21 MIN: 7.49 / MAX: 355.33 MIN: 7.75 / MAX: 8.64 MIN: 7.69 / MAX: 8.6 MIN: 7.76 / MAX: 10.31 MIN: 8.25 / MAX: 10.27 MIN: 7.79 / MAX: 10.48 MIN: 7.82 / MAX: 11.98 MIN: 7.73 / MAX: 212.36 MIN: 8.2 / MAX: 9.39 MIN: 7.8 / MAX: 8.78 MIN: 7.74 / MAX: 8.61 MIN: 7.89 / MAX: 8.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3 - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3 - Model: vgg16 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 12 24 36 48 60 SE +/- 0.27, N = 15 27.77 27.59 28.21 28.40 55.42 23.43 23.50 25.26 26.08 25.04 25.04 29.07 24.71 24.12 23.54 23.42 MIN: 24.82 / MAX: 264.66 MIN: 24.34 / MAX: 396.09 MIN: 24.57 / MAX: 270.76 MIN: 23.98 / MAX: 456 MIN: 25.32 / MAX: 281.46 MIN: 23.26 / MAX: 24.3 MIN: 23.23 / MAX: 24.26 MIN: 24.14 / MAX: 27.73 MIN: 24.52 / MAX: 27.73 MIN: 23.81 / MAX: 27.15 MIN: 23.87 / MAX: 28.04 MIN: 24.45 / MAX: 263.33 MIN: 23.88 / MAX: 119.23 MIN: 23.57 / MAX: 46.44 MIN: 23.32 / MAX: 24.54 MIN: 23.27 / MAX: 24.32 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3 - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3 - Model: resnet18 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 3 6 9 12 15 SE +/- 0.16, N = 15 5.84 5.81 5.97 6.23 13.38 5.24 5.23 5.59 5.89 5.69 5.65 5.85 5.50 5.30 5.23 5.42 MIN: 5.35 / MAX: 7.72 MIN: 5.3 / MAX: 6.82 MIN: 5.46 / MAX: 7.02 MIN: 4.99 / MAX: 309.18 MIN: 5.43 / MAX: 208.42 MIN: 5.14 / MAX: 5.99 MIN: 5.1 / MAX: 6.07 MIN: 5.09 / MAX: 7.7 MIN: 5.36 / MAX: 7.53 MIN: 5.11 / MAX: 6.94 MIN: 5.14 / MAX: 6.93 MIN: 5.3 / MAX: 8.27 MIN: 5.4 / MAX: 6.38 MIN: 5.17 / MAX: 5.93 MIN: 5.11 / MAX: 6.03 MIN: 5.36 / MAX: 6.27 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3 - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3 - Model: alexnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 3 6 9 12 15 SE +/- 0.23, N = 15 6.11 6.58 6.11 5.67 9.86 4.30 4.30 4.65 5.21 4.69 4.69 4.99 4.86 4.35 4.30 4.42 MIN: 4.83 / MAX: 124.76 MIN: 4.61 / MAX: 91.07 MIN: 4.73 / MAX: 81.72 MIN: 4.21 / MAX: 365.75 MIN: 4.25 / MAX: 157.02 MIN: 4.25 / MAX: 4.7 MIN: 4.25 / MAX: 5.08 MIN: 4.26 / MAX: 5.97 MIN: 4.79 / MAX: 6.66 MIN: 4.26 / MAX: 7.17 MIN: 4.26 / MAX: 6.15 MIN: 4.59 / MAX: 6.56 MIN: 4.8 / MAX: 6.37 MIN: 4.27 / MAX: 5.16 MIN: 4.26 / MAX: 5.16 MIN: 4.32 / MAX: 5.1 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3 - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3 - Model: resnet50 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 6 12 18 24 30 SE +/- 0.27, N = 15 13.13 13.57 14.58 12.35 23.11 10.01 9.97 11.09 11.50 10.84 10.95 11.15 10.43 10.25 10.03 9.87 MIN: 10.56 / MAX: 323.44 MIN: 10.45 / MAX: 199.55 MIN: 10.67 / MAX: 324.82 MIN: 9.83 / MAX: 424.28 MIN: 10.22 / MAX: 140.41 MIN: 9.91 / MAX: 10.74 MIN: 9.86 / MAX: 10.84 MIN: 10.18 / MAX: 13.12 MIN: 10.5 / MAX: 13.47 MIN: 9.93 / MAX: 12.83 MIN: 9.91 / MAX: 17.11 MIN: 10.31 / MAX: 12.97 MIN: 10.19 / MAX: 11.32 MIN: 10.05 / MAX: 11.08 MIN: 9.93 / MAX: 10.96 MIN: 9.79 / MAX: 10.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3 - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3 - Model: yolov4-tiny nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 7 14 21 28 35 SE +/- 0.23, N = 15 16.61 16.39 15.44 15.44 29.49 12.84 12.88 13.62 13.95 13.68 13.79 15.43 13.35 14.34 12.86 12.77 MIN: 12.32 / MAX: 375.99 MIN: 12.97 / MAX: 369.64 MIN: 12.92 / MAX: 211.43 MIN: 12.61 / MAX: 387.62 MIN: 13.03 / MAX: 182.99 MIN: 12.76 / MAX: 13.7 MIN: 12.75 / MAX: 13.79 MIN: 12.75 / MAX: 15.79 MIN: 13.03 / MAX: 15.9 MIN: 12.77 / MAX: 15.57 MIN: 12.79 / MAX: 15.92 MIN: 13.1 / MAX: 210.2 MIN: 12.87 / MAX: 58.52 MIN: 14.23 / MAX: 15.12 MIN: 12.76 / MAX: 13.98 MIN: 12.69 / MAX: 13.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3 - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3 - Model: squeezenet_ssd nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 4 8 12 16 20 SE +/- 0.19, N = 15 7.72 7.81 7.93 8.29 16.15 7.08 7.04 7.51 7.70 7.63 7.66 8.33 7.31 7.09 7.07 7.14 MIN: 7.12 / MAX: 23.25 MIN: 7.24 / MAX: 9.04 MIN: 7.31 / MAX: 9.45 MIN: 6.37 / MAX: 448.22 MIN: 7.25 / MAX: 210.69 MIN: 7.01 / MAX: 7.93 MIN: 6.97 / MAX: 7.76 MIN: 6.94 / MAX: 9.51 MIN: 7.11 / MAX: 9.19 MIN: 7.02 / MAX: 9.71 MIN: 7.02 / MAX: 9.08 MIN: 6.32 / MAX: 222.03 MIN: 6.96 / MAX: 30.1 MIN: 6.98 / MAX: 8.01 MIN: 7.01 / MAX: 7.75 MIN: 7.06 / MAX: 7.95 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3 - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3 - Model: regnety_400m nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 4 8 12 16 20 SE +/- 0.25, N = 14 7.73 10.34 9.60 8.89 17.23 8.25 8.01 8.10 8.58 8.44 8.45 7.99 7.99 8.50 7.98 8.27 MIN: 7.43 / MAX: 9.41 MIN: 8.21 / MAX: 214.16 MIN: 7.66 / MAX: 210.23 MIN: 7.74 / MAX: 476.28 MIN: 7.8 / MAX: 193.14 MIN: 8.12 / MAX: 14 MIN: 7.93 / MAX: 8.35 MIN: 7.77 / MAX: 15.42 MIN: 8.23 / MAX: 10.39 MIN: 8.04 / MAX: 10.17 MIN: 8.05 / MAX: 10.3 MIN: 7.62 / MAX: 9.27 MIN: 7.91 / MAX: 8.8 MIN: 8.04 / MAX: 30.12 MIN: 7.93 / MAX: 8.65 MIN: 8.22 / MAX: 9.01 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3 - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3 - Model: vision_transformer nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 16 32 48 64 80 SE +/- 0.13, N = 15 38.58 38.73 39.01 38.29 73.51 32.11 31.86 34.05 35.40 34.29 34.13 38.33 32.68 33.36 31.66 31.71 MIN: 33.77 / MAX: 476.18 MIN: 33.81 / MAX: 362.17 MIN: 33.91 / MAX: 411.66 MIN: 32.31 / MAX: 557.38 MIN: 39.27 / MAX: 288.2 MIN: 31.94 / MAX: 33.01 MIN: 31.58 / MAX: 35.84 MIN: 32.83 / MAX: 38.57 MIN: 33.93 / MAX: 39.3 MIN: 33.11 / MAX: 40.12 MIN: 32.98 / MAX: 36.11 MIN: 34.14 / MAX: 246.43 MIN: 32.02 / MAX: 87.72 MIN: 32.83 / MAX: 76.21 MIN: 31.52 / MAX: 32.14 MIN: 31.56 / MAX: 33.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3 - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3 - Model: FastestDet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f c b 2 4 6 8 10 SE +/- 0.15, N = 15 5.92 4.16 3.94 4.26 8.63 4.10 4.04 4.12 4.31 4.09 4.20 4.43 3.97 4.20 3.69 4.06 MIN: 4.25 / MAX: 103.26 MIN: 4.03 / MAX: 4.73 MIN: 3.8 / MAX: 5.41 MIN: 2.71 / MAX: 347.03 MIN: 4.27 / MAX: 144.3 MIN: 4.06 / MAX: 4.21 MIN: 4 / MAX: 4.15 MIN: 3.97 / MAX: 6.99 MIN: 4.14 / MAX: 6.11 MIN: 3.92 / MAX: 5.5 MIN: 4.04 / MAX: 5.82 MIN: 4.28 / MAX: 5.01 MIN: 3.93 / MAX: 4.73 MIN: 4.15 / MAX: 4.92 MIN: 3.66 / MAX: 3.92 MIN: 4.03 / MAX: 4.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: mobilenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 5 10 15 20 25 SE +/- 0.27, N = 15 9.41 8.37 10.18 9.62 18.39 8.04 8.11 8.46 8.46 8.40 8.84 10.02 8.50 MIN: 8.98 / MAX: 11.38 MIN: 7.98 / MAX: 10.71 MIN: 8.18 / MAX: 235.56 MIN: 7.71 / MAX: 449.11 MIN: 7.92 / MAX: 173.39 MIN: 7.96 / MAX: 9.01 MIN: 8.02 / MAX: 14.2 MIN: 7.97 / MAX: 10.56 MIN: 7.95 / MAX: 10.34 MIN: 7.93 / MAX: 15.25 MIN: 8.31 / MAX: 10.98 MIN: 8.07 / MAX: 266.25 MIN: 8.42 / MAX: 9.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 2 4 6 8 10 SE +/- 0.15, N = 15 5.10 3.34 3.32 3.66 8.35 3.19 3.15 3.28 3.27 3.27 3.28 3.29 3.17 MIN: 3.14 / MAX: 138.88 MIN: 3.14 / MAX: 4.45 MIN: 3.12 / MAX: 4.24 MIN: 3.01 / MAX: 311.25 MIN: 3.08 / MAX: 103.38 MIN: 3.13 / MAX: 4 MIN: 3.1 / MAX: 3.68 MIN: 3.09 / MAX: 4.98 MIN: 3.11 / MAX: 4.73 MIN: 3.08 / MAX: 5.18 MIN: 3.11 / MAX: 4.16 MIN: 3.1 / MAX: 3.96 MIN: 3.1 / MAX: 5.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 4080 zzz 4080 xxx 4080 rep i g 2 4 6 8 10 SE +/- 0.18, N = 15 3.26 3.33 3.30 3.62 6.56 3.16 3.24 3.26 3.24 3.26 3.16 MIN: 3.13 / MAX: 3.96 MIN: 3.19 / MAX: 4.79 MIN: 3.14 / MAX: 4.82 MIN: 3 / MAX: 469.9 MIN: 3.07 / MAX: 110.87 MIN: 3.11 / MAX: 3.77 MIN: 3.1 / MAX: 3.88 MIN: 3.13 / MAX: 4.08 MIN: 3.11 / MAX: 4.37 MIN: 3.11 / MAX: 4.7 MIN: 3.12 / MAX: 3.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 2 4 6 8 10 SE +/- 0.16, N = 15 3.51 5.27 5.09 3.75 8.00 3.37 3.33 3.43 3.43 3.39 3.46 3.43 3.34 MIN: 3.38 / MAX: 4.05 MIN: 3.27 / MAX: 191.55 MIN: 3.33 / MAX: 161.5 MIN: 3.2 / MAX: 361.52 MIN: 3.16 / MAX: 190.15 MIN: 3.33 / MAX: 3.8 MIN: 3.29 / MAX: 3.67 MIN: 3.29 / MAX: 3.87 MIN: 3.31 / MAX: 3.95 MIN: 3.26 / MAX: 3.91 MIN: 3.3 / MAX: 5.74 MIN: 3.3 / MAX: 4.89 MIN: 3.31 / MAX: 4.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: mnasnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 1.0328 2.0656 3.0984 4.1312 5.164 SE +/- 0.14, N = 15 3.16 3.13 3.17 3.34 4.59 2.99 2.96 3.08 3.05 3.03 3.06 3.07 2.97 MIN: 3.02 / MAX: 4.6 MIN: 3.01 / MAX: 3.62 MIN: 3.03 / MAX: 3.66 MIN: 2.68 / MAX: 393.6 MIN: 2.88 / MAX: 20.12 MIN: 2.96 / MAX: 3.32 MIN: 2.93 / MAX: 3.31 MIN: 2.93 / MAX: 4.42 MIN: 2.91 / MAX: 3.67 MIN: 2.91 / MAX: 4.45 MIN: 2.94 / MAX: 3.67 MIN: 2.93 / MAX: 3.84 MIN: 2.93 / MAX: 3.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 2 4 6 8 10 SE +/- 0.19, N = 15 4.12 4.04 4.36 4.60 8.41 3.87 3.83 4.01 4.02 3.98 4.06 4.19 3.84 MIN: 3.86 / MAX: 5.39 MIN: 3.85 / MAX: 4.9 MIN: 4.14 / MAX: 5.24 MIN: 3.79 / MAX: 336.2 MIN: 3.76 / MAX: 67.73 MIN: 3.81 / MAX: 4.62 MIN: 3.78 / MAX: 4.4 MIN: 3.79 / MAX: 5.39 MIN: 3.8 / MAX: 5.14 MIN: 3.77 / MAX: 5.44 MIN: 3.85 / MAX: 4.97 MIN: 4.01 / MAX: 5.09 MIN: 3.78 / MAX: 4.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: blazeface nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 0.3983 0.7966 1.1949 1.5932 1.9915 SE +/- 0.12, N = 14 1.18 1.46 1.16 1.49 1.77 1.39 1.36 1.42 1.42 1.41 1.43 1.41 1.38 MIN: 1.11 / MAX: 1.85 MIN: 1.39 / MAX: 2.91 MIN: 1.1 / MAX: 2 MIN: 1.05 / MAX: 379.08 MIN: 1.08 / MAX: 12.53 MIN: 1.37 / MAX: 1.52 MIN: 1.34 / MAX: 1.46 MIN: 1.34 / MAX: 2.84 MIN: 1.35 / MAX: 2 MIN: 1.34 / MAX: 2.1 MIN: 1.36 / MAX: 2.06 MIN: 1.35 / MAX: 2.02 MIN: 1.35 / MAX: 2.09 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: googlenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 5 10 15 20 25 SE +/- 0.24, N = 15 8.61 10.38 10.87 9.84 18.60 7.89 7.86 8.42 8.43 8.49 8.40 10.47 7.96 MIN: 7.95 / MAX: 10.07 MIN: 7.96 / MAX: 255.68 MIN: 8.37 / MAX: 194.11 MIN: 7.3 / MAX: 438.04 MIN: 8.02 / MAX: 292.16 MIN: 7.79 / MAX: 8.84 MIN: 7.74 / MAX: 8.62 MIN: 7.78 / MAX: 10.7 MIN: 7.77 / MAX: 10.4 MIN: 7.74 / MAX: 10.76 MIN: 7.71 / MAX: 10.64 MIN: 8.21 / MAX: 350.07 MIN: 7.81 / MAX: 9.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: vgg16 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 12 24 36 48 60 SE +/- 0.30, N = 15 27.89 29.12 31.57 28.63 55.48 23.52 23.55 25.16 25.40 25.05 25.48 27.43 24.04 MIN: 24.5 / MAX: 463.23 MIN: 24.62 / MAX: 266.39 MIN: 26.09 / MAX: 318.58 MIN: 24.13 / MAX: 500.18 MIN: 25.94 / MAX: 298.67 MIN: 23.33 / MAX: 25.08 MIN: 23.31 / MAX: 24.48 MIN: 23.97 / MAX: 27.81 MIN: 24.05 / MAX: 27.09 MIN: 23.78 / MAX: 26.95 MIN: 23.88 / MAX: 51.68 MIN: 24.65 / MAX: 251.37 MIN: 23.48 / MAX: 73.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: resnet18 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 3 6 9 12 15 SE +/- 0.23, N = 15 8.16 6.05 6.58 6.57 12.14 5.22 5.19 5.60 5.67 5.61 5.61 5.88 5.28 MIN: 5.39 / MAX: 397.44 MIN: 5.53 / MAX: 7.66 MIN: 6.04 / MAX: 7.81 MIN: 4.91 / MAX: 391.33 MIN: 5.28 / MAX: 151.53 MIN: 5.13 / MAX: 6.1 MIN: 5.09 / MAX: 6 MIN: 5.09 / MAX: 7.51 MIN: 5.1 / MAX: 8.06 MIN: 5.07 / MAX: 7.08 MIN: 5.09 / MAX: 7.91 MIN: 5.36 / MAX: 8.2 MIN: 5.16 / MAX: 6.09 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: alexnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 3 6 9 12 15 SE +/- 0.22, N = 15 5.14 5.33 5.14 5.53 10.08 4.31 4.31 4.68 4.67 4.72 4.61 5.10 4.35 MIN: 4.65 / MAX: 6.81 MIN: 4.83 / MAX: 6.6 MIN: 4.76 / MAX: 6.16 MIN: 4.22 / MAX: 362.62 MIN: 4.36 / MAX: 225.66 MIN: 4.26 / MAX: 5.07 MIN: 4.25 / MAX: 4.94 MIN: 4.26 / MAX: 6.8 MIN: 4.27 / MAX: 6.36 MIN: 4.25 / MAX: 7.3 MIN: 4.24 / MAX: 7.25 MIN: 4.75 / MAX: 6.12 MIN: 4.28 / MAX: 5.1 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: resnet50 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 6 12 18 24 30 SE +/- 0.23, N = 15 13.63 12.17 14.08 12.73 23.59 10.04 10.38 10.91 10.91 10.80 11.40 13.10 10.33 MIN: 10.52 / MAX: 488.94 MIN: 11.25 / MAX: 13.79 MIN: 10.29 / MAX: 247.29 MIN: 9.84 / MAX: 518.97 MIN: 9.96 / MAX: 177.63 MIN: 9.94 / MAX: 10.89 MIN: 9.88 / MAX: 18.75 MIN: 9.94 / MAX: 14.83 MIN: 9.91 / MAX: 13.07 MIN: 9.89 / MAX: 12.54 MIN: 10.5 / MAX: 13.51 MIN: 10.59 / MAX: 267.95 MIN: 10.2 / MAX: 11.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: yolov4-tiny nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 7 14 21 28 35 SE +/- 0.28, N = 15 17.30 15.45 15.55 15.21 29.80 12.86 13.10 13.61 13.62 13.55 13.86 13.77 13.14 MIN: 14.66 / MAX: 441.3 MIN: 12.65 / MAX: 445.76 MIN: 13.11 / MAX: 307.2 MIN: 12.34 / MAX: 380.51 MIN: 12.85 / MAX: 216.34 MIN: 12.76 / MAX: 13.73 MIN: 13.01 / MAX: 14.17 MIN: 12.67 / MAX: 19.72 MIN: 12.71 / MAX: 15.65 MIN: 12.72 / MAX: 15.51 MIN: 13.04 / MAX: 15.04 MIN: 12.96 / MAX: 14.66 MIN: 13 / MAX: 14.02 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssd nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 4 8 12 16 20 SE +/- 0.25, N = 14 9.21 9.16 7.57 8.28 15.40 7.09 7.04 7.62 7.67 7.55 7.66 7.21 7.14 MIN: 6.83 / MAX: 203.62 MIN: 6.73 / MAX: 423.75 MIN: 7.02 / MAX: 9 MIN: 6.38 / MAX: 381.81 MIN: 6.64 / MAX: 132.68 MIN: 7.01 / MAX: 7.97 MIN: 6.96 / MAX: 7.7 MIN: 7 / MAX: 9.93 MIN: 7.04 / MAX: 9.1 MIN: 6.99 / MAX: 9.08 MIN: 7.09 / MAX: 8.97 MIN: 6.73 / MAX: 8.82 MIN: 7.03 / MAX: 7.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: regnety_400m nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 4 8 12 16 20 SE +/- 0.21, N = 15 10.09 8.64 8.10 9.19 17.88 8.34 7.99 8.49 8.52 8.24 8.61 8.46 8.38 MIN: 7.84 / MAX: 366.66 MIN: 8.3 / MAX: 10.51 MIN: 7.65 / MAX: 10.05 MIN: 7.44 / MAX: 524.66 MIN: 7.38 / MAX: 190.77 MIN: 8.26 / MAX: 9.09 MIN: 7.92 / MAX: 8.78 MIN: 8.08 / MAX: 9.72 MIN: 8.13 / MAX: 9.73 MIN: 7.91 / MAX: 9.53 MIN: 8.21 / MAX: 10.07 MIN: 8.08 / MAX: 10.33 MIN: 8.05 / MAX: 27.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: vision_transformer nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 16 32 48 64 80 SE +/- 0.20, N = 15 39.18 38.17 38.38 37.88 71.08 32.09 33.22 34.10 34.23 34.10 34.91 38.01 33.32 MIN: 33.74 / MAX: 520.24 MIN: 32.97 / MAX: 462.63 MIN: 33.53 / MAX: 477.38 MIN: 32.46 / MAX: 518.57 MIN: 38.84 / MAX: 374.68 MIN: 31.84 / MAX: 32.77 MIN: 33.04 / MAX: 36.99 MIN: 32.32 / MAX: 38.54 MIN: 33.08 / MAX: 37.43 MIN: 32.43 / MAX: 38.75 MIN: 33.72 / MAX: 36.82 MIN: 32.96 / MAX: 388.09 MIN: 31.83 / MAX: 104.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: FastestDet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g 2 4 6 8 10 SE +/- 0.20, N = 15 2.81 3.96 4.03 4.41 6.93 4.08 4.03 4.20 4.17 4.14 4.28 3.83 3.92 MIN: 2.68 / MAX: 4.38 MIN: 3.79 / MAX: 11.36 MIN: 3.89 / MAX: 4.63 MIN: 2.06 / MAX: 295.24 MIN: 2.57 / MAX: 163.84 MIN: 4.04 / MAX: 4.29 MIN: 3.99 / MAX: 4.22 MIN: 4.01 / MAX: 11.47 MIN: 4.03 / MAX: 5.63 MIN: 4 / MAX: 5.6 MIN: 4.13 / MAX: 4.85 MIN: 3.7 / MAX: 4.57 MIN: 3.88 / MAX: 4.72 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 4 8 12 16 20 SE +/- 0.25, N = 15 8.93 9.02 9.16 9.52 17.09 8.05 8.03 9.19 8.31 8.38 8.43 MIN: 8.33 / MAX: 11.07 MIN: 8.42 / MAX: 11.17 MIN: 8.5 / MAX: 10.51 MIN: 7.97 / MAX: 420.29 MIN: 7.89 / MAX: 121.53 MIN: 7.96 / MAX: 9.04 MIN: 7.96 / MAX: 8.83 MIN: 8.51 / MAX: 11.04 MIN: 7.85 / MAX: 10.21 MIN: 7.94 / MAX: 10.07 MIN: 8.03 / MAX: 9.64 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 1.2285 2.457 3.6855 4.914 6.1425 SE +/- 0.18, N = 15 3.42 3.36 3.48 3.66 5.46 3.17 3.12 3.28 3.14 3.27 3.29 MIN: 3.15 / MAX: 25.1 MIN: 3.17 / MAX: 4.8 MIN: 3.32 / MAX: 4.99 MIN: 2.73 / MAX: 398.42 MIN: 3.27 / MAX: 38.65 MIN: 3.12 / MAX: 3.89 MIN: 3.07 / MAX: 3.62 MIN: 3.11 / MAX: 4.26 MIN: 3 / MAX: 3.85 MIN: 3.08 / MAX: 4.68 MIN: 3.12 / MAX: 3.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 1.3478 2.6956 4.0434 5.3912 6.739 SE +/- 0.13, N = 13 3.17 3.34 3.62 3.44 5.99 3.18 3.13 3.26 3.05 3.31 3.26 MIN: 3.04 / MAX: 4.3 MIN: 3.19 / MAX: 3.99 MIN: 3.47 / MAX: 4.24 MIN: 2.65 / MAX: 361.91 MIN: 3.05 / MAX: 26.81 MIN: 3.13 / MAX: 3.61 MIN: 3.09 / MAX: 3.68 MIN: 3.12 / MAX: 4.74 MIN: 2.94 / MAX: 3.56 MIN: 3.16 / MAX: 3.93 MIN: 3.13 / MAX: 4.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 1.2578 2.5156 3.7734 5.0312 6.289 SE +/- 0.20, N = 15 3.50 3.48 3.52 3.89 5.59 3.34 3.32 3.44 3.34 3.44 3.48 MIN: 3.37 / MAX: 4.2 MIN: 3.34 / MAX: 4.1 MIN: 3.38 / MAX: 4.23 MIN: 3.08 / MAX: 345.39 MIN: 3.32 / MAX: 42.33 MIN: 3.3 / MAX: 3.79 MIN: 3.29 / MAX: 3.79 MIN: 3.31 / MAX: 4.85 MIN: 3.22 / MAX: 3.97 MIN: 3.31 / MAX: 4.32 MIN: 3.34 / MAX: 4.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mnasnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 2 4 6 8 10 SE +/- 0.04, N = 15 3.07 4.99 4.93 3.10 6.06 2.97 2.94 3.08 2.98 3.06 3.10 MIN: 2.93 / MAX: 4.52 MIN: 3.02 / MAX: 235.56 MIN: 2.97 / MAX: 124.96 MIN: 2.61 / MAX: 4.75 MIN: 2.96 / MAX: 42.7 MIN: 2.94 / MAX: 3.45 MIN: 2.9 / MAX: 3.34 MIN: 2.95 / MAX: 3.88 MIN: 2.86 / MAX: 4.47 MIN: 2.93 / MAX: 5.02 MIN: 2.95 / MAX: 4.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 3 6 9 12 15 SE +/- 0.13, N = 15 4.10 4.41 4.15 4.37 9.81 3.85 3.88 4.05 3.97 4.01 4.04 MIN: 3.87 / MAX: 6.14 MIN: 4.21 / MAX: 5.82 MIN: 3.93 / MAX: 5.94 MIN: 3.85 / MAX: 366.28 MIN: 3.87 / MAX: 165.38 MIN: 3.78 / MAX: 4.83 MIN: 3.83 / MAX: 4.72 MIN: 3.83 / MAX: 5.42 MIN: 3.79 / MAX: 5.93 MIN: 3.81 / MAX: 6.04 MIN: 3.84 / MAX: 4.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: blazeface nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 0.6728 1.3456 2.0184 2.6912 3.364 SE +/- 0.03, N = 15 1.26 1.41 1.42 1.34 2.99 1.37 1.38 1.41 1.31 1.42 1.44 MIN: 1.2 / MAX: 1.76 MIN: 1.35 / MAX: 1.91 MIN: 1.36 / MAX: 1.92 MIN: 1.06 / MAX: 2.66 MIN: 1.22 / MAX: 149.55 MIN: 1.35 / MAX: 1.48 MIN: 1.36 / MAX: 1.53 MIN: 1.34 / MAX: 1.88 MIN: 1.25 / MAX: 3.14 MIN: 1.35 / MAX: 2.89 MIN: 1.37 / MAX: 2.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: googlenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 4 8 12 16 20 SE +/- 0.19, N = 15 8.35 10.39 9.05 9.90 16.97 7.86 7.84 8.55 8.26 8.42 8.79 MIN: 7.7 / MAX: 10.46 MIN: 7.87 / MAX: 391.66 MIN: 8.26 / MAX: 13.34 MIN: 7.76 / MAX: 396.66 MIN: 7.44 / MAX: 229.93 MIN: 7.75 / MAX: 8.71 MIN: 7.74 / MAX: 8.72 MIN: 7.86 / MAX: 10.08 MIN: 7.62 / MAX: 10.47 MIN: 7.77 / MAX: 10.52 MIN: 8.08 / MAX: 10.27 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vgg16 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 11 22 33 44 55 SE +/- 0.28, N = 15 28.14 29.17 27.44 28.53 49.70 23.38 23.51 26.09 25.33 25.56 25.67 MIN: 24.24 / MAX: 221.5 MIN: 24.61 / MAX: 264.85 MIN: 24.06 / MAX: 264.59 MIN: 23.95 / MAX: 473.83 MIN: 25.55 / MAX: 421.44 MIN: 23.19 / MAX: 24.27 MIN: 23.27 / MAX: 24.38 MIN: 24.58 / MAX: 30.18 MIN: 24.26 / MAX: 34.98 MIN: 24.24 / MAX: 27.92 MIN: 24.46 / MAX: 27.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet18 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 3 6 9 12 15 SE +/- 0.20, N = 15 7.38 5.90 7.52 6.40 11.30 5.20 5.21 5.74 5.65 5.67 5.92 MIN: 5.15 / MAX: 138.85 MIN: 5.43 / MAX: 7.49 MIN: 5.45 / MAX: 290.49 MIN: 5.1 / MAX: 457.07 MIN: 5.3 / MAX: 181.7 MIN: 5.1 / MAX: 6.09 MIN: 5.09 / MAX: 6.13 MIN: 5.18 / MAX: 8.08 MIN: 5.18 / MAX: 6.76 MIN: 5.19 / MAX: 7.38 MIN: 5.37 / MAX: 8.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: alexnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 3 6 9 12 15 SE +/- 0.16, N = 15 4.69 5.25 4.67 5.34 11.89 4.30 4.32 4.70 4.71 4.64 4.98 MIN: 4.28 / MAX: 6.33 MIN: 4.86 / MAX: 6.33 MIN: 4.28 / MAX: 6 MIN: 4.25 / MAX: 221.78 MIN: 4.34 / MAX: 229.18 MIN: 4.24 / MAX: 5.11 MIN: 4.25 / MAX: 5.33 MIN: 4.28 / MAX: 5.92 MIN: 4.26 / MAX: 7.21 MIN: 4.24 / MAX: 6 MIN: 4.59 / MAX: 7.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet50 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 6 12 18 24 30 SE +/- 0.25, N = 15 13.13 11.51 12.40 12.42 23.44 9.98 10.07 12.50 11.22 11.07 11.48 MIN: 10.18 / MAX: 247.5 MIN: 10.56 / MAX: 13.22 MIN: 11.44 / MAX: 14.43 MIN: 10.23 / MAX: 444.76 MIN: 10.17 / MAX: 219.36 MIN: 9.85 / MAX: 11.35 MIN: 9.95 / MAX: 10.88 MIN: 11.47 / MAX: 14.56 MIN: 10.33 / MAX: 12.81 MIN: 10.16 / MAX: 13.16 MIN: 10.56 / MAX: 12.93 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: yolov4-tiny nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 7 14 21 28 35 SE +/- 0.19, N = 15 15.55 15.40 15.95 15.00 28.73 12.90 12.97 15.26 13.52 13.73 13.93 MIN: 12.87 / MAX: 342.3 MIN: 13 / MAX: 245.79 MIN: 13.38 / MAX: 245.18 MIN: 12.75 / MAX: 401.37 MIN: 12.83 / MAX: 264.49 MIN: 12.77 / MAX: 13.92 MIN: 12.83 / MAX: 13.8 MIN: 14.19 / MAX: 17.06 MIN: 12.72 / MAX: 21.19 MIN: 12.78 / MAX: 20.99 MIN: 13.08 / MAX: 15.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssd nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 5 10 15 20 25 SE +/- 0.24, N = 15 7.02 9.46 9.81 8.65 18.83 7.08 7.05 8.06 7.27 7.62 7.71 MIN: 6.38 / MAX: 9.36 MIN: 7.03 / MAX: 160.39 MIN: 7.16 / MAX: 389.1 MIN: 6.64 / MAX: 544.17 MIN: 6.71 / MAX: 206.11 MIN: 7 / MAX: 7.94 MIN: 6.97 / MAX: 7.95 MIN: 7.42 / MAX: 9.25 MIN: 6.74 / MAX: 8.84 MIN: 7.01 / MAX: 14.37 MIN: 7.15 / MAX: 9.1 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: regnety_400m nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 4 8 12 16 20 SE +/- 0.24, N = 15 10.03 10.69 9.87 9.05 17.61 8.07 8.33 8.37 8.25 8.35 8.67 MIN: 7.81 / MAX: 171.2 MIN: 8.17 / MAX: 339.6 MIN: 7.81 / MAX: 243.06 MIN: 7.52 / MAX: 417.33 MIN: 7.85 / MAX: 165.34 MIN: 7.99 / MAX: 8.88 MIN: 8.25 / MAX: 9.32 MIN: 8.04 / MAX: 10.13 MIN: 7.93 / MAX: 9.88 MIN: 8.05 / MAX: 9.76 MIN: 8.3 / MAX: 14.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vision_transformer nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 16 32 48 64 80 SE +/- 0.11, N = 15 38.46 39.03 38.62 38.27 70.29 31.97 31.94 35.36 33.90 33.93 35.60 MIN: 32.39 / MAX: 435.46 MIN: 33.61 / MAX: 343.67 MIN: 33.33 / MAX: 465 MIN: 32.29 / MAX: 507.7 MIN: 39.39 / MAX: 250.19 MIN: 31.71 / MAX: 33.78 MIN: 31.72 / MAX: 34.34 MIN: 33.87 / MAX: 42.41 MIN: 32.72 / MAX: 37.77 MIN: 32.77 / MAX: 36.2 MIN: 34.13 / MAX: 38.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: FastestDet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 2 4 6 8 10 SE +/- 0.27, N = 15 2.64 4.11 4.45 4.32 6.71 4.07 3.83 4.61 3.75 4.17 4.19 MIN: 2.52 / MAX: 4.14 MIN: 3.98 / MAX: 4.73 MIN: 4.29 / MAX: 5.05 MIN: 2.51 / MAX: 398.91 MIN: 2.73 / MAX: 109.52 MIN: 4.03 / MAX: 4.18 MIN: 3.79 / MAX: 4.09 MIN: 4.45 / MAX: 5.92 MIN: 3.63 / MAX: 5.24 MIN: 4.02 / MAX: 4.75 MIN: 4.06 / MAX: 7.41 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3 - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3 - Model: mobilenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4 8 12 16 20 SE +/- 0.14, N = 3 12.12 10.61 9.04 10.03 16.52 8.06 8.07 8.25 8.34 8.45 MIN: 9.16 / MAX: 505.01 MIN: 8.34 / MAX: 225.97 MIN: 8.49 / MAX: 10.96 MIN: 7.86 / MAX: 346.64 MIN: 7.9 / MAX: 82.53 MIN: 8 / MAX: 8.96 MIN: 8.01 / MAX: 8.62 MIN: 7.78 / MAX: 9.61 MIN: 7.89 / MAX: 9.42 MIN: 8.01 / MAX: 10.86 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 2 4 6 8 10 SE +/- 0.53, N = 3 3.29 3.44 3.36 3.91 7.22 3.16 3.16 3.16 3.20 3.30 MIN: 3.13 / MAX: 4.29 MIN: 3.27 / MAX: 4.93 MIN: 3.21 / MAX: 4.78 MIN: 3.04 / MAX: 394.66 MIN: 3.17 / MAX: 69.66 MIN: 3.09 / MAX: 4.06 MIN: 3.11 / MAX: 3.95 MIN: 3.01 / MAX: 5.17 MIN: 3.05 / MAX: 4.67 MIN: 3.11 / MAX: 4.01 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 4080 zzz 4080 xxx 4080 rep 2 4 6 8 10 SE +/- 0.53, N = 3 4.97 3.30 3.33 3.70 6.43 3.17 3.06 3.08 3.28 MIN: 3.15 / MAX: 291.01 MIN: 3.15 / MAX: 3.91 MIN: 3.2 / MAX: 4.4 MIN: 2.98 / MAX: 261.6 MIN: 2.85 / MAX: 164.91 MIN: 3.12 / MAX: 3.75 MIN: 2.94 / MAX: 3.94 MIN: 2.97 / MAX: 3.67 MIN: 3.13 / MAX: 4.78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 2 4 6 8 10 SE +/- 0.60, N = 3 3.32 3.42 3.48 4.02 7.81 3.33 3.36 3.36 3.40 3.47 MIN: 3.19 / MAX: 4.76 MIN: 3.29 / MAX: 3.94 MIN: 3.35 / MAX: 4.05 MIN: 3.27 / MAX: 328.59 MIN: 3.3 / MAX: 131.26 MIN: 3.3 / MAX: 3.78 MIN: 3.32 / MAX: 3.66 MIN: 3.23 / MAX: 3.99 MIN: 3.28 / MAX: 3.87 MIN: 3.33 / MAX: 5.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3 - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3 - Model: mnasnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 2 4 6 8 10 SE +/- 0.13, N = 3 3.12 5.11 5.19 3.24 6.07 2.97 2.98 2.96 3.00 3.09 MIN: 2.98 / MAX: 3.71 MIN: 2.96 / MAX: 247.47 MIN: 3.04 / MAX: 436.91 MIN: 2.9 / MAX: 5.34 MIN: 2.94 / MAX: 129.1 MIN: 2.93 / MAX: 3.3 MIN: 2.95 / MAX: 3.9 MIN: 2.85 / MAX: 3.82 MIN: 2.88 / MAX: 4.37 MIN: 2.96 / MAX: 4.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 3 6 9 12 15 SE +/- 0.46, N = 3 5.94 4.35 4.47 4.74 9.19 3.85 3.88 3.95 4.01 4.07 MIN: 3.97 / MAX: 208.59 MIN: 4.08 / MAX: 5.62 MIN: 4.23 / MAX: 5.82 MIN: 3.68 / MAX: 295.7 MIN: 3.85 / MAX: 131.42 MIN: 3.81 / MAX: 4.75 MIN: 3.83 / MAX: 4.61 MIN: 3.79 / MAX: 4.59 MIN: 3.83 / MAX: 5.28 MIN: 3.85 / MAX: 4.79 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3 - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3 - Model: blazeface nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 0.5693 1.1386 1.7079 2.2772 2.8465 SE +/- 0.48, N = 3 1.42 1.42 1.30 2.48 2.53 1.38 1.39 1.31 1.32 1.43 MIN: 1.34 / MAX: 1.99 MIN: 1.34 / MAX: 2.37 MIN: 1.24 / MAX: 1.92 MIN: 1.17 / MAX: 344.52 MIN: 1.08 / MAX: 118.73 MIN: 1.35 / MAX: 1.64 MIN: 1.37 / MAX: 1.48 MIN: 1.25 / MAX: 1.76 MIN: 1.26 / MAX: 2.03 MIN: 1.36 / MAX: 2.02 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3 - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3 - Model: googlenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 5 10 15 20 25 SE +/- 0.80, N = 3 10.75 8.97 8.38 9.68 19.20 7.91 7.90 8.29 8.32 8.52 MIN: 7.92 / MAX: 447.83 MIN: 8.22 / MAX: 10.51 MIN: 7.78 / MAX: 10.43 MIN: 8.16 / MAX: 382.41 MIN: 7.84 / MAX: 193.36 MIN: 7.81 / MAX: 8.62 MIN: 7.8 / MAX: 8.73 MIN: 7.63 / MAX: 9.87 MIN: 7.71 / MAX: 10.39 MIN: 7.85 / MAX: 10.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3 - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3 - Model: vgg16 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 11 22 33 44 55 SE +/- 0.54, N = 3 27.61 30.74 30.16 27.98 50.32 23.40 23.58 25.26 25.44 25.01 MIN: 24.67 / MAX: 401.29 MIN: 25.36 / MAX: 428.68 MIN: 24.66 / MAX: 332.49 MIN: 24.35 / MAX: 423.63 MIN: 25.92 / MAX: 281.06 MIN: 23.2 / MAX: 24.07 MIN: 23.35 / MAX: 24.43 MIN: 24.29 / MAX: 27.75 MIN: 24.27 / MAX: 27.68 MIN: 23.88 / MAX: 26.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3 - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3 - Model: resnet18 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 3 6 9 12 15 SE +/- 0.30, N = 3 6.07 8.14 7.74 6.22 12.64 5.30 5.20 5.77 5.78 5.64 MIN: 5.49 / MAX: 15.12 MIN: 5.39 / MAX: 122.47 MIN: 5.25 / MAX: 312.09 MIN: 5.3 / MAX: 8.22 MIN: 5.3 / MAX: 53.81 MIN: 5.21 / MAX: 6.24 MIN: 5.1 / MAX: 6.16 MIN: 5.22 / MAX: 7.06 MIN: 5.21 / MAX: 6.97 MIN: 5.11 / MAX: 7.51 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3 - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3 - Model: alexnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 3 6 9 12 15 SE +/- 0.43, N = 3 6.62 5.45 4.99 6.17 10.59 4.31 4.33 4.66 4.69 4.68 MIN: 4.28 / MAX: 339.62 MIN: 4.93 / MAX: 7.98 MIN: 4.56 / MAX: 6.91 MIN: 4.5 / MAX: 261.75 MIN: 4.3 / MAX: 177.68 MIN: 4.26 / MAX: 5.07 MIN: 4.26 / MAX: 5.19 MIN: 4.24 / MAX: 5.97 MIN: 4.26 / MAX: 6.07 MIN: 4.27 / MAX: 6.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3 - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3 - Model: resnet50 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 5 10 15 20 25 SE +/- 0.04, N = 3 13.29 12.47 11.72 12.81 22.19 10.06 10.03 11.10 11.26 10.86 MIN: 10.54 / MAX: 456.82 MIN: 11.5 / MAX: 14.68 MIN: 10.8 / MAX: 12.8 MIN: 10.06 / MAX: 349.03 MIN: 10.16 / MAX: 181.74 MIN: 9.86 / MAX: 11.9 MIN: 9.93 / MAX: 10.87 MIN: 10.19 / MAX: 18.3 MIN: 10.32 / MAX: 13.29 MIN: 9.98 / MAX: 12.46 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3 - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3 - Model: yolov4-tiny nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 7 14 21 28 35 SE +/- 0.81, N = 3 17.67 13.88 15.85 14.57 28.41 12.81 12.82 13.42 13.63 13.71 MIN: 14.92 / MAX: 343.93 MIN: 13.09 / MAX: 14.77 MIN: 13.26 / MAX: 253.23 MIN: 12.33 / MAX: 312.42 MIN: 12.49 / MAX: 151.04 MIN: 12.7 / MAX: 13.69 MIN: 12.72 / MAX: 13.66 MIN: 12.65 / MAX: 16.19 MIN: 12.77 / MAX: 16.93 MIN: 12.78 / MAX: 15.62 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssd nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4 8 12 16 20 SE +/- 0.23, N = 3 7.48 9.38 9.51 7.57 14.27 7.09 7.12 7.25 7.27 7.67 MIN: 6.85 / MAX: 9.67 MIN: 6.77 / MAX: 224.11 MIN: 7.11 / MAX: 307.17 MIN: 6.69 / MAX: 10 MIN: 7.01 / MAX: 51.13 MIN: 7.02 / MAX: 7.86 MIN: 7.04 / MAX: 7.97 MIN: 6.72 / MAX: 8.05 MIN: 6.73 / MAX: 8.77 MIN: 7.06 / MAX: 9.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3 - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3 - Model: regnety_400m nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4 8 12 16 20 SE +/- 0.29, N = 3 8.25 10.23 10.09 8.42 18.25 8.03 8.22 8.34 8.38 8.72 MIN: 7.87 / MAX: 10.07 MIN: 8.22 / MAX: 197.1 MIN: 8.01 / MAX: 418.58 MIN: 7.66 / MAX: 10.74 MIN: 7.8 / MAX: 238.29 MIN: 7.97 / MAX: 8.65 MIN: 8.14 / MAX: 8.67 MIN: 8.03 / MAX: 10.23 MIN: 8.04 / MAX: 9.63 MIN: 8.32 / MAX: 10.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3 - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3 - Model: vision_transformer nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 15 30 45 60 75 SE +/- 0.11, N = 3 38.95 38.79 38.76 38.04 65.41 31.85 32.10 34.47 34.14 34.22 MIN: 34.04 / MAX: 486.96 MIN: 34.02 / MAX: 460.15 MIN: 33.38 / MAX: 423.24 MIN: 33.11 / MAX: 346.94 MIN: 39.08 / MAX: 230.59 MIN: 31.67 / MAX: 35.74 MIN: 31.9 / MAX: 33.03 MIN: 33.05 / MAX: 39.69 MIN: 32.5 / MAX: 37.13 MIN: 33.01 / MAX: 37.09 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3 - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3 - Model: FastestDet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 2 4 6 8 10 SE +/- 0.87, N = 3 3.93 3.12 2.85 4.18 7.12 4.07 4.10 3.82 3.80 4.20 MIN: 3.76 / MAX: 11.77 MIN: 2.97 / MAX: 4.42 MIN: 2.74 / MAX: 4.36 MIN: 2.53 / MAX: 295.11 MIN: 3.72 / MAX: 188.7 MIN: 4.03 / MAX: 4.2 MIN: 4.07 / MAX: 4.34 MIN: 3.65 / MAX: 9.77 MIN: 3.65 / MAX: 6.08 MIN: 4.04 / MAX: 5.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 5 10 15 20 25 SE +/- 0.24, N = 15 10.15 8.83 8.46 9.98 18.54 8.05 8.01 MIN: 8.08 / MAX: 193.04 MIN: 8.29 / MAX: 10.15 MIN: 8.12 / MAX: 10.14 MIN: 7.79 / MAX: 434.9 MIN: 8.01 / MAX: 164.45 MIN: 7.98 / MAX: 8.94 MIN: 7.96 / MAX: 8.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 1.2353 2.4706 3.7059 4.9412 6.1765 SE +/- 0.20, N = 15 3.27 3.60 5.25 3.69 5.49 3.17 3.15 MIN: 3.11 / MAX: 4.1 MIN: 3.44 / MAX: 4.27 MIN: 3.11 / MAX: 367.53 MIN: 3.07 / MAX: 544.13 MIN: 2.97 / MAX: 152.08 MIN: 3.12 / MAX: 3.78 MIN: 3.11 / MAX: 3.78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 1.3433 2.6866 4.0299 5.3732 6.7165 SE +/- 0.17, N = 15 4.81 3.44 3.36 3.52 5.97 3.16 MIN: 3.13 / MAX: 149.75 MIN: 3.3 / MAX: 4.34 MIN: 3.21 / MAX: 4.83 MIN: 2.95 / MAX: 536.1 MIN: 2.84 / MAX: 111.8 MIN: 3.12 / MAX: 3.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 2 4 6 8 10 SE +/- 0.20, N = 15 3.37 5.18 3.47 3.92 6.30 3.36 3.36 MIN: 3.25 / MAX: 5.26 MIN: 3.45 / MAX: 200.36 MIN: 3.33 / MAX: 5.01 MIN: 3.12 / MAX: 496.78 MIN: 3.28 / MAX: 147.57 MIN: 3.33 / MAX: 3.83 MIN: 3.32 / MAX: 3.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mnasnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 2 4 6 8 10 SE +/- 0.11, N = 15 3.10 3.28 3.19 3.25 8.15 2.98 2.97 MIN: 2.97 / MAX: 3.92 MIN: 3.15 / MAX: 4.32 MIN: 3.04 / MAX: 3.98 MIN: 2.68 / MAX: 277.21 MIN: 2.67 / MAX: 317.68 MIN: 2.94 / MAX: 3.36 MIN: 2.93 / MAX: 3.45 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 3 6 9 12 15 SE +/- 0.16, N = 15 5.88 4.44 4.14 4.55 9.53 3.85 3.86 MIN: 3.96 / MAX: 194.08 MIN: 4.24 / MAX: 5.18 MIN: 3.93 / MAX: 5.94 MIN: 3.84 / MAX: 379.07 MIN: 3.77 / MAX: 182.53 MIN: 3.81 / MAX: 4.6 MIN: 3.82 / MAX: 4.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: blazeface nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 0.8033 1.6066 2.4099 3.2132 4.0165 SE +/- 0.14, N = 15 2.91 1.38 1.45 1.51 3.57 1.38 1.39 MIN: 1.29 / MAX: 113.97 MIN: 1.33 / MAX: 1.98 MIN: 1.38 / MAX: 2.98 MIN: 1.11 / MAX: 380.46 MIN: 1.08 / MAX: 141.04 MIN: 1.35 / MAX: 1.88 MIN: 1.36 / MAX: 3.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: googlenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 5 10 15 20 25 SE +/- 0.22, N = 15 8.85 10.18 8.91 9.58 18.66 7.82 7.83 MIN: 8.16 / MAX: 10.25 MIN: 7.81 / MAX: 204.67 MIN: 8.3 / MAX: 10.96 MIN: 7.62 / MAX: 396.9 MIN: 7.42 / MAX: 326.73 MIN: 7.72 / MAX: 8.6 MIN: 7.73 / MAX: 8.6 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vgg16 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 12 24 36 48 60 SE +/- 0.26, N = 15 29.40 29.35 27.31 28.53 51.28 23.47 23.50 MIN: 26.17 / MAX: 411.51 MIN: 24.55 / MAX: 485.35 MIN: 24.27 / MAX: 230.86 MIN: 24.21 / MAX: 515.3 MIN: 24.83 / MAX: 242.12 MIN: 23.25 / MAX: 24.24 MIN: 23.26 / MAX: 24.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet18 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 3 6 9 12 15 SE +/- 0.24, N = 15 5.97 5.84 7.78 6.69 13.34 5.20 5.20 MIN: 5.4 / MAX: 8.25 MIN: 5.35 / MAX: 8.28 MIN: 5.4 / MAX: 168.29 MIN: 5.06 / MAX: 462.37 MIN: 5.43 / MAX: 279.86 MIN: 5.08 / MAX: 6.05 MIN: 5.1 / MAX: 6.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: alexnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 3 6 9 12 15 SE +/- 0.21, N = 15 6.54 5.16 4.94 5.41 10.69 4.30 4.30 MIN: 4.56 / MAX: 110.58 MIN: 4.73 / MAX: 6.38 MIN: 4.52 / MAX: 6.23 MIN: 4.23 / MAX: 364.66 MIN: 4.32 / MAX: 148.92 MIN: 4.24 / MAX: 4.85 MIN: 4.25 / MAX: 4.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet50 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 6 12 18 24 30 SE +/- 0.24, N = 15 13.46 11.24 12.98 12.52 23.54 10.04 10.03 MIN: 10.6 / MAX: 340.67 MIN: 10.22 / MAX: 29.96 MIN: 10.26 / MAX: 145.62 MIN: 9.95 / MAX: 459.05 MIN: 10.3 / MAX: 149.49 MIN: 9.94 / MAX: 10.91 MIN: 9.88 / MAX: 10.86 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: yolov4-tiny nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 6 12 18 24 30 SE +/- 0.25, N = 15 15.67 16.60 15.69 15.56 26.33 12.89 12.86 MIN: 12.91 / MAX: 334.44 MIN: 12.98 / MAX: 103.04 MIN: 13.13 / MAX: 187.93 MIN: 12.24 / MAX: 459.8 MIN: 12.62 / MAX: 127.32 MIN: 12.79 / MAX: 13.77 MIN: 12.74 / MAX: 13.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssd nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4 8 12 16 20 SE +/- 0.22, N = 15 7.72 9.34 7.40 8.31 15.46 7.07 7.05 MIN: 7.13 / MAX: 8.97 MIN: 6.88 / MAX: 268.7 MIN: 6.81 / MAX: 8.46 MIN: 6.35 / MAX: 364.95 MIN: 7.08 / MAX: 147.31 MIN: 6.99 / MAX: 7.81 MIN: 6.98 / MAX: 7.81 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: regnety_400m nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4 8 12 16 20 SE +/- 0.20, N = 15 8.37 8.45 10.05 9.10 18.24 8.19 8.20 MIN: 8.08 / MAX: 10.1 MIN: 8.05 / MAX: 12.64 MIN: 8.13 / MAX: 173.18 MIN: 7.61 / MAX: 454.62 MIN: 7.5 / MAX: 201.09 MIN: 8.12 / MAX: 8.98 MIN: 8.14 / MAX: 8.74 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vision_transformer nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 15 30 45 60 75 SE +/- 0.12, N = 15 38.58 38.69 38.82 38.32 69.48 32.13 32.16 MIN: 33.06 / MAX: 464.16 MIN: 33.32 / MAX: 390.07 MIN: 33.83 / MAX: 435.6 MIN: 32.26 / MAX: 477.15 MIN: 39.08 / MAX: 374.31 MIN: 31.95 / MAX: 32.87 MIN: 31.94 / MAX: 33.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: FastestDet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 1.008 2.016 3.024 4.032 5.04 SE +/- 0.29, N = 15 4.01 3.91 2.82 4.25 4.48 4.10 4.08 MIN: 3.87 / MAX: 5.47 MIN: 3.77 / MAX: 5.87 MIN: 2.69 / MAX: 3.5 MIN: 2.46 / MAX: 526.3 MIN: 2.2 / MAX: 27.6 MIN: 4.06 / MAX: 4.2 MIN: 4.04 / MAX: 4.2 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 4 8 12 16 20 SE +/- 0.13, N = 3 10.64 8.22 10.56 10.02 17.06 8.03 MIN: 8.4 / MAX: 127.99 MIN: 7.75 / MAX: 9.41 MIN: 8.32 / MAX: 239.95 MIN: 7.8 / MAX: 372.36 MIN: 8 / MAX: 101.45 MIN: 7.97 / MAX: 8.91 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 1.332 2.664 3.996 5.328 6.66 SE +/- 0.53, N = 3 3.29 3.38 4.75 3.83 5.92 3.15 MIN: 3.12 / MAX: 4.27 MIN: 3.2 / MAX: 4 MIN: 2.93 / MAX: 147.66 MIN: 3.11 / MAX: 343.21 MIN: 3.16 / MAX: 103.24 MIN: 3.1 / MAX: 3.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 2 4 6 8 10 SE +/- 0.04, N = 3 4.96 3.35 3.36 3.24 7.34 3.19 MIN: 3.14 / MAX: 189.43 MIN: 3.22 / MAX: 3.99 MIN: 3.22 / MAX: 4.62 MIN: 3.05 / MAX: 5.14 MIN: 3.09 / MAX: 155.33 MIN: 3.13 / MAX: 3.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 1.3253 2.6506 3.9759 5.3012 6.6265 SE +/- 0.02, N = 3 3.43 5.23 3.56 3.48 5.89 3.32 MIN: 3.29 / MAX: 5.31 MIN: 3.34 / MAX: 185.57 MIN: 3.43 / MAX: 4.24 MIN: 3.33 / MAX: 5.22 MIN: 3.19 / MAX: 97.88 MIN: 3.29 / MAX: 3.62 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mnasnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 2 4 6 8 10 SE +/- 0.02, N = 3 3.10 3.12 3.23 3.12 8.55 2.96 MIN: 2.97 / MAX: 3.73 MIN: 3 / MAX: 4.1 MIN: 3.08 / MAX: 4.73 MIN: 2.97 / MAX: 4.65 MIN: 2.99 / MAX: 185.5 MIN: 2.92 / MAX: 3.27 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 2 4 6 8 10 SE +/- 0.08, N = 3 5.82 4.10 4.63 4.17 6.63 3.84 MIN: 3.98 / MAX: 197.79 MIN: 3.88 / MAX: 5.04 MIN: 4.38 / MAX: 6.01 MIN: 3.86 / MAX: 5.52 MIN: 3.75 / MAX: 22.34 MIN: 3.8 / MAX: 4.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: blazeface nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 0.6053 1.2106 1.8159 2.4212 3.0265 SE +/- 0.04, N = 3 1.40 1.42 1.35 1.40 2.69 1.38 MIN: 1.34 / MAX: 1.86 MIN: 1.36 / MAX: 2.03 MIN: 1.28 / MAX: 1.84 MIN: 1.28 / MAX: 1.91 MIN: 1.35 / MAX: 48.81 MIN: 1.36 / MAX: 1.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: googlenet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 5 10 15 20 25 SE +/- 0.55, N = 3 10.14 10.47 8.55 9.97 18.80 7.86 MIN: 7.85 / MAX: 257.61 MIN: 7.86 / MAX: 191.94 MIN: 7.85 / MAX: 11.39 MIN: 8.16 / MAX: 381.49 MIN: 7.78 / MAX: 141.46 MIN: 7.75 / MAX: 8.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vgg16 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 12 24 36 48 60 SE +/- 0.28, N = 3 27.25 29.85 27.32 27.86 53.48 23.72 MIN: 24.12 / MAX: 252.53 MIN: 24.25 / MAX: 400.86 MIN: 24.36 / MAX: 262.38 MIN: 24.17 / MAX: 416.36 MIN: 25.52 / MAX: 296.52 MIN: 23.56 / MAX: 24.59 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet18 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3 6 9 12 15 SE +/- 0.05, N = 3 5.58 5.87 6.96 5.94 12.13 5.27 MIN: 5.09 / MAX: 6.98 MIN: 5.41 / MAX: 7.58 MIN: 5.3 / MAX: 242.18 MIN: 5.32 / MAX: 8.32 MIN: 5.32 / MAX: 123.4 MIN: 5.15 / MAX: 6.11 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: alexnet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3 6 9 12 15 SE +/- 0.57, N = 3 6.32 5.34 5.14 6.25 11.43 4.31 MIN: 4.26 / MAX: 195.95 MIN: 4.87 / MAX: 6.57 MIN: 4.75 / MAX: 7.34 MIN: 4.27 / MAX: 334.55 MIN: 4.24 / MAX: 178.83 MIN: 4.26 / MAX: 4.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet50 nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 5 10 15 20 25 SE +/- 0.30, N = 3 13.25 10.96 13.00 13.15 22.15 10.27 MIN: 10.61 / MAX: 154.12 MIN: 10.09 / MAX: 12.99 MIN: 10.34 / MAX: 397.57 MIN: 10.26 / MAX: 349.93 MIN: 10.11 / MAX: 123.04 MIN: 10.12 / MAX: 11.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: yolov4-tiny nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 7 14 21 28 35 SE +/- 0.94, N = 3 16.30 15.41 16.05 14.64 29.38 12.92 MIN: 14.11 / MAX: 184.46 MIN: 12.75 / MAX: 226.87 MIN: 12.93 / MAX: 474.03 MIN: 12.77 / MAX: 383.28 MIN: 12.95 / MAX: 201.31 MIN: 12.79 / MAX: 18.5 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssd nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 4 8 12 16 20 SE +/- 0.14, N = 3 8.26 9.44 7.43 7.45 15.32 7.07 MIN: 7.64 / MAX: 11.08 MIN: 7.17 / MAX: 94.63 MIN: 6.84 / MAX: 8.82 MIN: 6.59 / MAX: 9.11 MIN: 6.66 / MAX: 139.17 MIN: 6.98 / MAX: 9.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: regnety_400m nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 4 8 12 16 20 SE +/- 0.54, N = 3 8.34 8.70 10.11 9.14 17.02 8.06 MIN: 8.01 / MAX: 12.36 MIN: 8.29 / MAX: 12.6 MIN: 8.03 / MAX: 259.38 MIN: 8.14 / MAX: 400.02 MIN: 7.65 / MAX: 216.63 MIN: 7.98 / MAX: 8.6 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vision_transformer nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 16 32 48 64 80 SE +/- 0.10, N = 3 37.13 38.65 39.35 38.50 70.53 31.94 MIN: 33.97 / MAX: 443.1 MIN: 33.07 / MAX: 476.08 MIN: 34.22 / MAX: 466.65 MIN: 33.7 / MAX: 418.06 MIN: 39.2 / MAX: 276.33 MIN: 31.73 / MAX: 32.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: FastestDet nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 2 4 6 8 10 SE +/- 0.15, N = 3 5.86 4.59 4.62 4.14 7.23 4.07 MIN: 3.9 / MAX: 190.17 MIN: 4.44 / MAX: 5.2 MIN: 4.48 / MAX: 5.16 MIN: 3.73 / MAX: 5.07 MIN: 3.75 / MAX: 121.71 MIN: 4.04 / MAX: 4.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
VkFFT Test: FFT + iFFT R2C / C2R OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.2.31 Test: FFT + iFFT R2C / C2R nv 4090 4090 rep 4090 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i h g f e d c b a 20K 40K 60K 80K 100K SE +/- 796.66, N = 3 SE +/- 3.71, N = 3 SE +/- 118.74, N = 3 SE +/- 200.55, N = 3 84887 81329 84351 54432 55347 67689 69068 68279 66473 33727 26524 26638 26593 35304 35399 43021 42163 42105 1. (CXX) g++ options: -O3
VkFFT Test: FFT + iFFT C2C 1D batched in half precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C 1D batched in half precision nv 4090 4090 rep 4090 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i h g f e d c b a 60K 120K 180K 240K 300K SE +/- 133.47, N = 3 SE +/- 26.03, N = 3 SE +/- 18.50, N = 3 SE +/- 83.55, N = 3 292768 287651 290342 265171 255207 210991 210713 211058 211076 132270 104298 104171 104146 85191 85181 91744 91812 91597 1. (CXX) g++ options: -O3
VkFFT Test: FFT + iFFT C2C Bluestein in single precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C Bluestein in single precision nv 4090 4090 rep 4090 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i h g f e d c b a 4K 8K 12K 16K 20K SE +/- 83.38, N = 3 SE +/- 75.16, N = 15 SE +/- 72.34, N = 3 SE +/- 62.67, N = 3 20601 20404 20373 14449 14406 17185 17343 17287 17121 10061 7622 7574 7571 10560 10719 11311 11273 11340 1. (CXX) g++ options: -O3
VkFFT Test: FFT + iFFT C2C 1D batched in double precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C 1D batched in double precision nv 4090 4090 rep 4090 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i h g f e d c b a 12K 24K 36K 48K 60K SE +/- 14.62, N = 3 SE +/- 12.42, N = 3 SE +/- 10.58, N = 3 SE +/- 11.67, N = 3 54950 55383 55214 31122 30945 35058 35071 35038 34974 14780 10572 10548 10561 12168 12143 20847 20822 20816 1. (CXX) g++ options: -O3
VkFFT Test: FFT + iFFT C2C 1D batched in single precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C 1D batched in single precision nv 4090 4090 rep 4090 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i h g f e d c b a 30K 60K 90K 120K 150K SE +/- 25.50, N = 3 SE +/- 1.67, N = 3 SE +/- 2.73, N = 3 SE +/- 9.54, N = 3 152170 153939 153896 141437 141357 104543 104528 104491 104556 69738 56431 56455 56476 42651 42645 47971 47948 47887 1. (CXX) g++ options: -O3
VkFFT Test: FFT + iFFT C2C multidimensional in single precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C multidimensional in single precision nv 4090 4090 rep 4090 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 20K 40K 60K 80K 100K SE +/- 555.86, N = 3 SE +/- 437.33, N = 3 SE +/- 116.12, N = 3 SE +/- 57.83, N = 3 82875 80999 81406 54814 51005 70040 67887 70068 65869 34686 26541 26238 37090 36328 32812 32751 33001 1. (CXX) g++ options: -O3
VkFFT Test: FFT + iFFT C2C Bluestein benchmark in double precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C Bluestein benchmark in double precision nv 4090 4090 rep 4090 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 2K 4K 6K 8K 10K SE +/- 11.20, N = 3 SE +/- 4.37, N = 3 SE +/- 0.33, N = 3 8132 8119 8039 4289 4282 5584 5587 5583 5579 2417 1818 1814 2343 2346 4670 4695 4717 1. (CXX) g++ options: -O3
VkFFT Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling nv 4090 4090 rep 4090 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 30K 60K 90K 120K 150K SE +/- 2.08, N = 3 SE +/- 2.33, N = 3 SE +/- 8.89, N = 3 155148 155936 152656 143956 143969 105926 106099 106205 106210 71163 57094 57110 43365 43365 50596 50643 50504 1. (CXX) g++ options: -O3
vkpeak fp32-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20230730 fp32-scalar 3090 rep 3090 h g f e d c b a 5K 10K 15K 20K 25K SE +/- 0.30, N = 3 SE +/- 16.18, N = 3 SE +/- 4.18, N = 3 20925.30 21269.72 6810.73 6812.99 6837.94 8515.58 8531.96 12860.56 12807.06 13190.09
vkpeak fp32-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20230730 fp32-vec4 3090 rep 3090 h g f e d c b a 6K 12K 18K 24K 30K SE +/- 2.57, N = 3 SE +/- 19.37, N = 3 SE +/- 1.81, N = 3 27807.58 27797.80 9036.17 9002.59 9006.57 11231.72 11251.17 12822.01 12808.59 12730.08
vkpeak fp16-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20230730 fp16-scalar 3090 rep 3090 h g f e d c b a 4K 8K 12K 16K 20K SE +/- 5.09, N = 3 SE +/- 13.46, N = 3 SE +/- 4.01, N = 3 20953.30 20845.09 6838.32 6811.35 6812.52 8397.80 8412.33 13136.79 13145.19 13154.15
vkpeak fp16-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20230730 fp16-vec4 3090 rep 3090 h g f e d c b a 9K 18K 27K 36K 45K SE +/- 0.36, N = 3 SE +/- 0.37, N = 3 SE +/- 5.96, N = 3 41188.02 41149.10 13490.24 13438.47 13440.97 16865.29 16864.47 23387.26 23390.44 23232.42
vkpeak fp64-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20230730 fp64-scalar 3090 rep 3090 h g f e d c b a 200 400 600 800 1000 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.22, N = 3 653.63 653.13 213.37 213.96 214.17 267.41 267.43 839.01 839.20 841.40
vkpeak fp64-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20230730 fp64-vec4 3090 h g f e d c b a 200 400 600 800 1000 SE +/- 0.00, N = 3 SE +/- 0.48, N = 3 SE +/- 0.32, N = 3 653.15 210.96 213.95 214.23 267.25 267.74 836.16 836.55 841.80
vkpeak int32-scalar OpenBenchmarking.org GIOPS, More Is Better vkpeak 20230730 int32-scalar 3090 rep 3090 h g f e d c b a 4K 8K 12K 16K 20K SE +/- 0.03, N = 3 SE +/- 15.02, N = 3 SE +/- 0.34, N = 3 20767.64 20909.02 6800.60 6824.29 6827.92 8505.20 8520.02 2269.06 2269.25 2272.62
vkpeak int32-vec4 OpenBenchmarking.org GIOPS, More Is Better vkpeak 20230730 int32-vec4 3090 rep 3090 h g f e d c b a 4K 8K 12K 16K 20K SE +/- 0.05, N = 3 SE +/- 0.19, N = 3 SE +/- 0.26, N = 3 20517.68 20820.09 6772.98 6794.92 6800.17 8465.71 8465.82 2638.69 2640.08 2658.73
vkpeak int16-scalar OpenBenchmarking.org GIOPS, More Is Better vkpeak 20230730 int16-scalar 3090 rep 3090 h g f e d c b a 3K 6K 9K 12K 15K SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 SE +/- 1.30, N = 3 13608.57 13710.88 4495.98 4479.22 4480.59 5675.99 5676.02 13063.86 13070.81 13102.75
vkpeak int16-vec4 OpenBenchmarking.org GIOPS, More Is Better vkpeak 20230730 int16-vec4 3090 rep 3090 h g f e d c b a 5K 10K 15K 20K 25K SE +/- 0.31, N = 3 SE +/- 17.33, N = 3 SE +/- 21.55, N = 3 16881.47 16886.66 5978.38 5956.38 5959.75 7336.25 7352.85 23385.44 23396.59 23123.77
VkResample Upscale: 2x - Precision: Single OpenBenchmarking.org ms, Fewer Is Better VkResample 1.0 Upscale: 2x - Precision: Single nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d c b a 8 16 24 32 40 SE +/- 0.029, N = 3 SE +/- 0.000, N = 3 SE +/- 0.004, N = 3 SE +/- 0.001, N = 3 8.967 8.962 9.284 27.183 22.064 10.428 10.399 13.126 13.137 13.136 13.136 20.930 26.769 26.738 32.850 32.855 11.688 11.690 11.686 1. (CXX) g++ options: -O3
VkResample Upscale: 2x - Precision: Double OpenBenchmarking.org ms, Fewer Is Better VkResample 1.0 Upscale: 2x - Precision: Double nv 4090 4090 rep 4090 RTX 3070 Ti 3070 3090 rep 3090 4080 zzz 4080 xxx 4080 rep 4080 i g f e d 110 220 330 440 550 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 172.89 173.04 172.88 24.81 24.75 371.42 371.70 288.03 288.04 288.17 288.20 500.01 500.01 500.01 500.02 500.01 1. (CXX) g++ options: -O3
Phoronix Test Suite v10.8.5