vulkan-benchmarks

AMD Ryzen 9 7950X 16-Core testing with a ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS) and NVIDIA GeForce RTX 4090 24GB on Ubuntu 23.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2308069-PTS-VULKANBE16&export=txt&grt&sro&rro.

vulkan-benchmarks ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionDisplay Driverabcdefghi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090AMD Ryzen 9 7950X 16-Core @ 4.50GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS)AMD Device 14d832GBWestern Digital WD_BLACK SN850X 1000GB + 4001GBAMD Radeon RX 6700 XT (2855/1000MHz)AMD Navi 21/23ASUS MG28UIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 23.046.4.6-060406-generic (x86_64)GNOME Shell 44.2X Server 1.21.1.7 + Wayland4.6 Mesa 23.3~git2307260600.87109c~oibaf~l (git-87109c3 2023-07-26 lunar-oibaf-ppa) (LLVM 15.0.7 DRM 3.52)GCC 12.2.0ext43840x2160MSI NVIDIA GeForce RTX 4060 8GBNVIDIA Device 22beX Server 1.21.1.7NVIDIA 535.86.054.6.0eVGA NVIDIA GeForce RTX 3060 12GBNVIDIA GA106 HD AudioNVIDIA GeForce RTX 3060 Ti 8GBNVIDIA GA104 HD Audio2560x1440NVIDIA GeForce RTX 4080 16GBNVIDIA Device 22bb3840x2160NVIDIA GeForce RTX 3090 24GBNVIDIA GA102 HD AudioNVIDIA GeForce RTX 3070 8GBNVIDIA GA104 HD Audio2560x1440NVIDIA GeForce RTX 3070 Ti 8GBNVIDIA GeForce RTX 4090 24GBNVIDIA AD102 HD Audio3840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-Pa930Z/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-Pa930Z/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- a: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa601203- b: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa601203- c: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa601203- d: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- e: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- f: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- g: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- h: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- i: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 4080: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 4080 rep: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 4080 xxx: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 4080 zzz: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 3090: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 3090 rep: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 3070: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- RTX 3070 Ti: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 4090: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 4090 rep: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- nv 4090: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203Graphics Details- a: BAR1 / Visible vRAM Size: 12272 MB - vBIOS Version: 113-D5121100-101- b: BAR1 / Visible vRAM Size: 12272 MB - vBIOS Version: 113-D5121100-101- c: BAR1 / Visible vRAM Size: 12272 MB - vBIOS Version: 113-D5121100-101- d: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- e: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- f: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.06.14.40.46- g: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.06.14.40.46- h: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.06.14.40.46- i: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2c- 4080: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04- 4080 rep: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04- 4080 xxx: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04- 4080 zzz: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04- 3090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.27.00.02- 3090 rep: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.27.00.02- 3070: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2b- RTX 3070 Ti: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.5b.00.02- 4090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- 4090 rep: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- nv 4090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

vulkan-benchmarks ncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: CPU - vision_transformerncnn: CPU - FastestDetncnn: CPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - FastestDetncnn: CPU-v3-v3-v3 - mobilenetncnn: CPU-v3-v3-v3-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3-v3-v3-v3 - mobilenet-v3ncnn: CPU-v3-v3-v3 - shufflenet-v2ncnn: CPU-v3-v3-v3 - mnasnetncnn: CPU-v3-v3-v3 - efficientnet-b0ncnn: CPU-v3-v3-v3 - blazefacencnn: CPU-v3-v3-v3 - googlenetncnn: CPU-v3-v3-v3 - vgg16ncnn: CPU-v3-v3-v3 - resnet18ncnn: CPU-v3-v3-v3 - alexnetncnn: CPU-v3-v3-v3 - resnet50ncnn: CPU-v3-v3-v3 - yolov4-tinyncnn: CPU-v3-v3-v3 - squeezenet_ssdncnn: CPU-v3-v3-v3 - regnety_400mncnn: CPU-v3-v3-v3 - vision_transformerncnn: CPU-v3-v3-v3 - FastestDetncnn: Vulkan GPU-v3-v3-v3 - mobilenetncnn: Vulkan GPU-v3-v3-v3-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3-v3-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v3-v3-v3 - shufflenet-v2ncnn: Vulkan GPU-v3-v3-v3 - mnasnetncnn: Vulkan GPU-v3-v3-v3 - efficientnet-b0ncnn: Vulkan GPU-v3-v3-v3 - blazefacencnn: Vulkan GPU-v3-v3-v3 - googlenetncnn: Vulkan GPU-v3-v3-v3 - vgg16ncnn: Vulkan GPU-v3-v3-v3 - resnet18ncnn: Vulkan GPU-v3-v3-v3 - alexnetncnn: Vulkan GPU-v3-v3-v3 - resnet50ncnn: Vulkan GPU-v3-v3-v3 - yolov4-tinyncnn: Vulkan GPU-v3-v3-v3 - squeezenet_ssdncnn: Vulkan GPU-v3-v3-v3 - regnety_400mncnn: Vulkan GPU-v3-v3-v3 - vision_transformerncnn: Vulkan GPU-v3-v3-v3 - FastestDetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - mobilenetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - shufflenet-v2ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - mnasnetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - efficientnet-b0ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - blazefacencnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - googlenetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - vgg16ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - resnet18ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - alexnetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - resnet50ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - yolov4-tinyncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - squeezenet_ssdncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - regnety_400mncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - vision_transformerncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - FastestDetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - shufflenet-v2ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mnasnetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - efficientnet-b0ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - blazefacencnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - googlenetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vgg16ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet18ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - alexnetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet50ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - yolov4-tinyncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - squeezenet_ssdncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - regnety_400mncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vision_transformerncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - FastestDetncnn: CPU-v3-v3-v3-v3-v3-v3 - mobilenetncnn: CPU-v3-v3-v3-v3-v3-v3-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet-v3ncnn: CPU-v3-v3-v3-v3-v3-v3 - shufflenet-v2ncnn: CPU-v3-v3-v3-v3-v3-v3 - mnasnetncnn: CPU-v3-v3-v3-v3-v3-v3 - efficientnet-b0ncnn: CPU-v3-v3-v3-v3-v3-v3 - blazefacencnn: CPU-v3-v3-v3-v3-v3-v3 - googlenetncnn: CPU-v3-v3-v3-v3-v3-v3 - vgg16ncnn: CPU-v3-v3-v3-v3-v3-v3 - resnet18ncnn: CPU-v3-v3-v3-v3-v3-v3 - alexnetncnn: CPU-v3-v3-v3-v3-v3-v3 - resnet50ncnn: CPU-v3-v3-v3-v3-v3-v3 - yolov4-tinyncnn: CPU-v3-v3-v3-v3-v3-v3 - squeezenet_ssdncnn: CPU-v3-v3-v3-v3-v3-v3 - regnety_400mncnn: CPU-v3-v3-v3-v3-v3-v3 - vision_transformerncnn: CPU-v3-v3-v3-v3-v3-v3 - FastestDetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet-v3ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - shufflenet-v2ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mnasnetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - efficientnet-b0ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - blazefacencnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - googlenetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vgg16ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet18ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - alexnetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet50ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - yolov4-tinyncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - squeezenet_ssdncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - regnety_400mncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vision_transformerncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - FastestDetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet-v3ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - shufflenet-v2ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mnasnetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - efficientnet-b0ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - blazefacencnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - googlenetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vgg16ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet18ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - alexnetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet50ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - yolov4-tinyncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - squeezenet_ssdncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - regnety_400mncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vision_transformerncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - FastestDetvkfft: FFT + iFFT R2C / C2Rvkfft: FFT + iFFT C2C 1D batched in half precisionvkfft: FFT + iFFT C2C Bluestein in single precisionvkfft: FFT + iFFT C2C 1D batched in double precisionvkfft: FFT + iFFT C2C 1D batched in single precisionvkfft: FFT + iFFT C2C multidimensional in single precisionvkfft: FFT + iFFT C2C Bluestein benchmark in double precisionvkfft: FFT + iFFT C2C 1D batched in single precision, no reshufflingvkpeak: fp32-scalarvkpeak: fp32-vec4vkpeak: fp16-scalarvkpeak: fp16-vec4vkpeak: fp64-scalarvkpeak: fp64-vec4vkpeak: int32-scalarvkpeak: int32-vec4vkpeak: int16-scalarvkpeak: int16-vec4vkresample: 2x - Singlevkresample: 2x - Doubleabcdefghi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40908.053.163.342.973.901.387.9423.755.294.4110.2012.907.078.1832.493.623.188.053.173.173.352.983.861.387.9023.515.284.3110.0112.847.098.1631.884.142105915971134020816478873300147175050413190.0912730.0813154.1523232.42841.40841.802272.622658.7313102.7523123.7711.6867.973.163.342.973.851.387.8223.495.24.3210.0112.747.068.1831.954.058.043.143.332.953.821.377.8523.565.234.331012.877.078.2131.854.078.013.153.163.332.973.821.377.8423.55.214.2910.0112.987.038.0531.654.0783.153.163.332.963.851.377.9723.425.424.429.8712.777.148.2731.714.0642163918121127320822479483275146955064312807.0612808.5913145.1923390.44839.2836.552269.252640.0813070.8123396.5911.698.023.183.352.993.881.397.9323.455.244.3110.1112.817.18.2731.774.113.178.033.133.23.322.963.831.377.823.545.214.331012.817.06831.794.0983.143.163.342.973.891.387.8823.995.264.2810.3312.897.048.1431.784.087.953.153.173.332.963.821.367.8323.545.234.310.0312.867.077.9831.663.6943021917441131120847479713281246705059612860.5612822.0113136.7923387.26839.01836.162269.062638.6913063.8623385.4411.6888.103.173.352.983.851.387.8523.515.234.3010.0012.957.098.2332.434.083.178.023.163.173.352.973.871.387.8523.565.234.3110.1012.857.088.1732.124.113539985181107191214342645363282346433658531.9611251.178412.3316864.47267.43267.748520.028465.825676.027352.8532.855500.0148.043.143.183.332.963.841.387.8523.605.224.3110.1012.877.058.1031.934.083530485191105601216842651370902343433658515.5811231.728397.8016865.29267.41267.258505.208465.715675.997336.2532.850500.0168.453.153.552.973.871.377.9224.195.694.3611.0513.327.238.0833.564.228.273.133.143.42.973.861.388.1524.555.484.6410.2613.177.088.3432.924.248.563.163.153.43.124.041.438.0724.456.134.8311.0513.076.978.3433.473.858.653.163.153.332.963.851.377.9424.125.34.3510.2514.347.098.533.364.22659310414675711056156476262381814571106837.949006.576812.5213440.97214.17214.236827.926800.174480.595959.7526.738500.0122.743.163.5933.911.387.9823.785.554.3210.7217.237.138.332.732.573.148.173.183.352.983.861.418.9624.26.224.8710.3413.647.18.3632.424.078.983.153.163.383.054.141.379.1524.925.484.7111.2513.087.268.0733.393.978.043.143.153.332.953.851.377.9423.825.34.3510.1812.897.197.9932.384.068.53.173.163.342.973.841.387.9624.045.284.3510.3313.147.148.3833.323.922663810417175741054856455265411818570946832.749003.126810.5513438.4213.96213.956824.216795.394478.415956.2426.769500.01126524104298762210572564316810.739036.176838.3213490.24213.37210.966800.66772.984495.985978.388.373.525.032.744.051.410.330.965.65.312.0914.658.969.9437.82.663.2610.43.294.873.493.25.881.48.7527.835.826.5312.9615.167.469.8836.425.1410.083.33.293.523.394.681.2810.1729.125.865.0114.0515.118.168.2136.555.699.053.283.263.362.994.211.2510.1929.075.854.9911.1515.438.337.9938.334.4310.023.293.263.433.074.191.4110.4727.435.885.113.113.777.218.4638.013.83337271322701006114780697383468624177116320.93500.0068.733.283.413.053.991.48.4225.375.694.7511.1613.857.738.2435.074.423.248.443.263.433.074.011.418.42255.674.6210.8113.797.588.3935.564.28.433.313.283.463.084.021.448.4525.15.74.6611.1113.817.648.3334.24.28.433.293.273.443.094.051.428.4925.045.654.6910.9513.797.668.4534.134.28.843.283.463.064.061.438.425.485.614.6111.413.867.668.6134.914.288.433.293.263.483.14.041.448.7925.675.924.9811.4813.937.718.6735.64.1966473211076171213497410455665869557910621013.136288.2018.413.283.433.064.091.428.5226.115.684.6711.7614.037.868.6735.284.343.268.573.293.443.084.051.418.425.045.614.6510.8413.677.648.5635.074.218.483.273.433.094.021.428.5224.915.634.6510.7913.557.598.5734.274.188.463.33.273.443.074.041.458.5825.045.694.6910.8413.687.638.4434.294.098.43.273.243.393.033.981.418.4925.055.614.7210.813.557.558.2434.14.148.383.273.313.443.064.011.428.4225.565.674.6411.0713.737.628.3533.934.178.453.33.283.473.094.071.438.5225.015.644.6810.8613.717.678.7234.224.268279211058172873503810449170068558310620513.136288.1668.373.283.453.064.041.428.3825.035.564.6810.8213.67.628.4534.274.173.278.373.263.53.074.041.428.4225.015.624.6810.9413.657.628.5634.194.28.443.33.313.473.074.061.428.5255.664.6510.9113.697.648.7534.374.198.883.43.333.513.134.221.428.9926.085.895.2111.513.957.78.5835.44.318.463.273.263.433.054.021.428.4325.45.674.6710.9113.627.678.5234.234.178.313.143.053.342.983.971.318.2625.335.654.7111.2213.527.278.2533.93.758.343.23.083.434.011.328.3225.445.784.6911.2613.637.278.3834.143.869068210713173433507110452867887558710609913.137288.0398.383.253.423.043.991.48.425.45.634.6911.113.637.558.3734.14.163.248.473.293.283.463.064.041.428.4125.825.594.6811.0713.87.638.5834.324.798.43.283.273.433.064.031.418.5525.455.714.6711.2113.837.358.4734.474.048.383.233.23.373.013.951.398.3725.265.594.6511.0913.627.518.134.054.128.463.283.243.433.084.011.428.4225.165.64.6810.9113.617.628.4934.14.29.193.283.263.443.084.051.418.5526.095.744.712.515.268.068.3735.364.618.253.163.063.362.963.951.318.2925.265.774.6611.113.427.258.3434.473.8267689210991171853505810454370040558410592613.126288.0288.63.173.342.993.871.387.8623.435.214.310.314.267.528.2533.014.213.188.073.183.193.392.993.881.397.8723.555.274.3510.112.887.168.3831.944.118.063.143.153.322.953.831.367.8323.55.194.3110.0512.877.047.9531.894.0483.163.342.973.851.367.8223.55.234.39.9712.887.048.0131.864.048.113.153.163.332.963.831.367.8623.555.194.3110.3813.17.047.9933.224.038.033.123.133.322.943.881.387.8423.515.214.3210.0712.977.058.3331.943.838.073.163.362.983.881.397.923.585.24.3310.0312.827.128.2232.14.18.013.153.163.362.973.861.397.8323.55.24.310.0312.867.058.232.164.0855347255207144063094514135751005428214396921269.7227797.820845.0941149.1653.13653.1520909.0220820.0913710.8816886.6610.399371.6998.013.173.352.973.861.387.923.435.294.319.9512.777.128.2431.84.083.198.033.173.163.362.973.861.377.8223.485.24.3110.0612.837.068.0231.914.088.033.173.153.332.963.851.377.8523.545.24.310.0712.827.098.0931.934.118.013.173.153.362.973.851.387.8523.435.244.310.0112.847.088.2532.114.18.043.193.372.993.871.397.8923.525.224.3110.0412.867.098.3432.094.088.053.173.183.342.973.851.377.8623.385.24.39.9812.97.088.0731.974.078.063.163.173.332.973.851.387.9123.45.34.3110.0612.817.098.0331.854.078.053.173.362.983.851.387.8223.475.24.310.0412.897.078.1932.134.18.033.153.193.322.963.841.387.8623.725.274.3110.2712.927.078.0631.944.0754432265171144493112214143754814428914395620708.8427393.220640.6740876.12648.7120613.4120517.4513606.7916878.210.428371.42217.819.676.826.889.232.9818.2556.6414.039.6221.527.6613.21875.348.657.5221.117.816.67.075.098.993.9819.4948.2912.6810.8823.4828.5915.8216.2270.768.4117.829.195.388.136.879.013.031749.7511.141124.0729.3417.7519.6681.779.1816.347.248.064.896.027.813.1820.7255.4213.389.8623.1129.4916.1517.2373.518.6318.398.356.5684.598.411.7718.655.4812.1410.0823.5929.815.417.8871.086.9317.095.465.995.596.069.812.9916.9749.711.311.8923.4428.7318.8317.6170.296.7116.527.226.437.816.079.192.5319.250.3212.6410.5922.1928.4114.2718.2565.417.1218.545.495.976.38.159.533.5718.6651.2813.3410.6923.5426.3315.4618.2469.484.4817.065.927.345.898.556.632.6918.853.4812.1311.4322.1529.3815.3217.0270.537.2322.06424.7459.433.763.773.264.721.609.6928.366.085.4912.7315.208.138.8337.913.943.619.353.563.643.983.404.531.609.8729.066.285.2512.6015.548.479.0738.034.269.623.413.764.093.114.781.799.6528.406.185.5512.1115.428.399.0237.864.339.623.663.653.953.374.731.719.8628.406.235.6712.3515.448.298.8938.294.269.623.663.623.753.344.601.499.8428.636.575.5312.7315.218.289.1937.884.419.523.663.443.893.104.371.349.9028.536.405.3412.4215.008.659.0538.274.3210.033.913.704.023.244.742.489.6827.986.226.1712.8114.577.578.4238.044.189.983.693.523.923.254.551.519.5828.536.695.4112.5215.568.319.1038.324.2510.023.833.243.483.124.171.409.9727.865.946.2513.1514.647.459.1438.504.1427.18324.80510.083.33.553.124.231.2710.2727.7565.1414.113.687.868.1338.255.483.5310.553.33.283.453.184.341.3910.6228.825.694.6414.1313.977.838.6438.764.398.963.463.255.183.194.091.179.9728.555.814.9411.3915.39.328.1338.792.938.814.993.123.3434.181.338.8728.215.976.1114.5815.447.939.639.013.948.963.413.345.173.14.241.4311.329.055.784.7211.5315.559.2810.139.064.139.163.483.623.524.934.151.429.0527.447.524.6712.415.959.819.8738.624.459.043.363.333.485.194.471.38.3830.167.744.9911.7215.859.5110.0938.762.858.465.253.363.473.194.141.458.9127.317.784.9412.9815.697.410.0538.822.8210.564.753.363.563.234.631.358.5527.326.965.141316.057.4310.1139.354.628435129034220373552141538968140680391526569.284172.88310.753.33.473.184.31.4510.8629.246.775.2313.7316.797.988.7837.055.273.3110.233.313.33.493.154.091.410.6527.046.016.7913.0815.728.228.4837.594.598.743.453.533.593.234.341.349.2927.257.755.2713.8215.349.317.1539.124.169.543.314.93.43.16.281.418.927.595.816.5813.5716.397.8110.3438.734.168.373.343.335.273.134.041.4610.3829.126.055.3312.1715.459.168.6438.173.969.023.363.343.484.994.411.4110.3929.175.95.2511.5115.49.4610.6939.034.1110.613.443.33.425.114.351.428.9730.748.145.4512.4713.889.3810.2338.793.128.833.63.445.183.284.441.3810.1829.355.845.1611.2416.69.348.4538.693.918.223.383.355.233.124.11.4210.4729.855.875.3410.9615.419.448.738.654.598132928765120404553831539398099981191559368.962173.0438.153.63.454.774.371.338.9329.547.825.1811.4115.49.1110.1738.94.513.478.453.433.363.514.74.11.48.729.297.445.212.4515.269.119.8139.043.938.913.393.353.464.614.041.169.0227.047.614.6713.6815.629.379.5538.994.0610.544.452.613.172.545.261.0710.0127.775.846.1113.1316.617.727.7338.585.929.415.13.263.513.164.121.188.6127.898.165.1413.6317.39.2110.0939.182.818.933.423.173.53.074.11.268.3528.147.384.6913.1315.557.0210.0338.462.6412.123.294.973.323.125.941.4210.7527.616.076.6213.2917.677.488.2538.953.9310.153.274.813.373.15.882.918.8529.45.976.5413.4615.677.728.3738.584.0110.643.294.963.433.15.821.410.1427.255.586.3213.2516.38.268.3437.135.868488729276820601549501521708287581321551488.967172.887OpenBenchmarking.org

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mobilenetnv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070510152025SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.22, N = 158.158.3722.748.458.108.027.978.059.4310.7510.088.388.378.418.738.018.6017.81MIN: 7.73 / MAX: 9.34MIN: 8.15 / MAX: 9.75MIN: 8.24 / MAX: 1264.67MIN: 8.37 / MAX: 9.44MIN: 7.94 / MAX: 14.4MIN: 7.98 / MAX: 8.33MIN: 7.94 / MAX: 8.26MIN: 7.97 / MAX: 9.07MIN: 7.95 / MAX: 398.1MIN: 8.24 / MAX: 287.14MIN: 8.1 / MAX: 118.32MIN: 7.94 / MAX: 10.16MIN: 7.96 / MAX: 9.72MIN: 8.14 / MAX: 11.03MIN: 8.15 / MAX: 10.96MIN: 7.96 / MAX: 9.85MIN: 8.5 / MAX: 13.72MIN: 8.05 / MAX: 159.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v2-v2 - Model: mobilenet-v2nv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.17, N = 153.603.523.163.153.173.183.163.163.764.743.303.253.283.283.283.173.179.67MIN: 3.43 / MAX: 4.62MIN: 3.29 / MAX: 19.18MIN: 3.11 / MAX: 3.83MIN: 3.1 / MAX: 3.65MIN: 3.1 / MAX: 8.86MIN: 3.13 / MAX: 3.84MIN: 3.11 / MAX: 3.61MIN: 3.1 / MAX: 3.8MIN: 2.6 / MAX: 364.73MIN: 3.09 / MAX: 140.79MIN: 3.11 / MAX: 4.81MIN: 3.09 / MAX: 4.51MIN: 3.1 / MAX: 4.05MIN: 3.11 / MAX: 4MIN: 3.11 / MAX: 3.88MIN: 3.11 / MAX: 4.94MIN: 3.12 / MAX: 4.05MIN: 3.19 / MAX: 225.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: shufflenet-v2nv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.19, N = 153.455.033.593.553.353.353.343.343.773.513.553.423.453.433.413.353.346.82MIN: 3.32 / MAX: 4.91MIN: 3.07 / MAX: 228.55MIN: 3.3 / MAX: 25.28MIN: 3.27 / MAX: 22.86MIN: 3.3 / MAX: 3.82MIN: 3.31 / MAX: 3.8MIN: 3.31 / MAX: 3.77MIN: 3.3 / MAX: 3.85MIN: 3.02 / MAX: 511.95MIN: 3.38 / MAX: 5.4MIN: 3.39 / MAX: 5.48MIN: 3.28 / MAX: 4.19MIN: 3.32 / MAX: 3.85MIN: 3.3 / MAX: 4.15MIN: 3.28 / MAX: 4.87MIN: 3.31 / MAX: 3.68MIN: 3.3 / MAX: 4.19MIN: 3.16 / MAX: 64.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mnasnetnv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 154.772.743.002.972.982.992.972.973.263.223.123.043.063.063.052.972.996.88MIN: 3.07 / MAX: 97.57MIN: 2.62 / MAX: 4.22MIN: 2.96 / MAX: 3.68MIN: 2.93 / MAX: 3.95MIN: 2.94 / MAX: 3.83MIN: 2.96 / MAX: 3.44MIN: 2.93 / MAX: 3.45MIN: 2.92 / MAX: 3.48MIN: 2.46 / MAX: 277.54MIN: 3.11 / MAX: 3.71MIN: 2.98 / MAX: 3.79MIN: 2.91 / MAX: 4.47MIN: 2.94 / MAX: 4.45MIN: 2.94 / MAX: 4.51MIN: 2.92 / MAX: 3.82MIN: 2.93 / MAX: 3.28MIN: 2.95 / MAX: 3.88MIN: 3.05 / MAX: 110.251. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: efficientnet-b0nv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.19, N = 154.374.053.913.873.853.883.853.904.724.304.233.994.044.093.993.863.879.23MIN: 4.15 / MAX: 5.96MIN: 3.78 / MAX: 5.45MIN: 3.85 / MAX: 4.64MIN: 3.81 / MAX: 4.97MIN: 3.81 / MAX: 4.46MIN: 3.84 / MAX: 4.41MIN: 3.81 / MAX: 4.42MIN: 3.82 / MAX: 4.51MIN: 3.37 / MAX: 486.93MIN: 4.08 / MAX: 5.07MIN: 3.98 / MAX: 12.23MIN: 3.8 / MAX: 5.69MIN: 3.83 / MAX: 5.71MIN: 3.86 / MAX: 5.59MIN: 3.79 / MAX: 5.83MIN: 3.81 / MAX: 4.75MIN: 3.83 / MAX: 4.69MIN: 3.43 / MAX: 156.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: blazefacenv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030700.67051.3412.01152.6823.3525SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.14, N = 151.331.401.381.371.381.391.381.381.601.451.271.401.421.421.401.381.382.98MIN: 1.27 / MAX: 1.77MIN: 1.34 / MAX: 2MIN: 1.36 / MAX: 1.62MIN: 1.34 / MAX: 2.11MIN: 1.35 / MAX: 2.05MIN: 1.36 / MAX: 1.53MIN: 1.35 / MAX: 1.67MIN: 1.35 / MAX: 2.06MIN: 0.95 / MAX: 433.24MIN: 1.38 / MAX: 2.96MIN: 1.21 / MAX: 1.95MIN: 1.34 / MAX: 2.1MIN: 1.36 / MAX: 2.02MIN: 1.35 / MAX: 1.88MIN: 1.34 / MAX: 2.15MIN: 1.36 / MAX: 1.71MIN: 1.35 / MAX: 2.23MIN: 1.29 / MAX: 144.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: googlenetnv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.22, N = 158.9310.307.987.927.857.937.827.949.6910.8610.278.408.388.528.427.907.8618.25MIN: 8.27 / MAX: 10.68MIN: 8.19 / MAX: 349.57MIN: 7.86 / MAX: 8.78MIN: 7.8 / MAX: 8.96MIN: 7.71 / MAX: 8.83MIN: 7.82 / MAX: 8.91MIN: 7.73 / MAX: 8.65MIN: 7.71 / MAX: 8.73MIN: 7.29 / MAX: 407.61MIN: 8.12 / MAX: 189.87MIN: 7.95 / MAX: 115.68MIN: 7.72 / MAX: 10.5MIN: 7.72 / MAX: 10.05MIN: 7.84 / MAX: 10.21MIN: 7.75 / MAX: 9.96MIN: 7.79 / MAX: 8.74MIN: 7.76 / MAX: 8.74MIN: 7.5 / MAX: 267.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vgg16nv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701326395265SE +/- 0.05, N = 3SE +/- 0.30, N = 3SE +/- 0.23, N = 1529.5430.9623.7824.1923.5123.4523.4923.7528.3629.2427.7525.4025.0326.1125.3723.4323.4356.64MIN: 24.77 / MAX: 364.86MIN: 25.92 / MAX: 328.63MIN: 23.52 / MAX: 24.89MIN: 23.99 / MAX: 30.98MIN: 23.19 / MAX: 24.68MIN: 23.26 / MAX: 24.51MIN: 23.36 / MAX: 24.62MIN: 23.31 / MAX: 25.12MIN: 24.13 / MAX: 449.57MIN: 26.51 / MAX: 270.71MIN: 24.58 / MAX: 282.59MIN: 24.09 / MAX: 32.86MIN: 23.85 / MAX: 28.9MIN: 24.54 / MAX: 30.29MIN: 24.26 / MAX: 36.52MIN: 23.23 / MAX: 24.39MIN: 23.2 / MAX: 24.1MIN: 25.75 / MAX: 367.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet18nv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.17, N = 157.825.605.555.695.235.245.205.296.088.076.005.635.565.685.695.295.2114.03MIN: 5.54 / MAX: 303.05MIN: 5.13 / MAX: 6.83MIN: 5.19 / MAX: 25.4MIN: 5.22 / MAX: 92.59MIN: 5.1 / MAX: 6.28MIN: 5.15 / MAX: 6.09MIN: 5.1 / MAX: 5.9MIN: 5.09 / MAX: 6.29MIN: 4.97 / MAX: 245.95MIN: 5.86 / MAX: 121.03MIN: 5.47 / MAX: 7.29MIN: 5.08 / MAX: 7.55MIN: 5.09 / MAX: 6.84MIN: 5.17 / MAX: 7.45MIN: 5.16 / MAX: 7.68MIN: 5.18 / MAX: 6.19MIN: 5.09 / MAX: 6.04MIN: 5 / MAX: 303.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: alexnetnv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.21, N = 145.185.304.324.364.304.314.324.415.495.235.144.694.684.674.754.314.309.62MIN: 4.75 / MAX: 7.12MIN: 4.92 / MAX: 7.18MIN: 4.25 / MAX: 5.17MIN: 4.29 / MAX: 5.7MIN: 4.23 / MAX: 5.32MIN: 4.26 / MAX: 4.98MIN: 4.26 / MAX: 5.15MIN: 4.24 / MAX: 5.16MIN: 4.26 / MAX: 363.39MIN: 4.78 / MAX: 7.33MIN: 4.73 / MAX: 6.32MIN: 4.29 / MAX: 5.78MIN: 4.28 / MAX: 6.37MIN: 4.27 / MAX: 5.88MIN: 4.31 / MAX: 13.88MIN: 4.26 / MAX: 5.18MIN: 4.25 / MAX: 4.83MIN: 4.31 / MAX: 147.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet50nv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070510152025SE +/- 0.01, N = 3SE +/- 0.23, N = 3SE +/- 0.26, N = 1511.4112.0910.7211.0510.0010.1110.0110.2012.7313.7314.1011.1010.8211.7611.169.9510.3021.50MIN: 10.57 / MAX: 12.22MIN: 11.16 / MAX: 13.48MIN: 10.1 / MAX: 108.3MIN: 10.14 / MAX: 162.88MIN: 9.86 / MAX: 11.02MIN: 9.95 / MAX: 16.18MIN: 9.85 / MAX: 11.06MIN: 9.84 / MAX: 12.48MIN: 10.18 / MAX: 541.92MIN: 10.4 / MAX: 137.78MIN: 10.27 / MAX: 287MIN: 10.2 / MAX: 13.06MIN: 9.9 / MAX: 12.26MIN: 10.68 / MAX: 44.94MIN: 10.29 / MAX: 15.03MIN: 9.85 / MAX: 10.72MIN: 9.82 / MAX: 17.56MIN: 10.24 / MAX: 116.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: yolov4-tinynv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070714212835SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.18, N = 1515.4014.6517.2313.3212.9512.8112.7412.9015.2016.7913.6813.6313.6014.0313.8512.7714.2627.66MIN: 12.35 / MAX: 321.43MIN: 12.44 / MAX: 202.68MIN: 12.99 / MAX: 196.66MIN: 12.95 / MAX: 35.49MIN: 12.75 / MAX: 18.88MIN: 12.74 / MAX: 13.2MIN: 12.66 / MAX: 13.28MIN: 12.69 / MAX: 15.88MIN: 12.69 / MAX: 431.37MIN: 14.1 / MAX: 273.41MIN: 12.83 / MAX: 14.63MIN: 12.77 / MAX: 15.36MIN: 12.8 / MAX: 16.23MIN: 13.15 / MAX: 15.97MIN: 12.84 / MAX: 16.75MIN: 12.7 / MAX: 13.02MIN: 14.17 / MAX: 14.53MIN: 12.74 / MAX: 294.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: squeezenet_ssdnv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.24, N = 159.118.967.137.237.097.107.067.078.137.987.867.557.627.867.737.127.5213.20MIN: 6.77 / MAX: 101.58MIN: 6.92 / MAX: 244.02MIN: 7.04 / MAX: 8.43MIN: 7.15 / MAX: 8.02MIN: 6.99 / MAX: 9.39MIN: 7.05 / MAX: 7.65MIN: 7.01 / MAX: 7.55MIN: 7.01 / MAX: 8.07MIN: 6.37 / MAX: 399.11MIN: 7.32 / MAX: 16.07MIN: 7.25 / MAX: 8.98MIN: 7 / MAX: 8.72MIN: 7.01 / MAX: 8.84MIN: 7.22 / MAX: 10.84MIN: 7.13 / MAX: 9.7MIN: 7.05 / MAX: 7.63MIN: 7.45 / MAX: 7.74MIN: 6.9 / MAX: 68.611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: regnety_400mnv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.19, N = 1510.179.948.308.088.238.278.188.188.839.668.138.378.458.678.248.248.2518.00MIN: 8.12 / MAX: 209.53MIN: 7.43 / MAX: 166.02MIN: 8.22 / MAX: 9.1MIN: 7.98 / MAX: 10.87MIN: 8.03 / MAX: 8.9MIN: 8.22 / MAX: 9.18MIN: 8.12 / MAX: 8.86MIN: 8.07 / MAX: 9.68MIN: 7.65 / MAX: 351.08MIN: 7.78 / MAX: 95.3MIN: 7.78 / MAX: 9.98MIN: 8.05 / MAX: 10.19MIN: 8.12 / MAX: 9.68MIN: 8.22 / MAX: 15.29MIN: 7.89 / MAX: 9.52MIN: 8.17 / MAX: 8.84MIN: 8.17 / MAX: 8.9MIN: 7.91 / MAX: 176.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vision_transformernv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307020406080100SE +/- 0.39, N = 3SE +/- 0.29, N = 3SE +/- 0.12, N = 1538.9037.8032.7333.5632.4331.7731.9532.4937.9137.8138.2534.1034.2735.2835.0731.8033.0175.34MIN: 34.2 / MAX: 300.84MIN: 33.74 / MAX: 321.51MIN: 31.44 / MAX: 81.32MIN: 32.98 / MAX: 51.93MIN: 31.56 / MAX: 37.69MIN: 31.61 / MAX: 35.68MIN: 31.79 / MAX: 32.33MIN: 31.67 / MAX: 40.11MIN: 32.08 / MAX: 541.11MIN: 32.66 / MAX: 453.44MIN: 33.04 / MAX: 447.7MIN: 32.65 / MAX: 37.64MIN: 32.82 / MAX: 39.79MIN: 33.9 / MAX: 38.67MIN: 33.14 / MAX: 43.26MIN: 31.66 / MAX: 32.23MIN: 32.88 / MAX: 33.42MIN: 38.72 / MAX: 418.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: FastestDetnv 4090igfdcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.01, N = 3SE +/- 0.45, N = 3SE +/- 0.23, N = 154.512.662.574.224.084.114.053.623.945.275.484.164.174.344.424.084.218.65MIN: 4.34 / MAX: 5.96MIN: 2.54 / MAX: 3.41MIN: 2.53 / MAX: 3.21MIN: 4.18 / MAX: 4.97MIN: 4.02 / MAX: 4.28MIN: 4.08 / MAX: 4.4MIN: 4.02 / MAX: 4.35MIN: 2.7 / MAX: 4.54MIN: 2.43 / MAX: 267.02MIN: 4.05 / MAX: 247.02MIN: 2.67 / MAX: 259.34MIN: 4 / MAX: 4.69MIN: 4.05 / MAX: 4.74MIN: 4.19 / MAX: 5.77MIN: 4.25 / MAX: 6.71MIN: 4.05 / MAX: 4.84MIN: 4.19 / MAX: 4.41MIN: 3.94 / MAX: 185.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3 - Model: mobilenet-v3nv 4090igdcaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.00, N = 3SE +/- 0.00, N = 2SE +/- 0.21, N = 143.473.263.143.173.173.183.613.413.533.243.273.263.243.193.187.52MIN: 3.32 / MAX: 4.91MIN: 3.14 / MAX: 3.9MIN: 3.1 / MAX: 3.81MIN: 3.12 / MAX: 3.96MIN: 3.15 / MAX: 3.74MIN: 3.14 / MAX: 3.82MIN: 2.51 / MAX: 502.85MIN: 3.27 / MAX: 5.24MIN: 3.39 / MAX: 4.31MIN: 3.11 / MAX: 4.47MIN: 3.13 / MAX: 3.85MIN: 3.09 / MAX: 3.96MIN: 3.09 / MAX: 4.73MIN: 3.15 / MAX: 3.72MIN: 3.14 / MAX: 4.14MIN: 2.94 / MAX: 2151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mobilenetnv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070510152025SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.25, N = 158.4510.408.178.278.048.028.038.048.059.3510.2310.558.478.378.578.448.038.0721.11MIN: 8.03 / MAX: 12.61MIN: 7.97 / MAX: 455.46MIN: 8.08 / MAX: 9.37MIN: 8.17 / MAX: 9.04MIN: 7.95 / MAX: 9.09MIN: 7.95 / MAX: 9.81MIN: 7.98 / MAX: 8.84MIN: 7.95 / MAX: 14.33MIN: 7.95 / MAX: 8.89MIN: 7.49 / MAX: 474.12MIN: 8.13 / MAX: 386.42MIN: 8.22 / MAX: 303.1MIN: 8.04 / MAX: 10.17MIN: 7.97 / MAX: 16.09MIN: 7.98 / MAX: 10MIN: 7.98 / MAX: 10.55MIN: 7.96 / MAX: 8.77MIN: 7.99 / MAX: 8.8MIN: 7.98 / MAX: 322.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2nv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 153.433.293.183.133.143.163.133.143.173.563.313.303.293.263.293.263.173.187.81MIN: 3.25 / MAX: 4.81MIN: 3.12 / MAX: 3.93MIN: 3.13 / MAX: 3.9MIN: 3.07 / MAX: 3.82MIN: 3.08 / MAX: 4.06MIN: 3.09 / MAX: 3.92MIN: 3.08 / MAX: 3.85MIN: 3.1 / MAX: 3.73MIN: 3.09 / MAX: 3.78MIN: 3.09 / MAX: 345.01MIN: 3.14 / MAX: 4.92MIN: 3.12 / MAX: 4.82MIN: 3.11 / MAX: 3.98MIN: 3.1 / MAX: 3.87MIN: 3.12 / MAX: 4.14MIN: 3.1 / MAX: 4.12MIN: 3.12 / MAX: 3.64MIN: 3.14 / MAX: 3.63MIN: 3.07 / MAX: 154.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3nv 4090ifedcaRTX 3070 Ti4090 rep40904080 zzz3090 rep30903070246810SE +/- 0.02, N = 3SE +/- 0.00, N = 2SE +/- 0.02, N = 3SE +/- 0.20, N = 143.364.873.143.183.173.203.173.643.303.283.283.163.196.60MIN: 3.21 / MAX: 4.3MIN: 3.14 / MAX: 278.98MIN: 3.09 / MAX: 3.54MIN: 3.11 / MAX: 3.78MIN: 3.1 / MAX: 3.83MIN: 3.16 / MAX: 3.68MIN: 3.11 / MAX: 3.73MIN: 2.87 / MAX: 429.02MIN: 3.15 / MAX: 3.92MIN: 3.15 / MAX: 3.9MIN: 3.13 / MAX: 4.65MIN: 3.11 / MAX: 3.62MIN: 3.14 / MAX: 3.48MIN: 2.98 / MAX: 166.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: shufflenet-v2nv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.20, N = 153.513.493.353.403.333.353.323.333.353.983.493.453.463.503.443.433.363.397.07MIN: 3.37 / MAX: 4MIN: 3.35 / MAX: 4.24MIN: 3.3 / MAX: 4.02MIN: 3.35 / MAX: 5.89MIN: 3.28 / MAX: 4.14MIN: 3.3 / MAX: 3.82MIN: 3.29 / MAX: 4.19MIN: 3.3 / MAX: 3.59MIN: 3.29 / MAX: 3.85MIN: 3.14 / MAX: 529.82MIN: 3.36 / MAX: 4.33MIN: 3.32 / MAX: 3.99MIN: 3.32 / MAX: 5.24MIN: 3.37 / MAX: 4.85MIN: 3.3 / MAX: 5.36MIN: 3.3 / MAX: 4.22MIN: 3.32 / MAX: 4.06MIN: 3.35 / MAX: 3.69MIN: 3.25 / MAX: 243.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mnasnetnv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701.14532.29063.43594.58125.7265SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.16, N = 154.703.202.982.972.962.972.962.952.983.403.153.183.063.073.083.072.972.995.09MIN: 3 / MAX: 188.08MIN: 3.07 / MAX: 3.86MIN: 2.94 / MAX: 3.65MIN: 2.93 / MAX: 3.66MIN: 2.91 / MAX: 5.9MIN: 2.92 / MAX: 3.34MIN: 2.93 / MAX: 3.41MIN: 2.92 / MAX: 3.42MIN: 2.92 / MAX: 4.03MIN: 2.72 / MAX: 432.18MIN: 3 / MAX: 4.54MIN: 3.05 / MAX: 4.64MIN: 2.92 / MAX: 3.73MIN: 2.95 / MAX: 4.19MIN: 2.94 / MAX: 3.67MIN: 2.93 / MAX: 4.63MIN: 2.94 / MAX: 3.28MIN: 2.96 / MAX: 3.14MIN: 2.86 / MAX: 53.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: efficientnet-b0nv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.18, N = 154.105.883.863.863.843.873.833.823.864.534.094.344.044.044.054.013.863.888.99MIN: 3.86 / MAX: 5.46MIN: 4.04 / MAX: 364.21MIN: 3.82 / MAX: 4.22MIN: 3.78 / MAX: 10.45MIN: 3.79 / MAX: 4.76MIN: 3.77 / MAX: 9.91MIN: 3.79 / MAX: 4.61MIN: 3.78 / MAX: 4.39MIN: 3.8 / MAX: 4.6MIN: 3.75 / MAX: 396.62MIN: 3.87 / MAX: 5.46MIN: 4.14 / MAX: 5.84MIN: 3.8 / MAX: 5.31MIN: 3.82 / MAX: 5.33MIN: 3.83 / MAX: 6.11MIN: 3.78 / MAX: 5.34MIN: 3.82 / MAX: 4.34MIN: 3.84 / MAX: 4.39MIN: 3.71 / MAX: 129.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: blazefacenv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030700.89551.7912.68653.5824.4775SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.16, N = 151.401.401.411.381.381.381.371.371.381.601.401.391.421.421.411.411.371.393.98MIN: 1.34 / MAX: 1.87MIN: 1.33 / MAX: 2MIN: 1.38 / MAX: 2.09MIN: 1.35 / MAX: 2.08MIN: 1.34 / MAX: 1.88MIN: 1.34 / MAX: 2.25MIN: 1.35 / MAX: 1.82MIN: 1.35 / MAX: 1.75MIN: 1.34 / MAX: 1.85MIN: 1.11 / MAX: 436.01MIN: 1.33 / MAX: 1.93MIN: 1.33 / MAX: 1.94MIN: 1.36 / MAX: 2.01MIN: 1.36 / MAX: 1.93MIN: 1.35 / MAX: 1.9MIN: 1.35 / MAX: 2.01MIN: 1.36 / MAX: 1.46MIN: 1.37 / MAX: 1.82MIN: 1.31 / MAX: 228.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: googlenetnv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070510152025SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.22, N = 158.708.758.968.157.857.857.807.857.909.8710.6510.628.418.428.408.427.827.8719.49MIN: 7.96 / MAX: 10.01MIN: 8.08 / MAX: 16.01MIN: 8.82 / MAX: 9.87MIN: 8.02 / MAX: 9.02MIN: 7.71 / MAX: 8.76MIN: 7.71 / MAX: 8.85MIN: 7.72 / MAX: 8.74MIN: 7.76 / MAX: 8.76MIN: 7.74 / MAX: 9.54MIN: 7.33 / MAX: 399.24MIN: 8.29 / MAX: 236.11MIN: 7.83 / MAX: 323.31MIN: 7.72 / MAX: 9.9MIN: 7.73 / MAX: 10.06MIN: 7.77 / MAX: 9.78MIN: 7.79 / MAX: 10.01MIN: 7.69 / MAX: 8.61MIN: 7.76 / MAX: 10.36MIN: 7.4 / MAX: 200.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vgg16nv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701122334455SE +/- 0.14, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.24, N = 1529.2927.8324.2024.5523.6023.5623.5423.5623.5129.0627.0428.8225.8225.0125.0425.0023.4823.5548.29MIN: 24.63 / MAX: 296.95MIN: 24.98 / MAX: 262.23MIN: 23.56 / MAX: 58.31MIN: 23.62 / MAX: 97.69MIN: 23.17 / MAX: 24.71MIN: 23.24 / MAX: 24.78MIN: 23.33 / MAX: 24.61MIN: 23.34 / MAX: 24.72MIN: 23.29 / MAX: 24.68MIN: 24.11 / MAX: 541.55MIN: 24.22 / MAX: 296.13MIN: 24.35 / MAX: 214.1MIN: 24.35 / MAX: 62.94MIN: 23.8 / MAX: 26.41MIN: 24.06 / MAX: 27.35MIN: 23.93 / MAX: 26.69MIN: 23.24 / MAX: 29.21MIN: 23.3 / MAX: 24.45MIN: 24.97 / MAX: 183.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet18nv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.20, N = 157.445.826.225.485.225.235.215.235.286.286.015.695.595.625.615.675.205.2712.68MIN: 5.29 / MAX: 320.54MIN: 5.28 / MAX: 7.02MIN: 6.11 / MAX: 7MIN: 5.33 / MAX: 6.16MIN: 5.09 / MAX: 11.15MIN: 5.08 / MAX: 6.28MIN: 5.11 / MAX: 6.04MIN: 5.13 / MAX: 6.18MIN: 5.17 / MAX: 6.16MIN: 4.94 / MAX: 298.06MIN: 5.44 / MAX: 8.18MIN: 5.16 / MAX: 8.22MIN: 5.06 / MAX: 6.95MIN: 5.1 / MAX: 7.65MIN: 5.11 / MAX: 7.44MIN: 5.18 / MAX: 7.22MIN: 5.09 / MAX: 5.98MIN: 5.15 / MAX: 6.19MIN: 5.39 / MAX: 262.621. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: alexnetnv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.18, N = 155.206.534.874.644.314.314.334.334.315.256.794.644.684.684.654.624.314.3510.88MIN: 4.82 / MAX: 7.07MIN: 4.57 / MAX: 242.16MIN: 4.8 / MAX: 5.62MIN: 4.57 / MAX: 5.49MIN: 4.23 / MAX: 11.03MIN: 4.25 / MAX: 5.28MIN: 4.26 / MAX: 10.59MIN: 4.28 / MAX: 5.16MIN: 4.24 / MAX: 5.2MIN: 4.23 / MAX: 375.94MIN: 4.23 / MAX: 262.43MIN: 4.26 / MAX: 5.98MIN: 4.26 / MAX: 6.23MIN: 4.26 / MAX: 6.61MIN: 4.26 / MAX: 6.53MIN: 4.26 / MAX: 6.15MIN: 4.26 / MAX: 5.26MIN: 4.28 / MAX: 7.49MIN: 4.38 / MAX: 52.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet50nv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070612182430SE +/- 0.12, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.26, N = 1512.4512.9610.3410.2610.1010.1010.0010.0010.0112.6013.0814.1311.0710.9410.8410.8110.0610.1023.48MIN: 11.55 / MAX: 14.48MIN: 10.23 / MAX: 424.46MIN: 10.14 / MAX: 11.37MIN: 10.09 / MAX: 11.22MIN: 9.84 / MAX: 11.72MIN: 9.86 / MAX: 11.08MIN: 9.91 / MAX: 11.15MIN: 9.92 / MAX: 12.35MIN: 9.88 / MAX: 11.4MIN: 9.82 / MAX: 418.4MIN: 10.11 / MAX: 444.45MIN: 10.63 / MAX: 167.28MIN: 10.1 / MAX: 13.23MIN: 9.95 / MAX: 12.7MIN: 9.93 / MAX: 12.81MIN: 9.95 / MAX: 12.78MIN: 9.95 / MAX: 11.04MIN: 9.97 / MAX: 11.42MIN: 10.06 / MAX: 112.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: yolov4-tinynv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070714212835SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.18, N = 1515.2615.1613.6413.1712.8712.8512.8112.8712.8415.5415.7213.9713.8013.6513.6713.7912.8312.8828.59MIN: 12.87 / MAX: 132.82MIN: 12.86 / MAX: 248.64MIN: 13.04 / MAX: 76.32MIN: 13.03 / MAX: 14.1MIN: 12.68 / MAX: 13.84MIN: 12.72 / MAX: 13.93MIN: 12.73 / MAX: 13.08MIN: 12.76 / MAX: 13.73MIN: 12.69 / MAX: 15.33MIN: 12.15 / MAX: 492.01MIN: 13.2 / MAX: 301.81MIN: 13.11 / MAX: 16.15MIN: 12.76 / MAX: 15.76MIN: 12.71 / MAX: 14.99MIN: 12.71 / MAX: 14.88MIN: 12.75 / MAX: 19.63MIN: 12.74 / MAX: 13.59MIN: 12.76 / MAX: 13.67MIN: 12.87 / MAX: 325.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: squeezenet_ssdnv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.26, N = 159.117.467.107.087.057.087.067.077.098.478.227.837.637.627.647.587.067.1615.82MIN: 6.35 / MAX: 130.38MIN: 6.9 / MAX: 8.9MIN: 6.99 / MAX: 8.59MIN: 6.98 / MAX: 8.07MIN: 6.95 / MAX: 8MIN: 6.97 / MAX: 7.99MIN: 7 / MAX: 8.03MIN: 7 / MAX: 8.07MIN: 6.98 / MAX: 7.95MIN: 6.29 / MAX: 533.92MIN: 7.56 / MAX: 9.8MIN: 7.21 / MAX: 9.32MIN: 7 / MAX: 9.17MIN: 7.01 / MAX: 9.28MIN: 7.05 / MAX: 9.12MIN: 6.98 / MAX: 9.05MIN: 7 / MAX: 7.82MIN: 7.05 / MAX: 13.55MIN: 6.99 / MAX: 82.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: regnety_400mnv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.21, N = 159.819.888.368.348.108.178.008.218.169.078.488.648.588.568.568.398.028.3816.22MIN: 7.82 / MAX: 241.19MIN: 8.14 / MAX: 251.77MIN: 8.27 / MAX: 9.08MIN: 7.99 / MAX: 26.72MIN: 7.98 / MAX: 8.84MIN: 7.99 / MAX: 8.97MIN: 7.94 / MAX: 8.88MIN: 8.14 / MAX: 8.84MIN: 7.9 / MAX: 8.99MIN: 7.61 / MAX: 402.49MIN: 8.09 / MAX: 9.64MIN: 8.28 / MAX: 10.42MIN: 8.13 / MAX: 9.78MIN: 8.15 / MAX: 9.8MIN: 8.17 / MAX: 10.28MIN: 8 / MAX: 10.29MIN: 7.95 / MAX: 8.63MIN: 8.31 / MAX: 8.86MIN: 7.74 / MAX: 314.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vision_transformernv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701632486480SE +/- 0.07, N = 3SE +/- 0.21, N = 3SE +/- 0.09, N = 3SE +/- 0.16, N = 1539.0436.4232.4232.9231.9332.1231.7931.8531.8838.0337.5938.7634.3234.1935.0735.5631.9131.9470.76MIN: 33.83 / MAX: 463.88MIN: 33.49 / MAX: 224.86MIN: 31.89 / MAX: 65.47MIN: 32.67 / MAX: 36.93MIN: 31.62 / MAX: 35.85MIN: 31.66 / MAX: 46.9MIN: 31.63 / MAX: 35.57MIN: 31.69 / MAX: 33.06MIN: 31.55 / MAX: 37.47MIN: 32.66 / MAX: 467.28MIN: 34.45 / MAX: 457.98MIN: 33.12 / MAX: 539.58MIN: 32.58 / MAX: 41.88MIN: 32.72 / MAX: 36.79MIN: 33.66 / MAX: 39.36MIN: 33.19 / MAX: 40.43MIN: 31.74 / MAX: 34.28MIN: 31.73 / MAX: 34.21MIN: 38.81 / MAX: 250.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: FastestDetnv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.29, N = 153.935.144.074.244.084.114.094.074.104.264.594.394.794.204.214.204.084.118.41MIN: 3.8 / MAX: 5.4MIN: 3.7 / MAX: 81.79MIN: 4.02 / MAX: 4.82MIN: 3.88 / MAX: 24.21MIN: 4.03 / MAX: 5.29MIN: 4.01 / MAX: 9.72MIN: 4.05 / MAX: 5.5MIN: 4.04 / MAX: 4.53MIN: 4.06 / MAX: 4.81MIN: 2.5 / MAX: 396.93MIN: 2.62 / MAX: 232.18MIN: 4.25 / MAX: 5.86MIN: 4.64 / MAX: 6.21MIN: 4.03 / MAX: 6.49MIN: 4.04 / MAX: 4.97MIN: 4.02 / MAX: 4.97MIN: 4.04 / MAX: 4.35MIN: 4.07 / MAX: 4.29MIN: 2.89 / MAX: 487.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3 - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: mobilenetnv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.23, N = 158.9110.088.988.568.008.019.628.748.968.408.448.488.438.038.0617.82MIN: 8.33 / MAX: 10.07MIN: 8.08 / MAX: 286.28MIN: 8.1 / MAX: 124.43MIN: 8.04 / MAX: 75.44MIN: 7.96 / MAX: 8.63MIN: 7.95 / MAX: 8.95MIN: 7.76 / MAX: 454.91MIN: 8.25 / MAX: 10.5MIN: 8.39 / MAX: 10.77MIN: 8.12 / MAX: 10.11MIN: 7.97 / MAX: 10.71MIN: 7.96 / MAX: 10.32MIN: 7.99 / MAX: 10.44MIN: 7.98 / MAX: 8.77MIN: 7.94 / MAX: 13.92MIN: 7.57 / MAX: 211.621. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v2-v2 - Model: mobilenet-v2nv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.10, N = 153.393.303.153.163.143.153.413.453.463.283.303.273.313.173.149.19MIN: 3.21 / MAX: 4.24MIN: 3.14 / MAX: 4.82MIN: 3.1 / MAX: 3.63MIN: 3.09 / MAX: 3.89MIN: 3.1 / MAX: 3.67MIN: 3.1 / MAX: 3.68MIN: 2.99 / MAX: 184.91MIN: 3.23 / MAX: 4.55MIN: 3.29 / MAX: 4.38MIN: 3.1 / MAX: 4MIN: 3.12 / MAX: 4.03MIN: 3.1 / MAX: 4.34MIN: 3.12 / MAX: 4.76MIN: 3.11 / MAX: 4.5MIN: 3.08 / MAX: 3.7MIN: 3.04 / MAX: 232.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3 - Model: mobilenet-v3nv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx40803090 rep309030701.21052.4213.63154.8426.0525SE +/- 0.22, N = 143.353.293.163.153.163.163.763.533.253.273.313.283.153.155.38MIN: 3.21 / MAX: 5.23MIN: 3.15 / MAX: 4.32MIN: 3.11 / MAX: 3.93MIN: 3.11 / MAX: 3.48MIN: 3.12 / MAX: 3.7MIN: 3.12 / MAX: 3.69MIN: 2.89 / MAX: 366.04MIN: 3.2 / MAX: 40.81MIN: 3.11 / MAX: 4.74MIN: 3.14 / MAX: 4.63MIN: 3.16 / MAX: 5.3MIN: 3.14 / MAX: 3.89MIN: 3.11 / MAX: 3.6MIN: 3.11 / MAX: 3.71MIN: 2.74 / MAX: 121.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3 - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: shufflenet-v2nv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.21, N = 153.463.523.383.403.343.334.093.595.183.433.473.433.463.333.328.13MIN: 3.32 / MAX: 5.2MIN: 3.39 / MAX: 4.05MIN: 3.34 / MAX: 4.15MIN: 3.35 / MAX: 4.17MIN: 3.32 / MAX: 3.79MIN: 3.3 / MAX: 3.79MIN: 3.12 / MAX: 435.28MIN: 3.46 / MAX: 4.09MIN: 3.34 / MAX: 283.54MIN: 3.31 / MAX: 3.94MIN: 3.33 / MAX: 5.01MIN: 3.3 / MAX: 4.03MIN: 3.34 / MAX: 3.93MIN: 3.3 / MAX: 3.67MIN: 3.28 / MAX: 3.66MIN: 3.09 / MAX: 147.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3 - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: mnasnetnv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.04, N = 154.613.393.053.122.972.973.113.233.193.063.073.093.082.962.956.87MIN: 2.78 / MAX: 222.99MIN: 3.26 / MAX: 4.86MIN: 3.01 / MAX: 3.88MIN: 3.08 / MAX: 3.86MIN: 2.94 / MAX: 3.43MIN: 2.94 / MAX: 3.43MIN: 2.8 / MAX: 4.98MIN: 3.1 / MAX: 3.75MIN: 3.06 / MAX: 3.75MIN: 2.93 / MAX: 3.64MIN: 2.94 / MAX: 3.6MIN: 2.95 / MAX: 4.52MIN: 2.94 / MAX: 4.52MIN: 2.94 / MAX: 3.38MIN: 2.92 / MAX: 3.29MIN: 2.93 / MAX: 216.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3 - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: efficientnet-b0nv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.22, N = 154.044.684.144.043.893.824.784.344.094.034.064.024.023.853.839.01MIN: 3.78 / MAX: 4.9MIN: 4.48 / MAX: 6.02MIN: 4.09 / MAX: 5.13MIN: 3.99 / MAX: 4.82MIN: 3.83 / MAX: 9.72MIN: 3.79 / MAX: 4.34MIN: 3.82 / MAX: 411.19MIN: 4.16 / MAX: 5.28MIN: 3.86 / MAX: 4.83MIN: 3.82 / MAX: 5.43MIN: 3.83 / MAX: 5.55MIN: 3.82 / MAX: 5.39MIN: 3.82 / MAX: 5.66MIN: 3.81 / MAX: 4.53MIN: 3.78 / MAX: 4.41MIN: 3.98 / MAX: 188.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3 - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: blazefacenv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030700.68181.36362.04542.72723.409SE +/- 0.19, N = 151.161.281.371.431.381.371.791.341.171.411.421.421.441.371.363.03MIN: 1.11 / MAX: 1.67MIN: 1.23 / MAX: 1.73MIN: 1.34 / MAX: 2.07MIN: 1.4 / MAX: 1.77MIN: 1.36 / MAX: 1.58MIN: 1.35 / MAX: 1.52MIN: 1.13 / MAX: 312.12MIN: 1.27 / MAX: 1.95MIN: 1.11 / MAX: 1.9MIN: 1.34 / MAX: 1.91MIN: 1.36 / MAX: 1.92MIN: 1.36 / MAX: 2.2MIN: 1.37 / MAX: 3.45MIN: 1.35 / MAX: 1.46MIN: 1.34 / MAX: 1.46MIN: 1.28 / MAX: 96.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3 - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: googlenetnv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.24, N = 159.0210.179.158.077.887.849.659.299.978.558.508.528.457.857.8317.00MIN: 8.41 / MAX: 11.08MIN: 7.94 / MAX: 150.01MIN: 7.84 / MAX: 198.46MIN: 7.92 / MAX: 8.86MIN: 7.79 / MAX: 8.78MIN: 7.74 / MAX: 8.7MIN: 7.59 / MAX: 472.81MIN: 7.98 / MAX: 83.03MIN: 7.67 / MAX: 258.52MIN: 7.85 / MAX: 10.35MIN: 7.79 / MAX: 9.94MIN: 7.81 / MAX: 10.78MIN: 7.79 / MAX: 10.32MIN: 7.75 / MAX: 8.69MIN: 7.71 / MAX: 8.8MIN: 7.35 / MAX: 277.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3 - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: vgg16nv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701122334455SE +/- 0.24, N = 1527.0429.1224.9224.4523.9923.5028.4027.2528.5525.4525.0024.9125.1023.5423.5049.75MIN: 24.33 / MAX: 215.56MIN: 26.33 / MAX: 310.23MIN: 24.58 / MAX: 31.89MIN: 24.26 / MAX: 25.26MIN: 23.72 / MAX: 24.98MIN: 23.3 / MAX: 24.41MIN: 24.12 / MAX: 509.06MIN: 24.14 / MAX: 379.93MIN: 24.05 / MAX: 201.8MIN: 24.22 / MAX: 27.73MIN: 23.91 / MAX: 27.99MIN: 23.8 / MAX: 26.87MIN: 24.12 / MAX: 27.57MIN: 23.33 / MAX: 24.41MIN: 23.17 / MAX: 24.44MIN: 25.45 / MAX: 273.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3 - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: resnet18nv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.19, N = 157.615.865.486.135.265.216.187.755.815.715.665.635.705.205.1911.14MIN: 5.23 / MAX: 90.18MIN: 5.35 / MAX: 7.79MIN: 5.37 / MAX: 6.51MIN: 5.41 / MAX: 151.51MIN: 5.18 / MAX: 6.27MIN: 5.12 / MAX: 6.22MIN: 5.17 / MAX: 262.79MIN: 5.57 / MAX: 125.43MIN: 5.27 / MAX: 7.16MIN: 5.12 / MAX: 8.19MIN: 5.14 / MAX: 7.49MIN: 5.09 / MAX: 7.75MIN: 5.15 / MAX: 7.9MIN: 5.1 / MAX: 5.97MIN: 5.09 / MAX: 6.13MIN: 4.79 / MAX: 65.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3 - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: alexnetnv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.23, N = 154.675.014.714.834.284.295.555.274.944.674.654.654.664.304.3111.00MIN: 4.28 / MAX: 5.7MIN: 4.6 / MAX: 6.68MIN: 4.65 / MAX: 5.57MIN: 4.76 / MAX: 5.74MIN: 4.24 / MAX: 5.12MIN: 4.24 / MAX: 5.64MIN: 4.2 / MAX: 281.58MIN: 4.78 / MAX: 7.7MIN: 4.51 / MAX: 6.64MIN: 4.28 / MAX: 6.29MIN: 4.28 / MAX: 6.42MIN: 4.26 / MAX: 6.13MIN: 4.29 / MAX: 6.1MIN: 4.24 / MAX: 4.99MIN: 4.25 / MAX: 5.13MIN: 4.33 / MAX: 199.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3 - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: resnet50nv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070612182430SE +/- 0.22, N = 1513.6814.0511.2511.0510.3310.0112.1113.8211.3911.2110.9110.7911.1110.0710.0524.07MIN: 10.25 / MAX: 566.67MIN: 11.69 / MAX: 252.21MIN: 10.55 / MAX: 118.12MIN: 10.46 / MAX: 112.6MIN: 10.16 / MAX: 13.97MIN: 9.89 / MAX: 10.86MIN: 10.16 / MAX: 382.56MIN: 10.34 / MAX: 245.6MIN: 10.48 / MAX: 13.29MIN: 10.3 / MAX: 13.25MIN: 9.91 / MAX: 13.1MIN: 9.91 / MAX: 12.75MIN: 10.19 / MAX: 13.03MIN: 9.94 / MAX: 11.06MIN: 9.85 / MAX: 12.64MIN: 10.02 / MAX: 218.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3 - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: yolov4-tinynv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070714212835SE +/- 0.14, N = 1515.6215.1113.0813.0712.8912.9815.4215.3415.3013.8313.6913.5513.8112.8212.8729.34MIN: 12.99 / MAX: 184MIN: 12.93 / MAX: 151.45MIN: 12.96 / MAX: 13.83MIN: 12.95 / MAX: 14.55MIN: 12.84 / MAX: 13.19MIN: 12.73 / MAX: 35.55MIN: 12.21 / MAX: 414.81MIN: 12.94 / MAX: 157.95MIN: 12.87 / MAX: 144.73MIN: 12.89 / MAX: 15.4MIN: 12.73 / MAX: 15.68MIN: 12.75 / MAX: 14.74MIN: 12.84 / MAX: 15.1MIN: 12.72 / MAX: 13.48MIN: 12.75 / MAX: 13.58MIN: 12.17 / MAX: 245.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3 - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: squeezenet_ssdnv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.24, N = 159.378.167.266.977.047.038.399.309.327.357.647.597.647.097.0417.75MIN: 7.07 / MAX: 281.92MIN: 7.51 / MAX: 9.94MIN: 7.14 / MAX: 8.59MIN: 6.83 / MAX: 13.87MIN: 6.96 / MAX: 7.83MIN: 6.97 / MAX: 7.88MIN: 6.53 / MAX: 436.05MIN: 6.92 / MAX: 310.91MIN: 7.1 / MAX: 172.56MIN: 6.79 / MAX: 9.82MIN: 7.03 / MAX: 9.19MIN: 7.02 / MAX: 8.87MIN: 7.05 / MAX: 9.9MIN: 7.02 / MAX: 7.99MIN: 6.96 / MAX: 7.74MIN: 6.47 / MAX: 272.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3 - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: regnety_400mnv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070510152025SE +/- 0.21, N = 159.558.218.078.348.148.059.0217.158.138.478.758.578.338.097.9519.66MIN: 7.5 / MAX: 193.79MIN: 7.9 / MAX: 9.99MIN: 7.97 / MAX: 8.81MIN: 8.26 / MAX: 9.3MIN: 8.08 / MAX: 8.69MIN: 8 / MAX: 8.58MIN: 7.69 / MAX: 501.76MIN: 8.02 / MAX: 773.45MIN: 7.75 / MAX: 10.05MIN: 8.13 / MAX: 10.27MIN: 8.35 / MAX: 10.08MIN: 8.21 / MAX: 10.39MIN: 8.02 / MAX: 9.64MIN: 7.99 / MAX: 14.25MIN: 7.88 / MAX: 8.67MIN: 7.5 / MAX: 235.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3 - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: vision_transformernv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307020406080100SE +/- 0.18, N = 1538.9936.5533.3933.4731.7831.6537.8639.1238.7934.4734.3734.2734.2031.9331.8981.77MIN: 34.17 / MAX: 473.06MIN: 33 / MAX: 209.38MIN: 32.73 / MAX: 88.83MIN: 32.89 / MAX: 74.09MIN: 31.64 / MAX: 34.51MIN: 31.53 / MAX: 32.23MIN: 32.9 / MAX: 463.9MIN: 33.92 / MAX: 465.83MIN: 33.95 / MAX: 457.41MIN: 33.32 / MAX: 37.42MIN: 33.01 / MAX: 38.7MIN: 33.07 / MAX: 37.01MIN: 32.92 / MAX: 36.19MIN: 31.76 / MAX: 33.09MIN: 31.66 / MAX: 39.97MIN: 44.4 / MAX: 460.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3 - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: FastestDetnv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.27, N = 144.065.693.973.854.084.074.334.162.934.044.194.184.204.114.049.18MIN: 3.91 / MAX: 5.78MIN: 3.69 / MAX: 261.71MIN: 3.92 / MAX: 4.75MIN: 3.8 / MAX: 4.65MIN: 4.05 / MAX: 4.36MIN: 4.03 / MAX: 5.83MIN: 2.59 / MAX: 433.58MIN: 4 / MAX: 5.58MIN: 2.84 / MAX: 3.38MIN: 3.89 / MAX: 5.01MIN: 4.04 / MAX: 5.47MIN: 4.03 / MAX: 5.07MIN: 4.06 / MAX: 4.86MIN: 4.07 / MAX: 4.21MIN: 4.01 / MAX: 4.15MIN: 3.64 / MAX: 122.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3 - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: mobilenetnv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.26, N = 1510.549.058.208.657.958.009.629.548.818.388.888.468.438.018.0016.34MIN: 8.41 / MAX: 134.08MIN: 8.48 / MAX: 11.28MIN: 8.12 / MAX: 9.4MIN: 8.55 / MAX: 9.53MIN: 7.89 / MAX: 8.79MIN: 7.95 / MAX: 8.99MIN: 7.76 / MAX: 502.83MIN: 8.94 / MAX: 10.54MIN: 8.32 / MAX: 10.7MIN: 7.95 / MAX: 10.41MIN: 8.31 / MAX: 10.01MIN: 7.99 / MAX: 10.62MIN: 7.99 / MAX: 10.66MIN: 7.95 / MAX: 8.35MIN: 7.94 / MAX: 8.78MIN: 8.13 / MAX: 80.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v2-v2 - Model: mobilenet-v2nv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.18, N = 154.453.283.173.163.153.153.663.314.993.233.403.303.293.173.167.24MIN: 2.65 / MAX: 216.76MIN: 3.09 / MAX: 5.28MIN: 3.13 / MAX: 3.58MIN: 3.1 / MAX: 3.71MIN: 3.11 / MAX: 3.85MIN: 3.11 / MAX: 3.88MIN: 3.01 / MAX: 437.59MIN: 3.12 / MAX: 4.6MIN: 3.1 / MAX: 201.8MIN: 3.06 / MAX: 4.66MIN: 3.23 / MAX: 4.8MIN: 3.12 / MAX: 4.7MIN: 3.12 / MAX: 4.64MIN: 3.12 / MAX: 4.03MIN: 3.11 / MAX: 3.51MIN: 3.04 / MAX: 261.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3 - Model: mobilenet-v3nv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3070246810SE +/- 0.18, N = 152.613.263.153.153.173.163.654.903.123.203.333.273.273.158.06MIN: 2.5 / MAX: 3.12MIN: 3.12 / MAX: 4.19MIN: 3.1 / MAX: 3.87MIN: 3.1 / MAX: 3.8MIN: 3.11 / MAX: 8.89MIN: 3.11 / MAX: 3.75MIN: 2.87 / MAX: 347.75MIN: 3.17 / MAX: 120.84MIN: 2.99 / MAX: 5.09MIN: 3.06 / MAX: 3.84MIN: 3.19 / MAX: 4.2MIN: 3.14 / MAX: 3.99MIN: 3.12 / MAX: 5.24MIN: 3.11 / MAX: 3.83MIN: 2.96 / MAX: 219.871. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3 - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: shufflenet-v2nv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701.10032.20063.30094.40125.5015SE +/- 0.22, N = 153.173.363.353.333.333.333.953.403.343.373.513.443.443.363.344.89MIN: 3.04 / MAX: 3.78MIN: 3.25 / MAX: 4.02MIN: 3.31 / MAX: 4.01MIN: 3.29 / MAX: 3.99MIN: 3.31 / MAX: 3.81MIN: 3.3 / MAX: 3.77MIN: 3.19 / MAX: 410.41MIN: 3.26 / MAX: 4.84MIN: 3.23 / MAX: 4.78MIN: 3.25 / MAX: 3.95MIN: 3.37 / MAX: 4.26MIN: 3.32 / MAX: 4.16MIN: 3.31 / MAX: 4.88MIN: 3.32 / MAX: 3.7MIN: 3.31 / MAX: 3.6MIN: 3.04 / MAX: 18.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3 - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: mnasnetnv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.16, N = 142.542.992.982.962.962.963.373.103.003.013.133.073.092.972.976.02MIN: 2.44 / MAX: 3.58MIN: 2.86 / MAX: 4.38MIN: 2.95 / MAX: 3.63MIN: 2.92 / MAX: 3.81MIN: 2.93 / MAX: 3.41MIN: 2.93 / MAX: 3.4MIN: 2.86 / MAX: 278.87MIN: 2.97 / MAX: 3.72MIN: 2.89 / MAX: 3.46MIN: 2.91 / MAX: 3.6MIN: 3 / MAX: 5.1MIN: 2.94 / MAX: 3.72MIN: 2.94 / MAX: 3.79MIN: 2.94 / MAX: 3.39MIN: 2.92 / MAX: 3.28MIN: 2.79 / MAX: 50.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3 - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: efficientnet-b0nv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.18, N = 155.264.214.633.853.823.854.736.284.183.954.224.044.053.853.857.81MIN: 3.48 / MAX: 250.88MIN: 3.96 / MAX: 4.94MIN: 3.8 / MAX: 159.43MIN: 3.8 / MAX: 4.6MIN: 3.78 / MAX: 4.53MIN: 3.82 / MAX: 4.48MIN: 3.79 / MAX: 418.72MIN: 3.91 / MAX: 337.73MIN: 4 / MAX: 5.25MIN: 3.76 / MAX: 4.84MIN: 4 / MAX: 5.58MIN: 3.81 / MAX: 5.08MIN: 3.83 / MAX: 5MIN: 3.81 / MAX: 4.62MIN: 3.8 / MAX: 4.43MIN: 3.73 / MAX: 159.471. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3 - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: blazefacenv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030700.71551.4312.14652.8623.5775SE +/- 0.18, N = 151.071.251.381.371.361.371.711.411.331.391.421.451.421.381.363.18MIN: 1.02 / MAX: 1.52MIN: 1.19 / MAX: 2.61MIN: 1.36 / MAX: 1.76MIN: 1.35 / MAX: 1.62MIN: 1.34 / MAX: 1.44MIN: 1.35 / MAX: 1.39MIN: 1.09 / MAX: 448.17MIN: 1.35 / MAX: 1.89MIN: 1.27 / MAX: 1.98MIN: 1.34 / MAX: 1.89MIN: 1.36 / MAX: 1.92MIN: 1.36 / MAX: 8.73MIN: 1.35 / MAX: 2.15MIN: 1.36 / MAX: 1.9MIN: 1.34 / MAX: 1.61MIN: 1.31 / MAX: 185.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3 - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: googlenetnv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070510152025SE +/- 0.21, N = 1510.0110.198.357.947.837.979.868.908.878.378.998.588.497.857.8220.72MIN: 7.29 / MAX: 259.11MIN: 7.73 / MAX: 212.36MIN: 8.2 / MAX: 9.39MIN: 7.8 / MAX: 8.78MIN: 7.74 / MAX: 8.61MIN: 7.89 / MAX: 8.7MIN: 7.54 / MAX: 396.21MIN: 8.22 / MAX: 11.07MIN: 8.18 / MAX: 11.09MIN: 7.76 / MAX: 10.31MIN: 8.25 / MAX: 10.27MIN: 7.79 / MAX: 10.48MIN: 7.82 / MAX: 11.98MIN: 7.75 / MAX: 8.64MIN: 7.69 / MAX: 8.6MIN: 7.49 / MAX: 355.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3 - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: vgg16nv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701224364860SE +/- 0.27, N = 1527.7729.0724.7124.1223.5423.4228.4027.5928.2125.2626.0825.0425.0423.4323.5055.42MIN: 24.82 / MAX: 264.66MIN: 24.45 / MAX: 263.33MIN: 23.88 / MAX: 119.23MIN: 23.57 / MAX: 46.44MIN: 23.32 / MAX: 24.54MIN: 23.27 / MAX: 24.32MIN: 23.98 / MAX: 456MIN: 24.34 / MAX: 396.09MIN: 24.57 / MAX: 270.76MIN: 24.14 / MAX: 27.73MIN: 24.52 / MAX: 27.73MIN: 23.81 / MAX: 27.15MIN: 23.87 / MAX: 28.04MIN: 23.26 / MAX: 24.3MIN: 23.23 / MAX: 24.26MIN: 25.32 / MAX: 281.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3 - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: resnet18nv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.16, N = 155.845.855.505.305.235.426.235.815.975.595.895.695.655.245.2313.38MIN: 5.35 / MAX: 7.72MIN: 5.3 / MAX: 8.27MIN: 5.4 / MAX: 6.38MIN: 5.17 / MAX: 5.93MIN: 5.11 / MAX: 6.03MIN: 5.36 / MAX: 6.27MIN: 4.99 / MAX: 309.18MIN: 5.3 / MAX: 6.82MIN: 5.46 / MAX: 7.02MIN: 5.09 / MAX: 7.7MIN: 5.36 / MAX: 7.53MIN: 5.11 / MAX: 6.94MIN: 5.14 / MAX: 6.93MIN: 5.14 / MAX: 5.99MIN: 5.1 / MAX: 6.07MIN: 5.43 / MAX: 208.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3 - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: alexnetnv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.23, N = 156.114.994.864.354.304.425.676.586.114.655.214.694.694.304.309.86MIN: 4.83 / MAX: 124.76MIN: 4.59 / MAX: 6.56MIN: 4.8 / MAX: 6.37MIN: 4.27 / MAX: 5.16MIN: 4.26 / MAX: 5.16MIN: 4.32 / MAX: 5.1MIN: 4.21 / MAX: 365.75MIN: 4.61 / MAX: 91.07MIN: 4.73 / MAX: 81.72MIN: 4.26 / MAX: 5.97MIN: 4.79 / MAX: 6.66MIN: 4.26 / MAX: 7.17MIN: 4.26 / MAX: 6.15MIN: 4.25 / MAX: 4.7MIN: 4.25 / MAX: 5.08MIN: 4.25 / MAX: 157.021. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3 - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: resnet50nv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070612182430SE +/- 0.27, N = 1513.1311.1510.4310.2510.039.8712.3513.5714.5811.0911.5010.8410.9510.019.9723.11MIN: 10.56 / MAX: 323.44MIN: 10.31 / MAX: 12.97MIN: 10.19 / MAX: 11.32MIN: 10.05 / MAX: 11.08MIN: 9.93 / MAX: 10.96MIN: 9.79 / MAX: 10.73MIN: 9.83 / MAX: 424.28MIN: 10.45 / MAX: 199.55MIN: 10.67 / MAX: 324.82MIN: 10.18 / MAX: 13.12MIN: 10.5 / MAX: 13.47MIN: 9.93 / MAX: 12.83MIN: 9.91 / MAX: 17.11MIN: 9.91 / MAX: 10.74MIN: 9.86 / MAX: 10.84MIN: 10.22 / MAX: 140.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3 - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: yolov4-tinynv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070714212835SE +/- 0.23, N = 1516.6115.4313.3514.3412.8612.7715.4416.3915.4413.6213.9513.6813.7912.8412.8829.49MIN: 12.32 / MAX: 375.99MIN: 13.1 / MAX: 210.2MIN: 12.87 / MAX: 58.52MIN: 14.23 / MAX: 15.12MIN: 12.76 / MAX: 13.98MIN: 12.69 / MAX: 13.71MIN: 12.61 / MAX: 387.62MIN: 12.97 / MAX: 369.64MIN: 12.92 / MAX: 211.43MIN: 12.75 / MAX: 15.79MIN: 13.03 / MAX: 15.9MIN: 12.77 / MAX: 15.57MIN: 12.79 / MAX: 15.92MIN: 12.76 / MAX: 13.7MIN: 12.75 / MAX: 13.79MIN: 13.03 / MAX: 182.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3 - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: squeezenet_ssdnv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.19, N = 157.728.337.317.097.077.148.297.817.937.517.707.637.667.087.0416.15MIN: 7.12 / MAX: 23.25MIN: 6.32 / MAX: 222.03MIN: 6.96 / MAX: 30.1MIN: 6.98 / MAX: 8.01MIN: 7.01 / MAX: 7.75MIN: 7.06 / MAX: 7.95MIN: 6.37 / MAX: 448.22MIN: 7.24 / MAX: 9.04MIN: 7.31 / MAX: 9.45MIN: 6.94 / MAX: 9.51MIN: 7.11 / MAX: 9.19MIN: 7.02 / MAX: 9.71MIN: 7.02 / MAX: 9.08MIN: 7.01 / MAX: 7.93MIN: 6.97 / MAX: 7.76MIN: 7.25 / MAX: 210.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3 - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: regnety_400mnv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.25, N = 147.737.997.998.507.988.278.8910.349.608.108.588.448.458.258.0117.23MIN: 7.43 / MAX: 9.41MIN: 7.62 / MAX: 9.27MIN: 7.91 / MAX: 8.8MIN: 8.04 / MAX: 30.12MIN: 7.93 / MAX: 8.65MIN: 8.22 / MAX: 9.01MIN: 7.74 / MAX: 476.28MIN: 8.21 / MAX: 214.16MIN: 7.66 / MAX: 210.23MIN: 7.77 / MAX: 15.42MIN: 8.23 / MAX: 10.39MIN: 8.04 / MAX: 10.17MIN: 8.05 / MAX: 10.3MIN: 8.12 / MAX: 14MIN: 7.93 / MAX: 8.35MIN: 7.8 / MAX: 193.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3 - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: vision_transformernv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701632486480SE +/- 0.13, N = 1538.5838.3332.6833.3631.6631.7138.2938.7339.0134.0535.4034.2934.1332.1131.8673.51MIN: 33.77 / MAX: 476.18MIN: 34.14 / MAX: 246.43MIN: 32.02 / MAX: 87.72MIN: 32.83 / MAX: 76.21MIN: 31.52 / MAX: 32.14MIN: 31.56 / MAX: 33.03MIN: 32.31 / MAX: 557.38MIN: 33.81 / MAX: 362.17MIN: 33.91 / MAX: 411.66MIN: 32.83 / MAX: 38.57MIN: 33.93 / MAX: 39.3MIN: 33.11 / MAX: 40.12MIN: 32.98 / MAX: 36.11MIN: 31.94 / MAX: 33.01MIN: 31.58 / MAX: 35.84MIN: 39.27 / MAX: 288.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3 - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: FastestDetnv 4090igfcbRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.15, N = 155.924.434.064.203.694.064.264.163.944.124.314.094.204.104.048.63MIN: 4.25 / MAX: 103.26MIN: 4.28 / MAX: 5.01MIN: 4.01 / MAX: 4.78MIN: 4.15 / MAX: 4.92MIN: 3.66 / MAX: 3.92MIN: 4.03 / MAX: 4.3MIN: 2.71 / MAX: 347.03MIN: 4.03 / MAX: 4.73MIN: 3.8 / MAX: 5.41MIN: 3.97 / MAX: 6.99MIN: 4.14 / MAX: 6.11MIN: 3.92 / MAX: 5.5MIN: 4.04 / MAX: 5.82MIN: 4.06 / MAX: 4.21MIN: 4 / MAX: 4.15MIN: 4.27 / MAX: 144.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: mobilenetnv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070510152025SE +/- 0.27, N = 159.4110.028.509.628.3710.188.468.468.408.848.048.1118.39MIN: 8.98 / MAX: 11.38MIN: 8.07 / MAX: 266.25MIN: 8.42 / MAX: 9.29MIN: 7.71 / MAX: 449.11MIN: 7.98 / MAX: 10.71MIN: 8.18 / MAX: 235.56MIN: 7.97 / MAX: 10.56MIN: 7.95 / MAX: 10.34MIN: 7.93 / MAX: 15.25MIN: 8.31 / MAX: 10.98MIN: 7.96 / MAX: 9.01MIN: 8.02 / MAX: 14.2MIN: 7.92 / MAX: 173.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2nv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.15, N = 155.103.293.173.663.343.413.283.273.273.283.193.158.35MIN: 3.14 / MAX: 138.88MIN: 3.1 / MAX: 3.96MIN: 3.1 / MAX: 5.03MIN: 3.01 / MAX: 311.25MIN: 3.14 / MAX: 4.45MIN: 3.24 / MAX: 5.42MIN: 3.09 / MAX: 4.98MIN: 3.11 / MAX: 4.73MIN: 3.08 / MAX: 5.18MIN: 3.11 / MAX: 4.16MIN: 3.13 / MAX: 4MIN: 3.1 / MAX: 3.68MIN: 3.08 / MAX: 103.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3nv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep30903070246810SE +/- 0.18, N = 153.263.263.163.623.333.343.243.263.243.166.56MIN: 3.13 / MAX: 3.96MIN: 3.11 / MAX: 4.7MIN: 3.12 / MAX: 3.58MIN: 3 / MAX: 469.9MIN: 3.19 / MAX: 4.79MIN: 3.21 / MAX: 4.31MIN: 3.1 / MAX: 3.88MIN: 3.13 / MAX: 4.08MIN: 3.11 / MAX: 4.37MIN: 3.11 / MAX: 3.77MIN: 3.07 / MAX: 110.871. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2nv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.16, N = 153.513.433.343.755.275.173.433.433.393.463.373.338.00MIN: 3.38 / MAX: 4.05MIN: 3.3 / MAX: 4.89MIN: 3.31 / MAX: 4.05MIN: 3.2 / MAX: 361.52MIN: 3.27 / MAX: 191.55MIN: 3.22 / MAX: 208.13MIN: 3.29 / MAX: 3.87MIN: 3.31 / MAX: 3.95MIN: 3.26 / MAX: 3.91MIN: 3.3 / MAX: 5.74MIN: 3.33 / MAX: 3.8MIN: 3.29 / MAX: 3.67MIN: 3.16 / MAX: 190.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: mnasnetnv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701.03282.06563.09844.13125.164SE +/- 0.14, N = 153.163.072.973.343.133.173.083.053.033.062.992.964.59MIN: 3.02 / MAX: 4.6MIN: 2.93 / MAX: 3.84MIN: 2.93 / MAX: 3.88MIN: 2.68 / MAX: 393.6MIN: 3.01 / MAX: 3.62MIN: 3.03 / MAX: 3.66MIN: 2.93 / MAX: 4.42MIN: 2.91 / MAX: 3.67MIN: 2.91 / MAX: 4.45MIN: 2.94 / MAX: 3.67MIN: 2.96 / MAX: 3.32MIN: 2.93 / MAX: 3.31MIN: 2.88 / MAX: 20.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0nv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.19, N = 154.124.193.844.604.044.364.014.023.984.063.873.838.41MIN: 3.86 / MAX: 5.39MIN: 4.01 / MAX: 5.09MIN: 3.78 / MAX: 4.57MIN: 3.79 / MAX: 336.2MIN: 3.85 / MAX: 4.9MIN: 4.14 / MAX: 5.24MIN: 3.79 / MAX: 5.39MIN: 3.8 / MAX: 5.14MIN: 3.77 / MAX: 5.44MIN: 3.85 / MAX: 4.97MIN: 3.81 / MAX: 4.62MIN: 3.78 / MAX: 4.4MIN: 3.76 / MAX: 67.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: blazefacenv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030700.39830.79661.19491.59321.9915SE +/- 0.12, N = 141.181.411.381.491.461.431.421.421.411.431.391.361.77MIN: 1.11 / MAX: 1.85MIN: 1.35 / MAX: 2.02MIN: 1.35 / MAX: 2.09MIN: 1.05 / MAX: 379.08MIN: 1.39 / MAX: 2.91MIN: 1.36 / MAX: 2.04MIN: 1.34 / MAX: 2.84MIN: 1.35 / MAX: 2MIN: 1.34 / MAX: 2.1MIN: 1.36 / MAX: 2.06MIN: 1.37 / MAX: 1.52MIN: 1.34 / MAX: 1.46MIN: 1.08 / MAX: 12.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: googlenetnv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070510152025SE +/- 0.24, N = 158.6110.477.969.8410.3811.308.428.438.498.407.897.8618.60MIN: 7.95 / MAX: 10.07MIN: 8.21 / MAX: 350.07MIN: 7.81 / MAX: 9.05MIN: 7.3 / MAX: 438.04MIN: 7.96 / MAX: 255.68MIN: 7.95 / MAX: 477.54MIN: 7.78 / MAX: 10.7MIN: 7.77 / MAX: 10.4MIN: 7.74 / MAX: 10.76MIN: 7.71 / MAX: 10.64MIN: 7.79 / MAX: 8.84MIN: 7.74 / MAX: 8.62MIN: 8.02 / MAX: 292.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: vgg16nv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701224364860SE +/- 0.30, N = 1527.8927.4324.0428.6329.1231.5725.1625.4025.0525.4823.5223.5555.48MIN: 24.5 / MAX: 463.23MIN: 24.65 / MAX: 251.37MIN: 23.48 / MAX: 73.3MIN: 24.13 / MAX: 500.18MIN: 24.62 / MAX: 266.39MIN: 26.09 / MAX: 318.58MIN: 23.97 / MAX: 27.81MIN: 24.05 / MAX: 27.09MIN: 23.78 / MAX: 26.95MIN: 23.88 / MAX: 51.68MIN: 23.33 / MAX: 25.08MIN: 23.31 / MAX: 24.48MIN: 25.94 / MAX: 298.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: resnet18nv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.23, N = 158.165.885.286.576.056.585.605.675.615.615.225.1912.14MIN: 5.39 / MAX: 397.44MIN: 5.36 / MAX: 8.2MIN: 5.16 / MAX: 6.09MIN: 4.91 / MAX: 391.33MIN: 5.53 / MAX: 7.66MIN: 6.04 / MAX: 7.81MIN: 5.09 / MAX: 7.51MIN: 5.1 / MAX: 8.06MIN: 5.07 / MAX: 7.08MIN: 5.09 / MAX: 7.91MIN: 5.13 / MAX: 6.1MIN: 5.09 / MAX: 6MIN: 5.28 / MAX: 151.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: alexnetnv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.22, N = 155.145.104.355.535.335.144.684.674.724.614.314.3110.08MIN: 4.65 / MAX: 6.81MIN: 4.75 / MAX: 6.12MIN: 4.28 / MAX: 5.1MIN: 4.22 / MAX: 362.62MIN: 4.83 / MAX: 6.6MIN: 4.76 / MAX: 6.16MIN: 4.26 / MAX: 6.8MIN: 4.27 / MAX: 6.36MIN: 4.25 / MAX: 7.3MIN: 4.24 / MAX: 7.25MIN: 4.26 / MAX: 5.07MIN: 4.25 / MAX: 4.94MIN: 4.36 / MAX: 225.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: resnet50nv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070612182430SE +/- 0.23, N = 1513.6313.1010.3312.7312.1714.0810.9110.9110.8011.4010.0410.3823.59MIN: 10.52 / MAX: 488.94MIN: 10.59 / MAX: 267.95MIN: 10.2 / MAX: 11.18MIN: 9.84 / MAX: 518.97MIN: 11.25 / MAX: 13.79MIN: 10.29 / MAX: 247.29MIN: 9.94 / MAX: 14.83MIN: 9.91 / MAX: 13.07MIN: 9.89 / MAX: 12.54MIN: 10.5 / MAX: 13.51MIN: 9.94 / MAX: 10.89MIN: 9.88 / MAX: 18.75MIN: 9.96 / MAX: 177.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: yolov4-tinynv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070714212835SE +/- 0.28, N = 1517.3013.7713.1415.2115.4515.5513.6113.6213.5513.8612.8613.1029.80MIN: 14.66 / MAX: 441.3MIN: 12.96 / MAX: 14.66MIN: 13 / MAX: 14.02MIN: 12.34 / MAX: 380.51MIN: 12.65 / MAX: 445.76MIN: 13.11 / MAX: 307.2MIN: 12.67 / MAX: 19.72MIN: 12.71 / MAX: 15.65MIN: 12.72 / MAX: 15.51MIN: 13.04 / MAX: 15.04MIN: 12.76 / MAX: 13.73MIN: 13.01 / MAX: 14.17MIN: 12.85 / MAX: 216.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssdnv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.25, N = 149.217.217.148.289.169.287.627.677.557.667.097.0415.40MIN: 6.83 / MAX: 203.62MIN: 6.73 / MAX: 8.82MIN: 7.03 / MAX: 7.99MIN: 6.38 / MAX: 381.81MIN: 6.73 / MAX: 423.75MIN: 6.92 / MAX: 355.6MIN: 7 / MAX: 9.93MIN: 7.04 / MAX: 9.1MIN: 6.99 / MAX: 9.08MIN: 7.09 / MAX: 8.97MIN: 7.01 / MAX: 7.97MIN: 6.96 / MAX: 7.7MIN: 6.64 / MAX: 132.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: regnety_400mnv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.21, N = 1510.098.468.389.198.6410.108.498.528.248.618.347.9917.88MIN: 7.84 / MAX: 366.66MIN: 8.08 / MAX: 10.33MIN: 8.05 / MAX: 27.34MIN: 7.44 / MAX: 524.66MIN: 8.3 / MAX: 10.51MIN: 7.93 / MAX: 156.75MIN: 8.08 / MAX: 9.72MIN: 8.13 / MAX: 9.73MIN: 7.91 / MAX: 9.53MIN: 8.21 / MAX: 10.07MIN: 8.26 / MAX: 9.09MIN: 7.92 / MAX: 8.78MIN: 7.38 / MAX: 190.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: vision_transformernv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701632486480SE +/- 0.20, N = 1539.1838.0133.3237.8838.1739.0634.1034.2334.1034.9132.0933.2271.08MIN: 33.74 / MAX: 520.24MIN: 32.96 / MAX: 388.09MIN: 31.83 / MAX: 104.12MIN: 32.46 / MAX: 518.57MIN: 32.97 / MAX: 462.63MIN: 34.16 / MAX: 481.28MIN: 32.32 / MAX: 38.54MIN: 33.08 / MAX: 37.43MIN: 32.43 / MAX: 38.75MIN: 33.72 / MAX: 36.82MIN: 31.84 / MAX: 32.77MIN: 33.04 / MAX: 36.99MIN: 38.84 / MAX: 374.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3 - Model: FastestDetnv 4090igRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.20, N = 152.813.833.924.413.964.134.204.174.144.284.084.036.93MIN: 2.68 / MAX: 4.38MIN: 3.7 / MAX: 4.57MIN: 3.88 / MAX: 4.72MIN: 2.06 / MAX: 295.24MIN: 3.79 / MAX: 11.36MIN: 3.99 / MAX: 4.67MIN: 4.01 / MAX: 11.47MIN: 4.03 / MAX: 5.63MIN: 4 / MAX: 5.6MIN: 4.13 / MAX: 4.85MIN: 4.04 / MAX: 4.29MIN: 3.99 / MAX: 4.22MIN: 2.57 / MAX: 163.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenetnv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.25, N = 158.939.529.029.169.198.318.388.438.058.0317.09MIN: 8.33 / MAX: 11.07MIN: 7.97 / MAX: 420.29MIN: 8.42 / MAX: 11.17MIN: 8.5 / MAX: 10.51MIN: 8.51 / MAX: 11.04MIN: 7.85 / MAX: 10.21MIN: 7.94 / MAX: 10.07MIN: 8.03 / MAX: 9.64MIN: 7.96 / MAX: 9.04MIN: 7.96 / MAX: 8.83MIN: 7.89 / MAX: 121.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2nv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701.22852.4573.68554.9146.1425SE +/- 0.18, N = 153.423.663.363.483.283.143.273.293.173.125.46MIN: 3.15 / MAX: 25.1MIN: 2.73 / MAX: 398.42MIN: 3.17 / MAX: 4.8MIN: 3.32 / MAX: 4.99MIN: 3.11 / MAX: 4.26MIN: 3 / MAX: 3.85MIN: 3.08 / MAX: 4.68MIN: 3.12 / MAX: 3.99MIN: 3.12 / MAX: 3.89MIN: 3.07 / MAX: 3.62MIN: 3.27 / MAX: 38.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3nv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701.34782.69564.04345.39126.739SE +/- 0.13, N = 133.173.443.343.623.263.053.313.263.183.135.99MIN: 3.04 / MAX: 4.3MIN: 2.65 / MAX: 361.91MIN: 3.19 / MAX: 3.99MIN: 3.47 / MAX: 4.24MIN: 3.12 / MAX: 4.74MIN: 2.94 / MAX: 3.56MIN: 3.16 / MAX: 3.93MIN: 3.13 / MAX: 4.7MIN: 3.13 / MAX: 3.61MIN: 3.09 / MAX: 3.68MIN: 3.05 / MAX: 26.811. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2nv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701.25782.51563.77345.03126.289SE +/- 0.20, N = 153.503.893.483.523.443.343.443.483.343.325.59MIN: 3.37 / MAX: 4.2MIN: 3.08 / MAX: 345.39MIN: 3.34 / MAX: 4.1MIN: 3.38 / MAX: 4.23MIN: 3.31 / MAX: 4.85MIN: 3.22 / MAX: 3.97MIN: 3.31 / MAX: 4.32MIN: 3.34 / MAX: 4.88MIN: 3.3 / MAX: 3.79MIN: 3.29 / MAX: 3.79MIN: 3.32 / MAX: 42.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mnasnetnv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.04, N = 153.073.104.994.933.082.983.063.102.972.946.06MIN: 2.93 / MAX: 4.52MIN: 2.61 / MAX: 4.75MIN: 3.02 / MAX: 235.56MIN: 2.97 / MAX: 124.96MIN: 2.95 / MAX: 3.88MIN: 2.86 / MAX: 4.47MIN: 2.93 / MAX: 5.02MIN: 2.95 / MAX: 4.05MIN: 2.94 / MAX: 3.45MIN: 2.9 / MAX: 3.34MIN: 2.96 / MAX: 42.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0nv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.13, N = 154.104.374.414.154.053.974.014.043.853.889.81MIN: 3.87 / MAX: 6.14MIN: 3.85 / MAX: 366.28MIN: 4.21 / MAX: 5.82MIN: 3.93 / MAX: 5.94MIN: 3.83 / MAX: 5.42MIN: 3.79 / MAX: 5.93MIN: 3.81 / MAX: 6.04MIN: 3.84 / MAX: 4.83MIN: 3.78 / MAX: 4.83MIN: 3.83 / MAX: 4.72MIN: 3.87 / MAX: 165.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: blazefacenv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030700.67281.34562.01842.69123.364SE +/- 0.03, N = 151.261.341.411.421.411.311.421.441.371.382.99MIN: 1.2 / MAX: 1.76MIN: 1.06 / MAX: 2.66MIN: 1.35 / MAX: 1.91MIN: 1.36 / MAX: 1.92MIN: 1.34 / MAX: 1.88MIN: 1.25 / MAX: 3.14MIN: 1.35 / MAX: 2.89MIN: 1.37 / MAX: 2.07MIN: 1.35 / MAX: 1.48MIN: 1.36 / MAX: 1.53MIN: 1.22 / MAX: 149.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: googlenetnv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.19, N = 158.359.9010.399.058.558.268.428.797.867.8416.97MIN: 7.7 / MAX: 10.46MIN: 7.76 / MAX: 396.66MIN: 7.87 / MAX: 391.66MIN: 8.26 / MAX: 13.34MIN: 7.86 / MAX: 10.08MIN: 7.62 / MAX: 10.47MIN: 7.77 / MAX: 10.52MIN: 8.08 / MAX: 10.27MIN: 7.75 / MAX: 8.71MIN: 7.74 / MAX: 8.72MIN: 7.44 / MAX: 229.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vgg16nv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701122334455SE +/- 0.28, N = 1528.1428.5329.1727.4426.0925.3325.5625.6723.3823.5149.70MIN: 24.24 / MAX: 221.5MIN: 23.95 / MAX: 473.83MIN: 24.61 / MAX: 264.85MIN: 24.06 / MAX: 264.59MIN: 24.58 / MAX: 30.18MIN: 24.26 / MAX: 34.98MIN: 24.24 / MAX: 27.92MIN: 24.46 / MAX: 27.34MIN: 23.19 / MAX: 24.27MIN: 23.27 / MAX: 24.38MIN: 25.55 / MAX: 421.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet18nv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.20, N = 157.386.405.907.525.745.655.675.925.205.2111.30MIN: 5.15 / MAX: 138.85MIN: 5.1 / MAX: 457.07MIN: 5.43 / MAX: 7.49MIN: 5.45 / MAX: 290.49MIN: 5.18 / MAX: 8.08MIN: 5.18 / MAX: 6.76MIN: 5.19 / MAX: 7.38MIN: 5.37 / MAX: 8.24MIN: 5.1 / MAX: 6.09MIN: 5.09 / MAX: 6.13MIN: 5.3 / MAX: 181.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: alexnetnv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030703691215SE +/- 0.16, N = 154.695.345.254.674.704.714.644.984.304.3211.89MIN: 4.28 / MAX: 6.33MIN: 4.25 / MAX: 221.78MIN: 4.86 / MAX: 6.33MIN: 4.28 / MAX: 6MIN: 4.28 / MAX: 5.92MIN: 4.26 / MAX: 7.21MIN: 4.24 / MAX: 6MIN: 4.59 / MAX: 7.15MIN: 4.24 / MAX: 5.11MIN: 4.25 / MAX: 5.33MIN: 4.34 / MAX: 229.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet50nv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070612182430SE +/- 0.25, N = 1513.1312.4211.5112.4012.5011.2211.0711.489.9810.0723.44MIN: 10.18 / MAX: 247.5MIN: 10.23 / MAX: 444.76MIN: 10.56 / MAX: 13.22MIN: 11.44 / MAX: 14.43MIN: 11.47 / MAX: 14.56MIN: 10.33 / MAX: 12.81MIN: 10.16 / MAX: 13.16MIN: 10.56 / MAX: 12.93MIN: 9.85 / MAX: 11.35MIN: 9.95 / MAX: 10.88MIN: 10.17 / MAX: 219.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: yolov4-tinynv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070714212835SE +/- 0.19, N = 1515.5515.0015.4015.9515.2613.5213.7313.9312.9012.9728.73MIN: 12.87 / MAX: 342.3MIN: 12.75 / MAX: 401.37MIN: 13 / MAX: 245.79MIN: 13.38 / MAX: 245.18MIN: 14.19 / MAX: 17.06MIN: 12.72 / MAX: 21.19MIN: 12.78 / MAX: 20.99MIN: 13.08 / MAX: 15.68MIN: 12.77 / MAX: 13.92MIN: 12.83 / MAX: 13.8MIN: 12.83 / MAX: 264.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssdnv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070510152025SE +/- 0.24, N = 157.028.659.469.818.067.277.627.717.087.0518.83MIN: 6.38 / MAX: 9.36MIN: 6.64 / MAX: 544.17MIN: 7.03 / MAX: 160.39MIN: 7.16 / MAX: 389.1MIN: 7.42 / MAX: 9.25MIN: 6.74 / MAX: 8.84MIN: 7.01 / MAX: 14.37MIN: 7.15 / MAX: 9.1MIN: 7 / MAX: 7.94MIN: 6.97 / MAX: 7.95MIN: 6.71 / MAX: 206.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: regnety_400mnv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep3090307048121620SE +/- 0.24, N = 1510.039.0510.699.878.378.258.358.678.078.3317.61MIN: 7.81 / MAX: 171.2MIN: 7.52 / MAX: 417.33MIN: 8.17 / MAX: 339.6MIN: 7.81 / MAX: 243.06MIN: 8.04 / MAX: 10.13MIN: 7.93 / MAX: 9.88MIN: 8.05 / MAX: 9.76MIN: 8.3 / MAX: 14.66MIN: 7.99 / MAX: 8.88MIN: 8.25 / MAX: 9.32MIN: 7.85 / MAX: 165.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vision_transformernv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030701632486480SE +/- 0.11, N = 1538.4638.2739.0338.6235.3633.9033.9335.6031.9731.9470.29MIN: 32.39 / MAX: 435.46MIN: 32.29 / MAX: 507.7MIN: 33.61 / MAX: 343.67MIN: 33.33 / MAX: 465MIN: 33.87 / MAX: 42.41MIN: 32.72 / MAX: 37.77MIN: 32.77 / MAX: 36.2MIN: 34.13 / MAX: 38.49MIN: 31.71 / MAX: 33.78MIN: 31.72 / MAX: 34.34MIN: 39.39 / MAX: 250.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: FastestDetnv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070246810SE +/- 0.27, N = 152.644.324.114.454.613.754.174.194.073.836.71MIN: 2.52 / MAX: 4.14MIN: 2.51 / MAX: 398.91MIN: 3.98 / MAX: 4.73MIN: 4.29 / MAX: 5.05MIN: 4.45 / MAX: 5.92MIN: 3.63 / MAX: 5.24MIN: 4.02 / MAX: 4.75MIN: 4.06 / MAX: 7.41MIN: 4.03 / MAX: 4.18MIN: 3.79 / MAX: 4.09MIN: 2.73 / MAX: 109.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3 - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3 - Model: mobilenetnv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep3090307048121620SE +/- 0.14, N = 312.1210.0310.619.048.258.348.458.068.0716.52MIN: 9.16 / MAX: 505.01MIN: 7.86 / MAX: 346.64MIN: 8.34 / MAX: 225.97MIN: 8.49 / MAX: 10.96MIN: 7.78 / MAX: 9.61MIN: 7.89 / MAX: 9.42MIN: 8.01 / MAX: 10.86MIN: 8 / MAX: 8.96MIN: 8.01 / MAX: 8.62MIN: 7.9 / MAX: 82.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2nv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep30903070246810SE +/- 0.53, N = 33.293.913.443.363.163.203.303.163.167.22MIN: 3.13 / MAX: 4.29MIN: 3.04 / MAX: 394.66MIN: 3.27 / MAX: 4.93MIN: 3.21 / MAX: 4.78MIN: 3.01 / MAX: 5.17MIN: 3.05 / MAX: 4.67MIN: 3.11 / MAX: 4.01MIN: 3.09 / MAX: 4.06MIN: 3.11 / MAX: 3.95MIN: 3.17 / MAX: 69.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3nv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep3070246810SE +/- 0.53, N = 34.973.703.303.333.063.083.283.176.43MIN: 3.15 / MAX: 291.01MIN: 2.98 / MAX: 261.6MIN: 3.15 / MAX: 3.91MIN: 3.2 / MAX: 4.4MIN: 2.94 / MAX: 3.94MIN: 2.97 / MAX: 3.67MIN: 3.13 / MAX: 4.78MIN: 3.12 / MAX: 3.75MIN: 2.85 / MAX: 164.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2nv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep30903070246810SE +/- 0.60, N = 33.324.023.423.483.363.403.473.333.367.81MIN: 3.19 / MAX: 4.76MIN: 3.27 / MAX: 328.59MIN: 3.29 / MAX: 3.94MIN: 3.35 / MAX: 4.05MIN: 3.23 / MAX: 3.99MIN: 3.28 / MAX: 3.87MIN: 3.33 / MAX: 5.39MIN: 3.3 / MAX: 3.78MIN: 3.32 / MAX: 3.66MIN: 3.3 / MAX: 131.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3 - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3 - Model: mnasnetnv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep30903070246810SE +/- 0.13, N = 33.123.245.115.192.963.003.092.972.986.07MIN: 2.98 / MAX: 3.71MIN: 2.9 / MAX: 5.34MIN: 2.96 / MAX: 247.47MIN: 3.04 / MAX: 436.91MIN: 2.85 / MAX: 3.82MIN: 2.88 / MAX: 4.37MIN: 2.96 / MAX: 4.98MIN: 2.93 / MAX: 3.3MIN: 2.95 / MAX: 3.9MIN: 2.94 / MAX: 129.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0nv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep309030703691215SE +/- 0.46, N = 35.944.744.354.473.954.014.073.853.889.19MIN: 3.97 / MAX: 208.59MIN: 3.68 / MAX: 295.7MIN: 4.08 / MAX: 5.62MIN: 4.23 / MAX: 5.82MIN: 3.79 / MAX: 4.59MIN: 3.83 / MAX: 5.28MIN: 3.85 / MAX: 4.79MIN: 3.81 / MAX: 4.75MIN: 3.83 / MAX: 4.61MIN: 3.85 / MAX: 131.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3 - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3 - Model: blazefacenv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep309030700.56931.13861.70792.27722.8465SE +/- 0.48, N = 31.422.481.421.301.311.321.431.381.392.53MIN: 1.34 / MAX: 1.99MIN: 1.17 / MAX: 344.52MIN: 1.34 / MAX: 2.37MIN: 1.24 / MAX: 1.92MIN: 1.25 / MAX: 1.76MIN: 1.26 / MAX: 2.03MIN: 1.36 / MAX: 2.02MIN: 1.35 / MAX: 1.64MIN: 1.37 / MAX: 1.48MIN: 1.08 / MAX: 118.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3 - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3 - Model: googlenetnv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep30903070510152025SE +/- 0.80, N = 310.759.688.978.388.298.328.527.917.9019.20MIN: 7.92 / MAX: 447.83MIN: 8.16 / MAX: 382.41MIN: 8.22 / MAX: 10.51MIN: 7.78 / MAX: 10.43MIN: 7.63 / MAX: 9.87MIN: 7.71 / MAX: 10.39MIN: 7.85 / MAX: 10.56MIN: 7.81 / MAX: 8.62MIN: 7.8 / MAX: 8.73MIN: 7.84 / MAX: 193.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3 - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3 - Model: vgg16nv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep309030701122334455SE +/- 0.54, N = 327.6127.9830.7430.1625.2625.4425.0123.4023.5850.32MIN: 24.67 / MAX: 401.29MIN: 24.35 / MAX: 423.63MIN: 25.36 / MAX: 428.68MIN: 24.66 / MAX: 332.49MIN: 24.29 / MAX: 27.75MIN: 24.27 / MAX: 27.68MIN: 23.88 / MAX: 26.66MIN: 23.2 / MAX: 24.07MIN: 23.35 / MAX: 24.43MIN: 25.92 / MAX: 281.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3 - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3 - Model: resnet18nv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep309030703691215SE +/- 0.30, N = 36.076.228.147.745.775.785.645.305.2012.64MIN: 5.49 / MAX: 15.12MIN: 5.3 / MAX: 8.22MIN: 5.39 / MAX: 122.47MIN: 5.25 / MAX: 312.09MIN: 5.22 / MAX: 7.06MIN: 5.21 / MAX: 6.97MIN: 5.11 / MAX: 7.51MIN: 5.21 / MAX: 6.24MIN: 5.1 / MAX: 6.16MIN: 5.3 / MAX: 53.811. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3 - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3 - Model: alexnetnv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep309030703691215SE +/- 0.43, N = 36.626.175.454.994.664.694.684.314.3310.59MIN: 4.28 / MAX: 339.62MIN: 4.5 / MAX: 261.75MIN: 4.93 / MAX: 7.98MIN: 4.56 / MAX: 6.91MIN: 4.24 / MAX: 5.97MIN: 4.26 / MAX: 6.07MIN: 4.27 / MAX: 6.08MIN: 4.26 / MAX: 5.07MIN: 4.26 / MAX: 5.19MIN: 4.3 / MAX: 177.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3 - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3 - Model: resnet50nv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep30903070510152025SE +/- 0.04, N = 313.2912.8112.4711.7211.1011.2610.8610.0610.0322.19MIN: 10.54 / MAX: 456.82MIN: 10.06 / MAX: 349.03MIN: 11.5 / MAX: 14.68MIN: 10.8 / MAX: 12.8MIN: 10.19 / MAX: 18.3MIN: 10.32 / MAX: 13.29MIN: 9.98 / MAX: 12.46MIN: 9.86 / MAX: 11.9MIN: 9.93 / MAX: 10.87MIN: 10.16 / MAX: 181.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3 - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3 - Model: yolov4-tinynv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep30903070714212835SE +/- 0.81, N = 317.6714.5713.8815.8513.4213.6313.7112.8112.8228.41MIN: 14.92 / MAX: 343.93MIN: 12.33 / MAX: 312.42MIN: 13.09 / MAX: 14.77MIN: 13.26 / MAX: 253.23MIN: 12.65 / MAX: 16.19MIN: 12.77 / MAX: 16.93MIN: 12.78 / MAX: 15.62MIN: 12.7 / MAX: 13.69MIN: 12.72 / MAX: 13.66MIN: 12.49 / MAX: 151.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssdnv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep3090307048121620SE +/- 0.23, N = 37.487.579.389.517.257.277.677.097.1214.27MIN: 6.85 / MAX: 9.67MIN: 6.69 / MAX: 10MIN: 6.77 / MAX: 224.11MIN: 7.11 / MAX: 307.17MIN: 6.72 / MAX: 8.05MIN: 6.73 / MAX: 8.77MIN: 7.06 / MAX: 9.96MIN: 7.02 / MAX: 7.86MIN: 7.04 / MAX: 7.97MIN: 7.01 / MAX: 51.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3 - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3 - Model: regnety_400mnv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep3090307048121620SE +/- 0.29, N = 38.258.4210.2310.098.348.388.728.038.2218.25MIN: 7.87 / MAX: 10.07MIN: 7.66 / MAX: 10.74MIN: 8.22 / MAX: 197.1MIN: 8.01 / MAX: 418.58MIN: 8.03 / MAX: 10.23MIN: 8.04 / MAX: 9.63MIN: 8.32 / MAX: 10.48MIN: 7.97 / MAX: 8.65MIN: 8.14 / MAX: 8.67MIN: 7.8 / MAX: 238.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3 - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3 - Model: vision_transformernv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep309030701530456075SE +/- 0.11, N = 338.9538.0438.7938.7634.4734.1434.2231.8532.1065.41MIN: 34.04 / MAX: 486.96MIN: 33.11 / MAX: 346.94MIN: 34.02 / MAX: 460.15MIN: 33.38 / MAX: 423.24MIN: 33.05 / MAX: 39.69MIN: 32.5 / MAX: 37.13MIN: 33.01 / MAX: 37.09MIN: 31.67 / MAX: 35.74MIN: 31.9 / MAX: 33.03MIN: 39.08 / MAX: 230.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3 - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3 - Model: FastestDetnv 4090RTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep3090 rep30903070246810SE +/- 0.87, N = 33.934.183.122.853.823.804.204.074.107.12MIN: 3.76 / MAX: 11.77MIN: 2.53 / MAX: 295.11MIN: 2.97 / MAX: 4.42MIN: 2.74 / MAX: 4.36MIN: 3.65 / MAX: 9.77MIN: 3.65 / MAX: 6.08MIN: 4.04 / MAX: 5.63MIN: 4.03 / MAX: 4.2MIN: 4.07 / MAX: 4.34MIN: 3.72 / MAX: 188.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenetnv 4090RTX 3070 Ti4090 rep40903090 rep30903070510152025SE +/- 0.24, N = 1510.159.988.838.468.058.0118.54MIN: 8.08 / MAX: 193.04MIN: 7.79 / MAX: 434.9MIN: 8.29 / MAX: 10.15MIN: 8.12 / MAX: 10.14MIN: 7.98 / MAX: 8.94MIN: 7.96 / MAX: 8.47MIN: 8.01 / MAX: 164.451. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2nv 4090RTX 3070 Ti4090 rep40903090 rep309030701.23532.47063.70594.94126.1765SE +/- 0.20, N = 153.273.693.605.253.173.155.49MIN: 3.11 / MAX: 4.1MIN: 3.07 / MAX: 544.13MIN: 3.44 / MAX: 4.27MIN: 3.11 / MAX: 367.53MIN: 3.12 / MAX: 3.78MIN: 3.11 / MAX: 3.78MIN: 2.97 / MAX: 152.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3nv 4090RTX 3070 Ti4090 rep4090309030701.34332.68664.02995.37326.7165SE +/- 0.17, N = 154.813.523.443.363.165.97MIN: 3.13 / MAX: 149.75MIN: 2.95 / MAX: 536.1MIN: 3.3 / MAX: 4.34MIN: 3.21 / MAX: 4.83MIN: 3.12 / MAX: 3.67MIN: 2.84 / MAX: 111.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2nv 4090RTX 3070 Ti4090 rep40903090 rep30903070246810SE +/- 0.20, N = 153.373.925.183.473.363.366.30MIN: 3.25 / MAX: 5.26MIN: 3.12 / MAX: 496.78MIN: 3.45 / MAX: 200.36MIN: 3.33 / MAX: 5.01MIN: 3.33 / MAX: 3.83MIN: 3.32 / MAX: 3.82MIN: 3.28 / MAX: 147.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mnasnetnv 4090RTX 3070 Ti4090 rep40903090 rep30903070246810SE +/- 0.11, N = 153.103.253.283.192.982.978.15MIN: 2.97 / MAX: 3.92MIN: 2.68 / MAX: 277.21MIN: 3.15 / MAX: 4.32MIN: 3.04 / MAX: 3.98MIN: 2.94 / MAX: 3.36MIN: 2.93 / MAX: 3.45MIN: 2.67 / MAX: 317.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0nv 4090RTX 3070 Ti4090 rep40903090 rep309030703691215SE +/- 0.16, N = 155.884.554.444.143.853.869.53MIN: 3.96 / MAX: 194.08MIN: 3.84 / MAX: 379.07MIN: 4.24 / MAX: 5.18MIN: 3.93 / MAX: 5.94MIN: 3.81 / MAX: 4.6MIN: 3.82 / MAX: 4.82MIN: 3.77 / MAX: 182.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: blazefacenv 4090RTX 3070 Ti4090 rep40903090 rep309030700.80331.60662.40993.21324.0165SE +/- 0.14, N = 152.911.511.381.451.381.393.57MIN: 1.29 / MAX: 113.97MIN: 1.11 / MAX: 380.46MIN: 1.33 / MAX: 1.98MIN: 1.38 / MAX: 2.98MIN: 1.35 / MAX: 1.88MIN: 1.36 / MAX: 3.12MIN: 1.08 / MAX: 141.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: googlenetnv 4090RTX 3070 Ti4090 rep40903090 rep30903070510152025SE +/- 0.22, N = 158.859.5810.188.917.827.8318.66MIN: 8.16 / MAX: 10.25MIN: 7.62 / MAX: 396.9MIN: 7.81 / MAX: 204.67MIN: 8.3 / MAX: 10.96MIN: 7.72 / MAX: 8.6MIN: 7.73 / MAX: 8.6MIN: 7.42 / MAX: 326.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vgg16nv 4090RTX 3070 Ti4090 rep40903090 rep309030701224364860SE +/- 0.26, N = 1529.4028.5329.3527.3123.4723.5051.28MIN: 26.17 / MAX: 411.51MIN: 24.21 / MAX: 515.3MIN: 24.55 / MAX: 485.35MIN: 24.27 / MAX: 230.86MIN: 23.25 / MAX: 24.24MIN: 23.26 / MAX: 24.34MIN: 24.83 / MAX: 242.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet18nv 4090RTX 3070 Ti4090 rep40903090 rep309030703691215SE +/- 0.24, N = 155.976.695.847.785.205.2013.34MIN: 5.4 / MAX: 8.25MIN: 5.06 / MAX: 462.37MIN: 5.35 / MAX: 8.28MIN: 5.4 / MAX: 168.29MIN: 5.08 / MAX: 6.05MIN: 5.1 / MAX: 6.05MIN: 5.43 / MAX: 279.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: alexnetnv 4090RTX 3070 Ti4090 rep40903090 rep309030703691215SE +/- 0.21, N = 156.545.415.164.944.304.3010.69MIN: 4.56 / MAX: 110.58MIN: 4.23 / MAX: 364.66MIN: 4.73 / MAX: 6.38MIN: 4.52 / MAX: 6.23MIN: 4.24 / MAX: 4.85MIN: 4.25 / MAX: 4.63MIN: 4.32 / MAX: 148.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet50nv 4090RTX 3070 Ti4090 rep40903090 rep30903070612182430SE +/- 0.24, N = 1513.4612.5211.2412.9810.0410.0323.54MIN: 10.6 / MAX: 340.67MIN: 9.95 / MAX: 459.05MIN: 10.22 / MAX: 29.96MIN: 10.26 / MAX: 145.62MIN: 9.94 / MAX: 10.91MIN: 9.88 / MAX: 10.86MIN: 10.3 / MAX: 149.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: yolov4-tinynv 4090RTX 3070 Ti4090 rep40903090 rep30903070612182430SE +/- 0.25, N = 1515.6715.5616.6015.6912.8912.8626.33MIN: 12.91 / MAX: 334.44MIN: 12.24 / MAX: 459.8MIN: 12.98 / MAX: 103.04MIN: 13.13 / MAX: 187.93MIN: 12.79 / MAX: 13.77MIN: 12.74 / MAX: 13.68MIN: 12.62 / MAX: 127.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssdnv 4090RTX 3070 Ti4090 rep40903090 rep3090307048121620SE +/- 0.22, N = 157.728.319.347.407.077.0515.46MIN: 7.13 / MAX: 8.97MIN: 6.35 / MAX: 364.95MIN: 6.88 / MAX: 268.7MIN: 6.81 / MAX: 8.46MIN: 6.99 / MAX: 7.81MIN: 6.98 / MAX: 7.81MIN: 7.08 / MAX: 147.311. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: regnety_400mnv 4090RTX 3070 Ti4090 rep40903090 rep3090307048121620SE +/- 0.20, N = 158.379.108.4510.058.198.2018.24MIN: 8.08 / MAX: 10.1MIN: 7.61 / MAX: 454.62MIN: 8.05 / MAX: 12.64MIN: 8.13 / MAX: 173.18MIN: 8.12 / MAX: 8.98MIN: 8.14 / MAX: 8.74MIN: 7.5 / MAX: 201.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vision_transformernv 4090RTX 3070 Ti4090 rep40903090 rep309030701530456075SE +/- 0.12, N = 1538.5838.3238.6938.8232.1332.1669.48MIN: 33.06 / MAX: 464.16MIN: 32.26 / MAX: 477.15MIN: 33.32 / MAX: 390.07MIN: 33.83 / MAX: 435.6MIN: 31.95 / MAX: 32.87MIN: 31.94 / MAX: 33.7MIN: 39.08 / MAX: 374.311. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: FastestDetnv 4090RTX 3070 Ti4090 rep40903090 rep309030701.0082.0163.0244.0325.04SE +/- 0.29, N = 154.014.253.912.824.104.084.48MIN: 3.87 / MAX: 5.47MIN: 2.46 / MAX: 526.3MIN: 3.77 / MAX: 5.87MIN: 2.69 / MAX: 3.5MIN: 4.06 / MAX: 4.2MIN: 4.04 / MAX: 4.2MIN: 2.2 / MAX: 27.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenetnv 4090RTX 3070 Ti4090 rep40903090 rep307048121620SE +/- 0.13, N = 310.6410.028.2210.568.0317.06MIN: 8.4 / MAX: 127.99MIN: 7.8 / MAX: 372.36MIN: 7.75 / MAX: 9.41MIN: 8.32 / MAX: 239.95MIN: 7.97 / MAX: 8.91MIN: 8 / MAX: 101.451. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - Model: mobilenet-v2nv 4090RTX 3070 Ti4090 rep40903090 rep30701.3322.6643.9965.3286.66SE +/- 0.53, N = 33.293.833.384.753.155.92MIN: 3.12 / MAX: 4.27MIN: 3.11 / MAX: 343.21MIN: 3.2 / MAX: 4MIN: 2.93 / MAX: 147.66MIN: 3.1 / MAX: 3.75MIN: 3.16 / MAX: 103.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mobilenet-v3nv 4090RTX 3070 Ti4090 rep40903090 rep3070246810SE +/- 0.04, N = 34.963.243.353.363.197.34MIN: 3.14 / MAX: 189.43MIN: 3.05 / MAX: 5.14MIN: 3.22 / MAX: 3.99MIN: 3.22 / MAX: 4.62MIN: 3.13 / MAX: 3.61MIN: 3.09 / MAX: 155.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: shufflenet-v2nv 4090RTX 3070 Ti4090 rep40903090 rep30701.32532.65063.97595.30126.6265SE +/- 0.02, N = 33.433.485.233.563.325.89MIN: 3.29 / MAX: 5.31MIN: 3.33 / MAX: 5.22MIN: 3.34 / MAX: 185.57MIN: 3.43 / MAX: 4.24MIN: 3.29 / MAX: 3.62MIN: 3.19 / MAX: 97.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: mnasnetnv 4090RTX 3070 Ti4090 rep40903090 rep3070246810SE +/- 0.02, N = 33.103.123.123.232.968.55MIN: 2.97 / MAX: 3.73MIN: 2.97 / MAX: 4.65MIN: 3 / MAX: 4.1MIN: 3.08 / MAX: 4.73MIN: 2.92 / MAX: 3.27MIN: 2.99 / MAX: 185.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: efficientnet-b0nv 4090RTX 3070 Ti4090 rep40903090 rep3070246810SE +/- 0.08, N = 35.824.174.104.633.846.63MIN: 3.98 / MAX: 197.79MIN: 3.86 / MAX: 5.52MIN: 3.88 / MAX: 5.04MIN: 4.38 / MAX: 6.01MIN: 3.8 / MAX: 4.67MIN: 3.75 / MAX: 22.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: blazefacenv 4090RTX 3070 Ti4090 rep40903090 rep30700.60531.21061.81592.42123.0265SE +/- 0.04, N = 31.401.401.421.351.382.69MIN: 1.34 / MAX: 1.86MIN: 1.28 / MAX: 1.91MIN: 1.36 / MAX: 2.03MIN: 1.28 / MAX: 1.84MIN: 1.36 / MAX: 1.73MIN: 1.35 / MAX: 48.811. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: googlenetnv 4090RTX 3070 Ti4090 rep40903090 rep3070510152025SE +/- 0.55, N = 310.149.9710.478.557.8618.80MIN: 7.85 / MAX: 257.61MIN: 8.16 / MAX: 381.49MIN: 7.86 / MAX: 191.94MIN: 7.85 / MAX: 11.39MIN: 7.75 / MAX: 8.57MIN: 7.78 / MAX: 141.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vgg16nv 4090RTX 3070 Ti4090 rep40903090 rep30701224364860SE +/- 0.28, N = 327.2527.8629.8527.3223.7253.48MIN: 24.12 / MAX: 252.53MIN: 24.17 / MAX: 416.36MIN: 24.25 / MAX: 400.86MIN: 24.36 / MAX: 262.38MIN: 23.56 / MAX: 24.59MIN: 25.52 / MAX: 296.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet18nv 4090RTX 3070 Ti4090 rep40903090 rep30703691215SE +/- 0.05, N = 35.585.945.876.965.2712.13MIN: 5.09 / MAX: 6.98MIN: 5.32 / MAX: 8.32MIN: 5.41 / MAX: 7.58MIN: 5.3 / MAX: 242.18MIN: 5.15 / MAX: 6.11MIN: 5.32 / MAX: 123.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: alexnetnv 4090RTX 3070 Ti4090 rep40903090 rep30703691215SE +/- 0.57, N = 36.326.255.345.144.3111.43MIN: 4.26 / MAX: 195.95MIN: 4.27 / MAX: 334.55MIN: 4.87 / MAX: 6.57MIN: 4.75 / MAX: 7.34MIN: 4.26 / MAX: 4.83MIN: 4.24 / MAX: 178.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: resnet50nv 4090RTX 3070 Ti4090 rep40903090 rep3070510152025SE +/- 0.30, N = 313.2513.1510.9613.0010.2722.15MIN: 10.61 / MAX: 154.12MIN: 10.26 / MAX: 349.93MIN: 10.09 / MAX: 12.99MIN: 10.34 / MAX: 397.57MIN: 10.12 / MAX: 11.19MIN: 10.11 / MAX: 123.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: yolov4-tinynv 4090RTX 3070 Ti4090 rep40903090 rep3070714212835SE +/- 0.94, N = 316.3014.6415.4116.0512.9229.38MIN: 14.11 / MAX: 184.46MIN: 12.77 / MAX: 383.28MIN: 12.75 / MAX: 226.87MIN: 12.93 / MAX: 474.03MIN: 12.79 / MAX: 18.5MIN: 12.95 / MAX: 201.311. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: squeezenet_ssdnv 4090RTX 3070 Ti4090 rep40903090 rep307048121620SE +/- 0.14, N = 38.267.459.447.437.0715.32MIN: 7.64 / MAX: 11.08MIN: 6.59 / MAX: 9.11MIN: 7.17 / MAX: 94.63MIN: 6.84 / MAX: 8.82MIN: 6.98 / MAX: 9.71MIN: 6.66 / MAX: 139.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: regnety_400mnv 4090RTX 3070 Ti4090 rep40903090 rep307048121620SE +/- 0.54, N = 38.349.148.7010.118.0617.02MIN: 8.01 / MAX: 12.36MIN: 8.14 / MAX: 400.02MIN: 8.29 / MAX: 12.6MIN: 8.03 / MAX: 259.38MIN: 7.98 / MAX: 8.6MIN: 7.65 / MAX: 216.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: vision_transformernv 4090RTX 3070 Ti4090 rep40903090 rep30701632486480SE +/- 0.10, N = 337.1338.5038.6539.3531.9470.53MIN: 33.97 / MAX: 443.1MIN: 33.7 / MAX: 418.06MIN: 33.07 / MAX: 476.08MIN: 34.22 / MAX: 466.65MIN: 31.73 / MAX: 32.75MIN: 39.2 / MAX: 276.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - Model: FastestDetnv 4090RTX 3070 Ti4090 rep40903090 rep3070246810SE +/- 0.15, N = 35.864.144.594.624.077.23MIN: 3.9 / MAX: 190.17MIN: 3.73 / MAX: 5.07MIN: 4.44 / MAX: 5.2MIN: 4.48 / MAX: 5.16MIN: 4.04 / MAX: 4.25MIN: 3.75 / MAX: 121.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

VkFFT

Test: FFT + iFFT R2C / C2R

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT R2C / C2Rnv 4090ihgfedcba4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309020K40K60K80K100KSE +/- 3.71, N = 3SE +/- 118.74, N = 3SE +/- 200.55, N = 3SE +/- 796.66, N = 38488733727265242663826593353043539943021421634210581329843516768969068682796647354432553471. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in half precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C 1D batched in half precisionnv 4090ihgfedcba4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309060K120K180K240K300KSE +/- 26.03, N = 3SE +/- 18.50, N = 3SE +/- 83.55, N = 3SE +/- 133.47, N = 329276813227010429810417110414685191851819174491812915972876512903422109912107132110582110762651712552071. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C Bluestein in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C Bluestein in single precisionnv 4090ihgfedcba4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30904K8K12K16K20KSE +/- 75.16, N = 15SE +/- 72.34, N = 3SE +/- 62.67, N = 3SE +/- 83.38, N = 32060110061762275747571105601071911311112731134020404203731718517343172871712114449144061. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in double precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C 1D batched in double precisionnv 4090ihgfedcba4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309012K24K36K48K60KSE +/- 12.42, N = 3SE +/- 10.58, N = 3SE +/- 11.67, N = 3SE +/- 14.62, N = 35495014780105721054810561121681214320847208222081655383552143505835071350383497431122309451. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C 1D batched in single precisionnv 4090ihgfedcba4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030K60K90K120K150KSE +/- 1.67, N = 3SE +/- 2.73, N = 3SE +/- 9.54, N = 3SE +/- 25.50, N = 31521706973856431564555647642651426454797147948478871539391538961045431045281044911045561414371413571. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C multidimensional in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C multidimensional in single precisionnv 4090igfedcba4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309020K40K60K80K100KSE +/- 437.33, N = 3SE +/- 116.12, N = 3SE +/- 57.83, N = 3SE +/- 555.86, N = 382875346862654126238370903632832812327513300180999814067004067887700686586954814510051. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C Bluestein benchmark in double precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C Bluestein benchmark in double precisionnv 4090igfedcba4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30902K4K6K8K10KSE +/- 11.20, N = 3SE +/- 4.37, N = 3SE +/- 0.33, N = 3813224171818181423432346467046954717811980395584558755835579428942821. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C 1D batched in single precision, no reshufflingnv 4090igfedcba4090 rep40904080 zzz4080 xxx4080 rep40803090 rep309030K60K90K120K150KSE +/- 2.08, N = 3SE +/- 2.33, N = 3SE +/- 8.89, N = 315514871163570945711043365433655059650643505041559361526561059261060991062051062101439561439691. (CXX) g++ options: -O3

vkpeak

fp32-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-scalarhgfedcba3090 rep30905K10K15K20K25KSE +/- 0.30, N = 3SE +/- 16.18, N = 3SE +/- 4.18, N = 36810.736832.746837.948515.588531.9612860.5612807.0613190.0920925.3021269.72

vkpeak

fp32-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-vec4hgfedcba3090 rep30906K12K18K24K30KSE +/- 2.57, N = 3SE +/- 19.37, N = 3SE +/- 1.81, N = 39036.179003.129006.5711231.7211251.1712822.0112808.5912730.0827807.5827797.80

vkpeak

fp16-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-scalarhgfedcba3090 rep30904K8K12K16K20KSE +/- 5.09, N = 3SE +/- 13.46, N = 3SE +/- 4.01, N = 36838.326811.356812.528397.808412.3313136.7913145.1913154.1520953.3020845.09

vkpeak

fp16-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-vec4hgfedcba3090 rep30909K18K27K36K45KSE +/- 0.36, N = 3SE +/- 0.37, N = 3SE +/- 5.96, N = 313490.2413438.4713440.9716865.2916864.4723387.2623390.4423232.4241188.0241149.10

vkpeak

fp64-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp64-scalarhgfedcba3090 rep30902004006008001000SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.22, N = 3213.37213.96214.17267.41267.43839.01839.20841.40653.63653.13

vkpeak

fp64-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp64-vec4hgfedcba30902004006008001000SE +/- 0.00, N = 3SE +/- 0.48, N = 3SE +/- 0.32, N = 3210.96213.95214.23267.25267.74836.16836.55841.80653.15

vkpeak

int32-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int32-scalarhgfedcba3090 rep30904K8K12K16K20KSE +/- 0.03, N = 3SE +/- 15.02, N = 3SE +/- 0.34, N = 36800.606824.296827.928505.208520.022269.062269.252272.6220767.6420909.02

vkpeak

int32-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int32-vec4hgfedcba3090 rep30904K8K12K16K20KSE +/- 0.05, N = 3SE +/- 0.19, N = 3SE +/- 0.26, N = 36772.986795.396800.178465.718465.822638.692640.082658.7320517.6820820.09

vkpeak

int16-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int16-scalarhgfedcba3090 rep30903K6K9K12K15KSE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 1.30, N = 34495.984479.224480.595675.995676.0213063.8613070.8113102.7513608.5713710.88

vkpeak

int16-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int16-vec4hgfedcba3090 rep30905K10K15K20K25KSE +/- 0.31, N = 3SE +/- 17.33, N = 3SE +/- 21.55, N = 35978.385956.385959.757336.257352.8523385.4423396.5923123.7716881.4716886.66

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Singlenv 4090igfedcbaRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070816243240SE +/- 0.000, N = 3SE +/- 0.004, N = 3SE +/- 0.001, N = 3SE +/- 0.029, N = 38.96720.93026.76926.73832.85032.85511.68811.69011.68627.1838.9629.28413.12613.13713.13613.13610.42810.39922.0641. (CXX) g++ options: -O3

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Doublenv 4090igfedRTX 3070 Ti4090 rep40904080 zzz4080 xxx4080 rep40803090 rep30903070110220330440550SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3172.89500.01500.01500.01500.02500.0124.81173.04172.88288.03288.04288.17288.20371.42371.7024.751. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.5