vulkan-benchmarks

AMD Ryzen 9 7950X 16-Core testing with a ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS) and NVIDIA GeForce RTX 4090 24GB on Ubuntu 23.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2308069-PTS-VULKANBE16
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

NVIDIA GPU Compute 4 Tests
Vulkan Compute 4 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
a
August 01
  3 Hours, 11 Minutes
b
August 01
  1 Hour, 30 Minutes
c
August 01
  1 Hour, 32 Minutes
d
August 01
  3 Hours, 45 Minutes
e
August 01
  3 Hours, 16 Minutes
f
August 02
  1 Hour, 53 Minutes
g
August 02
  2 Hours, 9 Minutes
h
August 02
  47 Minutes
i
August 02
  1 Hour, 50 Minutes
4080
August 02
  2 Hours, 4 Minutes
4080 rep
August 02
  2 Hours, 7 Minutes
4080 xxx
August 02
  2 Hours, 8 Minutes
4080 zzz
August 02
  2 Hours, 9 Minutes
3090
August 03
  2 Hours, 44 Minutes
3090 rep
August 03
  2 Hours, 54 Minutes
3070
August 03
  4 Hours, 56 Minutes
RTX 3070 Ti
August 04
  1 Day, 7 Hours, 27 Minutes
4090
August 06
  2 Hours, 52 Minutes
4090 rep
August 06
  2 Hours, 54 Minutes
nv 4090
August 06
  2 Hours, 52 Minutes
Invert Hiding All Results Option
  3 Hours, 57 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


vulkan-benchmarks ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionDisplay Driverabcdefghi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090AMD Ryzen 9 7950X 16-Core @ 4.50GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS)AMD Device 14d832GBWestern Digital WD_BLACK SN850X 1000GB + 4001GBAMD Radeon RX 6700 XT (2855/1000MHz)AMD Navi 21/23ASUS MG28UIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 23.046.4.6-060406-generic (x86_64)GNOME Shell 44.2X Server 1.21.1.7 + Wayland4.6 Mesa 23.3~git2307260600.87109c~oibaf~l (git-87109c3 2023-07-26 lunar-oibaf-ppa) (LLVM 15.0.7 DRM 3.52)GCC 12.2.0ext43840x2160MSI NVIDIA GeForce RTX 4060 8GBNVIDIA Device 22beX Server 1.21.1.7NVIDIA 535.86.054.6.0eVGA NVIDIA GeForce RTX 3060 12GBNVIDIA GA106 HD AudioNVIDIA GeForce RTX 3060 Ti 8GBNVIDIA GA104 HD Audio2560x1440NVIDIA GeForce RTX 4080 16GBNVIDIA Device 22bb3840x2160NVIDIA GeForce RTX 3090 24GBNVIDIA GA102 HD AudioNVIDIA GeForce RTX 3070 8GBNVIDIA GA104 HD Audio2560x1440NVIDIA GeForce RTX 3070 Ti 8GBNVIDIA GeForce RTX 4090 24GBNVIDIA AD102 HD Audio3840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-Pa930Z/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-Pa930Z/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- a: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa601203- b: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa601203- c: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa601203- d: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- e: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- f: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- g: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- h: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- i: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 4080: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 4080 rep: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 4080 xxx: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 4080 zzz: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 3090: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 3090 rep: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 3070: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- RTX 3070 Ti: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 4090: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- 4090 rep: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- nv 4090: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203Graphics Details- a: BAR1 / Visible vRAM Size: 12272 MB - vBIOS Version: 113-D5121100-101- b: BAR1 / Visible vRAM Size: 12272 MB - vBIOS Version: 113-D5121100-101- c: BAR1 / Visible vRAM Size: 12272 MB - vBIOS Version: 113-D5121100-101- d: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- e: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- f: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.06.14.40.46- g: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.06.14.40.46- h: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.06.14.40.46- i: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2c- 4080: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04- 4080 rep: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04- 4080 xxx: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04- 4080 zzz: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04- 3090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.27.00.02- 3090 rep: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.27.00.02- 3070: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2b- RTX 3070 Ti: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.5b.00.02- 4090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- 4090 rep: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- nv 4090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

vulkan-benchmarks vkpeak: fp32-scalarvkpeak: fp32-vec4vkpeak: fp16-scalarvkpeak: fp16-vec4vkpeak: fp64-scalarvkpeak: fp64-vec4vkpeak: int32-scalarvkpeak: int32-vec4vkpeak: int16-scalarvkpeak: int16-vec4vkfft: FFT + iFFT R2C / C2Rvkfft: FFT + iFFT C2C 1D batched in half precisionvkfft: FFT + iFFT C2C Bluestein in single precisionvkfft: FFT + iFFT C2C 1D batched in double precisionvkfft: FFT + iFFT C2C 1D batched in single precisionvkfft: FFT + iFFT C2C multidimensional in single precisionvkfft: FFT + iFFT C2C Bluestein benchmark in double precisionvkfft: FFT + iFFT C2C 1D batched in single precision, no reshufflingvkresample: 2x - Singlencnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: CPU - vision_transformerncnn: CPU - FastestDetncnn: CPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - FastestDetncnn: CPU-v3-v3-v3 - mobilenetncnn: CPU-v3-v3-v3-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3-v3-v3-v3 - mobilenet-v3ncnn: CPU-v3-v3-v3 - shufflenet-v2ncnn: CPU-v3-v3-v3 - mnasnetncnn: CPU-v3-v3-v3 - efficientnet-b0ncnn: CPU-v3-v3-v3 - blazefacencnn: CPU-v3-v3-v3 - googlenetncnn: CPU-v3-v3-v3 - vgg16ncnn: CPU-v3-v3-v3 - resnet18ncnn: CPU-v3-v3-v3 - alexnetncnn: CPU-v3-v3-v3 - resnet50ncnn: CPU-v3-v3-v3 - yolov4-tinyncnn: CPU-v3-v3-v3 - squeezenet_ssdncnn: CPU-v3-v3-v3 - regnety_400mncnn: CPU-v3-v3-v3 - vision_transformerncnn: CPU-v3-v3-v3 - FastestDetncnn: Vulkan GPU-v3-v3-v3 - mobilenetncnn: Vulkan GPU-v3-v3-v3-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3-v3-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v3-v3-v3 - shufflenet-v2ncnn: Vulkan GPU-v3-v3-v3 - mnasnetncnn: Vulkan GPU-v3-v3-v3 - efficientnet-b0ncnn: Vulkan GPU-v3-v3-v3 - blazefacencnn: Vulkan GPU-v3-v3-v3 - googlenetncnn: Vulkan GPU-v3-v3-v3 - vgg16ncnn: Vulkan GPU-v3-v3-v3 - resnet18ncnn: Vulkan GPU-v3-v3-v3 - alexnetncnn: Vulkan GPU-v3-v3-v3 - resnet50ncnn: Vulkan GPU-v3-v3-v3 - yolov4-tinyncnn: Vulkan GPU-v3-v3-v3 - squeezenet_ssdncnn: Vulkan GPU-v3-v3-v3 - regnety_400mncnn: Vulkan GPU-v3-v3-v3 - vision_transformerncnn: Vulkan GPU-v3-v3-v3 - FastestDetvkresample: 2x - Doublencnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - mobilenetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - shufflenet-v2ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - mnasnetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - efficientnet-b0ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - blazefacencnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - googlenetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - vgg16ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - resnet18ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - alexnetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - resnet50ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - yolov4-tinyncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - squeezenet_ssdncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - regnety_400mncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - vision_transformerncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3 - FastestDetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - shufflenet-v2ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mnasnetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - efficientnet-b0ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - blazefacencnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - googlenetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vgg16ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet18ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - alexnetncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet50ncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - yolov4-tinyncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - squeezenet_ssdncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - regnety_400mncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vision_transformerncnn: Vulkan GPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - FastestDetncnn: CPU-v3-v3-v3-v3-v3-v3 - mobilenetncnn: CPU-v3-v3-v3-v3-v3-v3-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet-v3ncnn: CPU-v3-v3-v3-v3-v3-v3 - shufflenet-v2ncnn: CPU-v3-v3-v3-v3-v3-v3 - mnasnetncnn: CPU-v3-v3-v3-v3-v3-v3 - efficientnet-b0ncnn: CPU-v3-v3-v3-v3-v3-v3 - blazefacencnn: CPU-v3-v3-v3-v3-v3-v3 - googlenetncnn: CPU-v3-v3-v3-v3-v3-v3 - vgg16ncnn: CPU-v3-v3-v3-v3-v3-v3 - resnet18ncnn: CPU-v3-v3-v3-v3-v3-v3 - alexnetncnn: CPU-v3-v3-v3-v3-v3-v3 - resnet50ncnn: CPU-v3-v3-v3-v3-v3-v3 - yolov4-tinyncnn: CPU-v3-v3-v3-v3-v3-v3 - squeezenet_ssdncnn: CPU-v3-v3-v3-v3-v3-v3 - regnety_400mncnn: CPU-v3-v3-v3-v3-v3-v3 - vision_transformerncnn: CPU-v3-v3-v3-v3-v3-v3 - FastestDetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet-v3ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - shufflenet-v2ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mnasnetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - efficientnet-b0ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - blazefacencnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - googlenetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vgg16ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet18ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - alexnetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet50ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - yolov4-tinyncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - squeezenet_ssdncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - regnety_400mncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vision_transformerncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3 - FastestDetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mobilenet-v3ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - shufflenet-v2ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - mnasnetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - efficientnet-b0ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - blazefacencnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - googlenetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vgg16ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet18ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - alexnetncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - resnet50ncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - yolov4-tinyncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - squeezenet_ssdncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - regnety_400mncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - vision_transformerncnn: CPU-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3-v3 - FastestDetabcdefghi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409013190.0912730.0813154.1523232.42841.40841.802272.622658.7313102.7523123.7742105915971134020816478873300147175050411.6868.053.163.342.973.901.387.9423.755.294.4110.2012.907.078.1832.493.623.188.053.173.173.352.983.861.387.9023.515.284.3110.0112.847.098.1631.884.112807.0612808.5913145.1923390.44839.2836.552269.252640.0813070.8123396.5942163918121127320822479483275146955064311.697.973.163.342.973.851.387.8223.495.24.3210.0112.747.068.1831.954.058.043.143.332.953.821.377.8523.565.234.331012.877.078.2131.854.078.013.153.163.332.973.821.377.8423.55.214.2910.0112.987.038.0531.654.0783.153.163.332.963.851.377.9723.425.424.429.8712.777.148.2731.714.0612860.5612822.0113136.7923387.26839.01836.162269.062638.6913063.8623385.4443021917441131120847479713281246705059611.6888.023.183.352.993.881.397.9323.455.244.3110.1112.817.18.2731.774.113.178.033.133.23.322.963.831.377.823.545.214.331012.817.06831.794.0983.143.163.342.973.891.387.8823.995.264.2810.3312.897.048.1431.784.087.953.153.173.332.963.821.367.8323.545.234.310.0312.867.077.9831.663.698531.9611251.178412.3316864.47267.43267.748520.028465.825676.027352.8535399851811071912143426453632823464336532.8558.103.173.352.983.851.387.8523.515.234.3010.0012.957.098.2332.434.083.178.023.163.173.352.973.871.387.8523.565.234.3110.1012.857.088.1732.124.11500.0148515.5811231.728397.8016865.29267.41267.258505.208465.715675.997336.2535304851911056012168426513709023434336532.8508.043.143.183.332.963.841.387.8523.605.224.3110.1012.877.058.1031.934.08500.0166837.949006.576812.5213440.97214.17214.236827.926800.174480.595959.7526593104146757110561564762623818145711026.7388.453.153.552.973.871.377.9224.195.694.3611.0513.327.238.0833.564.228.273.133.143.42.973.861.388.1524.555.484.6410.2613.177.088.3432.924.248.563.163.153.43.124.041.438.0724.456.134.8311.0513.076.978.3433.473.858.653.163.153.332.963.851.377.9424.125.34.3510.2514.347.098.533.364.2500.016832.749003.126810.5513438.4213.96213.956824.216795.394478.415956.2426638104171757410548564552654118185709426.76922.743.163.5933.911.387.9823.785.554.3210.7217.237.138.332.732.573.148.173.183.352.983.861.418.9624.26.224.8710.3413.647.18.3632.424.078.983.153.163.383.054.141.379.1524.925.484.7111.2513.087.268.0733.393.978.043.143.153.332.953.851.377.9423.825.34.3510.1812.897.197.9932.384.06500.0118.53.173.163.342.973.841.387.9624.045.284.3510.3313.147.148.3833.323.926810.739036.176838.3213490.24213.37210.966800.66772.984495.985978.382652410429876221057256431337271322701006114780697383468624177116320.938.373.525.032.744.051.410.330.965.65.312.0914.658.969.9437.82.663.2610.43.294.873.493.25.881.48.7527.835.826.5312.9615.167.469.8836.425.1410.083.33.293.523.394.681.2810.1729.125.865.0114.0515.118.168.2136.555.699.053.283.263.362.994.211.2510.1929.075.854.9911.1515.438.337.9938.334.43500.00610.023.293.263.433.074.191.4110.4727.435.885.113.113.777.218.4638.013.8366473211076171213497410455665869557910621013.1368.733.283.413.053.991.48.4225.375.694.7511.1613.857.738.2435.074.423.248.443.263.433.074.011.418.42255.674.6210.8113.797.588.3935.564.28.433.313.283.463.084.021.448.4525.15.74.6611.1113.817.648.3334.24.28.433.293.273.443.094.051.428.4925.045.654.6910.9513.797.668.4534.134.2288.2018.843.283.463.064.061.438.425.485.614.6111.413.867.668.6134.914.288.433.293.263.483.14.041.448.7925.675.924.9811.4813.937.718.6735.64.1968279211058172873503810449170068558310620513.1368.413.283.433.064.091.428.5226.115.684.6711.7614.037.868.6735.284.343.268.573.293.443.084.051.418.425.045.614.6510.8413.677.648.5635.074.218.483.273.433.094.021.428.5224.915.634.6510.7913.557.598.5734.274.188.463.33.273.443.074.041.458.5825.045.694.6910.8413.687.638.4434.294.09288.1668.43.273.243.393.033.981.418.4925.055.614.7210.813.557.558.2434.14.148.383.273.313.443.064.011.428.4225.565.674.6411.0713.737.628.3533.934.178.453.33.283.473.094.071.438.5225.015.644.6810.8613.717.678.7234.224.269068210713173433507110452867887558710609913.1378.373.283.453.064.041.428.3825.035.564.6810.8213.67.628.4534.274.173.278.373.263.53.074.041.428.4225.015.624.6810.9413.657.628.5634.194.28.443.33.313.473.074.061.428.5255.664.6510.9113.697.648.7534.374.198.883.43.333.513.134.221.428.9926.085.895.2111.513.957.78.5835.44.31288.0398.463.273.263.433.054.021.428.4325.45.674.6710.9113.627.678.5234.234.178.313.143.053.342.983.971.318.2625.335.654.7111.2213.527.278.2533.93.758.343.23.083.434.011.328.3225.445.784.6911.2613.637.278.3834.143.867689210991171853505810454370040558410592613.1268.383.253.423.043.991.48.425.45.634.6911.113.637.558.3734.14.163.248.473.293.283.463.064.041.428.4125.825.594.6811.0713.87.638.5834.324.798.43.283.273.433.064.031.418.5525.455.714.6711.2113.837.358.4734.474.048.383.233.23.373.013.951.398.3725.265.594.6511.0913.627.518.134.054.12288.0288.463.283.243.433.084.011.428.4225.165.64.6810.9113.617.628.4934.14.29.193.283.263.443.084.051.418.5526.095.744.712.515.268.068.3735.364.618.253.163.063.362.963.951.318.2925.265.774.6611.113.427.258.3434.473.8221269.7227797.820845.0941149.1653.13653.1520909.0220820.0913710.8816886.6655347255207144063094514135751005428214396910.3998.63.173.342.993.871.387.8623.435.214.310.314.267.528.2533.014.213.188.073.183.193.392.993.881.397.8723.555.274.3510.112.887.168.3831.944.118.063.143.153.322.953.831.367.8323.55.194.3110.0512.877.047.9531.894.0483.163.342.973.851.367.8223.55.234.39.9712.887.048.0131.864.04371.6998.113.153.163.332.963.831.367.8623.555.194.3110.3813.17.047.9933.224.038.033.123.133.322.943.881.387.8423.515.214.3210.0712.977.058.3331.943.838.073.163.362.983.881.397.923.585.24.3310.0312.827.128.2232.14.18.013.153.163.362.973.861.397.8323.55.24.310.0312.867.058.232.164.0820708.8427393.220640.6740876.12648.7120613.4120517.4513606.7916878.254432265171144493112214143754814428914395610.4288.013.173.352.973.861.387.923.435.294.319.9512.777.128.2431.84.083.198.033.173.163.362.973.861.377.8223.485.24.3110.0612.837.068.0231.914.088.033.173.153.332.963.851.377.8523.545.24.310.0712.827.098.0931.934.118.013.173.153.362.973.851.387.8523.435.244.310.0112.847.088.2532.114.1371.4228.043.193.372.993.871.397.8923.525.224.3110.0412.867.098.3432.094.088.053.173.183.342.973.851.377.8623.385.24.39.9812.97.088.0731.974.078.063.163.173.332.973.851.387.9123.45.34.3110.0612.817.098.0331.854.078.053.173.362.983.851.387.8223.475.24.310.0412.897.078.1932.134.18.033.153.193.322.963.841.387.8623.725.274.3110.2712.927.078.0631.944.0722.06417.819.676.826.889.232.9818.2556.6414.039.6221.527.6613.21875.348.657.5221.117.816.67.075.098.993.9819.4948.2912.6810.8823.4828.5915.8216.2270.768.4117.829.195.388.136.879.013.031749.7511.141124.0729.3417.7519.6681.779.1816.347.248.064.896.027.813.1820.7255.4213.389.8623.1129.4916.1517.2373.518.6324.74518.398.356.5684.598.411.7718.655.4812.1410.0823.5929.815.417.8871.086.9317.095.465.995.596.069.812.9916.9749.711.311.8923.4428.7318.8317.6170.296.7116.527.226.437.816.079.192.5319.250.3212.6410.5922.1928.4114.2718.2565.417.1218.545.495.976.38.159.533.5718.6651.2813.3410.6923.5426.3315.4618.2469.484.4817.065.927.345.898.556.632.6918.853.4812.1311.4322.1529.3815.3217.0270.537.2327.1839.433.763.773.264.721.609.6928.366.085.4912.7315.208.138.8337.913.943.619.353.563.643.983.404.531.609.8729.066.285.2512.6015.548.479.0738.034.269.623.413.764.093.114.781.799.6528.406.185.5512.1115.428.399.0237.864.339.623.663.653.953.374.731.719.8628.406.235.6712.3515.448.298.8938.294.2624.8059.623.663.623.753.344.601.499.8428.636.575.5312.7315.218.289.1937.884.419.523.663.443.893.104.371.349.9028.536.405.3412.4215.008.659.0538.274.3210.033.913.704.023.244.742.489.6827.986.226.1712.8114.577.578.4238.044.189.983.693.523.923.254.551.519.5828.536.695.4112.5215.568.319.1038.324.2510.023.833.243.483.124.171.409.9727.865.946.2513.1514.647.459.1438.504.148435129034220373552141538968140680391526569.28410.083.33.553.124.231.2710.2727.7565.1414.113.687.868.1338.255.483.5310.553.33.283.453.184.341.3910.6228.825.694.6414.1313.977.838.6438.764.398.963.463.255.183.194.091.179.9728.555.814.9411.3915.39.328.1338.792.938.814.993.123.3434.181.338.8728.215.976.1114.5815.447.939.639.013.94172.8838.963.413.345.173.14.241.4311.329.055.784.7211.5315.559.2810.139.064.139.163.483.623.524.934.151.429.0527.447.524.6712.415.959.819.8738.624.459.043.363.333.485.194.471.38.3830.167.744.9911.7215.859.5110.0938.762.858.465.253.363.473.194.141.458.9127.317.784.9412.9815.697.410.0538.822.8210.564.753.363.563.234.631.358.5527.326.965.141316.057.4310.1139.354.628132928765120404553831539398099981191559368.96210.753.33.473.184.31.4510.8629.246.775.2313.7316.797.988.7837.055.273.3110.233.313.33.493.154.091.410.6527.046.016.7913.0815.728.228.4837.594.598.743.453.533.593.234.341.349.2927.257.755.2713.8215.349.317.1539.124.169.543.314.93.43.16.281.418.927.595.816.5813.5716.397.8110.3438.734.16173.0438.373.343.335.273.134.041.4610.3829.126.055.3312.1715.459.168.6438.173.969.023.363.343.484.994.411.4110.3929.175.95.2511.5115.49.4610.6939.034.1110.613.443.33.425.114.351.428.9730.748.145.4512.4713.889.3810.2338.793.128.833.63.445.183.284.441.3810.1829.355.845.1611.2416.69.348.4538.693.918.223.383.355.233.124.11.4210.4729.855.875.3410.9615.419.448.738.654.598488729276820601549501521708287581321551488.9678.153.63.454.774.371.338.9329.547.825.1811.4115.49.1110.1738.94.513.478.453.433.363.514.74.11.48.729.297.445.212.4515.269.119.8139.043.938.913.393.353.464.614.041.169.0227.047.614.6713.6815.629.379.5538.994.0610.544.452.613.172.545.261.0710.0127.775.846.1113.1316.617.727.7338.585.92172.8879.415.13.263.513.164.121.188.6127.898.165.1413.6317.39.2110.0939.182.818.933.423.173.53.074.11.268.3528.147.384.6913.1315.557.0210.0338.462.6412.123.294.973.323.125.941.4210.7527.616.076.6213.2917.677.488.2538.953.9310.153.274.813.373.15.882.918.8529.45.976.5413.4615.677.728.3738.584.0110.643.294.963.433.15.821.410.1427.255.586.3213.2516.38.268.3437.135.86OpenBenchmarking.org

vkpeak

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-scalarabcdefgh30903090 rep5K10K15K20K25KSE +/- 4.18, N = 3SE +/- 16.18, N = 3SE +/- 0.30, N = 313190.0912807.0612860.568531.968515.586837.946812.996810.7321269.7220925.30
OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-scalarabcdefgh30903090 rep4K8K12K16K20KMin: 13182.88 / Avg: 13190.09 / Max: 13197.36Min: 8515.52 / Avg: 8531.96 / Max: 8564.32Min: 8515.03 / Avg: 8515.58 / Max: 8516.06

fp32-scalar

i: The test quit with a non-zero exit status.

4080: The test quit with a non-zero exit status.

4080 rep: The test quit with a non-zero exit status.

4080 xxx: The test quit with a non-zero exit status.

4080 zzz: The test quit with a non-zero exit status.

3070: The test quit with a non-zero exit status.

RTX 3070 Ti: The test quit with a non-zero exit status.

4090: The test quit with a non-zero exit status.

4090 rep: The test quit with a non-zero exit status.

nv 4090: The test quit with a non-zero exit status.

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-vec4abcdefgh30903090 rep6K12K18K24K30KSE +/- 1.81, N = 3SE +/- 19.37, N = 3SE +/- 2.57, N = 312730.0812808.5912822.0111251.1711231.729006.579002.599036.1727797.8027807.58
OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-vec4abcdefgh30903090 rep5K10K15K20K25KMin: 12728.22 / Avg: 12730.08 / Max: 12733.7Min: 11226.91 / Avg: 11251.17 / Max: 11289.45Min: 11226.91 / Avg: 11231.72 / Max: 11235.71

fp32-vec4

i: The test quit with a non-zero exit status.

4080: The test quit with a non-zero exit status.

4080 rep: The test quit with a non-zero exit status.

4080 xxx: The test quit with a non-zero exit status.

4080 zzz: The test quit with a non-zero exit status.

3070: The test quit with a non-zero exit status.

RTX 3070 Ti: The test quit with a non-zero exit status.

4090: The test quit with a non-zero exit status.

4090 rep: The test quit with a non-zero exit status.

nv 4090: The test quit with a non-zero exit status.

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-scalarabcdefgh30903090 rep4K8K12K16K20KSE +/- 4.01, N = 3SE +/- 13.46, N = 3SE +/- 5.09, N = 313154.1513145.1913136.798412.338397.806812.526811.356838.3220845.0920953.30
OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-scalarabcdefgh30903090 rep4K8K12K16K20KMin: 13147.3 / Avg: 13154.15 / Max: 13161.18Min: 8397.02 / Avg: 8412.33 / Max: 8439.17Min: 8387.64 / Avg: 8397.8 / Max: 8403.41

fp16-scalar

i: The test quit with a non-zero exit status.

4080: The test quit with a non-zero exit status.

4080 rep: The test quit with a non-zero exit status.

4080 xxx: The test quit with a non-zero exit status.

4080 zzz: The test quit with a non-zero exit status.

3070: The test quit with a non-zero exit status.

RTX 3070 Ti: The test quit with a non-zero exit status.

4090: The test quit with a non-zero exit status.

4090 rep: The test quit with a non-zero exit status.

nv 4090: The test quit with a non-zero exit status.

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-vec4abcdefgh30903090 rep9K18K27K36K45KSE +/- 5.96, N = 3SE +/- 0.37, N = 3SE +/- 0.36, N = 323232.4223390.4423387.2616864.4716865.2913440.9713438.4713490.2441149.1041188.02
OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-vec4abcdefgh30903090 rep7K14K21K28K35KMin: 23221.66 / Avg: 23232.42 / Max: 23242.25Min: 16863.77 / Avg: 16864.47 / Max: 16865.01Min: 16864.58 / Avg: 16865.29 / Max: 16865.77

fp16-vec4

i: The test quit with a non-zero exit status.

4080: The test quit with a non-zero exit status.

4080 rep: The test quit with a non-zero exit status.

4080 xxx: The test quit with a non-zero exit status.

4080 zzz: The test quit with a non-zero exit status.

3070: The test quit with a non-zero exit status.

RTX 3070 Ti: The test quit with a non-zero exit status.

4090: The test quit with a non-zero exit status.

4090 rep: The test quit with a non-zero exit status.

nv 4090: The test quit with a non-zero exit status.

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp64-scalarabcdefgh30903090 rep2004006008001000SE +/- 0.22, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3841.40839.20839.01267.43267.41214.17213.96213.37653.13653.63
OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp64-scalarabcdefgh30903090 rep150300450600750Min: 841.09 / Avg: 841.4 / Max: 841.83Min: 267.42 / Avg: 267.43 / Max: 267.45Min: 267.41 / Avg: 267.41 / Max: 267.41

fp64-scalar

i: The test quit with a non-zero exit status.

4080: The test quit with a non-zero exit status.

4080 rep: The test quit with a non-zero exit status.

4080 xxx: The test quit with a non-zero exit status.

4080 zzz: The test quit with a non-zero exit status.

3070: The test quit with a non-zero exit status.

RTX 3070 Ti: The test quit with a non-zero exit status.

4090: The test quit with a non-zero exit status.

4090 rep: The test quit with a non-zero exit status.

nv 4090: The test quit with a non-zero exit status.

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp64-vec4abcdefgh30902004006008001000SE +/- 0.32, N = 3SE +/- 0.48, N = 3SE +/- 0.00, N = 3841.80836.55836.16267.74267.25214.23213.95210.96653.15
OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp64-vec4abcdefgh3090150300450600750Min: 841.29 / Avg: 841.8 / Max: 842.39Min: 267.25 / Avg: 267.74 / Max: 268.7Min: 267.25 / Avg: 267.25 / Max: 267.26

fp64-vec4

i: The test quit with a non-zero exit status.

4080: The test quit with a non-zero exit status.

4080 rep: The test quit with a non-zero exit status.

4080 xxx: The test quit with a non-zero exit status.

4080 zzz: The test quit with a non-zero exit status.

3070: The test quit with a non-zero exit status.

RTX 3070 Ti: The test quit with a non-zero exit status.

4090: The test quit with a non-zero exit status.

4090 rep: The test quit with a non-zero exit status.

nv 4090: The test quit with a non-zero exit status.

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int32-scalarabcdefgh30903090 rep4K8K12K16K20KSE +/- 0.34, N = 3SE +/- 15.02, N = 3SE +/- 0.03, N = 32272.622269.252269.068520.028505.206827.926824.296800.6020909.0220767.64
OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int32-scalarabcdefgh30903090 rep4K8K12K16K20KMin: 2272.07 / Avg: 2272.62 / Max: 2273.25Min: 8504.83 / Avg: 8520.02 / Max: 8550.06Min: 8505.14 / Avg: 8505.2 / Max: 8505.25

int32-scalar

i: The test quit with a non-zero exit status.

4080: The test quit with a non-zero exit status.

4080 rep: The test quit with a non-zero exit status.

4080 xxx: The test quit with a non-zero exit status.

4080 zzz: The test quit with a non-zero exit status.

3070: The test quit with a non-zero exit status.

RTX 3070 Ti: The test quit with a non-zero exit status.

4090: The test quit with a non-zero exit status.

4090 rep: The test quit with a non-zero exit status.

nv 4090: The test quit with a non-zero exit status.

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int32-vec4abcdefgh30903090 rep4K8K12K16K20KSE +/- 0.26, N = 3SE +/- 0.19, N = 3SE +/- 0.05, N = 32658.732640.082638.698465.828465.716800.176794.926772.9820820.0920517.68
OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int32-vec4abcdefgh30903090 rep4K8K12K16K20KMin: 2658.24 / Avg: 2658.73 / Max: 2659.12Min: 8465.46 / Avg: 8465.82 / Max: 8466.09Min: 8465.65 / Avg: 8465.71 / Max: 8465.81

int32-vec4

i: The test quit with a non-zero exit status.

4080: The test quit with a non-zero exit status.

4080 rep: The test quit with a non-zero exit status.

4080 xxx: The test quit with a non-zero exit status.

4080 zzz: The test quit with a non-zero exit status.

3070: The test quit with a non-zero exit status.

RTX 3070 Ti: The test quit with a non-zero exit status.

4090: The test quit with a non-zero exit status.

4090 rep: The test quit with a non-zero exit status.

nv 4090: The test quit with a non-zero exit status.

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int16-scalarabcdefgh30903090 rep3K6K9K12K15KSE +/- 1.30, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 313102.7513070.8113063.865676.025675.994480.594479.224495.9813710.8813608.57
OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int16-scalarabcdefgh30903090 rep2K4K6K8K10KMin: 13101.22 / Avg: 13102.75 / Max: 13105.34Min: 5675.86 / Avg: 5676.02 / Max: 5676.17Min: 5675.95 / Avg: 5675.99 / Max: 5676.02

int16-scalar

i: The test quit with a non-zero exit status.

4080: The test quit with a non-zero exit status.

4080 rep: The test quit with a non-zero exit status.

4080 xxx: The test quit with a non-zero exit status.

4080 zzz: The test quit with a non-zero exit status.

3070: The test quit with a non-zero exit status.

RTX 3070 Ti: The test quit with a non-zero exit status.

4090: The test quit with a non-zero exit status.

4090 rep: The test quit with a non-zero exit status.

nv 4090: The test quit with a non-zero exit status.

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int16-vec4abcdefgh30903090 rep5K10K15K20K25KSE +/- 21.55, N = 3SE +/- 17.33, N = 3SE +/- 0.31, N = 323123.7723396.5923385.447352.857336.255959.755956.385978.3816886.6616881.47
OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int16-vec4abcdefgh30903090 rep4K8K12K16K20KMin: 23080.77 / Avg: 23123.77 / Max: 23147.93Min: 7332.02 / Avg: 7352.85 / Max: 7387.26Min: 7335.63 / Avg: 7336.25 / Max: 7336.63

int16-vec4

i: The test quit with a non-zero exit status.

4080: The test quit with a non-zero exit status.

4080 rep: The test quit with a non-zero exit status.

4080 xxx: The test quit with a non-zero exit status.

4080 zzz: The test quit with a non-zero exit status.

3070: The test quit with a non-zero exit status.

RTX 3070 Ti: The test quit with a non-zero exit status.

4090: The test quit with a non-zero exit status.

4090 rep: The test quit with a non-zero exit status.

nv 4090: The test quit with a non-zero exit status.

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT R2C / C2Rabcdefghi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 409020K40K60K80K100KSE +/- 200.55, N = 3SE +/- 118.74, N = 3SE +/- 3.71, N = 3SE +/- 796.66, N = 34210542163430213539935304265932663826524337276647368279690686768955347544328435181329848871. (CXX) g++ options: -O3
OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT R2C / C2Rabcdefghi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 409015K30K45K60K75KMin: 41846 / Avg: 42105.33 / Max: 42500Min: 35162 / Avg: 35399.33 / Max: 35525Min: 35297 / Avg: 35304.33 / Max: 35309Min: 65416 / Avg: 66473 / Max: 680341. (CXX) g++ options: -O3

Test: FFT + iFFT R2C / C2R

3070: The test quit with a non-zero exit status. E: VkFFT System: 512x512x128 Buffer: 128 MB avg_time_per_step: 4.833 ms std_error: 0.038 num_iter: 31 benchmark: 27226 bandwidth: 311.6

RTX 3070 Ti: The test quit with a non-zero exit status. E: VkFFT System: 512x512x128 Buffer: 128 MB avg_time_per_step: 3.494 ms std_error: 0.002 num_iter: 31 benchmark: 37664 bandwidth: 431.0

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C 1D batched in half precisionabcdefghi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 409060K120K180K240K300KSE +/- 83.55, N = 3SE +/- 18.50, N = 3SE +/- 26.03, N = 3SE +/- 133.47, N = 391597918129174485181851911041461041711042981322702110762110582107132109912552072651712903422876512927681. (CXX) g++ options: -O3
OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C 1D batched in half precisionabcdefghi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 409050K100K150K200K250KMin: 91448 / Avg: 91597 / Max: 91737Min: 85148 / Avg: 85181 / Max: 85212Min: 85148 / Avg: 85191.33 / Max: 85238Min: 210885 / Avg: 211076 / Max: 2113331. (CXX) g++ options: -O3

Test: FFT + iFFT C2C 1D batched in half precision

3070: The test quit with a non-zero exit status.

RTX 3070 Ti: The test quit with a non-zero exit status.

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C Bluestein in single precisionabcdefghi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 40904K8K12K16K20KSE +/- 62.67, N = 3SE +/- 72.34, N = 3SE +/- 75.16, N = 15SE +/- 83.38, N = 31134011273113111071910560757175747622100611712117287173431718514406144492037320404206011. (CXX) g++ options: -O3
OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C Bluestein in single precisionabcdefghi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 40904K8K12K16K20KMin: 11277 / Avg: 11339.67 / Max: 11465Min: 10624 / Avg: 10719 / Max: 10861Min: 9874 / Avg: 10560.27 / Max: 10899Min: 16955 / Avg: 17120.67 / Max: 172201. (CXX) g++ options: -O3

Test: FFT + iFFT C2C Bluestein in single precision

3070: The test quit with a non-zero exit status. E: VkFFT System: 4241x4241x1 Buffer: 137 MB avg_time_per_step: 16.022 ms std_error: 0.041 num_iter: 29 benchmark: 8770 bandwidth: 133.8

RTX 3070 Ti: The test quit with a non-zero exit status. E: VkFFT System: 4241x4241x1 Buffer: 137 MB avg_time_per_step: 10.602 ms std_error: 0.177 num_iter: 29 benchmark: 13253 bandwidth: 202.2

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C 1D batched in double precisionabcdefghi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 409012K24K36K48K60KSE +/- 11.67, N = 3SE +/- 10.58, N = 3SE +/- 12.42, N = 3SE +/- 14.62, N = 32081620822208471214312168105611054810572147803497435038350713505830945311225521455383549501. (CXX) g++ options: -O3
OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C 1D batched in double precisionabcdefghi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 409010K20K30K40K50KMin: 20794 / Avg: 20815.67 / Max: 20834Min: 12123 / Avg: 12143 / Max: 12159Min: 12146 / Avg: 12168 / Max: 12189Min: 34951 / Avg: 34973.67 / Max: 350011. (CXX) g++ options: -O3

Test: FFT + iFFT C2C 1D batched in double precision

3070: The test quit with a non-zero exit status.

RTX 3070 Ti: The test quit with a non-zero exit status.

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C 1D batched in single precisionabcdefghi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 409030K60K90K120K150KSE +/- 9.54, N = 3SE +/- 2.73, N = 3SE +/- 1.67, N = 3SE +/- 25.50, N = 34788747948479714264542651564765645556431697381045561044911045281045431413571414371538961539391521701. (CXX) g++ options: -O3
OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C 1D batched in single precisionabcdefghi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 409030K60K90K120K150KMin: 47871 / Avg: 47887 / Max: 47904Min: 42641 / Avg: 42644.67 / Max: 42650Min: 42648 / Avg: 42651.33 / Max: 42653Min: 104505 / Avg: 104556 / Max: 1045821. (CXX) g++ options: -O3

Test: FFT + iFFT C2C 1D batched in single precision

3070: The test quit with a non-zero exit status.

RTX 3070 Ti: The test quit with a non-zero exit status.

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C multidimensional in single precisionabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 409020K40K60K80K100KSE +/- 57.83, N = 3SE +/- 116.12, N = 3SE +/- 437.33, N = 3SE +/- 555.86, N = 333001327513281236328370902623826541346866586970068678877004051005548148140680999828751. (CXX) g++ options: -O3
OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C multidimensional in single precisionabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 409014K28K42K56K70KMin: 32887 / Avg: 33000.67 / Max: 33076Min: 36118 / Avg: 36327.67 / Max: 36519Min: 36227 / Avg: 37089.67 / Max: 37646Min: 65213 / Avg: 65868.67 / Max: 669741. (CXX) g++ options: -O3

Test: FFT + iFFT C2C multidimensional in single precision

3070: The test quit with a non-zero exit status. E: VkFFT System: 3840x2160x1 Buffer: 63 MB avg_time_per_step: 2.236 ms std_error: 0.035 num_iter: 64 benchmark: 28982 bandwidth: 331.7

RTX 3070 Ti: The test quit with a non-zero exit status. E: VkFFT System: 3840x2160x1 Buffer: 63 MB avg_time_per_step: 1.462 ms std_error: 0.004 num_iter: 64 benchmark: 44332 bandwidth: 507.3

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C Bluestein benchmark in double precisionabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 40902K4K6K8K10KSE +/- 0.33, N = 3SE +/- 4.37, N = 3SE +/- 11.20, N = 3471746954670234623431814181824175579558355875584428242898039811981321. (CXX) g++ options: -O3
OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C Bluestein benchmark in double precisionabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 409014002800420056007000Min: 4717 / Avg: 4717.33 / Max: 4718Min: 2341 / Avg: 2346.33 / Max: 2355Min: 2330 / Avg: 2342.67 / Max: 23651. (CXX) g++ options: -O3

Test: FFT + iFFT C2C Bluestein benchmark in double precision

3070: The test quit with a non-zero exit status. E: VkFFT System: 2909x2909x1 Buffer: 129 MB avg_time_per_step: 72.294 ms std_error: 0.046 num_iter: 31 benchmark: 1828 bandwidth: 27.9

RTX 3070 Ti: The test quit with a non-zero exit status. E: VkFFT System: 2909x2909x1 Buffer: 129 MB avg_time_per_step: 66.798 ms std_error: 0.291 num_iter: 31 benchmark: 1979 bandwidth: 30.2

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C 1D batched in single precision, no reshufflingabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 409030K60K90K120K150KSE +/- 8.89, N = 3SE +/- 2.33, N = 3SE +/- 2.08, N = 350504506435059643365433655711057094711631062101062051060991059261439691439561526561559361551481. (CXX) g++ options: -O3
OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.2.31Test: FFT + iFFT C2C 1D batched in single precision, no reshufflingabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep40904090 repnv 409030K60K90K120K150KMin: 50491 / Avg: 50504 / Max: 50521Min: 43361 / Avg: 43364.67 / Max: 43369Min: 43361 / Avg: 43365 / Max: 433681. (CXX) g++ options: -O3

Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling

3070: The test quit with a non-zero exit status.

RTX 3070 Ti: The test quit with a non-zero exit status.

VkResample

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Singleabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090816243240SE +/- 0.001, N = 3SE +/- 0.004, N = 3SE +/- 0.000, N = 3SE +/- 0.029, N = 311.68611.69011.68832.85532.85026.73826.76920.93013.13613.13613.13713.12610.39910.42822.06427.1839.2848.9628.9671. (CXX) g++ options: -O3
OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Singleabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090714212835Min: 11.68 / Avg: 11.69 / Max: 11.69Min: 32.85 / Avg: 32.86 / Max: 32.86Min: 32.85 / Avg: 32.85 / Max: 32.85Min: 27.13 / Avg: 27.18 / Max: 27.231. (CXX) g++ options: -O3

NCNN

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mobilenetabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.22, N = 158.057.978.028.108.4522.748.378.738.418.378.388.608.0117.819.4310.088.438.15MIN: 7.97 / MAX: 9.07MIN: 7.94 / MAX: 8.26MIN: 7.98 / MAX: 8.33MIN: 7.94 / MAX: 14.4MIN: 8.37 / MAX: 9.44MIN: 8.24 / MAX: 1264.67MIN: 8.15 / MAX: 9.75MIN: 8.15 / MAX: 10.96MIN: 8.14 / MAX: 11.03MIN: 7.96 / MAX: 9.72MIN: 7.94 / MAX: 10.16MIN: 8.5 / MAX: 13.72MIN: 7.96 / MAX: 9.85MIN: 8.05 / MAX: 159.41MIN: 7.95 / MAX: 398.1MIN: 8.1 / MAX: 118.32MIN: 8.04 / MAX: 18.04MIN: 7.73 / MAX: 9.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mobilenetabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025Min: 8.02 / Avg: 8.05 / Max: 8.1Min: 8.01 / Avg: 8.1 / Max: 8.2Min: 8.33 / Avg: 9.43 / Max: 10.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v2-v2 - Model: mobilenet-v2abcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.17, N = 153.163.163.183.173.153.163.523.283.283.283.253.173.179.673.763.304.743.60MIN: 3.1 / MAX: 3.8MIN: 3.11 / MAX: 3.61MIN: 3.13 / MAX: 3.84MIN: 3.1 / MAX: 8.86MIN: 3.1 / MAX: 3.65MIN: 3.11 / MAX: 3.83MIN: 3.29 / MAX: 19.18MIN: 3.11 / MAX: 3.88MIN: 3.11 / MAX: 4MIN: 3.1 / MAX: 4.05MIN: 3.09 / MAX: 4.51MIN: 3.12 / MAX: 4.05MIN: 3.11 / MAX: 4.94MIN: 3.19 / MAX: 225.84MIN: 2.6 / MAX: 364.73MIN: 3.11 / MAX: 4.81MIN: 3.09 / MAX: 140.79MIN: 3.43 / MAX: 4.621. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v2-v2 - Model: mobilenet-v2abcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 3.15 / Avg: 3.16 / Max: 3.16Min: 3.15 / Avg: 3.17 / Max: 3.2Min: 3.28 / Avg: 3.76 / Max: 4.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: shufflenet-v2abcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.19, N = 153.343.343.353.353.553.595.033.413.433.453.423.343.356.823.773.553.513.45MIN: 3.3 / MAX: 3.85MIN: 3.31 / MAX: 3.77MIN: 3.31 / MAX: 3.8MIN: 3.3 / MAX: 3.82MIN: 3.27 / MAX: 22.86MIN: 3.3 / MAX: 25.28MIN: 3.07 / MAX: 228.55MIN: 3.28 / MAX: 4.87MIN: 3.3 / MAX: 4.15MIN: 3.32 / MAX: 3.85MIN: 3.28 / MAX: 4.19MIN: 3.3 / MAX: 4.19MIN: 3.31 / MAX: 3.68MIN: 3.16 / MAX: 64.72MIN: 3.02 / MAX: 511.95MIN: 3.39 / MAX: 5.48MIN: 3.38 / MAX: 5.4MIN: 3.32 / MAX: 4.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: shufflenet-v2abcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 3.32 / Avg: 3.34 / Max: 3.36Min: 3.34 / Avg: 3.35 / Max: 3.36Min: 3.16 / Avg: 3.77 / Max: 5.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mnasnetabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 152.972.972.992.982.973.002.743.053.063.063.042.992.976.883.263.123.224.77MIN: 2.92 / MAX: 3.48MIN: 2.93 / MAX: 3.45MIN: 2.96 / MAX: 3.44MIN: 2.94 / MAX: 3.83MIN: 2.93 / MAX: 3.95MIN: 2.96 / MAX: 3.68MIN: 2.62 / MAX: 4.22MIN: 2.92 / MAX: 3.82MIN: 2.94 / MAX: 4.51MIN: 2.94 / MAX: 4.45MIN: 2.91 / MAX: 4.47MIN: 2.95 / MAX: 3.88MIN: 2.93 / MAX: 3.28MIN: 3.05 / MAX: 110.25MIN: 2.46 / MAX: 277.54MIN: 2.98 / MAX: 3.79MIN: 3.11 / MAX: 3.71MIN: 3.07 / MAX: 97.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mnasnetabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 2.95 / Avg: 2.97 / Max: 2.98Min: 2.97 / Avg: 2.98 / Max: 2.99Min: 2.57 / Avg: 3.26 / Max: 4.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: efficientnet-b0abcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.19, N = 153.903.853.883.853.873.914.053.994.094.043.993.873.869.234.724.234.034.37MIN: 3.82 / MAX: 4.51MIN: 3.81 / MAX: 4.42MIN: 3.84 / MAX: 4.41MIN: 3.81 / MAX: 4.46MIN: 3.81 / MAX: 4.97MIN: 3.85 / MAX: 4.64MIN: 3.78 / MAX: 5.45MIN: 3.79 / MAX: 5.83MIN: 3.86 / MAX: 5.59MIN: 3.83 / MAX: 5.71MIN: 3.8 / MAX: 5.69MIN: 3.83 / MAX: 4.69MIN: 3.81 / MAX: 4.75MIN: 3.43 / MAX: 156.19MIN: 3.37 / MAX: 486.93MIN: 3.98 / MAX: 12.23MIN: 3.86 / MAX: 4.82MIN: 4.15 / MAX: 5.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: efficientnet-b0abcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 3.85 / Avg: 3.9 / Max: 3.99Min: 3.85 / Avg: 3.85 / Max: 3.85Min: 4.06 / Avg: 4.72 / Max: 6.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: blazefaceabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40900.67051.3412.01152.6823.3525SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 151.381.381.391.381.371.381.401.401.421.421.401.381.382.981.601.271.451.33MIN: 1.35 / MAX: 2.06MIN: 1.35 / MAX: 1.67MIN: 1.36 / MAX: 1.53MIN: 1.35 / MAX: 2.05MIN: 1.34 / MAX: 2.11MIN: 1.36 / MAX: 1.62MIN: 1.34 / MAX: 2MIN: 1.34 / MAX: 2.15MIN: 1.35 / MAX: 1.88MIN: 1.36 / MAX: 2.02MIN: 1.34 / MAX: 2.1MIN: 1.35 / MAX: 2.23MIN: 1.36 / MAX: 1.71MIN: 1.29 / MAX: 144.96MIN: 0.95 / MAX: 433.24MIN: 1.21 / MAX: 1.95MIN: 1.38 / MAX: 2.96MIN: 1.27 / MAX: 1.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: blazefaceabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810Min: 1.37 / Avg: 1.38 / Max: 1.38Min: 1.37 / Avg: 1.38 / Max: 1.39Min: 1.19 / Avg: 1.6 / Max: 3.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: googlenetabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620SE +/- 0.11, N = 3SE +/- 0.01, N = 3SE +/- 0.22, N = 157.947.827.937.857.927.9810.308.428.528.388.407.867.9018.259.6910.279.538.93MIN: 7.71 / MAX: 8.73MIN: 7.73 / MAX: 8.65MIN: 7.82 / MAX: 8.91MIN: 7.71 / MAX: 8.83MIN: 7.8 / MAX: 8.96MIN: 7.86 / MAX: 8.78MIN: 8.19 / MAX: 349.57MIN: 7.75 / MAX: 9.96MIN: 7.84 / MAX: 10.21MIN: 7.72 / MAX: 10.05MIN: 7.72 / MAX: 10.5MIN: 7.76 / MAX: 8.74MIN: 7.79 / MAX: 8.74MIN: 7.5 / MAX: 267.89MIN: 7.29 / MAX: 407.61MIN: 7.95 / MAX: 115.68MIN: 8.86 / MAX: 11.44MIN: 8.27 / MAX: 10.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: googlenetabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025Min: 7.8 / Avg: 7.94 / Max: 8.15Min: 7.84 / Avg: 7.85 / Max: 7.86Min: 8.51 / Avg: 9.69 / Max: 10.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vgg16abcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40901326395265SE +/- 0.30, N = 3SE +/- 0.05, N = 3SE +/- 0.23, N = 1523.7523.4923.4523.5124.1923.7830.9625.3726.1125.0325.4023.4323.4356.6428.3627.7528.1929.54MIN: 23.31 / MAX: 25.12MIN: 23.36 / MAX: 24.62MIN: 23.26 / MAX: 24.51MIN: 23.19 / MAX: 24.68MIN: 23.99 / MAX: 30.98MIN: 23.52 / MAX: 24.89MIN: 25.92 / MAX: 328.63MIN: 24.26 / MAX: 36.52MIN: 24.54 / MAX: 30.29MIN: 23.85 / MAX: 28.9MIN: 24.09 / MAX: 32.86MIN: 23.2 / MAX: 24.1MIN: 23.23 / MAX: 24.39MIN: 25.75 / MAX: 367.74MIN: 24.13 / MAX: 449.57MIN: 24.58 / MAX: 282.59MIN: 24.69 / MAX: 205.72MIN: 24.77 / MAX: 364.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vgg16abcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40901122334455Min: 23.44 / Avg: 23.75 / Max: 24.35Min: 23.46 / Avg: 23.51 / Max: 23.6Min: 26.77 / Avg: 28.36 / Max: 29.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet18abcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.17, N = 155.295.205.245.235.695.555.605.695.685.565.635.215.2914.036.086.008.077.82MIN: 5.09 / MAX: 6.29MIN: 5.1 / MAX: 5.9MIN: 5.15 / MAX: 6.09MIN: 5.1 / MAX: 6.28MIN: 5.22 / MAX: 92.59MIN: 5.19 / MAX: 25.4MIN: 5.13 / MAX: 6.83MIN: 5.16 / MAX: 7.68MIN: 5.17 / MAX: 7.45MIN: 5.09 / MAX: 6.84MIN: 5.08 / MAX: 7.55MIN: 5.09 / MAX: 6.04MIN: 5.18 / MAX: 6.19MIN: 5 / MAX: 303.38MIN: 4.97 / MAX: 245.95MIN: 5.47 / MAX: 7.29MIN: 5.86 / MAX: 121.03MIN: 5.54 / MAX: 303.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet18abcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620Min: 5.21 / Avg: 5.29 / Max: 5.42Min: 5.21 / Avg: 5.23 / Max: 5.26Min: 5.41 / Avg: 6.08 / Max: 7.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: alexnetabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215SE +/- 0.11, N = 3SE +/- 0.01, N = 3SE +/- 0.21, N = 144.414.324.314.304.364.325.304.754.674.684.694.304.319.625.495.145.145.18MIN: 4.24 / MAX: 5.16MIN: 4.26 / MAX: 5.15MIN: 4.26 / MAX: 4.98MIN: 4.23 / MAX: 5.32MIN: 4.29 / MAX: 5.7MIN: 4.25 / MAX: 5.17MIN: 4.92 / MAX: 7.18MIN: 4.31 / MAX: 13.88MIN: 4.27 / MAX: 5.88MIN: 4.28 / MAX: 6.37MIN: 4.29 / MAX: 5.78MIN: 4.25 / MAX: 4.83MIN: 4.26 / MAX: 5.18MIN: 4.31 / MAX: 147.6MIN: 4.26 / MAX: 363.39MIN: 4.73 / MAX: 6.32MIN: 4.76 / MAX: 6.26MIN: 4.75 / MAX: 7.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: alexnetabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 4.3 / Avg: 4.41 / Max: 4.63Min: 4.28 / Avg: 4.3 / Max: 4.31Min: 4.63 / Avg: 5.49 / Max: 6.621. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet50abcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025SE +/- 0.23, N = 3SE +/- 0.01, N = 3SE +/- 0.26, N = 1510.2010.0110.1110.0011.0510.7212.0911.1611.7610.8211.1010.309.9521.5012.7314.1012.7311.41MIN: 9.84 / MAX: 12.48MIN: 9.85 / MAX: 11.06MIN: 9.95 / MAX: 16.18MIN: 9.86 / MAX: 11.02MIN: 10.14 / MAX: 162.88MIN: 10.1 / MAX: 108.3MIN: 11.16 / MAX: 13.48MIN: 10.29 / MAX: 15.03MIN: 10.68 / MAX: 44.94MIN: 9.9 / MAX: 12.26MIN: 10.2 / MAX: 13.06MIN: 9.82 / MAX: 17.56MIN: 9.85 / MAX: 10.72MIN: 10.24 / MAX: 116.85MIN: 10.18 / MAX: 541.92MIN: 10.27 / MAX: 287MIN: 10.22 / MAX: 181.72MIN: 10.57 / MAX: 12.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet50abcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025Min: 9.94 / Avg: 10.2 / Max: 10.65Min: 9.99 / Avg: 10 / Max: 10.02Min: 11.38 / Avg: 12.73 / Max: 14.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: yolov4-tinyabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090714212835SE +/- 0.11, N = 3SE +/- 0.05, N = 3SE +/- 0.18, N = 1512.9012.7412.8112.9513.3217.2314.6513.8514.0313.6013.6314.2612.7727.6615.2013.6815.3815.40MIN: 12.69 / MAX: 15.88MIN: 12.66 / MAX: 13.28MIN: 12.74 / MAX: 13.2MIN: 12.75 / MAX: 18.88MIN: 12.95 / MAX: 35.49MIN: 12.99 / MAX: 196.66MIN: 12.44 / MAX: 202.68MIN: 12.84 / MAX: 16.75MIN: 13.15 / MAX: 15.97MIN: 12.8 / MAX: 16.23MIN: 12.77 / MAX: 15.36MIN: 14.17 / MAX: 14.53MIN: 12.7 / MAX: 13.02MIN: 12.74 / MAX: 294.9MIN: 12.69 / MAX: 431.37MIN: 12.83 / MAX: 14.63MIN: 12.32 / MAX: 188.07MIN: 12.35 / MAX: 321.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: yolov4-tinyabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090612182430Min: 12.79 / Avg: 12.9 / Max: 13.12Min: 12.87 / Avg: 12.95 / Max: 13.03Min: 13.73 / Avg: 15.2 / Max: 16.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: squeezenet_ssdabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.24, N = 157.077.067.107.097.237.138.967.737.867.627.557.527.1213.208.137.867.319.11MIN: 7.01 / MAX: 8.07MIN: 7.01 / MAX: 7.55MIN: 7.05 / MAX: 7.65MIN: 6.99 / MAX: 9.39MIN: 7.15 / MAX: 8.02MIN: 7.04 / MAX: 8.43MIN: 6.92 / MAX: 244.02MIN: 7.13 / MAX: 9.7MIN: 7.22 / MAX: 10.84MIN: 7.01 / MAX: 8.84MIN: 7 / MAX: 8.72MIN: 7.45 / MAX: 7.74MIN: 7.05 / MAX: 7.63MIN: 6.9 / MAX: 68.61MIN: 6.37 / MAX: 399.11MIN: 7.25 / MAX: 8.98MIN: 6.71 / MAX: 9.3MIN: 6.77 / MAX: 101.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: squeezenet_ssdabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620Min: 7.05 / Avg: 7.07 / Max: 7.09Min: 7.07 / Avg: 7.09 / Max: 7.11Min: 6.83 / Avg: 8.13 / Max: 9.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: regnety_400mabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.19, N = 158.188.188.278.238.088.309.948.248.678.458.378.258.2418.008.838.139.6610.17MIN: 8.07 / MAX: 9.68MIN: 8.12 / MAX: 8.86MIN: 8.22 / MAX: 9.18MIN: 8.03 / MAX: 8.9MIN: 7.98 / MAX: 10.87MIN: 8.22 / MAX: 9.1MIN: 7.43 / MAX: 166.02MIN: 7.89 / MAX: 9.52MIN: 8.22 / MAX: 15.29MIN: 8.12 / MAX: 9.68MIN: 8.05 / MAX: 10.19MIN: 8.17 / MAX: 8.9MIN: 8.17 / MAX: 8.84MIN: 7.91 / MAX: 176.28MIN: 7.65 / MAX: 351.08MIN: 7.78 / MAX: 9.98MIN: 7.78 / MAX: 95.3MIN: 8.12 / MAX: 209.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: regnety_400mabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025Min: 8.12 / Avg: 8.18 / Max: 8.27Min: 8.12 / Avg: 8.23 / Max: 8.31Min: 8.06 / Avg: 8.83 / Max: 10.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vision_transformerabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409020406080100SE +/- 0.29, N = 3SE +/- 0.39, N = 3SE +/- 0.12, N = 1532.4931.9531.7732.4333.5632.7337.8035.0735.2834.2734.1033.0131.8075.3437.9138.2537.8138.90MIN: 31.67 / MAX: 40.11MIN: 31.79 / MAX: 32.33MIN: 31.61 / MAX: 35.68MIN: 31.56 / MAX: 37.69MIN: 32.98 / MAX: 51.93MIN: 31.44 / MAX: 81.32MIN: 33.74 / MAX: 321.51MIN: 33.14 / MAX: 43.26MIN: 33.9 / MAX: 38.67MIN: 32.82 / MAX: 39.79MIN: 32.65 / MAX: 37.64MIN: 32.88 / MAX: 33.42MIN: 31.66 / MAX: 32.23MIN: 38.72 / MAX: 418.01MIN: 32.08 / MAX: 541.11MIN: 33.04 / MAX: 447.7MIN: 32.66 / MAX: 453.44MIN: 34.2 / MAX: 300.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vision_transformerabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40901428425670Min: 31.91 / Avg: 32.49 / Max: 32.83Min: 31.95 / Avg: 32.43 / Max: 33.2Min: 36.65 / Avg: 37.91 / Max: 38.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: FastestDetabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810SE +/- 0.45, N = 3SE +/- 0.01, N = 3SE +/- 0.23, N = 153.624.054.114.084.222.572.664.424.344.174.164.214.088.653.945.485.274.51MIN: 2.7 / MAX: 4.54MIN: 4.02 / MAX: 4.35MIN: 4.08 / MAX: 4.4MIN: 4.02 / MAX: 4.28MIN: 4.18 / MAX: 4.97MIN: 2.53 / MAX: 3.21MIN: 2.54 / MAX: 3.41MIN: 4.25 / MAX: 6.71MIN: 4.19 / MAX: 5.77MIN: 4.05 / MAX: 4.74MIN: 4 / MAX: 4.69MIN: 4.19 / MAX: 4.41MIN: 4.05 / MAX: 4.84MIN: 3.94 / MAX: 185.21MIN: 2.43 / MAX: 267.02MIN: 2.67 / MAX: 259.34MIN: 4.05 / MAX: 247.02MIN: 4.34 / MAX: 5.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: FastestDetabcdfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 2.72 / Avg: 3.62 / Max: 4.08Min: 4.07 / Avg: 4.08 / Max: 4.09Min: 2.53 / Avg: 3.94 / Max: 5.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3 - Model: mobilenet-v3acdgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810SE +/- 0.00, N = 2SE +/- 0.00, N = 3SE +/- 0.21, N = 143.183.173.173.143.263.243.263.273.243.183.197.523.613.533.413.47MIN: 3.14 / MAX: 3.82MIN: 3.15 / MAX: 3.74MIN: 3.12 / MAX: 3.96MIN: 3.1 / MAX: 3.81MIN: 3.14 / MAX: 3.9MIN: 3.09 / MAX: 4.73MIN: 3.09 / MAX: 3.96MIN: 3.13 / MAX: 3.85MIN: 3.11 / MAX: 4.47MIN: 3.14 / MAX: 4.14MIN: 3.15 / MAX: 3.72MIN: 2.94 / MAX: 215MIN: 2.51 / MAX: 502.85MIN: 3.39 / MAX: 4.31MIN: 3.27 / MAX: 5.24MIN: 3.32 / MAX: 4.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3 - Model: mobilenet-v3acdgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 3.18 / Avg: 3.18 / Max: 3.18Min: 3.17 / Avg: 3.17 / Max: 3.18Min: 2.64 / Avg: 3.61 / Max: 5.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mobilenetabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.25, N = 158.058.048.038.028.048.278.1710.408.448.578.378.478.078.0321.119.3510.5510.238.45MIN: 7.95 / MAX: 8.89MIN: 7.95 / MAX: 14.33MIN: 7.98 / MAX: 8.84MIN: 7.95 / MAX: 9.81MIN: 7.95 / MAX: 9.09MIN: 8.17 / MAX: 9.04MIN: 8.08 / MAX: 9.37MIN: 7.97 / MAX: 455.46MIN: 7.98 / MAX: 10.55MIN: 7.98 / MAX: 10MIN: 7.97 / MAX: 16.09MIN: 8.04 / MAX: 10.17MIN: 7.99 / MAX: 8.8MIN: 7.96 / MAX: 8.77MIN: 7.98 / MAX: 322.43MIN: 7.49 / MAX: 474.12MIN: 8.22 / MAX: 303.1MIN: 8.13 / MAX: 386.42MIN: 8.03 / MAX: 12.611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mobilenetabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025Min: 8 / Avg: 8.05 / Max: 8.08Min: 8.02 / Avg: 8.02 / Max: 8.03Min: 8.02 / Avg: 8.04 / Max: 8.07Min: 7.83 / Avg: 9.35 / Max: 10.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2abcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 153.173.143.133.163.143.133.183.293.263.293.263.293.183.177.813.563.303.313.43MIN: 3.09 / MAX: 3.78MIN: 3.1 / MAX: 3.73MIN: 3.08 / MAX: 3.85MIN: 3.09 / MAX: 3.92MIN: 3.08 / MAX: 4.06MIN: 3.07 / MAX: 3.82MIN: 3.13 / MAX: 3.9MIN: 3.12 / MAX: 3.93MIN: 3.1 / MAX: 4.12MIN: 3.12 / MAX: 4.14MIN: 3.1 / MAX: 3.87MIN: 3.11 / MAX: 3.98MIN: 3.14 / MAX: 3.63MIN: 3.12 / MAX: 3.64MIN: 3.07 / MAX: 154.75MIN: 3.09 / MAX: 345.01MIN: 3.12 / MAX: 4.82MIN: 3.14 / MAX: 4.92MIN: 3.25 / MAX: 4.811. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2abcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 3.14 / Avg: 3.17 / Max: 3.18Min: 3.14 / Avg: 3.16 / Max: 3.18Min: 3.13 / Avg: 3.14 / Max: 3.16Min: 3.28 / Avg: 3.56 / Max: 4.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3acdefi4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810SE +/- 0.02, N = 3SE +/- 0.00, N = 2SE +/- 0.02, N = 3SE +/- 0.20, N = 143.173.203.173.183.144.873.283.193.166.603.643.283.303.36MIN: 3.11 / MAX: 3.73MIN: 3.16 / MAX: 3.68MIN: 3.1 / MAX: 3.83MIN: 3.11 / MAX: 3.78MIN: 3.09 / MAX: 3.54MIN: 3.14 / MAX: 278.98MIN: 3.13 / MAX: 4.65MIN: 3.14 / MAX: 3.48MIN: 3.11 / MAX: 3.62MIN: 2.98 / MAX: 166.19MIN: 2.87 / MAX: 429.02MIN: 3.15 / MAX: 3.9MIN: 3.15 / MAX: 3.92MIN: 3.21 / MAX: 4.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3acdefi4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 3.15 / Avg: 3.17 / Max: 3.2Min: 3.16 / Avg: 3.17 / Max: 3.17Min: 3.15 / Avg: 3.18 / Max: 3.21Min: 3.05 / Avg: 3.64 / Max: 5.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: shufflenet-v2abcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.20, N = 153.353.333.323.353.333.403.353.493.433.443.503.463.393.367.073.983.453.493.51MIN: 3.29 / MAX: 3.85MIN: 3.3 / MAX: 3.59MIN: 3.29 / MAX: 4.19MIN: 3.3 / MAX: 3.82MIN: 3.28 / MAX: 4.14MIN: 3.35 / MAX: 5.89MIN: 3.3 / MAX: 4.02MIN: 3.35 / MAX: 4.24MIN: 3.3 / MAX: 4.22MIN: 3.3 / MAX: 5.36MIN: 3.37 / MAX: 4.85MIN: 3.32 / MAX: 5.24MIN: 3.35 / MAX: 3.69MIN: 3.32 / MAX: 4.06MIN: 3.25 / MAX: 243.32MIN: 3.14 / MAX: 529.82MIN: 3.32 / MAX: 3.99MIN: 3.36 / MAX: 4.33MIN: 3.37 / MAX: 41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: shufflenet-v2abcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 3.32 / Avg: 3.35 / Max: 3.36Min: 3.34 / Avg: 3.35 / Max: 3.37Min: 3.32 / Avg: 3.33 / Max: 3.34Min: 3.33 / Avg: 3.98 / Max: 5.621. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mnasnetabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40901.14532.29063.43594.58125.7265SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.16, N = 152.982.952.962.972.962.972.983.203.073.083.073.062.992.975.093.403.183.154.70MIN: 2.92 / MAX: 4.03MIN: 2.92 / MAX: 3.42MIN: 2.93 / MAX: 3.41MIN: 2.92 / MAX: 3.34MIN: 2.91 / MAX: 5.9MIN: 2.93 / MAX: 3.66MIN: 2.94 / MAX: 3.65MIN: 3.07 / MAX: 3.86MIN: 2.93 / MAX: 4.63MIN: 2.94 / MAX: 3.67MIN: 2.95 / MAX: 4.19MIN: 2.92 / MAX: 3.73MIN: 2.96 / MAX: 3.14MIN: 2.94 / MAX: 3.28MIN: 2.86 / MAX: 53.75MIN: 2.72 / MAX: 432.18MIN: 3.05 / MAX: 4.64MIN: 3 / MAX: 4.54MIN: 3 / MAX: 188.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mnasnetabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810Min: 2.96 / Avg: 2.98 / Max: 3Min: 2.96 / Avg: 2.97 / Max: 2.98Min: 2.94 / Avg: 2.96 / Max: 2.97Min: 2.87 / Avg: 3.4 / Max: 4.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: efficientnet-b0abcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.18, N = 153.863.823.833.873.843.863.865.884.014.054.044.043.883.868.994.534.344.094.10MIN: 3.8 / MAX: 4.6MIN: 3.78 / MAX: 4.39MIN: 3.79 / MAX: 4.61MIN: 3.77 / MAX: 9.91MIN: 3.79 / MAX: 4.76MIN: 3.78 / MAX: 10.45MIN: 3.82 / MAX: 4.22MIN: 4.04 / MAX: 364.21MIN: 3.78 / MAX: 5.34MIN: 3.83 / MAX: 6.11MIN: 3.82 / MAX: 5.33MIN: 3.8 / MAX: 5.31MIN: 3.84 / MAX: 4.39MIN: 3.82 / MAX: 4.34MIN: 3.71 / MAX: 129.99MIN: 3.75 / MAX: 396.62MIN: 4.14 / MAX: 5.84MIN: 3.87 / MAX: 5.46MIN: 3.86 / MAX: 5.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: efficientnet-b0abcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 3.84 / Avg: 3.86 / Max: 3.88Min: 3.82 / Avg: 3.87 / Max: 3.9Min: 3.83 / Avg: 3.84 / Max: 3.85Min: 3.99 / Avg: 4.53 / Max: 5.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: blazefaceabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40900.89551.7912.68653.5824.4775SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.16, N = 151.381.371.371.381.381.381.411.401.411.411.421.421.391.373.981.601.391.401.40MIN: 1.34 / MAX: 1.85MIN: 1.35 / MAX: 1.75MIN: 1.35 / MAX: 1.82MIN: 1.34 / MAX: 2.25MIN: 1.34 / MAX: 1.88MIN: 1.35 / MAX: 2.08MIN: 1.38 / MAX: 2.09MIN: 1.33 / MAX: 2MIN: 1.35 / MAX: 2.01MIN: 1.35 / MAX: 1.9MIN: 1.36 / MAX: 1.93MIN: 1.36 / MAX: 2.01MIN: 1.37 / MAX: 1.82MIN: 1.36 / MAX: 1.46MIN: 1.31 / MAX: 228.4MIN: 1.11 / MAX: 436.01MIN: 1.33 / MAX: 1.94MIN: 1.33 / MAX: 1.93MIN: 1.34 / MAX: 1.871. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: blazefaceabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810Min: 1.36 / Avg: 1.38 / Max: 1.39Min: 1.37 / Avg: 1.38 / Max: 1.39Min: 1.37 / Avg: 1.38 / Max: 1.39Min: 1.17 / Avg: 1.6 / Max: 3.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: googlenetabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.22, N = 157.907.857.807.857.858.158.968.758.428.408.428.417.877.8219.499.8710.6210.658.70MIN: 7.74 / MAX: 9.54MIN: 7.76 / MAX: 8.76MIN: 7.72 / MAX: 8.74MIN: 7.71 / MAX: 8.85MIN: 7.71 / MAX: 8.76MIN: 8.02 / MAX: 9.02MIN: 8.82 / MAX: 9.87MIN: 8.08 / MAX: 16.01MIN: 7.79 / MAX: 10.01MIN: 7.77 / MAX: 9.78MIN: 7.73 / MAX: 10.06MIN: 7.72 / MAX: 9.9MIN: 7.76 / MAX: 10.36MIN: 7.69 / MAX: 8.61MIN: 7.4 / MAX: 200.01MIN: 7.33 / MAX: 399.24MIN: 7.83 / MAX: 323.31MIN: 8.29 / MAX: 236.11MIN: 7.96 / MAX: 10.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: googlenetabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025Min: 7.85 / Avg: 7.9 / Max: 7.92Min: 7.81 / Avg: 7.85 / Max: 7.92Min: 7.82 / Avg: 7.85 / Max: 7.87Min: 8.43 / Avg: 9.87 / Max: 11.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vgg16abcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40901122334455SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.14, N = 3SE +/- 0.24, N = 1523.5123.5623.5423.5623.6024.5524.2027.8325.0025.0425.0125.8223.5523.4848.2929.0628.8227.0429.29MIN: 23.29 / MAX: 24.68MIN: 23.34 / MAX: 24.72MIN: 23.33 / MAX: 24.61MIN: 23.24 / MAX: 24.78MIN: 23.17 / MAX: 24.71MIN: 23.62 / MAX: 97.69MIN: 23.56 / MAX: 58.31MIN: 24.98 / MAX: 262.23MIN: 23.93 / MAX: 26.69MIN: 24.06 / MAX: 27.35MIN: 23.8 / MAX: 26.41MIN: 24.35 / MAX: 62.94MIN: 23.3 / MAX: 24.45MIN: 23.24 / MAX: 29.21MIN: 24.97 / MAX: 183.12MIN: 24.11 / MAX: 541.55MIN: 24.35 / MAX: 214.1MIN: 24.22 / MAX: 296.13MIN: 24.63 / MAX: 296.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vgg16abcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40901020304050Min: 23.44 / Avg: 23.51 / Max: 23.59Min: 23.43 / Avg: 23.56 / Max: 23.76Min: 23.38 / Avg: 23.6 / Max: 23.86Min: 27.77 / Avg: 29.06 / Max: 30.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet18abcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.20, N = 155.285.235.215.235.225.486.225.825.675.615.625.595.275.2012.686.285.696.017.44MIN: 5.17 / MAX: 6.16MIN: 5.13 / MAX: 6.18MIN: 5.11 / MAX: 6.04MIN: 5.08 / MAX: 6.28MIN: 5.09 / MAX: 11.15MIN: 5.33 / MAX: 6.16MIN: 6.11 / MAX: 7MIN: 5.28 / MAX: 7.02MIN: 5.18 / MAX: 7.22MIN: 5.11 / MAX: 7.44MIN: 5.1 / MAX: 7.65MIN: 5.06 / MAX: 6.95MIN: 5.15 / MAX: 6.19MIN: 5.09 / MAX: 5.98MIN: 5.39 / MAX: 262.62MIN: 4.94 / MAX: 298.06MIN: 5.16 / MAX: 8.22MIN: 5.44 / MAX: 8.18MIN: 5.29 / MAX: 320.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet18abcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620Min: 5.26 / Avg: 5.28 / Max: 5.29Min: 5.2 / Avg: 5.23 / Max: 5.29Min: 5.2 / Avg: 5.22 / Max: 5.24Min: 5.38 / Avg: 6.28 / Max: 8.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: alexnetabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.18, N = 154.314.334.334.314.314.644.876.534.624.654.684.684.354.3110.885.254.646.795.20MIN: 4.24 / MAX: 5.2MIN: 4.28 / MAX: 5.16MIN: 4.26 / MAX: 10.59MIN: 4.25 / MAX: 5.28MIN: 4.23 / MAX: 11.03MIN: 4.57 / MAX: 5.49MIN: 4.8 / MAX: 5.62MIN: 4.57 / MAX: 242.16MIN: 4.26 / MAX: 6.15MIN: 4.26 / MAX: 6.53MIN: 4.26 / MAX: 6.61MIN: 4.26 / MAX: 6.23MIN: 4.28 / MAX: 7.49MIN: 4.26 / MAX: 5.26MIN: 4.38 / MAX: 52.99MIN: 4.23 / MAX: 375.94MIN: 4.26 / MAX: 5.98MIN: 4.23 / MAX: 262.43MIN: 4.82 / MAX: 7.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: alexnetabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 4.29 / Avg: 4.31 / Max: 4.32Min: 4.3 / Avg: 4.31 / Max: 4.33Min: 4.31 / Avg: 4.31 / Max: 4.31Min: 4.63 / Avg: 5.25 / Max: 6.611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet50abcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090612182430SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.12, N = 3SE +/- 0.26, N = 1510.0110.0010.0010.1010.1010.2610.3412.9610.8110.8410.9411.0710.1010.0623.4812.6014.1313.0812.45MIN: 9.88 / MAX: 11.4MIN: 9.92 / MAX: 12.35MIN: 9.91 / MAX: 11.15MIN: 9.86 / MAX: 11.08MIN: 9.84 / MAX: 11.72MIN: 10.09 / MAX: 11.22MIN: 10.14 / MAX: 11.37MIN: 10.23 / MAX: 424.46MIN: 9.95 / MAX: 12.78MIN: 9.93 / MAX: 12.81MIN: 9.95 / MAX: 12.7MIN: 10.1 / MAX: 13.23MIN: 9.97 / MAX: 11.42MIN: 9.95 / MAX: 11.04MIN: 10.06 / MAX: 112.91MIN: 9.82 / MAX: 418.4MIN: 10.63 / MAX: 167.28MIN: 10.11 / MAX: 444.45MIN: 11.55 / MAX: 14.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet50abcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025Min: 9.98 / Avg: 10.01 / Max: 10.06Min: 9.98 / Avg: 10.1 / Max: 10.27Min: 9.98 / Avg: 10.1 / Max: 10.33Min: 11.1 / Avg: 12.6 / Max: 14.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: yolov4-tinyabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090714212835SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.18, N = 1512.8412.8712.8112.8512.8713.1713.6415.1613.7913.6713.6513.8012.8812.8328.5915.5413.9715.7215.26MIN: 12.69 / MAX: 15.33MIN: 12.76 / MAX: 13.73MIN: 12.73 / MAX: 13.08MIN: 12.72 / MAX: 13.93MIN: 12.68 / MAX: 13.84MIN: 13.03 / MAX: 14.1MIN: 13.04 / MAX: 76.32MIN: 12.86 / MAX: 248.64MIN: 12.75 / MAX: 19.63MIN: 12.71 / MAX: 14.88MIN: 12.71 / MAX: 14.99MIN: 12.76 / MAX: 15.76MIN: 12.76 / MAX: 13.67MIN: 12.74 / MAX: 13.59MIN: 12.87 / MAX: 325.37MIN: 12.15 / MAX: 492.01MIN: 13.11 / MAX: 16.15MIN: 13.2 / MAX: 301.81MIN: 12.87 / MAX: 132.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: yolov4-tinyabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090612182430Min: 12.81 / Avg: 12.84 / Max: 12.88Min: 12.83 / Avg: 12.85 / Max: 12.87Min: 12.79 / Avg: 12.87 / Max: 12.92Min: 14.26 / Avg: 15.54 / Max: 17.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: squeezenet_ssdabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.26, N = 157.097.077.067.087.057.087.107.467.587.647.627.637.167.0615.828.477.838.229.11MIN: 6.98 / MAX: 7.95MIN: 7 / MAX: 8.07MIN: 7 / MAX: 8.03MIN: 6.97 / MAX: 7.99MIN: 6.95 / MAX: 8MIN: 6.98 / MAX: 8.07MIN: 6.99 / MAX: 8.59MIN: 6.9 / MAX: 8.9MIN: 6.98 / MAX: 9.05MIN: 7.05 / MAX: 9.12MIN: 7.01 / MAX: 9.28MIN: 7 / MAX: 9.17MIN: 7.05 / MAX: 13.55MIN: 7 / MAX: 7.82MIN: 6.99 / MAX: 82.57MIN: 6.29 / MAX: 533.92MIN: 7.21 / MAX: 9.32MIN: 7.56 / MAX: 9.8MIN: 6.35 / MAX: 130.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: squeezenet_ssdabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620Min: 7.04 / Avg: 7.09 / Max: 7.12Min: 7.04 / Avg: 7.08 / Max: 7.12Min: 7.02 / Avg: 7.05 / Max: 7.07Min: 6.97 / Avg: 8.47 / Max: 9.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: regnety_400mabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.21, N = 158.168.218.008.178.108.348.369.888.398.568.568.588.388.0216.229.078.648.489.81MIN: 7.9 / MAX: 8.99MIN: 8.14 / MAX: 8.84MIN: 7.94 / MAX: 8.88MIN: 7.99 / MAX: 8.97MIN: 7.98 / MAX: 8.84MIN: 7.99 / MAX: 26.72MIN: 8.27 / MAX: 9.08MIN: 8.14 / MAX: 251.77MIN: 8 / MAX: 10.29MIN: 8.17 / MAX: 10.28MIN: 8.15 / MAX: 9.8MIN: 8.13 / MAX: 9.78MIN: 8.31 / MAX: 8.86MIN: 7.95 / MAX: 8.63MIN: 7.74 / MAX: 314.84MIN: 7.61 / MAX: 402.49MIN: 8.28 / MAX: 10.42MIN: 8.09 / MAX: 9.64MIN: 7.82 / MAX: 241.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: regnety_400mabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620Min: 7.96 / Avg: 8.16 / Max: 8.33Min: 8.06 / Avg: 8.17 / Max: 8.25Min: 8.05 / Avg: 8.1 / Max: 8.13Min: 8.01 / Avg: 9.07 / Max: 10.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vision_transformerabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40901632486480SE +/- 0.09, N = 3SE +/- 0.21, N = 3SE +/- 0.07, N = 3SE +/- 0.16, N = 1531.8831.8531.7932.1231.9332.9232.4236.4235.5635.0734.1934.3231.9431.9170.7638.0338.7637.5939.04MIN: 31.55 / MAX: 37.47MIN: 31.69 / MAX: 33.06MIN: 31.63 / MAX: 35.57MIN: 31.66 / MAX: 46.9MIN: 31.62 / MAX: 35.85MIN: 32.67 / MAX: 36.93MIN: 31.89 / MAX: 65.47MIN: 33.49 / MAX: 224.86MIN: 33.19 / MAX: 40.43MIN: 33.66 / MAX: 39.36MIN: 32.72 / MAX: 36.79MIN: 32.58 / MAX: 41.88MIN: 31.73 / MAX: 34.21MIN: 31.74 / MAX: 34.28MIN: 38.81 / MAX: 250.01MIN: 32.66 / MAX: 467.28MIN: 33.12 / MAX: 539.58MIN: 34.45 / MAX: 457.98MIN: 33.83 / MAX: 463.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vision_transformerabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40901428425670Min: 31.78 / Avg: 31.88 / Max: 32.05Min: 31.9 / Avg: 32.12 / Max: 32.54Min: 31.85 / Avg: 31.93 / Max: 32.08Min: 36.85 / Avg: 38.03 / Max: 39.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: FastestDetabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.29, N = 154.104.074.094.114.084.244.075.144.204.214.204.794.114.088.414.264.394.593.93MIN: 4.06 / MAX: 4.81MIN: 4.04 / MAX: 4.53MIN: 4.05 / MAX: 5.5MIN: 4.01 / MAX: 9.72MIN: 4.03 / MAX: 5.29MIN: 3.88 / MAX: 24.21MIN: 4.02 / MAX: 4.82MIN: 3.7 / MAX: 81.79MIN: 4.02 / MAX: 4.97MIN: 4.04 / MAX: 4.97MIN: 4.03 / MAX: 6.49MIN: 4.64 / MAX: 6.21MIN: 4.07 / MAX: 4.29MIN: 4.04 / MAX: 4.35MIN: 2.89 / MAX: 487.78MIN: 2.5 / MAX: 396.93MIN: 4.25 / MAX: 5.86MIN: 2.62 / MAX: 232.18MIN: 3.8 / MAX: 5.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: FastestDetabcdefgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 4.1 / Avg: 4.1 / Max: 4.1Min: 4.08 / Avg: 4.11 / Max: 4.15Min: 4.07 / Avg: 4.08 / Max: 4.08Min: 2.6 / Avg: 4.26 / Max: 6.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: mobilenetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620SE +/- 0.23, N = 158.018.008.568.9810.088.438.488.448.408.068.0317.829.628.968.748.91MIN: 7.95 / MAX: 8.95MIN: 7.96 / MAX: 8.63MIN: 8.04 / MAX: 75.44MIN: 8.1 / MAX: 124.43MIN: 8.08 / MAX: 286.28MIN: 7.99 / MAX: 10.44MIN: 7.96 / MAX: 10.32MIN: 7.97 / MAX: 10.71MIN: 8.12 / MAX: 10.11MIN: 7.94 / MAX: 13.92MIN: 7.98 / MAX: 8.77MIN: 7.57 / MAX: 211.62MIN: 7.76 / MAX: 454.91MIN: 8.39 / MAX: 10.77MIN: 8.25 / MAX: 10.5MIN: 8.33 / MAX: 10.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: mobilenetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025Min: 8.34 / Avg: 9.62 / Max: 11.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v2-v2 - Model: mobilenet-v2bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215SE +/- 0.10, N = 153.153.143.163.153.303.313.273.303.283.143.179.193.413.463.453.39MIN: 3.1 / MAX: 3.68MIN: 3.1 / MAX: 3.67MIN: 3.09 / MAX: 3.89MIN: 3.1 / MAX: 3.63MIN: 3.14 / MAX: 4.82MIN: 3.12 / MAX: 4.76MIN: 3.1 / MAX: 4.34MIN: 3.12 / MAX: 4.03MIN: 3.1 / MAX: 4MIN: 3.08 / MAX: 3.7MIN: 3.11 / MAX: 4.5MIN: 3.04 / MAX: 232.12MIN: 2.99 / MAX: 184.91MIN: 3.29 / MAX: 4.38MIN: 3.23 / MAX: 4.55MIN: 3.21 / MAX: 4.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v2-v2 - Model: mobilenet-v2bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 3.18 / Avg: 3.41 / Max: 4.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3 - Model: mobilenet-v3bcfgi40804080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40901.21052.4213.63154.8426.0525SE +/- 0.22, N = 143.163.163.153.163.293.283.313.273.153.155.383.763.253.533.35MIN: 3.12 / MAX: 3.69MIN: 3.12 / MAX: 3.7MIN: 3.11 / MAX: 3.48MIN: 3.11 / MAX: 3.93MIN: 3.15 / MAX: 4.32MIN: 3.14 / MAX: 3.89MIN: 3.16 / MAX: 5.3MIN: 3.14 / MAX: 4.63MIN: 3.11 / MAX: 3.71MIN: 3.11 / MAX: 3.6MIN: 2.74 / MAX: 121.29MIN: 2.89 / MAX: 366.04MIN: 3.11 / MAX: 4.74MIN: 3.2 / MAX: 40.81MIN: 3.21 / MAX: 5.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3-v3-v3 - Model: mobilenet-v3bcfgi40804080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810Min: 3.03 / Avg: 3.76 / Max: 5.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: shufflenet-v2bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810SE +/- 0.21, N = 153.333.343.403.383.523.463.433.473.433.323.338.134.095.183.593.46MIN: 3.3 / MAX: 3.79MIN: 3.32 / MAX: 3.79MIN: 3.35 / MAX: 4.17MIN: 3.34 / MAX: 4.15MIN: 3.39 / MAX: 4.05MIN: 3.34 / MAX: 3.93MIN: 3.3 / MAX: 4.03MIN: 3.33 / MAX: 5.01MIN: 3.31 / MAX: 3.94MIN: 3.28 / MAX: 3.66MIN: 3.3 / MAX: 3.67MIN: 3.09 / MAX: 147.21MIN: 3.12 / MAX: 435.28MIN: 3.34 / MAX: 283.54MIN: 3.46 / MAX: 4.09MIN: 3.32 / MAX: 5.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: shufflenet-v2bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 3.34 / Avg: 4.09 / Max: 5.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: mnasnetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810SE +/- 0.04, N = 152.972.973.123.053.393.083.093.073.062.952.966.873.113.193.234.61MIN: 2.94 / MAX: 3.43MIN: 2.94 / MAX: 3.43MIN: 3.08 / MAX: 3.86MIN: 3.01 / MAX: 3.88MIN: 3.26 / MAX: 4.86MIN: 2.94 / MAX: 4.52MIN: 2.95 / MAX: 4.52MIN: 2.94 / MAX: 3.6MIN: 2.93 / MAX: 3.64MIN: 2.92 / MAX: 3.29MIN: 2.94 / MAX: 3.38MIN: 2.93 / MAX: 216.41MIN: 2.8 / MAX: 4.98MIN: 3.06 / MAX: 3.75MIN: 3.1 / MAX: 3.75MIN: 2.78 / MAX: 222.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: mnasnetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 2.92 / Avg: 3.11 / Max: 3.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: efficientnet-b0bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215SE +/- 0.22, N = 153.823.894.044.144.684.024.024.064.033.833.859.014.784.094.344.04MIN: 3.79 / MAX: 4.34MIN: 3.83 / MAX: 9.72MIN: 3.99 / MAX: 4.82MIN: 4.09 / MAX: 5.13MIN: 4.48 / MAX: 6.02MIN: 3.82 / MAX: 5.66MIN: 3.82 / MAX: 5.39MIN: 3.83 / MAX: 5.55MIN: 3.82 / MAX: 5.43MIN: 3.78 / MAX: 4.41MIN: 3.81 / MAX: 4.53MIN: 3.98 / MAX: 188.57MIN: 3.82 / MAX: 411.19MIN: 3.86 / MAX: 4.83MIN: 4.16 / MAX: 5.28MIN: 3.78 / MAX: 4.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: efficientnet-b0bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 4.03 / Avg: 4.78 / Max: 6.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: blazefacebcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40900.68181.36362.04542.72723.409SE +/- 0.19, N = 151.371.381.431.371.281.441.421.421.411.361.373.031.791.171.341.16MIN: 1.35 / MAX: 1.52MIN: 1.36 / MAX: 1.58MIN: 1.4 / MAX: 1.77MIN: 1.34 / MAX: 2.07MIN: 1.23 / MAX: 1.73MIN: 1.37 / MAX: 3.45MIN: 1.36 / MAX: 2.2MIN: 1.36 / MAX: 1.92MIN: 1.34 / MAX: 1.91MIN: 1.34 / MAX: 1.46MIN: 1.35 / MAX: 1.46MIN: 1.28 / MAX: 96.94MIN: 1.13 / MAX: 312.12MIN: 1.11 / MAX: 1.9MIN: 1.27 / MAX: 1.95MIN: 1.11 / MAX: 1.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: blazefacebcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810Min: 1.18 / Avg: 1.79 / Max: 3.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: googlenetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620SE +/- 0.24, N = 157.847.888.079.1510.178.458.528.508.557.837.8517.009.659.979.299.02MIN: 7.74 / MAX: 8.7MIN: 7.79 / MAX: 8.78MIN: 7.92 / MAX: 8.86MIN: 7.84 / MAX: 198.46MIN: 7.94 / MAX: 150.01MIN: 7.79 / MAX: 10.32MIN: 7.81 / MAX: 10.78MIN: 7.79 / MAX: 9.94MIN: 7.85 / MAX: 10.35MIN: 7.71 / MAX: 8.8MIN: 7.75 / MAX: 8.69MIN: 7.35 / MAX: 277.79MIN: 7.59 / MAX: 472.81MIN: 7.67 / MAX: 258.52MIN: 7.98 / MAX: 83.03MIN: 8.41 / MAX: 11.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: googlenetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620Min: 8.42 / Avg: 9.65 / Max: 11.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: vgg16bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40901122334455SE +/- 0.24, N = 1523.5023.9924.4524.9229.1225.1024.9125.0025.4523.5023.5449.7528.4028.5527.2527.04MIN: 23.3 / MAX: 24.41MIN: 23.72 / MAX: 24.98MIN: 24.26 / MAX: 25.26MIN: 24.58 / MAX: 31.89MIN: 26.33 / MAX: 310.23MIN: 24.12 / MAX: 27.57MIN: 23.8 / MAX: 26.87MIN: 23.91 / MAX: 27.99MIN: 24.22 / MAX: 27.73MIN: 23.17 / MAX: 24.44MIN: 23.33 / MAX: 24.41MIN: 25.45 / MAX: 273.86MIN: 24.12 / MAX: 509.06MIN: 24.05 / MAX: 201.8MIN: 24.14 / MAX: 379.93MIN: 24.33 / MAX: 215.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: vgg16bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40901020304050Min: 27.41 / Avg: 28.4 / Max: 30.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: resnet18bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215SE +/- 0.19, N = 155.215.266.135.485.865.705.635.665.715.195.2011.146.185.817.757.61MIN: 5.12 / MAX: 6.22MIN: 5.18 / MAX: 6.27MIN: 5.41 / MAX: 151.51MIN: 5.37 / MAX: 6.51MIN: 5.35 / MAX: 7.79MIN: 5.15 / MAX: 7.9MIN: 5.09 / MAX: 7.75MIN: 5.14 / MAX: 7.49MIN: 5.12 / MAX: 8.19MIN: 5.09 / MAX: 6.13MIN: 5.1 / MAX: 5.97MIN: 4.79 / MAX: 65.12MIN: 5.17 / MAX: 262.79MIN: 5.27 / MAX: 7.16MIN: 5.57 / MAX: 125.43MIN: 5.23 / MAX: 90.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: resnet18bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 5.67 / Avg: 6.18 / Max: 8.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: alexnetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215SE +/- 0.23, N = 154.294.284.834.715.014.664.654.654.674.314.3011.005.554.945.274.67MIN: 4.24 / MAX: 5.64MIN: 4.24 / MAX: 5.12MIN: 4.76 / MAX: 5.74MIN: 4.65 / MAX: 5.57MIN: 4.6 / MAX: 6.68MIN: 4.29 / MAX: 6.1MIN: 4.26 / MAX: 6.13MIN: 4.28 / MAX: 6.42MIN: 4.28 / MAX: 6.29MIN: 4.25 / MAX: 5.13MIN: 4.24 / MAX: 4.99MIN: 4.33 / MAX: 199.92MIN: 4.2 / MAX: 281.58MIN: 4.51 / MAX: 6.64MIN: 4.78 / MAX: 7.7MIN: 4.28 / MAX: 5.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: alexnetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 4.58 / Avg: 5.55 / Max: 6.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: resnet50bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090612182430SE +/- 0.22, N = 1510.0110.3311.0511.2514.0511.1110.7910.9111.2110.0510.0724.0712.1111.3913.8213.68MIN: 9.89 / MAX: 10.86MIN: 10.16 / MAX: 13.97MIN: 10.46 / MAX: 112.6MIN: 10.55 / MAX: 118.12MIN: 11.69 / MAX: 252.21MIN: 10.19 / MAX: 13.03MIN: 9.91 / MAX: 12.75MIN: 9.91 / MAX: 13.1MIN: 10.3 / MAX: 13.25MIN: 9.85 / MAX: 12.64MIN: 9.94 / MAX: 11.06MIN: 10.02 / MAX: 218.35MIN: 10.16 / MAX: 382.56MIN: 10.48 / MAX: 13.29MIN: 10.34 / MAX: 245.6MIN: 10.25 / MAX: 566.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: resnet50bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090612182430Min: 11.02 / Avg: 12.11 / Max: 13.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: yolov4-tinybcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090714212835SE +/- 0.14, N = 1512.9812.8913.0713.0815.1113.8113.5513.6913.8312.8712.8229.3415.4215.3015.3415.62MIN: 12.73 / MAX: 35.55MIN: 12.84 / MAX: 13.19MIN: 12.95 / MAX: 14.55MIN: 12.96 / MAX: 13.83MIN: 12.93 / MAX: 151.45MIN: 12.84 / MAX: 15.1MIN: 12.75 / MAX: 14.74MIN: 12.73 / MAX: 15.68MIN: 12.89 / MAX: 15.4MIN: 12.75 / MAX: 13.58MIN: 12.72 / MAX: 13.48MIN: 12.17 / MAX: 245.34MIN: 12.21 / MAX: 414.81MIN: 12.87 / MAX: 144.73MIN: 12.94 / MAX: 157.95MIN: 12.99 / MAX: 1841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: yolov4-tinybcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090714212835Min: 13.95 / Avg: 15.42 / Max: 16.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: squeezenet_ssdbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620SE +/- 0.24, N = 157.037.046.977.268.167.647.597.647.357.047.0917.758.399.329.309.37MIN: 6.97 / MAX: 7.88MIN: 6.96 / MAX: 7.83MIN: 6.83 / MAX: 13.87MIN: 7.14 / MAX: 8.59MIN: 7.51 / MAX: 9.94MIN: 7.05 / MAX: 9.9MIN: 7.02 / MAX: 8.87MIN: 7.03 / MAX: 9.19MIN: 6.79 / MAX: 9.82MIN: 6.96 / MAX: 7.74MIN: 7.02 / MAX: 7.99MIN: 6.47 / MAX: 272.11MIN: 6.53 / MAX: 436.05MIN: 7.1 / MAX: 172.56MIN: 6.92 / MAX: 310.91MIN: 7.07 / MAX: 281.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: squeezenet_ssdbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620Min: 6.93 / Avg: 8.39 / Max: 9.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: regnety_400mbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025SE +/- 0.21, N = 158.058.148.348.078.218.338.578.758.477.958.0919.669.028.1317.159.55MIN: 8 / MAX: 8.58MIN: 8.08 / MAX: 8.69MIN: 8.26 / MAX: 9.3MIN: 7.97 / MAX: 8.81MIN: 7.9 / MAX: 9.99MIN: 8.02 / MAX: 9.64MIN: 8.21 / MAX: 10.39MIN: 8.35 / MAX: 10.08MIN: 8.13 / MAX: 10.27MIN: 7.88 / MAX: 8.67MIN: 7.99 / MAX: 14.25MIN: 7.5 / MAX: 235.36MIN: 7.69 / MAX: 501.76MIN: 7.75 / MAX: 10.05MIN: 8.02 / MAX: 773.45MIN: 7.5 / MAX: 193.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: regnety_400mbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025Min: 8.17 / Avg: 9.02 / Max: 10.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: vision_transformerbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409020406080100SE +/- 0.18, N = 1531.6531.7833.4733.3936.5534.2034.2734.3734.4731.8931.9381.7737.8638.7939.1238.99MIN: 31.53 / MAX: 32.23MIN: 31.64 / MAX: 34.51MIN: 32.89 / MAX: 74.09MIN: 32.73 / MAX: 88.83MIN: 33 / MAX: 209.38MIN: 32.92 / MAX: 36.19MIN: 33.07 / MAX: 37.01MIN: 33.01 / MAX: 38.7MIN: 33.32 / MAX: 37.42MIN: 31.66 / MAX: 39.97MIN: 31.76 / MAX: 33.09MIN: 44.4 / MAX: 460.28MIN: 32.9 / MAX: 463.9MIN: 33.95 / MAX: 457.41MIN: 33.92 / MAX: 465.83MIN: 34.17 / MAX: 473.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: vision_transformerbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40901632486480Min: 36.43 / Avg: 37.86 / Max: 38.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: FastestDetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215SE +/- 0.27, N = 144.074.083.853.975.694.204.184.194.044.044.119.184.332.934.164.06MIN: 4.03 / MAX: 5.83MIN: 4.05 / MAX: 4.36MIN: 3.8 / MAX: 4.65MIN: 3.92 / MAX: 4.75MIN: 3.69 / MAX: 261.71MIN: 4.06 / MAX: 4.86MIN: 4.03 / MAX: 5.07MIN: 4.04 / MAX: 5.47MIN: 3.89 / MAX: 5.01MIN: 4.01 / MAX: 4.15MIN: 4.07 / MAX: 4.21MIN: 3.64 / MAX: 122.65MIN: 2.59 / MAX: 433.58MIN: 2.84 / MAX: 3.38MIN: 4 / MAX: 5.58MIN: 3.91 / MAX: 5.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3-v3 - Model: FastestDetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 2.69 / Avg: 4.33 / Max: 6.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: mobilenetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620SE +/- 0.26, N = 158.007.958.658.209.058.438.468.888.388.008.0116.349.628.819.5410.54MIN: 7.95 / MAX: 8.99MIN: 7.89 / MAX: 8.79MIN: 8.55 / MAX: 9.53MIN: 8.12 / MAX: 9.4MIN: 8.48 / MAX: 11.28MIN: 7.99 / MAX: 10.66MIN: 7.99 / MAX: 10.62MIN: 8.31 / MAX: 10.01MIN: 7.95 / MAX: 10.41MIN: 7.94 / MAX: 8.78MIN: 7.95 / MAX: 8.35MIN: 8.13 / MAX: 80.69MIN: 7.76 / MAX: 502.83MIN: 8.32 / MAX: 10.7MIN: 8.94 / MAX: 10.54MIN: 8.41 / MAX: 134.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: mobilenetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620Min: 8.22 / Avg: 9.62 / Max: 10.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v2-v2 - Model: mobilenet-v2bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810SE +/- 0.18, N = 153.153.153.163.173.283.293.303.403.233.163.177.243.664.993.314.45MIN: 3.11 / MAX: 3.88MIN: 3.11 / MAX: 3.85MIN: 3.1 / MAX: 3.71MIN: 3.13 / MAX: 3.58MIN: 3.09 / MAX: 5.28MIN: 3.12 / MAX: 4.64MIN: 3.12 / MAX: 4.7MIN: 3.23 / MAX: 4.8MIN: 3.06 / MAX: 4.66MIN: 3.11 / MAX: 3.51MIN: 3.12 / MAX: 4.03MIN: 3.04 / MAX: 261.68MIN: 3.01 / MAX: 437.59MIN: 3.1 / MAX: 201.8MIN: 3.12 / MAX: 4.6MIN: 2.65 / MAX: 216.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v2-v2 - Model: mobilenet-v2bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 3.21 / Avg: 3.66 / Max: 5.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3 - Model: mobilenet-v3bcfgi40804080 rep4080 xxx4080 zzz3090 rep3070RTX 3070 Ti40904090 repnv 4090246810SE +/- 0.18, N = 153.163.173.153.153.263.273.273.333.203.158.063.653.124.902.61MIN: 3.11 / MAX: 3.75MIN: 3.11 / MAX: 8.89MIN: 3.1 / MAX: 3.8MIN: 3.1 / MAX: 3.87MIN: 3.12 / MAX: 4.19MIN: 3.12 / MAX: 5.24MIN: 3.14 / MAX: 3.99MIN: 3.19 / MAX: 4.2MIN: 3.06 / MAX: 3.84MIN: 3.11 / MAX: 3.83MIN: 2.96 / MAX: 219.87MIN: 2.87 / MAX: 347.75MIN: 2.99 / MAX: 5.09MIN: 3.17 / MAX: 120.84MIN: 2.5 / MAX: 3.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3-v3-v3 - Model: mobilenet-v3bcfgi40804080 rep4080 xxx4080 zzz3090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 3.08 / Avg: 3.65 / Max: 4.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: shufflenet-v2bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40901.10032.20063.30094.40125.5015SE +/- 0.22, N = 153.333.333.333.353.363.443.443.513.373.343.364.893.953.343.403.17MIN: 3.3 / MAX: 3.77MIN: 3.31 / MAX: 3.81MIN: 3.29 / MAX: 3.99MIN: 3.31 / MAX: 4.01MIN: 3.25 / MAX: 4.02MIN: 3.31 / MAX: 4.88MIN: 3.32 / MAX: 4.16MIN: 3.37 / MAX: 4.26MIN: 3.25 / MAX: 3.95MIN: 3.31 / MAX: 3.6MIN: 3.32 / MAX: 3.7MIN: 3.04 / MAX: 18.32MIN: 3.19 / MAX: 410.41MIN: 3.23 / MAX: 4.78MIN: 3.26 / MAX: 4.84MIN: 3.04 / MAX: 3.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: shufflenet-v2bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810Min: 3.33 / Avg: 3.95 / Max: 5.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: mnasnetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810SE +/- 0.16, N = 142.962.962.962.982.993.093.073.133.012.972.976.023.373.003.102.54MIN: 2.93 / MAX: 3.4MIN: 2.93 / MAX: 3.41MIN: 2.92 / MAX: 3.81MIN: 2.95 / MAX: 3.63MIN: 2.86 / MAX: 4.38MIN: 2.94 / MAX: 3.79MIN: 2.94 / MAX: 3.72MIN: 3 / MAX: 5.1MIN: 2.91 / MAX: 3.6MIN: 2.92 / MAX: 3.28MIN: 2.94 / MAX: 3.39MIN: 2.79 / MAX: 50.49MIN: 2.86 / MAX: 278.87MIN: 2.89 / MAX: 3.46MIN: 2.97 / MAX: 3.72MIN: 2.44 / MAX: 3.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: mnasnetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810Min: 2.99 / Avg: 3.37 / Max: 4.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: efficientnet-b0bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810SE +/- 0.18, N = 153.853.823.854.634.214.054.044.223.953.853.857.814.734.186.285.26MIN: 3.82 / MAX: 4.48MIN: 3.78 / MAX: 4.53MIN: 3.8 / MAX: 4.6MIN: 3.8 / MAX: 159.43MIN: 3.96 / MAX: 4.94MIN: 3.83 / MAX: 5MIN: 3.81 / MAX: 5.08MIN: 4 / MAX: 5.58MIN: 3.76 / MAX: 4.84MIN: 3.8 / MAX: 4.43MIN: 3.81 / MAX: 4.62MIN: 3.73 / MAX: 159.47MIN: 3.79 / MAX: 418.72MIN: 4 / MAX: 5.25MIN: 3.91 / MAX: 337.73MIN: 3.48 / MAX: 250.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: efficientnet-b0bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215Min: 4.05 / Avg: 4.73 / Max: 5.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: blazefacebcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40900.71551.4312.14652.8623.5775SE +/- 0.18, N = 151.371.361.371.381.251.421.451.421.391.361.383.181.711.331.411.07MIN: 1.35 / MAX: 1.39MIN: 1.34 / MAX: 1.44MIN: 1.35 / MAX: 1.62MIN: 1.36 / MAX: 1.76MIN: 1.19 / MAX: 2.61MIN: 1.35 / MAX: 2.15MIN: 1.36 / MAX: 8.73MIN: 1.36 / MAX: 1.92MIN: 1.34 / MAX: 1.89MIN: 1.34 / MAX: 1.61MIN: 1.36 / MAX: 1.9MIN: 1.31 / MAX: 185.03MIN: 1.09 / MAX: 448.17MIN: 1.27 / MAX: 1.98MIN: 1.35 / MAX: 1.89MIN: 1.02 / MAX: 1.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: blazefacebcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090246810Min: 1.15 / Avg: 1.71 / Max: 3.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: googlenetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025SE +/- 0.21, N = 157.977.837.948.3510.198.498.588.998.377.827.8520.729.868.878.9010.01MIN: 7.89 / MAX: 8.7MIN: 7.74 / MAX: 8.61MIN: 7.8 / MAX: 8.78MIN: 8.2 / MAX: 9.39MIN: 7.73 / MAX: 212.36MIN: 7.82 / MAX: 11.98MIN: 7.79 / MAX: 10.48MIN: 8.25 / MAX: 10.27MIN: 7.76 / MAX: 10.31MIN: 7.69 / MAX: 8.6MIN: 7.75 / MAX: 8.64MIN: 7.49 / MAX: 355.33MIN: 7.54 / MAX: 396.21MIN: 8.18 / MAX: 11.09MIN: 8.22 / MAX: 11.07MIN: 7.29 / MAX: 259.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: googlenetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 4090510152025Min: 8.58 / Avg: 9.86 / Max: 10.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: vgg16bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40901224364860SE +/- 0.27, N = 1523.4223.5424.1224.7129.0725.0425.0426.0825.2623.5023.4355.4228.4028.2127.5927.77MIN: 23.27 / MAX: 24.32MIN: 23.32 / MAX: 24.54MIN: 23.57 / MAX: 46.44MIN: 23.88 / MAX: 119.23MIN: 24.45 / MAX: 263.33MIN: 23.87 / MAX: 28.04MIN: 23.81 / MAX: 27.15MIN: 24.52 / MAX: 27.73MIN: 24.14 / MAX: 27.73MIN: 23.23 / MAX: 24.26MIN: 23.26 / MAX: 24.3MIN: 25.32 / MAX: 281.46MIN: 23.98 / MAX: 456MIN: 24.57 / MAX: 270.76MIN: 24.34 / MAX: 396.09MIN: 24.82 / MAX: 264.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: vgg16bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40901122334455Min: 27.16 / Avg: 28.4 / Max: 30.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: resnet18bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215SE +/- 0.16, N = 155.425.235.305.505.855.655.695.895.595.235.2413.386.235.975.815.84MIN: 5.36 / MAX: 6.27MIN: 5.11 / MAX: 6.03MIN: 5.17 / MAX: 5.93MIN: 5.4 / MAX: 6.38MIN: 5.3 / MAX: 8.27MIN: 5.14 / MAX: 6.93MIN: 5.11 / MAX: 6.94MIN: 5.36 / MAX: 7.53MIN: 5.09 / MAX: 7.7MIN: 5.1 / MAX: 6.07MIN: 5.14 / MAX: 5.99MIN: 5.43 / MAX: 208.42MIN: 4.99 / MAX: 309.18MIN: 5.46 / MAX: 7.02MIN: 5.3 / MAX: 6.82MIN: 5.35 / MAX: 7.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: resnet18bcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 409048121620Min: 5.49 / Avg: 6.23 / Max: 7.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3-v3 - Model: alexnetbcfgi40804080 rep4080 xxx4080 zzz30903090 rep3070RTX 3070 Ti40904090 repnv 40903691215SE +/- 0.23, N = 15