Benchmarks by Michael Larabel for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2012250-HA-NVIDIAGPU61 NVIDIA Linux GPU Performance Comptue RTX 30 RTX 20 - Phoronix Test Suite NVIDIA Linux GPU Performance Comptue RTX 30 RTX 20 Benchmarks by Michael Larabel for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2012250-HA-NVIDIAGPU61&export=pdf&sro&grw .
NVIDIA Linux GPU Performance Comptue RTX 30 RTX 20 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti TITAN RTX RTX 3060 TI RTX 3080 AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS) AMD Starship/Matisse 16GB 2000GB Corsair Force MP600 NVIDIA GeForce RTX 2060 6GB (1365/7000MHz) NVIDIA TU106 HD Audio ASUS MG28U Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.4.0-58-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.8 NVIDIA 460.27.04 4.6.0 OpenCL 1.2 CUDA 11.2.66 1.2.155 GCC 9.3.0 ext4 3840x2160 NVIDIA GeForce RTX 2060 SUPER 8GB (1470/7000MHz) ASUS NVIDIA GeForce RTX 2070 8GB (435/405MHz) NVIDIA GeForce RTX 2070 SUPER 8GB (1605/7000MHz) NVIDIA TU104 HD Audio Zotac NVIDIA GeForce RTX 2080 8GB (1515/7000MHz) NVIDIA GeForce RTX 2080 SUPER 8GB (1650/7750MHz) NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz) NVIDIA TU102 HD Audio NVIDIA TITAN RTX 24GB (1350/7000MHz) NVIDIA GeForce RTX 3060 Ti 8GB (1665/7000MHz) NVIDIA Device 228b NVIDIA GeForce RTX 3080 10GB (1710/9501MHz) NVIDIA Device 1aef OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0xa201009 OpenCL Details - RTX 2060: GPU Compute Cores: 1920 - RTX 2060 SUPER: GPU Compute Cores: 2176 - RTX 2070: GPU Compute Cores: 2304 - RTX 2070 SUPER: GPU Compute Cores: 2560 - RTX 2080: GPU Compute Cores: 2944 - RTX 2080 SUPER: GPU Compute Cores: 3072 - RTX 2080 Ti: GPU Compute Cores: 4352 - TITAN RTX: GPU Compute Cores: 4608 - RTX 3060 TI: GPU Compute Cores: 4864 - RTX 3080: GPU Compute Cores: 8704 Python Details - Python 2.7.18 + Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
NVIDIA Linux GPU Performance Comptue RTX 30 RTX 20 betsy: ETC1 - Highest betsy: ETC2 RGB - Highest plaidml: No - Training - VGG16 - OpenCL luxcorerender-cl: DLSC plaidml: No - Training - VGG19 - OpenCL plaidml: No - Inference - VGG16 - OpenCL luxcorerender-cl: Food luxcorerender-cl: Rainbow Colors and Prism plaidml: No - Inference - VGG19 - OpenCL plaidml: No - Inference - IMDB LSTM - OpenCL plaidml: No - Inference - Mobilenet - OpenCL plaidml: No - Inference - ResNet 50 - OpenCL plaidml: Yes - Inference - Mobilenet - OpenCL plaidml: No - Inference - DenseNet 201 - OpenCL plaidml: No - Inference - Inception V3 - OpenCL plaidml: No - Inference - NASNer Large - OpenCL lczero: OpenCL ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 luxcorerender-cl: LuxCore Benchmark ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - regnety_400m rodinia: OpenCL Particle Filter arrayfire: Conjugate Gradient OpenCL blender: BMW27 - NVIDIA OptiX blender: Classroom - NVIDIA OptiX blender: Fishy Cat - NVIDIA OptiX blender: Barbershop - NVIDIA OptiX blender: Pabellon Barcelona - NVIDIA OptiX fahbench: hashcat: MD5 hashcat: SHA1 hashcat: 7-Zip hashcat: SHA-512 hashcat: TrueCrypt RIPEMD160 + XTS octanebench: Total Score redshift: financebench: Black-Scholes OpenCL cl-mem: Copy cl-mem: Read cl-mem: Write clpeak: Integer Compute INT clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth mandelgpu: GPU viennacl: OpenCL LU Factorization realsr-ncnn: 4x - No realsr-ncnn: 4x - Yes vkfft: vkresample: 2x - Double vkresample: 2x - Single waifu2x-ncnn: 2x - 3 - Yes RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti TITAN RTX RTX 3060 TI RTX 3080 6.189 8.748 27.12 3.30 22.04 110.96 1.21 7.98 88.38 392.02 1282.27 320.95 1720.24 125.45 171.78 31.81 9752 12.42 4.44 4.15 4.42 2.80 4.01 5.48 1.8 13.43 56.98 14.47 11.61 24.58 21.41 14.32 18.29 8.960 2.650 36.27 147.65 67.13 1779.46 206.86 183.2913 25930633333 8248933333 439000 1049366667 311800 195.009806 469 12.565 239.5 297.5 251.6 5224.68 5369.81 231.27 277.40 253858256.7 74.5062 12.707 85.948 22903 350.961 27.195 5.779 5.451 7.733 30.00 4.19 24.58 126.76 1.32 11.41 100.61 440.36 1581.85 376.59 2011.99 147.13 203.78 38.42 11399 13.04 4.67 4.34 4.47 3.63 4.33 5.70 1.93 14.18 57.22 14.77 11.53 25.71 22.80 14.63 18.49 7.981 2.073 30.90 144.40 58.94 1758.17 194.41 207.0770 29272000000 9314900000 490533 1183433333 349600 244.224553 378 10.454 288.3 397.1 340.1 6956.55 6985.02 261.53 369.42 278640038.1 74.0853 11.540 75.892 29585 309.619 22.282 5.313 5.309 7.646 30.27 4.14 129.65 1.30 10.94 103.03 447.47 1566.36 379.70 2004.51 148.04 202.05 39.07 11495 12.66 4.60 4.19 4.45 3.56 4.15 5.59 1.87 13.60 57.13 14.55 11.74 25.05 21.41 14.51 18.68 7.858 2.086 31.34 148.17 58.34 1826.30 198.63 205.2322 29955566667 9535466667 500233 1213266667 356700 243.261218 375 10.225 285.7 397.1 323.9 7094.24 7190.53 268.18 369.14 283803060.1 74.1817 11.187 74.654 29546 302.845 22.269 5.314 4.824 6.653 33.75 4.32 27.78 152.58 1.86 10.04 121.35 499.43 1701.99 422.86 2300.01 149.68 220.43 42.32 12841 12.77 4.64 4.30 4.46 3.67 4.22 5.60 1.88 13.48 57.13 14.53 11.36 25.25 21.79 14.55 18.58 6.900 2.060 25.44 89.12 48.03 996.82 132.73 229.7600 35466033333 11183366667 593700 1426733333 423733 264.534943 351 8.224 292.3 397.1 320.6 8551.57 8591.77 309.27 370.17 321637774.3 75.7984 9.972 63.178 29427 261.969 19.523 4.761 4.375 6.153 34.13 4.23 28.32 161.88 1.88 9.27 128.62 545.48 1726.88 442.78 2333.79 152.28 242.54 45.39 13576 12.69 4.50 4.25 4.48 3.55 4.10 5.56 1.86 13.56 57.25 14.58 11.37 24.62 21.81 14.45 18.77 6.288 2.072 26.24 86.13 45.51 971.62 132.90 244.4171 38493233333 12203533333 637367 1559500000 461367 261.329504 341 7.620 290.5 397.1 331.9 9330.83 8860.55 344.68 369.65 343080891.1 77.1727 9.473 59.403 28745 235.512 19.024 4.605 4.004 5.507 38.7 4.32 32.29 180.23 1.97 9.19 143.39 588.04 1816.87 468.60 2489.11 160.48 266.39 49.16 14634 12.72 4.65 4.30 4.50 3.60 4.21 5.68 1.88 13.12 57.08 14.36 11.22 24.73 22.13 14.47 18.65 5.809 1.893 25.22 81.14 42.60 911.32 126.93 260.8957 43136966667 13653000000 711967 1734100000 527287 268.114368 325 6.749 302.0 437.8 350.8 10244.08 10347.92 373.96 405.68 368037096.6 76.6869 8.842 54.037 30475 216.539 17.507 4.358 3.177 44.02 5.53 36.9 231.73 2.18 12.14 185.09 754.69 2414.96 636.84 3316.31 213.54 351.18 63.53 16952 12.64 4.74 4.43 4.48 4.58 4.56 6.02 1.95 13.96 56.69 14.65 11.44 24.69 21.45 14.36 18.32 4.507 1.670 21.67 73.99 32.80 899.14 103.97 304.0362 55488333333 17751300000 885300 2241700000 657967 354.856769 246 6.014 324.0 545.6 446.6 13432.89 12677.18 519.03 508.00 447272306.0 78.3244 7.605 44.186 34110 152.853 14.767 3.856 3.043 4.116 44.51 5.95 37.56 244.00 2.23 13.56 194.50 782.29 2551.38 654.76 3392.96 228.78 359.13 66.50 17009 12.77 4.49 4.20 4.43 5.03 4.02 6.03 1.83 13.32 56.52 14.36 11.25 24.95 21.88 14.35 18.21 4.296 1.637 18.64 73.40 31.72 905.22 102.51 307.0348 58104300000 18576400000 937233 2350366667 688167 383.463739 235 5.691 320.8 568.2 495.4 13791.91 14109.68 545.22 530.48 460475602.6 81.3167 7.360 41.731 36216 149.204 13.561 3.827 4.358 6.040 34.98 7.13 29.38 167.73 2.97 17.11 133.11 689.61 1875.37 454.66 2428.74 177.41 236.40 48.04 16799 12.54 4.54 4.25 4.47 5.91 4.11 5.55 1.86 13.41 57.18 14.51 11.41 24.69 21.35 15.11 18.57 7.040 2.092 20.15 56.33 36.74 565.99 84.95 235.7167 32927600000 11103733333 581300 1406900000 427100 383.416583 239 8.328 294.1 392.8 384.3 8365.85 16033.64 306.12 389.18 280941400.2 75.7896 8.784 54.144 32556 264.871 17.676 4.371 2.701 3.664 47.50 9.60 40.88 280.31 4.00 21.91 223.64 1019.79 3062.00 730.76 3623.82 261.32 398.64 79.23 18668 12.78 4.56 4.34 4.52 7.84 4.20 5.61 1.91 13.95 56.96 14.74 11.71 25.34 22.15 14.50 18.45 4.290 1.557 11.48 38.85 21.48 421.51 55.63 320.6571 56429133333 19151100000 981300 2426033333 723200 565.099512 165 4.869 354.7 674.2 645.3 15586.03 29490.81 545.91 662.24 421918457.5 79.3217 6.345 34.206 40997 148.284 11.220 3.512 OpenBenchmarking.org
Betsy GPU Compressor Codec: ETC1 - Quality: Highest OpenBenchmarking.org Seconds, Fewer Is Better Betsy GPU Compressor 1.1 Beta Codec: ETC1 - Quality: Highest RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 2 4 6 8 10 SE +/- 0.054, N = 13 SE +/- 0.052, N = 13 SE +/- 0.052, N = 13 SE +/- 0.054, N = 13 SE +/- 0.048, N = 14 SE +/- 0.060, N = 14 SE +/- 0.049, N = 14 SE +/- 0.051, N = 14 SE +/- 0.052, N = 14 SE +/- 0.051, N = 14 6.189 5.451 5.309 4.824 4.375 4.004 3.177 4.358 2.701 3.043 1. (CXX) g++ options: -O3 -O2 -lpthread -ldl
Betsy GPU Compressor Codec: ETC2 RGB - Quality: Highest OpenBenchmarking.org Seconds, Fewer Is Better Betsy GPU Compressor 1.1 Beta Codec: ETC2 RGB - Quality: Highest RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 3060 TI RTX 3080 TITAN RTX 2 4 6 8 10 SE +/- 0.084, N = 9 SE +/- 0.060, N = 13 SE +/- 0.064, N = 12 SE +/- 0.053, N = 15 SE +/- 0.064, N = 12 SE +/- 0.057, N = 14 SE +/- 0.066, N = 13 SE +/- 0.063, N = 13 SE +/- 0.058, N = 13 8.748 7.733 7.646 6.653 6.153 5.507 6.040 3.664 4.116 1. (CXX) g++ options: -O3 -O2 -lpthread -ldl
PlaidML FP16: No - Mode: Training - Network: VGG16 - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Training - Network: VGG16 - Device: OpenCL RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 11 22 33 44 55 SE +/- 0.20, N = 2 SE +/- 0.33, N = 2 SE +/- 0.18, N = 2 SE +/- 0.36, N = 2 27.12 30.00 30.27 33.75 34.13 38.70 44.02 34.98 47.50 44.51
LuxCoreRender OpenCL Scene: DLSC OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender OpenCL 2.3 Scene: DLSC RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 12 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.08, N = 12 SE +/- 0.10, N = 12 SE +/- 0.14, N = 12 SE +/- 0.01, N = 3 3.30 4.19 4.14 4.32 4.23 4.32 5.53 7.13 9.60 5.95 MIN: 3.16 / MAX: 3.4 MIN: 4.11 / MAX: 4.29 MIN: 3.79 / MAX: 4.28 MIN: 1.6 / MAX: 4.51 MIN: 4.15 / MAX: 4.3 MIN: 4.13 / MAX: 4.4 MIN: 2.02 / MAX: 5.76 MIN: 2.58 / MAX: 7.38 MIN: 3.45 / MAX: 9.87 MIN: 5.62 / MAX: 6.04
PlaidML FP16: No - Mode: Training - Network: VGG19 - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Training - Network: VGG19 - Device: OpenCL RTX 2060 RTX 2060 SUPER RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 9 18 27 36 45 SE +/- 0.04, N = 2 SE +/- 0.05, N = 2 SE +/- 0.15, N = 2 SE +/- 0.08, N = 2 22.04 24.58 27.78 28.32 32.29 36.90 29.38 40.88 37.56
PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: OpenCL RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 60 120 180 240 300 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.18, N = 3 SE +/- 0.00, N = 3 SE +/- 0.10, N = 3 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 110.96 126.76 129.65 152.58 161.88 180.23 231.73 167.73 280.31 244.00
LuxCoreRender OpenCL Scene: Food OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender OpenCL 2.3 Scene: Food RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 0.9 1.8 2.7 3.6 4.5 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 12 SE +/- 0.03, N = 4 SE +/- 0.01, N = 3 SE +/- 0.04, N = 12 SE +/- 0.05, N = 12 SE +/- 0.07, N = 14 SE +/- 0.03, N = 5 1.21 1.32 1.30 1.86 1.88 1.97 2.18 2.97 4.00 2.23 MIN: 0.24 / MAX: 1.44 MIN: 0.23 / MAX: 1.59 MIN: 0.23 / MAX: 1.55 MIN: 0.18 / MAX: 2.3 MIN: 0.23 / MAX: 2.29 MIN: 0.27 / MAX: 2.37 MIN: 0.15 / MAX: 2.71 MIN: 0.19 / MAX: 3.74 MIN: 0.17 / MAX: 5.07 MIN: 0.23 / MAX: 2.76
LuxCoreRender OpenCL Scene: Rainbow Colors and Prism OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender OpenCL 2.3 Scene: Rainbow Colors and Prism RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.20, N = 12 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.27, N = 12 SE +/- 0.48, N = 12 SE +/- 0.65, N = 12 SE +/- 0.02, N = 3 7.98 11.41 10.94 10.04 9.27 9.19 12.14 17.11 21.91 13.56 MIN: 7.02 / MAX: 8.25 MIN: 9.87 / MAX: 11.85 MIN: 9.85 / MAX: 11.46 MIN: 5.24 / MAX: 10.68 MIN: 8.34 / MAX: 9.64 MIN: 7.68 / MAX: 9.63 MIN: 6.39 / MAX: 12.87 MIN: 8.56 / MAX: 18.35 MIN: 12.02 / MAX: 23.73 MIN: 12.87 / MAX: 14.06
PlaidML FP16: No - Mode: Inference - Network: VGG19 - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG19 - Device: OpenCL RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 50 100 150 200 250 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 SE +/- 0.27, N = 3 SE +/- 0.20, N = 3 SE +/- 0.15, N = 3 SE +/- 0.08, N = 3 SE +/- 0.28, N = 3 SE +/- 0.14, N = 3 88.38 100.61 103.03 121.35 128.62 143.39 185.09 133.11 223.64 194.50
PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 200 400 600 800 1000 SE +/- 0.58, N = 3 SE +/- 0.53, N = 3 SE +/- 1.13, N = 3 SE +/- 0.23, N = 3 SE +/- 1.14, N = 3 SE +/- 0.10, N = 3 SE +/- 1.36, N = 3 SE +/- 0.42, N = 3 SE +/- 1.56, N = 3 SE +/- 1.34, N = 3 392.02 440.36 447.47 499.43 545.48 588.04 754.69 689.61 1019.79 782.29
PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 700 1400 2100 2800 3500 SE +/- 1.88, N = 3 SE +/- 2.50, N = 3 SE +/- 3.57, N = 3 SE +/- 3.23, N = 3 SE +/- 0.70, N = 3 SE +/- 1.68, N = 3 SE +/- 4.12, N = 3 SE +/- 4.77, N = 3 SE +/- 8.25, N = 3 SE +/- 2.13, N = 3 1282.27 1581.85 1566.36 1701.99 1726.88 1816.87 2414.96 1875.37 3062.00 2551.38
PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCL RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 160 320 480 640 800 SE +/- 0.61, N = 3 SE +/- 0.68, N = 3 SE +/- 0.70, N = 3 SE +/- 0.62, N = 3 SE +/- 0.34, N = 3 SE +/- 0.40, N = 3 SE +/- 0.39, N = 3 SE +/- 0.51, N = 3 SE +/- 1.40, N = 3 SE +/- 2.07, N = 3 320.95 376.59 379.70 422.86 442.78 468.60 636.84 454.66 730.76 654.76
PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 800 1600 2400 3200 4000 SE +/- 1.92, N = 3 SE +/- 2.83, N = 3 SE +/- 1.56, N = 3 SE +/- 0.32, N = 3 SE +/- 3.97, N = 3 SE +/- 1.35, N = 3 SE +/- 16.01, N = 3 SE +/- 1.99, N = 3 SE +/- 12.23, N = 3 SE +/- 16.11, N = 3 1720.24 2011.99 2004.51 2300.01 2333.79 2489.11 3316.31 2428.74 3623.82 3392.96
PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 60 120 180 240 300 SE +/- 0.07, N = 3 SE +/- 0.15, N = 3 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 0.14, N = 3 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 SE +/- 0.26, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 125.45 147.13 148.04 149.68 152.28 160.48 213.54 177.41 261.32 228.78
PlaidML FP16: No - Mode: Inference - Network: Inception V3 - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: Inception V3 - Device: OpenCL RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 90 180 270 360 450 SE +/- 0.18, N = 3 SE +/- 0.18, N = 3 SE +/- 0.28, N = 3 SE +/- 0.01, N = 3 SE +/- 0.46, N = 3 SE +/- 0.03, N = 3 SE +/- 0.39, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 0.19, N = 3 171.78 203.78 202.05 220.43 242.54 266.39 351.18 236.40 398.64 359.13
PlaidML FP16: No - Mode: Inference - Network: NASNer Large - Device: OpenCL OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: NASNer Large - Device: OpenCL RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.14, N = 3 SE +/- 0.05, N = 3 SE +/- 0.12, N = 3 SE +/- 0.18, N = 3 31.81 38.42 39.07 42.32 45.39 49.16 63.53 48.04 79.23 66.50
LeelaChessZero Backend: OpenCL OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: OpenCL RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 4K 8K 12K 16K 20K SE +/- 66.84, N = 3 SE +/- 69.95, N = 3 SE +/- 54.87, N = 3 SE +/- 49.97, N = 3 SE +/- 51.64, N = 3 SE +/- 48.26, N = 3 SE +/- 54.85, N = 3 SE +/- 26.30, N = 3 SE +/- 122.65, N = 3 SE +/- 110.73, N = 3 9752 11399 11495 12841 13576 14634 16952 16799 18668 17009 1. (CXX) g++ options: -flto -pthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: mobilenet RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 3 6 9 12 15 SE +/- 0.14, N = 3 SE +/- 0.17, N = 4 SE +/- 0.17, N = 3 SE +/- 0.14, N = 15 SE +/- 0.16, N = 5 SE +/- 0.11, N = 3 SE +/- 0.18, N = 3 SE +/- 0.17, N = 4 SE +/- 0.12, N = 3 SE +/- 0.16, N = 3 12.42 13.04 12.66 12.77 12.69 12.72 12.64 12.54 12.78 12.77 MIN: 11.98 / MAX: 24.63 MIN: 12.34 / MAX: 24 MIN: 12.23 / MAX: 24.28 MIN: 11.98 / MAX: 34.45 MIN: 11.97 / MAX: 24.76 MIN: 12.26 / MAX: 13.5 MIN: 12.22 / MAX: 14.36 MIN: 11.94 / MAX: 15.69 MIN: 12.35 / MAX: 20.47 MIN: 12.01 / MAX: 24.52 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 1.0665 2.133 3.1995 4.266 5.3325 SE +/- 0.03, N = 3 SE +/- 0.07, N = 4 SE +/- 0.02, N = 3 SE +/- 0.07, N = 15 SE +/- 0.04, N = 5 SE +/- 0.23, N = 3 SE +/- 0.21, N = 3 SE +/- 0.05, N = 4 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 4.44 4.67 4.60 4.64 4.50 4.65 4.74 4.54 4.56 4.49 MIN: 4.21 / MAX: 4.75 MIN: 4.31 / MAX: 5.1 MIN: 4.35 / MAX: 13.48 MIN: 4.23 / MAX: 5.42 MIN: 4.19 / MAX: 13.41 MIN: 4.19 / MAX: 5.42 MIN: 4.31 / MAX: 5.6 MIN: 4.25 / MAX: 5.73 MIN: 4.3 / MAX: 4.85 MIN: 4.26 / MAX: 4.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 0.9968 1.9936 2.9904 3.9872 4.984 SE +/- 0.03, N = 3 SE +/- 0.08, N = 4 SE +/- 0.01, N = 3 SE +/- 0.04, N = 15 SE +/- 0.05, N = 5 SE +/- 0.10, N = 3 SE +/- 0.11, N = 3 SE +/- 0.06, N = 4 SE +/- 0.09, N = 3 SE +/- 0.00, N = 3 4.15 4.34 4.19 4.30 4.25 4.30 4.43 4.25 4.34 4.20 MIN: 4.06 / MAX: 4.44 MIN: 4.12 / MAX: 4.79 MIN: 4.14 / MAX: 4.4 MIN: 4.06 / MAX: 19.54 MIN: 4.07 / MAX: 4.62 MIN: 4.12 / MAX: 4.69 MIN: 4.13 / MAX: 11.06 MIN: 4.02 / MAX: 5.79 MIN: 4.17 / MAX: 4.73 MIN: 4.14 / MAX: 4.93 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: shufflenet-v2 RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 1.017 2.034 3.051 4.068 5.085 SE +/- 0.02, N = 3 SE +/- 0.01, N = 4 SE +/- 0.03, N = 3 SE +/- 0.01, N = 13 SE +/- 0.01, N = 5 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 4.42 4.47 4.45 4.46 4.48 4.50 4.48 4.47 4.52 4.43 MIN: 4.31 / MAX: 4.8 MIN: 4.37 / MAX: 4.78 MIN: 4.34 / MAX: 4.91 MIN: 4.31 / MAX: 4.82 MIN: 4.36 / MAX: 14.35 MIN: 4.35 / MAX: 5.04 MIN: 4.38 / MAX: 4.9 MIN: 4.34 / MAX: 5.99 MIN: 4.42 / MAX: 13.63 MIN: 4.32 / MAX: 5.06 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
LuxCoreRender OpenCL Scene: LuxCore Benchmark OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender OpenCL 2.3 Scene: LuxCore Benchmark RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 12 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 13 SE +/- 0.07, N = 12 SE +/- 0.10, N = 12 SE +/- 0.02, N = 3 2.80 3.63 3.56 3.67 3.55 3.60 4.58 5.91 7.84 5.03 MIN: 0.23 / MAX: 3.2 MIN: 0.23 / MAX: 4.15 MIN: 0.23 / MAX: 4.1 MIN: 0.2 / MAX: 4.25 MIN: 0.23 / MAX: 4.07 MIN: 0.27 / MAX: 4.14 MIN: 0.19 / MAX: 5.4 MIN: 0.25 / MAX: 6.86 MIN: 0.15 / MAX: 9.17 MIN: 0.32 / MAX: 5.72
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: mnasnet RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 1.026 2.052 3.078 4.104 5.13 SE +/- 0.04, N = 3 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 15 SE +/- 0.06, N = 5 SE +/- 0.21, N = 3 SE +/- 0.26, N = 3 SE +/- 0.06, N = 4 SE +/- 0.13, N = 3 SE +/- 0.02, N = 3 4.01 4.33 4.15 4.22 4.10 4.21 4.56 4.11 4.20 4.02 MIN: 3.83 / MAX: 4.42 MIN: 4.04 / MAX: 4.77 MIN: 4 / MAX: 16.56 MIN: 3.86 / MAX: 5.14 MIN: 3.82 / MAX: 4.59 MIN: 3.86 / MAX: 4.92 MIN: 3.93 / MAX: 5.1 MIN: 3.87 / MAX: 5.49 MIN: 3.91 / MAX: 10.98 MIN: 3.88 / MAX: 4.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: efficientnet-b0 RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.06, N = 4 SE +/- 0.02, N = 3 SE +/- 0.06, N = 15 SE +/- 0.04, N = 5 SE +/- 0.21, N = 3 SE +/- 0.26, N = 3 SE +/- 0.05, N = 4 SE +/- 0.06, N = 3 SE +/- 0.57, N = 3 5.48 5.70 5.59 5.60 5.56 5.68 6.02 5.55 5.61 6.03 MIN: 5.32 / MAX: 5.98 MIN: 5.43 / MAX: 17.45 MIN: 5.42 / MAX: 5.85 MIN: 5.28 / MAX: 16.37 MIN: 5.37 / MAX: 6.08 MIN: 5.29 / MAX: 17.94 MIN: 5.37 / MAX: 11.96 MIN: 5.34 / MAX: 7.32 MIN: 5.45 / MAX: 5.87 MIN: 5.28 / MAX: 339.41 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: blazeface RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 0.4388 0.8776 1.3164 1.7552 2.194 SE +/- 0.03, N = 3 SE +/- 0.03, N = 4 SE +/- 0.05, N = 3 SE +/- 0.02, N = 15 SE +/- 0.02, N = 5 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 4 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 1.80 1.93 1.87 1.88 1.86 1.88 1.95 1.86 1.91 1.83 MIN: 1.79 / MAX: 1.96 MIN: 1.85 / MAX: 2.11 MIN: 1.8 / MAX: 2.15 MIN: 1.78 / MAX: 2.2 MIN: 1.81 / MAX: 2.28 MIN: 1.77 / MAX: 2.21 MIN: 1.85 / MAX: 2.39 MIN: 1.78 / MAX: 2.03 MIN: 1.82 / MAX: 2.14 MIN: 1.78 / MAX: 2.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: googlenet RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 4 8 12 16 20 SE +/- 0.55, N = 3 SE +/- 0.43, N = 4 SE +/- 0.27, N = 3 SE +/- 0.21, N = 14 SE +/- 0.35, N = 5 SE +/- 0.39, N = 3 SE +/- 0.49, N = 3 SE +/- 0.33, N = 4 SE +/- 0.37, N = 3 SE +/- 0.68, N = 2 13.43 14.18 13.60 13.48 13.56 13.12 13.96 13.41 13.95 13.32 MIN: 12.61 / MAX: 15.67 MIN: 12.62 / MAX: 15.92 MIN: 12.94 / MAX: 24.05 MIN: 12.43 / MAX: 26.89 MIN: 12.71 / MAX: 20.62 MIN: 12.41 / MAX: 14.53 MIN: 12.74 / MAX: 15.74 MIN: 12.46 / MAX: 28.14 MIN: 12.96 / MAX: 24.07 MIN: 12.38 / MAX: 25.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: vgg16 RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 13 26 39 52 65 SE +/- 0.31, N = 3 SE +/- 0.14, N = 4 SE +/- 0.13, N = 3 SE +/- 0.13, N = 15 SE +/- 0.17, N = 5 SE +/- 0.37, N = 3 SE +/- 0.27, N = 3 SE +/- 0.43, N = 4 SE +/- 0.09, N = 3 SE +/- 0.17, N = 3 56.98 57.22 57.13 57.13 57.25 57.08 56.69 57.18 56.96 56.52 MIN: 55.41 / MAX: 58.77 MIN: 55.71 / MAX: 64.21 MIN: 56.01 / MAX: 66.17 MIN: 54.93 / MAX: 68.8 MIN: 55.82 / MAX: 67.69 MIN: 55.71 / MAX: 70.92 MIN: 55.51 / MAX: 58.35 MIN: 54.93 / MAX: 68.94 MIN: 56.04 / MAX: 59.71 MIN: 55.22 / MAX: 72.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: resnet18 RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 4 8 12 16 20 SE +/- 0.26, N = 3 SE +/- 0.19, N = 4 SE +/- 0.19, N = 3 SE +/- 0.07, N = 15 SE +/- 0.12, N = 5 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 SE +/- 0.21, N = 4 SE +/- 0.25, N = 3 SE +/- 0.25, N = 3 14.47 14.77 14.55 14.53 14.58 14.36 14.65 14.51 14.74 14.36 MIN: 14.01 / MAX: 23.82 MIN: 14.08 / MAX: 16.79 MIN: 14.18 / MAX: 15.25 MIN: 14.06 / MAX: 27.16 MIN: 14.21 / MAX: 16.27 MIN: 14.01 / MAX: 15.12 MIN: 14.13 / MAX: 15.56 MIN: 13.97 / MAX: 25.49 MIN: 14.13 / MAX: 15.8 MIN: 13.96 / MAX: 17.74 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: alexnet RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 3 6 9 12 15 SE +/- 0.22, N = 3 SE +/- 0.04, N = 4 SE +/- 0.08, N = 3 SE +/- 0.06, N = 15 SE +/- 0.06, N = 5 SE +/- 0.16, N = 3 SE +/- 0.04, N = 3 SE +/- 0.15, N = 4 SE +/- 0.12, N = 3 SE +/- 0.05, N = 3 11.61 11.53 11.74 11.36 11.37 11.22 11.44 11.41 11.71 11.25 MIN: 11.08 / MAX: 18.93 MIN: 11.27 / MAX: 18.29 MIN: 11.39 / MAX: 12.18 MIN: 10.75 / MAX: 15.63 MIN: 11.04 / MAX: 17.87 MIN: 10.85 / MAX: 11.85 MIN: 11.09 / MAX: 11.74 MIN: 10.77 / MAX: 22.45 MIN: 11.28 / MAX: 12.12 MIN: 11 / MAX: 11.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: resnet50 RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 6 12 18 24 30 SE +/- 0.15, N = 3 SE +/- 0.09, N = 4 SE +/- 0.54, N = 3 SE +/- 0.18, N = 15 SE +/- 0.23, N = 5 SE +/- 0.40, N = 3 SE +/- 0.33, N = 3 SE +/- 0.35, N = 4 SE +/- 0.56, N = 3 SE +/- 0.37, N = 3 24.58 25.71 25.05 25.25 24.62 24.73 24.69 24.69 25.34 24.95 MIN: 23.83 / MAX: 37.73 MIN: 24.88 / MAX: 31.53 MIN: 24.29 / MAX: 41.55 MIN: 23.93 / MAX: 36.67 MIN: 23.99 / MAX: 26.48 MIN: 23.79 / MAX: 35.29 MIN: 23.98 / MAX: 37.45 MIN: 23.66 / MAX: 26.48 MIN: 24.02 / MAX: 26.96 MIN: 23.91 / MAX: 36.02 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: yolov4-tiny RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 5 10 15 20 25 SE +/- 0.56, N = 3 SE +/- 0.07, N = 4 SE +/- 0.58, N = 3 SE +/- 0.24, N = 15 SE +/- 0.41, N = 5 SE +/- 0.60, N = 3 SE +/- 0.60, N = 3 SE +/- 0.64, N = 4 SE +/- 0.70, N = 3 SE +/- 0.66, N = 3 21.41 22.80 21.41 21.79 21.81 22.13 21.45 21.35 22.15 21.88 MIN: 20.54 / MAX: 32.23 MIN: 22.31 / MAX: 25.04 MIN: 20.58 / MAX: 23.53 MIN: 20.42 / MAX: 29.15 MIN: 20.55 / MAX: 24.56 MIN: 20.69 / MAX: 28.53 MIN: 20.52 / MAX: 33.08 MIN: 20.24 / MAX: 29.48 MIN: 20.53 / MAX: 30.78 MIN: 20.33 / MAX: 23.41 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: squeezenet_ssd RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.12, N = 4 SE +/- 0.14, N = 2 SE +/- 0.06, N = 15 SE +/- 0.24, N = 4 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.73, N = 4 SE +/- 0.09, N = 3 SE +/- 0.19, N = 3 14.32 14.63 14.51 14.55 14.45 14.47 14.36 15.11 14.50 14.35 MIN: 14.04 / MAX: 14.98 MIN: 14.15 / MAX: 16.53 MIN: 14.12 / MAX: 15.9 MIN: 13.96 / MAX: 25.49 MIN: 13.6 / MAX: 27.41 MIN: 14.11 / MAX: 24.43 MIN: 13.91 / MAX: 27.4 MIN: 13.72 / MAX: 369.11 MIN: 14.11 / MAX: 14.97 MIN: 13.84 / MAX: 22.5 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: regnety_400m RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 5 10 15 20 25 SE +/- 0.08, N = 3 SE +/- 0.15, N = 4 SE +/- 0.06, N = 3 SE +/- 0.07, N = 15 SE +/- 0.09, N = 5 SE +/- 0.19, N = 3 SE +/- 0.20, N = 3 SE +/- 0.28, N = 4 SE +/- 0.35, N = 3 SE +/- 0.09, N = 3 18.29 18.49 18.68 18.58 18.77 18.65 18.32 18.57 18.45 18.21 MIN: 17.99 / MAX: 28.28 MIN: 18.01 / MAX: 28.46 MIN: 18.38 / MAX: 19.17 MIN: 17.91 / MAX: 28.99 MIN: 18.01 / MAX: 30.34 MIN: 18.24 / MAX: 19.54 MIN: 17.72 / MAX: 19.14 MIN: 17.78 / MAX: 20.78 MIN: 17.4 / MAX: 28.56 MIN: 17.88 / MAX: 20.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Rodinia Test: OpenCL Particle Filter OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Particle Filter RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 3 6 9 12 15 SE +/- 0.009, N = 3 SE +/- 0.027, N = 3 SE +/- 0.017, N = 3 SE +/- 0.014, N = 3 SE +/- 0.006, N = 3 SE +/- 0.010, N = 3 SE +/- 0.076, N = 3 SE +/- 0.067, N = 3 SE +/- 0.018, N = 3 SE +/- 0.016, N = 3 8.960 7.981 7.858 6.900 6.288 5.809 4.507 7.040 4.290 4.296 1. (CXX) g++ options: -O2 -lOpenCL
ArrayFire Test: Conjugate Gradient OpenCL OpenBenchmarking.org ms, Fewer Is Better ArrayFire 3.7 Test: Conjugate Gradient OpenCL RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 0.5963 1.1926 1.7889 2.3852 2.9815 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 SE +/- 0.001, N = 3 SE +/- 0.010, N = 3 SE +/- 0.002, N = 3 SE +/- 0.004, N = 3 SE +/- 0.010, N = 3 2.650 2.073 2.086 2.060 2.072 1.893 1.670 2.092 1.557 1.637 1. (CXX) g++ options: -rdynamic
Blender Blend File: BMW27 - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: BMW27 - Compute: NVIDIA OptiX RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 2.49, N = 15 SE +/- 2.49, N = 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 36.27 30.90 31.34 25.44 26.24 25.22 21.67 20.15 11.48 18.64
Blender Blend File: Classroom - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Classroom - Compute: NVIDIA OptiX RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 30 60 90 120 150 SE +/- 0.39, N = 3 SE +/- 0.38, N = 3 SE +/- 0.95, N = 3 SE +/- 0.32, N = 3 SE +/- 0.14, N = 3 SE +/- 0.11, N = 3 SE +/- 0.50, N = 3 SE +/- 0.23, N = 3 SE +/- 0.16, N = 3 SE +/- 0.17, N = 3 147.65 144.40 148.17 89.12 86.13 81.14 73.99 56.33 38.85 73.40
Blender Blend File: Fishy Cat - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Fishy Cat - Compute: NVIDIA OptiX RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 15 30 45 60 75 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 67.13 58.94 58.34 48.03 45.51 42.60 32.80 36.74 21.48 31.72
Blender Blend File: Barbershop - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Barbershop - Compute: NVIDIA OptiX RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 400 800 1200 1600 2000 SE +/- 2.01, N = 3 SE +/- 0.52, N = 3 SE +/- 2.84, N = 3 SE +/- 1.90, N = 3 SE +/- 0.81, N = 3 SE +/- 0.14, N = 3 SE +/- 1.58, N = 3 SE +/- 0.84, N = 3 SE +/- 1.49, N = 3 SE +/- 1.56, N = 3 1779.46 1758.17 1826.30 996.82 971.62 911.32 899.14 565.99 421.51 905.22
Blender Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 50 100 150 200 250 SE +/- 0.04, N = 3 SE +/- 0.26, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.10, N = 3 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 206.86 194.41 198.63 132.73 132.90 126.93 103.97 84.95 55.63 102.51
FAHBench OpenBenchmarking.org Ns Per Day, More Is Better FAHBench 2.3.2 RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 70 140 210 280 350 SE +/- 0.18, N = 3 SE +/- 0.17, N = 3 SE +/- 0.10, N = 3 SE +/- 0.16, N = 3 SE +/- 0.18, N = 3 SE +/- 0.20, N = 3 SE +/- 0.05, N = 3 SE +/- 0.14, N = 3 SE +/- 0.30, N = 3 SE +/- 0.07, N = 3 183.29 207.08 205.23 229.76 244.42 260.90 304.04 235.72 320.66 307.03
Hashcat Benchmark: MD5 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: MD5 RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 12000M 24000M 36000M 48000M 60000M SE +/- 11770207.21, N = 3 SE +/- 8911415.90, N = 3 SE +/- 9342079.24, N = 3 SE +/- 41910871.04, N = 3 SE +/- 30944592.06, N = 3 SE +/- 13808974.54, N = 3 SE +/- 63030953.60, N = 3 SE +/- 23055223.56, N = 3 SE +/- 39942305.61, N = 3 SE +/- 33241139.17, N = 3 25930633333 29272000000 29955566667 35466033333 38493233333 43136966667 55488333333 32927600000 56429133333 58104300000
Hashcat Benchmark: SHA1 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA1 RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 4000M 8000M 12000M 16000M 20000M SE +/- 2302414.19, N = 3 SE +/- 7522189.40, N = 3 SE +/- 2514844.82, N = 3 SE +/- 1017076.42, N = 3 SE +/- 14178073.84, N = 3 SE +/- 3601851.38, N = 3 SE +/- 20384389.45, N = 3 SE +/- 16045802.50, N = 3 SE +/- 13325289.24, N = 3 SE +/- 24080351.60, N = 3 8248933333 9314900000 9535466667 11183366667 12203533333 13653000000 17751300000 11103733333 19151100000 18576400000
Hashcat Benchmark: 7-Zip OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: 7-Zip RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 200K 400K 600K 800K 1000K SE +/- 1101.51, N = 3 SE +/- 1192.10, N = 3 SE +/- 726.48, N = 3 SE +/- 435.89, N = 3 SE +/- 1713.99, N = 3 SE +/- 1963.27, N = 3 SE +/- 3113.41, N = 3 SE +/- 3659.23, N = 3 SE +/- 503.32, N = 3 SE +/- 520.68, N = 3 439000 490533 500233 593700 637367 711967 885300 581300 981300 937233
Hashcat Benchmark: SHA-512 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA-512 RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 500M 1000M 1500M 2000M 2500M SE +/- 635959.47, N = 3 SE +/- 1281058.59, N = 3 SE +/- 845248.16, N = 3 SE +/- 933333.33, N = 3 SE +/- 814452.78, N = 3 SE +/- 2042873.79, N = 3 SE +/- 1258305.74, N = 3 SE +/- 750555.35, N = 3 SE +/- 2669165.50, N = 3 SE +/- 3263093.28, N = 3 1049366667 1183433333 1213266667 1426733333 1559500000 1734100000 2241700000 1406900000 2426033333 2350366667
Hashcat Benchmark: TrueCrypt RIPEMD160 + XTS OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: TrueCrypt RIPEMD160 + XTS RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 150K 300K 450K 600K 750K SE +/- 57.74, N = 3 SE +/- 57.74, N = 3 SE +/- 556.78, N = 3 SE +/- 66.67, N = 3 SE +/- 133.33, N = 3 SE +/- 4580.29, N = 15 SE +/- 1591.99, N = 3 SE +/- 550.76, N = 3 SE +/- 750.56, N = 3 SE +/- 1120.02, N = 3 311800 349600 356700 423733 461367 527287 657967 427100 723200 688167
OctaneBench Total Score OpenBenchmarking.org Score, More Is Better OctaneBench 2020.1 Total Score RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 120 240 360 480 600 195.01 244.22 243.26 264.53 261.33 268.11 354.86 383.42 565.10 383.46
RedShift Demo OpenBenchmarking.org Seconds, Fewer Is Better RedShift Demo 3.0 RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 100 200 300 400 500 SE +/- 0.88, N = 3 SE +/- 1.00, N = 3 SE +/- 0.88, N = 3 SE +/- 0.33, N = 3 SE +/- 1.33, N = 3 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 469 378 375 351 341 325 246 239 165 235
FinanceBench Benchmark: Black-Scholes OpenCL OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-06-06 Benchmark: Black-Scholes OpenCL RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 3 6 9 12 15 SE +/- 0.001, N = 3 SE +/- 0.010, N = 3 SE +/- 0.017, N = 3 SE +/- 0.003, N = 3 SE +/- 0.019, N = 3 SE +/- 0.022, N = 3 SE +/- 0.007, N = 3 SE +/- 0.000, N = 3 SE +/- 0.020, N = 3 SE +/- 0.005, N = 3 12.565 10.454 10.225 8.224 7.620 6.749 6.014 8.328 4.869 5.691 1. (CXX) g++ options: -O3 -lOpenCL
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 80 160 240 320 400 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.15, N = 3 SE +/- 0.21, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 239.5 288.3 285.7 292.3 290.5 302.0 324.0 294.1 354.7 320.8 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 150 300 450 600 750 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.17, N = 3 SE +/- 0.44, N = 3 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 297.5 397.1 397.1 397.1 397.1 437.8 545.6 392.8 674.2 568.2 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 140 280 420 560 700 SE +/- 0.12, N = 3 SE +/- 0.20, N = 3 SE +/- 0.13, N = 3 SE +/- 0.23, N = 3 SE +/- 0.79, N = 3 SE +/- 0.17, N = 3 SE +/- 0.38, N = 3 SE +/- 0.19, N = 3 SE +/- 0.03, N = 3 SE +/- 1.30, N = 3 251.6 340.1 323.9 320.6 331.9 350.8 446.6 384.3 645.3 495.4 1. (CC) gcc options: -O2 -flto -lOpenCL
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 3K 6K 9K 12K 15K SE +/- 4.93, N = 3 SE +/- 78.58, N = 15 SE +/- 85.50, N = 15 SE +/- 72.96, N = 15 SE +/- 43.62, N = 3 SE +/- 165.30, N = 3 SE +/- 138.31, N = 15 SE +/- 74.13, N = 15 SE +/- 232.90, N = 3 SE +/- 135.03, N = 15 5224.68 6956.55 7094.24 8551.57 9330.83 10244.08 13432.89 8365.85 15586.03 13791.91 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 6K 12K 18K 24K 30K SE +/- 56.79, N = 15 SE +/- 75.69, N = 15 SE +/- 90.04, N = 15 SE +/- 84.78, N = 15 SE +/- 6.84, N = 3 SE +/- 174.06, N = 3 SE +/- 138.31, N = 3 SE +/- 0.44, N = 3 SE +/- 0.32, N = 3 SE +/- 193.76, N = 15 5369.81 6985.02 7190.53 8591.77 8860.55 10347.92 12677.18 16033.64 29490.81 14109.68 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 120 240 360 480 600 SE +/- 0.61, N = 3 SE +/- 0.69, N = 3 SE +/- 0.77, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 1.36, N = 3 SE +/- 0.89, N = 3 SE +/- 0.00, N = 3 SE +/- 1.44, N = 3 231.27 261.53 268.18 309.27 344.68 373.96 519.03 306.12 545.91 545.22 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 140 280 420 560 700 SE +/- 0.14, N = 3 SE +/- 0.41, N = 3 SE +/- 0.01, N = 3 SE +/- 0.21, N = 3 SE +/- 0.04, N = 3 SE +/- 0.18, N = 3 SE +/- 0.66, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 277.40 369.42 369.14 370.17 369.65 405.68 508.00 389.18 662.24 530.48 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
MandelGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelGPU 1.3pts1 OpenCL Device: GPU RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 100M 200M 300M 400M 500M SE +/- 1171800.01, N = 3 SE +/- 1645708.79, N = 3 SE +/- 776602.98, N = 3 SE +/- 516930.54, N = 3 SE +/- 1494897.42, N = 3 SE +/- 774665.97, N = 3 SE +/- 590928.34, N = 3 SE +/- 805777.35, N = 3 SE +/- 2886689.51, N = 3 SE +/- 2487384.16, N = 3 253858256.7 278640038.1 283803060.1 321637774.3 343080891.1 368037096.6 447272306.0 280941400.2 421918457.5 460475602.6 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
ViennaCL OpenCL LU Factorization OpenBenchmarking.org GFLOPS, More Is Better ViennaCL 1.4.2 OpenCL LU Factorization RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 20 40 60 80 100 SE +/- 0.20, N = 3 SE +/- 0.40, N = 3 SE +/- 0.18, N = 3 SE +/- 0.40, N = 3 SE +/- 0.16, N = 3 SE +/- 0.12, N = 3 SE +/- 0.20, N = 3 SE +/- 0.09, N = 3 SE +/- 0.10, N = 3 SE +/- 0.19, N = 3 74.51 74.09 74.18 75.80 77.17 76.69 78.32 75.79 79.32 81.32 1. (CXX) g++ options: -rdynamic -lOpenCL
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 3 6 9 12 15 SE +/- 0.091, N = 3 SE +/- 0.111, N = 3 SE +/- 0.096, N = 3 SE +/- 0.110, N = 3 SE +/- 0.131, N = 3 SE +/- 0.101, N = 3 SE +/- 0.108, N = 3 SE +/- 0.056, N = 3 SE +/- 0.093, N = 3 SE +/- 0.105, N = 3 12.707 11.540 11.187 9.972 9.473 8.842 7.605 8.784 6.345 7.360
RealSR-NCNN Scale: 4x - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: Yes RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 20 40 60 80 100 SE +/- 0.35, N = 3 SE +/- 0.37, N = 3 SE +/- 0.45, N = 3 SE +/- 0.29, N = 3 SE +/- 0.38, N = 3 SE +/- 0.27, N = 3 SE +/- 0.21, N = 3 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 SE +/- 0.22, N = 3 85.95 75.89 74.65 63.18 59.40 54.04 44.19 54.14 34.21 41.73
VkFFT OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.1.1 RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 9K 18K 27K 36K 45K SE +/- 44.06, N = 3 SE +/- 257.75, N = 3 SE +/- 149.21, N = 3 SE +/- 213.44, N = 3 SE +/- 95.96, N = 3 SE +/- 51.40, N = 3 SE +/- 78.30, N = 3 SE +/- 86.88, N = 3 SE +/- 284.10, N = 3 SE +/- 413.34, N = 3 22903 29585 29546 29427 28745 30475 34110 32556 40997 36216 1. (CXX) g++ options: -O3 -pthread
VkResample Upscale: 2x - Precision: Double OpenBenchmarking.org ms, Fewer Is Better VkResample 1.0 Upscale: 2x - Precision: Double RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 80 160 240 320 400 SE +/- 0.05, N = 3 SE +/- 0.28, N = 3 SE +/- 0.78, N = 3 SE +/- 0.26, N = 3 SE +/- 0.63, N = 3 SE +/- 0.01, N = 3 SE +/- 0.12, N = 3 SE +/- 0.88, N = 3 SE +/- 0.30, N = 3 SE +/- 0.10, N = 3 350.96 309.62 302.85 261.97 235.51 216.54 152.85 264.87 148.28 149.20 1. (CXX) g++ options: -O3 -pthread
VkResample Upscale: 2x - Precision: Single OpenBenchmarking.org ms, Fewer Is Better VkResample 1.0 Upscale: 2x - Precision: Single RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 27.20 22.28 22.27 19.52 19.02 17.51 14.77 17.68 11.22 13.56 1. (CXX) g++ options: -O3 -pthread
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes RTX 2060 RTX 2060 SUPER RTX 2070 RTX 2070 SUPER RTX 2080 RTX 2080 SUPER RTX 2080 Ti RTX 3060 TI RTX 3080 TITAN RTX 1.3003 2.6006 3.9009 5.2012 6.5015 SE +/- 0.064, N = 3 SE +/- 0.055, N = 3 SE +/- 0.049, N = 3 SE +/- 0.067, N = 4 SE +/- 0.053, N = 3 SE +/- 0.060, N = 3 SE +/- 0.057, N = 3 SE +/- 0.008, N = 3 SE +/- 0.047, N = 5 SE +/- 0.056, N = 3 5.779 5.313 5.314 4.761 4.605 4.358 3.856 4.371 3.512 3.827
Phoronix Test Suite v10.8.4