NVIDIA Linux GPU Performance Comptue RTX 30 RTX 20

Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2012250-HA-NVIDIAGPU61&grs&sro&rro.

NVIDIA Linux GPU Performance Comptue RTX 30 RTX 20ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTXRTX 3060 TIRTX 3080AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600NVIDIA GeForce RTX 2060 6GB (1365/7000MHz)NVIDIA TU106 HD AudioASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.4.0-58-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 460.27.044.6.0OpenCL 1.2 CUDA 11.2.661.2.155GCC 9.3.0ext43840x2160NVIDIA GeForce RTX 2060 SUPER 8GB (1470/7000MHz)ASUS NVIDIA GeForce RTX 2070 8GB (435/405MHz)NVIDIA GeForce RTX 2070 SUPER 8GB (1605/7000MHz)NVIDIA TU104 HD AudioZotac NVIDIA GeForce RTX 2080 8GB (1515/7000MHz)NVIDIA GeForce RTX 2080 SUPER 8GB (1650/7750MHz)NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)NVIDIA TU102 HD AudioNVIDIA TITAN RTX 24GB (1350/7000MHz)NVIDIA GeForce RTX 3060 Ti 8GB (1665/7000MHz)NVIDIA Device 228bNVIDIA GeForce RTX 3080 10GB (1710/9501MHz)NVIDIA Device 1aefOpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0xa201009OpenCL Details- RTX 2060: GPU Compute Cores: 1920- RTX 2060 SUPER: GPU Compute Cores: 2176- RTX 2070: GPU Compute Cores: 2304- RTX 2070 SUPER: GPU Compute Cores: 2560- RTX 2080: GPU Compute Cores: 2944- RTX 2080 SUPER: GPU Compute Cores: 3072- RTX 2080 Ti: GPU Compute Cores: 4352- TITAN RTX: GPU Compute Cores: 4608- RTX 3060 TI: GPU Compute Cores: 4864- RTX 3080: GPU Compute Cores: 8704Python Details- Python 2.7.18 + Python 3.8.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

NVIDIA Linux GPU Performance Comptue RTX 30 RTX 20clpeak: Single-Precision Floatblender: Barbershop - NVIDIA OptiXblender: Classroom - NVIDIA OptiXblender: Pabellon Barcelona - NVIDIA OptiXluxcorerender-cl: Foodblender: BMW27 - NVIDIA OptiXblender: Fishy Cat - NVIDIA OptiXclpeak: Integer Compute INTluxcorerender-cl: DLSCoctanebench: Total Scoreredshift: luxcorerender-cl: LuxCore Benchmarkplaidml: No - Inference - IMDB LSTM - OpenCLfinancebench: Black-Scholes OpenCLcl-mem: Writeplaidml: No - Inference - VGG19 - OpenCLplaidml: No - Inference - VGG16 - OpenCLrealsr-ncnn: 4x - Yesplaidml: No - Inference - NASNer Large - OpenCLvkresample: 2x - Singleplaidml: No - Inference - Mobilenet - OpenCLbetsy: ETC2 RGB - Highestclpeak: Global Memory Bandwidthvkresample: 2x - Doubleclpeak: Double-Precision Doublehashcat: SHA1plaidml: No - Inference - Inception V3 - OpenCLhashcat: TrueCrypt RIPEMD160 + XTShashcat: SHA-512betsy: ETC1 - Highestplaidml: No - Inference - ResNet 50 - OpenCLcl-mem: Readhashcat: MD5hashcat: 7-Zipplaidml: Yes - Inference - Mobilenet - OpenCLrodinia: OpenCL Particle Filterplaidml: No - Inference - DenseNet 201 - OpenCLrealsr-ncnn: 4x - Nolczero: OpenCLplaidml: No - Training - VGG19 - OpenCLmandelgpu: GPUvkfft: plaidml: No - Training - VGG16 - OpenCLfahbench: arrayfire: Conjugate Gradient OpenCLwaifu2x-ncnn: 2x - 3 - Yescl-mem: Copyviennacl: OpenCL LU Factorizationncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetluxcorerender-cl: Rainbow Colors and PrismRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTXRTX 3060 TIRTX 30805369.811779.46147.65206.861.2136.2767.135224.683.30195.0098064692.80392.0212.565251.688.38110.9685.94831.8127.1951282.278.748277.40350.961231.278248933333171.7831180010493666676.189320.95297.5259306333334390001720.248.960125.4512.707975222.04253858256.72290327.12183.29132.6505.779239.574.50621.821.414.444.1514.3212.4211.6124.5818.2914.474.4256.9813.435.484.017.986985.021758.17144.40194.411.3230.9058.946956.554.19244.2245533783.63440.3610.454340.1100.61126.7675.89238.4222.2821581.857.733369.42309.619261.539314900000203.7834960011834333335.451376.59397.1292720000004905332011.997.981147.1311.5401139924.58278640038.12958530.00207.07702.0735.313288.374.08531.9322.804.674.3414.6313.0411.5325.7118.4914.774.4757.2214.185.704.3311.417190.531826.30148.17198.631.3031.3458.347094.244.14243.2612183753.56447.4710.225323.9103.03129.6574.65439.0722.2691566.367.646369.14302.845268.189535466667202.0535670012132666675.309379.70397.1299555666675002332004.517.858148.0411.18711495283803060.12954630.27205.23222.0865.314285.774.18171.8721.414.604.1914.5112.6611.7425.0518.6814.554.4557.1313.605.594.1510.948591.77996.8289.12132.731.8625.4448.038551.574.32264.5349433513.67499.438.224320.6121.35152.5863.17842.3219.5231701.996.653370.17261.969309.2711183366667220.4342373314267333334.824422.86397.1354660333335937002300.016.900149.689.9721284127.78321637774.32942733.75229.76002.0604.761292.375.79841.8821.794.644.3014.5512.7711.3625.2518.5814.534.4657.1313.485.604.2210.048860.55971.6286.13132.901.8826.2445.519330.834.23261.3295043413.55545.487.620331.9128.62161.8859.40345.3919.0241726.886.153369.65235.512344.6812203533333242.5446136715595000004.375442.78397.1384932333336373672333.796.288152.289.4731357628.32343080891.12874534.13244.41712.0724.605290.577.17271.8621.814.504.2514.4512.6911.3724.6218.7714.584.4857.2513.565.564.109.2710347.92911.3281.14126.931.9725.2242.6010244.084.32268.1143683253.60588.046.749350.8143.39180.2354.03749.1617.5071816.875.507405.68216.539373.9613653000000266.3952728717341000004.004468.60437.8431369666677119672489.115.809160.488.8421463432.29368037096.63047538.7260.89571.8934.358302.076.68691.8822.134.654.3014.4712.7211.2224.7318.6514.364.5057.0813.125.684.219.1912677.18899.1473.99103.972.1821.6732.8013432.895.53354.8567692464.58754.696.014446.6185.09231.7344.18663.5314.7672414.96508.00152.853519.0317751300000351.1865796722417000003.177636.84545.6554883333338853003316.314.507213.547.6051695236.9447272306.03411044.02304.03621.6703.856324.078.32441.9521.454.744.4314.3612.6411.4424.6918.3214.654.4856.6913.966.024.5612.1414109.68905.2273.40102.512.2318.6431.7213791.915.95383.4637392355.03782.295.691495.4194.50244.0041.73166.5013.5612551.384.116530.48149.204545.2218576400000359.1368816723503666673.043654.76568.2581043000009372333392.964.296228.787.3601700937.56460475602.63621644.51307.03481.6373.827320.881.31671.8321.884.494.2014.3512.7711.2524.9518.2114.364.4356.5213.326.034.0213.5616033.64565.9956.3384.952.9720.1536.748365.857.13383.4165832395.91689.618.328384.3133.11167.7354.14448.0417.6761875.376.040389.18264.871306.1211103733333236.4042710014069000004.358454.66392.8329276000005813002428.747.040177.418.7841679929.38280941400.23255634.98235.71672.0924.371294.175.78961.8621.354.544.2515.1112.5411.4124.6918.5714.514.4757.1813.415.554.1117.1129490.81421.5138.8555.634.0011.4821.4815586.039.60565.0995121657.841019.794.869645.3223.64280.3134.20679.2311.2203062.003.664662.24148.284545.9119151100000398.6472320024260333332.701730.76674.2564291333339813003623.824.290261.326.3451866840.88421918457.54099747.50320.65711.5573.512354.779.32171.9122.154.564.3414.5012.7811.7125.3418.4514.744.5256.9613.955.614.2021.91OpenBenchmarking.org

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20606K12K18K24K30KSE +/- 193.76, N = 15SE +/- 0.32, N = 3SE +/- 0.44, N = 3SE +/- 138.31, N = 3SE +/- 174.06, N = 3SE +/- 6.84, N = 3SE +/- 84.78, N = 15SE +/- 90.04, N = 15SE +/- 75.69, N = 15SE +/- 56.79, N = 1514109.6829490.8116033.6412677.1810347.928860.558591.777190.536985.025369.811. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: NVIDIA OptiXTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060400800120016002000SE +/- 1.56, N = 3SE +/- 1.49, N = 3SE +/- 0.84, N = 3SE +/- 1.58, N = 3SE +/- 0.14, N = 3SE +/- 0.81, N = 3SE +/- 1.90, N = 3SE +/- 2.84, N = 3SE +/- 0.52, N = 3SE +/- 2.01, N = 3905.22421.51565.99899.14911.32971.62996.821826.301758.171779.46

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: NVIDIA OptiXTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060306090120150SE +/- 0.17, N = 3SE +/- 0.16, N = 3SE +/- 0.23, N = 3SE +/- 0.50, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 3SE +/- 0.32, N = 3SE +/- 0.95, N = 3SE +/- 0.38, N = 3SE +/- 0.39, N = 373.4038.8556.3373.9981.1486.1389.12148.17144.40147.65

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 206050100150200250SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.26, N = 3SE +/- 0.04, N = 3102.5155.6384.95103.97126.93132.90132.73198.63194.41206.86

LuxCoreRender OpenCL

Scene: Food

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: FoodTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20600.91.82.73.64.5SE +/- 0.03, N = 5SE +/- 0.07, N = 14SE +/- 0.05, N = 12SE +/- 0.04, N = 12SE +/- 0.01, N = 3SE +/- 0.03, N = 4SE +/- 0.04, N = 12SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 32.234.002.972.181.971.881.861.301.321.21MIN: 0.23 / MAX: 2.76MIN: 0.17 / MAX: 5.07MIN: 0.19 / MAX: 3.74MIN: 0.15 / MAX: 2.71MIN: 0.27 / MAX: 2.37MIN: 0.23 / MAX: 2.29MIN: 0.18 / MAX: 2.3MIN: 0.23 / MAX: 1.55MIN: 0.23 / MAX: 1.59MIN: 0.24 / MAX: 1.44

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: NVIDIA OptiXTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060816243240SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 2.49, N = 15SE +/- 2.49, N = 15SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 318.6411.4820.1521.6725.2226.2425.4431.3430.9036.27

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: NVIDIA OptiXTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20601530456075SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 331.7221.4836.7432.8042.6045.5148.0358.3458.9467.13

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20603K6K9K12K15KSE +/- 135.03, N = 15SE +/- 232.90, N = 3SE +/- 74.13, N = 15SE +/- 138.31, N = 15SE +/- 165.30, N = 3SE +/- 43.62, N = 3SE +/- 72.96, N = 15SE +/- 85.50, N = 15SE +/- 78.58, N = 15SE +/- 4.93, N = 313791.9115586.038365.8513432.8910244.089330.838551.577094.246956.555224.681. (CXX) g++ options: -O3 -rdynamic -lOpenCL

LuxCoreRender OpenCL

Scene: DLSC

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: DLSCTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20603691215SE +/- 0.01, N = 3SE +/- 0.14, N = 12SE +/- 0.10, N = 12SE +/- 0.08, N = 12SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 12SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 35.959.607.135.534.324.234.324.144.193.30MIN: 5.62 / MAX: 6.04MIN: 3.45 / MAX: 9.87MIN: 2.58 / MAX: 7.38MIN: 2.02 / MAX: 5.76MIN: 4.13 / MAX: 4.4MIN: 4.15 / MAX: 4.3MIN: 1.6 / MAX: 4.51MIN: 3.79 / MAX: 4.28MIN: 4.11 / MAX: 4.29MIN: 3.16 / MAX: 3.4

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 2020.1Total ScoreTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060120240360480600383.46565.10383.42354.86268.11261.33264.53243.26244.22195.01

RedShift Demo

OpenBenchmarking.orgSeconds, Fewer Is BetterRedShift Demo 3.0TITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060100200300400500SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 1.33, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 1.00, N = 3SE +/- 0.88, N = 3235165239246325341351375378469

LuxCoreRender OpenCL

Scene: LuxCore Benchmark

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: LuxCore BenchmarkTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060246810SE +/- 0.02, N = 3SE +/- 0.10, N = 12SE +/- 0.07, N = 12SE +/- 0.04, N = 13SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 12SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 35.037.845.914.583.603.553.673.563.632.80MIN: 0.32 / MAX: 5.72MIN: 0.15 / MAX: 9.17MIN: 0.25 / MAX: 6.86MIN: 0.19 / MAX: 5.4MIN: 0.27 / MAX: 4.14MIN: 0.23 / MAX: 4.07MIN: 0.2 / MAX: 4.25MIN: 0.23 / MAX: 4.1MIN: 0.23 / MAX: 4.15MIN: 0.23 / MAX: 3.2

PlaidML

FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20602004006008001000SE +/- 1.34, N = 3SE +/- 1.56, N = 3SE +/- 0.42, N = 3SE +/- 1.36, N = 3SE +/- 0.10, N = 3SE +/- 1.14, N = 3SE +/- 0.23, N = 3SE +/- 1.13, N = 3SE +/- 0.53, N = 3SE +/- 0.58, N = 3782.291019.79689.61754.69588.04545.48499.43447.47440.36392.02

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Black-Scholes OpenCLTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20603691215SE +/- 0.005, N = 3SE +/- 0.020, N = 3SE +/- 0.000, N = 3SE +/- 0.007, N = 3SE +/- 0.022, N = 3SE +/- 0.019, N = 3SE +/- 0.003, N = 3SE +/- 0.017, N = 3SE +/- 0.010, N = 3SE +/- 0.001, N = 35.6914.8698.3286.0146.7497.6208.22410.22510.45412.5651. (CXX) g++ options: -O3 -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060140280420560700SE +/- 1.30, N = 3SE +/- 0.03, N = 3SE +/- 0.19, N = 3SE +/- 0.38, N = 3SE +/- 0.17, N = 3SE +/- 0.79, N = 3SE +/- 0.23, N = 3SE +/- 0.13, N = 3SE +/- 0.20, N = 3SE +/- 0.12, N = 3495.4645.3384.3446.6350.8331.9320.6323.9340.1251.61. (CC) gcc options: -O2 -flto -lOpenCL

PlaidML

FP16: No - Mode: Inference - Network: VGG19 - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: OpenCLTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 206050100150200250SE +/- 0.14, N = 3SE +/- 0.28, N = 3SE +/- 0.08, N = 3SE +/- 0.15, N = 3SE +/- 0.20, N = 3SE +/- 0.27, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 3194.50223.64133.11185.09143.39128.62121.35103.03100.6188.38

PlaidML

FP16: No - Mode: Inference - Network: VGG16 - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: OpenCLTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 206060120180240300SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.00, N = 3SE +/- 0.18, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3244.00280.31167.73231.73180.23161.88152.58129.65126.76110.96

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 206020406080100SE +/- 0.22, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.21, N = 3SE +/- 0.27, N = 3SE +/- 0.38, N = 3SE +/- 0.29, N = 3SE +/- 0.45, N = 3SE +/- 0.37, N = 3SE +/- 0.35, N = 341.7334.2154.1444.1954.0459.4063.1874.6575.8985.95

PlaidML

FP16: No - Mode: Inference - Network: NASNer Large - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: NASNer Large - Device: OpenCLTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 206020406080100SE +/- 0.18, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.14, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 366.5079.2348.0463.5349.1645.3942.3239.0738.4231.81

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060612182430SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 313.5611.2217.6814.7717.5119.0219.5222.2722.2827.201. (CXX) g++ options: -O3 -pthread

PlaidML

FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20607001400210028003500SE +/- 2.13, N = 3SE +/- 8.25, N = 3SE +/- 4.77, N = 3SE +/- 4.12, N = 3SE +/- 1.68, N = 3SE +/- 0.70, N = 3SE +/- 3.23, N = 3SE +/- 3.57, N = 3SE +/- 2.50, N = 3SE +/- 1.88, N = 32551.383062.001875.372414.961816.871726.881701.991566.361581.851282.27

Betsy GPU Compressor

Codec: ETC2 RGB - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: HighestTITAN RTXRTX 3080RTX 3060 TIRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060246810SE +/- 0.058, N = 13SE +/- 0.063, N = 13SE +/- 0.066, N = 13SE +/- 0.057, N = 14SE +/- 0.064, N = 12SE +/- 0.053, N = 15SE +/- 0.064, N = 12SE +/- 0.060, N = 13SE +/- 0.084, N = 94.1163.6646.0405.5076.1536.6537.6467.7338.7481. (CXX) g++ options: -O3 -O2 -lpthread -ldl

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060140280420560700SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.66, N = 3SE +/- 0.18, N = 3SE +/- 0.04, N = 3SE +/- 0.21, N = 3SE +/- 0.01, N = 3SE +/- 0.41, N = 3SE +/- 0.14, N = 3530.48662.24389.18508.00405.68369.65370.17369.14369.42277.401. (CXX) g++ options: -O3 -rdynamic -lOpenCL

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: DoubleTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 206080160240320400SE +/- 0.10, N = 3SE +/- 0.30, N = 3SE +/- 0.88, N = 3SE +/- 0.12, N = 3SE +/- 0.01, N = 3SE +/- 0.63, N = 3SE +/- 0.26, N = 3SE +/- 0.78, N = 3SE +/- 0.28, N = 3SE +/- 0.05, N = 3149.20148.28264.87152.85216.54235.51261.97302.85309.62350.961. (CXX) g++ options: -O3 -pthread

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060120240360480600SE +/- 1.44, N = 3SE +/- 0.00, N = 3SE +/- 0.89, N = 3SE +/- 1.36, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.77, N = 3SE +/- 0.69, N = 3SE +/- 0.61, N = 3545.22545.91306.12519.03373.96344.68309.27268.18261.53231.271. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA1TITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20604000M8000M12000M16000M20000MSE +/- 24080351.60, N = 3SE +/- 13325289.24, N = 3SE +/- 16045802.50, N = 3SE +/- 20384389.45, N = 3SE +/- 3601851.38, N = 3SE +/- 14178073.84, N = 3SE +/- 1017076.42, N = 3SE +/- 2514844.82, N = 3SE +/- 7522189.40, N = 3SE +/- 2302414.19, N = 318576400000191511000001110373333317751300000136530000001220353333311183366667953546666793149000008248933333

PlaidML

FP16: No - Mode: Inference - Network: Inception V3 - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Inception V3 - Device: OpenCLTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 206090180270360450SE +/- 0.19, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 0.39, N = 3SE +/- 0.03, N = 3SE +/- 0.46, N = 3SE +/- 0.01, N = 3SE +/- 0.28, N = 3SE +/- 0.18, N = 3SE +/- 0.18, N = 3359.13398.64236.40351.18266.39242.54220.43202.05203.78171.78

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: TrueCrypt RIPEMD160 + XTSTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060150K300K450K600K750KSE +/- 1120.02, N = 3SE +/- 750.56, N = 3SE +/- 550.76, N = 3SE +/- 1591.99, N = 3SE +/- 4580.29, N = 15SE +/- 133.33, N = 3SE +/- 66.67, N = 3SE +/- 556.78, N = 3SE +/- 57.74, N = 3SE +/- 57.74, N = 3688167723200427100657967527287461367423733356700349600311800

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA-512TITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060500M1000M1500M2000M2500MSE +/- 3263093.28, N = 3SE +/- 2669165.50, N = 3SE +/- 750555.35, N = 3SE +/- 1258305.74, N = 3SE +/- 2042873.79, N = 3SE +/- 814452.78, N = 3SE +/- 933333.33, N = 3SE +/- 845248.16, N = 3SE +/- 1281058.59, N = 3SE +/- 635959.47, N = 32350366667242603333314069000002241700000173410000015595000001426733333121326666711834333331049366667

Betsy GPU Compressor

Codec: ETC1 - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: HighestTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060246810SE +/- 0.051, N = 14SE +/- 0.052, N = 14SE +/- 0.051, N = 14SE +/- 0.049, N = 14SE +/- 0.060, N = 14SE +/- 0.048, N = 14SE +/- 0.054, N = 13SE +/- 0.052, N = 13SE +/- 0.052, N = 13SE +/- 0.054, N = 133.0432.7014.3583.1774.0044.3754.8245.3095.4516.1891. (CXX) g++ options: -O3 -O2 -lpthread -ldl

PlaidML

FP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCLTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060160320480640800SE +/- 2.07, N = 3SE +/- 1.40, N = 3SE +/- 0.51, N = 3SE +/- 0.39, N = 3SE +/- 0.40, N = 3SE +/- 0.34, N = 3SE +/- 0.62, N = 3SE +/- 0.70, N = 3SE +/- 0.68, N = 3SE +/- 0.61, N = 3654.76730.76454.66636.84468.60442.78422.86379.70376.59320.95

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060150300450600750SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.44, N = 3SE +/- 0.17, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3568.2674.2392.8545.6437.8397.1397.1397.1397.1297.51. (CC) gcc options: -O2 -flto -lOpenCL

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: MD5TITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 206012000M24000M36000M48000M60000MSE +/- 33241139.17, N = 3SE +/- 39942305.61, N = 3SE +/- 23055223.56, N = 3SE +/- 63030953.60, N = 3SE +/- 13808974.54, N = 3SE +/- 30944592.06, N = 3SE +/- 41910871.04, N = 3SE +/- 9342079.24, N = 3SE +/- 8911415.90, N = 3SE +/- 11770207.21, N = 358104300000564291333333292760000055488333333431369666673849323333335466033333299555666672927200000025930633333

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: 7-ZipTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060200K400K600K800K1000KSE +/- 520.68, N = 3SE +/- 503.32, N = 3SE +/- 3659.23, N = 3SE +/- 3113.41, N = 3SE +/- 1963.27, N = 3SE +/- 1713.99, N = 3SE +/- 435.89, N = 3SE +/- 726.48, N = 3SE +/- 1192.10, N = 3SE +/- 1101.51, N = 3937233981300581300885300711967637367593700500233490533439000

PlaidML

FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20608001600240032004000SE +/- 16.11, N = 3SE +/- 12.23, N = 3SE +/- 1.99, N = 3SE +/- 16.01, N = 3SE +/- 1.35, N = 3SE +/- 3.97, N = 3SE +/- 0.32, N = 3SE +/- 1.56, N = 3SE +/- 2.83, N = 3SE +/- 1.92, N = 33392.963623.822428.743316.312489.112333.792300.012004.512011.991720.24

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20603691215SE +/- 0.016, N = 3SE +/- 0.018, N = 3SE +/- 0.067, N = 3SE +/- 0.076, N = 3SE +/- 0.010, N = 3SE +/- 0.006, N = 3SE +/- 0.014, N = 3SE +/- 0.017, N = 3SE +/- 0.027, N = 3SE +/- 0.009, N = 34.2964.2907.0404.5075.8096.2886.9007.8587.9818.9601. (CXX) g++ options: -O2 -lOpenCL

PlaidML

FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCLTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 206060120180240300SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.26, N = 3SE +/- 0.07, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.15, N = 3SE +/- 0.07, N = 3228.78261.32177.41213.54160.48152.28149.68148.04147.13125.45

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20603691215SE +/- 0.105, N = 3SE +/- 0.093, N = 3SE +/- 0.056, N = 3SE +/- 0.108, N = 3SE +/- 0.101, N = 3SE +/- 0.131, N = 3SE +/- 0.110, N = 3SE +/- 0.096, N = 3SE +/- 0.111, N = 3SE +/- 0.091, N = 37.3606.3458.7847.6058.8429.4739.97211.18711.54012.707

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: OpenCLTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20604K8K12K16K20KSE +/- 110.73, N = 3SE +/- 122.65, N = 3SE +/- 26.30, N = 3SE +/- 54.85, N = 3SE +/- 48.26, N = 3SE +/- 51.64, N = 3SE +/- 49.97, N = 3SE +/- 54.87, N = 3SE +/- 69.95, N = 3SE +/- 66.84, N = 317009186681679916952146341357612841114951139997521. (CXX) g++ options: -flto -pthread

PlaidML

FP16: No - Mode: Training - Network: VGG19 - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Training - Network: VGG19 - Device: OpenCLTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2060 SUPERRTX 2060918273645SE +/- 0.08, N = 2SE +/- 0.15, N = 2SE +/- 0.05, N = 2SE +/- 0.04, N = 237.5640.8829.3836.9032.2928.3227.7824.5822.04

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060100M200M300M400M500MSE +/- 2487384.16, N = 3SE +/- 2886689.51, N = 3SE +/- 805777.35, N = 3SE +/- 590928.34, N = 3SE +/- 774665.97, N = 3SE +/- 1494897.42, N = 3SE +/- 516930.54, N = 3SE +/- 776602.98, N = 3SE +/- 1645708.79, N = 3SE +/- 1171800.01, N = 3460475602.6421918457.5280941400.2447272306.0368037096.6343080891.1321637774.3283803060.1278640038.1253858256.71. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1TITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20609K18K27K36K45KSE +/- 413.34, N = 3SE +/- 284.10, N = 3SE +/- 86.88, N = 3SE +/- 78.30, N = 3SE +/- 51.40, N = 3SE +/- 95.96, N = 3SE +/- 213.44, N = 3SE +/- 149.21, N = 3SE +/- 257.75, N = 3SE +/- 44.06, N = 3362164099732556341103047528745294272954629585229031. (CXX) g++ options: -O3 -pthread

PlaidML

FP16: No - Mode: Training - Network: VGG16 - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Training - Network: VGG16 - Device: OpenCLTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20601122334455SE +/- 0.36, N = 2SE +/- 0.18, N = 2SE +/- 0.33, N = 2SE +/- 0.20, N = 244.5147.5034.9844.0238.7034.1333.7530.2730.0027.12

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2TITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 206070140210280350SE +/- 0.07, N = 3SE +/- 0.30, N = 3SE +/- 0.14, N = 3SE +/- 0.05, N = 3SE +/- 0.20, N = 3SE +/- 0.18, N = 3SE +/- 0.16, N = 3SE +/- 0.10, N = 3SE +/- 0.17, N = 3SE +/- 0.18, N = 3307.03320.66235.72304.04260.90244.42229.76205.23207.08183.29

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20600.59631.19261.78892.38522.9815SE +/- 0.010, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 3SE +/- 0.010, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 31.6371.5572.0921.6701.8932.0722.0602.0862.0732.6501. (CXX) g++ options: -rdynamic

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20601.30032.60063.90095.20126.5015SE +/- 0.056, N = 3SE +/- 0.047, N = 5SE +/- 0.008, N = 3SE +/- 0.057, N = 3SE +/- 0.060, N = 3SE +/- 0.053, N = 3SE +/- 0.067, N = 4SE +/- 0.049, N = 3SE +/- 0.055, N = 3SE +/- 0.064, N = 33.8273.5124.3713.8564.3584.6054.7615.3145.3135.779

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 206080160240320400SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.21, N = 3SE +/- 0.15, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3320.8354.7294.1324.0302.0290.5292.3285.7288.3239.51. (CC) gcc options: -O2 -flto -lOpenCL

ViennaCL

OpenCL LU Factorization

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 206020406080100SE +/- 0.19, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.20, N = 3SE +/- 0.12, N = 3SE +/- 0.16, N = 3SE +/- 0.40, N = 3SE +/- 0.18, N = 3SE +/- 0.40, N = 3SE +/- 0.20, N = 381.3279.3275.7978.3276.6977.1775.8074.1874.0974.511. (CXX) g++ options: -rdynamic -lOpenCL

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazefaceTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20600.43880.87761.31641.75522.194SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 4SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 5SE +/- 0.02, N = 15SE +/- 0.05, N = 3SE +/- 0.03, N = 4SE +/- 0.03, N = 31.831.911.861.951.881.861.881.871.931.80MIN: 1.78 / MAX: 2.03MIN: 1.82 / MAX: 2.14MIN: 1.78 / MAX: 2.03MIN: 1.85 / MAX: 2.39MIN: 1.77 / MAX: 2.21MIN: 1.81 / MAX: 2.28MIN: 1.78 / MAX: 2.2MIN: 1.8 / MAX: 2.15MIN: 1.85 / MAX: 2.11MIN: 1.79 / MAX: 1.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tinyTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060510152025SE +/- 0.66, N = 3SE +/- 0.70, N = 3SE +/- 0.64, N = 4SE +/- 0.60, N = 3SE +/- 0.60, N = 3SE +/- 0.41, N = 5SE +/- 0.24, N = 15SE +/- 0.58, N = 3SE +/- 0.07, N = 4SE +/- 0.56, N = 321.8822.1521.3521.4522.1321.8121.7921.4122.8021.41MIN: 20.33 / MAX: 23.41MIN: 20.53 / MAX: 30.78MIN: 20.24 / MAX: 29.48MIN: 20.52 / MAX: 33.08MIN: 20.69 / MAX: 28.53MIN: 20.55 / MAX: 24.56MIN: 20.42 / MAX: 29.15MIN: 20.58 / MAX: 23.53MIN: 22.31 / MAX: 25.04MIN: 20.54 / MAX: 32.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2TITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20601.06652.1333.19954.2665.3325SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 4SE +/- 0.21, N = 3SE +/- 0.23, N = 3SE +/- 0.04, N = 5SE +/- 0.07, N = 15SE +/- 0.02, N = 3SE +/- 0.07, N = 4SE +/- 0.03, N = 34.494.564.544.744.654.504.644.604.674.44MIN: 4.26 / MAX: 4.85MIN: 4.3 / MAX: 4.85MIN: 4.25 / MAX: 5.73MIN: 4.31 / MAX: 5.6MIN: 4.19 / MAX: 5.42MIN: 4.19 / MAX: 13.41MIN: 4.23 / MAX: 5.42MIN: 4.35 / MAX: 13.48MIN: 4.31 / MAX: 5.1MIN: 4.21 / MAX: 4.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3TITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20600.99681.99362.99043.98724.984SE +/- 0.00, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 4SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 5SE +/- 0.04, N = 15SE +/- 0.01, N = 3SE +/- 0.08, N = 4SE +/- 0.03, N = 34.204.344.254.434.304.254.304.194.344.15MIN: 4.14 / MAX: 4.93MIN: 4.17 / MAX: 4.73MIN: 4.02 / MAX: 5.79MIN: 4.13 / MAX: 11.06MIN: 4.12 / MAX: 4.69MIN: 4.07 / MAX: 4.62MIN: 4.06 / MAX: 19.54MIN: 4.14 / MAX: 4.4MIN: 4.12 / MAX: 4.79MIN: 4.06 / MAX: 4.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssdTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 206048121620SE +/- 0.19, N = 3SE +/- 0.09, N = 3SE +/- 0.73, N = 4SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.24, N = 4SE +/- 0.06, N = 15SE +/- 0.14, N = 2SE +/- 0.12, N = 4SE +/- 0.05, N = 314.3514.5015.1114.3614.4714.4514.5514.5114.6314.32MIN: 13.84 / MAX: 22.5MIN: 14.11 / MAX: 14.97MIN: 13.72 / MAX: 369.11MIN: 13.91 / MAX: 27.4MIN: 14.11 / MAX: 24.43MIN: 13.6 / MAX: 27.41MIN: 13.96 / MAX: 25.49MIN: 14.12 / MAX: 15.9MIN: 14.15 / MAX: 16.53MIN: 14.04 / MAX: 14.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenetTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20603691215SE +/- 0.16, N = 3SE +/- 0.12, N = 3SE +/- 0.17, N = 4SE +/- 0.18, N = 3SE +/- 0.11, N = 3SE +/- 0.16, N = 5SE +/- 0.14, N = 15SE +/- 0.17, N = 3SE +/- 0.17, N = 4SE +/- 0.14, N = 312.7712.7812.5412.6412.7212.6912.7712.6613.0412.42MIN: 12.01 / MAX: 24.52MIN: 12.35 / MAX: 20.47MIN: 11.94 / MAX: 15.69MIN: 12.22 / MAX: 14.36MIN: 12.26 / MAX: 13.5MIN: 11.97 / MAX: 24.76MIN: 11.98 / MAX: 34.45MIN: 12.23 / MAX: 24.28MIN: 12.34 / MAX: 24MIN: 11.98 / MAX: 24.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnetTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20603691215SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.15, N = 4SE +/- 0.04, N = 3SE +/- 0.16, N = 3SE +/- 0.06, N = 5SE +/- 0.06, N = 15SE +/- 0.08, N = 3SE +/- 0.04, N = 4SE +/- 0.22, N = 311.2511.7111.4111.4411.2211.3711.3611.7411.5311.61MIN: 11 / MAX: 11.61MIN: 11.28 / MAX: 12.12MIN: 10.77 / MAX: 22.45MIN: 11.09 / MAX: 11.74MIN: 10.85 / MAX: 11.85MIN: 11.04 / MAX: 17.87MIN: 10.75 / MAX: 15.63MIN: 11.39 / MAX: 12.18MIN: 11.27 / MAX: 18.29MIN: 11.08 / MAX: 18.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet50TITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060612182430SE +/- 0.37, N = 3SE +/- 0.56, N = 3SE +/- 0.35, N = 4SE +/- 0.33, N = 3SE +/- 0.40, N = 3SE +/- 0.23, N = 5SE +/- 0.18, N = 15SE +/- 0.54, N = 3SE +/- 0.09, N = 4SE +/- 0.15, N = 324.9525.3424.6924.6924.7324.6225.2525.0525.7124.58MIN: 23.91 / MAX: 36.02MIN: 24.02 / MAX: 26.96MIN: 23.66 / MAX: 26.48MIN: 23.98 / MAX: 37.45MIN: 23.79 / MAX: 35.29MIN: 23.99 / MAX: 26.48MIN: 23.93 / MAX: 36.67MIN: 24.29 / MAX: 41.55MIN: 24.88 / MAX: 31.53MIN: 23.83 / MAX: 37.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400mTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060510152025SE +/- 0.09, N = 3SE +/- 0.35, N = 3SE +/- 0.28, N = 4SE +/- 0.20, N = 3SE +/- 0.19, N = 3SE +/- 0.09, N = 5SE +/- 0.07, N = 15SE +/- 0.06, N = 3SE +/- 0.15, N = 4SE +/- 0.08, N = 318.2118.4518.5718.3218.6518.7718.5818.6818.4918.29MIN: 17.88 / MAX: 20.03MIN: 17.4 / MAX: 28.56MIN: 17.78 / MAX: 20.78MIN: 17.72 / MAX: 19.14MIN: 18.24 / MAX: 19.54MIN: 18.01 / MAX: 30.34MIN: 17.91 / MAX: 28.99MIN: 18.38 / MAX: 19.17MIN: 18.01 / MAX: 28.46MIN: 17.99 / MAX: 28.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18TITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 206048121620SE +/- 0.25, N = 3SE +/- 0.25, N = 3SE +/- 0.21, N = 4SE +/- 0.19, N = 3SE +/- 0.14, N = 3SE +/- 0.12, N = 5SE +/- 0.07, N = 15SE +/- 0.19, N = 3SE +/- 0.19, N = 4SE +/- 0.26, N = 314.3614.7414.5114.6514.3614.5814.5314.5514.7714.47MIN: 13.96 / MAX: 17.74MIN: 14.13 / MAX: 15.8MIN: 13.97 / MAX: 25.49MIN: 14.13 / MAX: 15.56MIN: 14.01 / MAX: 15.12MIN: 14.21 / MAX: 16.27MIN: 14.06 / MAX: 27.16MIN: 14.18 / MAX: 15.25MIN: 14.08 / MAX: 16.79MIN: 14.01 / MAX: 23.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v2TITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20601.0172.0343.0514.0685.085SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 5SE +/- 0.01, N = 13SE +/- 0.03, N = 3SE +/- 0.01, N = 4SE +/- 0.02, N = 34.434.524.474.484.504.484.464.454.474.42MIN: 4.32 / MAX: 5.06MIN: 4.42 / MAX: 13.63MIN: 4.34 / MAX: 5.99MIN: 4.38 / MAX: 4.9MIN: 4.35 / MAX: 5.04MIN: 4.36 / MAX: 14.35MIN: 4.31 / MAX: 4.82MIN: 4.34 / MAX: 4.91MIN: 4.37 / MAX: 4.78MIN: 4.31 / MAX: 4.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg16TITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20601326395265SE +/- 0.17, N = 3SE +/- 0.09, N = 3SE +/- 0.43, N = 4SE +/- 0.27, N = 3SE +/- 0.37, N = 3SE +/- 0.17, N = 5SE +/- 0.13, N = 15SE +/- 0.13, N = 3SE +/- 0.14, N = 4SE +/- 0.31, N = 356.5256.9657.1856.6957.0857.2557.1357.1357.2256.98MIN: 55.22 / MAX: 72.77MIN: 56.04 / MAX: 59.71MIN: 54.93 / MAX: 68.94MIN: 55.51 / MAX: 58.35MIN: 55.71 / MAX: 70.92MIN: 55.82 / MAX: 67.69MIN: 54.93 / MAX: 68.8MIN: 56.01 / MAX: 66.17MIN: 55.71 / MAX: 64.21MIN: 55.41 / MAX: 58.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenetTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 206048121620SE +/- 0.68, N = 2SE +/- 0.37, N = 3SE +/- 0.33, N = 4SE +/- 0.49, N = 3SE +/- 0.39, N = 3SE +/- 0.35, N = 5SE +/- 0.21, N = 14SE +/- 0.27, N = 3SE +/- 0.43, N = 4SE +/- 0.55, N = 313.3213.9513.4113.9613.1213.5613.4813.6014.1813.43MIN: 12.38 / MAX: 25.82MIN: 12.96 / MAX: 24.07MIN: 12.46 / MAX: 28.14MIN: 12.74 / MAX: 15.74MIN: 12.41 / MAX: 14.53MIN: 12.71 / MAX: 20.62MIN: 12.43 / MAX: 26.89MIN: 12.94 / MAX: 24.05MIN: 12.62 / MAX: 15.92MIN: 12.61 / MAX: 15.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b0TITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060246810SE +/- 0.57, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 4SE +/- 0.26, N = 3SE +/- 0.21, N = 3SE +/- 0.04, N = 5SE +/- 0.06, N = 15SE +/- 0.02, N = 3SE +/- 0.06, N = 4SE +/- 0.03, N = 36.035.615.556.025.685.565.605.595.705.48MIN: 5.28 / MAX: 339.41MIN: 5.45 / MAX: 5.87MIN: 5.34 / MAX: 7.32MIN: 5.37 / MAX: 11.96MIN: 5.29 / MAX: 17.94MIN: 5.37 / MAX: 6.08MIN: 5.28 / MAX: 16.37MIN: 5.42 / MAX: 5.85MIN: 5.43 / MAX: 17.45MIN: 5.32 / MAX: 5.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnetTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 20601.0262.0523.0784.1045.13SE +/- 0.02, N = 3SE +/- 0.13, N = 3SE +/- 0.06, N = 4SE +/- 0.26, N = 3SE +/- 0.21, N = 3SE +/- 0.06, N = 5SE +/- 0.07, N = 15SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 34.024.204.114.564.214.104.224.154.334.01MIN: 3.88 / MAX: 4.28MIN: 3.91 / MAX: 10.98MIN: 3.87 / MAX: 5.49MIN: 3.93 / MAX: 5.1MIN: 3.86 / MAX: 4.92MIN: 3.82 / MAX: 4.59MIN: 3.86 / MAX: 5.14MIN: 4 / MAX: 16.56MIN: 4.04 / MAX: 4.77MIN: 3.83 / MAX: 4.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

LuxCoreRender OpenCL

Scene: Rainbow Colors and Prism

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: Rainbow Colors and PrismTITAN RTXRTX 3080RTX 3060 TIRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRTX 2060510152025SE +/- 0.02, N = 3SE +/- 0.65, N = 12SE +/- 0.48, N = 12SE +/- 0.27, N = 12SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.20, N = 12SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 313.5621.9117.1112.149.199.2710.0410.9411.417.98MIN: 12.87 / MAX: 14.06MIN: 12.02 / MAX: 23.73MIN: 8.56 / MAX: 18.35MIN: 6.39 / MAX: 12.87MIN: 7.68 / MAX: 9.63MIN: 8.34 / MAX: 9.64MIN: 5.24 / MAX: 10.68MIN: 9.85 / MAX: 11.46MIN: 9.87 / MAX: 11.85MIN: 7.02 / MAX: 8.25


Phoronix Test Suite v10.8.5