NVIDIA TITAN RTX On Linux

AMD Ryzen 9 3950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) and NVIDIA TITAN RTX 24GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2010017-PTS-NVIDIATI83.

NVIDIA TITAN RTX On LinuxProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution12345AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600 + 2000GBNVIDIA TITAN RTX 24GB (390/405MHz)NVIDIA TU102 HD AudioDELL P2415QRealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.4.0-48-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 450.664.6.0OpenCL 2.0 AMD-APP (3182.0) + OpenCL 1.2 CUDA 11.0.2281.2.133GCC 9.3.0 + CUDA 11.0ext43840x2160NVIDIA TITAN RTX 24GB (1350/7000MHz)NVIDIA TITAN RTX 24GB (390/405MHz)NVIDIA TITAN RTX 24GB (1350/7000MHz)NVIDIA TITAN RTX 24GB (390/405MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701013OpenCL Details- GPU Compute Cores: 4608Python Details- Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

NVIDIA TITAN RTX On Linuxrealsr-ncnn: 4x - Norealsr-ncnn: 4x - Yesvkfft: hashcat: MD5hashcat: SHA1hashcat: 7-Ziphashcat: SHA-512hashcat: TrueCrypt RIPEMD160 + XTSredshift: caffe: AlexNet - NVIDIA CUDA - 100caffe: AlexNet - NVIDIA CUDA - 200caffe: AlexNet - NVIDIA CUDA - 1000caffe: GoogleNet - NVIDIA CUDA - 100caffe: GoogleNet - NVIDIA CUDA - 200caffe: GoogleNet - NVIDIA CUDA - 1000ncnn: Vulkan GPU - squeezenetncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tiny123457.83142.8844013857420166667183076000009165332541833333672533235922.1231805.538921.312855.005680.1628171.03.864.141.381.641.281.442.60.583.025.081.321.633.067.347.97643.0854004757241333333182597666679172002534366667670800235924.7031806.408920.172875.905678.6028230.83.864.131.391.641.291.452.590.572.975.091.321.573.057.177.85842.6554001957365533333183223000009170002541733333672600235923.2381810.298900.182860.435693.3428178.63.904.141.381.641.281.452.600.582.985.081.321.623.057.157.84942.8164004857329633333182854333339151672540333333671700235923.6671812.078934.212861.315657.5128240.13.904.131.381.641.281.452.600.592.975.091.311.583.067.177.82542.6064006157416166667183244666679199332541766667672200235931.0241811.358986.352878.315693.6628365.73.884.161.401.641.281.452.610.582.975.071.311.603.067.15OpenBenchmarking.org

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: No12345246810SE +/- 0.063, N = 3SE +/- 0.063, N = 3SE +/- 0.068, N = 3SE +/- 0.073, N = 3SE +/- 0.087, N = 37.8317.9767.8587.8497.825

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: Yes123451020304050SE +/- 0.16, N = 3SE +/- 0.18, N = 3SE +/- 0.24, N = 3SE +/- 0.18, N = 3SE +/- 0.23, N = 342.8843.0942.6642.8242.61

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 2020-09-29123459K18K27K36K45KSE +/- 35.69, N = 3SE +/- 17.33, N = 3SE +/- 11.20, N = 3SE +/- 17.68, N = 3SE +/- 7.23, N = 34013840047400194004840061

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: MD51234512000M24000M36000M48000M60000MSE +/- 31137883.32, N = 3SE +/- 70861139.64, N = 3SE +/- 47871262.55, N = 3SE +/- 85923577.15, N = 3SE +/- 76423651.08, N = 35742016666757241333333573655333335732963333357416166667

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA1123454000M8000M12000M16000M20000MSE +/- 14435719.59, N = 3SE +/- 7574592.03, N = 3SE +/- 14301048.91, N = 3SE +/- 14493025.14, N = 3SE +/- 10927844.15, N = 31830760000018259766667183223000001828543333318324466667

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: 7-Zip12345200K400K600K800K1000KSE +/- 290.59, N = 3SE +/- 4404.92, N = 3SE +/- 2150.19, N = 3SE +/- 218.58, N = 3SE +/- 2945.24, N = 3916533917200917000915167919933

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA-51212345500M1000M1500M2000M2500MSE +/- 1348249.89, N = 3SE +/- 2493547.23, N = 3SE +/- 2233333.33, N = 3SE +/- 1920358.76, N = 3SE +/- 2162046.36, N = 325418333332534366667254173333325403333332541766667

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: TrueCrypt RIPEMD160 + XTS12345140K280K420K560K700KSE +/- 66.67, N = 3SE +/- 300.00, N = 3SE +/- 57.74, N = 3SE +/- 400.00, N = 3SE +/- 702.38, N = 3672533670800672600671700672200

RedShift Demo

OpenBenchmarking.orgSeconds, Fewer Is BetterRedShift Demo 3.01234550100150200250SE +/- 0.33, N = 3235235235235235

Caffe

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100123452004006008001000SE +/- 3.49, N = 3SE +/- 6.92, N = 3SE +/- 5.77, N = 3SE +/- 6.30, N = 3SE +/- 6.25, N = 3922.12924.70923.24923.67931.021. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 20012345400800120016002000SE +/- 4.34, N = 3SE +/- 3.74, N = 3SE +/- 7.02, N = 3SE +/- 1.42, N = 3SE +/- 4.80, N = 31805.531806.401810.291812.071811.351. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000123452K4K6K8K10KSE +/- 3.89, N = 3SE +/- 2.81, N = 3SE +/- 10.24, N = 3SE +/- 25.22, N = 3SE +/- 66.63, N = 38921.318920.178900.188934.218986.351. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100123456001200180024003000SE +/- 2.35, N = 3SE +/- 6.81, N = 3SE +/- 13.01, N = 3SE +/- 19.64, N = 3SE +/- 16.43, N = 32855.002875.902860.432861.312878.311. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 2001234512002400360048006000SE +/- 9.16, N = 3SE +/- 5.02, N = 3SE +/- 23.06, N = 3SE +/- 15.61, N = 3SE +/- 9.15, N = 35680.165678.605693.345657.515693.661. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000123456K12K18K24K30KSE +/- 93.92, N = 3SE +/- 11.37, N = 3SE +/- 39.43, N = 3SE +/- 49.35, N = 3SE +/- 60.61, N = 328171.028230.828178.628240.128365.71. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

NCNN

Target: Vulkan GPU - Model: squeezenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: squeezenet123450.87751.7552.63253.514.3875SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 33.863.863.903.903.88MIN: 3.74 / MAX: 5.31MIN: 3.71 / MAX: 5.2MIN: 3.74 / MAX: 4.41MIN: 3.77 / MAX: 7.98MIN: 3.76 / MAX: 51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mobilenet123450.9361.8722.8083.7444.68SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 34.144.134.144.134.16MIN: 4.06 / MAX: 7.21MIN: 4.03 / MAX: 8.16MIN: 4.08 / MAX: 5.99MIN: 4.04 / MAX: 7.11MIN: 4.09 / MAX: 15.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2123450.3150.630.9451.261.575SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 31.381.391.381.381.40MIN: 1.36 / MAX: 1.43MIN: 1.37 / MAX: 2.58MIN: 1.37 / MAX: 1.77MIN: 1.37 / MAX: 2.56MIN: 1.37 / MAX: 9.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3123450.3690.7381.1071.4761.845SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.641.641.641.641.64MIN: 1.62 / MAX: 1.67MIN: 1.63 / MAX: 1.71MIN: 1.62 / MAX: 1.67MIN: 1.61 / MAX: 2.74MIN: 1.62 / MAX: 2.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: shufflenet-v2123450.29030.58060.87091.16121.4515SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.281.291.281.281.28MIN: 1.27 / MAX: 1.34MIN: 1.27 / MAX: 1.5MIN: 1.27 / MAX: 1.35MIN: 1.27 / MAX: 1.47MIN: 1.27 / MAX: 1.311. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mnasnet123450.32630.65260.97891.30521.6315SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 31.441.451.451.451.45MIN: 1.43 / MAX: 1.49MIN: 1.43 / MAX: 6.87MIN: 1.43 / MAX: 2.12MIN: 1.43 / MAX: 2.58MIN: 1.43 / MAX: 2.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: efficientnet-b0123450.58731.17461.76192.34922.9365SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 32.602.592.602.602.61MIN: 2.59 / MAX: 2.89MIN: 2.58 / MAX: 2.79MIN: 2.59 / MAX: 2.87MIN: 2.59 / MAX: 3.69MIN: 2.59 / MAX: 3.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: blazeface123450.13280.26560.39840.53120.664SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 30.580.570.580.590.58MIN: 0.57 / MAX: 0.71MAX: 0.6MIN: 0.56 / MAX: 0.62MIN: 0.57 / MAX: 1.82MIN: 0.57 / MAX: 1.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: googlenet123450.67951.3592.03852.7183.3975SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 33.022.972.982.972.97MIN: 2.96 / MAX: 7.02MIN: 2.95 / MAX: 3.79MIN: 2.95 / MAX: 4.92MIN: 2.95 / MAX: 4.06MIN: 2.95 / MAX: 3.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: vgg16123451.14532.29063.43594.58125.7265SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 35.085.095.085.095.07MIN: 4.92 / MAX: 18.61MIN: 4.93 / MAX: 9.69MIN: 4.92 / MAX: 18.32MIN: 4.93 / MAX: 18.51MIN: 4.92 / MAX: 18.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet18123450.2970.5940.8911.1881.485SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.321.321.321.311.31MIN: 1.3 / MAX: 3.18MIN: 1.3 / MAX: 1.45MIN: 1.31 / MAX: 1.5MIN: 1.3 / MAX: 1.97MIN: 1.3 / MAX: 2.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: alexnet123450.36680.73361.10041.46721.834SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 2SE +/- 0.01, N = 3SE +/- 0.03, N = 31.631.571.621.581.60MIN: 1.54 / MAX: 4.84MIN: 1.54 / MAX: 1.82MIN: 1.55 / MAX: 1.86MIN: 1.54 / MAX: 6.38MIN: 1.55 / MAX: 2.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet50123450.68851.3772.06552.7543.4425SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 33.063.053.053.063.06MIN: 3.03 / MAX: 4.65MIN: 3.03 / MAX: 3.55MIN: 3.02 / MAX: 4.01MIN: 3.04 / MAX: 4.53MIN: 3.02 / MAX: 7.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: yolov4-tiny12345246810SE +/- 0.12, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 37.347.177.157.177.15MIN: 6.93 / MAX: 18.18MIN: 6.94 / MAX: 13.24MIN: 6.93 / MAX: 13.66MIN: 6.93 / MAX: 8.75MIN: 6.92 / MAX: 9.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread


Phoronix Test Suite v10.8.4