NVIDIA TITAN RTX On Linux AMD Ryzen 9 3950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) and NVIDIA TITAN RTX 24GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2010017-PTS-NVIDIATI83&grr&sor .
NVIDIA TITAN RTX On Linux Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 1 2 3 4 5 AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) AMD Starship/Matisse 16GB 2000GB Corsair Force MP600 + 2000GB NVIDIA TITAN RTX 24GB (390/405MHz) NVIDIA TU102 HD Audio DELL P2415Q Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.4.0-48-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.8 NVIDIA 450.66 4.6.0 OpenCL 2.0 AMD-APP (3182.0) + OpenCL 1.2 CUDA 11.0.228 1.2.133 GCC 9.3.0 + CUDA 11.0 ext4 3840x2160 NVIDIA TITAN RTX 24GB (1350/7000MHz) NVIDIA TITAN RTX 24GB (390/405MHz) NVIDIA TITAN RTX 24GB (1350/7000MHz) NVIDIA TITAN RTX 24GB (390/405MHz) OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701013 OpenCL Details - GPU Compute Cores: 4608 Python Details - Python 3.8.2 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
NVIDIA TITAN RTX On Linux redshift: realsr-ncnn: 4x - Yes caffe: GoogleNet - NVIDIA CUDA - 1000 vkfft: ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU - squeezenet caffe: AlexNet - NVIDIA CUDA - 1000 realsr-ncnn: 4x - No hashcat: SHA-512 hashcat: SHA1 caffe: GoogleNet - NVIDIA CUDA - 200 hashcat: MD5 hashcat: TrueCrypt RIPEMD160 + XTS caffe: GoogleNet - NVIDIA CUDA - 100 hashcat: 7-Zip caffe: AlexNet - NVIDIA CUDA - 200 caffe: AlexNet - NVIDIA CUDA - 100 1 2 3 4 5 235 42.884 28171.0 40138 7.34 3.06 1.63 1.32 5.08 3.02 0.58 2.6 1.44 1.28 1.64 1.38 4.14 3.86 8921.31 7.831 2541833333 18307600000 5680.16 57420166667 672533 2855.00 916533 1805.53 922.123 235 43.085 28230.8 40047 7.17 3.05 1.57 1.32 5.09 2.97 0.57 2.59 1.45 1.29 1.64 1.39 4.13 3.86 8920.17 7.976 2534366667 18259766667 5678.60 57241333333 670800 2875.90 917200 1806.40 924.703 235 42.655 28178.6 40019 7.15 3.05 1.62 1.32 5.08 2.98 0.58 2.60 1.45 1.28 1.64 1.38 4.14 3.90 8900.18 7.858 2541733333 18322300000 5693.34 57365533333 672600 2860.43 917000 1810.29 923.238 235 42.816 28240.1 40048 7.17 3.06 1.58 1.31 5.09 2.97 0.59 2.60 1.45 1.28 1.64 1.38 4.13 3.90 8934.21 7.849 2540333333 18285433333 5657.51 57329633333 671700 2861.31 915167 1812.07 923.667 235 42.606 28365.7 40061 7.15 3.06 1.60 1.31 5.07 2.97 0.58 2.61 1.45 1.28 1.64 1.40 4.16 3.88 8986.35 7.825 2541766667 18324466667 5693.66 57416166667 672200 2878.31 919933 1811.35 931.024 OpenBenchmarking.org
RedShift Demo OpenBenchmarking.org Seconds, Fewer Is Better RedShift Demo 3.0 1 2 3 4 5 50 100 150 200 250 SE +/- 0.33, N = 3 235 235 235 235 235
RealSR-NCNN Scale: 4x - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: Yes 5 3 4 1 2 10 20 30 40 50 SE +/- 0.23, N = 3 SE +/- 0.24, N = 3 SE +/- 0.18, N = 3 SE +/- 0.16, N = 3 SE +/- 0.18, N = 3 42.61 42.66 42.82 42.88 43.09
Caffe Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 1 3 2 4 5 6K 12K 18K 24K 30K SE +/- 93.92, N = 3 SE +/- 39.43, N = 3 SE +/- 11.37, N = 3 SE +/- 49.35, N = 3 SE +/- 60.61, N = 3 28171.0 28178.6 28230.8 28240.1 28365.7 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
VkFFT OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 2020-09-29 1 5 4 2 3 9K 18K 27K 36K 45K SE +/- 35.69, N = 3 SE +/- 7.23, N = 3 SE +/- 17.68, N = 3 SE +/- 17.33, N = 3 SE +/- 11.20, N = 3 40138 40061 40048 40047 40019
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny 3 5 2 4 1 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.12, N = 3 7.15 7.15 7.17 7.17 7.34 MIN: 6.93 / MAX: 13.66 MIN: 6.92 / MAX: 9.07 MIN: 6.94 / MAX: 13.24 MIN: 6.93 / MAX: 8.75 MIN: 6.93 / MAX: 18.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 2 3 1 4 5 0.6885 1.377 2.0655 2.754 3.4425 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 3.05 3.05 3.06 3.06 3.06 MIN: 3.03 / MAX: 3.55 MIN: 3.02 / MAX: 4.01 MIN: 3.03 / MAX: 4.65 MIN: 3.04 / MAX: 4.53 MIN: 3.02 / MAX: 7.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet 2 4 5 3 1 0.3668 0.7336 1.1004 1.4672 1.834 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 2 SE +/- 0.02, N = 3 1.57 1.58 1.60 1.62 1.63 MIN: 1.54 / MAX: 1.82 MIN: 1.54 / MAX: 6.38 MIN: 1.55 / MAX: 2.1 MIN: 1.55 / MAX: 1.86 MIN: 1.54 / MAX: 4.84 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 4 5 1 2 3 0.297 0.594 0.891 1.188 1.485 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.31 1.31 1.32 1.32 1.32 MIN: 1.3 / MAX: 1.97 MIN: 1.3 / MAX: 2.42 MIN: 1.3 / MAX: 3.18 MIN: 1.3 / MAX: 1.45 MIN: 1.31 / MAX: 1.5 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 5 1 3 2 4 1.1453 2.2906 3.4359 4.5812 5.7265 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 5.07 5.08 5.08 5.09 5.09 MIN: 4.92 / MAX: 18.36 MIN: 4.92 / MAX: 18.61 MIN: 4.92 / MAX: 18.32 MIN: 4.93 / MAX: 9.69 MIN: 4.93 / MAX: 18.51 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet 2 4 5 3 1 0.6795 1.359 2.0385 2.718 3.3975 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 2.97 2.97 2.97 2.98 3.02 MIN: 2.95 / MAX: 3.79 MIN: 2.95 / MAX: 4.06 MIN: 2.95 / MAX: 3.59 MIN: 2.95 / MAX: 4.92 MIN: 2.96 / MAX: 7.02 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface 2 1 3 5 4 0.1328 0.2656 0.3984 0.5312 0.664 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 0.57 0.58 0.58 0.58 0.59 MAX: 0.6 MIN: 0.57 / MAX: 0.71 MIN: 0.56 / MAX: 0.62 MIN: 0.57 / MAX: 1.44 MIN: 0.57 / MAX: 1.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 2 1 3 4 5 0.5873 1.1746 1.7619 2.3492 2.9365 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 2.59 2.60 2.60 2.60 2.61 MIN: 2.58 / MAX: 2.79 MIN: 2.59 / MAX: 2.89 MIN: 2.59 / MAX: 2.87 MIN: 2.59 / MAX: 3.69 MIN: 2.59 / MAX: 3.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet 1 2 3 4 5 0.3263 0.6526 0.9789 1.3052 1.6315 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 1.44 1.45 1.45 1.45 1.45 MIN: 1.43 / MAX: 1.49 MIN: 1.43 / MAX: 6.87 MIN: 1.43 / MAX: 2.12 MIN: 1.43 / MAX: 2.58 MIN: 1.43 / MAX: 2.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 1 3 4 5 2 0.2903 0.5806 0.8709 1.1612 1.4515 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.28 1.28 1.28 1.28 1.29 MIN: 1.27 / MAX: 1.34 MIN: 1.27 / MAX: 1.35 MIN: 1.27 / MAX: 1.47 MIN: 1.27 / MAX: 1.31 MIN: 1.27 / MAX: 1.5 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 1 2 3 4 5 0.369 0.738 1.107 1.476 1.845 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.64 1.64 1.64 1.64 1.64 MIN: 1.62 / MAX: 1.67 MIN: 1.63 / MAX: 1.71 MIN: 1.62 / MAX: 1.67 MIN: 1.61 / MAX: 2.74 MIN: 1.62 / MAX: 2.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 1 3 4 2 5 0.315 0.63 0.945 1.26 1.575 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 1.38 1.38 1.38 1.39 1.40 MIN: 1.36 / MAX: 1.43 MIN: 1.37 / MAX: 1.77 MIN: 1.37 / MAX: 2.56 MIN: 1.37 / MAX: 2.58 MIN: 1.37 / MAX: 9.38 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet 2 4 1 3 5 0.936 1.872 2.808 3.744 4.68 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 4.13 4.13 4.14 4.14 4.16 MIN: 4.03 / MAX: 8.16 MIN: 4.04 / MAX: 7.11 MIN: 4.06 / MAX: 7.21 MIN: 4.08 / MAX: 5.99 MIN: 4.09 / MAX: 15.72 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet 1 2 5 3 4 0.8775 1.755 2.6325 3.51 4.3875 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 3.86 3.86 3.88 3.90 3.90 MIN: 3.74 / MAX: 5.31 MIN: 3.71 / MAX: 5.2 MIN: 3.76 / MAX: 5 MIN: 3.74 / MAX: 4.41 MIN: 3.77 / MAX: 7.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Caffe Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 3 2 1 4 5 2K 4K 6K 8K 10K SE +/- 10.24, N = 3 SE +/- 2.81, N = 3 SE +/- 3.89, N = 3 SE +/- 25.22, N = 3 SE +/- 66.63, N = 3 8900.18 8920.17 8921.31 8934.21 8986.35 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No 5 1 4 3 2 2 4 6 8 10 SE +/- 0.087, N = 3 SE +/- 0.063, N = 3 SE +/- 0.073, N = 3 SE +/- 0.068, N = 3 SE +/- 0.063, N = 3 7.825 7.831 7.849 7.858 7.976
Hashcat Benchmark: SHA-512 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA-512 1 5 3 4 2 500M 1000M 1500M 2000M 2500M SE +/- 1348249.89, N = 3 SE +/- 2162046.36, N = 3 SE +/- 2233333.33, N = 3 SE +/- 1920358.76, N = 3 SE +/- 2493547.23, N = 3 2541833333 2541766667 2541733333 2540333333 2534366667
Hashcat Benchmark: SHA1 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA1 5 3 1 4 2 4000M 8000M 12000M 16000M 20000M SE +/- 10927844.15, N = 3 SE +/- 14301048.91, N = 3 SE +/- 14435719.59, N = 3 SE +/- 14493025.14, N = 3 SE +/- 7574592.03, N = 3 18324466667 18322300000 18307600000 18285433333 18259766667
Caffe Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 4 2 1 3 5 1200 2400 3600 4800 6000 SE +/- 15.61, N = 3 SE +/- 5.02, N = 3 SE +/- 9.16, N = 3 SE +/- 23.06, N = 3 SE +/- 9.15, N = 3 5657.51 5678.60 5680.16 5693.34 5693.66 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Hashcat Benchmark: MD5 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: MD5 1 5 3 4 2 12000M 24000M 36000M 48000M 60000M SE +/- 31137883.32, N = 3 SE +/- 76423651.08, N = 3 SE +/- 47871262.55, N = 3 SE +/- 85923577.15, N = 3 SE +/- 70861139.64, N = 3 57420166667 57416166667 57365533333 57329633333 57241333333
Hashcat Benchmark: TrueCrypt RIPEMD160 + XTS OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: TrueCrypt RIPEMD160 + XTS 3 1 5 4 2 140K 280K 420K 560K 700K SE +/- 57.74, N = 3 SE +/- 66.67, N = 3 SE +/- 702.38, N = 3 SE +/- 400.00, N = 3 SE +/- 300.00, N = 3 672600 672533 672200 671700 670800
Caffe Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 1 3 4 2 5 600 1200 1800 2400 3000 SE +/- 2.35, N = 3 SE +/- 13.01, N = 3 SE +/- 19.64, N = 3 SE +/- 6.81, N = 3 SE +/- 16.43, N = 3 2855.00 2860.43 2861.31 2875.90 2878.31 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Hashcat Benchmark: 7-Zip OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: 7-Zip 5 2 3 1 4 200K 400K 600K 800K 1000K SE +/- 2945.24, N = 3 SE +/- 4404.92, N = 3 SE +/- 2150.19, N = 3 SE +/- 290.59, N = 3 SE +/- 218.58, N = 3 919933 917200 917000 916533 915167
Caffe Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 1 2 3 5 4 400 800 1200 1600 2000 SE +/- 4.34, N = 3 SE +/- 3.74, N = 3 SE +/- 7.02, N = 3 SE +/- 4.80, N = 3 SE +/- 1.42, N = 3 1805.53 1806.40 1810.29 1811.35 1812.07 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 1 3 4 2 5 200 400 600 800 1000 SE +/- 3.49, N = 3 SE +/- 5.77, N = 3 SE +/- 6.30, N = 3 SE +/- 6.92, N = 3 SE +/- 6.25, N = 3 922.12 923.24 923.67 924.70 931.02 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Phoronix Test Suite v10.8.4