TR 3970X + TITAN RTX AMD Ryzen Threadripper 3970X 32-Core testing with a ASUS ROG ZENITH II EXTREME (0702 BIOS) and NVIDIA TITAN RTX 24GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009247-PTS-TR3970XT07&grw .
TR 3970X + TITAN RTX Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 1 2 3 AMD Ryzen Threadripper 3970X 32-Core @ 3.70GHz (32 Cores / 64 Threads) ASUS ROG ZENITH II EXTREME (0702 BIOS) AMD Starship/Matisse 64GB 1000GB Corsair Force MP600 NVIDIA TITAN RTX 24GB (1350/7000MHz) NVIDIA TU102 HD Audio ASUS MG28U Aquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.4.0-47-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.8 NVIDIA 450.36.06 4.6.0 OpenCL 1.2 CUDA 11.0.185 1.2.133 GCC 9.3.0 + CUDA 11.0 ext4 3840x2160 NVIDIA TITAN RTX 24GB (960/810MHz) NVIDIA TITAN RTX 24GB (675/810MHz) OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301025 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
TR 3970X + TITAN RTX ncnn: CPU - squeezenet ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny 1 2 3 36.81 18.85 8.41 7.92 8.06 7.62 10.08 3.31 44.53 60.04 20.65 10.36 57.98 28.01 37.35 18.59 8.29 7.77 7.99 7.45 10.16 3.34 45.37 60.57 21.09 10.80 57.95 28.27 37.21 18.88 8.33 8.00 8.13 7.64 10.17 3.26 44.38 61.32 20.59 10.91 57.54 28.65 OpenBenchmarking.org
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet 1 2 3 9 18 27 36 45 SE +/- 0.63, N = 3 SE +/- 0.21, N = 3 SE +/- 0.44, N = 3 36.81 37.35 37.21 MIN: 30.68 / MAX: 60.78 MIN: 32.23 / MAX: 56.41 MIN: 31.72 / MAX: 60.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet 1 2 3 5 10 15 20 25 SE +/- 0.22, N = 3 SE +/- 0.07, N = 3 SE +/- 0.18, N = 3 18.85 18.59 18.88 MIN: 16.5 / MAX: 72.14 MIN: 16.61 / MAX: 38.44 MIN: 16.96 / MAX: 39.23 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 8.41 8.29 8.33 MIN: 7.32 / MAX: 25.63 MIN: 7.24 / MAX: 23.87 MIN: 7.34 / MAX: 24.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 7.92 7.77 8.00 MIN: 7.02 / MAX: 23.44 MIN: 7.01 / MAX: 26.28 MIN: 7.04 / MAX: 23.38 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: shufflenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.04, N = 3 SE +/- 0.13, N = 3 SE +/- 0.06, N = 3 8.06 7.99 8.13 MIN: 7.08 / MAX: 24.86 MIN: 7.13 / MAX: 22.58 MIN: 7.26 / MAX: 24.27 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet 1 2 3 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 7.62 7.45 7.64 MIN: 6.65 / MAX: 23.75 MIN: 6.55 / MAX: 24.74 MIN: 6.59 / MAX: 44.94 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: efficientnet-b0 1 2 3 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 10.08 10.16 10.17 MIN: 9.07 / MAX: 28.85 MIN: 8.88 / MAX: 24.71 MIN: 9.02 / MAX: 27.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface 1 2 3 0.7515 1.503 2.2545 3.006 3.7575 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 3.31 3.34 3.26 MIN: 2.91 / MAX: 18.72 MIN: 2.93 / MAX: 40.87 MIN: 2.85 / MAX: 18.38 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet 1 2 3 10 20 30 40 50 SE +/- 0.23, N = 3 SE +/- 1.55, N = 3 SE +/- 0.27, N = 3 44.53 45.37 44.38 MIN: 38.93 / MAX: 76.85 MIN: 39.02 / MAX: 800.22 MIN: 39.6 / MAX: 62.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16 1 2 3 14 28 42 56 70 SE +/- 0.77, N = 3 SE +/- 0.54, N = 3 SE +/- 0.46, N = 3 60.04 60.57 61.32 MIN: 52.68 / MAX: 187.57 MIN: 51.07 / MAX: 121.5 MIN: 53.84 / MAX: 79.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18 1 2 3 5 10 15 20 25 SE +/- 0.20, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 20.65 21.09 20.59 MIN: 18.85 / MAX: 38.12 MIN: 18.86 / MAX: 41.61 MIN: 18.17 / MAX: 37.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet 1 2 3 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 SE +/- 0.11, N = 3 10.36 10.80 10.91 MIN: 9.27 / MAX: 38.82 MIN: 9.58 / MAX: 28.99 MIN: 9.81 / MAX: 28.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50 1 2 3 13 26 39 52 65 SE +/- 0.24, N = 3 SE +/- 0.38, N = 3 SE +/- 0.39, N = 3 57.98 57.95 57.54 MIN: 51.34 / MAX: 81.36 MIN: 51.14 / MAX: 79.36 MIN: 51.01 / MAX: 80.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: yolov4-tiny 1 2 3 7 14 21 28 35 SE +/- 0.25, N = 3 SE +/- 0.17, N = 3 SE +/- 0.30, N = 3 28.01 28.27 28.65 MIN: 25.72 / MAX: 45.44 MIN: 25.61 / MAX: 53.15 MIN: 26.19 / MAX: 46.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Phoronix Test Suite v10.8.4