3900xt pre xmas AMD Ryzen 9 3900XT 12-Core testing with a MSI MEG X570 GODLIKE (MS-7C34) v1.0 (1.B3 BIOS) and AMD Radeon RX 56/64 8GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2012199-HA-3900XTPRE13&sro .
3900xt pre xmas Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 AMD Ryzen 9 3900XT 12-Core @ 3.80GHz (12 Cores / 24 Threads) MSI MEG X570 GODLIKE (MS-7C34) v1.0 (1.B3 BIOS) AMD Starship/Matisse 16GB 500GB Seagate FireCuda 520 SSD ZP500GM30002 AMD Radeon RX 56/64 8GB (1630/945MHz) AMD Vega 10 HDMI Audio ASUS MG28U Realtek Device 2600 + Realtek Device 3000 + Intel Wi-Fi 6 AX200 Ubuntu 20.10 5.8.0-31-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 amdgpu 19.1.0 4.6 Mesa 20.2.1 (LLVM 11.0.0) 1.2.131 GCC 10.2.0 ext4 3840x2160 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8701021 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
3900xt pre xmas clomp: Static OMP Speedup build-eigen: Time To Compile encode-ape: WAV To APE ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - regnety_400m encode-wavpack: WAV To WavPack 1 2 3 33.1 69.139 10.730 16.34 5.30 4.61 6.13 4.76 6.96 2.27 16.05 59.08 15.97 12.30 27.30 25.95 18.25 21.05 16.54 5.34 4.64 6.15 4.79 7.02 2.28 16.02 59.29 15.80 12.24 27.60 26.36 18.38 21.12 11.824 33.2 71.717 10.988 16.25 5.32 4.61 6.15 4.77 6.98 2.28 16.11 59.20 15.91 12.19 27.31 26.57 18.21 21.06 16.52 5.33 4.62 6.14 4.78 6.97 2.27 16.05 59.28 15.90 12.18 27.77 26.76 18.56 21.09 11.773 32.4 69.869 10.698 16.63 5.39 4.69 6.25 4.85 7.09 2.34 16.41 59.21 16.12 12.42 28.09 27.23 18.52 21.23 16.55 5.32 4.64 6.15 4.80 7.03 2.28 16.35 58.94 16.11 12.25 27.71 27.48 18.46 21.07 11.795 OpenBenchmarking.org
CLOMP Static OMP Speedup OpenBenchmarking.org Speedup, More Is Better CLOMP 1.2 Static OMP Speedup 1 2 3 8 16 24 32 40 SE +/- 0.20, N = 3 SE +/- 0.27, N = 3 SE +/- 0.15, N = 3 33.1 33.2 32.4 1. (CC) gcc options: -fopenmp -O3 -lm
Timed Eigen Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Eigen Compilation 3.3.9 Time To Compile 1 2 3 16 32 48 64 80 SE +/- 0.50, N = 3 SE +/- 1.06, N = 4 SE +/- 0.48, N = 3 69.14 71.72 69.87
Monkey Audio Encoding WAV To APE OpenBenchmarking.org Seconds, Fewer Is Better Monkey Audio Encoding 3.99.6 WAV To APE 1 2 3 3 6 9 12 15 SE +/- 0.10, N = 5 SE +/- 0.09, N = 5 SE +/- 0.01, N = 5 10.73 10.99 10.70 1. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mobilenet 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.14, N = 3 SE +/- 0.11, N = 3 16.34 16.25 16.63 MIN: 16.14 / MAX: 16.77 MIN: 15.9 / MAX: 19.58 MIN: 16.33 / MAX: 20.27 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 1.2128 2.4256 3.6384 4.8512 6.064 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 5.30 5.32 5.39 MIN: 5.19 / MAX: 6.79 MIN: 5.2 / MAX: 6.43 MIN: 5.23 / MAX: 6.53 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 1.0553 2.1106 3.1659 4.2212 5.2765 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 4.61 4.61 4.69 MIN: 4.55 / MAX: 6.19 MIN: 4.55 / MAX: 5.68 MIN: 4.59 / MAX: 9.43 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: shufflenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.10, N = 3 6.13 6.15 6.25 MIN: 6.05 / MAX: 10.9 MIN: 6.07 / MAX: 7.05 MIN: 6.06 / MAX: 7.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mnasnet 1 2 3 1.0913 2.1826 3.2739 4.3652 5.4565 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 4.76 4.77 4.85 MIN: 4.67 / MAX: 9.53 MIN: 4.7 / MAX: 6.03 MIN: 4.7 / MAX: 5.69 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: efficientnet-b0 1 2 3 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 6.96 6.98 7.09 MIN: 6.88 / MAX: 7.22 MIN: 6.91 / MAX: 7.52 MIN: 6.95 / MAX: 10.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: blazeface 1 2 3 0.5265 1.053 1.5795 2.106 2.6325 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.06, N = 3 2.27 2.28 2.34 MIN: 2.23 / MAX: 2.78 MIN: 2.24 / MAX: 2.32 MIN: 2.24 / MAX: 2.52 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: googlenet 1 2 3 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.33, N = 3 16.05 16.11 16.41 MIN: 15.67 / MAX: 17.45 MIN: 15.78 / MAX: 19.96 MIN: 15.67 / MAX: 17.46 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: vgg16 1 2 3 13 26 39 52 65 SE +/- 0.12, N = 3 SE +/- 0.13, N = 3 SE +/- 0.20, N = 3 59.08 59.20 59.21 MIN: 58.29 / MAX: 69.84 MIN: 58.24 / MAX: 81.79 MIN: 58.26 / MAX: 68.93 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet18 1 2 3 4 8 12 16 20 SE +/- 0.15, N = 3 SE +/- 0.01, N = 3 SE +/- 0.29, N = 3 15.97 15.91 16.12 MIN: 15.64 / MAX: 51.26 MIN: 15.78 / MAX: 16.5 MIN: 15.67 / MAX: 17.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: alexnet 1 2 3 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.00, N = 3 SE +/- 0.13, N = 3 12.30 12.19 12.42 MIN: 12.02 / MAX: 12.59 MIN: 12.06 / MAX: 12.49 MIN: 12.06 / MAX: 12.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet50 1 2 3 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.38, N = 3 27.30 27.31 28.09 MIN: 27.13 / MAX: 28 MIN: 27.03 / MAX: 35.65 MIN: 27.21 / MAX: 48.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: yolov4-tiny 1 2 3 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.46, N = 3 SE +/- 0.53, N = 3 25.95 26.57 27.23 MIN: 25.68 / MAX: 28 MIN: 25.9 / MAX: 27.97 MIN: 26 / MAX: 44.52 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: squeezenet_ssd 1 2 3 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.18, N = 3 SE +/- 0.06, N = 3 18.25 18.21 18.52 MIN: 17.92 / MAX: 18.83 MIN: 17.75 / MAX: 18.82 MIN: 18.13 / MAX: 30.09 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: regnety_400m 1 2 3 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 21.05 21.06 21.23 MIN: 20.89 / MAX: 28.5 MIN: 20.85 / MAX: 21.54 MIN: 21 / MAX: 21.95 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: mobilenet 1 2 3 4 8 12 16 20 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 16.54 16.52 16.55 MIN: 16.24 / MAX: 20.35 MIN: 16.14 / MAX: 20.91 MIN: 16.3 / MAX: 20.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 1 2 3 1.2015 2.403 3.6045 4.806 6.0075 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.34 5.33 5.32 MIN: 5.22 / MAX: 6.55 MIN: 5.21 / MAX: 6.87 MIN: 5.21 / MAX: 6.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 1 2 3 1.044 2.088 3.132 4.176 5.22 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 4.64 4.62 4.64 MIN: 4.57 / MAX: 6.17 MIN: 4.56 / MAX: 6.11 MIN: 4.55 / MAX: 6.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: shufflenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 6.15 6.14 6.15 MIN: 6.09 / MAX: 7.03 MIN: 6.05 / MAX: 7.3 MIN: 6.07 / MAX: 7.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: mnasnet 1 2 3 1.08 2.16 3.24 4.32 5.4 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 4.79 4.78 4.80 MIN: 4.7 / MAX: 6.05 MIN: 4.7 / MAX: 6.09 MIN: 4.7 / MAX: 6.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: efficientnet-b0 1 2 3 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 7.02 6.97 7.03 MIN: 6.95 / MAX: 7.6 MIN: 6.89 / MAX: 8.05 MIN: 6.91 / MAX: 8.36 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: blazeface 1 2 3 0.513 1.026 1.539 2.052 2.565 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.28 2.27 2.28 MIN: 2.24 / MAX: 2.36 MIN: 2.23 / MAX: 2.33 MIN: 2.22 / MAX: 2.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: googlenet 1 2 3 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.39, N = 3 16.02 16.05 16.35 MIN: 15.67 / MAX: 18.5 MIN: 15.75 / MAX: 16.71 MIN: 15.63 / MAX: 20.84 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: vgg16 1 2 3 13 26 39 52 65 SE +/- 0.04, N = 3 SE +/- 0.13, N = 3 SE +/- 0.19, N = 3 59.29 59.28 58.94 MIN: 58.5 / MAX: 64.34 MIN: 58.52 / MAX: 75.04 MIN: 58.11 / MAX: 62.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: resnet18 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.29, N = 3 15.80 15.90 16.11 MIN: 15.68 / MAX: 16.43 MIN: 15.77 / MAX: 17.54 MIN: 15.69 / MAX: 20.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: alexnet 1 2 3 3 6 9 12 15 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 12.24 12.18 12.25 MIN: 12.03 / MAX: 12.57 MIN: 12 / MAX: 12.75 MIN: 12.02 / MAX: 12.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: resnet50 1 2 3 7 14 21 28 35 SE +/- 0.31, N = 3 SE +/- 0.34, N = 3 SE +/- 0.50, N = 3 27.60 27.77 27.71 MIN: 27.03 / MAX: 37.48 MIN: 27.16 / MAX: 32.06 MIN: 27.01 / MAX: 76.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: yolov4-tiny 1 2 3 6 12 18 24 30 SE +/- 0.14, N = 3 SE +/- 0.47, N = 3 SE +/- 0.11, N = 2 26.36 26.76 27.48 MIN: 25.67 / MAX: 82.75 MIN: 26.09 / MAX: 28.17 MIN: 27.16 / MAX: 28.59 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: squeezenet_ssd 1 2 3 5 10 15 20 25 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.12, N = 3 18.38 18.56 18.46 MIN: 18 / MAX: 27.75 MIN: 17.98 / MAX: 77.54 MIN: 18.01 / MAX: 72.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: regnety_400m 1 2 3 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 21.12 21.09 21.07 MIN: 20.93 / MAX: 21.74 MIN: 20.86 / MAX: 21.63 MIN: 20.89 / MAX: 22.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 11.82 11.77 11.80 1. (CXX) g++ options: -rdynamic
Phoronix Test Suite v10.8.4