3990x-cassandra-yafa-ncnn AMD Ryzen Threadripper 3990X 64-Core testing with a Gigabyte TRX40 AORUS PRO WIFI (F4p BIOS) and AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 on Pop 21.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2107292-IB-3990XCASS89&grw&sro .
3990x-cassandra-yafa-ncnn Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Vulkan Compiler File-System Screen Resolution 1 2 2a 3 4 AMD Ryzen Threadripper 3990X 64-Core @ 2.90GHz (64 Cores / 128 Threads) Gigabyte TRX40 AORUS PRO WIFI (F4p BIOS) AMD Starship/Matisse 126GB Samsung SSD 970 EVO Plus 500GB AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 (1750/875MHz) AMD Navi 10 HDMI Audio DELL P2415Q Intel I211 + Intel Wi-Fi 6 AX200 Pop 21.04 5.11.0-7620-generic (x86_64) GNOME Shell 3.38.4 X Server 1.20.9 1.2.145 GCC 10.3.0 ext4 3840x2160 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301039 Java Details - 1, 2, 3, 4: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2) Python Details - 1, 2, 3, 4: Python 3.9.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
3990x-cassandra-yafa-ncnn ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m qe: AUSURF112 yafaray: Total Time For Sample Scene cassandra: Reads cassandra: Writes cassandra: Mixed 1:1 cassandra: Mixed 1:3 1 2 2a 3 4 31.16 17.95 17.02 16.11 17.59 22.33 7.34 31.03 56.28 20.90 14.15 41.63 39.77 30.81 51.08 341.21 52.611 371344 262391 286781 266381 31.80 15.52 14.47 15.30 14.66 19.29 6.70 29.11 56.17 21.06 14.16 42.04 40.11 30.91 47.96 53.639 344802 261119 288416 342.76 30.64 15.24 14.29 15.02 14.18 18.66 6.78 28.87 55.29 20.38 13.93 40.83 38.59 30.56 48.19 340.44 49.906 31.10 15.32 14.36 14.83 14.19 18.49 6.64 28.83 55.47 20.81 14.17 40.66 39.35 30.85 47.53 340.82 54.888 OpenBenchmarking.org
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mobilenet 1 2 3 4 7 14 21 28 35 SE +/- 0.20, N = 3 SE +/- 0.29, N = 3 SE +/- 0.18, N = 3 31.16 31.80 30.64 31.10 MIN: 29.07 / MAX: 36.76 MIN: 29.44 / MAX: 37.64 MIN: 29.04 / MAX: 34.86 MIN: 29.33 / MAX: 35.59 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 4 4 8 12 16 20 SE +/- 2.21, N = 3 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 17.95 15.52 15.24 15.32 MIN: 14.79 / MAX: 31.53 MIN: 14.68 / MAX: 17.46 MIN: 14.81 / MAX: 19.83 MIN: 14.7 / MAX: 19.89 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 4 4 8 12 16 20 SE +/- 2.02, N = 3 SE +/- 0.21, N = 3 SE +/- 0.08, N = 3 17.02 14.47 14.29 14.36 MIN: 14.23 / MAX: 24.11 MIN: 13.74 / MAX: 18.85 MIN: 13.87 / MAX: 15.87 MIN: 13.8 / MAX: 126.22 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 1 2 3 4 4 8 12 16 20 SE +/- 0.74, N = 3 SE +/- 0.25, N = 3 SE +/- 0.12, N = 3 16.11 15.30 15.02 14.83 MIN: 14.35 / MAX: 19.69 MIN: 14.07 / MAX: 18.67 MIN: 14.5 / MAX: 15.74 MIN: 13.97 / MAX: 19.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet 1 2 3 4 4 8 12 16 20 SE +/- 2.78, N = 3 SE +/- 0.25, N = 3 SE +/- 0.07, N = 3 17.59 14.66 14.18 14.19 MIN: 13.88 / MAX: 27.71 MIN: 13.77 / MAX: 20.5 MIN: 13.89 / MAX: 18.5 MIN: 13.81 / MAX: 18.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 1 2 3 4 5 10 15 20 25 SE +/- 1.84, N = 3 SE +/- 0.38, N = 3 SE +/- 0.01, N = 3 22.33 19.29 18.66 18.49 MIN: 18.1 / MAX: 589.99 MIN: 18.02 / MAX: 24.54 MIN: 18.19 / MAX: 19.51 MIN: 18.09 / MAX: 22.62 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface 1 2 3 4 2 4 6 8 10 SE +/- 0.58, N = 3 SE +/- 0.12, N = 3 SE +/- 0.02, N = 3 7.34 6.70 6.78 6.64 MIN: 6.41 / MAX: 11.18 MIN: 6.28 / MAX: 8.05 MIN: 6.56 / MAX: 10.71 MIN: 6.36 / MAX: 10.81 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet 1 2 3 4 7 14 21 28 35 SE +/- 1.56, N = 3 SE +/- 0.23, N = 3 SE +/- 0.08, N = 3 31.03 29.11 28.87 28.83 MIN: 27.62 / MAX: 38.39 MIN: 27.79 / MAX: 35.59 MIN: 27.89 / MAX: 30.5 MIN: 27.75 / MAX: 33.78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 1 2 3 4 13 26 39 52 65 SE +/- 1.25, N = 3 SE +/- 0.14, N = 3 SE +/- 0.15, N = 3 56.28 56.17 55.29 55.47 MIN: 51.74 / MAX: 545.42 MIN: 51.58 / MAX: 400.42 MIN: 52.53 / MAX: 62.55 MIN: 52.76 / MAX: 73.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 1 2 3 4 5 10 15 20 25 SE +/- 0.28, N = 3 SE +/- 0.19, N = 3 SE +/- 0.28, N = 3 20.90 21.06 20.38 20.81 MIN: 19.75 / MAX: 25.44 MIN: 20.2 / MAX: 22.36 MIN: 19.86 / MAX: 25.1 MIN: 19.86 / MAX: 25.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet 1 2 3 4 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 14.15 14.16 13.93 14.17 MIN: 13.61 / MAX: 15.9 MIN: 13.72 / MAX: 17.38 MIN: 13.56 / MAX: 14.62 MIN: 13.71 / MAX: 19.01 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 1 2 3 4 10 20 30 40 50 SE +/- 0.55, N = 3 SE +/- 0.91, N = 3 SE +/- 0.58, N = 3 41.63 42.04 40.83 40.66 MIN: 38.62 / MAX: 47.35 MIN: 38.63 / MAX: 49.92 MIN: 39.35 / MAX: 143.81 MIN: 38.27 / MAX: 45.6 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny 1 2 3 4 9 18 27 36 45 SE +/- 0.33, N = 3 SE +/- 0.38, N = 3 SE +/- 0.17, N = 3 39.77 40.11 38.59 39.35 MIN: 38.73 / MAX: 44.47 MIN: 38.73 / MAX: 48.12 MIN: 37.9 / MAX: 50.82 MIN: 38.24 / MAX: 44.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: squeezenet_ssd 1 2 3 4 7 14 21 28 35 SE +/- 0.18, N = 3 SE +/- 0.08, N = 3 SE +/- 0.33, N = 3 30.81 30.91 30.56 30.85 MIN: 29.82 / MAX: 38.46 MIN: 30.13 / MAX: 35.05 MIN: 29.93 / MAX: 37.53 MIN: 29.9 / MAX: 35.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m 1 2 3 4 12 24 36 48 60 SE +/- 1.44, N = 3 SE +/- 0.22, N = 3 SE +/- 0.20, N = 3 51.08 47.96 48.19 47.53 MIN: 46.73 / MAX: 58.38 MIN: 46.45 / MAX: 52.91 MIN: 46.97 / MAX: 51.84 MIN: 46.33 / MAX: 52.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
Quantum ESPRESSO Input: AUSURF112 OpenBenchmarking.org Seconds, Fewer Is Better Quantum ESPRESSO 6.8 Input: AUSURF112 1 2a 3 4 70 140 210 280 350 341.21 342.76 340.44 340.82 1. (F9X) gfortran options: -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
YafaRay Total Time For Sample Scene OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.5.1 Total Time For Sample Scene 1 2 3 4 12 24 36 48 60 SE +/- 0.60, N = 15 SE +/- 0.50, N = 15 SE +/- 0.49, N = 3 52.61 53.64 49.91 54.89 1. (CXX) g++ options: -std=c++11 -pthread -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype
Apache Cassandra Test: Reads OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.0 Test: Reads 1 2 80K 160K 240K 320K 400K SE +/- 19762.08, N = 6 SE +/- 19382.76, N = 7 371344 344802
Apache Cassandra Test: Writes OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.0 Test: Writes 1 2 60K 120K 180K 240K 300K SE +/- 831.31, N = 3 SE +/- 2124.44, N = 3 262391 261119
Apache Cassandra Test: Mixed 1:1 OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.0 Test: Mixed 1:1 1 2 60K 120K 180K 240K 300K SE +/- 5040.07, N = 9 SE +/- 2418.26, N = 9 286781 288416
Apache Cassandra Test: Mixed 1:3 OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.0 Test: Mixed 1:3 1 60K 120K 180K 240K 300K SE +/- 5019.24, N = 9 266381
Phoronix Test Suite v10.8.4