TR 3960X WK AMD Ryzen Threadripper 3960X 24-Core testing with a MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS) and Sapphire AMD Radeon RX 5500/5500M / Pro 5500M 4GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009286-PTS-TR3960XW65&rdt .
TR 3960X WK Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 AMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads) MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS) AMD Starship/Matisse 32GB 1000GB Sabrent Rocket 4.0 1TB Sapphire AMD Radeon RX 5500/5500M / Pro 5500M 4GB (1900/875MHz) AMD Navi 10 HDMI Audio ASUS MG28U Aquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.9.0-rc5-14sep-patch (x86_64) 20200914 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 20.0.8 (LLVM 10.0.0) 1.2.128 GCC 9.3.0 ext4 3840x2160 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301025 Python Details - Python 3.8.2 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
TR 3960X WK dolfyn: Computational Fluid Dynamics ffte: N=256, 3D Complex FFT Routine hmmer: Pfam Database Search mafft: Multiple Sequence Alignment - LSU RNA byte: Dhrystone 2 couchdb: 100 - 1000 - 24 gromacs: Water Benchmark caffe: AlexNet - CPU - 100 caffe: AlexNet - CPU - 200 caffe: AlexNet - CPU - 1000 caffe: GoogleNet - CPU - 100 caffe: GoogleNet - CPU - 200 caffe: GoogleNet - CPU - 1000 ncnn: CPU - squeezenet ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny hint: FLOAT mlpack: scikit_ica mlpack: scikit_qda mlpack: scikit_svm mlpack: scikit_linearridgeregression 1 2 3 15.815 83303.755210131 131.076 8.119 46013391.3 108.154 2.529 63797 127209 634925 158702 318121 1585367 16.40 17.46 7.28 6.82 7.31 6.58 8.79 2.97 17.76 41.10 13.89 11.49 23.37 27.88 6.22 9.60 4.35 8.35 3.20 4.44 11.93 0.90 8.63 80.33 3.83 28.12 11.79 21.70 387229070.97652 53.53 44.98 19.75 1.44 15.860 83979.875954596 131.428 8.243 45773092.9 107.556 2.528 63525 127488 636843 158891 319129 1589890 16.39 17.24 7.24 6.78 7.28 6.53 8.64 2.93 17.75 42.30 13.94 11.56 23.55 28.20 6.27 9.79 4.35 8.23 3.20 4.44 12.20 0.89 8.25 80.68 3.82 28.66 11.85 21.32 387185459.42906 52.58 45.91 19.69 1.43 15.705 83465.842136051 131.508 8.187 46091339.8 108.106 2.527 63700 127527 638848 159254 317413 1586757 16.43 17.22 7.33 6.86 7.34 6.57 8.69 2.94 17.78 41.33 13.96 11.53 23.53 28.07 6.13 10.22 4.35 8.46 3.20 4.44 12.25 0.89 8.41 80.10 3.81 28.36 11.92 21.05 388597973.06793 54.33 45.89 19.67 1.43 OpenBenchmarking.org
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 15.82 15.86 15.71
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine 1 2 3 20K 40K 60K 80K 100K SE +/- 411.46, N = 3 SE +/- 111.80, N = 3 SE +/- 308.33, N = 3 83303.76 83979.88 83465.84 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 2 3 30 60 90 120 150 SE +/- 0.15, N = 3 SE +/- 0.06, N = 3 SE +/- 0.20, N = 3 131.08 131.43 131.51 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 1 2 3 2 4 6 8 10 SE +/- 0.058, N = 3 SE +/- 0.032, N = 3 SE +/- 0.029, N = 3 8.119 8.243 8.187 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 1 2 3 10M 20M 30M 40M 50M SE +/- 536306.21, N = 6 SE +/- 211256.10, N = 3 SE +/- 715922.77, N = 3 46013391.3 45773092.9 46091339.8
Apache CouchDB Bulk Size: 100 - Inserts: 1000 - Rounds: 24 OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.1.1 Bulk Size: 100 - Inserts: 1000 - Rounds: 24 1 2 3 20 40 60 80 100 SE +/- 0.55, N = 3 SE +/- 0.44, N = 3 SE +/- 0.15, N = 3 108.15 107.56 108.11 1. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.3 Water Benchmark 1 2 3 0.569 1.138 1.707 2.276 2.845 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 2.529 2.528 2.527 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 1 2 3 14K 28K 42K 56K 70K SE +/- 143.57, N = 3 SE +/- 97.00, N = 3 SE +/- 154.26, N = 3 63797 63525 63700 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 1 2 3 30K 60K 90K 120K 150K SE +/- 174.78, N = 3 SE +/- 239.20, N = 3 SE +/- 241.08, N = 3 127209 127488 127527 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 1000 1 2 3 140K 280K 420K 560K 700K SE +/- 490.79, N = 3 SE +/- 1432.41, N = 3 SE +/- 563.79, N = 3 634925 636843 638848 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 1 2 3 30K 60K 90K 120K 150K SE +/- 235.22, N = 3 SE +/- 34.07, N = 3 SE +/- 63.49, N = 3 158702 158891 159254 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 1 2 3 70K 140K 210K 280K 350K SE +/- 136.00, N = 3 SE +/- 472.70, N = 3 SE +/- 800.55, N = 3 318121 319129 317413 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 1000 1 2 3 300K 600K 900K 1200K 1500K SE +/- 339.92, N = 3 SE +/- 2111.28, N = 3 SE +/- 2136.95, N = 3 1585367 1589890 1586757 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet 1 2 3 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.13, N = 3 SE +/- 0.13, N = 3 16.40 16.39 16.43 MIN: 16.11 / MAX: 17.71 MIN: 15.99 / MAX: 17.08 MIN: 16.01 / MAX: 17.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet 1 2 3 4 8 12 16 20 SE +/- 0.15, N = 3 SE +/- 0.19, N = 3 SE +/- 0.08, N = 3 17.46 17.24 17.22 MIN: 16.88 / MAX: 98.56 MIN: 16.79 / MAX: 18.18 MIN: 16.94 / MAX: 18.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 7.28 7.24 7.33 MIN: 7.03 / MAX: 9.71 MIN: 6.86 / MAX: 8.46 MIN: 7.03 / MAX: 12.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 6.82 6.78 6.86 MIN: 6.6 / MAX: 8.32 MIN: 6.64 / MAX: 12.14 MIN: 6.67 / MAX: 9.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: shufflenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 7.31 7.28 7.34 MIN: 7.1 / MAX: 9.34 MIN: 7.02 / MAX: 8.28 MIN: 7.06 / MAX: 8.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet 1 2 3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 6.58 6.53 6.57 MIN: 6.37 / MAX: 7.69 MIN: 6.37 / MAX: 7.64 MIN: 6.42 / MAX: 7.23 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: efficientnet-b0 1 2 3 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 8.79 8.64 8.69 MIN: 8.52 / MAX: 9.56 MIN: 8.42 / MAX: 13.4 MIN: 8.49 / MAX: 9.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface 1 2 3 0.6683 1.3366 2.0049 2.6732 3.3415 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.97 2.93 2.94 MIN: 2.8 / MAX: 3.43 MIN: 2.78 / MAX: 4.11 MIN: 2.79 / MAX: 3.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet 1 2 3 4 8 12 16 20 SE +/- 0.22, N = 3 SE +/- 0.33, N = 3 SE +/- 0.34, N = 3 17.76 17.75 17.78 MIN: 17.14 / MAX: 19.41 MIN: 16.84 / MAX: 18.95 MIN: 17.02 / MAX: 54.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16 1 2 3 10 20 30 40 50 SE +/- 0.19, N = 3 SE +/- 0.52, N = 3 SE +/- 0.34, N = 3 41.10 42.30 41.33 MIN: 40.58 / MAX: 42.83 MIN: 40.29 / MAX: 44.44 MIN: 40.29 / MAX: 125.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18 1 2 3 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.15, N = 3 SE +/- 0.16, N = 3 13.89 13.94 13.96 MIN: 13.68 / MAX: 15.18 MIN: 13.53 / MAX: 14.99 MIN: 13.5 / MAX: 15.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet 1 2 3 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.12, N = 3 SE +/- 0.13, N = 3 11.49 11.56 11.53 MIN: 11.33 / MAX: 12 MIN: 11.26 / MAX: 12.58 MIN: 11.25 / MAX: 15.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50 1 2 3 6 12 18 24 30 SE +/- 0.11, N = 3 SE +/- 0.02, N = 3 SE +/- 0.12, N = 3 23.37 23.55 23.53 MIN: 23.04 / MAX: 24.13 MIN: 23.36 / MAX: 28.08 MIN: 23.13 / MAX: 25.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: yolov4-tiny 1 2 3 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.14, N = 3 SE +/- 0.10, N = 3 27.88 28.20 28.07 MIN: 27.63 / MAX: 32.81 MIN: 27.77 / MAX: 40.69 MIN: 27.72 / MAX: 29.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet 1 2 3 2 4 6 8 10 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 6.22 6.27 6.13 MIN: 5.94 / MAX: 30.39 MIN: 5.92 / MAX: 16.22 MIN: 5.93 / MAX: 9.53 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet 1 2 3 3 6 9 12 15 SE +/- 0.66, N = 3 SE +/- 0.43, N = 3 SE +/- 0.43, N = 3 9.60 9.79 10.22 MIN: 7.7 / MAX: 35.6 MIN: 7.28 / MAX: 27.36 MIN: 6.48 / MAX: 44.78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 1 2 3 0.9788 1.9576 2.9364 3.9152 4.894 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 4.35 4.35 4.35 MIN: 4.18 / MAX: 4.71 MIN: 4.16 / MAX: 4.71 MIN: 4.18 / MAX: 4.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 1 2 3 2 4 6 8 10 SE +/- 0.06, N = 3 SE +/- 0.24, N = 3 SE +/- 0.18, N = 3 8.35 8.23 8.46 MIN: 7 / MAX: 36 MIN: 7 / MAX: 39.95 MIN: 7 / MAX: 32.09 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 1 2 3 0.72 1.44 2.16 2.88 3.6 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.20 3.20 3.20 MIN: 3.14 / MAX: 3.51 MIN: 3.14 / MAX: 4.02 MIN: 3.15 / MAX: 4.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet 1 2 3 0.999 1.998 2.997 3.996 4.995 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 4.44 4.44 4.44 MIN: 4.29 / MAX: 5.12 MIN: 4.29 / MAX: 4.8 MIN: 4.29 / MAX: 9.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 1 2 3 3 6 9 12 15 SE +/- 0.25, N = 3 SE +/- 0.27, N = 3 SE +/- 0.07, N = 3 11.93 12.20 12.25 MIN: 10.02 / MAX: 43.66 MIN: 10.04 / MAX: 36.91 MIN: 9.98 / MAX: 38.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface 1 2 3 0.2025 0.405 0.6075 0.81 1.0125 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 0.90 0.89 0.89 MIN: 0.88 / MAX: 1.78 MIN: 0.88 / MAX: 1.08 MIN: 0.87 / MAX: 1.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet 1 2 3 2 4 6 8 10 SE +/- 0.10, N = 3 SE +/- 0.38, N = 3 SE +/- 0.41, N = 3 8.63 8.25 8.41 MIN: 6.94 / MAX: 36.76 MIN: 6.93 / MAX: 44.59 MIN: 6.92 / MAX: 34.2 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 1 2 3 20 40 60 80 100 SE +/- 0.26, N = 3 SE +/- 0.21, N = 3 SE +/- 0.31, N = 3 80.33 80.68 80.10 MIN: 70.05 / MAX: 121.13 MIN: 70.8 / MAX: 121.57 MIN: 70.02 / MAX: 120.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 1 2 3 0.8618 1.7236 2.5854 3.4472 4.309 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.83 3.82 3.81 MIN: 3.75 / MAX: 4.35 MIN: 3.75 / MAX: 4.28 MIN: 3.74 / MAX: 4.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet 1 2 3 7 14 21 28 35 SE +/- 0.27, N = 3 SE +/- 0.29, N = 3 SE +/- 0.13, N = 3 28.12 28.66 28.36 MIN: 25.02 / MAX: 62.51 MIN: 24.96 / MAX: 55.15 MIN: 24.67 / MAX: 55.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 1 2 3 3 6 9 12 15 SE +/- 0.24, N = 3 SE +/- 0.39, N = 3 SE +/- 0.32, N = 3 11.79 11.85 11.92 MIN: 10.07 / MAX: 37.47 MIN: 10.05 / MAX: 35.67 MIN: 10.07 / MAX: 40.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny 1 2 3 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.10, N = 3 SE +/- 0.29, N = 3 21.70 21.32 21.05 MIN: 12.05 / MAX: 42.59 MIN: 13.93 / MAX: 46.1 MIN: 11.08 / MAX: 40.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT 1 2 3 80M 160M 240M 320M 400M SE +/- 224641.82, N = 3 SE +/- 145428.42, N = 3 SE +/- 333985.70, N = 3 387229070.98 387185459.43 388597973.07 1. (CC) gcc options: -O3 -march=native -lm
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica 1 2 3 12 24 36 48 60 SE +/- 0.37, N = 3 SE +/- 0.64, N = 3 SE +/- 0.31, N = 3 53.53 52.58 54.33
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda 1 2 3 10 20 30 40 50 SE +/- 0.32, N = 3 SE +/- 0.12, N = 3 SE +/- 0.04, N = 3 44.98 45.91 45.89
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm 1 2 3 5 10 15 20 25 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 19.75 19.69 19.67
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression 1 2 3 0.324 0.648 0.972 1.296 1.62 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.44 1.43 1.43
Phoronix Test Suite v10.8.5