TR 3960X WK AMD Ryzen Threadripper 3960X 24-Core testing with a MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS) and Sapphire AMD Radeon RX 5500/5500M / Pro 5500M 4GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009286-PTS-TR3960XW65&sro&grs .
TR 3960X WK Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 AMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads) MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS) AMD Starship/Matisse 32GB 1000GB Sabrent Rocket 4.0 1TB Sapphire AMD Radeon RX 5500/5500M / Pro 5500M 4GB (1900/875MHz) AMD Navi 10 HDMI Audio ASUS MG28U Aquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.9.0-rc5-14sep-patch (x86_64) 20200914 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 20.0.8 (LLVM 10.0.0) 1.2.128 GCC 9.3.0 ext4 3840x2160 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301025 Python Details - Python 3.8.2 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
TR 3960X WK mlpack: scikit_ica ncnn: Vulkan GPU - yolov4-tiny ncnn: CPU - vgg16 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - squeezenet mlpack: scikit_qda ncnn: Vulkan GPU - alexnet ncnn: CPU - efficientnet-b0 mafft: Multiple Sequence Alignment - LSU RNA ncnn: CPU - mobilenet ncnn: CPU - blazeface ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - yolov4-tiny ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - resnet50 dolfyn: Computational Fluid Dynamics ncnn: CPU - shufflenet-v2 ffte: N=256, 3D Complex FFT Routine ncnn: CPU - resnet50 ncnn: CPU - mnasnet ncnn: Vulkan GPU - vgg16 mlpack: scikit_linearridgeregression byte: Dhrystone 2 caffe: AlexNet - CPU - 1000 ncnn: CPU - alexnet couchdb: 100 - 1000 - 24 caffe: GoogleNet - CPU - 200 ncnn: Vulkan GPU - resnet18 ncnn: CPU - resnet18 caffe: AlexNet - CPU - 100 mlpack: scikit_svm hint: FLOAT caffe: GoogleNet - CPU - 100 hmmer: Pfam Database Search caffe: GoogleNet - CPU - 1000 caffe: AlexNet - CPU - 200 ncnn: CPU - squeezenet ncnn: CPU - googlenet gromacs: Water Benchmark ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - mobilenet 1 2 3 53.53 21.70 41.10 8.35 11.93 6.22 44.98 28.12 8.79 8.119 17.46 2.97 7.28 6.82 27.88 0.90 11.79 15.815 7.31 83303.755210131 23.37 6.58 80.33 1.44 46013391.3 634925 11.49 108.154 318121 3.83 13.89 63797 19.75 387229070.97652 158702 131.076 1585367 127209 16.40 17.76 2.529 4.44 3.20 4.35 8.63 9.60 52.58 21.32 42.30 8.23 12.20 6.27 45.91 28.66 8.64 8.243 17.24 2.93 7.24 6.78 28.20 0.89 11.85 15.860 7.28 83979.875954596 23.55 6.53 80.68 1.43 45773092.9 636843 11.56 107.556 319129 3.82 13.94 63525 19.69 387185459.42906 158891 131.428 1589890 127488 16.39 17.75 2.528 4.44 3.20 4.35 8.25 9.79 54.33 21.05 41.33 8.46 12.25 6.13 45.89 28.36 8.69 8.187 17.22 2.94 7.33 6.86 28.07 0.89 11.92 15.705 7.34 83465.842136051 23.53 6.57 80.10 1.43 46091339.8 638848 11.53 108.106 317413 3.81 13.96 63700 19.67 388597973.06793 159254 131.508 1586757 127527 16.43 17.78 2.527 4.44 3.20 4.35 8.41 10.22 OpenBenchmarking.org
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica 1 2 3 12 24 36 48 60 SE +/- 0.37, N = 3 SE +/- 0.64, N = 3 SE +/- 0.31, N = 3 53.53 52.58 54.33
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny 1 2 3 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.10, N = 3 SE +/- 0.29, N = 3 21.70 21.32 21.05 MIN: 12.05 / MAX: 42.59 MIN: 13.93 / MAX: 46.1 MIN: 11.08 / MAX: 40.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16 1 2 3 10 20 30 40 50 SE +/- 0.19, N = 3 SE +/- 0.52, N = 3 SE +/- 0.34, N = 3 41.10 42.30 41.33 MIN: 40.58 / MAX: 42.83 MIN: 40.29 / MAX: 44.44 MIN: 40.29 / MAX: 125.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 1 2 3 2 4 6 8 10 SE +/- 0.06, N = 3 SE +/- 0.24, N = 3 SE +/- 0.18, N = 3 8.35 8.23 8.46 MIN: 7 / MAX: 36 MIN: 7 / MAX: 39.95 MIN: 7 / MAX: 32.09 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 1 2 3 3 6 9 12 15 SE +/- 0.25, N = 3 SE +/- 0.27, N = 3 SE +/- 0.07, N = 3 11.93 12.20 12.25 MIN: 10.02 / MAX: 43.66 MIN: 10.04 / MAX: 36.91 MIN: 9.98 / MAX: 38.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet 1 2 3 2 4 6 8 10 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 6.22 6.27 6.13 MIN: 5.94 / MAX: 30.39 MIN: 5.92 / MAX: 16.22 MIN: 5.93 / MAX: 9.53 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda 1 2 3 10 20 30 40 50 SE +/- 0.32, N = 3 SE +/- 0.12, N = 3 SE +/- 0.04, N = 3 44.98 45.91 45.89
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet 1 2 3 7 14 21 28 35 SE +/- 0.27, N = 3 SE +/- 0.29, N = 3 SE +/- 0.13, N = 3 28.12 28.66 28.36 MIN: 25.02 / MAX: 62.51 MIN: 24.96 / MAX: 55.15 MIN: 24.67 / MAX: 55.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: efficientnet-b0 1 2 3 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 8.79 8.64 8.69 MIN: 8.52 / MAX: 9.56 MIN: 8.42 / MAX: 13.4 MIN: 8.49 / MAX: 9.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 1 2 3 2 4 6 8 10 SE +/- 0.058, N = 3 SE +/- 0.032, N = 3 SE +/- 0.029, N = 3 8.119 8.243 8.187 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet 1 2 3 4 8 12 16 20 SE +/- 0.15, N = 3 SE +/- 0.19, N = 3 SE +/- 0.08, N = 3 17.46 17.24 17.22 MIN: 16.88 / MAX: 98.56 MIN: 16.79 / MAX: 18.18 MIN: 16.94 / MAX: 18.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface 1 2 3 0.6683 1.3366 2.0049 2.6732 3.3415 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.97 2.93 2.94 MIN: 2.8 / MAX: 3.43 MIN: 2.78 / MAX: 4.11 MIN: 2.79 / MAX: 3.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 7.28 7.24 7.33 MIN: 7.03 / MAX: 9.71 MIN: 6.86 / MAX: 8.46 MIN: 7.03 / MAX: 12.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 6.82 6.78 6.86 MIN: 6.6 / MAX: 8.32 MIN: 6.64 / MAX: 12.14 MIN: 6.67 / MAX: 9.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: yolov4-tiny 1 2 3 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.14, N = 3 SE +/- 0.10, N = 3 27.88 28.20 28.07 MIN: 27.63 / MAX: 32.81 MIN: 27.77 / MAX: 40.69 MIN: 27.72 / MAX: 29.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface 1 2 3 0.2025 0.405 0.6075 0.81 1.0125 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 0.90 0.89 0.89 MIN: 0.88 / MAX: 1.78 MIN: 0.88 / MAX: 1.08 MIN: 0.87 / MAX: 1.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 1 2 3 3 6 9 12 15 SE +/- 0.24, N = 3 SE +/- 0.39, N = 3 SE +/- 0.32, N = 3 11.79 11.85 11.92 MIN: 10.07 / MAX: 37.47 MIN: 10.05 / MAX: 35.67 MIN: 10.07 / MAX: 40.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 15.82 15.86 15.71
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: shufflenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 7.31 7.28 7.34 MIN: 7.1 / MAX: 9.34 MIN: 7.02 / MAX: 8.28 MIN: 7.06 / MAX: 8.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine 1 2 3 20K 40K 60K 80K 100K SE +/- 411.46, N = 3 SE +/- 111.80, N = 3 SE +/- 308.33, N = 3 83303.76 83979.88 83465.84 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50 1 2 3 6 12 18 24 30 SE +/- 0.11, N = 3 SE +/- 0.02, N = 3 SE +/- 0.12, N = 3 23.37 23.55 23.53 MIN: 23.04 / MAX: 24.13 MIN: 23.36 / MAX: 28.08 MIN: 23.13 / MAX: 25.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet 1 2 3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 6.58 6.53 6.57 MIN: 6.37 / MAX: 7.69 MIN: 6.37 / MAX: 7.64 MIN: 6.42 / MAX: 7.23 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 1 2 3 20 40 60 80 100 SE +/- 0.26, N = 3 SE +/- 0.21, N = 3 SE +/- 0.31, N = 3 80.33 80.68 80.10 MIN: 70.05 / MAX: 121.13 MIN: 70.8 / MAX: 121.57 MIN: 70.02 / MAX: 120.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression 1 2 3 0.324 0.648 0.972 1.296 1.62 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.44 1.43 1.43
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 1 2 3 10M 20M 30M 40M 50M SE +/- 536306.21, N = 6 SE +/- 211256.10, N = 3 SE +/- 715922.77, N = 3 46013391.3 45773092.9 46091339.8
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 1000 1 2 3 140K 280K 420K 560K 700K SE +/- 490.79, N = 3 SE +/- 1432.41, N = 3 SE +/- 563.79, N = 3 634925 636843 638848 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet 1 2 3 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.12, N = 3 SE +/- 0.13, N = 3 11.49 11.56 11.53 MIN: 11.33 / MAX: 12 MIN: 11.26 / MAX: 12.58 MIN: 11.25 / MAX: 15.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Apache CouchDB Bulk Size: 100 - Inserts: 1000 - Rounds: 24 OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.1.1 Bulk Size: 100 - Inserts: 1000 - Rounds: 24 1 2 3 20 40 60 80 100 SE +/- 0.55, N = 3 SE +/- 0.44, N = 3 SE +/- 0.15, N = 3 108.15 107.56 108.11 1. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 1 2 3 70K 140K 210K 280K 350K SE +/- 136.00, N = 3 SE +/- 472.70, N = 3 SE +/- 800.55, N = 3 318121 319129 317413 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 1 2 3 0.8618 1.7236 2.5854 3.4472 4.309 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.83 3.82 3.81 MIN: 3.75 / MAX: 4.35 MIN: 3.75 / MAX: 4.28 MIN: 3.74 / MAX: 4.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18 1 2 3 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.15, N = 3 SE +/- 0.16, N = 3 13.89 13.94 13.96 MIN: 13.68 / MAX: 15.18 MIN: 13.53 / MAX: 14.99 MIN: 13.5 / MAX: 15.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 1 2 3 14K 28K 42K 56K 70K SE +/- 143.57, N = 3 SE +/- 97.00, N = 3 SE +/- 154.26, N = 3 63797 63525 63700 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm 1 2 3 5 10 15 20 25 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 19.75 19.69 19.67
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT 1 2 3 80M 160M 240M 320M 400M SE +/- 224641.82, N = 3 SE +/- 145428.42, N = 3 SE +/- 333985.70, N = 3 387229070.98 387185459.43 388597973.07 1. (CC) gcc options: -O3 -march=native -lm
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 1 2 3 30K 60K 90K 120K 150K SE +/- 235.22, N = 3 SE +/- 34.07, N = 3 SE +/- 63.49, N = 3 158702 158891 159254 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 2 3 30 60 90 120 150 SE +/- 0.15, N = 3 SE +/- 0.06, N = 3 SE +/- 0.20, N = 3 131.08 131.43 131.51 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 1000 1 2 3 300K 600K 900K 1200K 1500K SE +/- 339.92, N = 3 SE +/- 2111.28, N = 3 SE +/- 2136.95, N = 3 1585367 1589890 1586757 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 1 2 3 30K 60K 90K 120K 150K SE +/- 174.78, N = 3 SE +/- 239.20, N = 3 SE +/- 241.08, N = 3 127209 127488 127527 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet 1 2 3 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.13, N = 3 SE +/- 0.13, N = 3 16.40 16.39 16.43 MIN: 16.11 / MAX: 17.71 MIN: 15.99 / MAX: 17.08 MIN: 16.01 / MAX: 17.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet 1 2 3 4 8 12 16 20 SE +/- 0.22, N = 3 SE +/- 0.33, N = 3 SE +/- 0.34, N = 3 17.76 17.75 17.78 MIN: 17.14 / MAX: 19.41 MIN: 16.84 / MAX: 18.95 MIN: 17.02 / MAX: 54.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.3 Water Benchmark 1 2 3 0.569 1.138 1.707 2.276 2.845 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 2.529 2.528 2.527 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet 1 2 3 0.999 1.998 2.997 3.996 4.995 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 4.44 4.44 4.44 MIN: 4.29 / MAX: 5.12 MIN: 4.29 / MAX: 4.8 MIN: 4.29 / MAX: 9.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 1 2 3 0.72 1.44 2.16 2.88 3.6 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.20 3.20 3.20 MIN: 3.14 / MAX: 3.51 MIN: 3.14 / MAX: 4.02 MIN: 3.15 / MAX: 4.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 1 2 3 0.9788 1.9576 2.9364 3.9152 4.894 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 4.35 4.35 4.35 MIN: 4.18 / MAX: 4.71 MIN: 4.16 / MAX: 4.71 MIN: 4.18 / MAX: 4.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet 1 2 3 2 4 6 8 10 SE +/- 0.10, N = 3 SE +/- 0.38, N = 3 SE +/- 0.41, N = 3 8.63 8.25 8.41 MIN: 6.94 / MAX: 36.76 MIN: 6.93 / MAX: 44.59 MIN: 6.92 / MAX: 34.2 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet 1 2 3 3 6 9 12 15 SE +/- 0.66, N = 3 SE +/- 0.43, N = 3 SE +/- 0.43, N = 3 9.60 9.79 10.22 MIN: 7.7 / MAX: 35.6 MIN: 7.28 / MAX: 27.36 MIN: 6.48 / MAX: 44.78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Phoronix Test Suite v10.8.5