TR 3990X Okt AMD Ryzen Threadripper 3990X 64-Core testing with a System76 Thelio Major (F4c Z5 BIOS) and AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2010117-FI-TR3990XOK30&grw&sro .
TR 3990X Okt Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution Linux 5.4 2 3 AMD Ryzen Threadripper 3990X 64-Core @ 2.90GHz (64 Cores / 128 Threads) System76 Thelio Major (F4c Z5 BIOS) AMD Starship/Matisse 126GB Samsung SSD 970 EVO Plus 500GB AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (1750/875MHz) AMD Navi 10 HDMI Audio G237HL Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.4.0-47-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.8 amdgpu 19.1.0 4.6 Mesa 20.0.8 (LLVM 10.0.0) GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301025 Graphics Details - GLAMOR Python Details - Python 2.7.18rc1 + Python 3.8.2 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
TR 3990X Okt system-decompress-gzip: sockperf: Throughput sockperf: Latency Ping Pong sockperf: Latency Under Load hint: FLOAT libraw: Post-Processing Benchmark webp: Default webp: Quality 100 webp: Quality 100, Lossless webp: Quality 100, Highest Compression webp: Quality 100, Lossless, Highest Compression espeak: Text-To-Speech Synthesis glmark2: 1920 x 1080 hmmer: Pfam Database Search mafft: Multiple Sequence Alignment - LSU RNA lczero: BLAS lczero: Eigen lczero: Rand dolfyn: Computational Fluid Dynamics rnnoise: mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 caffe: AlexNet - CPU - 100 caffe: AlexNet - CPU - 200 caffe: GoogleNet - CPU - 100 caffe: GoogleNet - CPU - 200 ncnn: CPU - squeezenet ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 mpv: Big Buck Bunny Sunflower 1080p - Software Only mpv: Big Buck Bunny Sunflower 4K - Software Only ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny mlpack: scikit_ica mlpack: scikit_qda mlpack: scikit_svm mlpack: scikit_linearridgeregression gromacs: Water Benchmark lammps: 20k Atoms lammps: Rhodopsin Protein namd: ATPase Simulation - 327,506 Atoms openvino: Face Detection 0106 FP16 - CPU openvino: Face Detection 0106 FP16 - CPU openvino: Face Detection 0106 FP32 - CPU openvino: Face Detection 0106 FP32 - CPU openvino: Person Detection 0106 FP16 - CPU openvino: Person Detection 0106 FP16 - CPU openvino: Person Detection 0106 FP32 - CPU openvino: Person Detection 0106 FP32 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP32 - CPU openvino: Age Gender Recognition Retail 0013 FP32 - CPU ffte: N=256, 3D Complex FFT Routine kripke: mocassin: Dust 2D tau100.0 incompact3d: Cylinder gpaw: Carbon Nanotube build-llvm: Time To Compile aom-av1: Speed 0 Two-Pass aom-av1: Speed 4 Two-Pass aom-av1: Speed 6 Realtime aom-av1: Speed 6 Two-Pass aom-av1: Speed 8 Realtime realsr-ncnn: 4x - No vkfft: couchdb: 100 - 1000 - 24 influxdb: 4 - 10000 - 2,5000,1 - 10000 influxdb: 64 - 10000 - 2,5000,1 - 10000 influxdb: 1024 - 10000 - 2,5000,1 - 10000 keydb: byte: Dhrystone 2 Linux 5.4 2 3 2.864 563933 3.755 14.711 376705975.84097 41.87 1.503 2.294 16.220 7.141 33.528 28.581 8656 165.003 9.002 1516 1472 194138 16.965 18.639 8.451 32.798 5.465 5.496 31.884 270.100 245.844 54629 111761 150216 301954 25.07 27.40 13.12 2574.81 1046.24 12.96 13.11 12.37 16.49 5.51 25.59 52.69 18.63 12.16 36.78 36.40 5.69 9.68 4.54 8.72 3.43 4.61 11.53 1.09 8.45 83.73 3.30 30.49 9.92 14.46 51.50 42.09 20.90 1.61 3.761 26.555 23.682 0.43908 9.65 3294.43 9.58 3293.71 6.68 4662.58 6.50 4799.77 33390.81 0.93 33014.26 0.94 129902.32261876 44930063 228 183.000783 110.688 203.220 0.33 2.53 18.09 3.90 34.83 16.882 20490 118.091 1148311.0 1498935.7 1547196.9 450800.42 42607098.5 2.849 563495 3.798 14.608 375641035.45247 41.55 1.501 2.298 16.310 7.123 33.770 28.463 8656 166.113 9.056 1505 1446 193664 17.044 18.772 8.519 32.629 5.442 5.480 31.723 268.809 244.172 56118 112360 150709 301781 25.35 27.48 13.47 2563.28 1037.84 14.34 14.34 14.58 18.21 6.10 26.94 52.14 18.23 11.75 36.39 35.05 5.70 9.53 4.53 8.71 3.55 4.60 11.46 1.06 8.50 84.12 3.31 30.27 9.82 14.37 51.92 42.43 20.91 1.62 3.754 26.287 23.338 0.43837 9.64 3286.54 9.56 3316.90 6.64 4683.32 6.52 4752.26 33277.84 0.93 32931.77 0.94 127987.82451596 44873723 229 183.694458 110.650 203.638 0.33 2.51 18.12 3.90 34.49 16.543 20538 118.569 1146038.5 1501667.1 1539068.1 440299.41 43503603.1 2.856 572421 3.802 14.593 374097932.78222 41.40 1.494 2.305 16.379 7.172 34.126 28.747 8649 165.748 9.135 1474 1483 192306 16.884 18.737 8.664 32.291 5.422 5.480 31.607 268.086 246.780 56032 112011 150522 303072 25.63 28.21 13.39 2568.83 1050.73 13.85 14.12 12.93 17.31 5.88 26.31 52.12 18.47 12.17 37.34 35.66 5.74 9.71 4.53 8.71 3.43 4.61 11.50 1.08 8.46 83.47 3.30 30.02 9.58 14.39 52.39 42.79 20.91 1.65 3.753 26.178 23.888 0.44425 9.65 3284.11 9.60 3307.26 6.60 4721.76 6.51 4782.63 33230.50 0.93 32913.00 0.94 128693.18322745 44476553 230 184.298365 110.575 203.351 0.33 2.53 17.86 3.91 34.69 16.583 20532 117.927 1144571.8 1495506.9 1534264.5 447704.19 44247620.1 OpenBenchmarking.org
System GZIP Decompression OpenBenchmarking.org Seconds, Fewer Is Better System GZIP Decompression 2 3 Linux 5.4 0.6444 1.2888 1.9332 2.5776 3.222 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.018, N = 3 2.849 2.856 2.864
Sockperf Test: Throughput OpenBenchmarking.org Messages Per Second, More Is Better Sockperf 3.4 Test: Throughput 2 3 Linux 5.4 120K 240K 360K 480K 600K SE +/- 7244.39, N = 5 SE +/- 5468.48, N = 5 SE +/- 6120.70, N = 5 563495 572421 563933 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Sockperf Test: Latency Ping Pong OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Ping Pong 2 3 Linux 5.4 0.8555 1.711 2.5665 3.422 4.2775 SE +/- 0.025, N = 5 SE +/- 0.018, N = 5 SE +/- 0.027, N = 5 3.798 3.802 3.755 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Sockperf Test: Latency Under Load OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Under Load 2 3 Linux 5.4 4 8 12 16 20 SE +/- 0.10, N = 5 SE +/- 0.17, N = 6 SE +/- 0.06, N = 5 14.61 14.59 14.71 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT 2 3 Linux 5.4 80M 160M 240M 320M 400M SE +/- 1833857.81, N = 3 SE +/- 1447224.32, N = 3 SE +/- 918474.47, N = 3 375641035.45 374097932.78 376705975.84 1. (CC) gcc options: -O3 -march=native -lm
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark 2 3 Linux 5.4 10 20 30 40 50 SE +/- 0.28, N = 3 SE +/- 0.08, N = 3 SE +/- 0.26, N = 3 41.55 41.40 41.87 1. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
WebP Image Encode Encode Settings: Default OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Default 2 3 Linux 5.4 0.3382 0.6764 1.0146 1.3528 1.691 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 1.501 1.494 1.503 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 2 3 Linux 5.4 0.5186 1.0372 1.5558 2.0744 2.593 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.007, N = 3 2.298 2.305 2.294 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless 2 3 Linux 5.4 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 16.31 16.38 16.22 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression 2 3 Linux 5.4 2 4 6 8 10 SE +/- 0.009, N = 3 SE +/- 0.010, N = 3 SE +/- 0.015, N = 3 7.123 7.172 7.141 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression 2 3 Linux 5.4 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 33.77 34.13 33.53 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis 2 3 Linux 5.4 7 14 21 28 35 SE +/- 0.05, N = 4 SE +/- 0.07, N = 4 SE +/- 0.17, N = 4 28.46 28.75 28.58 1. (CC) gcc options: -O2 -std=c99
GLmark2 Resolution: 1920 x 1080 OpenBenchmarking.org Score, More Is Better GLmark2 2020.04 Resolution: 1920 x 1080 2 3 Linux 5.4 2K 4K 6K 8K 10K 8656 8649 8656
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 2 3 Linux 5.4 40 80 120 160 200 SE +/- 0.25, N = 3 SE +/- 0.30, N = 3 SE +/- 0.34, N = 3 166.11 165.75 165.00 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 2 3 Linux 5.4 3 6 9 12 15 SE +/- 0.042, N = 3 SE +/- 0.030, N = 3 SE +/- 0.056, N = 3 9.056 9.135 9.002 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS 2 3 Linux 5.4 300 600 900 1200 1500 SE +/- 12.57, N = 3 SE +/- 14.47, N = 3 SE +/- 12.81, N = 3 1505 1474 1516 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen 2 3 Linux 5.4 300 600 900 1200 1500 SE +/- 20.76, N = 3 SE +/- 15.07, N = 8 SE +/- 17.75, N = 3 1446 1483 1472 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Random OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Random 2 3 Linux 5.4 40K 80K 120K 160K 200K SE +/- 236.07, N = 3 SE +/- 147.36, N = 3 SE +/- 434.65, N = 3 193664 192306 194138 1. (CXX) g++ options: -flto -pthread
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 2 3 Linux 5.4 4 8 12 16 20 SE +/- 0.29, N = 3 SE +/- 0.19, N = 3 SE +/- 0.23, N = 3 17.04 16.88 16.97
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 2 3 Linux 5.4 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 18.77 18.74 18.64 1. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: SqueezeNetV1.0 2 3 Linux 5.4 2 4 6 8 10 SE +/- 0.181, N = 15 SE +/- 0.250, N = 12 SE +/- 0.112, N = 13 8.519 8.664 8.451 MIN: 7.66 / MAX: 11.88 MIN: 7.6 / MAX: 12 MIN: 7.67 / MAX: 10.49 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: resnet-v2-50 2 3 Linux 5.4 8 16 24 32 40 SE +/- 0.17, N = 15 SE +/- 0.17, N = 12 SE +/- 0.21, N = 13 32.63 32.29 32.80 MIN: 30.5 / MAX: 35.76 MIN: 30.04 / MAX: 35.43 MIN: 30.14 / MAX: 35.82 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: MobileNetV2_224 2 3 Linux 5.4 1.2296 2.4592 3.6888 4.9184 6.148 SE +/- 0.012, N = 15 SE +/- 0.022, N = 12 SE +/- 0.029, N = 13 5.442 5.422 5.465 MIN: 5.16 / MAX: 6.35 MIN: 5.18 / MAX: 6.03 MIN: 5.19 / MAX: 6.36 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: mobilenet-v1-1.0 2 3 Linux 5.4 1.2366 2.4732 3.7098 4.9464 6.183 SE +/- 0.022, N = 14 SE +/- 0.021, N = 12 SE +/- 0.019, N = 13 5.480 5.480 5.496 MIN: 5.07 / MAX: 6.39 MIN: 5.04 / MAX: 6.09 MIN: 5.09 / MAX: 6.1 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: inception-v3 2 3 Linux 5.4 7 14 21 28 35 SE +/- 0.20, N = 15 SE +/- 0.09, N = 12 SE +/- 0.16, N = 13 31.72 31.61 31.88 MIN: 30 / MAX: 33.75 MIN: 30.43 / MAX: 33.67 MIN: 30.07 / MAX: 33.31 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 2 3 Linux 5.4 60 120 180 240 300 SE +/- 0.64, N = 3 SE +/- 2.03, N = 3 SE +/- 3.67, N = 3 268.81 268.09 270.10 MIN: 262.12 / MAX: 297.98 MIN: 261.24 / MAX: 297.96 MIN: 262.09 / MAX: 307.37 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 2 3 Linux 5.4 50 100 150 200 250 SE +/- 0.37, N = 3 SE +/- 0.33, N = 3 SE +/- 0.60, N = 3 244.17 246.78 245.84 MIN: 241.07 / MAX: 245.76 MIN: 243.93 / MAX: 255.19 MIN: 244.15 / MAX: 249.18 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 2 3 Linux 5.4 12K 24K 36K 48K 60K SE +/- 135.75, N = 3 SE +/- 239.93, N = 3 SE +/- 541.26, N = 3 56118 56032 54629 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 2 3 Linux 5.4 20K 40K 60K 80K 100K SE +/- 157.09, N = 3 SE +/- 122.83, N = 3 SE +/- 186.59, N = 3 112360 112011 111761 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 2 3 Linux 5.4 30K 60K 90K 120K 150K SE +/- 53.58, N = 3 SE +/- 476.07, N = 3 SE +/- 388.21, N = 3 150709 150522 150216 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 2 3 Linux 5.4 60K 120K 180K 240K 300K SE +/- 1840.25, N = 3 SE +/- 72.13, N = 3 SE +/- 143.11, N = 3 301781 303072 301954 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet 2 3 Linux 5.4 6 12 18 24 30 SE +/- 0.24, N = 3 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 25.35 25.63 25.07 MIN: 23.99 / MAX: 29.99 MIN: 24.27 / MAX: 29.59 MIN: 23.9 / MAX: 29.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet 2 3 Linux 5.4 7 14 21 28 35 SE +/- 0.37, N = 3 SE +/- 0.19, N = 3 SE +/- 0.43, N = 3 27.48 28.21 27.40 MIN: 25.29 / MAX: 35.51 MIN: 26.7 / MAX: 32.72 MIN: 25.76 / MAX: 32.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v2-v2 - Model: mobilenet-v2 2 3 Linux 5.4 3 6 9 12 15 SE +/- 0.23, N = 3 SE +/- 0.20, N = 3 SE +/- 0.31, N = 3 13.47 13.39 13.12 MIN: 12.3 / MAX: 18.22 MIN: 12.33 / MAX: 32.12 MIN: 12.2 / MAX: 15.14 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
MPV Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only OpenBenchmarking.org FPS, More Is Better MPV Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only 2 3 Linux 5.4 600 1200 1800 2400 3000 SE +/- 18.75, N = 3 SE +/- 3.63, N = 3 SE +/- 13.10, N = 3 2563.28 2568.83 2574.81 MIN: 1200.01 / MAX: 6000.24 MIN: 1333.32 / MAX: 6000.24 MIN: 1200 / MAX: 6000.24 1. mpv 0.32.0
MPV Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only OpenBenchmarking.org FPS, More Is Better MPV Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only 2 3 Linux 5.4 200 400 600 800 1000 SE +/- 1.81, N = 3 SE +/- 1.72, N = 3 SE +/- 5.25, N = 3 1037.84 1050.73 1046.24 MIN: 545.46 / MAX: 2000.02 MIN: 545.45 / MAX: 2000.04 MIN: 545.46 / MAX: 2000.04 1. mpv 0.32.0
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v3-v3 - Model: mobilenet-v3 2 3 Linux 5.4 4 8 12 16 20 SE +/- 0.23, N = 3 SE +/- 0.10, N = 3 SE +/- 0.04, N = 3 14.34 13.85 12.96 MIN: 12.98 / MAX: 26.26 MIN: 13.09 / MAX: 19.5 MIN: 12.63 / MAX: 14.51 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: shufflenet-v2 2 3 Linux 5.4 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 14.34 14.12 13.11 MIN: 13.13 / MAX: 18.68 MIN: 13.32 / MAX: 23.18 MIN: 12.73 / MAX: 15.23 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet 2 3 Linux 5.4 4 8 12 16 20 SE +/- 1.28, N = 3 SE +/- 0.05, N = 3 SE +/- 0.19, N = 3 14.58 12.93 12.37 MIN: 12.24 / MAX: 23.88 MIN: 12.34 / MAX: 19.09 MIN: 12.06 / MAX: 14.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: efficientnet-b0 2 3 Linux 5.4 4 8 12 16 20 SE +/- 0.25, N = 3 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 18.21 17.31 16.49 MIN: 16.67 / MAX: 23.54 MIN: 16.65 / MAX: 19.75 MIN: 16.29 / MAX: 24.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface 2 3 Linux 5.4 2 4 6 8 10 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 6.10 5.88 5.51 MIN: 5.59 / MAX: 16.3 MIN: 5.66 / MAX: 6.32 MIN: 5.29 / MAX: 9.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet 2 3 Linux 5.4 6 12 18 24 30 SE +/- 0.42, N = 3 SE +/- 0.20, N = 3 SE +/- 0.27, N = 3 26.94 26.31 25.59 MIN: 25.48 / MAX: 29.13 MIN: 25.44 / MAX: 33.58 MIN: 24.94 / MAX: 42.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16 2 3 Linux 5.4 12 24 36 48 60 SE +/- 1.04, N = 3 SE +/- 0.64, N = 3 SE +/- 0.56, N = 3 52.14 52.12 52.69 MIN: 48.42 / MAX: 60.84 MIN: 49.36 / MAX: 63.52 MIN: 50.22 / MAX: 58.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18 2 3 Linux 5.4 5 10 15 20 25 SE +/- 0.34, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 18.23 18.47 18.63 MIN: 17.31 / MAX: 23.72 MIN: 17.93 / MAX: 19.6 MIN: 17.97 / MAX: 24.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet 2 3 Linux 5.4 3 6 9 12 15 SE +/- 0.09, N = 3 SE +/- 0.19, N = 3 SE +/- 0.27, N = 3 11.75 12.17 12.16 MIN: 11.17 / MAX: 23.47 MIN: 11.23 / MAX: 17.3 MIN: 11.37 / MAX: 14.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50 2 3 Linux 5.4 9 18 27 36 45 SE +/- 0.52, N = 3 SE +/- 0.64, N = 3 SE +/- 0.49, N = 3 36.39 37.34 36.78 MIN: 34.7 / MAX: 41.24 MIN: 35.19 / MAX: 51.27 MIN: 34.95 / MAX: 41.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: yolov4-tiny 2 3 Linux 5.4 8 16 24 32 40 SE +/- 0.37, N = 3 SE +/- 0.35, N = 3 SE +/- 0.03, N = 3 35.05 35.66 36.40 MIN: 34.11 / MAX: 40.67 MIN: 34.66 / MAX: 38.72 MIN: 35.85 / MAX: 41.14 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet 2 3 Linux 5.4 1.2915 2.583 3.8745 5.166 6.4575 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.70 5.74 5.69 MIN: 5.48 / MAX: 9.78 MIN: 5.49 / MAX: 11.91 MIN: 5.51 / MAX: 6.13 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet 2 3 Linux 5.4 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.13, N = 3 SE +/- 0.09, N = 3 9.53 9.71 9.68 MIN: 8.35 / MAX: 36.02 MIN: 8.39 / MAX: 40.44 MIN: 8.37 / MAX: 29.59 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 2 3 Linux 5.4 1.0215 2.043 3.0645 4.086 5.1075 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.53 4.53 4.54 MIN: 4.25 / MAX: 5.75 MIN: 4.23 / MAX: 5.62 MIN: 4.26 / MAX: 7.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 2 3 Linux 5.4 2 4 6 8 10 SE +/- 0.20, N = 3 SE +/- 0.19, N = 3 SE +/- 0.01, N = 3 8.71 8.71 8.72 MIN: 7.4 / MAX: 31.92 MIN: 7.43 / MAX: 30.58 MIN: 7.38 / MAX: 31.16 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 2 3 Linux 5.4 0.7988 1.5976 2.3964 3.1952 3.994 SE +/- 0.11, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.55 3.43 3.43 MIN: 3.33 / MAX: 22.77 MIN: 3.34 / MAX: 3.77 MIN: 3.35 / MAX: 3.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet 2 3 Linux 5.4 1.0373 2.0746 3.1119 4.1492 5.1865 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 4.60 4.61 4.61 MIN: 4.35 / MAX: 4.94 MIN: 4.35 / MAX: 5.07 MIN: 4.36 / MAX: 4.94 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 2 3 Linux 5.4 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.21, N = 3 SE +/- 0.22, N = 3 11.46 11.50 11.53 MIN: 10.41 / MAX: 42.66 MIN: 10.38 / MAX: 33.2 MIN: 10.48 / MAX: 36.8 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface 2 3 Linux 5.4 0.2453 0.4906 0.7359 0.9812 1.2265 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.06 1.08 1.09 MIN: 1.01 / MAX: 1.56 MIN: 1.01 / MAX: 1.45 MIN: 1.01 / MAX: 1.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet 2 3 Linux 5.4 2 4 6 8 10 SE +/- 0.15, N = 3 SE +/- 0.05, N = 3 SE +/- 0.21, N = 3 8.50 8.46 8.45 MIN: 7.1 / MAX: 29.7 MIN: 7.12 / MAX: 27.97 MIN: 7.12 / MAX: 32.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 2 3 Linux 5.4 20 40 60 80 100 SE +/- 0.14, N = 3 SE +/- 0.10, N = 3 SE +/- 0.45, N = 3 84.12 83.47 83.73 MIN: 66.71 / MAX: 108.36 MIN: 66.56 / MAX: 110.05 MIN: 66.8 / MAX: 112.01 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 2 3 Linux 5.4 0.7448 1.4896 2.2344 2.9792 3.724 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.31 3.30 3.30 MIN: 3.22 / MAX: 5.09 MIN: 3.22 / MAX: 4.97 MIN: 3.23 / MAX: 6.06 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet 2 3 Linux 5.4 7 14 21 28 35 SE +/- 0.72, N = 3 SE +/- 0.32, N = 3 SE +/- 0.57, N = 3 30.27 30.02 30.49 MIN: 25.44 / MAX: 55.96 MIN: 25.34 / MAX: 56.39 MIN: 25.2 / MAX: 56.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 2 3 Linux 5.4 3 6 9 12 15 SE +/- 0.12, N = 3 SE +/- 0.09, N = 3 SE +/- 0.34, N = 3 9.82 9.58 9.92 MIN: 9 / MAX: 31.97 MIN: 8.97 / MAX: 23.04 MIN: 9.02 / MAX: 30.94 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny 2 3 Linux 5.4 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 14.37 14.39 14.46 MIN: 12.27 / MAX: 32.22 MIN: 12.33 / MAX: 39.9 MIN: 12.36 / MAX: 40.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica 2 3 Linux 5.4 12 24 36 48 60 SE +/- 0.45, N = 3 SE +/- 0.21, N = 3 SE +/- 0.17, N = 3 51.92 52.39 51.50
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda 2 3 Linux 5.4 10 20 30 40 50 SE +/- 0.36, N = 3 SE +/- 0.14, N = 3 SE +/- 0.29, N = 3 42.43 42.79 42.09
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm 2 3 Linux 5.4 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 20.91 20.91 20.90
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression 2 3 Linux 5.4 0.3713 0.7426 1.1139 1.4852 1.8565 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 1.62 1.65 1.61
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.3 Water Benchmark 2 3 Linux 5.4 0.8462 1.6924 2.5386 3.3848 4.231 SE +/- 0.005, N = 3 SE +/- 0.006, N = 3 SE +/- 0.005, N = 3 3.754 3.753 3.761 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 24Aug2020 Model: 20k Atoms 2 3 Linux 5.4 6 12 18 24 30 SE +/- 0.11, N = 3 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 26.29 26.18 26.56 1. (CXX) g++ options: -O3 -pthread -lm
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 24Aug2020 Model: Rhodopsin Protein 2 3 Linux 5.4 6 12 18 24 30 SE +/- 0.35, N = 3 SE +/- 0.34, N = 15 SE +/- 0.40, N = 3 23.34 23.89 23.68 1. (CXX) g++ options: -O3 -pthread -lm
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms 2 3 Linux 5.4 0.1 0.2 0.3 0.4 0.5 SE +/- 0.00080, N = 3 SE +/- 0.00402, N = 3 SE +/- 0.00089, N = 3 0.43837 0.44425 0.43908
OpenVINO Model: Face Detection 0106 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP16 - Device: CPU 2 3 Linux 5.4 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 9.64 9.65 9.65 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread
OpenVINO Model: Face Detection 0106 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP16 - Device: CPU 2 3 Linux 5.4 700 1400 2100 2800 3500 SE +/- 21.19, N = 3 SE +/- 16.95, N = 3 SE +/- 7.33, N = 3 3286.54 3284.11 3294.43 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread
OpenVINO Model: Face Detection 0106 FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP32 - Device: CPU 2 3 Linux 5.4 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 9.56 9.60 9.58 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread
OpenVINO Model: Face Detection 0106 FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP32 - Device: CPU 2 3 Linux 5.4 700 1400 2100 2800 3500 SE +/- 23.91, N = 3 SE +/- 7.03, N = 3 SE +/- 15.60, N = 3 3316.90 3307.26 3293.71 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread
OpenVINO Model: Person Detection 0106 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP16 - Device: CPU 2 3 Linux 5.4 2 4 6 8 10 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 6.64 6.60 6.68 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread
OpenVINO Model: Person Detection 0106 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP16 - Device: CPU 2 3 Linux 5.4 1000 2000 3000 4000 5000 SE +/- 16.66, N = 3 SE +/- 29.15, N = 3 SE +/- 34.97, N = 3 4683.32 4721.76 4662.58 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread
OpenVINO Model: Person Detection 0106 FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP32 - Device: CPU 2 3 Linux 5.4 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 6.52 6.51 6.50 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread
OpenVINO Model: Person Detection 0106 FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP32 - Device: CPU 2 3 Linux 5.4 1000 2000 3000 4000 5000 SE +/- 26.72, N = 3 SE +/- 16.09, N = 3 SE +/- 11.05, N = 3 4752.26 4782.63 4799.77 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU 2 3 Linux 5.4 7K 14K 21K 28K 35K SE +/- 207.95, N = 3 SE +/- 175.53, N = 3 SE +/- 76.15, N = 3 33277.84 33230.50 33390.81 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU 2 3 Linux 5.4 0.2093 0.4186 0.6279 0.8372 1.0465 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 0.93 0.93 0.93 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread
OpenVINO Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU 2 3 Linux 5.4 7K 14K 21K 28K 35K SE +/- 275.04, N = 3 SE +/- 193.36, N = 3 SE +/- 147.89, N = 3 32931.77 32913.00 33014.26 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread
OpenVINO Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU 2 3 Linux 5.4 0.2115 0.423 0.6345 0.846 1.0575 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 0.94 0.94 0.94 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine 2 3 Linux 5.4 30K 60K 90K 120K 150K SE +/- 813.33, N = 3 SE +/- 201.66, N = 3 SE +/- 375.92, N = 3 127987.82 128693.18 129902.32 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 2 3 Linux 5.4 10M 20M 30M 40M 50M SE +/- 81815.46, N = 3 SE +/- 132345.53, N = 3 SE +/- 151580.08, N = 3 44873723 44476553 44930063 1. (CXX) g++ options: -O3 -fopenmp
Monte Carlo Simulations of Ionised Nebulae Input: Dust 2D tau100.0 OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2019-03-24 Input: Dust 2D tau100.0 2 3 Linux 5.4 50 100 150 200 250 SE +/- 1.00, N = 3 SE +/- 1.00, N = 3 SE +/- 1.00, N = 3 229 230 228 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
Incompact3D Input: Cylinder OpenBenchmarking.org Seconds, Fewer Is Better Incompact3D 2020-09-17 Input: Cylinder 2 3 Linux 5.4 40 80 120 160 200 SE +/- 1.34, N = 3 SE +/- 1.01, N = 3 SE +/- 0.90, N = 3 183.69 184.30 183.00 1. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
GPAW Input: Carbon Nanotube OpenBenchmarking.org Seconds, Fewer Is Better GPAW 20.1 Input: Carbon Nanotube 2 3 Linux 5.4 20 40 60 80 100 SE +/- 0.16, N = 3 SE +/- 0.09, N = 3 SE +/- 0.21, N = 3 110.65 110.58 110.69 1. (CC) gcc options: -pthread -shared -fwrapv -O2 -lxc -lblas -lmpi
Timed LLVM Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 10.0 Time To Compile 2 3 Linux 5.4 40 80 120 160 200 SE +/- 0.91, N = 3 SE +/- 0.41, N = 3 SE +/- 1.18, N = 3 203.64 203.35 203.22
AOM AV1 Encoder Mode: Speed 0 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 0 Two-Pass 2 3 Linux 5.4 0.0743 0.1486 0.2229 0.2972 0.3715 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.33 0.33 0.33 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 4 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 4 Two-Pass 2 3 Linux 5.4 0.5693 1.1386 1.7079 2.2772 2.8465 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.51 2.53 2.53 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 6 Realtime 2 3 Linux 5.4 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 18.12 17.86 18.09 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 6 Two-Pass 2 3 Linux 5.4 0.8798 1.7596 2.6394 3.5192 4.399 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.90 3.91 3.90 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 8 Realtime 2 3 Linux 5.4 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 34.49 34.69 34.83 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No 2 3 Linux 5.4 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.22, N = 3 16.54 16.58 16.88
VkFFT OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 2020-09-29 2 3 Linux 5.4 4K 8K 12K 16K 20K SE +/- 4.16, N = 3 SE +/- 2.85, N = 3 SE +/- 39.00, N = 3 20538 20532 20490
Apache CouchDB Bulk Size: 100 - Inserts: 1000 - Rounds: 24 OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.1.1 Bulk Size: 100 - Inserts: 1000 - Rounds: 24 2 3 Linux 5.4 30 60 90 120 150 SE +/- 0.48, N = 3 SE +/- 1.12, N = 3 SE +/- 1.01, N = 3 118.57 117.93 118.09 1. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
InfluxDB Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 2 3 Linux 5.4 200K 400K 600K 800K 1000K SE +/- 2893.07, N = 3 SE +/- 1149.41, N = 3 SE +/- 2154.85, N = 3 1146038.5 1144571.8 1148311.0
InfluxDB Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 2 3 Linux 5.4 300K 600K 900K 1200K 1500K SE +/- 2189.38, N = 3 SE +/- 2142.12, N = 3 SE +/- 898.05, N = 3 1501667.1 1495506.9 1498935.7
InfluxDB Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 2 3 Linux 5.4 300K 600K 900K 1200K 1500K SE +/- 3242.42, N = 3 SE +/- 5690.48, N = 3 SE +/- 2382.01, N = 3 1539068.1 1534264.5 1547196.9
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 2 3 Linux 5.4 100K 200K 300K 400K 500K SE +/- 3068.12, N = 3 SE +/- 5879.33, N = 3 SE +/- 7622.83, N = 3 440299.41 447704.19 450800.42 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 2 3 Linux 5.4 9M 18M 27M 36M 45M SE +/- 631231.56, N = 3 SE +/- 365672.33, N = 3 SE +/- 97131.78, N = 3 43503603.1 44247620.1 42607098.5
Phoronix Test Suite v10.8.4