Xeon E3-1245 v5 Ubuntu Linux 5.4 Intel Xeon E3-1245 v5 testing with a MSI C236A WORKSTATION (MS-7998) v1.0 (2.90 BIOS) and MSI Intel HD P530 3GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2010041-FI-XEONE312446&grt&rdt .
Xeon E3-1245 v5 Ubuntu Linux 5.4 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution Vulkan Linux 5.4 Linux 5.8 Linux 5.9 Git Intel Xeon E3-1245 v5 @ 3.90GHz (4 Cores / 8 Threads) MSI C236A WORKSTATION (MS-7998) v1.0 (2.90 BIOS) Intel Xeon E3-1200 v5/E3-1500 32GB 120GB Samsung SSD 850 MSI Intel HD P530 3GB (1150MHz) Realtek ALC1150 LG Ultra HD Intel I219-V Ubuntu 20.04 5.4.0-37-generic (x86_64) GNOME Shell 3.36.2 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 20.0.4 GCC 9.3.0 ext4 3840x2160 5.8.13-050813-generic (x86_64) 1.2.131 5.9.0-050900rc7daily20201003-generic (x86_64) 20201002 GNOME Shell 3.36.4 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xdc Python Details - Python 3.8.2 Security Details - Linux 5.4: itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable - Linux 5.8: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable - Linux 5.9 Git: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Xeon E3-1245 v5 Ubuntu Linux 5.4 ai-benchmark: Device Inference Score ai-benchmark: Device Training Score ai-benchmark: Device AI Score aom-av1: Speed 0 Two-Pass aom-av1: Speed 4 Two-Pass aom-av1: Speed 6 Realtime aom-av1: Speed 6 Two-Pass aom-av1: Speed 8 Realtime couchdb: 100 - 1000 - 24 astcenc: Fast astcenc: Medium astcenc: Thorough astcenc: Exhaustive blender: BMW27 - CPU-Only blender: Fishy Cat - CPU-Only byte: Dhrystone 2 caffe: AlexNet - CPU - 100 caffe: AlexNet - CPU - 200 caffe: GoogleNet - CPU - 100 caffe: GoogleNet - CPU - 200 crafty: Elapsed Time deepspeech: CPU dolfyn: Computational Fluid Dynamics ecp-candle: P1B2 ecp-candle: P3B1 ecp-candle: P3B2 espeak: Text-To-Speech Synthesis ffte: N=256, 3D Complex FFT Routine glmark2: 1920 x 1080 glmark2: 3840 x 2160 gpaw: Carbon Nanotube gromacs: Water Benchmark hint: FLOAT hugin: Panorama Photo Assistant + Stitching Time incompact3d: Cylinder influxdb: 4 - 10000 - 2,5000,1 - 10000 influxdb: 64 - 10000 - 2,5000,1 - 10000 influxdb: 1024 - 10000 - 2,5000,1 - 10000 keydb: kripke: lammps: Rhodopsin Protein lczero: BLAS lczero: Eigen avifenc: 0 avifenc: 2 avifenc: 8 avifenc: 10 libraw: Post-Processing Benchmark mlpack: scikit_ica mlpack: scikit_qda mlpack: scikit_svm mlpack: scikit_linearridgeregression mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 montage: Mosaic of M17, K band, 1.5 deg x 1.5 deg mocassin: Dust 2D tau100.0 mpv: Big Buck Bunny Sunflower 4K - Software Only mpv: Big Buck Bunny Sunflower 1080p - Software Only namd: ATPase Simulation - 327,506 Atoms ncnn: CPU - squeezenet ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny ocrmypdf: Processing 60 Page PDF Document onednn: IP Batch 1D - f32 - CPU onednn: IP Batch All - f32 - CPU onednn: IP Batch 1D - u8s8f32 - CPU onednn: IP Batch All - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch deconv_1d - f32 - CPU onednn: Deconvolution Batch deconv_3d - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch deconv_1d - u8s8f32 - CPU onednn: Deconvolution Batch deconv_3d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU realsr-ncnn: 4x - No rnnoise: rodinia: OpenMP LavaMD rodinia: OpenMP HotSpot3D rodinia: OpenMP Leukocyte rodinia: OpenMP CFD Solver rodinia: OpenMP Streamcluster system-decompress-gzip: tensorflow-lite: SqueezeNet tensorflow-lite: Inception V4 tensorflow-lite: NASNet Mobile tensorflow-lite: Mobilenet Float tensorflow-lite: Mobilenet Quant tensorflow-lite: Inception ResNet V2 tesseract-ocr: Time To OCR 7 Images build-apache: Time To Compile build-gdb: Time To Compile hmmer: Pfam Database Search build-linux-kernel: Time To Compile build-llvm: Time To Compile mafft: Multiple Sequence Alignment - LSU RNA tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 tscp: AI Chess Performance vkfft: webp: Default webp: Quality 100 webp: Quality 100, Lossless webp: Quality 100, Highest Compression webp: Quality 100, Lossless, Highest Compression compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 compress-zstd: 3 compress-zstd: 19 Linux 5.4 Linux 5.8 Linux 5.9 Git 664 733 1397 0.22 1.88 14.99 3.00 35.82 175.791 9.02 11.14 72.26 578.17 382.67 532.49 38538549.1 66942 133823 164645 328888 7547320 78.51331 21.425 46.128 1212.787 590.047 33.933 20328.495177544 543 135 598.320 0.504 381569301.12653 73.463 750.917623 870964.9 990095.1 996459.9 393603.00 17835730 2.746 797 717 182.456 108.057 8.199 7.600 27.66 71.10 88.35 27.86 4.29 10.255 56.164 5.601 7.748 62.308 89.559 305 32.10 139.45 3.87433 24.88 28.94 7.79 6.53 4.67 6.66 10.51 2.17 23.08 89.35 23.09 22.56 46.50 39.48 52.13 46.43 14.72 16.32 10.11 15.11 30.86 2.63 42.16 214.21 36.80 57.94 87.11 89.34 55.538 8.17233 111.115 3.77996 49.8609 19.9426 9.25837 14.3651 18.7529 10.3947 7.41923 569.397 285.961 5.15748 7.16575 17.241 27.870 570.414 118.303 254.675 47.393 28.734 3.282 499694 7204167 366947 339412 348999 6517420 27.447 32.732 159.604 125.430 199.635 1493.222 12.268 366.297 344.444 1141277 1141 1.741 2.728 19.361 8.148 47.466 50.735 2386.0 20.5 663 733 1396 0.22 1.87 14.99 2.99 35.57 157.303 9.12 11.14 72.32 578.30 382.25 531.91 38513841.4 67067 133870 165139 330435 7469298 80.60955 21.476 45.575 1226.039 598.284 34.257 20258.596219821 502 119 597.093 0.504 381270686.12418 74.056 754.293904 814462.2 984645.3 993437.4 381234.65 17866357 2.758 678 701 182.788 108.403 8.257 7.639 27.65 68.13 84.76 28.12 4.24 10.373 56.934 5.638 7.788 62.931 90.697 304 32.22 139.88 3.88624 24.76 28.85 7.82 6.53 4.66 6.61 10.44 2.14 23.03 88.95 23.11 22.12 46.45 39.36 43.80 38.59 12.69 14 10.06 13 26.02 2.35 35.48 206.48 31.34 51.30 75.25 79.45 55.393 8.37706 114.236 3.90177 50.9712 19.8862 9.73579 15.1450 18.8285 10.3074 7.42274 548.044 270.665 5.04980 7.19867 283.500 28.130 568.094 118.459 256.916 47.671 28.730 3.339 499845 7204273 366914 339372 348976 6518137 27.639 32.712 157.323 125.495 199.789 1490.327 12.291 366.393 346.500 1134693 1161 1.750 2.742 19.333 8.184 47.260 50.618 2389.3 20.4 665 733 1398 0.22 1.88 14.99 2.99 35.65 161.474 9.07 11.15 72.37 578.22 381.93 532.18 38169338.8 66888 133774 165181 330664 7302256 80.04749 21.537 45.976 1229.33 592.267 34.588 20335.309333098 552 137 602.790 0.502 381134357.52245 74.650 754.269389 805704.5 982092.1 994952.9 377603.40 17848217 2.758 779 719 183.225 108.956 8.316 7.678 27.55 70.06 86.12 27.87 4.33 10.396 57.044 5.605 7.800 62.561 91.033 304 32.28 139.75 3.88703 24.84 28.84 7.78 6.52 4.66 6.62 10.45 2.15 23.07 88.82 23.05 22.41 46.46 39.45 43.80 38.17 12.68 13.99 11.09 13 26.02 2.29 35.47 205.15 31.36 49.78 75.27 79.33 55.509 8.40703 110.716 3.84716 50.2447 19.8822 9.82786 14.7150 18.7935 10.1850 7.40944 558.863 260.964 5.04492 7.20203 283.462 28.088 566.532 118.977 258.772 47.631 28.703 3.355 499825 7202907 367078 339434 349438 6518793 27.567 33.008 160.636 125.660 200.948 1493.850 12.291 366.631 346.140 1131430 1161 1.749 2.756 19.214 8.161 47.190 51.026 2389.0 20.4 OpenBenchmarking.org
AI Benchmark Alpha Device Inference Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device Inference Score Linux 5.4 Linux 5.8 Linux 5.9 Git 140 280 420 560 700 664 663 665
AI Benchmark Alpha Device Training Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device Training Score Linux 5.4 Linux 5.8 Linux 5.9 Git 160 320 480 640 800 733 733 733
AI Benchmark Alpha Device AI Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device AI Score Linux 5.4 Linux 5.8 Linux 5.9 Git 300 600 900 1200 1500 1397 1396 1398
AOM AV1 Encoder Mode: Speed 0 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 0 Two-Pass Linux 5.4 Linux 5.8 Linux 5.9 Git 0.0495 0.099 0.1485 0.198 0.2475 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.22 0.22 0.22 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 4 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 4 Two-Pass Linux 5.4 Linux 5.8 Linux 5.9 Git 0.423 0.846 1.269 1.692 2.115 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.88 1.87 1.88 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 6 Realtime Linux 5.4 Linux 5.8 Linux 5.9 Git 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 14.99 14.99 14.99 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 6 Two-Pass Linux 5.4 Linux 5.8 Linux 5.9 Git 0.675 1.35 2.025 2.7 3.375 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.00 2.99 2.99 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 8 Realtime Linux 5.4 Linux 5.8 Linux 5.9 Git 8 16 24 32 40 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 35.82 35.57 35.65 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Apache CouchDB Bulk Size: 100 - Inserts: 1000 - Rounds: 24 OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.1.1 Bulk Size: 100 - Inserts: 1000 - Rounds: 24 Linux 5.4 Linux 5.8 Linux 5.9 Git 40 80 120 160 200 SE +/- 1.34, N = 3 SE +/- 2.35, N = 4 SE +/- 2.13, N = 5 175.79 157.30 161.47 1. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
ASTC Encoder Preset: Fast OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Fast Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 9.02 9.12 9.07 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Medium Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 11.14 11.14 11.15 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Thorough Linux 5.4 Linux 5.8 Linux 5.9 Git 16 32 48 64 80 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 72.26 72.32 72.37 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Exhaustive Linux 5.4 Linux 5.8 Linux 5.9 Git 120 240 360 480 600 SE +/- 0.03, N = 3 SE +/- 0.22, N = 3 SE +/- 0.09, N = 3 578.17 578.30 578.22 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: BMW27 - Compute: CPU-Only Linux 5.4 Linux 5.8 Linux 5.9 Git 80 160 240 320 400 SE +/- 0.56, N = 3 SE +/- 1.11, N = 3 SE +/- 0.53, N = 3 382.67 382.25 381.93
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Fishy Cat - Compute: CPU-Only Linux 5.4 Linux 5.8 Linux 5.9 Git 120 240 360 480 600 SE +/- 0.85, N = 3 SE +/- 0.55, N = 3 SE +/- 0.81, N = 3 532.49 531.91 532.18
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 Linux 5.4 Linux 5.8 Linux 5.9 Git 8M 16M 24M 32M 40M SE +/- 54862.86, N = 3 SE +/- 6470.73, N = 3 SE +/- 156718.13, N = 3 38538549.1 38513841.4 38169338.8
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 Linux 5.4 Linux 5.8 Linux 5.9 Git 14K 28K 42K 56K 70K SE +/- 46.52, N = 3 SE +/- 13.57, N = 3 SE +/- 46.28, N = 3 66942 67067 66888 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 Linux 5.4 Linux 5.8 Linux 5.9 Git 30K 60K 90K 120K 150K SE +/- 12.01, N = 3 SE +/- 43.43, N = 3 SE +/- 97.27, N = 3 133823 133870 133774 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 Linux 5.4 Linux 5.8 Linux 5.9 Git 40K 80K 120K 160K 200K SE +/- 195.74, N = 3 SE +/- 11.41, N = 3 SE +/- 205.69, N = 3 164645 165139 165181 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 Linux 5.4 Linux 5.8 Linux 5.9 Git 70K 140K 210K 280K 350K SE +/- 211.61, N = 3 SE +/- 96.33, N = 3 SE +/- 315.26, N = 3 328888 330435 330664 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time Linux 5.4 Linux 5.8 Linux 5.9 Git 1.6M 3.2M 4.8M 6.4M 8M SE +/- 35007.79, N = 3 SE +/- 23889.21, N = 3 SE +/- 21668.21, N = 3 7547320 7469298 7302256 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
DeepSpeech Acceleration: CPU OpenBenchmarking.org Seconds, Fewer Is Better DeepSpeech 0.6 Acceleration: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.51, N = 3 SE +/- 0.53, N = 3 78.51 80.61 80.05
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics Linux 5.4 Linux 5.8 Linux 5.9 Git 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 21.43 21.48 21.54
ECP-CANDLE Benchmark: P1B2 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.3 Benchmark: P1B2 Linux 5.4 Linux 5.8 Linux 5.9 Git 10 20 30 40 50 46.13 45.58 45.98
ECP-CANDLE Benchmark: P3B1 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.3 Benchmark: P3B1 Linux 5.4 Linux 5.8 Linux 5.9 Git 300 600 900 1200 1500 1212.79 1226.04 1229.33
ECP-CANDLE Benchmark: P3B2 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.3 Benchmark: P3B2 Linux 5.4 Linux 5.8 Linux 5.9 Git 130 260 390 520 650 590.05 598.28 592.27
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis Linux 5.4 Linux 5.8 Linux 5.9 Git 8 16 24 32 40 SE +/- 0.22, N = 16 SE +/- 0.20, N = 4 SE +/- 0.13, N = 4 33.93 34.26 34.59 1. (CC) gcc options: -O2 -std=c99
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine Linux 5.4 Linux 5.8 Linux 5.9 Git 4K 8K 12K 16K 20K SE +/- 95.32, N = 3 SE +/- 59.99, N = 3 SE +/- 64.79, N = 3 20328.50 20258.60 20335.31 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
GLmark2 Resolution: 1920 x 1080 OpenBenchmarking.org Score, More Is Better GLmark2 2020.04 Resolution: 1920 x 1080 Linux 5.4 Linux 5.8 Linux 5.9 Git 120 240 360 480 600 543 502 552
GLmark2 Resolution: 3840 x 2160 OpenBenchmarking.org Score, More Is Better GLmark2 2020.04 Resolution: 3840 x 2160 Linux 5.4 Linux 5.8 Linux 5.9 Git 30 60 90 120 150 135 119 137
GPAW Input: Carbon Nanotube OpenBenchmarking.org Seconds, Fewer Is Better GPAW 20.1 Input: Carbon Nanotube Linux 5.4 Linux 5.8 Linux 5.9 Git 130 260 390 520 650 SE +/- 0.74, N = 3 SE +/- 0.45, N = 3 SE +/- 4.33, N = 3 598.32 597.09 602.79 1. (CC) gcc options: -pthread -shared -fwrapv -O2 -lxc -lblas -lmpi
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.3 Water Benchmark Linux 5.4 Linux 5.8 Linux 5.9 Git 0.1134 0.2268 0.3402 0.4536 0.567 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 0.504 0.504 0.502 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT Linux 5.4 Linux 5.8 Linux 5.9 Git 80M 160M 240M 320M 400M SE +/- 1362924.61, N = 3 SE +/- 293442.33, N = 3 SE +/- 715889.13, N = 3 381569301.13 381270686.12 381134357.52 1. (CC) gcc options: -O3 -march=native -lm
Hugin Panorama Photo Assistant + Stitching Time OpenBenchmarking.org Seconds, Fewer Is Better Hugin Panorama Photo Assistant + Stitching Time Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 0.48, N = 3 SE +/- 0.25, N = 3 SE +/- 0.42, N = 3 73.46 74.06 74.65
Incompact3D Input: Cylinder OpenBenchmarking.org Seconds, Fewer Is Better Incompact3D 2020-09-17 Input: Cylinder Linux 5.4 Linux 5.8 Linux 5.9 Git 160 320 480 640 800 SE +/- 1.52, N = 3 SE +/- 0.11, N = 3 SE +/- 0.28, N = 3 750.92 754.29 754.27 1. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
InfluxDB Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Linux 5.4 Linux 5.8 Linux 5.9 Git 200K 400K 600K 800K 1000K SE +/- 11259.18, N = 12 SE +/- 19326.59, N = 12 SE +/- 21169.01, N = 12 870964.9 814462.2 805704.5
InfluxDB Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Linux 5.4 Linux 5.8 Linux 5.9 Git 200K 400K 600K 800K 1000K SE +/- 1019.70, N = 3 SE +/- 2768.93, N = 3 SE +/- 4935.24, N = 3 990095.1 984645.3 982092.1
InfluxDB Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Linux 5.4 Linux 5.8 Linux 5.9 Git 200K 400K 600K 800K 1000K SE +/- 1038.31, N = 3 SE +/- 553.23, N = 3 SE +/- 2049.30, N = 3 996459.9 993437.4 994952.9
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 Linux 5.4 Linux 5.8 Linux 5.9 Git 80K 160K 240K 320K 400K SE +/- 767.36, N = 3 SE +/- 387.75, N = 3 SE +/- 359.00, N = 3 393603.00 381234.65 377603.40 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 Linux 5.4 Linux 5.8 Linux 5.9 Git 4M 8M 12M 16M 20M SE +/- 15901.03, N = 3 SE +/- 59668.65, N = 3 SE +/- 85458.77, N = 3 17835730 17866357 17848217 1. (CXX) g++ options: -O3 -fopenmp
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 24Aug2020 Model: Rhodopsin Protein Linux 5.4 Linux 5.8 Linux 5.9 Git 0.6206 1.2412 1.8618 2.4824 3.103 SE +/- 0.004, N = 3 SE +/- 0.005, N = 3 SE +/- 0.007, N = 3 2.746 2.758 2.758 1. (CXX) g++ options: -O3 -pthread -lm
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS Linux 5.4 Linux 5.8 Linux 5.9 Git 200 400 600 800 1000 SE +/- 0.67, N = 3 SE +/- 9.24, N = 3 SE +/- 2.33, N = 3 797 678 779 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen Linux 5.4 Linux 5.8 Linux 5.9 Git 160 320 480 640 800 SE +/- 5.46, N = 3 SE +/- 10.51, N = 9 SE +/- 3.28, N = 3 717 701 719 1. (CXX) g++ options: -flto -pthread
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.7.3 Encoder Speed: 0 Linux 5.4 Linux 5.8 Linux 5.9 Git 40 80 120 160 200 SE +/- 0.69, N = 3 SE +/- 0.73, N = 3 SE +/- 0.02, N = 3 182.46 182.79 183.23 1. (CXX) g++ options: -O3 -fPIC
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.7.3 Encoder Speed: 2 Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 0.16, N = 3 SE +/- 0.10, N = 3 SE +/- 0.18, N = 3 108.06 108.40 108.96 1. (CXX) g++ options: -O3 -fPIC
libavif avifenc Encoder Speed: 8 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.7.3 Encoder Speed: 8 Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.013, N = 3 SE +/- 0.022, N = 3 SE +/- 0.007, N = 3 8.199 8.257 8.316 1. (CXX) g++ options: -O3 -fPIC
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.7.3 Encoder Speed: 10 Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.017, N = 3 SE +/- 0.030, N = 3 SE +/- 0.031, N = 3 7.600 7.639 7.678 1. (CXX) g++ options: -O3 -fPIC
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 27.66 27.65 27.55 1. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica Linux 5.4 Linux 5.8 Linux 5.9 Git 16 32 48 64 80 SE +/- 0.60, N = 3 SE +/- 0.91, N = 3 SE +/- 0.43, N = 3 71.10 68.13 70.06
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 1.98, N = 12 SE +/- 1.29, N = 3 SE +/- 1.27, N = 12 88.35 84.76 86.12
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.10, N = 3 SE +/- 0.13, N = 3 SE +/- 0.00, N = 3 27.86 28.12 27.87
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression Linux 5.4 Linux 5.8 Linux 5.9 Git 0.9743 1.9486 2.9229 3.8972 4.8715 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 4 4.29 4.24 4.33
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: SqueezeNetV1.0 Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 10.26 10.37 10.40 MIN: 10.11 / MAX: 16.05 MIN: 10.27 / MAX: 31.91 MIN: 10.23 / MAX: 33.26 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: resnet-v2-50 Linux 5.4 Linux 5.8 Linux 5.9 Git 13 26 39 52 65 SE +/- 0.16, N = 3 SE +/- 0.13, N = 3 SE +/- 0.07, N = 3 56.16 56.93 57.04 MIN: 55.4 / MAX: 107.43 MIN: 56.62 / MAX: 80.52 MIN: 56.76 / MAX: 79.83 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: MobileNetV2_224 Linux 5.4 Linux 5.8 Linux 5.9 Git 1.2686 2.5372 3.8058 5.0744 6.343 SE +/- 0.019, N = 3 SE +/- 0.018, N = 3 SE +/- 0.037, N = 3 5.601 5.638 5.605 MIN: 5.47 / MAX: 9.61 MIN: 5.53 / MAX: 13.75 MIN: 5.45 / MAX: 10.27 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: mobilenet-v1-1.0 Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.009, N = 3 SE +/- 0.014, N = 3 SE +/- 0.003, N = 3 7.748 7.788 7.800 MIN: 7.67 / MAX: 22.21 MIN: 7.73 / MAX: 12.65 MIN: 7.74 / MAX: 27.87 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: inception-v3 Linux 5.4 Linux 5.8 Linux 5.9 Git 14 28 42 56 70 SE +/- 0.15, N = 3 SE +/- 0.20, N = 3 SE +/- 0.64, N = 3 62.31 62.93 62.56 MIN: 61.37 / MAX: 84.45 MIN: 62.1 / MAX: 89.18 MIN: 60.97 / MAX: 89.35 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Montage Astronomical Image Mosaic Engine Mosaic of M17, K band, 1.5 deg x 1.5 deg OpenBenchmarking.org Seconds, Fewer Is Better Montage Astronomical Image Mosaic Engine 6.0 Mosaic of M17, K band, 1.5 deg x 1.5 deg Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.18, N = 3 89.56 90.70 91.03 1. (CC) gcc options: -std=gnu99 -lcfitsio -lm -O2
Monte Carlo Simulations of Ionised Nebulae Input: Dust 2D tau100.0 OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2019-03-24 Input: Dust 2D tau100.0 Linux 5.4 Linux 5.8 Linux 5.9 Git 70 140 210 280 350 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 305 304 304 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
MPV Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only OpenBenchmarking.org FPS, More Is Better MPV Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 32.10 32.22 32.28 MIN: 31.41 / MAX: 32.79 MIN: 31.25 / MAX: 33.33 MIN: 31.58 / MAX: 33.49 1. mpv 0.32.0
MPV Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only OpenBenchmarking.org FPS, More Is Better MPV Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only Linux 5.4 Linux 5.8 Linux 5.9 Git 30 60 90 120 150 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 139.45 139.88 139.75 MIN: 115.16 / MAX: 146.34 MIN: 123.1 / MAX: 146.35 MIN: 119 / MAX: 146.35 1. mpv 0.32.0
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms Linux 5.4 Linux 5.8 Linux 5.9 Git 0.8746 1.7492 2.6238 3.4984 4.373 SE +/- 0.00123, N = 3 SE +/- 0.00476, N = 3 SE +/- 0.00417, N = 3 3.87433 3.88624 3.88703
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet Linux 5.4 Linux 5.8 Linux 5.9 Git 6 12 18 24 30 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 24.88 24.76 24.84 MIN: 24.67 / MAX: 25.89 MIN: 24.65 / MAX: 36.01 MIN: 24.64 / MAX: 36.64 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 28.94 28.85 28.84 MIN: 28.7 / MAX: 32.85 MIN: 28.7 / MAX: 41.24 MIN: 28.67 / MAX: 41.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v2-v2 - Model: mobilenet-v2 Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 7.79 7.82 7.78 MIN: 7.67 / MAX: 9.99 MIN: 7.68 / MAX: 20.52 MIN: 7.67 / MAX: 9.94 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v3-v3 - Model: mobilenet-v3 Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 6.53 6.53 6.52 MIN: 6.41 / MAX: 9.17 MIN: 6.43 / MAX: 9.7 MIN: 6.42 / MAX: 8.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: shufflenet-v2 Linux 5.4 Linux 5.8 Linux 5.9 Git 1.0508 2.1016 3.1524 4.2032 5.254 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 4.67 4.66 4.66 MIN: 4.59 / MAX: 8.01 MIN: 4.61 / MAX: 6.54 MIN: 4.59 / MAX: 6.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 6.66 6.61 6.62 MIN: 6.56 / MAX: 20.23 MIN: 6.56 / MAX: 8.52 MIN: 6.57 / MAX: 8.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: efficientnet-b0 Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 10.51 10.44 10.45 MIN: 10.41 / MAX: 11 MIN: 10.4 / MAX: 10.76 MIN: 10.4 / MAX: 13.27 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface Linux 5.4 Linux 5.8 Linux 5.9 Git 0.4883 0.9766 1.4649 1.9532 2.4415 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.17 2.14 2.15 MIN: 2.13 / MAX: 2.29 MIN: 2.13 / MAX: 2.24 MIN: 2.13 / MAX: 2.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet Linux 5.4 Linux 5.8 Linux 5.9 Git 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 23.08 23.03 23.07 MIN: 22.97 / MAX: 26.36 MIN: 22.94 / MAX: 23.49 MIN: 22.94 / MAX: 35.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16 Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.12, N = 3 SE +/- 0.07, N = 3 89.35 88.95 88.82 MIN: 89.05 / MAX: 94.32 MIN: 88.67 / MAX: 137.54 MIN: 88.53 / MAX: 101.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18 Linux 5.4 Linux 5.8 Linux 5.9 Git 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 23.09 23.11 23.05 MIN: 22.97 / MAX: 26.84 MIN: 22.91 / MAX: 35.18 MIN: 22.9 / MAX: 25.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet Linux 5.4 Linux 5.8 Linux 5.9 Git 5 10 15 20 25 SE +/- 0.22, N = 3 SE +/- 0.01, N = 3 SE +/- 0.28, N = 3 22.56 22.12 22.41 MIN: 22.06 / MAX: 39.49 MIN: 22.04 / MAX: 22.28 MIN: 22.05 / MAX: 59.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50 Linux 5.4 Linux 5.8 Linux 5.9 Git 11 22 33 44 55 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 46.50 46.45 46.46 MIN: 46.31 / MAX: 58.67 MIN: 46.3 / MAX: 49.28 MIN: 46.33 / MAX: 51.2 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: yolov4-tiny Linux 5.4 Linux 5.8 Linux 5.9 Git 9 18 27 36 45 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 39.48 39.36 39.45 MIN: 39.28 / MAX: 53.47 MIN: 39.23 / MAX: 42.19 MIN: 39.23 / MAX: 42.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet Linux 5.4 Linux 5.8 Linux 5.9 Git 12 24 36 48 60 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 52.13 43.80 43.80 MIN: 50.27 / MAX: 53.67 MIN: 43.21 / MAX: 44.65 MIN: 43.39 / MAX: 43.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet Linux 5.4 Linux 5.8 Linux 5.9 Git 11 22 33 44 55 SE +/- 0.43, N = 3 SE +/- 0.25, N = 3 SE +/- 0.24, N = 3 46.43 38.59 38.17 MIN: 44.53 / MAX: 65.27 MIN: 36.59 / MAX: 45.66 MIN: 35.95 / MAX: 45.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 Linux 5.4 Linux 5.8 Linux 5.9 Git 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 14.72 12.69 12.68 MIN: 14.5 / MAX: 15.08 MIN: 11.98 / MAX: 13.56 MIN: 12.28 / MAX: 12.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 Linux 5.4 Linux 5.8 Linux 5.9 Git 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 16.32 14.00 13.99 MIN: 16.25 / MAX: 16.42 MIN: 13.24 / MAX: 14.29 MIN: 13.58 / MAX: 14.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.30, N = 3 SE +/- 0.52, N = 3 10.11 10.06 11.09 MIN: 9.97 / MAX: 10.55 MIN: 9.19 / MAX: 10.9 MIN: 10.03 / MAX: 13.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet Linux 5.4 Linux 5.8 Linux 5.9 Git 4 8 12 16 20 SE +/- 0.01, N = 3 15.11 13.00 13.00 MIN: 14.91 / MAX: 15.25 MIN: 12.57 / MAX: 13.06 MIN: 12.53 / MAX: 14.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 30.86 26.02 26.02 MIN: 29.36 / MAX: 35.2 MIN: 25.28 / MAX: 26.55 MIN: 25.31 / MAX: 27.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface Linux 5.4 Linux 5.8 Linux 5.9 Git 0.5918 1.1836 1.7754 2.3672 2.959 SE +/- 0.01, N = 3 SE +/- 0.15, N = 3 SE +/- 0.03, N = 3 2.63 2.35 2.29 MIN: 2.59 / MAX: 2.65 MIN: 2.17 / MAX: 2.76 MIN: 2.14 / MAX: 2.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet Linux 5.4 Linux 5.8 Linux 5.9 Git 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 42.16 35.48 35.47 MIN: 38.96 / MAX: 43.23 MIN: 34.99 / MAX: 35.72 MIN: 34.79 / MAX: 35.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 Linux 5.4 Linux 5.8 Linux 5.9 Git 50 100 150 200 250 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 SE +/- 0.22, N = 3 214.21 206.48 205.15 MIN: 212.41 / MAX: 218.75 MIN: 204.27 / MAX: 209.41 MIN: 203.28 / MAX: 208.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 Linux 5.4 Linux 5.8 Linux 5.9 Git 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 36.80 31.34 31.36 MIN: 36.41 / MAX: 37.2 MIN: 30.87 / MAX: 31.48 MIN: 30.73 / MAX: 31.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet Linux 5.4 Linux 5.8 Linux 5.9 Git 13 26 39 52 65 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 SE +/- 0.88, N = 3 57.94 51.30 49.78 MIN: 55.09 / MAX: 61.52 MIN: 48.82 / MAX: 54.66 MIN: 46.19 / MAX: 53.72 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 87.11 75.25 75.27 MIN: 84.69 / MAX: 89 MIN: 74.67 / MAX: 76.95 MIN: 74.66 / MAX: 75.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 89.34 79.45 79.33 MIN: 78.43 / MAX: 101.68 MIN: 78.55 / MAX: 81.51 MIN: 78.43 / MAX: 80.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OCRMyPDF Processing 60 Page PDF Document OpenBenchmarking.org Seconds, Fewer Is Better OCRMyPDF 9.6.0+dfsg Processing 60 Page PDF Document Linux 5.4 Linux 5.8 Linux 5.9 Git 12 24 36 48 60 SE +/- 0.14, N = 3 SE +/- 0.13, N = 3 SE +/- 0.20, N = 3 55.54 55.39 55.51
oneDNN Harness: IP Batch 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch 1D - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.00538, N = 3 SE +/- 0.03078, N = 3 SE +/- 0.03775, N = 3 8.17233 8.37706 8.40703 MIN: 7.82 MIN: 8 MIN: 7.97 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Batch All - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch All - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 30 60 90 120 150 SE +/- 0.29, N = 3 SE +/- 1.51, N = 3 SE +/- 0.10, N = 3 111.12 114.24 110.72 MIN: 109.55 MIN: 110.66 MIN: 109.51 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 0.8779 1.7558 2.6337 3.5116 4.3895 SE +/- 0.00729, N = 3 SE +/- 0.00186, N = 3 SE +/- 0.00590, N = 3 3.77996 3.90177 3.84716 MIN: 3.67 MIN: 3.79 MIN: 3.73 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 11 22 33 44 55 SE +/- 0.01, N = 3 SE +/- 0.17, N = 3 SE +/- 0.15, N = 3 49.86 50.97 50.24 MIN: 49.38 MIN: 50.11 MIN: 49.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 5 10 15 20 25 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 19.94 19.89 19.88 MIN: 19.8 MIN: 19.73 MIN: 19.73 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.00752, N = 3 SE +/- 0.08955, N = 3 SE +/- 0.10703, N = 3 9.25837 9.73579 9.82786 MIN: 9.19 MIN: 9.52 MIN: 9.53 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 14.37 15.15 14.72 MIN: 14.12 MIN: 14.79 MIN: 14.45 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 5 10 15 20 25 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 18.75 18.83 18.79 MIN: 18.38 MIN: 18.6 MIN: 18.35 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 0.01, N = 3 10.39 10.31 10.19 MIN: 10.15 MIN: 10.1 MIN: 10.07 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.02123, N = 3 SE +/- 0.01758, N = 3 SE +/- 0.00534, N = 3 7.41923 7.42274 7.40944 MIN: 7.35 MIN: 7.35 MIN: 7.3 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 120 240 360 480 600 SE +/- 0.15, N = 3 SE +/- 0.78, N = 3 SE +/- 3.18, N = 3 569.40 548.04 558.86 MIN: 568.18 MIN: 545.86 MIN: 552.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 60 120 180 240 300 SE +/- 0.67, N = 3 SE +/- 3.43, N = 3 SE +/- 2.39, N = 15 285.96 270.67 260.96 MIN: 283.43 MIN: 261.84 MIN: 247.57 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 1.1604 2.3208 3.4812 4.6416 5.802 SE +/- 0.00807, N = 3 SE +/- 0.01544, N = 3 SE +/- 0.00900, N = 3 5.15748 5.04980 5.04492 MIN: 5.07 MIN: 4.96 MIN: 4.96 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.00597, N = 3 SE +/- 0.01858, N = 3 SE +/- 0.01959, N = 3 7.16575 7.19867 7.20203 MIN: 7.05 MIN: 7.05 MIN: 7.07 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No Linux 5.4 Linux 5.8 Linux 5.9 Git 60 120 180 240 300 SE +/- 0.24, N = 4 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 17.24 283.50 283.46
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.07, N = 3 SE +/- 0.18, N = 3 SE +/- 0.07, N = 3 27.87 28.13 28.09 1. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD Linux 5.4 Linux 5.8 Linux 5.9 Git 120 240 360 480 600 SE +/- 2.84, N = 3 SE +/- 1.01, N = 3 SE +/- 0.14, N = 3 570.41 568.09 566.53 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D Linux 5.4 Linux 5.8 Linux 5.9 Git 30 60 90 120 150 SE +/- 0.05, N = 3 SE +/- 0.30, N = 3 SE +/- 0.30, N = 3 118.30 118.46 118.98 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte Linux 5.4 Linux 5.8 Linux 5.9 Git 60 120 180 240 300 SE +/- 0.36, N = 3 SE +/- 0.33, N = 3 SE +/- 1.15, N = 3 254.68 256.92 258.77 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver Linux 5.4 Linux 5.8 Linux 5.9 Git 11 22 33 44 55 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 47.39 47.67 47.63 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 28.73 28.73 28.70 1. (CXX) g++ options: -O2 -lOpenCL
System GZIP Decompression OpenBenchmarking.org Seconds, Fewer Is Better System GZIP Decompression Linux 5.4 Linux 5.8 Linux 5.9 Git 0.7549 1.5098 2.2647 3.0196 3.7745 SE +/- 0.001, N = 3 SE +/- 0.035, N = 14 SE +/- 0.046, N = 13 3.282 3.339 3.355
TensorFlow Lite Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: SqueezeNet Linux 5.4 Linux 5.8 Linux 5.9 Git 110K 220K 330K 440K 550K SE +/- 34.51, N = 3 SE +/- 53.76, N = 3 SE +/- 63.00, N = 3 499694 499845 499825
TensorFlow Lite Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception V4 Linux 5.4 Linux 5.8 Linux 5.9 Git 1.5M 3M 4.5M 6M 7.5M SE +/- 250.95, N = 3 SE +/- 132.46, N = 3 SE +/- 580.35, N = 3 7204167 7204273 7202907
TensorFlow Lite Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: NASNet Mobile Linux 5.4 Linux 5.8 Linux 5.9 Git 80K 160K 240K 320K 400K SE +/- 60.60, N = 3 SE +/- 108.52, N = 3 SE +/- 77.03, N = 3 366947 366914 367078
TensorFlow Lite Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Float Linux 5.4 Linux 5.8 Linux 5.9 Git 70K 140K 210K 280K 350K SE +/- 59.09, N = 3 SE +/- 39.28, N = 3 SE +/- 48.17, N = 3 339412 339372 339434
TensorFlow Lite Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Quant Linux 5.4 Linux 5.8 Linux 5.9 Git 70K 140K 210K 280K 350K SE +/- 72.59, N = 3 SE +/- 64.01, N = 3 SE +/- 102.05, N = 3 348999 348976 349438
TensorFlow Lite Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 Linux 5.4 Linux 5.8 Linux 5.9 Git 1.4M 2.8M 4.2M 5.6M 7M SE +/- 634.06, N = 3 SE +/- 88.19, N = 3 SE +/- 862.41, N = 3 6517420 6518137 6518793
Tesseract OCR Time To OCR 7 Images OpenBenchmarking.org Seconds, Fewer Is Better Tesseract OCR 4.1.1 Time To OCR 7 Images Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 27.45 27.64 27.57
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.41 Time To Compile Linux 5.4 Linux 5.8 Linux 5.9 Git 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 32.73 32.71 33.01
Timed GDB GNU Debugger Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GDB GNU Debugger Compilation 9.1 Time To Compile Linux 5.4 Linux 5.8 Linux 5.9 Git 40 80 120 160 200 SE +/- 0.20, N = 3 SE +/- 0.25, N = 3 SE +/- 0.30, N = 3 159.60 157.32 160.64
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search Linux 5.4 Linux 5.8 Linux 5.9 Git 30 60 90 120 150 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 125.43 125.50 125.66 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.4 Time To Compile Linux 5.4 Linux 5.8 Linux 5.9 Git 40 80 120 160 200 SE +/- 0.78, N = 3 SE +/- 0.59, N = 3 SE +/- 0.79, N = 3 199.64 199.79 200.95
Timed LLVM Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 10.0 Time To Compile Linux 5.4 Linux 5.8 Linux 5.9 Git 300 600 900 1200 1500 SE +/- 7.38, N = 3 SE +/- 0.97, N = 3 SE +/- 0.27, N = 3 1493.22 1490.33 1493.85
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 SE +/- 0.14, N = 3 12.27 12.29 12.29 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 Linux 5.4 Linux 5.8 Linux 5.9 Git 80 160 240 320 400 SE +/- 0.47, N = 3 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 366.30 366.39 366.63 MIN: 364.37 / MAX: 374.6 MIN: 365.21 / MAX: 367.87 MIN: 365.56 / MAX: 368.1 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 Linux 5.4 Linux 5.8 Linux 5.9 Git 80 160 240 320 400 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 0.40, N = 3 344.44 346.50 346.14 MIN: 343.7 / MAX: 346.67 MIN: 345.83 / MAX: 347.67 MIN: 344.07 / MAX: 348.21 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance Linux 5.4 Linux 5.8 Linux 5.9 Git 200K 400K 600K 800K 1000K SE +/- 1206.41, N = 5 SE +/- 1191.39, N = 5 SE +/- 1137.79, N = 5 1141277 1134693 1131430 1. (CC) gcc options: -O3 -march=native
VkFFT OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 2020-09-29 Linux 5.4 Linux 5.8 Linux 5.9 Git 200 400 600 800 1000 SE +/- 0.67, N = 3 1141 1161 1161
WebP Image Encode Encode Settings: Default OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Default Linux 5.4 Linux 5.8 Linux 5.9 Git 0.3938 0.7876 1.1814 1.5752 1.969 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 1.741 1.750 1.749 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 Linux 5.4 Linux 5.8 Linux 5.9 Git 0.6201 1.2402 1.8603 2.4804 3.1005 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 SE +/- 0.005, N = 3 2.728 2.742 2.756 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless Linux 5.4 Linux 5.8 Linux 5.9 Git 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 19.36 19.33 19.21 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.009, N = 3 SE +/- 0.009, N = 3 SE +/- 0.015, N = 3 8.148 8.184 8.161 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression Linux 5.4 Linux 5.8 Linux 5.9 Git 11 22 33 44 55 SE +/- 0.14, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 47.47 47.26 47.19 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
XZ Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 Linux 5.4 Linux 5.8 Linux 5.9 Git 12 24 36 48 60 SE +/- 0.05, N = 3 SE +/- 0.11, N = 3 SE +/- 0.37, N = 3 50.74 50.62 51.03 1. (CC) gcc options: -pthread -fvisibility=hidden -O2
Zstd Compression Compression Level: 3 OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.5 Compression Level: 3 Linux 5.4 Linux 5.8 Linux 5.9 Git 500 1000 1500 2000 2500 SE +/- 6.19, N = 3 SE +/- 1.97, N = 3 SE +/- 1.34, N = 3 2386.0 2389.3 2389.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.5 Compression Level: 19 Linux 5.4 Linux 5.8 Linux 5.9 Git 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 20.5 20.4 20.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
Phoronix Test Suite v10.8.4