Xeon E3-1245 v5 Ubuntu Linux 5.4 Intel Xeon E3-1245 v5 testing with a MSI C236A WORKSTATION (MS-7998) v1.0 (2.90 BIOS) and MSI Intel HD P530 3GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2010041-FI-XEONE312446&sro&grr .
Xeon E3-1245 v5 Ubuntu Linux 5.4 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution Vulkan Linux 5.4 Linux 5.8 Linux 5.9 Git Intel Xeon E3-1245 v5 @ 3.90GHz (4 Cores / 8 Threads) MSI C236A WORKSTATION (MS-7998) v1.0 (2.90 BIOS) Intel Xeon E3-1200 v5/E3-1500 32GB 120GB Samsung SSD 850 MSI Intel HD P530 3GB (1150MHz) Realtek ALC1150 LG Ultra HD Intel I219-V Ubuntu 20.04 5.4.0-37-generic (x86_64) GNOME Shell 3.36.2 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 20.0.4 GCC 9.3.0 ext4 3840x2160 5.8.13-050813-generic (x86_64) 1.2.131 5.9.0-050900rc7daily20201003-generic (x86_64) 20201002 GNOME Shell 3.36.4 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xdc Python Details - Python 3.8.2 Security Details - Linux 5.4: itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable - Linux 5.8: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable - Linux 5.9 Git: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Xeon E3-1245 v5 Ubuntu Linux 5.4 build-llvm: Time To Compile incompact3d: Cylinder gpaw: Carbon Nanotube lczero: Eigen astcenc: Exhaustive rodinia: OpenMP LavaMD blender: Fishy Cat - CPU-Only influxdb: 4 - 10000 - 2,5000,1 - 10000 ai-benchmark: Device AI Score ai-benchmark: Device Training Score ai-benchmark: Device Inference Score mlpack: scikit_qda kripke: ecp-candle: P3B1 blender: BMW27 - CPU-Only vkfft: lczero: BLAS gromacs: Water Benchmark caffe: GoogleNet - CPU - 200 mocassin: Dust 2D tau100.0 mpv: Big Buck Bunny Sunflower 4K - Software Only rodinia: OpenMP Leukocyte couchdb: 100 - 1000 - 24 build-linux-kernel: Time To Compile ecp-candle: P3B2 realsr-ncnn: 4x - No namd: ATPase Simulation - 327,506 Atoms hint: FLOAT avifenc: 0 ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU - squeezenet caffe: GoogleNet - CPU - 100 tensorflow-lite: Inception ResNet V2 build-gdb: Time To Compile mnn: inception-v3 mnn: mobilenet-v1-1.0 mnn: MobileNetV2_224 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 tensorflow-lite: Inception V4 caffe: AlexNet - CPU - 200 hmmer: Pfam Database Search compress-zstd: 19 byte: Dhrystone 2 rodinia: OpenMP HotSpot3D glmark2: 3840 x 2160 glmark2: 1920 x 1080 avifenc: 2 influxdb: 64 - 10000 - 2,5000,1 - 10000 influxdb: 1024 - 10000 - 2,5000,1 - 10000 ncnn: CPU - yolov4-tiny ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet ncnn: CPU - squeezenet espeak: Text-To-Speech Synthesis montage: Mosaic of M17, K band, 1.5 deg x 1.5 deg onednn: Recurrent Neural Network Inference - f32 - CPU mlpack: scikit_linearridgeregression astcenc: Thorough hugin: Panorama Photo Assistant + Stitching Time mlpack: scikit_ica keydb: deepspeech: CPU caffe: AlexNet - CPU - 100 mpv: Big Buck Bunny Sunflower 1080p - Software Only tensorflow-lite: NASNet Mobile tensorflow-lite: SqueezeNet tensorflow-lite: Mobilenet Float tensorflow-lite: Mobilenet Quant onednn: IP Batch All - u8s8f32 - CPU onednn: IP Batch All - f32 - CPU ocrmypdf: Processing 60 Page PDF Document compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 webp: Quality 100, Lossless, Highest Compression rodinia: OpenMP CFD Solver aom-av1: Speed 0 Two-Pass libraw: Post-Processing Benchmark aom-av1: Speed 6 Realtime onednn: Recurrent Neural Network Training - f32 - CPU aom-av1: Speed 6 Two-Pass build-apache: Time To Compile mlpack: scikit_svm compress-zstd: 3 rodinia: OpenMP Streamcluster rnnoise: tesseract-ocr: Time To OCR 7 Images crafty: Elapsed Time tnn: CPU - MobileNet v2 aom-av1: Speed 4 Two-Pass tnn: CPU - SqueezeNet v1.1 dolfyn: Computational Fluid Dynamics onednn: Deconvolution Batch deconv_1d - u8s8f32 - CPU onednn: Deconvolution Batch deconv_1d - f32 - CPU webp: Quality 100, Lossless aom-av1: Speed 8 Realtime astcenc: Medium ecp-candle: P1B2 onednn: IP Batch 1D - f32 - CPU onednn: IP Batch 1D - u8s8f32 - CPU mafft: Multiple Sequence Alignment - LSU RNA onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU system-decompress-gzip: astcenc: Fast webp: Quality 100, Highest Compression avifenc: 8 avifenc: 10 lammps: Rhodopsin Protein onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU ffte: N=256, 3D Complex FFT Routine onednn: Deconvolution Batch deconv_3d - u8s8f32 - CPU onednn: Deconvolution Batch deconv_3d - f32 - CPU webp: Quality 100 tscp: AI Chess Performance webp: Default Linux 5.4 Linux 5.8 Linux 5.9 Git 1493.222 750.917623 598.320 717 578.17 570.414 532.49 870964.9 1397 733 664 88.35 17835730 1212.787 382.67 1141 797 0.504 328888 305 32.10 254.675 175.791 199.635 590.047 17.241 3.87433 381569301.12653 182.456 89.34 87.11 57.94 36.80 214.21 42.16 2.63 30.86 15.11 10.11 16.32 14.72 46.43 52.13 164645 6517420 159.604 62.308 7.748 5.601 56.164 10.255 7204167 133823 125.430 20.5 38538549.1 118.303 135 543 108.057 990095.1 996459.9 39.48 46.50 22.56 23.09 89.35 23.08 2.17 10.51 6.66 4.67 6.53 7.79 28.94 24.88 33.933 89.559 285.961 4.29 72.26 73.463 71.10 393603.00 78.51331 66942 139.45 366947 499694 339412 348999 49.8609 111.115 55.538 50.735 47.466 47.393 0.22 27.66 14.99 569.397 3.00 32.732 27.86 2386.0 28.734 27.870 27.447 7547320 366.297 1.88 344.444 21.425 10.3947 9.25837 19.361 35.82 11.14 46.128 8.17233 3.77996 12.268 5.15748 7.16575 3.282 9.02 8.148 8.199 7.600 2.746 18.7529 19.9426 20328.495177544 7.41923 14.3651 2.728 1141277 1.741 1490.327 754.293904 597.093 701 578.30 568.094 531.91 814462.2 1396 733 663 84.76 17866357 1226.039 382.25 1161 678 0.504 330435 304 32.22 256.916 157.303 199.789 598.284 283.500 3.88624 381270686.12418 182.788 79.45 75.25 51.30 31.34 206.48 35.48 2.35 26.02 13 10.06 14 12.69 38.59 43.80 165139 6518137 157.323 62.931 7.788 5.638 56.934 10.373 7204273 133870 125.495 20.4 38513841.4 118.459 119 502 108.403 984645.3 993437.4 39.36 46.45 22.12 23.11 88.95 23.03 2.14 10.44 6.61 4.66 6.53 7.82 28.85 24.76 34.257 90.697 270.665 4.24 72.32 74.056 68.13 381234.65 80.60955 67067 139.88 366914 499845 339372 348976 50.9712 114.236 55.393 50.618 47.260 47.671 0.22 27.65 14.99 548.044 2.99 32.712 28.12 2389.3 28.730 28.130 27.639 7469298 366.393 1.87 346.500 21.476 10.3074 9.73579 19.333 35.57 11.14 45.575 8.37706 3.90177 12.291 5.04980 7.19867 3.339 9.12 8.184 8.257 7.639 2.758 18.8285 19.8862 20258.596219821 7.42274 15.1450 2.742 1134693 1.750 1493.850 754.269389 602.790 719 578.22 566.532 532.18 805704.5 1398 733 665 86.12 17848217 1229.33 381.93 1161 779 0.502 330664 304 32.28 258.772 161.474 200.948 592.267 283.462 3.88703 381134357.52245 183.225 79.33 75.27 49.78 31.36 205.15 35.47 2.29 26.02 13 11.09 13.99 12.68 38.17 43.80 165181 6518793 160.636 62.561 7.800 5.605 57.044 10.396 7202907 133774 125.660 20.4 38169338.8 118.977 137 552 108.956 982092.1 994952.9 39.45 46.46 22.41 23.05 88.82 23.07 2.15 10.45 6.62 4.66 6.52 7.78 28.84 24.84 34.588 91.033 260.964 4.33 72.37 74.650 70.06 377603.40 80.04749 66888 139.75 367078 499825 339434 349438 50.2447 110.716 55.509 51.026 47.190 47.631 0.22 27.55 14.99 558.863 2.99 33.008 27.87 2389.0 28.703 28.088 27.567 7302256 366.631 1.88 346.140 21.537 10.1850 9.82786 19.214 35.65 11.15 45.976 8.40703 3.84716 12.291 5.04492 7.20203 3.355 9.07 8.161 8.316 7.678 2.758 18.7935 19.8822 20335.309333098 7.40944 14.7150 2.756 1131430 1.749 OpenBenchmarking.org
Timed LLVM Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 10.0 Time To Compile Linux 5.4 Linux 5.8 Linux 5.9 Git 300 600 900 1200 1500 SE +/- 7.38, N = 3 SE +/- 0.97, N = 3 SE +/- 0.27, N = 3 1493.22 1490.33 1493.85
Incompact3D Input: Cylinder OpenBenchmarking.org Seconds, Fewer Is Better Incompact3D 2020-09-17 Input: Cylinder Linux 5.4 Linux 5.8 Linux 5.9 Git 160 320 480 640 800 SE +/- 1.52, N = 3 SE +/- 0.11, N = 3 SE +/- 0.28, N = 3 750.92 754.29 754.27 1. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
GPAW Input: Carbon Nanotube OpenBenchmarking.org Seconds, Fewer Is Better GPAW 20.1 Input: Carbon Nanotube Linux 5.4 Linux 5.8 Linux 5.9 Git 130 260 390 520 650 SE +/- 0.74, N = 3 SE +/- 0.45, N = 3 SE +/- 4.33, N = 3 598.32 597.09 602.79 1. (CC) gcc options: -pthread -shared -fwrapv -O2 -lxc -lblas -lmpi
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen Linux 5.4 Linux 5.8 Linux 5.9 Git 160 320 480 640 800 SE +/- 5.46, N = 3 SE +/- 10.51, N = 9 SE +/- 3.28, N = 3 717 701 719 1. (CXX) g++ options: -flto -pthread
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Exhaustive Linux 5.4 Linux 5.8 Linux 5.9 Git 120 240 360 480 600 SE +/- 0.03, N = 3 SE +/- 0.22, N = 3 SE +/- 0.09, N = 3 578.17 578.30 578.22 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD Linux 5.4 Linux 5.8 Linux 5.9 Git 120 240 360 480 600 SE +/- 2.84, N = 3 SE +/- 1.01, N = 3 SE +/- 0.14, N = 3 570.41 568.09 566.53 1. (CXX) g++ options: -O2 -lOpenCL
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Fishy Cat - Compute: CPU-Only Linux 5.4 Linux 5.8 Linux 5.9 Git 120 240 360 480 600 SE +/- 0.85, N = 3 SE +/- 0.55, N = 3 SE +/- 0.81, N = 3 532.49 531.91 532.18
InfluxDB Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Linux 5.4 Linux 5.8 Linux 5.9 Git 200K 400K 600K 800K 1000K SE +/- 11259.18, N = 12 SE +/- 19326.59, N = 12 SE +/- 21169.01, N = 12 870964.9 814462.2 805704.5
AI Benchmark Alpha Device AI Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device AI Score Linux 5.4 Linux 5.8 Linux 5.9 Git 300 600 900 1200 1500 1397 1396 1398
AI Benchmark Alpha Device Training Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device Training Score Linux 5.4 Linux 5.8 Linux 5.9 Git 160 320 480 640 800 733 733 733
AI Benchmark Alpha Device Inference Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device Inference Score Linux 5.4 Linux 5.8 Linux 5.9 Git 140 280 420 560 700 664 663 665
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 1.98, N = 12 SE +/- 1.29, N = 3 SE +/- 1.27, N = 12 88.35 84.76 86.12
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 Linux 5.4 Linux 5.8 Linux 5.9 Git 4M 8M 12M 16M 20M SE +/- 15901.03, N = 3 SE +/- 59668.65, N = 3 SE +/- 85458.77, N = 3 17835730 17866357 17848217 1. (CXX) g++ options: -O3 -fopenmp
ECP-CANDLE Benchmark: P3B1 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.3 Benchmark: P3B1 Linux 5.4 Linux 5.8 Linux 5.9 Git 300 600 900 1200 1500 1212.79 1226.04 1229.33
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: BMW27 - Compute: CPU-Only Linux 5.4 Linux 5.8 Linux 5.9 Git 80 160 240 320 400 SE +/- 0.56, N = 3 SE +/- 1.11, N = 3 SE +/- 0.53, N = 3 382.67 382.25 381.93
VkFFT OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 2020-09-29 Linux 5.4 Linux 5.8 Linux 5.9 Git 200 400 600 800 1000 SE +/- 0.67, N = 3 1141 1161 1161
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS Linux 5.4 Linux 5.8 Linux 5.9 Git 200 400 600 800 1000 SE +/- 0.67, N = 3 SE +/- 9.24, N = 3 SE +/- 2.33, N = 3 797 678 779 1. (CXX) g++ options: -flto -pthread
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.3 Water Benchmark Linux 5.4 Linux 5.8 Linux 5.9 Git 0.1134 0.2268 0.3402 0.4536 0.567 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 0.504 0.504 0.502 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 Linux 5.4 Linux 5.8 Linux 5.9 Git 70K 140K 210K 280K 350K SE +/- 211.61, N = 3 SE +/- 96.33, N = 3 SE +/- 315.26, N = 3 328888 330435 330664 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Monte Carlo Simulations of Ionised Nebulae Input: Dust 2D tau100.0 OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2019-03-24 Input: Dust 2D tau100.0 Linux 5.4 Linux 5.8 Linux 5.9 Git 70 140 210 280 350 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 305 304 304 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
MPV Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only OpenBenchmarking.org FPS, More Is Better MPV Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 32.10 32.22 32.28 MIN: 31.41 / MAX: 32.79 MIN: 31.25 / MAX: 33.33 MIN: 31.58 / MAX: 33.49 1. mpv 0.32.0
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte Linux 5.4 Linux 5.8 Linux 5.9 Git 60 120 180 240 300 SE +/- 0.36, N = 3 SE +/- 0.33, N = 3 SE +/- 1.15, N = 3 254.68 256.92 258.77 1. (CXX) g++ options: -O2 -lOpenCL
Apache CouchDB Bulk Size: 100 - Inserts: 1000 - Rounds: 24 OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.1.1 Bulk Size: 100 - Inserts: 1000 - Rounds: 24 Linux 5.4 Linux 5.8 Linux 5.9 Git 40 80 120 160 200 SE +/- 1.34, N = 3 SE +/- 2.35, N = 4 SE +/- 2.13, N = 5 175.79 157.30 161.47 1. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.4 Time To Compile Linux 5.4 Linux 5.8 Linux 5.9 Git 40 80 120 160 200 SE +/- 0.78, N = 3 SE +/- 0.59, N = 3 SE +/- 0.79, N = 3 199.64 199.79 200.95
ECP-CANDLE Benchmark: P3B2 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.3 Benchmark: P3B2 Linux 5.4 Linux 5.8 Linux 5.9 Git 130 260 390 520 650 590.05 598.28 592.27
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No Linux 5.4 Linux 5.8 Linux 5.9 Git 60 120 180 240 300 SE +/- 0.24, N = 4 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 17.24 283.50 283.46
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms Linux 5.4 Linux 5.8 Linux 5.9 Git 0.8746 1.7492 2.6238 3.4984 4.373 SE +/- 0.00123, N = 3 SE +/- 0.00476, N = 3 SE +/- 0.00417, N = 3 3.87433 3.88624 3.88703
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT Linux 5.4 Linux 5.8 Linux 5.9 Git 80M 160M 240M 320M 400M SE +/- 1362924.61, N = 3 SE +/- 293442.33, N = 3 SE +/- 715889.13, N = 3 381569301.13 381270686.12 381134357.52 1. (CC) gcc options: -O3 -march=native -lm
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.7.3 Encoder Speed: 0 Linux 5.4 Linux 5.8 Linux 5.9 Git 40 80 120 160 200 SE +/- 0.69, N = 3 SE +/- 0.73, N = 3 SE +/- 0.02, N = 3 182.46 182.79 183.23 1. (CXX) g++ options: -O3 -fPIC
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 89.34 79.45 79.33 MIN: 78.43 / MAX: 101.68 MIN: 78.55 / MAX: 81.51 MIN: 78.43 / MAX: 80.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 87.11 75.25 75.27 MIN: 84.69 / MAX: 89 MIN: 74.67 / MAX: 76.95 MIN: 74.66 / MAX: 75.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet Linux 5.4 Linux 5.8 Linux 5.9 Git 13 26 39 52 65 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 SE +/- 0.88, N = 3 57.94 51.30 49.78 MIN: 55.09 / MAX: 61.52 MIN: 48.82 / MAX: 54.66 MIN: 46.19 / MAX: 53.72 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 Linux 5.4 Linux 5.8 Linux 5.9 Git 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 36.80 31.34 31.36 MIN: 36.41 / MAX: 37.2 MIN: 30.87 / MAX: 31.48 MIN: 30.73 / MAX: 31.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 Linux 5.4 Linux 5.8 Linux 5.9 Git 50 100 150 200 250 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 SE +/- 0.22, N = 3 214.21 206.48 205.15 MIN: 212.41 / MAX: 218.75 MIN: 204.27 / MAX: 209.41 MIN: 203.28 / MAX: 208.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet Linux 5.4 Linux 5.8 Linux 5.9 Git 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 42.16 35.48 35.47 MIN: 38.96 / MAX: 43.23 MIN: 34.99 / MAX: 35.72 MIN: 34.79 / MAX: 35.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface Linux 5.4 Linux 5.8 Linux 5.9 Git 0.5918 1.1836 1.7754 2.3672 2.959 SE +/- 0.01, N = 3 SE +/- 0.15, N = 3 SE +/- 0.03, N = 3 2.63 2.35 2.29 MIN: 2.59 / MAX: 2.65 MIN: 2.17 / MAX: 2.76 MIN: 2.14 / MAX: 2.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 30.86 26.02 26.02 MIN: 29.36 / MAX: 35.2 MIN: 25.28 / MAX: 26.55 MIN: 25.31 / MAX: 27.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet Linux 5.4 Linux 5.8 Linux 5.9 Git 4 8 12 16 20 SE +/- 0.01, N = 3 15.11 13.00 13.00 MIN: 14.91 / MAX: 15.25 MIN: 12.57 / MAX: 13.06 MIN: 12.53 / MAX: 14.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.30, N = 3 SE +/- 0.52, N = 3 10.11 10.06 11.09 MIN: 9.97 / MAX: 10.55 MIN: 9.19 / MAX: 10.9 MIN: 10.03 / MAX: 13.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 Linux 5.4 Linux 5.8 Linux 5.9 Git 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 16.32 14.00 13.99 MIN: 16.25 / MAX: 16.42 MIN: 13.24 / MAX: 14.29 MIN: 13.58 / MAX: 14.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 Linux 5.4 Linux 5.8 Linux 5.9 Git 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 14.72 12.69 12.68 MIN: 14.5 / MAX: 15.08 MIN: 11.98 / MAX: 13.56 MIN: 12.28 / MAX: 12.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet Linux 5.4 Linux 5.8 Linux 5.9 Git 11 22 33 44 55 SE +/- 0.43, N = 3 SE +/- 0.25, N = 3 SE +/- 0.24, N = 3 46.43 38.59 38.17 MIN: 44.53 / MAX: 65.27 MIN: 36.59 / MAX: 45.66 MIN: 35.95 / MAX: 45.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet Linux 5.4 Linux 5.8 Linux 5.9 Git 12 24 36 48 60 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 52.13 43.80 43.80 MIN: 50.27 / MAX: 53.67 MIN: 43.21 / MAX: 44.65 MIN: 43.39 / MAX: 43.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 Linux 5.4 Linux 5.8 Linux 5.9 Git 40K 80K 120K 160K 200K SE +/- 195.74, N = 3 SE +/- 11.41, N = 3 SE +/- 205.69, N = 3 164645 165139 165181 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
TensorFlow Lite Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 Linux 5.4 Linux 5.8 Linux 5.9 Git 1.4M 2.8M 4.2M 5.6M 7M SE +/- 634.06, N = 3 SE +/- 88.19, N = 3 SE +/- 862.41, N = 3 6517420 6518137 6518793
Timed GDB GNU Debugger Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GDB GNU Debugger Compilation 9.1 Time To Compile Linux 5.4 Linux 5.8 Linux 5.9 Git 40 80 120 160 200 SE +/- 0.20, N = 3 SE +/- 0.25, N = 3 SE +/- 0.30, N = 3 159.60 157.32 160.64
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: inception-v3 Linux 5.4 Linux 5.8 Linux 5.9 Git 14 28 42 56 70 SE +/- 0.15, N = 3 SE +/- 0.20, N = 3 SE +/- 0.64, N = 3 62.31 62.93 62.56 MIN: 61.37 / MAX: 84.45 MIN: 62.1 / MAX: 89.18 MIN: 60.97 / MAX: 89.35 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: mobilenet-v1-1.0 Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.009, N = 3 SE +/- 0.014, N = 3 SE +/- 0.003, N = 3 7.748 7.788 7.800 MIN: 7.67 / MAX: 22.21 MIN: 7.73 / MAX: 12.65 MIN: 7.74 / MAX: 27.87 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: MobileNetV2_224 Linux 5.4 Linux 5.8 Linux 5.9 Git 1.2686 2.5372 3.8058 5.0744 6.343 SE +/- 0.019, N = 3 SE +/- 0.018, N = 3 SE +/- 0.037, N = 3 5.601 5.638 5.605 MIN: 5.47 / MAX: 9.61 MIN: 5.53 / MAX: 13.75 MIN: 5.45 / MAX: 10.27 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: resnet-v2-50 Linux 5.4 Linux 5.8 Linux 5.9 Git 13 26 39 52 65 SE +/- 0.16, N = 3 SE +/- 0.13, N = 3 SE +/- 0.07, N = 3 56.16 56.93 57.04 MIN: 55.4 / MAX: 107.43 MIN: 56.62 / MAX: 80.52 MIN: 56.76 / MAX: 79.83 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2020-09-17 Model: SqueezeNetV1.0 Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 10.26 10.37 10.40 MIN: 10.11 / MAX: 16.05 MIN: 10.27 / MAX: 31.91 MIN: 10.23 / MAX: 33.26 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TensorFlow Lite Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception V4 Linux 5.4 Linux 5.8 Linux 5.9 Git 1.5M 3M 4.5M 6M 7.5M SE +/- 250.95, N = 3 SE +/- 132.46, N = 3 SE +/- 580.35, N = 3 7204167 7204273 7202907
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 Linux 5.4 Linux 5.8 Linux 5.9 Git 30K 60K 90K 120K 150K SE +/- 12.01, N = 3 SE +/- 43.43, N = 3 SE +/- 97.27, N = 3 133823 133870 133774 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search Linux 5.4 Linux 5.8 Linux 5.9 Git 30 60 90 120 150 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 125.43 125.50 125.66 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Zstd Compression Compression Level: 19 OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.5 Compression Level: 19 Linux 5.4 Linux 5.8 Linux 5.9 Git 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 20.5 20.4 20.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 Linux 5.4 Linux 5.8 Linux 5.9 Git 8M 16M 24M 32M 40M SE +/- 54862.86, N = 3 SE +/- 6470.73, N = 3 SE +/- 156718.13, N = 3 38538549.1 38513841.4 38169338.8
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D Linux 5.4 Linux 5.8 Linux 5.9 Git 30 60 90 120 150 SE +/- 0.05, N = 3 SE +/- 0.30, N = 3 SE +/- 0.30, N = 3 118.30 118.46 118.98 1. (CXX) g++ options: -O2 -lOpenCL
GLmark2 Resolution: 3840 x 2160 OpenBenchmarking.org Score, More Is Better GLmark2 2020.04 Resolution: 3840 x 2160 Linux 5.4 Linux 5.8 Linux 5.9 Git 30 60 90 120 150 135 119 137
GLmark2 Resolution: 1920 x 1080 OpenBenchmarking.org Score, More Is Better GLmark2 2020.04 Resolution: 1920 x 1080 Linux 5.4 Linux 5.8 Linux 5.9 Git 120 240 360 480 600 543 502 552
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.7.3 Encoder Speed: 2 Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 0.16, N = 3 SE +/- 0.10, N = 3 SE +/- 0.18, N = 3 108.06 108.40 108.96 1. (CXX) g++ options: -O3 -fPIC
InfluxDB Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Linux 5.4 Linux 5.8 Linux 5.9 Git 200K 400K 600K 800K 1000K SE +/- 1019.70, N = 3 SE +/- 2768.93, N = 3 SE +/- 4935.24, N = 3 990095.1 984645.3 982092.1
InfluxDB Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Linux 5.4 Linux 5.8 Linux 5.9 Git 200K 400K 600K 800K 1000K SE +/- 1038.31, N = 3 SE +/- 553.23, N = 3 SE +/- 2049.30, N = 3 996459.9 993437.4 994952.9
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: yolov4-tiny Linux 5.4 Linux 5.8 Linux 5.9 Git 9 18 27 36 45 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 39.48 39.36 39.45 MIN: 39.28 / MAX: 53.47 MIN: 39.23 / MAX: 42.19 MIN: 39.23 / MAX: 42.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet50 Linux 5.4 Linux 5.8 Linux 5.9 Git 11 22 33 44 55 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 46.50 46.45 46.46 MIN: 46.31 / MAX: 58.67 MIN: 46.3 / MAX: 49.28 MIN: 46.33 / MAX: 51.2 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: alexnet Linux 5.4 Linux 5.8 Linux 5.9 Git 5 10 15 20 25 SE +/- 0.22, N = 3 SE +/- 0.01, N = 3 SE +/- 0.28, N = 3 22.56 22.12 22.41 MIN: 22.06 / MAX: 39.49 MIN: 22.04 / MAX: 22.28 MIN: 22.05 / MAX: 59.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: resnet18 Linux 5.4 Linux 5.8 Linux 5.9 Git 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 23.09 23.11 23.05 MIN: 22.97 / MAX: 26.84 MIN: 22.91 / MAX: 35.18 MIN: 22.9 / MAX: 25.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: vgg16 Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.12, N = 3 SE +/- 0.07, N = 3 89.35 88.95 88.82 MIN: 89.05 / MAX: 94.32 MIN: 88.67 / MAX: 137.54 MIN: 88.53 / MAX: 101.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: googlenet Linux 5.4 Linux 5.8 Linux 5.9 Git 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 23.08 23.03 23.07 MIN: 22.97 / MAX: 26.36 MIN: 22.94 / MAX: 23.49 MIN: 22.94 / MAX: 35.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: blazeface Linux 5.4 Linux 5.8 Linux 5.9 Git 0.4883 0.9766 1.4649 1.9532 2.4415 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.17 2.14 2.15 MIN: 2.13 / MAX: 2.29 MIN: 2.13 / MAX: 2.24 MIN: 2.13 / MAX: 2.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: efficientnet-b0 Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 10.51 10.44 10.45 MIN: 10.41 / MAX: 11 MIN: 10.4 / MAX: 10.76 MIN: 10.4 / MAX: 13.27 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mnasnet Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 6.66 6.61 6.62 MIN: 6.56 / MAX: 20.23 MIN: 6.56 / MAX: 8.52 MIN: 6.57 / MAX: 8.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: shufflenet-v2 Linux 5.4 Linux 5.8 Linux 5.9 Git 1.0508 2.1016 3.1524 4.2032 5.254 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 4.67 4.66 4.66 MIN: 4.59 / MAX: 8.01 MIN: 4.61 / MAX: 6.54 MIN: 4.59 / MAX: 6.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v3-v3 - Model: mobilenet-v3 Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 6.53 6.53 6.52 MIN: 6.41 / MAX: 9.17 MIN: 6.43 / MAX: 9.7 MIN: 6.42 / MAX: 8.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU-v2-v2 - Model: mobilenet-v2 Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 7.79 7.82 7.78 MIN: 7.67 / MAX: 9.99 MIN: 7.68 / MAX: 20.52 MIN: 7.67 / MAX: 9.94 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: mobilenet Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 28.94 28.85 28.84 MIN: 28.7 / MAX: 32.85 MIN: 28.7 / MAX: 41.24 MIN: 28.67 / MAX: 41.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: CPU - Model: squeezenet Linux 5.4 Linux 5.8 Linux 5.9 Git 6 12 18 24 30 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 24.88 24.76 24.84 MIN: 24.67 / MAX: 25.89 MIN: 24.65 / MAX: 36.01 MIN: 24.64 / MAX: 36.64 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis Linux 5.4 Linux 5.8 Linux 5.9 Git 8 16 24 32 40 SE +/- 0.22, N = 16 SE +/- 0.20, N = 4 SE +/- 0.13, N = 4 33.93 34.26 34.59 1. (CC) gcc options: -O2 -std=c99
Montage Astronomical Image Mosaic Engine Mosaic of M17, K band, 1.5 deg x 1.5 deg OpenBenchmarking.org Seconds, Fewer Is Better Montage Astronomical Image Mosaic Engine 6.0 Mosaic of M17, K band, 1.5 deg x 1.5 deg Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.18, N = 3 89.56 90.70 91.03 1. (CC) gcc options: -std=gnu99 -lcfitsio -lm -O2
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 60 120 180 240 300 SE +/- 0.67, N = 3 SE +/- 3.43, N = 3 SE +/- 2.39, N = 15 285.96 270.67 260.96 MIN: 283.43 MIN: 261.84 MIN: 247.57 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression Linux 5.4 Linux 5.8 Linux 5.9 Git 0.9743 1.9486 2.9229 3.8972 4.8715 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 4 4.29 4.24 4.33
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Thorough Linux 5.4 Linux 5.8 Linux 5.9 Git 16 32 48 64 80 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 72.26 72.32 72.37 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
Hugin Panorama Photo Assistant + Stitching Time OpenBenchmarking.org Seconds, Fewer Is Better Hugin Panorama Photo Assistant + Stitching Time Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 0.48, N = 3 SE +/- 0.25, N = 3 SE +/- 0.42, N = 3 73.46 74.06 74.65
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica Linux 5.4 Linux 5.8 Linux 5.9 Git 16 32 48 64 80 SE +/- 0.60, N = 3 SE +/- 0.91, N = 3 SE +/- 0.43, N = 3 71.10 68.13 70.06
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 Linux 5.4 Linux 5.8 Linux 5.9 Git 80K 160K 240K 320K 400K SE +/- 767.36, N = 3 SE +/- 387.75, N = 3 SE +/- 359.00, N = 3 393603.00 381234.65 377603.40 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
DeepSpeech Acceleration: CPU OpenBenchmarking.org Seconds, Fewer Is Better DeepSpeech 0.6 Acceleration: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.51, N = 3 SE +/- 0.53, N = 3 78.51 80.61 80.05
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 Linux 5.4 Linux 5.8 Linux 5.9 Git 14K 28K 42K 56K 70K SE +/- 46.52, N = 3 SE +/- 13.57, N = 3 SE +/- 46.28, N = 3 66942 67067 66888 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
MPV Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only OpenBenchmarking.org FPS, More Is Better MPV Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only Linux 5.4 Linux 5.8 Linux 5.9 Git 30 60 90 120 150 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 139.45 139.88 139.75 MIN: 115.16 / MAX: 146.34 MIN: 123.1 / MAX: 146.35 MIN: 119 / MAX: 146.35 1. mpv 0.32.0
TensorFlow Lite Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: NASNet Mobile Linux 5.4 Linux 5.8 Linux 5.9 Git 80K 160K 240K 320K 400K SE +/- 60.60, N = 3 SE +/- 108.52, N = 3 SE +/- 77.03, N = 3 366947 366914 367078
TensorFlow Lite Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: SqueezeNet Linux 5.4 Linux 5.8 Linux 5.9 Git 110K 220K 330K 440K 550K SE +/- 34.51, N = 3 SE +/- 53.76, N = 3 SE +/- 63.00, N = 3 499694 499845 499825
TensorFlow Lite Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Float Linux 5.4 Linux 5.8 Linux 5.9 Git 70K 140K 210K 280K 350K SE +/- 59.09, N = 3 SE +/- 39.28, N = 3 SE +/- 48.17, N = 3 339412 339372 339434
TensorFlow Lite Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Quant Linux 5.4 Linux 5.8 Linux 5.9 Git 70K 140K 210K 280K 350K SE +/- 72.59, N = 3 SE +/- 64.01, N = 3 SE +/- 102.05, N = 3 348999 348976 349438
oneDNN Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 11 22 33 44 55 SE +/- 0.01, N = 3 SE +/- 0.17, N = 3 SE +/- 0.15, N = 3 49.86 50.97 50.24 MIN: 49.38 MIN: 50.11 MIN: 49.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Batch All - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch All - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 30 60 90 120 150 SE +/- 0.29, N = 3 SE +/- 1.51, N = 3 SE +/- 0.10, N = 3 111.12 114.24 110.72 MIN: 109.55 MIN: 110.66 MIN: 109.51 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OCRMyPDF Processing 60 Page PDF Document OpenBenchmarking.org Seconds, Fewer Is Better OCRMyPDF 9.6.0+dfsg Processing 60 Page PDF Document Linux 5.4 Linux 5.8 Linux 5.9 Git 12 24 36 48 60 SE +/- 0.14, N = 3 SE +/- 0.13, N = 3 SE +/- 0.20, N = 3 55.54 55.39 55.51
XZ Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 Linux 5.4 Linux 5.8 Linux 5.9 Git 12 24 36 48 60 SE +/- 0.05, N = 3 SE +/- 0.11, N = 3 SE +/- 0.37, N = 3 50.74 50.62 51.03 1. (CC) gcc options: -pthread -fvisibility=hidden -O2
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression Linux 5.4 Linux 5.8 Linux 5.9 Git 11 22 33 44 55 SE +/- 0.14, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 47.47 47.26 47.19 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver Linux 5.4 Linux 5.8 Linux 5.9 Git 11 22 33 44 55 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 47.39 47.67 47.63 1. (CXX) g++ options: -O2 -lOpenCL
AOM AV1 Encoder Mode: Speed 0 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 0 Two-Pass Linux 5.4 Linux 5.8 Linux 5.9 Git 0.0495 0.099 0.1485 0.198 0.2475 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.22 0.22 0.22 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 27.66 27.65 27.55 1. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
AOM AV1 Encoder Mode: Speed 6 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 6 Realtime Linux 5.4 Linux 5.8 Linux 5.9 Git 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 14.99 14.99 14.99 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 120 240 360 480 600 SE +/- 0.15, N = 3 SE +/- 0.78, N = 3 SE +/- 3.18, N = 3 569.40 548.04 558.86 MIN: 568.18 MIN: 545.86 MIN: 552.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
AOM AV1 Encoder Mode: Speed 6 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 6 Two-Pass Linux 5.4 Linux 5.8 Linux 5.9 Git 0.675 1.35 2.025 2.7 3.375 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.00 2.99 2.99 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.41 Time To Compile Linux 5.4 Linux 5.8 Linux 5.9 Git 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 32.73 32.71 33.01
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.10, N = 3 SE +/- 0.13, N = 3 SE +/- 0.00, N = 3 27.86 28.12 27.87
Zstd Compression Compression Level: 3 OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.5 Compression Level: 3 Linux 5.4 Linux 5.8 Linux 5.9 Git 500 1000 1500 2000 2500 SE +/- 6.19, N = 3 SE +/- 1.97, N = 3 SE +/- 1.34, N = 3 2386.0 2389.3 2389.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 28.73 28.73 28.70 1. (CXX) g++ options: -O2 -lOpenCL
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.07, N = 3 SE +/- 0.18, N = 3 SE +/- 0.07, N = 3 27.87 28.13 28.09 1. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
Tesseract OCR Time To OCR 7 Images OpenBenchmarking.org Seconds, Fewer Is Better Tesseract OCR 4.1.1 Time To OCR 7 Images Linux 5.4 Linux 5.8 Linux 5.9 Git 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 27.45 27.64 27.57
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time Linux 5.4 Linux 5.8 Linux 5.9 Git 1.6M 3.2M 4.8M 6.4M 8M SE +/- 35007.79, N = 3 SE +/- 23889.21, N = 3 SE +/- 21668.21, N = 3 7547320 7469298 7302256 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 Linux 5.4 Linux 5.8 Linux 5.9 Git 80 160 240 320 400 SE +/- 0.47, N = 3 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 366.30 366.39 366.63 MIN: 364.37 / MAX: 374.6 MIN: 365.21 / MAX: 367.87 MIN: 365.56 / MAX: 368.1 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
AOM AV1 Encoder Mode: Speed 4 Two-Pass OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 4 Two-Pass Linux 5.4 Linux 5.8 Linux 5.9 Git 0.423 0.846 1.269 1.692 2.115 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.88 1.87 1.88 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 Linux 5.4 Linux 5.8 Linux 5.9 Git 80 160 240 320 400 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 0.40, N = 3 344.44 346.50 346.14 MIN: 343.7 / MAX: 346.67 MIN: 345.83 / MAX: 347.67 MIN: 344.07 / MAX: 348.21 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics Linux 5.4 Linux 5.8 Linux 5.9 Git 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 21.43 21.48 21.54
oneDNN Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 0.01, N = 3 10.39 10.31 10.19 MIN: 10.15 MIN: 10.1 MIN: 10.07 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.00752, N = 3 SE +/- 0.08955, N = 3 SE +/- 0.10703, N = 3 9.25837 9.73579 9.82786 MIN: 9.19 MIN: 9.52 MIN: 9.53 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless Linux 5.4 Linux 5.8 Linux 5.9 Git 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 19.36 19.33 19.21 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
AOM AV1 Encoder Mode: Speed 8 Realtime OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 8 Realtime Linux 5.4 Linux 5.8 Linux 5.9 Git 8 16 24 32 40 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 35.82 35.57 35.65 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Medium Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 11.14 11.14 11.15 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ECP-CANDLE Benchmark: P1B2 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.3 Benchmark: P1B2 Linux 5.4 Linux 5.8 Linux 5.9 Git 10 20 30 40 50 46.13 45.58 45.98
oneDNN Harness: IP Batch 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch 1D - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.00538, N = 3 SE +/- 0.03078, N = 3 SE +/- 0.03775, N = 3 8.17233 8.37706 8.40703 MIN: 7.82 MIN: 8 MIN: 7.97 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 0.8779 1.7558 2.6337 3.5116 4.3895 SE +/- 0.00729, N = 3 SE +/- 0.00186, N = 3 SE +/- 0.00590, N = 3 3.77996 3.90177 3.84716 MIN: 3.67 MIN: 3.79 MIN: 3.73 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 SE +/- 0.14, N = 3 12.27 12.29 12.29 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 1.1604 2.3208 3.4812 4.6416 5.802 SE +/- 0.00807, N = 3 SE +/- 0.01544, N = 3 SE +/- 0.00900, N = 3 5.15748 5.04980 5.04492 MIN: 5.07 MIN: 4.96 MIN: 4.96 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.00597, N = 3 SE +/- 0.01858, N = 3 SE +/- 0.01959, N = 3 7.16575 7.19867 7.20203 MIN: 7.05 MIN: 7.05 MIN: 7.07 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
System GZIP Decompression OpenBenchmarking.org Seconds, Fewer Is Better System GZIP Decompression Linux 5.4 Linux 5.8 Linux 5.9 Git 0.7549 1.5098 2.2647 3.0196 3.7745 SE +/- 0.001, N = 3 SE +/- 0.035, N = 14 SE +/- 0.046, N = 13 3.282 3.339 3.355
ASTC Encoder Preset: Fast OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Fast Linux 5.4 Linux 5.8 Linux 5.9 Git 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 9.02 9.12 9.07 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.009, N = 3 SE +/- 0.009, N = 3 SE +/- 0.015, N = 3 8.148 8.184 8.161 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
libavif avifenc Encoder Speed: 8 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.7.3 Encoder Speed: 8 Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.013, N = 3 SE +/- 0.022, N = 3 SE +/- 0.007, N = 3 8.199 8.257 8.316 1. (CXX) g++ options: -O3 -fPIC
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.7.3 Encoder Speed: 10 Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.017, N = 3 SE +/- 0.030, N = 3 SE +/- 0.031, N = 3 7.600 7.639 7.678 1. (CXX) g++ options: -O3 -fPIC
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 24Aug2020 Model: Rhodopsin Protein Linux 5.4 Linux 5.8 Linux 5.9 Git 0.6206 1.2412 1.8618 2.4824 3.103 SE +/- 0.004, N = 3 SE +/- 0.005, N = 3 SE +/- 0.007, N = 3 2.746 2.758 2.758 1. (CXX) g++ options: -O3 -pthread -lm
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 5 10 15 20 25 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 18.75 18.83 18.79 MIN: 18.38 MIN: 18.6 MIN: 18.35 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 5 10 15 20 25 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 19.94 19.89 19.88 MIN: 19.8 MIN: 19.73 MIN: 19.73 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine Linux 5.4 Linux 5.8 Linux 5.9 Git 4K 8K 12K 16K 20K SE +/- 95.32, N = 3 SE +/- 59.99, N = 3 SE +/- 64.79, N = 3 20328.50 20258.60 20335.31 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
oneDNN Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 2 4 6 8 10 SE +/- 0.02123, N = 3 SE +/- 0.01758, N = 3 SE +/- 0.00534, N = 3 7.41923 7.42274 7.40944 MIN: 7.35 MIN: 7.35 MIN: 7.3 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU Linux 5.4 Linux 5.8 Linux 5.9 Git 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 14.37 15.15 14.72 MIN: 14.12 MIN: 14.79 MIN: 14.45 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 Linux 5.4 Linux 5.8 Linux 5.9 Git 0.6201 1.2402 1.8603 2.4804 3.1005 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 SE +/- 0.005, N = 3 2.728 2.742 2.756 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance Linux 5.4 Linux 5.8 Linux 5.9 Git 200K 400K 600K 800K 1000K SE +/- 1206.41, N = 5 SE +/- 1191.39, N = 5 SE +/- 1137.79, N = 5 1141277 1134693 1131430 1. (CC) gcc options: -O3 -march=native
WebP Image Encode Encode Settings: Default OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Default Linux 5.4 Linux 5.8 Linux 5.9 Git 0.3938 0.7876 1.1814 1.5752 1.969 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 1.741 1.750 1.749 1. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
Phoronix Test Suite v10.8.4