Core i7 5775C EOY 2020 Intel Core i7-5775C testing with a MSI Z97-G45 GAMING (MS-7821) v1.0 (V2.9 BIOS) and MSI Intel Iris Pro 6200 3GB on Ubuntu 18.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2012292-HA-COREI757721&grw&sor .
Core i7 5775C EOY 2020 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 Intel Core i7-5775C @ 3.70GHz (4 Cores / 8 Threads) MSI Z97-G45 GAMING (MS-7821) v1.0 (V2.9 BIOS) Intel Broadwell-U DMI 16GB 120GB CT120BX100SSD1 MSI Intel Iris Pro 6200 3GB (1150MHz) Intel Broadwell-U Audio VA2431 Qualcomm Atheros Killer E220x Ubuntu 18.10 5.0.0-999-generic (x86_64) 20190223 GNOME Shell 3.30.2 X Server 1.20.1 modesetting 1.20.1 4.5 Mesa 19.2.0-devel (git-2631fd3 2019-07-24 cosmic-oibaf-ppa) 1.1.102 GCC 8.3.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x20 - Thermald 1.7 Java Details - OpenJDK Runtime Environment (build 11.0.3+7-Ubuntu-1ubuntu218.10.1) Python Details - Python 2.7.16 + Python 3.6.8 Security Details - l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling
Core i7 5775C EOY 2020 compress-lz4: 1 - Compression Speed compress-lz4: 1 - Decompression Speed compress-lz4: 3 - Compression Speed yquake2: Software CPU - 1920 x 1080 yquake2: OpenGL 3.x - 1920 x 1080 yquake2: OpenGL 1.x - 1920 x 1080 compress-lz4: 3 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 9 - Decompression Speed crafty: Elapsed Time clomp: Static OMP Speedup hint: FLOAT basis: ETC1S basis: UASTC Level 0 basis: UASTC Level 2 basis: UASTC Level 3 basis: UASTC Level 2 + RDO Post-Processing brl-cad: VGR Performance Metric encode-ape: WAV To APE encode-opus: WAV To Opus Encode encode-wavpack: WAV To WavPack astcenc: Fast astcenc: Medium astcenc: Thorough astcenc: Exhaustive betsy: ETC1 - Highest betsy: ETC2 RGB - Highest encode-ogg: WAV To Ogg espeak: Text-To-Speech Synthesis hmmer: Pfam Database Search mafft: Multiple Sequence Alignment - LSU RNA numpy: hpcc: G-HPL hpcc: G-Ffte hpcc: EP-DGEMM hpcc: G-Ptrans hpcc: EP-STREAM Triad hpcc: G-Rand Access hpcc: Rand Ring Latency hpcc: Rand Ring Bandwidth hpcc: Max Ping Pong Bandwidth dolfyn: Computational Fluid Dynamics rnnoise: caffe: AlexNet - CPU - 100 caffe: GoogleNet - CPU - 100 ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m mlpack: scikit_ica mlpack: scikit_qda mlpack: scikit_svm mlpack: scikit_linearridgeregression gromacs: Water Benchmark lammps: Rhodopsin Protein onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU ffte: N=256, 3D Complex FFT Routine sunflow: Global Illumination + Image Synthesis coremark: CoreMark Size 666 - Iterations Per Second build-ffmpeg: Time To Compile stockfish: Total Time asmfish: 1024 Hash Memory, 26 Depth kvazaar: Bosphorus 4K - Slow kvazaar: Bosphorus 4K - Medium kvazaar: Bosphorus 1080p - Slow kvazaar: Bosphorus 1080p - Medium kvazaar: Bosphorus 4K - Very Fast kvazaar: Bosphorus 4K - Ultra Fast kvazaar: Bosphorus 1080p - Very Fast kvazaar: Bosphorus 1080p - Ultra Fast x265: Bosphorus 4K x265: Bosphorus 1080p rav1e: 1 rav1e: 5 rav1e: 6 rav1e: 10 embree: Pathtracer - Crown embree: Pathtracer ISPC - Crown embree: Pathtracer - Asian Dragon embree: Pathtracer ISPC - Asian Dragon indigobench: CPU - Bedroom indigobench: CPU - Supercar build2: Time To Compile build-eigen: Time To Compile phpbench: PHP Benchmark Suite keydb: redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET unpack-firefox: firefox-84.0.source.tar.xz sqlite-speedtest: Timed Time - Size 1,000 byte: Dhrystone 2 1 2 3 4787.47 5489.2 43.28 114.3 553.8 518.3 5408.7 42.62 5451.8 6824944 3.1 381176866.34838 79.260 12.002 81.244 158.998 1001.729 45294 14.503 9.412 16.308 8.42 10.93 72.63 588.79 15.397 16.127 24.080 40.479 114.001 12.524 279.09 82.76190 2.79950 36.01717 1.23725 4.08144 0.03293 0.21109 5.46352 17277.738 23.148 25.670 67035 138193 26.45 7.08 5.99 11.39 6.68 10.20 3.09 19.60 81.26 23.82 21.96 46.99 37.64 31.07 17.30 61.60 99.00 13.35 5.25 0.482 2.878 9.00601 13.1433 5.77578 3.12754 20.7908 12.4843 17.9927 21.2868 13.0131 11.2286 7404.84 3879.97 7400.51 3898.31 3.50897 7399.77 3897.98 6.70028 19803.756124216 2.505 140877.184823 121.027 7419488 11144382 2.01 2.06 8.75 9.02 5.62 10.18 22.68 40.39 6.11 28.06 0.288 0.878 1.183 2.636 4.7126 5.4516 5.6657 6.8017 0.721 1.654 242.554 82.306 574643 411366.57 2368415.09 1661804.39 1421632.92 2215813.5 1660500.87 20.723 72.274 37136699.5 4805.59 5497.5 43.51 114.1 547.1 510.3 5450.2 42.68 5458.7 6860639 3.1 381158594.08639 78.339 11.989 81.226 158.995 1001.520 45275 14.476 9.422 16.298 8.43 10.93 72.60 588.80 14.698 15.314 24.079 41.123 114.069 12.610 275.03 81.58543 2.79007 35.86837 1.23969 4.07380 0.03319 0.21092 5.81487 17131.180 23.227 25.657 66877 138669 26.68 7.07 6.00 11.29 6.60 10.20 3.09 19.49 81.40 23.82 22.01 46.98 37.65 31.15 17.38 60.42 98.58 13.34 5.39 0.484 2.849 8.59358 13.9508 5.61439 3.24606 20.8008 11.7263 16.9585 20.6052 12.8064 11.1951 7396.94 3895.42 7394.59 3884.54 4.27331 7396.63 3882.87 6.33749 19947.561999863 2.487 141109.571045 122.199 7444518 11229182 2.01 2.06 8.77 9.04 5.62 10.16 22.72 40.45 6.14 27.90 0.289 0.882 1.185 2.650 4.7276 5.3988 5.6753 6.8597 0.723 1.655 240.876 82.482 575213 411227.16 1438792.17 1761393.52 1444723.54 1939149.38 1671609.50 21.023 72.628 37026672.1 4792.61 5519.6 43.38 114.7 542.5 527.6 5477.8 42.05 5474.5 6885507 3.1 382533367.81553 78.757 11.998 81.253 158.980 1001.727 45131 14.447 9.416 16.303 8.43 10.93 72.62 588.64 15.956 15.716 24.102 40.939 114.093 12.543 277.38 80.82890 2.78478 36.29173 1.23435 4.07377 0.03312 0.21180 5.36042 17216.067 23.020 25.659 65933 138249 26.68 6.97 5.98 11.31 6.62 10.13 3.06 19.47 81.33 23.72 22.04 47.11 37.90 30.94 17.27 59.83 95.58 13.36 5.28 0.484 2.865 8.49555 14.1075 5.62232 2.67834 21.1590 11.6886 16.0907 21.9084 12.8897 11.2136 7398.38 3876.36 7399.59 3869.51 3.71011 7419.20 3873.60 6.32260 19694.290326945 2.445 141520.579544 121.208 7392870 11302838 2.01 2.06 8.79 9.05 5.64 10.19 22.76 40.49 6.09 27.94 0.288 0.882 1.185 2.653 4.7247 5.3503 5.6695 6.7998 0.723 1.658 238.064 82.334 574799 408847.89 2024710.52 1687783.91 1461476.54 2041822.96 1668121.37 21.303 72.533 37053570.5 OpenBenchmarking.org
LZ4 Compression Compression Level: 1 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Compression Speed 2 3 1 1000 2000 3000 4000 5000 SE +/- 18.70, N = 3 SE +/- 5.40, N = 3 SE +/- 14.35, N = 3 4805.59 4792.61 4787.47 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed 3 2 1 1200 2400 3600 4800 6000 SE +/- 19.26, N = 3 SE +/- 14.56, N = 3 SE +/- 33.51, N = 3 5519.6 5497.5 5489.2 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed 2 3 1 10 20 30 40 50 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 43.51 43.38 43.28 1. (CC) gcc options: -O3
yquake2 Renderer: Software CPU - Resolution: 1920 x 1080 OpenBenchmarking.org Frames Per Second, More Is Better yquake2 7.45 Renderer: Software CPU - Resolution: 1920 x 1080 3 1 2 30 60 90 120 150 SE +/- 0.24, N = 3 SE +/- 0.15, N = 3 SE +/- 0.68, N = 3 114.7 114.3 114.1 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
yquake2 Renderer: OpenGL 3.x - Resolution: 1920 x 1080 OpenBenchmarking.org Frames Per Second, More Is Better yquake2 7.45 Renderer: OpenGL 3.x - Resolution: 1920 x 1080 1 2 3 120 240 360 480 600 SE +/- 2.95, N = 3 SE +/- 4.68, N = 3 SE +/- 3.26, N = 3 553.8 547.1 542.5 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
yquake2 Renderer: OpenGL 1.x - Resolution: 1920 x 1080 OpenBenchmarking.org Frames Per Second, More Is Better yquake2 7.45 Renderer: OpenGL 1.x - Resolution: 1920 x 1080 3 1 2 110 220 330 440 550 SE +/- 3.86, N = 3 SE +/- 9.01, N = 4 SE +/- 5.25, N = 15 527.6 518.3 510.3 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed 3 2 1 1200 2400 3600 4800 6000 SE +/- 9.93, N = 3 SE +/- 4.97, N = 3 SE +/- 4.57, N = 3 5477.8 5450.2 5408.7 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed 2 1 3 10 20 30 40 50 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 SE +/- 0.42, N = 3 42.68 42.62 42.05 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed 3 2 1 1200 2400 3600 4800 6000 SE +/- 5.41, N = 3 SE +/- 11.99, N = 3 SE +/- 6.01, N = 3 5474.5 5458.7 5451.8 1. (CC) gcc options: -O3
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time 3 2 1 1.5M 3M 4.5M 6M 7.5M SE +/- 4260.63, N = 3 SE +/- 16848.33, N = 3 SE +/- 29965.63, N = 3 6885507 6860639 6824944 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
CLOMP Static OMP Speedup OpenBenchmarking.org Speedup, More Is Better CLOMP 1.2 Static OMP Speedup 3 2 1 0.6975 1.395 2.0925 2.79 3.4875 SE +/- 0.03, N = 15 SE +/- 0.03, N = 3 SE +/- 0.03, N = 12 3.1 3.1 3.1 1. (CC) gcc options: -fopenmp -O3 -lm
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT 3 1 2 80M 160M 240M 320M 400M SE +/- 82458.49, N = 3 SE +/- 977507.07, N = 3 SE +/- 786027.10, N = 3 382533367.82 381176866.35 381158594.09 1. (CC) gcc options: -O3 -march=native -lm
Basis Universal Settings: ETC1S OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: ETC1S 2 3 1 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 SE +/- 0.52, N = 3 78.34 78.76 79.26 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 0 2 3 1 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 11.99 12.00 12.00 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 2 2 1 3 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 81.23 81.24 81.25 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 3 3 2 1 40 80 120 160 200 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 158.98 159.00 159.00 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 2 + RDO Post-Processing OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 2 + RDO Post-Processing 2 3 1 200 400 600 800 1000 SE +/- 0.24, N = 3 SE +/- 0.27, N = 3 SE +/- 0.26, N = 3 1001.52 1001.73 1001.73 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.30.8 VGR Performance Metric 1 2 3 10K 20K 30K 40K 50K 45294 45275 45131 1. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm
Monkey Audio Encoding WAV To APE OpenBenchmarking.org Seconds, Fewer Is Better Monkey Audio Encoding 3.99.6 WAV To APE 3 2 1 4 8 12 16 20 SE +/- 0.06, N = 5 SE +/- 0.06, N = 5 SE +/- 0.11, N = 5 14.45 14.48 14.50 1. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode 1 3 2 3 6 9 12 15 SE +/- 0.019, N = 5 SE +/- 0.011, N = 5 SE +/- 0.014, N = 5 9.412 9.416 9.422 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack 2 3 1 4 8 12 16 20 SE +/- 0.00, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 16.30 16.30 16.31 1. (CXX) g++ options: -rdynamic
ASTC Encoder Preset: Fast OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Fast 1 2 3 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.42 8.43 8.43 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Medium 1 2 3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 10.93 10.93 10.93 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Thorough 2 3 1 16 32 48 64 80 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 72.60 72.62 72.63 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Exhaustive 3 1 2 130 260 390 520 650 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 588.64 588.79 588.80 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
Betsy GPU Compressor Codec: ETC1 - Quality: Highest OpenBenchmarking.org Seconds, Fewer Is Better Betsy GPU Compressor 1.1 Beta Codec: ETC1 - Quality: Highest 2 1 3 4 8 12 16 20 SE +/- 0.45, N = 14 SE +/- 0.55, N = 15 SE +/- 0.04, N = 3 14.70 15.40 15.96 1. (CXX) g++ options: -O3 -O2 -lpthread -ldl
Betsy GPU Compressor Codec: ETC2 RGB - Quality: Highest OpenBenchmarking.org Seconds, Fewer Is Better Betsy GPU Compressor 1.1 Beta Codec: ETC2 RGB - Quality: Highest 2 3 1 4 8 12 16 20 SE +/- 0.32, N = 15 SE +/- 0.19, N = 15 SE +/- 0.51, N = 13 15.31 15.72 16.13 1. (CXX) g++ options: -O3 -O2 -lpthread -ldl
Ogg Audio Encoding WAV To Ogg OpenBenchmarking.org Seconds, Fewer Is Better Ogg Audio Encoding 1.3.4 WAV To Ogg 2 1 3 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 24.08 24.08 24.10 1. (CC) gcc options: -O2 -ffast-math -fsigned-char
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis 1 3 2 9 18 27 36 45 SE +/- 0.35, N = 16 SE +/- 0.56, N = 6 SE +/- 0.48, N = 4 40.48 40.94 41.12 1. (CC) gcc options: -O2 -std=c99
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 2 3 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.00, N = 3 114.00 114.07 114.09 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 1 3 2 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 12.52 12.54 12.61 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
Numpy Benchmark OpenBenchmarking.org Score, More Is Better Numpy Benchmark 1 3 2 60 120 180 240 300 SE +/- 1.04, N = 3 SE +/- 0.46, N = 3 SE +/- 0.54, N = 3 279.09 277.38 275.03
HPC Challenge Test / Class: G-HPL OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-HPL 1 2 3 20 40 60 80 100 SE +/- 1.43, N = 3 SE +/- 0.40, N = 3 SE +/- 0.62, N = 3 82.76 81.59 80.83 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.2
HPC Challenge Test / Class: G-Ffte OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ffte 1 2 3 0.6299 1.2598 1.8897 2.5196 3.1495 SE +/- 0.00194, N = 3 SE +/- 0.00441, N = 3 SE +/- 0.00289, N = 3 2.79950 2.79007 2.78478 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.2
HPC Challenge Test / Class: EP-DGEMM OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: EP-DGEMM 3 1 2 8 16 24 32 40 SE +/- 0.40, N = 3 SE +/- 0.20, N = 3 SE +/- 0.08, N = 3 36.29 36.02 35.87 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.2
HPC Challenge Test / Class: G-Ptrans OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ptrans 2 1 3 0.2789 0.5578 0.8367 1.1156 1.3945 SE +/- 0.00266, N = 3 SE +/- 0.01118, N = 3 SE +/- 0.00515, N = 3 1.23969 1.23725 1.23435 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.2
HPC Challenge Test / Class: EP-STREAM Triad OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: EP-STREAM Triad 1 2 3 0.9183 1.8366 2.7549 3.6732 4.5915 SE +/- 0.00157, N = 3 SE +/- 0.00200, N = 3 SE +/- 0.00055, N = 3 4.08144 4.07380 4.07377 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.2
HPC Challenge Test / Class: G-Random Access OpenBenchmarking.org GUP/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Random Access 2 3 1 0.0075 0.015 0.0225 0.03 0.0375 SE +/- 0.00018, N = 3 SE +/- 0.00022, N = 3 SE +/- 0.00010, N = 3 0.03319 0.03312 0.03293 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.2
HPC Challenge Test / Class: Random Ring Latency OpenBenchmarking.org usecs, Fewer Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Latency 2 1 3 0.0477 0.0954 0.1431 0.1908 0.2385 SE +/- 0.00022, N = 3 SE +/- 0.00015, N = 3 SE +/- 0.00054, N = 3 0.21092 0.21109 0.21180 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.2
HPC Challenge Test / Class: Random Ring Bandwidth OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Bandwidth 2 1 3 1.3083 2.6166 3.9249 5.2332 6.5415 SE +/- 0.02203, N = 3 SE +/- 0.18268, N = 3 SE +/- 0.18898, N = 3 5.81487 5.46352 5.36042 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.2
HPC Challenge Test / Class: Max Ping Pong Bandwidth OpenBenchmarking.org MB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Max Ping Pong Bandwidth 1 3 2 4K 8K 12K 16K 20K SE +/- 43.07, N = 3 SE +/- 105.67, N = 3 SE +/- 272.70, N = 3 17277.74 17216.07 17131.18 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.2
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 3 1 2 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 23.02 23.15 23.23
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 2 3 1 6 12 18 24 30 SE +/- 0.22, N = 3 SE +/- 0.20, N = 3 SE +/- 0.22, N = 3 25.66 25.66 25.67 1. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 3 2 1 14K 28K 42K 56K 70K SE +/- 171.69, N = 3 SE +/- 202.49, N = 3 SE +/- 296.71, N = 3 65933 66877 67035 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lboost_system -lboost_thread -lboost_filesystem -lboost_chrono -lboost_date_time -lboost_atomic -lpthread -lglog -lgflags -lsz -lz -ldl -lm -lprotobuf -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 1 3 2 30K 60K 90K 120K 150K SE +/- 306.29, N = 3 SE +/- 303.12, N = 3 SE +/- 50.29, N = 3 138193 138249 138669 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lboost_system -lboost_thread -lboost_filesystem -lboost_chrono -lboost_date_time -lboost_atomic -lpthread -lglog -lgflags -lsz -lz -ldl -lm -lprotobuf -llmdb -lopenblas
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mobilenet 1 2 3 6 12 18 24 30 SE +/- 0.12, N = 3 SE +/- 0.30, N = 3 SE +/- 0.17, N = 3 26.45 26.68 26.68 MIN: 26.18 / MAX: 39.6 MIN: 26.23 / MAX: 28.16 MIN: 26.16 / MAX: 37.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 3 2 1 2 4 6 8 10 SE +/- 0.13, N = 3 SE +/- 0.02, N = 3 SE +/- 0.10, N = 3 6.97 7.07 7.08 MIN: 6.76 / MAX: 8.01 MIN: 6.97 / MAX: 8.06 MIN: 6.91 / MAX: 8.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 3 1 2 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 5.98 5.99 6.00 MIN: 5.93 / MAX: 6.94 MIN: 5.93 / MAX: 6.86 MIN: 5.95 / MAX: 7.09 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: shufflenet-v2 2 3 1 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 11.29 11.31 11.39 MIN: 11.24 / MAX: 12.08 MIN: 11.22 / MAX: 24.44 MIN: 11.31 / MAX: 12.43 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mnasnet 2 3 1 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 6.60 6.62 6.68 MIN: 6.55 / MAX: 6.74 MIN: 6.5 / MAX: 6.81 MIN: 6.56 / MAX: 6.86 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: efficientnet-b0 3 1 2 3 6 9 12 15 SE +/- 0.11, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 10.13 10.20 10.20 MIN: 9.89 / MAX: 11.32 MIN: 10.13 / MAX: 10.32 MIN: 9.86 / MAX: 21.1 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: blazeface 3 1 2 0.6953 1.3906 2.0859 2.7812 3.4765 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 3.06 3.09 3.09 MIN: 3.03 / MAX: 3.14 MIN: 3.03 / MAX: 3.29 MIN: 3.03 / MAX: 7.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: googlenet 3 2 1 5 10 15 20 25 SE +/- 0.09, N = 3 SE +/- 0.14, N = 3 SE +/- 0.02, N = 3 19.47 19.49 19.60 MIN: 19.19 / MAX: 20.46 MIN: 19.14 / MAX: 19.91 MIN: 19.49 / MAX: 19.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: vgg16 1 3 2 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.13, N = 3 SE +/- 0.09, N = 3 81.26 81.33 81.40 MIN: 80.69 / MAX: 92.8 MIN: 80.91 / MAX: 85.21 MIN: 80.9 / MAX: 92.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet18 3 1 2 6 12 18 24 30 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 23.72 23.82 23.82 MIN: 23.51 / MAX: 36.61 MIN: 23.6 / MAX: 36.25 MIN: 23.47 / MAX: 25.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: alexnet 1 2 3 5 10 15 20 25 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 21.96 22.01 22.04 MIN: 21.61 / MAX: 23.2 MIN: 21.84 / MAX: 22.4 MIN: 21.86 / MAX: 22.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet50 2 1 3 11 22 33 44 55 SE +/- 0.17, N = 3 SE +/- 0.06, N = 3 SE +/- 0.21, N = 3 46.98 46.99 47.11 MIN: 46.59 / MAX: 60.2 MIN: 46.57 / MAX: 49.08 MIN: 46.64 / MAX: 50.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: yolov4-tiny 1 2 3 9 18 27 36 45 SE +/- 0.11, N = 3 SE +/- 0.18, N = 3 SE +/- 0.04, N = 3 37.64 37.65 37.90 MIN: 37.09 / MAX: 39.96 MIN: 37.19 / MAX: 39.03 MIN: 37.49 / MAX: 40.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: squeezenet_ssd 3 1 2 7 14 21 28 35 SE +/- 0.10, N = 3 SE +/- 0.13, N = 3 SE +/- 0.05, N = 3 30.94 31.07 31.15 MIN: 30.7 / MAX: 32.44 MIN: 30.77 / MAX: 40.23 MIN: 30.98 / MAX: 33.13 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: regnety_400m 3 1 2 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.22, N = 3 17.27 17.30 17.38 MIN: 17.12 / MAX: 18.45 MIN: 17.18 / MAX: 17.63 MIN: 17.04 / MAX: 98.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica 3 2 1 14 28 42 56 70 SE +/- 0.08, N = 3 SE +/- 0.28, N = 3 SE +/- 0.75, N = 3 59.83 60.42 61.60
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda 3 2 1 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 1.16, N = 3 SE +/- 0.67, N = 3 95.58 98.58 99.00
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm 2 1 3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 13.34 13.35 13.36
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression 1 3 2 1.2128 2.4256 3.6384 4.8512 6.064 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.08, N = 4 5.25 5.28 5.39
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.3 Water Benchmark 3 2 1 0.1089 0.2178 0.3267 0.4356 0.5445 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 0.484 0.484 0.482 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 1 3 2 0.6476 1.2952 1.9428 2.5904 3.238 SE +/- 0.006, N = 3 SE +/- 0.010, N = 3 SE +/- 0.007, N = 3 2.878 2.865 2.849 1. (CXX) g++ options: -O3 -pthread -lm
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 3 2 1 3 6 9 12 15 SE +/- 0.00624, N = 3 SE +/- 0.03663, N = 3 SE +/- 0.07657, N = 3 8.49555 8.59358 9.00601 MIN: 8.07 MIN: 8.22 MIN: 8.67 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1 2 3 4 8 12 16 20 SE +/- 0.10, N = 3 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 13.14 13.95 14.11 MIN: 12.91 MIN: 13.68 MIN: 13.89 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU 2 3 1 1.2996 2.5992 3.8988 5.1984 6.498 SE +/- 0.00125, N = 3 SE +/- 0.00754, N = 3 SE +/- 0.00787, N = 3 5.61439 5.62232 5.77578 MIN: 5.59 MIN: 5.59 MIN: 5.72 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU 3 1 2 0.7304 1.4608 2.1912 2.9216 3.652 SE +/- 0.01863, N = 3 SE +/- 0.01583, N = 3 SE +/- 0.03533, N = 3 2.67834 3.12754 3.24606 MIN: 2.6 MIN: 3.06 MIN: 3.13 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 1 2 3 5 10 15 20 25 SE +/- 0.27, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 20.79 20.80 21.16 MIN: 19.99 MIN: 20.59 MIN: 20.85 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 3 2 1 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.11, N = 3 11.69 11.73 12.48 MIN: 11.58 MIN: 11.59 MIN: 12.21 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 3 2 1 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 16.09 16.96 17.99 MIN: 15.86 MIN: 16.49 MIN: 17.6 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 2 1 3 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 20.61 21.29 21.91 MIN: 20.41 MIN: 21 MIN: 21.58 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU 2 3 1 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.14, N = 3 12.81 12.89 13.01 MIN: 12.64 MIN: 12.74 MIN: 12.66 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 2 3 1 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 11.20 11.21 11.23 MIN: 11.17 MIN: 11.16 MIN: 11.19 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 2 3 1 1600 3200 4800 6400 8000 SE +/- 4.24, N = 3 SE +/- 5.94, N = 3 SE +/- 2.74, N = 3 7396.94 7398.38 7404.84 MIN: 7384.58 MIN: 7383.67 MIN: 7395.03 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 3 1 2 800 1600 2400 3200 4000 SE +/- 11.46, N = 3 SE +/- 13.83, N = 3 SE +/- 3.39, N = 3 3876.36 3879.97 3895.42 MIN: 3849.85 MIN: 3849.25 MIN: 3886.76 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 2 3 1 1600 3200 4800 6400 8000 SE +/- 3.54, N = 3 SE +/- 7.47, N = 3 SE +/- 4.52, N = 3 7394.59 7399.59 7400.51 MIN: 7380.26 MIN: 7382.13 MIN: 7385.65 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 3 2 1 800 1600 2400 3200 4000 SE +/- 6.34, N = 3 SE +/- 3.15, N = 3 SE +/- 11.68, N = 3 3869.51 3884.54 3898.31 MIN: 3857.36 MIN: 3876.92 MIN: 3877.01 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 1 3 2 0.9615 1.923 2.8845 3.846 4.8075 SE +/- 0.03532, N = 3 SE +/- 0.06500, N = 3 SE +/- 0.00958, N = 3 3.50897 3.71011 4.27331 MIN: 3.36 MIN: 3.47 MIN: 4.17 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 2 1 3 1600 3200 4800 6400 8000 SE +/- 3.99, N = 3 SE +/- 5.89, N = 3 SE +/- 5.09, N = 3 7396.63 7399.77 7419.20 MIN: 7381.31 MIN: 7384.45 MIN: 7404.38 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 3 2 1 800 1600 2400 3200 4000 SE +/- 8.25, N = 3 SE +/- 0.98, N = 3 SE +/- 11.47, N = 3 3873.60 3882.87 3897.98 MIN: 3858.23 MIN: 3878.39 MIN: 3871.77 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 3 2 1 2 4 6 8 10 SE +/- 0.01493, N = 3 SE +/- 0.00312, N = 3 SE +/- 0.01488, N = 3 6.32260 6.33749 6.70028 MIN: 6.08 MIN: 6.17 MIN: 6.39 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine 2 1 3 4K 8K 12K 16K 20K SE +/- 34.77, N = 3 SE +/- 49.39, N = 3 SE +/- 59.28, N = 3 19947.56 19803.76 19694.29 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
Sunflow Rendering System Global Illumination + Image Synthesis OpenBenchmarking.org Seconds, Fewer Is Better Sunflow Rendering System 0.07.2 Global Illumination + Image Synthesis 3 2 1 0.5636 1.1272 1.6908 2.2544 2.818 SE +/- 0.009, N = 3 SE +/- 0.020, N = 3 SE +/- 0.008, N = 3 2.445 2.487 2.505 MIN: 2.34 / MAX: 3.15 MIN: 2.36 / MAX: 3.31 MIN: 2.38 / MAX: 3.2
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second 3 2 1 30K 60K 90K 120K 150K SE +/- 224.91, N = 3 SE +/- 334.19, N = 3 SE +/- 280.09, N = 3 141520.58 141109.57 140877.18 1. (CC) gcc options: -O2 -lrt" -lrt
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.2.2 Time To Compile 1 3 2 30 60 90 120 150 SE +/- 0.08, N = 3 SE +/- 0.17, N = 3 SE +/- 0.47, N = 3 121.03 121.21 122.20
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 12 Total Time 2 1 3 1.6M 3.2M 4.8M 6.4M 8M SE +/- 60261.05, N = 3 SE +/- 44996.61, N = 3 SE +/- 74914.67, N = 3 7444518 7419488 7392870 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth 3 2 1 2M 4M 6M 8M 10M SE +/- 85928.55, N = 3 SE +/- 81955.72, N = 3 SE +/- 69549.32, N = 3 11302838 11229182 11144382
Kvazaar Video Input: Bosphorus 4K - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Slow 3 2 1 0.4523 0.9046 1.3569 1.8092 2.2615 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.01 2.01 2.01 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium 3 2 1 0.4635 0.927 1.3905 1.854 2.3175 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.06 2.06 2.06 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Slow 3 2 1 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 8.79 8.77 8.75 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium 3 2 1 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 9.05 9.04 9.02 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast 3 2 1 1.269 2.538 3.807 5.076 6.345 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 5.64 5.62 5.62 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast 3 1 2 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 10.19 10.18 10.16 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast 3 2 1 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 22.76 22.72 22.68 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast 3 2 1 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 40.49 40.45 40.39 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K 2 1 3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 6.14 6.11 6.09 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p 1 3 2 7 14 21 28 35 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 SE +/- 0.15, N = 3 28.06 27.94 27.90 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 1 2 3 1 0.065 0.13 0.195 0.26 0.325 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.289 0.288 0.288
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 5 3 2 1 0.1985 0.397 0.5955 0.794 0.9925 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 0.882 0.882 0.878
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 6 3 2 1 0.2666 0.5332 0.7998 1.0664 1.333 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 SE +/- 0.004, N = 3 1.185 1.185 1.183
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 10 3 2 1 0.5969 1.1938 1.7907 2.3876 2.9845 SE +/- 0.008, N = 3 SE +/- 0.009, N = 3 SE +/- 0.002, N = 3 2.653 2.650 2.636
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Crown 2 3 1 1.0637 2.1274 3.1911 4.2548 5.3185 SE +/- 0.0150, N = 3 SE +/- 0.0010, N = 3 SE +/- 0.0107, N = 3 4.7276 4.7247 4.7126 MIN: 4.69 / MAX: 4.79 MIN: 4.71 / MAX: 4.76 MIN: 4.68 / MAX: 4.76
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Crown 1 2 3 1.2266 2.4532 3.6798 4.9064 6.133 SE +/- 0.0048, N = 3 SE +/- 0.0109, N = 3 SE +/- 0.0687, N = 3 5.4516 5.3988 5.3503 MIN: 5.43 / MAX: 5.51 MIN: 5.36 / MAX: 5.47 MIN: 5.16 / MAX: 5.47
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon 2 3 1 1.2769 2.5538 3.8307 5.1076 6.3845 SE +/- 0.0277, N = 3 SE +/- 0.0224, N = 3 SE +/- 0.0264, N = 3 5.6753 5.6695 5.6657 MIN: 5.59 / MAX: 5.79 MIN: 5.62 / MAX: 5.79 MIN: 5.59 / MAX: 5.79
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon 2 1 3 2 4 6 8 10 SE +/- 0.0255, N = 3 SE +/- 0.0262, N = 3 SE +/- 0.0223, N = 3 6.8597 6.8017 6.7998 MIN: 6.77 / MAX: 6.98 MIN: 6.73 / MAX: 6.95 MIN: 6.73 / MAX: 6.95
IndigoBench Acceleration: CPU - Scene: Bedroom OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Bedroom 3 2 1 0.1627 0.3254 0.4881 0.6508 0.8135 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.723 0.723 0.721
IndigoBench Acceleration: CPU - Scene: Supercar OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Supercar 3 2 1 0.3731 0.7462 1.1193 1.4924 1.8655 SE +/- 0.002, N = 3 SE +/- 0.006, N = 3 SE +/- 0.005, N = 3 1.658 1.655 1.654
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile 3 2 1 50 100 150 200 250 SE +/- 0.19, N = 3 SE +/- 0.21, N = 3 SE +/- 3.92, N = 3 238.06 240.88 242.55
Timed Eigen Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Eigen Compilation 3.3.9 Time To Compile 1 3 2 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.11, N = 3 82.31 82.33 82.48
PHPBench PHP Benchmark Suite OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite 2 3 1 120K 240K 360K 480K 600K SE +/- 1210.22, N = 3 SE +/- 1507.55, N = 3 SE +/- 503.51, N = 3 575213 574799 574643
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 1 2 3 90K 180K 270K 360K 450K SE +/- 1505.49, N = 3 SE +/- 1060.79, N = 3 SE +/- 57.06, N = 3 411366.57 411227.16 408847.89 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 3 2 500K 1000K 1500K 2000K 2500K SE +/- 28970.62, N = 8 SE +/- 116386.47, N = 12 SE +/- 20132.66, N = 3 2368415.09 2024710.52 1438792.17 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 2 3 1 400K 800K 1200K 1600K 2000K SE +/- 50997.94, N = 15 SE +/- 56308.56, N = 15 SE +/- 57333.88, N = 15 1761393.52 1687783.91 1661804.39 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 3 2 1 300K 600K 900K 1200K 1500K SE +/- 3122.80, N = 3 SE +/- 10846.13, N = 3 SE +/- 23763.10, N = 3 1461476.54 1444723.54 1421632.92 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 3 2 500K 1000K 1500K 2000K 2500K SE +/- 41608.51, N = 3 SE +/- 26746.34, N = 3 SE +/- 51226.02, N = 15 2215813.50 2041822.96 1939149.38 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 2 3 1 400K 800K 1200K 1600K 2000K SE +/- 10885.22, N = 3 SE +/- 19626.30, N = 3 SE +/- 30456.48, N = 3 1671609.50 1668121.37 1660500.87 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Unpacking Firefox Extracting: firefox-84.0.source.tar.xz OpenBenchmarking.org Seconds, Fewer Is Better Unpacking Firefox 84.0 Extracting: firefox-84.0.source.tar.xz 1 2 3 5 10 15 20 25 SE +/- 0.19, N = 4 SE +/- 0.16, N = 4 SE +/- 0.22, N = 11 20.72 21.02 21.30
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 1 3 2 16 32 48 64 80 SE +/- 0.13, N = 3 SE +/- 0.19, N = 3 SE +/- 0.06, N = 3 72.27 72.53 72.63 1. (CC) gcc options: -O2 -ldl -lz -lpthread
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 1 3 2 8M 16M 24M 32M 40M SE +/- 250280.80, N = 3 SE +/- 365197.62, N = 3 SE +/- 317702.14, N = 3 37136699.5 37053570.5 37026672.1
Phoronix Test Suite v10.8.5