Ice Lake Ubuntu 20.04.2 / 20.10 / 21.04 Linux Benchmarks Tests for a future article. 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Clear Linux OS 34630 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2105195-IB-2105188IB39&grs .
Ice Lake Ubuntu 20.04.2 / 20.10 / 21.04 Linux Benchmarks Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Compiler File-System Screen Resolution Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads) Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) Intel Device 0998 504GB 800GB INTEL SSDPF21Q800GB ASPEED VE228 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP Ubuntu 21.04 5.11.0-17-generic (x86_64) GNOME Shell 3.38.4 X Server GCC 10.3.0 ext4 1920x1080 Ubuntu 20.04 5.8.0-53-generic (x86_64) GNOME Shell 3.36.7 X Server 1.20.9 GCC 9.3.0 Clear Linux OS 34630 5.10.19-1032.native (x86_64) GNOME Shell 40.0 X Server GCC 11.1.1 20210517 releases/gcc-11.1.0-132-g7d91dd2efb + Clang 11.1.0 + LLVM 11.1.0 OpenBenchmarking.org Kernel Details - Ubuntu 21.04: Transparent Huge Pages: madvise - Ubuntu 20.04.2 LTS: Transparent Huge Pages: madvise - Clear Linux 34630: Transparent Huge Pages: always Compiler Details - Ubuntu 21.04: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Ubuntu 20.04.2 LTS: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Clear Linux 34630: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=skylake-avx512 Disk Details - Ubuntu 21.04: NONE / errors=remount-ro,relatime,rw / Block Size: 4096 - Ubuntu 20.04.2 LTS: NONE / errors=remount-ro,relatime,rw / Block Size: 4096 - Clear Linux 34630: MQ-DEADLINE / relatime,rw,stripe=256 / Block Size: 4096 Processor Details - Ubuntu 21.04: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270 - Ubuntu 20.04.2 LTS: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270 - Clear Linux 34630: Scaling Governor: intel_pstate performance - CPU Microcode: 0xd000270 Java Details - Ubuntu 21.04: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2) - Ubuntu 20.04.2 LTS: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.04) - Clear Linux 34630: OpenJDK Runtime Environment (build 1.8.0-u252-ga-b00) Python Details - Ubuntu 21.04: Python 3.9.4 - Ubuntu 20.04.2 LTS: Python 3.8.5 - Clear Linux 34630: Python 3.9.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Environment Details - Clear Linux 34630: FFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,--enable-new-dtags -Wa,-mbranches-within-32B-boundaries" CXXFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -Wformat -Wformat-security -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -fno-semantic-interposition -ffat-lto-objects -fno-trapping-math -Wl,-sort-common -Wl,--enable-new-dtags -mtune=skylake -Wa,-mbranches-within-32B-boundaries -fvisibility-inlines-hidden -Wl,--enable-new-dtags" FCFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,-sort-common -Wl,--enable-new-dtags" CFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -Wformat -Wformat-security -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -fno-semantic-interposition -ffat-lto-objects -fno-trapping-math -Wl,-sort-common -Wl,--enable-new-dtags -mtune=skylake -Wa,-mbranches-within-32B-boundaries" THEANO_FLAGS="floatX=float32,openmp=true,gcc.cxxflags="-ftree-vectorize -mavx""
Ice Lake Ubuntu 20.04.2 / 20.10 / 21.04 Linux Benchmarks dacapobench: Tradebeans onednn: Deconvolution Batch shapes_1d - f32 - CPU kvazaar: Bosphorus 4K - Very Fast pjsip: OPTIONS, Stateful pjsip: INVITE libgav1: Summer Nature 4K x265: Bosphorus 1080p dacapobench: H2 kvazaar: Bosphorus 4K - Medium svt-hevc: 10 - Bosphorus 1080p svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p x265: Bosphorus 4K tungsten: Non-Exponential svt-av1: Preset 8 - Bosphorus 4K kvazaar: Bosphorus 4K - Ultra Fast svt-hevc: 7 - Bosphorus 1080p tungsten: Water Caustic build-apache: Time To Compile compress-zstd: 3 - Compression Speed avifenc: 6 avifenc: 6, Lossless tensorflow-lite: Mobilenet Float tensorflow-lite: Mobilenet Quant openvkl: vklBenchmarkVdbVolume svt-hevc: 1 - Bosphorus 1080p build-php: Time To Compile phpbench: PHP Benchmark Suite build-linux-kernel: Time To Compile liquid-dsp: 160 - 256 - 57 openvkl: vklBenchmark tensorflow-lite: NASNet Mobile compress-zstd: 19 - Decompression Speed nwchem: C240 Buckyball node-web-tooling: liquid-dsp: 32 - 256 - 57 liquid-dsp: 16 - 256 - 57 liquid-dsp: 128 - 256 - 57 blogbench: Read tungsten: Hair ospray: NASA Streamlines - SciVis ospray: Magnetic Reconnection - SciVis compress-zstd: 19, Long Mode - Decompression Speed onednn: Convolution Batch Shapes Auto - f32 - CPU compress-zstd: 19, Long Mode - Compression Speed blogbench: Write embree: Pathtracer ISPC - Crown incompact3d: X3D-benchmarking input.i3d ospray: San Miguel - SciVis compress-zstd: 19 - Compression Speed pjsip: OPTIONS, Stateless coremark: CoreMark Size 666 - Iterations Per Second tensorflow-lite: Inception V4 onednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU compress-zstd: 19 - Compression Speed blender: Barbershop - CPU-Only compress-zstd: 19, Long Mode - Compression Speed povray: Trace Time chia-vdf: Square Assembly Optimized openvkl: vklBenchmarkUnstructuredVolume blender: Fishy Cat - CPU-Only mrbayes: Primate Phylogeny Analysis stockfish: Total Time incompact3d: input.i3d 193 Cells Per Direction blender: BMW27 - CPU-Only helsing: 14 digit rodinia: OpenMP LavaMD tensorflow-lite: SqueezeNet onednn: IP Shapes 3D - bf16bf16bf16 - CPU sysbench: RAM / Memory gromacs: MPI CPU - water_GMX50_bare onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU npb: LU.C embree: Pathtracer - Asian Dragon pybench: Total For Average Test Times chia-vdf: Square Plain C++ onednn: Recurrent Neural Network Training - u8s8f32 - CPU ospray: NASA Streamlines - Path Tracer relion: Basic - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: IP Shapes 1D - f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU securemark: SecureMark-TLS onednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Training - f32 - CPU lammps: 20k Atoms onednn: Deconvolution Batch shapes_3d - f32 - CPU blender: Pabellon Barcelona - CPU-Only ospray: San Miguel - Path Tracer onednn: IP Shapes 1D - bf16bf16bf16 - CPU onednn: IP Shapes 3D - f32 - CPU embree: Pathtracer ISPC - Asian Dragon onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU tensorflow-lite: Inception ResNet V2 rodinia: OpenMP HotSpot3D hpcg: blender: Classroom - CPU-Only onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPU sysbench: CPU tnn: CPU - SqueezeNet v1.1 namd: ATPase Simulation - 327,506 Atoms ospray: XFrog Forest - Path Tracer ospray: XFrog Forest - SciVis compress-zstd: 8 - Compression Speed cassandra: Writes tnn: CPU - MobileNet v2 pgbench: 100 - 250 - Read Only - Average Latency pgbench: 100 - 250 - Read Only keydb: tungsten: Volumetric Caustic avifenc: 10, Lossless avifenc: 10 asmfish: 1024 Hash Memory, 26 Depth oidn: Memorial dacapobench: Jython java-gradle-perf: Reactor lammps: Rhodopsin Protein rodinia: OpenMP Leukocyte npb: EP.C Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 16729 28.7517 13.38 3815 2531 19.39 28.33 10740 6.90 260.88 204.86 12.92 5.23912 29.809 22.78 172.02 31.2739 35.187 6148.4 15.383 36.764 41555.1 43074.5 21915837 29.57 40.207 715587 24.813 3080633333 669 81319.3 2727.6 1875.8 10.79 1631166667 826756667 3290766667 2253810 6.52538 125 109.63 2719.4 1.41084 47.2 60768 67.0395 291.993968 90.91 82.3 40082 2347453.567360 687019 3.27051 0.231483 82.2 108.31 46.0 9.476 147240 1776457 45.72 170.019 180493580 11.3305505 29.61 82.465 40.406 47808.3 1.82051 12176.80 9.004 0.256555 187807.08 81.2911 995 138967 684.271 27.78 348.838 671.541 0.362841 0.945450 0.193198 230272 0.615331 672.646 35.725 0.846330 88.36 10.42 3.03516 1.37903 106.6858 437.866 439.677 569303 105.650 39.8231 72.28 437.972 2.10597 213870.87 366.729 0.27128 10.38 18.87 2096.7 106863 443.547 0.266 946634 524178.09 13.4481 8.522 5.050 172711481 57.59 5320 366.363 23.866 62.044 6259.88 18060 32.6500 13.33 3597 1840 19.31 28.50 11235 6.93 234.92 196.97 12.58 5.16631 28.139 23.16 157.78 32.4833 35.987 4576.8 15.501 38.524 45998.8 46915.5 22077090 29.14 40.723 726623 25.960 3106666667 661 90927.9 2740.7 1825 10.94 1653866667 832983333 3316633333 2224145 6.45646 125 110.19 2782.7 1.39626 43.6 63062 66.8866 319.487874 90.91 84.0 40954 2261054.029261 716517 3.47588 0.218363 83.0 109.07 44.8 9.451 143600 1737714 46.11 178.773 173250319 11.7113234 29.54 82.638 39.522 48816.5 1.80328 12252.73 8.884 0.249170 189606.14 82.6069 982 139400 667.646 27.78 348.604 678.041 0.353757 0.942653 0.190921 225195 0.602936 671.557 36.300 0.850400 88.16 10.42 3.01186 1.37432 106.1435 431.304 433.741 576988 104.315 39.9669 71.92 433.426 2.10694 214228.62 367.799 0.27083 10.38 18.87 2128.7 103051 374.203 0.263 954214 421171.59 13.8745 8.967 5.361 171613950 49.87 5518 382.988 22.084 60.933 6011.42 4282 9.79872 40.96 10360 5296 54.59 78.42 4132 17.80 592.99 456.47 29.05 2.40790 60.273 46.98 315.80 18.1712 22.241 7015.5 10.237 27.246 32984.6 33677.2 29605565 38.51 31.104 900184 20.736 3843066667 815 74734.9 2253.6 2209.4 13.06 1973100000 999833333 3970066667 1908438 5.62201 142.86 125 2447.0 1.58077 49.2 56009 73.8370 289.551473 100 90.1 43880 2452833.360945 665666 3.22981 0.216048 88.0 102.25 47.5 8.952 151167 1828816 43.83 175.476 182083926 11.1637303 28.23 79.118 38.693 46761.2 1.74481 12670.82 9.201 0.247725 194386.02 84.1301 1015 143567 663.896 28.57 358.316 660.198 0.361521 0.921930 0.188906 229770 0.614468 660.052 35.632 0.862061 86.83 10.60 2.98572 1.35679 107.7726 434.954 433.432 572684 104.278 39.4821 71.51 433.798 2.08771 212303.58 364.987 0.27141 10.38 18.87 2656.6 312015 354.298 0.254 990608 503991.58 13.8659 5.815 3.066 174596848 59.98 3590 301.413 31.153 40.815 5425.40 OpenBenchmarking.org
DaCapo Benchmark Java Test: Tradebeans OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Tradebeans Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 4K 8K 12K 16K 20K SE +/- 82.67, N = 4 SE +/- 203.38, N = 4 SE +/- 51.71, N = 4 16729 18060 4282
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 8 16 24 32 40 SE +/- 0.28978, N = 15 SE +/- 0.49102, N = 15 SE +/- 0.09474, N = 15 28.75170 32.65000 9.79872 MIN: 15.46 MIN: 9.3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 6.28 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 13.38 13.33 40.96 -O2 -O2 -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
PJSIP Method: OPTIONS, Stateful OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateful Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 2K 4K 6K 8K 10K SE +/- 6.89, N = 3 SE +/- 13.59, N = 3 SE +/- 1.53, N = 3 3815 3597 10360 -lavformat -lavcodec -lswscale -lavutil -lasound -O2 -lavformat -lavcodec -lswscale -lavutil -lasound -O2 -lopus -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -lSDL2 -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread
PJSIP Method: INVITE OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: INVITE Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 1100 2200 3300 4400 5500 SE +/- 22.31, N = 15 SE +/- 14.31, N = 3 SE +/- 20.58, N = 3 2531 1840 5296 -lavformat -lavcodec -lswscale -lavutil -lasound -O2 -lavformat -lavcodec -lswscale -lavutil -lasound -O2 -lopus -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -lSDL2 -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 4K Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 12 24 36 48 60 SE +/- 0.15, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 19.39 19.31 54.59 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CXX) g++ options: -O3 -lpthread -lrt
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 20 40 60 80 100 SE +/- 0.34, N = 3 SE +/- 0.25, N = 3 SE +/- 0.59, N = 3 28.33 28.50 78.42 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
DaCapo Benchmark Java Test: H2 OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: H2 Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 2K 4K 6K 8K 10K SE +/- 133.93, N = 4 SE +/- 115.71, N = 20 SE +/- 53.70, N = 20 10740 11235 4132
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 6.90 6.93 17.80 -O2 -O2 -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 130 260 390 520 650 SE +/- 2.49, N = 3 SE +/- 1.73, N = 3 SE +/- 5.64, N = 3 260.88 234.92 592.99 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 100 200 300 400 500 SE +/- 2.00, N = 15 SE +/- 1.73, N = 3 SE +/- 2.91, N = 3 204.86 196.97 456.47 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 7 14 21 28 35 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 SE +/- 0.40, N = 15 12.92 12.58 29.05 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Tungsten Renderer Scene: Non-Exponential OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Non-Exponential Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 1.1788 2.3576 3.5364 4.7152 5.894 SE +/- 0.07466, N = 3 SE +/- 0.02249, N = 3 SE +/- 0.00072, N = 3 5.23912 5.16631 2.40790 -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -ljpeg -lpthread -ldl
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 13 26 39 52 65 SE +/- 0.34, N = 3 SE +/- 0.34, N = 15 SE +/- 0.42, N = 3 29.81 28.14 60.27 -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 11 22 33 44 55 SE +/- 0.11, N = 3 SE +/- 0.24, N = 3 SE +/- 0.34, N = 12 22.78 23.16 46.98 -O2 -O2 -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -lpthread -lm -lrt
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 70 140 210 280 350 SE +/- 1.72, N = 3 SE +/- 1.68, N = 3 SE +/- 2.34, N = 15 172.02 157.78 315.80 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
Tungsten Renderer Scene: Water Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Water Caustic Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 8 16 24 32 40 SE +/- 0.27, N = 3 SE +/- 0.19, N = 3 SE +/- 0.04, N = 3 31.27 32.48 18.17 -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -ljpeg -lpthread -ldl
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.41 Time To Compile Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 8 16 24 32 40 SE +/- 0.05, N = 3 SE +/- 0.13, N = 3 SE +/- 0.01, N = 3 35.19 35.99 22.24
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 1500 3000 4500 6000 7500 SE +/- 53.61, N = 3 SE +/- 7.13, N = 3 SE +/- 61.86, N = 3 6148.4 4620.1 7166.7 -llzma -llzma -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -O3 -pthread -lz
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 4 8 12 16 20 SE +/- 0.19, N = 4 SE +/- 0.14, N = 15 SE +/- 0.03, N = 3 15.38 15.50 10.24 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 9 18 27 36 45 SE +/- 0.47, N = 3 SE +/- 0.38, N = 3 SE +/- 0.11, N = 3 36.76 38.52 27.25 1. (CXX) g++ options: -O3 -fPIC -lm
TensorFlow Lite Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Float Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 10K 20K 30K 40K 50K SE +/- 612.15, N = 15 SE +/- 630.61, N = 15 SE +/- 68.45, N = 3 41555.1 45998.8 32984.6
TensorFlow Lite Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Quant Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 10K 20K 30K 40K 50K SE +/- 601.06, N = 15 SE +/- 706.39, N = 15 SE +/- 60.78, N = 3 43074.5 46915.5 33677.2
OpenVKL Benchmark: vklBenchmarkVdbVolume OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkVdbVolume Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 6M 12M 18M 24M 30M SE +/- 58341.71, N = 3 SE +/- 187763.97, N = 3 SE +/- 324102.01, N = 3 21915837 22077090 29605565 MIN: 1062215 / MAX: 153704880 MIN: 1030467 / MAX: 169859376 MIN: 1030715 / MAX: 177194520
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 9 18 27 36 45 SE +/- 0.09, N = 3 SE +/- 0.33, N = 3 SE +/- 0.24, N = 3 29.57 29.14 38.51 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.4.2 Time To Compile Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 9 18 27 36 45 SE +/- 0.52, N = 3 SE +/- 0.02, N = 3 SE +/- 0.10, N = 3 40.21 40.72 31.10
PHPBench PHP Benchmark Suite OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 200K 400K 600K 800K 1000K SE +/- 3328.66, N = 3 SE +/- 5472.66, N = 3 SE +/- 1642.74, N = 3 715587 726623 900184
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.10.20 Time To Compile Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 6 12 18 24 30 SE +/- 0.35, N = 13 SE +/- 0.36, N = 12 SE +/- 0.17, N = 9 24.81 25.96 20.74
Liquid-DSP Threads: 160 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 160 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 800M 1600M 2400M 3200M 4000M SE +/- 15087448.79, N = 3 SE +/- 11545032.60, N = 3 SE +/- 13074699.91, N = 3 3080633333 3106666667 3843066667 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenVKL Benchmark: vklBenchmark OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmark Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 200 400 600 800 1000 SE +/- 2.40, N = 3 SE +/- 5.81, N = 3 SE +/- 2.08, N = 3 669 661 815 MIN: 1 / MAX: 2821 MIN: 1 / MAX: 2845 MIN: 1 / MAX: 3198
TensorFlow Lite Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: NASNet Mobile Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 20K 40K 60K 80K 100K SE +/- 768.19, N = 15 SE +/- 1128.79, N = 4 SE +/- 102.89, N = 3 81319.3 90927.9 74734.9
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 600 1200 1800 2400 3000 SE +/- 12.48, N = 3 SE +/- 9.85, N = 3 SE +/- 4.09, N = 3 2727.6 2740.7 2253.6 -llzma -llzma -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -O3 -pthread -lz
NWChem Input: C240 Buckyball OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 500 1000 1500 2000 2500 1875.8 1825.0 2209.4 -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lz 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lm -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.12, N = 15 SE +/- 0.01, N = 3 10.79 10.94 13.06 1. Ubuntu 21.04: Nodejs
v12.21.0 2. Ubuntu 20.04.2 LTS: Nodejs
v10.19.0 3. Clear Linux 34630: Nodejs
v14.17.0
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 400M 800M 1200M 1600M 2000M SE +/- 768837.51, N = 3 SE +/- 5634516.64, N = 3 SE +/- 16877598.57, N = 3 1631166667 1653866667 1973100000 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 200M 400M 600M 800M 1000M SE +/- 702242.44, N = 3 SE +/- 5004752.19, N = 3 SE +/- 2729061.70, N = 3 826756667 832983333 999833333 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 900M 1800M 2700M 3600M 4500M SE +/- 9583724.63, N = 3 SE +/- 8772368.23, N = 3 SE +/- 36096414.10, N = 3 3290766667 3316633333 3970066667 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
BlogBench Test: Read OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Read Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 500K 1000K 1500K 2000K 2500K SE +/- 30037.66, N = 3 SE +/- 21799.22, N = 9 SE +/- 12916.51, N = 3 2253810 2224145 1908438 -O2 -O2 -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -pthread
Tungsten Renderer Scene: Hair OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Hair Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 2 4 6 8 10 SE +/- 0.07388, N = 3 SE +/- 0.08029, N = 15 SE +/- 0.02021, N = 3 6.52538 6.45646 5.62201 -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -ljpeg -lpthread -ldl
OSPray Demo: NASA Streamlines - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 30 60 90 120 150 SE +/- 0.00, N = 3 125.00 125.00 142.86 MIN: 31.25 / MAX: 142.86 MIN: 33.33 / MAX: 142.86 MIN: 41.67
OSPray Demo: Magnetic Reconnection - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 30 60 90 120 150 SE +/- 1.01, N = 15 SE +/- 0.93, N = 12 SE +/- 0.00, N = 3 109.63 110.19 125.00 MIN: 20.83 / MAX: 125 MIN: 21.74 / MAX: 111.11 MIN: 90.91
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 600 1200 1800 2400 3000 SE +/- 1.06, N = 3 SE +/- 5.43, N = 15 SE +/- 7.42, N = 3 2719.4 2782.7 2447.0 -llzma -llzma -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -O3 -pthread -lz
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 0.3557 0.7114 1.0671 1.4228 1.7785 SE +/- 0.00589, N = 3 SE +/- 0.00638, N = 3 SE +/- 0.01755, N = 3 1.41084 1.39626 1.58077 MIN: 1.27 MIN: 1.26 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 1.38 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 11 22 33 44 55 SE +/- 0.33, N = 3 SE +/- 0.39, N = 15 SE +/- 0.18, N = 3 47.2 43.6 49.2 -llzma -llzma -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -O3 -pthread -lz
BlogBench Test: Write OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Write Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 14K 28K 42K 56K 70K SE +/- 717.36, N = 3 SE +/- 770.50, N = 3 SE +/- 74.61, N = 3 60768 63062 56009 -O2 -O2 -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -pthread
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Crown Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 16 32 48 64 80 SE +/- 0.30, N = 3 SE +/- 0.70, N = 3 SE +/- 0.68, N = 3 67.04 66.89 73.84 MIN: 59.72 / MAX: 87.99 MIN: 59.4 / MAX: 88.24 MIN: 64.62 / MAX: 94.03
Xcompact3d Incompact3d Input: X3D-benchmarking input.i3d OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 70 140 210 280 350 SE +/- 1.18, N = 3 SE +/- 3.10, N = 3 SE +/- 0.46, N = 3 291.99 319.49 289.55 -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
OSPray Demo: San Miguel - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 90.91 90.91 100.00 MIN: 55.56 / MAX: 100 MIN: 55.56 / MAX: 100 MIN: 52.63 / MAX: 111.11
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 20 40 60 80 100 SE +/- 0.81, N = 3 SE +/- 0.68, N = 3 SE +/- 0.32, N = 3 82.3 84.0 90.1 -llzma -llzma -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -O3 -pthread -lz
PJSIP Method: OPTIONS, Stateless OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateless Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 9K 18K 27K 36K 45K SE +/- 533.56, N = 3 SE +/- 352.27, N = 3 SE +/- 270.37, N = 3 40082 40954 43880 -lavformat -lavcodec -lswscale -lavutil -lasound -O2 -lavformat -lavcodec -lswscale -lavutil -lasound -O2 -lopus -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -lSDL2 -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 500K 1000K 1500K 2000K 2500K SE +/- 2872.35, N = 3 SE +/- 24473.66, N = 5 SE +/- 6737.83, N = 3 2347453.57 2261054.03 2452833.36 -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -O2 -lrt" -lrt
TensorFlow Lite Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception V4 Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 150K 300K 450K 600K 750K SE +/- 7233.85, N = 5 SE +/- 2002.95, N = 3 SE +/- 6494.21, N = 3 687019 716517 665666
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 0.7821 1.5642 2.3463 3.1284 3.9105 SE +/- 0.00929, N = 3 SE +/- 0.00414, N = 3 SE +/- 0.00331, N = 3 3.27051 3.47588 3.22981 MIN: 3.09 MIN: 3.31 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 3.02 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 0.0521 0.1042 0.1563 0.2084 0.2605 SE +/- 0.002322, N = 6 SE +/- 0.001876, N = 8 SE +/- 0.001684, N = 3 0.231483 0.218363 0.216048 MIN: 0.2 MIN: 0.19 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 0.19 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 20 40 60 80 100 SE +/- 1.03, N = 15 SE +/- 0.70, N = 8 SE +/- 0.66, N = 3 82.2 83.0 88.0 -llzma -llzma -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -O3 -pthread -lz
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Barbershop - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 20 40 60 80 100 SE +/- 0.26, N = 3 SE +/- 0.27, N = 3 SE +/- 0.12, N = 3 108.31 109.07 102.25
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 11 22 33 44 55 SE +/- 0.54, N = 15 SE +/- 0.61, N = 15 SE +/- 0.64, N = 3 46.0 44.8 47.5 -llzma -llzma -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -O3 -pthread -lz
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 3 6 9 12 15 SE +/- 0.097, N = 5 SE +/- 0.084, N = 3 SE +/- 0.016, N = 3 9.476 9.451 8.952 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -R/usr/lib -lSDL -lpthread 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
Chia Blockchain VDF Test: Square Assembly Optimized OpenBenchmarking.org IPS, More Is Better Chia Blockchain VDF 1.0.1 Test: Square Assembly Optimized Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 30K 60K 90K 120K 150K SE +/- 1568.63, N = 5 SE +/- 680.69, N = 3 SE +/- 240.37, N = 3 147240 143600 151167 1. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread
OpenVKL Benchmark: vklBenchmarkUnstructuredVolume OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkUnstructuredVolume Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 400K 800K 1200K 1600K 2000K SE +/- 7183.20, N = 3 SE +/- 17974.50, N = 3 SE +/- 13533.55, N = 3 1776457 1737714 1828816 MIN: 24424 / MAX: 5906521 MIN: 21419 / MAX: 5849413 MIN: 25882 / MAX: 5901670
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Fishy Cat - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 10 20 30 40 50 SE +/- 0.15, N = 3 SE +/- 0.17, N = 3 SE +/- 0.01, N = 3 45.72 46.11 43.83
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 40 80 120 160 200 SE +/- 0.83, N = 3 SE +/- 0.79, N = 3 SE +/- 1.06, N = 3 170.02 178.77 175.48 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 40M 80M 120M 160M 200M SE +/- 2129761.70, N = 15 SE +/- 1908422.73, N = 4 SE +/- 1887131.91, N = 3 180493580 173250319 182083926 -pipe -fexceptions -fstack-protector -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 11.33 11.71 11.16 -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: BMW27 - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 29.61 29.54 28.23
Helsing Digit Range: 14 digit OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 14 digit Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 20 40 60 80 100 SE +/- 0.73, N = 15 SE +/- 0.11, N = 3 SE +/- 0.09, N = 3 82.47 82.64 79.12 1. (CC) gcc options: -O2 -pthread
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 9 18 27 36 45 SE +/- 0.15, N = 3 SE +/- 0.07, N = 3 SE +/- 0.46, N = 4 40.41 39.52 38.69 1. (CXX) g++ options: -O2 -lOpenCL
TensorFlow Lite Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: SqueezeNet Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 10K 20K 30K 40K 50K SE +/- 35.00, N = 3 SE +/- 510.23, N = 15 SE +/- 60.24, N = 3 47808.3 48816.5 46761.2
oneDNN Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 0.4096 0.8192 1.2288 1.6384 2.048 SE +/- 0.01169, N = 14 SE +/- 0.00660, N = 3 SE +/- 0.00330, N = 3 1.82051 1.80328 1.74481 MIN: 1.67 MIN: 1.67 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 1.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Sysbench Test: RAM / Memory OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 3K 6K 9K 12K 15K SE +/- 145.91, N = 15 SE +/- 138.52, N = 15 SE +/- 126.52, N = 3 12176.80 12252.73 12670.82 -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 3 6 9 12 15 SE +/- 0.058, N = 3 SE +/- 0.033, N = 3 SE +/- 0.028, N = 3 9.004 8.884 9.201 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CXX) g++ options: -O3 -pthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 0.0577 0.1154 0.1731 0.2308 0.2885 SE +/- 0.002948, N = 3 SE +/- 0.003241, N = 3 SE +/- 0.001913, N = 3 0.256555 0.249170 0.247725 MIN: 0.23 MIN: 0.22 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 0.22 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 40K 80K 120K 160K 200K SE +/- 419.66, N = 3 SE +/- 1295.33, N = 3 SE +/- 301.93, N = 3 187807.08 189606.14 194386.02 -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Ubuntu 21.04: Open MPI 4.1.0 3. Ubuntu 20.04.2 LTS: Open MPI 4.0.3 4. Clear Linux 34630: 3.2
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 20 40 60 80 100 SE +/- 0.58, N = 3 SE +/- 0.68, N = 9 SE +/- 1.14, N = 3 81.29 82.61 84.13 MIN: 67.06 / MAX: 91.79 MIN: 67 / MAX: 93.51 MIN: 68.3 / MAX: 93.51
PyBench Total For Average Test Times OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2018-02-16 Total For Average Test Times Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 200 400 600 800 1000 SE +/- 1.86, N = 3 SE +/- 0.88, N = 3 SE +/- 1.00, N = 3 995 982 1015
Chia Blockchain VDF Test: Square Plain C++ OpenBenchmarking.org IPS, More Is Better Chia Blockchain VDF 1.0.1 Test: Square Plain C++ Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 30K 60K 90K 120K 150K SE +/- 88.19, N = 3 SE +/- 585.95, N = 3 SE +/- 33.33, N = 3 138967 139400 143567 1. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 150 300 450 600 750 SE +/- 5.64, N = 15 SE +/- 1.50, N = 3 SE +/- 3.59, N = 3 684.27 667.65 663.90 MIN: 645.4 MIN: 642.94 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 627.72 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OSPray Demo: NASA Streamlines - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: Path Tracer Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 7 14 21 28 35 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 27.78 27.78 28.57 MIN: 17.24 / MAX: 29.41 MIN: 16.67 / MAX: 29.41 MIN: 18.87 / MAX: 30.3
RELION Test: Basic - Device: CPU OpenBenchmarking.org Seconds, Fewer Is Better RELION 3.1.1 Test: Basic - Device: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 80 160 240 320 400 SE +/- 1.61, N = 3 SE +/- 0.74, N = 3 SE +/- 0.17, N = 3 348.84 348.60 358.32 -lmpi_cxx -lmpi_cxx -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 150 300 450 600 750 SE +/- 0.16, N = 3 SE +/- 4.59, N = 15 SE +/- 0.41, N = 3 671.54 678.04 660.20 MIN: 647.67 MIN: 638.98 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 628.82 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 0.0816 0.1632 0.2448 0.3264 0.408 SE +/- 0.004351, N = 3 SE +/- 0.003468, N = 3 SE +/- 0.001144, N = 3 0.362841 0.353757 0.361521 MIN: 0.32 MIN: 0.31 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 0.32 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 0.2127 0.4254 0.6381 0.8508 1.0635 SE +/- 0.010679, N = 4 SE +/- 0.011491, N = 3 SE +/- 0.004580, N = 3 0.945450 0.942653 0.921930 MIN: 0.86 MIN: 0.85 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 0.84 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 0.0435 0.087 0.1305 0.174 0.2175 SE +/- 0.001836, N = 13 SE +/- 0.001679, N = 14 SE +/- 0.000520, N = 3 0.193198 0.190921 0.188906 MIN: 0.18 MIN: 0.18 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 0.18 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SecureMark Benchmark: SecureMark-TLS OpenBenchmarking.org marks, More Is Better SecureMark 1.0.4 Benchmark: SecureMark-TLS Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 50K 100K 150K 200K 250K SE +/- 178.37, N = 3 SE +/- 124.15, N = 3 SE +/- 121.72, N = 3 230272 225195 229770 1. (CC) gcc options: -pedantic -O3
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 0.1384 0.2768 0.4152 0.5536 0.692 SE +/- 0.004554, N = 3 SE +/- 0.002882, N = 3 SE +/- 0.001093, N = 3 0.615331 0.602936 0.614468 MIN: 0.57 MIN: 0.56 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 0.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 150 300 450 600 750 SE +/- 1.73, N = 3 SE +/- 5.68, N = 8 SE +/- 2.16, N = 3 672.65 671.56 660.05 MIN: 646.15 MIN: 639.68 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 626.35 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 35.73 36.30 35.63 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CXX) g++ options: -O3 -pthread -lm
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 0.194 0.388 0.582 0.776 0.97 SE +/- 0.008374, N = 6 SE +/- 0.008602, N = 6 SE +/- 0.000743, N = 3 0.846330 0.850400 0.862061 MIN: 0.8 MIN: 0.8 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 0.81 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Pabellon Barcelona - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 20 40 60 80 100 SE +/- 0.24, N = 3 SE +/- 0.21, N = 3 SE +/- 0.08, N = 3 88.36 88.16 86.83
OSPray Demo: San Miguel - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: Path Tracer Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 10.42 10.42 10.60 MIN: 7.87 / MAX: 10.64 MIN: 7.94 / MAX: 10.53 MIN: 7.58 / MAX: 10.87
oneDNN Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 0.6829 1.3658 2.0487 2.7316 3.4145 SE +/- 0.02675, N = 12 SE +/- 0.03357, N = 3 SE +/- 0.00391, N = 3 3.03516 3.01186 2.98572 MIN: 2.86 MIN: 2.84 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 2.85 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 0.3103 0.6206 0.9309 1.2412 1.5515 SE +/- 0.01131, N = 3 SE +/- 0.01216, N = 3 SE +/- 0.00192, N = 3 1.37903 1.37432 1.35679 MIN: 1.33 MIN: 1.32 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 1.31 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 20 40 60 80 100 SE +/- 0.84, N = 10 SE +/- 0.91, N = 3 SE +/- 1.45, N = 3 106.69 106.14 107.77 MIN: 85.33 / MAX: 112.78 MIN: 84.21 / MAX: 111.87 MIN: 85.87 / MAX: 112.96
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 90 180 270 360 450 SE +/- 0.63, N = 3 SE +/- 1.48, N = 3 SE +/- 1.15, N = 3 437.87 431.30 434.95 MIN: 422.67 MIN: 414.55 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 417.43 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 100 200 300 400 500 SE +/- 0.53, N = 3 SE +/- 1.79, N = 3 SE +/- 2.18, N = 3 439.68 433.74 433.43 MIN: 424.75 MIN: 416.67 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 414.35 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
TensorFlow Lite Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 120K 240K 360K 480K 600K SE +/- 790.57, N = 3 SE +/- 5213.69, N = 3 SE +/- 4180.58, N = 3 569303 576988 572684
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 20 40 60 80 100 SE +/- 0.57, N = 3 SE +/- 0.14, N = 3 SE +/- 0.11, N = 3 105.65 104.32 104.28 1. (CXX) g++ options: -O2 -lOpenCL
High Performance Conjugate Gradient OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 9 18 27 36 45 SE +/- 0.03, N = 3 SE +/- 0.19, N = 3 SE +/- 0.01, N = 3 39.82 39.97 39.48 -lmpi_cxx -lmpi_cxx 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Classroom - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 16 32 48 64 80 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 SE +/- 0.10, N = 3 72.28 71.92 71.51
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 90 180 270 360 450 SE +/- 0.96, N = 3 SE +/- 1.72, N = 3 SE +/- 1.50, N = 3 437.97 433.43 433.80 MIN: 422.42 MIN: 414.41 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 414.44 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 0.4741 0.9482 1.4223 1.8964 2.3705 SE +/- 0.01823, N = 3 SE +/- 0.01729, N = 3 SE +/- 0.00152, N = 3 2.10597 2.10694 2.08771 MIN: 2.03 MIN: 2.03 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 2.03 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 50K 100K 150K 200K 250K SE +/- 275.74, N = 3 SE +/- 252.80, N = 3 SE +/- 23.74, N = 3 213870.87 214228.62 212303.58 -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 80 160 240 320 400 SE +/- 0.08, N = 3 SE +/- 0.53, N = 3 SE +/- 0.04, N = 3 366.73 367.80 364.99 MIN: 365.95 / MAX: 373.2 MIN: 367 / MAX: 370.11 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 364.69 / MAX: 366.31 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 0.0611 0.1222 0.1833 0.2444 0.3055 SE +/- 0.00049, N = 3 SE +/- 0.00044, N = 3 SE +/- 0.00032, N = 3 0.27128 0.27083 0.27141
OSPray Demo: XFrog Forest - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: Path Tracer Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 10.38 10.38 10.38 MIN: 7.58 / MAX: 10.75 MIN: 9.17 / MAX: 10.53 MIN: 8.13 / MAX: 10.75
OSPray Demo: XFrog Forest - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 18.87 18.87 18.87 MIN: 13.33 / MAX: 19.23 MIN: 12.82 / MAX: 19.23 MIN: 13.16 / MAX: 19.61
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Compression Speed Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 600 1200 1800 2400 3000 SE +/- 24.65, N = 3 SE +/- 25.03, N = 15 SE +/- 58.63, N = 12 2096.7 2128.7 2656.6 -llzma -llzma -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -O3 -pthread -lz
Apache Cassandra Test: Writes OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 3.11.4 Test: Writes Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 70K 140K 210K 280K 350K SE +/- 2263.61, N = 15 SE +/- 1368.32, N = 15 SE +/- 2265.50, N = 3 106863 103051 312015
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 100 200 300 400 500 SE +/- 9.40, N = 15 SE +/- 0.44, N = 3 SE +/- 0.15, N = 3 443.55 374.20 354.30 MIN: 367.81 / MAX: 808.75 MIN: 371.48 / MAX: 419.18 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 353.49 / MAX: 357.29 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 0.0599 0.1198 0.1797 0.2396 0.2995 SE +/- 0.003, N = 15 SE +/- 0.003, N = 4 SE +/- 0.005, N = 15 0.266 0.263 0.254 -O2 -O2 -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 200K 400K 600K 800K 1000K SE +/- 10755.01, N = 15 SE +/- 10643.51, N = 4 SE +/- 19682.88, N = 15 946634 954214 990608 -O2 -O2 -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 110K 220K 330K 440K 550K SE +/- 4128.37, N = 3 SE +/- 18099.02, N = 15 SE +/- 2367.62, N = 3 524178.09 421171.59 503991.58 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Tungsten Renderer Scene: Volumetric Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Volumetric Caustic Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 4 8 12 16 20 SE +/- 0.44, N = 15 SE +/- 0.39, N = 15 SE +/- 0.48, N = 15 13.45 13.87 13.87 -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -ljpeg -lpthread -ldl
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 3 6 9 12 15 SE +/- 0.146, N = 15 SE +/- 0.118, N = 12 SE +/- 0.013, N = 3 8.522 8.967 5.815 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 1.2062 2.4124 3.6186 4.8248 6.031 SE +/- 0.074, N = 15 SE +/- 0.099, N = 15 SE +/- 0.030, N = 15 5.050 5.361 3.066 1. (CXX) g++ options: -O3 -fPIC -lm
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 40M 80M 120M 160M 200M SE +/- 1805964.50, N = 3 SE +/- 3374554.91, N = 12 SE +/- 2090545.83, N = 3 172711481 171613950 174596848
Intel Open Image Denoise Scene: Memorial OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.2.0 Scene: Memorial Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 13 26 39 52 65 SE +/- 2.57, N = 12 SE +/- 2.40, N = 12 SE +/- 0.85, N = 3 57.59 49.87 59.98
DaCapo Benchmark Java Test: Jython OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Jython Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 1200 2400 3600 4800 6000 SE +/- 153.92, N = 20 SE +/- 194.90, N = 20 SE +/- 14.30, N = 4 5320 5518 3590
Java Gradle Build Gradle Build: Reactor OpenBenchmarking.org Seconds, Fewer Is Better Java Gradle Build Gradle Build: Reactor Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 80 160 240 320 400 SE +/- 5.06, N = 9 SE +/- 10.71, N = 9 SE +/- 8.63, N = 6 366.36 382.99 301.41
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.64, N = 12 SE +/- 0.47, N = 15 23.87 22.08 31.15 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (CXX) g++ options: -O3 -pthread -lm
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 14 28 42 56 70 SE +/- 0.67, N = 3 SE +/- 1.30, N = 15 SE +/- 0.36, N = 15 62.04 60.93 40.82 1. (CXX) g++ options: -O2 -lOpenCL
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C Ubuntu 21.04 Ubuntu 20.04.2 LTS Clear Linux 34630 1300 2600 3900 5200 6500 SE +/- 51.84, N = 15 SE +/- 81.23, N = 15 SE +/- 165.45, N = 12 6259.88 6011.42 5425.40 -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Ubuntu 21.04: Open MPI 4.1.0 3. Ubuntu 20.04.2 LTS: Open MPI 4.0.3 4. Clear Linux 34630: 3.2
Phoronix Test Suite v10.8.4