Ice Lake Ubuntu 20.04.2 / 20.10 / 21.04 Linux Benchmarks Benchmarks for a future article. 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2105188-IB-2105170IB02&grr&rdt .
Ice Lake Ubuntu 20.04.2 / 20.10 / 21.04 Linux Benchmarks Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Compiler File-System Screen Resolution Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads) Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) Intel Device 0998 504GB 800GB INTEL SSDPF21Q800GB ASPEED VE228 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP Ubuntu 21.04 5.11.0-17-generic (x86_64) GNOME Shell 3.38.4 X Server GCC 10.3.0 ext4 1920x1080 Ubuntu 20.10 5.8.0-53-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 GCC 10.2.0 Ubuntu 20.04 GNOME Shell 3.36.7 GCC 9.3.0 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - Ubuntu 21.04: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Ubuntu 20.10: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Ubuntu 20.04.2 LTS: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270 Java Details - Ubuntu 21.04: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2) - Ubuntu 20.10: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.10) - Ubuntu 20.04.2 LTS: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.04) Python Details - Ubuntu 21.04: Python 3.9.4 - Ubuntu 20.10: Python 3.8.6 - Ubuntu 20.04.2 LTS: Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Ice Lake Ubuntu 20.04.2 / 20.10 / 21.04 Linux Benchmarks wrf: conus 2.5km java-gradle-perf: Reactor openvkl: vklBenchmarkUnstructuredVolume blogbench: Read openvkl: vklBenchmark wireguard: nwchem: C240 Buckyball onnx: super-resolution-10 - OpenMP CPU yafaray: Total Time For Sample Scene relion: Basic - CPU cassandra: Writes incompact3d: X3D-benchmarking input.i3d tensorflow-lite: Mobilenet Quant tensorflow-lite: Mobilenet Float node-web-tooling: asmfish: 1024 Hash Memory, 26 Depth securemark: SecureMark-TLS onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onnx: bertsquad-10 - OpenMP CPU lammps: 20k Atoms keydb: tensorflow-lite: NASNet Mobile onednn: Recurrent Neural Network Training - f32 - CPU build-llvm: Unix Makefiles pjsip: INVITE aom-av1: Speed 6 Two-Pass - Bosphorus 4K tensorflow-lite: SqueezeNet helsing: 14 digit hpcg: libgav1: Summer Nature 4K compress-zstd: 3, Long Mode - Decompression Speed build-erlang: Time To Compile onednn: Recurrent Neural Network Inference - u8s8f32 - CPU mrbayes: Primate Phylogeny Analysis openfoam: Motorbike 60M compress-zstd: 19, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed cpuminer-opt: Blake-2 S rodinia: OpenMP Leukocyte compress-zstd: 8 - Decompression Speed compress-zstd: 8 - Compression Speed cpuminer-opt: Skeincoin cpuminer-opt: Quad SHA-256, Pyrite compress-zstd: 19 - Decompression Speed compress-zstd: 19 - Compression Speed cpuminer-opt: Deepcoin build-llvm: Ninja compress-zstd: 3, Long Mode - Compression Speed tnn: CPU - MobileNet v2 cpuminer-opt: Garlicoin tensorflow-lite: Inception ResNet V2 onnx: fcn-resnet101-11 - OpenMP CPU onnx: yolov4 - OpenMP CPU build-wasmer: Time To Compile onnx: shufflenet-v2-10 - OpenMP CPU blender: Barbershop - CPU-Only build-linux-kernel: Time To Compile svt-av1: Preset 8 - Bosphorus 4K onednn: Deconvolution Batch shapes_1d - f32 - CPU build-nodejs: Time To Compile rodinia: OpenMP HotSpot3D cpuminer-opt: Magi cpuminer-opt: Triple SHA-256, Onecoin sysbench: CPU blender: Pabellon Barcelona - CPU-Only kvazaar: Bosphorus 4K - Medium aom-av1: Speed 6 Realtime - Bosphorus 4K pgbench: 100 - 250 - Read Only - Average Latency pgbench: 100 - 250 - Read Only compress-zstd: 3 - Decompression Speed build-godot: Time To Compile cpuminer-opt: x25x onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU tensorflow-lite: Inception V4 blender: Classroom - CPU-Only compress-zstd: 3 - Compression Speed build-php: Time To Compile tungsten: Volumetric Caustic build2: Time To Compile openvkl: vklBenchmarkVdbVolume pjsip: OPTIONS, Stateful john-the-ripper: MD5 stockfish: Total Time dacapobench: H2 compress-zstd: 19 - Decompression Speed compress-zstd: 19 - Compression Speed pjsip: OPTIONS, Stateless dacapobench: Jython neat: blender: Fishy Cat - CPU-Only x265: Bosphorus 4K ospray: San Miguel - Path Tracer kvazaar: Bosphorus 4K - Very Fast swet: Average tungsten: Water Caustic srsran: OFDM_Test chia-vdf: Square Assembly Optimized srslte: OFDM_Test rodinia: OpenMP LavaMD sysbench: RAM / Memory coremark: CoreMark Size 666 - Iterations Per Second compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed aom-av1: Speed 8 Realtime - Bosphorus 4K srslte: PHY_DL_Test srslte: PHY_DL_Test avifenc: 6, Lossless avifenc: 6 srsran: PHY_DL_Test srsran: PHY_DL_Test dacapobench: Tradebeans build-apache: Time To Compile chia-vdf: Square Plain C++ onednn: IP Shapes 1D - u8s8f32 - CPU aom-av1: Speed 9 Realtime - Bosphorus 4K pgbench: 100 - 250 - Read Write - Average Latency pgbench: 100 - 250 - Read Write namd: ATPase Simulation - 327,506 Atoms avifenc: 10, Lossless onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU gromacs: MPI CPU - water_GMX50_bare blender: BMW27 - CPU-Only onednn: IP Shapes 1D - bf16bf16bf16 - CPU john-the-ripper: Blowfish aircrack-ng: onednn: IP Shapes 3D - u8s8f32 - CPU phpbench: PHP Benchmark Suite ospray: San Miguel - SciVis avifenc: 10 onednn: IP Shapes 3D - bf16bf16bf16 - CPU openfoam: Motorbike 30M kvazaar: Bosphorus 4K - Ultra Fast tungsten: Hair x265: Bosphorus 1080p tnn: CPU - SqueezeNet v1.1 ospray: XFrog Forest - Path Tracer kvazaar: Bosphorus 1080p - Medium onednn: IP Shapes 1D - f32 - CPU pybench: Total For Average Test Times svt-hevc: 1 - Bosphorus 1080p liquid-dsp: 64 - 256 - 57 onednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU liquid-dsp: 160 - 256 - 57 liquid-dsp: 128 - 256 - 57 liquid-dsp: 32 - 256 - 57 liquid-dsp: 16 - 256 - 57 rodinia: OpenMP Streamcluster oidn: Memorial embree: Pathtracer - Asian Dragon ospray: XFrog Forest - SciVis povray: Trace Time onednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPU svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p svt-vp9: VMAF Optimized - Bosphorus 1080p onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU ospray: Magnetic Reconnection - SciVis toybrot: C++ Tasks embree: Pathtracer ISPC - Asian Dragon incompact3d: input.i3d 193 Cells Per Direction onednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU kvazaar: Bosphorus 1080p - Very Fast npb: EP.C npb: LU.C embree: Pathtracer - Crown build-mplayer: Time To Compile embree: Pathtracer ISPC - Crown ospray: NASA Streamlines - Path Tracer onednn: IP Shapes 3D - f32 - CPU toybrot: C++ Threads toybrot: OpenMP svt-vp9: Visual Quality Optimized - Bosphorus 1080p lammps: Rhodopsin Protein onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU svt-hevc: 10 - Bosphorus 1080p svt-hevc: 7 - Bosphorus 1080p tungsten: Non-Exponential ospray: NASA Streamlines - SciVis blogbench: Write Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 9903.05 366.363 1776457 2253810 669 681.119 1875.8 6967 86.283 348.838 106863 291.993968 43074.5 41555.1 10.79 172711481 230272 684.271 671.541 500 35.725 524178.09 81319.3 672.646 198.538 2531 4.05 47808.3 82.465 39.8231 19.39 180.485 439.677 170.019 105.35 2719.4 47.2 2708.5 46.0 1395127 62.044 3073.3 2096.7 277683 317291 2615.9 82.2 29191 129.152 266.9 443.547 29473 569303 193 479 44.607 8217 108.31 24.813 29.809 28.7517 102.136 105.650 2735.74 422680 213870.87 88.36 6.90 7.36 0.266 946634 2985.6 75.951 2221.97 437.972 437.866 687019 72.28 6148.4 40.207 13.4481 70.057 21915837 3815 10077667 180493580 10740 2727.6 82.3 40082 5320 45.569 45.72 12.92 10.42 13.38 639155887 31.2739 120766667 147240 121500000 40.406 12176.80 2347453.567360 3283.9 300.2 16.97 86.6 210.9 36.764 15.383 85.1 206.8 16729 35.187 138967 1.25869 23.59 8.739 28712 0.27128 8.522 0.231483 9.004 29.61 3.03516 118366 211170.458 0.445171 715587 90.91 5.050 1.82051 15.05 22.78 6.52538 28.33 366.729 10.38 25.42 0.945450 995 29.57 3044966667 3.27051 0.362841 3080633333 3290766667 1631166667 826756667 7.696 57.59 81.2911 18.87 9.476 2.10597 204.86 210.15 0.193198 0.256555 109.63 8015 106.6858 11.3305505 0.615331 49.20 6259.88 187807.08 64.3015 10.109 67.0395 27.78 1.37903 7118 7438 170.47 23.866 1.41084 0.924756 0.846330 260.88 172.02 5.23912 125 60768 9897.119 401.780 1493294 2245366 604 657.468 1873.7 5673 83.265 348.747 104506 299.195353 58977.6 55489.8 10.37 167218420 226588 730.825 754.982 435 35.506 355593.18 157434 732.235 226.852 1866 3.89 66836.4 83.525 39.6266 19.20 3131.9 183.275 506.972 171.388 104.92 2736.3 45.1 2685.6 41.0 1360453 85.284 3022.1 2004.3 268467 322628 2607.2 81.8 28568 144.787 265.2 606.052 26183 725870 161 356 68.800 7470 109.65 26.529 27.160 35.4561 108.602 104.849 2731.13 402686 213970.43 89.25 6.93 7.34 0.271 927066 2969.1 76.879 2253.80 472.245 468.050 892083 72.53 4516.9 44.011 13.7613 74.317 22032745 3716 9267667 177624409 11377 2713.8 81.7 40166 5614 55.921 46.91 12.66 10.42 13.20 627367473 31.1264 121300000 147100 120333333 39.838 12184.78 2316974.822459 3235.7 297.4 17.02 84.4 207.6 38.676 15.356 85.3 205.4 17609 37.074 139033 1.40323 23.69 12.100 20742 0.28452 9.451 0.247260 8.851 29.56 3.30961 111351 211559.000 0.471052 717063 90.91 5.855 1.96166 15.01 22.74 6.75881 28.26 366.201 10.31 25.26 1.018895 984 28.63 2157125000 3.50505 0.386794 3086966667 3292133333 1569933333 812776667 8.563 50.81 80.2191 18.87 9.365 2.27008 198.24 201.11 0.206919 0.270802 111.11 7963 101.8558 11.8795649 0.659638 48.57 5881.04 188751.17 64.4236 11.098 66.1552 27.78 1.54442 7165 7806 164.77 21.646 1.57954 1.03102 0.902751 237.32 161.78 5.25085 125 62517 13913.625 382.988 1737714 2224145 661 674.971 1825 6677 76.817 348.604 103051 319.487874 46915.5 45998.8 10.94 171613950 225195 667.646 678.041 485 36.300 421171.59 90927.9 671.557 216.231 1840 3.89 48816.5 82.638 39.9669 19.31 3131.1 175.896 433.741 178.773 104.77 2782.7 43.6 2695.5 44.8 1414359 60.933 2954.1 2128.7 270519 333611 2576.5 83.0 28282 138.523 259.1 374.203 26198 576988 193 509 67.845 8389 109.07 25.960 28.139 32.6500 104.250 104.315 2650.50 405578 214228.62 88.16 6.93 7.41 0.263 954214 2947.3 74.310 2440.72 433.426 431.304 716517 71.92 4620.1 40.723 13.8745 66.330 22077090 3597 10277000 173250319 11235 2740.7 84.0 40954 5518 46.11 12.58 10.42 13.33 579234165 32.4833 121200000 143600 120733333 39.522 12252.73 2261054.029261 3251.5 287.2 16.75 88.6 212.7 38.524 15.501 85.7 203.3 18060 35.987 139400 1.30999 22.82 11.858 21161 0.27083 8.967 0.218363 8.884 29.54 3.01186 116803 0.432599 726623 90.91 5.361 1.80328 14.96 23.16 6.45646 28.50 367.799 10.38 25.66 0.942653 982 29.14 2208933333 3.47588 0.353757 3106666667 3316633333 1653866667 832983333 8.012 49.87 82.6069 18.87 9.451 2.10694 196.97 190.48 0.190921 0.249170 110.19 7871 106.1435 11.7113234 0.602936 49.09 6011.42 189606.14 65.0249 10.618 66.8866 27.78 1.37432 7173 7328 159.10 22.084 1.39626 0.903312 0.850400 234.92 157.78 5.16631 125 63062 OpenBenchmarking.org
WRF Input: conus 2.5km OpenBenchmarking.org Seconds, Fewer Is Better WRF 4.2.2 Input: conus 2.5km Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3K 6K 9K 12K 15K 9903.05 9897.12 13913.63 -fallow-invalid-boz -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz -fallow-invalid-boz -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 1. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
Java Gradle Build Gradle Build: Reactor OpenBenchmarking.org Seconds, Fewer Is Better Java Gradle Build Gradle Build: Reactor Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 90 180 270 360 450 SE +/- 5.06, N = 9 SE +/- 11.03, N = 9 SE +/- 10.71, N = 9 366.36 401.78 382.99
OpenVKL Benchmark: vklBenchmarkUnstructuredVolume OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkUnstructuredVolume Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 400K 800K 1200K 1600K 2000K SE +/- 7183.20, N = 3 SE +/- 5996.30, N = 3 SE +/- 17974.50, N = 3 1776457 1493294 1737714 MIN: 24424 / MAX: 5906521 MIN: 18361 / MAX: 5191710 MIN: 21419 / MAX: 5849413
BlogBench Test: Read OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Read Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 500K 1000K 1500K 2000K 2500K SE +/- 30037.66, N = 3 SE +/- 19991.91, N = 3 SE +/- 21799.22, N = 9 2253810 2245366 2224145 1. (CC) gcc options: -O2 -pthread
OpenVKL Benchmark: vklBenchmark OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmark Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 140 280 420 560 700 SE +/- 2.40, N = 3 SE +/- 7.42, N = 3 SE +/- 5.81, N = 3 669 604 661 MIN: 1 / MAX: 2821 MIN: 1 / MAX: 2606 MIN: 1 / MAX: 2845
WireGuard + Linux Networking Stack Stress Test OpenBenchmarking.org Seconds, Fewer Is Better WireGuard + Linux Networking Stack Stress Test Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 150 300 450 600 750 SE +/- 8.55, N = 3 SE +/- 7.37, N = 3 SE +/- 7.42, N = 3 681.12 657.47 674.97
NWChem Input: C240 Buckyball OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 400 800 1200 1600 2000 1875.8 1873.7 1825.0 -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lz -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lz 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lm -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 1500 3000 4500 6000 7500 SE +/- 204.88, N = 12 SE +/- 158.31, N = 12 SE +/- 227.25, N = 12 6967 5673 6677 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
YafaRay Total Time For Sample Scene OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.4.1 Total Time For Sample Scene Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 2.26, N = 15 SE +/- 2.54, N = 12 SE +/- 1.59, N = 15 86.28 83.27 76.82 1. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread
RELION Test: Basic - Device: CPU OpenBenchmarking.org Seconds, Fewer Is Better RELION 3.1.1 Test: Basic - Device: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 80 160 240 320 400 SE +/- 1.61, N = 3 SE +/- 0.44, N = 3 SE +/- 0.74, N = 3 348.84 348.75 348.60 1. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi
Apache Cassandra Test: Writes OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 3.11.4 Test: Writes Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20K 40K 60K 80K 100K SE +/- 2263.61, N = 15 SE +/- 1429.17, N = 3 SE +/- 1368.32, N = 15 106863 104506 103051
Xcompact3d Incompact3d Input: X3D-benchmarking input.i3d OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 70 140 210 280 350 SE +/- 1.18, N = 3 SE +/- 1.73, N = 3 SE +/- 3.10, N = 3 291.99 299.20 319.49 -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
TensorFlow Lite Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Quant Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 13K 26K 39K 52K 65K SE +/- 601.06, N = 15 SE +/- 573.26, N = 15 SE +/- 706.39, N = 15 43074.5 58977.6 46915.5
TensorFlow Lite Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Float Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 12K 24K 36K 48K 60K SE +/- 612.15, N = 15 SE +/- 733.25, N = 15 SE +/- 630.61, N = 15 41555.1 55489.8 45998.8
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.13, N = 15 SE +/- 0.12, N = 15 10.79 10.37 10.94 1. Ubuntu 21.04: Nodejs
v12.21.0 2. Ubuntu 20.10: Nodejs
v12.18.2 3. Ubuntu 20.04.2 LTS: Nodejs
v10.19.0
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40M 80M 120M 160M 200M SE +/- 1805964.50, N = 3 SE +/- 748824.48, N = 3 SE +/- 3374554.91, N = 12 172711481 167218420 171613950
SecureMark Benchmark: SecureMark-TLS OpenBenchmarking.org marks, More Is Better SecureMark 1.0.4 Benchmark: SecureMark-TLS Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 50K 100K 150K 200K 250K SE +/- 178.37, N = 3 SE +/- 530.29, N = 3 SE +/- 124.15, N = 3 230272 226588 225195 1. (CC) gcc options: -pedantic -O3
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 160 320 480 640 800 SE +/- 5.64, N = 15 SE +/- 6.32, N = 15 SE +/- 1.50, N = 3 684.27 730.83 667.65 MIN: 645.4 MIN: 671.61 MIN: 642.94 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 160 320 480 640 800 SE +/- 0.16, N = 3 SE +/- 19.97, N = 12 SE +/- 4.59, N = 15 671.54 754.98 678.04 MIN: 647.67 MIN: 673.86 MIN: 638.98 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 110 220 330 440 550 SE +/- 6.29, N = 12 SE +/- 4.21, N = 3 SE +/- 3.18, N = 3 500 435 485 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.19, N = 3 SE +/- 0.04, N = 3 35.73 35.51 36.30 1. (CXX) g++ options: -O3 -pthread -lm
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 110K 220K 330K 440K 550K SE +/- 4128.37, N = 3 SE +/- 13586.63, N = 12 SE +/- 18099.02, N = 15 524178.09 355593.18 421171.59 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
TensorFlow Lite Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: NASNet Mobile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30K 60K 90K 120K 150K SE +/- 768.19, N = 15 SE +/- 5683.05, N = 15 SE +/- 1128.79, N = 4 81319.3 157434.0 90927.9
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 160 320 480 640 800 SE +/- 1.73, N = 3 SE +/- 5.22, N = 15 SE +/- 5.68, N = 8 672.65 732.24 671.56 MIN: 646.15 MIN: 674.53 MIN: 639.68 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Timed LLVM Compilation Build System: Unix Makefiles OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 12.0 Build System: Unix Makefiles Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 50 100 150 200 250 SE +/- 1.40, N = 3 SE +/- 0.98, N = 3 SE +/- 2.23, N = 3 198.54 226.85 216.23
PJSIP Method: INVITE OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: INVITE Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 500 1000 1500 2000 2500 SE +/- 22.31, N = 15 SE +/- 14.40, N = 10 SE +/- 14.31, N = 3 2531 1866 1840 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O2
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.9113 1.8226 2.7339 3.6452 4.5565 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 4.05 3.89 3.89 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
TensorFlow Lite Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: SqueezeNet Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 14K 28K 42K 56K 70K SE +/- 35.00, N = 3 SE +/- 1725.85, N = 12 SE +/- 510.23, N = 15 47808.3 66836.4 48816.5
Helsing Digit Range: 14 digit OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 14 digit Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.73, N = 15 SE +/- 0.21, N = 3 SE +/- 0.11, N = 3 82.47 83.53 82.64 1. (CC) gcc options: -O2 -pthread
High Performance Conjugate Gradient OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 9 18 27 36 45 SE +/- 0.03, N = 3 SE +/- 0.15, N = 3 SE +/- 0.19, N = 3 39.82 39.63 39.97 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 4K Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 5 10 15 20 25 SE +/- 0.15, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 19.39 19.20 19.31 1. (CXX) g++ options: -O3 -lpthread -lrt
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Decompression Speed Ubuntu 20.10 Ubuntu 20.04.2 LTS 700 1400 2100 2800 3500 SE +/- 31.17, N = 12 SE +/- 14.00, N = 14 3131.9 3131.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
Timed Erlang/OTP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Erlang/OTP Compilation 23.2 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40 80 120 160 200 SE +/- 0.85, N = 3 SE +/- 1.20, N = 3 SE +/- 1.32, N = 3 180.49 183.28 175.90
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 110 220 330 440 550 SE +/- 0.53, N = 3 SE +/- 33.83, N = 15 SE +/- 1.79, N = 3 439.68 506.97 433.74 MIN: 424.75 MIN: 435.95 MIN: 416.67 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40 80 120 160 200 SE +/- 0.83, N = 3 SE +/- 0.41, N = 3 SE +/- 0.79, N = 3 170.02 171.39 178.77 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
OpenFOAM Input: Motorbike 60M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 60M Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.46, N = 3 SE +/- 0.07, N = 3 SE +/- 0.16, N = 3 105.35 104.92 104.77 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 600 1200 1800 2400 3000 SE +/- 1.06, N = 3 SE +/- 5.79, N = 15 SE +/- 5.43, N = 15 2719.4 2736.3 2782.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 11 22 33 44 55 SE +/- 0.33, N = 3 SE +/- 0.39, N = 15 SE +/- 0.39, N = 15 47.2 45.1 43.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 600 1200 1800 2400 3000 SE +/- 5.02, N = 15 SE +/- 3.89, N = 3 SE +/- 5.48, N = 15 2708.5 2685.6 2695.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 10 20 30 40 50 SE +/- 0.54, N = 15 SE +/- 0.27, N = 3 SE +/- 0.61, N = 15 46.0 41.0 44.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Blake-2 S Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 300K 600K 900K 1200K 1500K SE +/- 21571.08, N = 15 SE +/- 14563.41, N = 15 SE +/- 29885.49, N = 15 1395127 1360453 1414359 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 0.59, N = 3 SE +/- 1.30, N = 15 62.04 85.28 60.93 1. (CXX) g++ options: -O2 -lOpenCL
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 700 1400 2100 2800 3500 SE +/- 2.77, N = 3 SE +/- 6.44, N = 13 SE +/- 65.22, N = 15 3073.3 3022.1 2954.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Compression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 500 1000 1500 2000 2500 SE +/- 24.65, N = 3 SE +/- 42.14, N = 15 SE +/- 25.03, N = 15 2096.7 2004.3 2128.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Skeincoin Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 60K 120K 180K 240K 300K SE +/- 2421.06, N = 15 SE +/- 3496.83, N = 15 SE +/- 2324.26, N = 13 277683 268467 270519 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Quad SHA-256, Pyrite Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 70K 140K 210K 280K 350K SE +/- 3914.38, N = 15 SE +/- 9923.95, N = 13 SE +/- 5088.38, N = 15 317291 322628 333611 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 600 1200 1800 2400 3000 SE +/- 3.01, N = 15 SE +/- 9.81, N = 3 SE +/- 6.16, N = 8 2615.9 2607.2 2576.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 1.03, N = 15 SE +/- 0.46, N = 3 SE +/- 0.70, N = 8 82.2 81.8 83.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Deepcoin Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6K 12K 18K 24K 30K SE +/- 326.37, N = 15 SE +/- 615.22, N = 12 SE +/- 453.46, N = 13 29191 28568 28282 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 12.0 Build System: Ninja Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30 60 90 120 150 SE +/- 0.36, N = 3 SE +/- 1.55, N = 3 SE +/- 0.38, N = 3 129.15 144.79 138.52
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Compression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 60 120 180 240 300 SE +/- 1.99, N = 3 SE +/- 8.38, N = 12 SE +/- 4.39, N = 14 266.9 265.2 259.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 130 260 390 520 650 SE +/- 9.40, N = 15 SE +/- 12.21, N = 15 SE +/- 0.44, N = 3 443.55 606.05 374.20 MIN: 367.81 / MAX: 808.75 MIN: 372.96 / MAX: 843.9 MIN: 371.48 / MAX: 419.18 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Garlicoin Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6K 12K 18K 24K 30K SE +/- 1184.13, N = 12 SE +/- 1085.72, N = 12 SE +/- 855.76, N = 12 29473 26183 26198 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
TensorFlow Lite Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 160K 320K 480K 640K 800K SE +/- 790.57, N = 3 SE +/- 13732.96, N = 12 SE +/- 5213.69, N = 3 569303 725870 576988
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40 80 120 160 200 SE +/- 1.20, N = 3 SE +/- 1.04, N = 3 SE +/- 0.44, N = 3 193 161 193 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 110 220 330 440 550 SE +/- 1.17, N = 3 SE +/- 1.67, N = 3 SE +/- 5.01, N = 3 479 356 509 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Timed Wasmer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Wasmer Compilation 1.0.2 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 15 30 45 60 75 SE +/- 0.37, N = 3 SE +/- 0.19, N = 3 SE +/- 0.51, N = 11 44.61 68.80 67.85 1. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lgcc_s -lutil -lrt -lpthread -lm -lc
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2K 4K 6K 8K 10K SE +/- 106.53, N = 3 SE +/- 35.85, N = 3 SE +/- 116.22, N = 3 8217 7470 8389 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Barbershop - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.26, N = 3 SE +/- 0.41, N = 3 SE +/- 0.27, N = 3 108.31 109.65 109.07
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.10.20 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6 12 18 24 30 SE +/- 0.35, N = 13 SE +/- 0.39, N = 14 SE +/- 0.36, N = 12 24.81 26.53 25.96
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 7 14 21 28 35 SE +/- 0.34, N = 3 SE +/- 0.37, N = 15 SE +/- 0.34, N = 15 29.81 27.16 28.14 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 8 16 24 32 40 SE +/- 0.29, N = 15 SE +/- 0.37, N = 15 SE +/- 0.49, N = 15 28.75 35.46 32.65 MIN: 15.46 MIN: 14.26 MIN: 9.3 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 15.11 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.97, N = 3 SE +/- 1.42, N = 3 SE +/- 0.48, N = 3 102.14 108.60 104.25
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.57, N = 3 SE +/- 0.02, N = 3 SE +/- 0.14, N = 3 105.65 104.85 104.32 1. (CXX) g++ options: -O2 -lOpenCL
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Magi Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 600 1200 1800 2400 3000 SE +/- 50.25, N = 12 SE +/- 5.62, N = 3 SE +/- 22.13, N = 15 2735.74 2731.13 2650.50 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Triple SHA-256, Onecoin Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 90K 180K 270K 360K 450K SE +/- 1250.96, N = 3 SE +/- 11124.39, N = 12 SE +/- 8447.87, N = 12 422680 402686 405578 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 50K 100K 150K 200K 250K SE +/- 275.74, N = 3 SE +/- 253.78, N = 3 SE +/- 252.80, N = 3 213870.87 213970.43 214228.62 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Pabellon Barcelona - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.24, N = 3 SE +/- 0.26, N = 3 SE +/- 0.21, N = 3 88.36 89.25 88.16
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 6.90 6.93 6.93 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 7.36 7.34 7.41 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.061 0.122 0.183 0.244 0.305 SE +/- 0.003, N = 15 SE +/- 0.004, N = 3 SE +/- 0.003, N = 4 0.266 0.271 0.263 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 200K 400K 600K 800K 1000K SE +/- 10755.01, N = 15 SE +/- 12775.84, N = 3 SE +/- 10643.51, N = 4 946634 927066 954214 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 600 1200 1800 2400 3000 SE +/- 9.20, N = 2 SE +/- 7.67, N = 5 2985.6 2969.1 2947.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.13, N = 3 SE +/- 0.33, N = 3 SE +/- 1.05, N = 3 75.95 76.88 74.31
Cpuminer-Opt Algorithm: x25x OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: x25x Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 500 1000 1500 2000 2500 SE +/- 24.82, N = 3 SE +/- 25.12, N = 4 SE +/- 33.66, N = 15 2221.97 2253.80 2440.72 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 100 200 300 400 500 SE +/- 0.96, N = 3 SE +/- 1.27, N = 3 SE +/- 1.72, N = 3 437.97 472.25 433.43 MIN: 422.42 MIN: 442.59 MIN: 414.41 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 100 200 300 400 500 SE +/- 0.63, N = 3 SE +/- 2.30, N = 3 SE +/- 1.48, N = 3 437.87 468.05 431.30 MIN: 422.67 MIN: 438.61 MIN: 414.55 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
TensorFlow Lite Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception V4 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 200K 400K 600K 800K 1000K SE +/- 7233.85, N = 5 SE +/- 7218.89, N = 3 SE +/- 2002.95, N = 3 687019 892083 716517
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Classroom - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 16 32 48 64 80 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 72.28 72.53 71.92
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed Ubuntu 20.04.2 LTS Ubuntu 21.04 Ubuntu 20.10 1300 2600 3900 5200 6500 SE +/- 47.03, N = 3 SE +/- 53.61, N = 3 SE +/- 43.45, N = 13 4576.8 6148.4 4516.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.4.2 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 10 20 30 40 50 SE +/- 0.52, N = 3 SE +/- 0.37, N = 9 SE +/- 0.02, N = 3 40.21 44.01 40.72
Tungsten Renderer Scene: Volumetric Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Volumetric Caustic Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 4 8 12 16 20 SE +/- 0.44, N = 15 SE +/- 0.44, N = 15 SE +/- 0.39, N = 15 13.45 13.76 13.87 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 16 32 48 64 80 SE +/- 0.48, N = 3 SE +/- 0.57, N = 3 SE +/- 0.36, N = 3 70.06 74.32 66.33
OpenVKL Benchmark: vklBenchmarkVdbVolume OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkVdbVolume Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 5M 10M 15M 20M 25M SE +/- 58341.71, N = 3 SE +/- 181382.68, N = 3 SE +/- 187763.97, N = 3 21915837 22032745 22077090 MIN: 1062215 / MAX: 153704880 MIN: 1023660 / MAX: 161438904 MIN: 1030467 / MAX: 169859376
PJSIP Method: OPTIONS, Stateful OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateful Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 800 1600 2400 3200 4000 SE +/- 6.89, N = 3 SE +/- 7.67, N = 3 SE +/- 13.59, N = 3 3815 3716 3597 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O2
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: MD5 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2M 4M 6M 8M 10M SE +/- 137751.39, N = 3 SE +/- 118341.78, N = 3 SE +/- 7549.83, N = 3 10077667 9267667 10277000 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40M 80M 120M 160M 200M SE +/- 2129761.70, N = 15 SE +/- 1600302.62, N = 3 SE +/- 1908422.73, N = 4 180493580 177624409 173250319 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
DaCapo Benchmark Java Test: H2 OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: H2 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2K 4K 6K 8K 10K SE +/- 133.93, N = 4 SE +/- 129.08, N = 4 SE +/- 115.71, N = 20 10740 11377 11235
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 600 1200 1800 2400 3000 SE +/- 12.48, N = 3 SE +/- 15.59, N = 4 SE +/- 9.85, N = 3 2727.6 2713.8 2740.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.81, N = 3 SE +/- 0.92, N = 4 SE +/- 0.68, N = 3 82.3 81.7 84.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
PJSIP Method: OPTIONS, Stateless OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateless Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 9K 18K 27K 36K 45K SE +/- 533.56, N = 3 SE +/- 217.12, N = 3 SE +/- 352.27, N = 3 40082 40166 40954 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O2
DaCapo Benchmark Java Test: Jython OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Jython Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 1200 2400 3600 4800 6000 SE +/- 153.92, N = 20 SE +/- 174.19, N = 20 SE +/- 194.90, N = 20 5320 5614 5518
Nebular Empirical Analysis Tool OpenBenchmarking.org Seconds, Fewer Is Better Nebular Empirical Analysis Tool 2.3 Ubuntu 21.04 Ubuntu 20.10 13 26 39 52 65 SE +/- 0.19, N = 3 SE +/- 0.44, N = 3 45.57 55.92 1. (F9X) gfortran options: -O3 -cpp -ffree-line-length-0 -Jsource/ -fopenmp -fno-backtrace -lcfitsio
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Fishy Cat - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 11 22 33 44 55 SE +/- 0.15, N = 3 SE +/- 0.44, N = 3 SE +/- 0.17, N = 3 45.72 46.91 46.11
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 12.92 12.66 12.58 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OSPray Demo: San Miguel - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: Path Tracer Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 10.42 10.42 10.42 MIN: 7.87 / MAX: 10.64 MIN: 7.25 / MAX: 10.53 MIN: 7.94 / MAX: 10.53
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 13.38 13.20 13.33 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Swet Average OpenBenchmarking.org Operations Per Second, More Is Better Swet 1.5.16 Average Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 140M 280M 420M 560M 700M SE +/- 1889606.21, N = 3 SE +/- 7650995.20, N = 3 SE +/- 7221883.45, N = 15 639155887 627367473 579234165 1. (CC) gcc options: -lm -lpthread -lcurses -lrt
Tungsten Renderer Scene: Water Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Water Caustic Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 8 16 24 32 40 SE +/- 0.27, N = 3 SE +/- 0.06, N = 3 SE +/- 0.19, N = 3 31.27 31.13 32.48 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
srsRAN Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsRAN 21.04 Test: OFDM_Test Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30M 60M 90M 120M 150M SE +/- 284800.12, N = 3 SE +/- 305505.05, N = 3 SE +/- 152752.52, N = 3 120766667 121300000 121200000 -lconfig++ -lsctp 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Chia Blockchain VDF Test: Square Assembly Optimized OpenBenchmarking.org IPS, More Is Better Chia Blockchain VDF 1.0.1 Test: Square Assembly Optimized Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30K 60K 90K 120K 150K SE +/- 1568.63, N = 5 SE +/- 900.00, N = 3 SE +/- 680.69, N = 3 147240 147100 143600 1. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread
srsLTE Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsLTE 20.10.1 Test: OFDM_Test Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30M 60M 90M 120M 150M SE +/- 650640.71, N = 3 SE +/- 260341.66, N = 3 SE +/- 176383.42, N = 3 121500000 120333333 120733333 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 9 18 27 36 45 SE +/- 0.15, N = 3 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 40.41 39.84 39.52 1. (CXX) g++ options: -O2 -lOpenCL
Sysbench Test: RAM / Memory OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3K 6K 9K 12K 15K SE +/- 145.91, N = 15 SE +/- 151.11, N = 12 SE +/- 138.52, N = 15 12176.80 12184.78 12252.73 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 500K 1000K 1500K 2000K 2500K SE +/- 2872.35, N = 3 SE +/- 3242.58, N = 3 SE +/- 24473.66, N = 5 2347453.57 2316974.82 2261054.03 1. (CC) gcc options: -O2 -lrt" -lrt
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 700 1400 2100 2800 3500 SE +/- 8.34, N = 3 SE +/- 15.05, N = 3 SE +/- 7.83, N = 3 3283.9 3235.7 3251.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 70 140 210 280 350 SE +/- 3.33, N = 3 SE +/- 1.00, N = 3 SE +/- 3.27, N = 3 300.2 297.4 287.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.17, N = 3 SE +/- 0.12, N = 3 16.97 17.02 16.75 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
srsLTE Test: PHY_DL_Test OpenBenchmarking.org UE Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.30, N = 3 SE +/- 0.90, N = 3 SE +/- 0.13, N = 3 86.6 84.4 88.6 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsLTE Test: PHY_DL_Test OpenBenchmarking.org eNb Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 50 100 150 200 250 SE +/- 0.84, N = 3 SE +/- 1.35, N = 3 SE +/- 0.18, N = 3 210.9 207.6 212.7 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 9 18 27 36 45 SE +/- 0.47, N = 3 SE +/- 0.05, N = 3 SE +/- 0.38, N = 3 36.76 38.68 38.52 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 4 8 12 16 20 SE +/- 0.19, N = 4 SE +/- 0.11, N = 3 SE +/- 0.14, N = 15 15.38 15.36 15.50 1. (CXX) g++ options: -O3 -fPIC -lm
srsRAN Test: PHY_DL_Test OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: PHY_DL_Test Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.78, N = 3 SE +/- 0.23, N = 3 SE +/- 0.37, N = 3 85.1 85.3 85.7 -lconfig++ -lsctp 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: PHY_DL_Test OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: PHY_DL_Test Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 50 100 150 200 250 SE +/- 0.78, N = 3 SE +/- 0.64, N = 3 SE +/- 0.53, N = 3 206.8 205.4 203.3 -lconfig++ -lsctp 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
DaCapo Benchmark Java Test: Tradebeans OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Tradebeans Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 4K 8K 12K 16K 20K SE +/- 82.67, N = 4 SE +/- 203.85, N = 4 SE +/- 203.38, N = 4 16729 17609 18060
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.41 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 9 18 27 36 45 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.13, N = 3 35.19 37.07 35.99
Chia Blockchain VDF Test: Square Plain C++ OpenBenchmarking.org IPS, More Is Better Chia Blockchain VDF 1.0.1 Test: Square Plain C++ Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30K 60K 90K 120K 150K SE +/- 88.19, N = 3 SE +/- 484.19, N = 3 SE +/- 585.95, N = 3 138967 139033 139400 1. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.3157 0.6314 0.9471 1.2628 1.5785 SE +/- 0.00905, N = 15 SE +/- 0.01231, N = 3 SE +/- 0.00863, N = 3 1.25869 1.40323 1.30999 MIN: 0.86 MIN: 0.82 MIN: 0.92 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6 12 18 24 30 SE +/- 0.28, N = 3 SE +/- 0.15, N = 3 SE +/- 0.23, N = 5 23.59 23.69 22.82 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.010, N = 3 SE +/- 0.030, N = 3 SE +/- 0.045, N = 3 8.739 12.100 11.858 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Write Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6K 12K 18K 24K 30K SE +/- 43.42, N = 3 SE +/- 45.12, N = 3 SE +/- 84.38, N = 3 28712 20742 21161 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.064 0.128 0.192 0.256 0.32 SE +/- 0.00049, N = 3 SE +/- 0.00359, N = 3 SE +/- 0.00044, N = 3 0.27128 0.28452 0.27083
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.146, N = 15 SE +/- 0.088, N = 7 SE +/- 0.118, N = 12 8.522 9.451 8.967 1. (CXX) g++ options: -O3 -fPIC -lm
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.0556 0.1112 0.1668 0.2224 0.278 SE +/- 0.002322, N = 6 SE +/- 0.001954, N = 10 SE +/- 0.001876, N = 8 0.231483 0.247260 0.218363 MIN: 0.2 MIN: 0.2 MIN: 0.19 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.058, N = 3 SE +/- 0.014, N = 3 SE +/- 0.033, N = 3 9.004 8.851 8.884 1. (CXX) g++ options: -O3 -pthread
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: BMW27 - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 29.61 29.56 29.54
oneDNN Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.7447 1.4894 2.2341 2.9788 3.7235 SE +/- 0.02675, N = 12 SE +/- 0.03870, N = 3 SE +/- 0.03357, N = 3 3.03516 3.30961 3.01186 MIN: 2.86 MIN: 2.85 MIN: 2.84 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: Blowfish Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30K 60K 90K 120K 150K SE +/- 160.48, N = 3 SE +/- 877.45, N = 3 SE +/- 277.78, N = 3 118366 111351 116803 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
Aircrack-ng OpenBenchmarking.org k/s, More Is Better Aircrack-ng 1.5.2 Ubuntu 21.04 Ubuntu 20.10 50K 100K 150K 200K 250K SE +/- 66.66, N = 3 SE +/- 149.68, N = 3 211170.46 211559.00 1. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.106 0.212 0.318 0.424 0.53 SE +/- 0.004513, N = 6 SE +/- 0.003416, N = 15 SE +/- 0.003621, N = 8 0.445171 0.471052 0.432599 MIN: 0.4 MIN: 0.39 MIN: 0.39 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
PHPBench PHP Benchmark Suite OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 160K 320K 480K 640K 800K SE +/- 3328.66, N = 3 SE +/- 4029.73, N = 3 SE +/- 5472.66, N = 3 715587 717063 726623
OSPray Demo: San Miguel - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 90.91 90.91 90.91 MIN: 55.56 / MAX: 100 MIN: 43.48 / MAX: 100 MIN: 55.56 / MAX: 100
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 1.3174 2.6348 3.9522 5.2696 6.587 SE +/- 0.074, N = 15 SE +/- 0.084, N = 15 SE +/- 0.099, N = 15 5.050 5.855 5.361 1. (CXX) g++ options: -O3 -fPIC -lm
oneDNN Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.4414 0.8828 1.3242 1.7656 2.207 SE +/- 0.01169, N = 14 SE +/- 0.01622, N = 9 SE +/- 0.00660, N = 3 1.82051 1.96166 1.80328 MIN: 1.67 MIN: 1.66 MIN: 1.67 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 4 8 12 16 20 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 15.05 15.01 14.96 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6 12 18 24 30 SE +/- 0.11, N = 3 SE +/- 0.02, N = 3 SE +/- 0.24, N = 3 22.78 22.74 23.16 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Tungsten Renderer Scene: Hair OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Hair Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2 4 6 8 10 SE +/- 0.07388, N = 3 SE +/- 0.07152, N = 15 SE +/- 0.08029, N = 15 6.52538 6.75881 6.45646 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 7 14 21 28 35 SE +/- 0.34, N = 3 SE +/- 0.29, N = 5 SE +/- 0.25, N = 3 28.33 28.26 28.50 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 80 160 240 320 400 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 SE +/- 0.53, N = 3 366.73 366.20 367.80 MIN: 365.95 / MAX: 373.2 MIN: 366.01 / MAX: 367.51 MIN: 367 / MAX: 370.11 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OSPray Demo: XFrog Forest - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: Path Tracer Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 10.38 10.31 10.38 MIN: 7.58 / MAX: 10.75 MIN: 7.46 / MAX: 10.75 MIN: 9.17 / MAX: 10.53
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6 12 18 24 30 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 25.42 25.26 25.66 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.2293 0.4586 0.6879 0.9172 1.1465 SE +/- 0.010679, N = 4 SE +/- 0.009311, N = 7 SE +/- 0.011491, N = 3 0.945450 1.018895 0.942653 MIN: 0.86 MIN: 0.84 MIN: 0.85 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
PyBench Total For Average Test Times OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2018-02-16 Total For Average Test Times Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 200 400 600 800 1000 SE +/- 1.86, N = 3 SE +/- 2.67, N = 3 SE +/- 0.88, N = 3 995 984 982
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 7 14 21 28 35 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 SE +/- 0.33, N = 3 29.57 28.63 29.14 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
Liquid-DSP Threads: 64 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 700M 1400M 2100M 2800M 3500M SE +/- 19853994.84, N = 3 SE +/- 24378623.66, N = 4 SE +/- 4879321.45, N = 3 3044966667 2157125000 2208933333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.7886 1.5772 2.3658 3.1544 3.943 SE +/- 0.00929, N = 3 SE +/- 0.02670, N = 3 SE +/- 0.00414, N = 3 3.27051 3.50505 3.47588 MIN: 3.09 MIN: 3.07 MIN: 3.31 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.087 0.174 0.261 0.348 0.435 SE +/- 0.004351, N = 3 SE +/- 0.003521, N = 3 SE +/- 0.003468, N = 3 0.362841 0.386794 0.353757 MIN: 0.32 MIN: 0.31 MIN: 0.31 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Liquid-DSP Threads: 160 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 160 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 700M 1400M 2100M 2800M 3500M SE +/- 15087448.79, N = 3 SE +/- 10305392.33, N = 3 SE +/- 11545032.60, N = 3 3080633333 3086966667 3106666667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 700M 1400M 2100M 2800M 3500M SE +/- 9583724.63, N = 3 SE +/- 13475203.56, N = 3 SE +/- 8772368.23, N = 3 3290766667 3292133333 3316633333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 400M 800M 1200M 1600M 2000M SE +/- 768837.51, N = 3 SE +/- 11623730.52, N = 3 SE +/- 5634516.64, N = 3 1631166667 1569933333 1653866667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 200M 400M 600M 800M 1000M SE +/- 702242.44, N = 3 SE +/- 2859827.11, N = 3 SE +/- 5004752.19, N = 3 826756667 812776667 832983333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2 4 6 8 10 SE +/- 0.081, N = 3 SE +/- 0.089, N = 3 SE +/- 0.145, N = 15 7.696 8.563 8.012 1. (CXX) g++ options: -O2 -lOpenCL
Intel Open Image Denoise Scene: Memorial OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.2.0 Scene: Memorial Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 13 26 39 52 65 SE +/- 2.57, N = 12 SE +/- 2.62, N = 12 SE +/- 2.40, N = 12 57.59 50.81 49.87
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.58, N = 3 SE +/- 0.89, N = 5 SE +/- 0.68, N = 9 81.29 80.22 82.61 MIN: 67.06 / MAX: 91.79 MIN: 64.42 / MAX: 92.94 MIN: 67 / MAX: 93.51
OSPray Demo: XFrog Forest - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 18.87 18.87 18.87 MIN: 13.33 / MAX: 19.23 MIN: 12.05 / MAX: 19.23 MIN: 12.82 / MAX: 19.23
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.097, N = 5 SE +/- 0.063, N = 3 SE +/- 0.084, N = 3 9.476 9.365 9.451 -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lImath -lHalf -lIex -lIexMath -lpthread 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSM -lICE -lX11 -lIlmImf -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.5108 1.0216 1.5324 2.0432 2.554 SE +/- 0.01823, N = 3 SE +/- 0.02930, N = 15 SE +/- 0.01729, N = 3 2.10597 2.27008 2.10694 MIN: 2.03 MIN: 2.03 MIN: 2.03 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40 80 120 160 200 SE +/- 2.00, N = 15 SE +/- 1.24, N = 3 SE +/- 1.73, N = 3 204.86 198.24 196.97 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 50 100 150 200 250 SE +/- 2.00, N = 15 SE +/- 2.23, N = 3 SE +/- 2.60, N = 3 210.15 201.11 190.48 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.0466 0.0932 0.1398 0.1864 0.233 SE +/- 0.001836, N = 13 SE +/- 0.001494, N = 14 SE +/- 0.001679, N = 14 0.193198 0.206919 0.190921 MIN: 0.18 MIN: 0.18 MIN: 0.18 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.0609 0.1218 0.1827 0.2436 0.3045 SE +/- 0.002948, N = 3 SE +/- 0.003157, N = 4 SE +/- 0.003241, N = 3 0.256555 0.270802 0.249170 MIN: 0.23 MIN: 0.23 MIN: 0.22 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OSPray Demo: Magnetic Reconnection - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 1.01, N = 15 SE +/- 0.00, N = 3 SE +/- 0.93, N = 12 109.63 111.11 110.19 MIN: 20.83 / MAX: 125 MIN: 23.26 MIN: 21.74 / MAX: 111.11
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2K 4K 6K 8K 10K SE +/- 77.75, N = 6 SE +/- 78.83, N = 6 SE +/- 99.62, N = 3 8015 7963 7871 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.84, N = 10 SE +/- 0.47, N = 3 SE +/- 0.91, N = 3 106.69 101.86 106.14 MIN: 85.33 / MAX: 112.78 MIN: 80.69 / MAX: 111.15 MIN: 84.21 / MAX: 111.87
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 11.33 11.88 11.71 -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.1484 0.2968 0.4452 0.5936 0.742 SE +/- 0.004554, N = 3 SE +/- 0.004580, N = 3 SE +/- 0.002882, N = 3 0.615331 0.659638 0.602936 MIN: 0.57 MIN: 0.56 MIN: 0.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 11 22 33 44 55 SE +/- 0.45, N = 3 SE +/- 0.20, N = 3 SE +/- 0.46, N = 3 49.20 48.57 49.09 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 1300 2600 3900 5200 6500 SE +/- 51.84, N = 15 SE +/- 92.04, N = 15 SE +/- 81.23, N = 15 6259.88 5881.04 6011.42 -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Ubuntu 21.04: Open MPI 4.1.0 3. Ubuntu 20.10: Open MPI 4.0.3 4. Ubuntu 20.04.2 LTS: Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40K 80K 120K 160K 200K SE +/- 419.66, N = 3 SE +/- 579.11, N = 3 SE +/- 1295.33, N = 3 187807.08 188751.17 189606.14 -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Ubuntu 21.04: Open MPI 4.1.0 3. Ubuntu 20.10: Open MPI 4.0.3 4. Ubuntu 20.04.2 LTS: Open MPI 4.0.3
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Crown Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 15 30 45 60 75 SE +/- 0.70, N = 3 SE +/- 0.77, N = 3 SE +/- 0.74, N = 3 64.30 64.42 65.02 MIN: 55.83 / MAX: 81.77 MIN: 56.4 / MAX: 82.25 MIN: 57.31 / MAX: 81.55
Timed MPlayer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed MPlayer Compilation 1.4 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.12, N = 3 10.11 11.10 10.62
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Crown Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 15 30 45 60 75 SE +/- 0.30, N = 3 SE +/- 0.47, N = 3 SE +/- 0.70, N = 3 67.04 66.16 66.89 MIN: 59.72 / MAX: 87.99 MIN: 57.81 / MAX: 87.38 MIN: 59.4 / MAX: 88.24
OSPray Demo: NASA Streamlines - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: Path Tracer Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 7 14 21 28 35 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 27.78 27.78 27.78 MIN: 17.24 / MAX: 29.41 MIN: 16.67 / MAX: 29.41 MIN: 16.67 / MAX: 29.41
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.3475 0.695 1.0425 1.39 1.7375 SE +/- 0.01131, N = 3 SE +/- 0.02039, N = 3 SE +/- 0.01216, N = 3 1.37903 1.54442 1.37432 MIN: 1.33 MIN: 1.31 MIN: 1.32 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 1500 3000 4500 6000 7500 SE +/- 85.39, N = 4 SE +/- 61.67, N = 3 SE +/- 77.87, N = 4 7118 7165 7173 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
toyBrot Fractal Generator Implementation: OpenMP OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: OpenMP Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2K 4K 6K 8K 10K SE +/- 105.33, N = 3 SE +/- 108.03, N = 3 SE +/- 81.17, N = 4 7438 7806 7328 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40 80 120 160 200 SE +/- 2.03, N = 3 SE +/- 0.47, N = 3 SE +/- 1.88, N = 4 170.47 164.77 159.10 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.26, N = 15 SE +/- 0.64, N = 12 23.87 21.65 22.08 1. (CXX) g++ options: -O3 -pthread -lm
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.3554 0.7108 1.0662 1.4216 1.777 SE +/- 0.00589, N = 3 SE +/- 0.01111, N = 3 SE +/- 0.00638, N = 3 1.41084 1.57954 1.39626 MIN: 1.27 MIN: 1.24 MIN: 1.26 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.232 0.464 0.696 0.928 1.16 SE +/- 0.005014, N = 3 SE +/- 0.004578, N = 3 SE +/- 0.005605, N = 3 0.924756 1.031020 0.903312 MIN: 0.86 MIN: 0.84 MIN: 0.84 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.2031 0.4062 0.6093 0.8124 1.0155 SE +/- 0.008374, N = 6 SE +/- 0.008637, N = 6 SE +/- 0.008602, N = 6 0.846330 0.902751 0.850400 MIN: 0.8 MIN: 0.8 MIN: 0.8 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 60 120 180 240 300 SE +/- 2.49, N = 3 SE +/- 2.36, N = 6 SE +/- 1.73, N = 3 260.88 237.32 234.92 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40 80 120 160 200 SE +/- 1.72, N = 3 SE +/- 1.13, N = 3 SE +/- 1.68, N = 3 172.02 161.78 157.78 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
Tungsten Renderer Scene: Non-Exponential OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Non-Exponential Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 1.1814 2.3628 3.5442 4.7256 5.907 SE +/- 0.07466, N = 3 SE +/- 0.04628, N = 3 SE +/- 0.02249, N = 3 5.23912 5.25085 5.16631 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
OSPray Demo: NASA Streamlines - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30 60 90 120 150 125 125 125 MIN: 31.25 / MAX: 142.86 MIN: 28.57 / MAX: 142.86 MIN: 33.33 / MAX: 142.86
BlogBench Test: Write OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Write Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 14K 28K 42K 56K 70K SE +/- 717.36, N = 3 SE +/- 419.01, N = 3 SE +/- 770.50, N = 3 60768 62517 63062 1. (CC) gcc options: -O2 -pthread
Phoronix Test Suite v10.8.4