Ice Lake Ubuntu 20.04.2 / 20.10 / 21.04 Linux Benchmarks Benchmarks for a future article. 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2105188-IB-2105170IB02&grs .
Ice Lake Ubuntu 20.04.2 / 20.10 / 21.04 Linux Benchmarks Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Compiler File-System Screen Resolution Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads) Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) Intel Device 0998 504GB 800GB INTEL SSDPF21Q800GB ASPEED VE228 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP Ubuntu 21.04 5.11.0-17-generic (x86_64) GNOME Shell 3.38.4 X Server GCC 10.3.0 ext4 1920x1080 Ubuntu 20.10 5.8.0-53-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 GCC 10.2.0 Ubuntu 20.04 GNOME Shell 3.36.7 GCC 9.3.0 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - Ubuntu 21.04: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Ubuntu 20.10: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Ubuntu 20.04.2 LTS: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270 Java Details - Ubuntu 21.04: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2) - Ubuntu 20.10: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.10) - Ubuntu 20.04.2 LTS: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.04) Python Details - Ubuntu 21.04: Python 3.9.4 - Ubuntu 20.10: Python 3.8.6 - Ubuntu 20.04.2 LTS: Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Ice Lake Ubuntu 20.04.2 / 20.10 / 21.04 Linux Benchmarks build-wasmer: Time To Compile onnx: yolov4 - OpenMP CPU liquid-dsp: 64 - 256 - 57 wrf: conus 2.5km pgbench: 100 - 250 - Read Write - Average Latency pgbench: 100 - 250 - Read Write pjsip: INVITE tensorflow-lite: Mobilenet Quant compress-zstd: 3 - Compression Speed tensorflow-lite: Mobilenet Float tensorflow-lite: Inception V4 onednn: Deconvolution Batch shapes_1d - f32 - CPU neat: onnx: fcn-resnet101-11 - OpenMP CPU openvkl: vklBenchmarkUnstructuredVolume onnx: bertsquad-10 - OpenMP CPU build-llvm: Unix Makefiles onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onnx: shufflenet-v2-10 - OpenMP CPU compress-zstd: 19, Long Mode - Compression Speed build-llvm: Ninja build2: Time To Compile onednn: IP Shapes 1D - u8s8f32 - CPU svt-hevc: 10 - Bosphorus 1080p john-the-ripper: MD5 openvkl: vklBenchmark swet: Average svt-vp9: VMAF Optimized - Bosphorus 1080p onednn: IP Shapes 1D - bf16bf16bf16 - CPU cpuminer-opt: x25x build-mplayer: Time To Compile svt-av1: Preset 8 - Bosphorus 4K onednn: Recurrent Neural Network Training - u8s8f32 - CPU build-php: Time To Compile incompact3d: X3D-benchmarking input.i3d onednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU svt-hevc: 7 - Bosphorus 1080p onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: IP Shapes 3D - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU compress-zstd: 19, Long Mode - Compression Speed onednn: IP Shapes 1D - f32 - CPU dacapobench: Tradebeans onednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPU svt-vp9: Visual Quality Optimized - Bosphorus 1080p build-linux-kernel: Time To Compile onednn: Deconvolution Batch shapes_3d - f32 - CPU toybrot: OpenMP build-nodejs: Time To Compile john-the-ripper: Blowfish pjsip: OPTIONS, Stateful dacapobench: H2 node-web-tooling: build-apache: Time To Compile liquid-dsp: 32 - 256 - 57 avifenc: 6, Lossless mrbayes: Primate Phylogeny Analysis namd: ATPase Simulation - 327,506 Atoms srslte: PHY_DL_Test incompact3d: input.i3d 193 Cells Per Direction embree: Pathtracer ISPC - Asian Dragon tungsten: Hair compress-zstd: 8, Long Mode - Compression Speed tungsten: Water Caustic build-erlang: Time To Compile stockfish: Total Time aom-av1: Speed 6 Two-Pass - Bosphorus 4K svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p coremark: CoreMark Size 666 - Iterations Per Second aom-av1: Speed 9 Realtime - Bosphorus 4K blogbench: Write wireguard: build-godot: Time To Compile cpuminer-opt: Skeincoin svt-hevc: 1 - Bosphorus 1080p pgbench: 100 - 250 - Read Only - Average Latency embree: Pathtracer - Asian Dragon pgbench: 100 - 250 - Read Only compress-zstd: 19 - Compression Speed nwchem: C240 Buckyball x265: Bosphorus 4K blender: Fishy Cat - CPU-Only chia-vdf: Square Assembly Optimized liquid-dsp: 16 - 256 - 57 srslte: PHY_DL_Test compress-zstd: 19, Long Mode - Decompression Speed securemark: SecureMark-TLS rodinia: OpenMP LavaMD lammps: 20k Atoms pjsip: OPTIONS, Stateless kvazaar: Bosphorus 4K - Ultra Fast toybrot: C++ Tasks gromacs: MPI CPU - water_GMX50_bare srsran: PHY_DL_Test tungsten: Non-Exponential aom-av1: Speed 8 Realtime - Bosphorus 4K kvazaar: Bosphorus 1080p - Medium phpbench: PHP Benchmark Suite compress-zstd: 19 - Decompression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 19 - Compression Speed kvazaar: Bosphorus 4K - Very Fast ospray: Magnetic Reconnection - SciVis embree: Pathtracer ISPC - Crown blogbench: Read pybench: Total For Average Test Times compress-zstd: 3 - Decompression Speed kvazaar: Bosphorus 1080p - Very Fast helsing: 14 digit rodinia: OpenMP HotSpot3D blender: Barbershop - CPU-Only blender: Pabellon Barcelona - CPU-Only povray: Trace Time embree: Pathtracer - Crown compress-zstd: 19 - Decompression Speed libgav1: Summer Nature 4K srslte: OFDM_Test npb: LU.C aom-av1: Speed 6 Realtime - Bosphorus 4K avifenc: 6 hpcg: compress-zstd: 19, Long Mode - Decompression Speed x265: Bosphorus 1080p blender: Classroom - CPU-Only liquid-dsp: 160 - 256 - 57 liquid-dsp: 128 - 256 - 57 toybrot: C++ Threads openvkl: vklBenchmarkVdbVolume srsran: PHY_DL_Test ospray: XFrog Forest - Path Tracer sysbench: RAM / Memory openfoam: Motorbike 30M openfoam: Motorbike 60M srsran: OFDM_Test tnn: CPU - SqueezeNet v1.1 kvazaar: Bosphorus 4K - Medium chia-vdf: Square Plain C++ blender: BMW27 - CPU-Only aircrack-ng: sysbench: CPU relion: Basic - CPU compress-zstd: 3, Long Mode - Decompression Speed ospray: NASA Streamlines - Path Tracer ospray: NASA Streamlines - SciVis ospray: San Miguel - Path Tracer ospray: XFrog Forest - SciVis ospray: San Miguel - SciVis compress-zstd: 3, Long Mode - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 8 - Compression Speed onnx: super-resolution-10 - OpenMP CPU cassandra: Writes tnn: CPU - MobileNet v2 tensorflow-lite: Inception ResNet V2 tensorflow-lite: NASNet Mobile tensorflow-lite: SqueezeNet keydb: cpuminer-opt: Triple SHA-256, Onecoin cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: Garlicoin cpuminer-opt: Blake-2 S cpuminer-opt: Deepcoin cpuminer-opt: Magi onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU yafaray: Total Time For Sample Scene tungsten: Volumetric Caustic avifenc: 10, Lossless avifenc: 10 asmfish: 1024 Hash Memory, 26 Depth oidn: Memorial dacapobench: Jython java-gradle-perf: Reactor lammps: Rhodopsin Protein rodinia: OpenMP Streamcluster rodinia: OpenMP Leukocyte npb: EP.C Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 44.607 479 3044966667 9903.05 8.739 28712 2531 43074.5 6148.4 41555.1 687019 28.7517 45.569 193 1776457 500 198.538 0.924756 0.231483 1.41084 1.37903 8217 46.0 129.152 70.057 1.25869 260.88 10077667 669 639155887 210.15 3.03516 2221.97 10.109 29.809 684.271 40.207 291.993968 0.615331 0.362841 672.646 172.02 437.972 0.445171 1.82051 0.256555 437.866 0.193198 47.2 0.945450 16729 2.10597 3.27051 170.47 24.813 0.846330 7438 102.136 118366 3815 10740 10.79 35.187 1631166667 36.764 170.019 0.27128 86.6 11.3305505 106.6858 6.52538 300.2 31.2739 180.485 180493580 4.05 204.86 2347453.567360 23.59 60768 681.119 75.951 277683 29.57 0.266 81.2911 946634 82.3 1875.8 12.92 45.72 147240 826756667 210.9 2719.4 230272 40.406 35.725 40082 22.78 8015 9.004 206.8 5.23912 16.97 25.42 715587 2615.9 3283.9 82.2 13.38 109.63 67.0395 2253810 995 2985.6 49.20 82.465 105.650 108.31 88.36 9.476 64.3015 2727.6 19.39 121500000 187807.08 7.36 15.383 39.8231 2708.5 28.33 72.28 3080633333 3290766667 7118 21915837 85.1 10.38 12176.80 15.05 105.35 120766667 366.729 6.90 138967 29.61 211170.458 213870.87 348.838 27.78 125 10.42 18.87 90.91 266.9 3073.3 2096.7 6967 106863 443.547 569303 81319.3 47808.3 524178.09 422680 317291 29473 1395127 29191 2735.74 671.541 439.677 86.283 13.4481 8.522 5.050 172711481 57.59 5320 366.363 23.866 7.696 62.044 6259.88 68.800 356 2157125000 9897.119 12.100 20742 1866 58977.6 4516.9 55489.8 892083 35.4561 55.921 161 1493294 435 226.852 1.03102 0.247260 1.57954 1.54442 7470 41.0 144.787 74.317 1.40323 237.32 9267667 604 627367473 201.11 3.30961 2253.80 11.098 27.160 730.825 44.011 299.195353 0.659638 0.386794 732.235 161.78 472.245 0.471052 1.96166 0.270802 468.050 0.206919 45.1 1.018895 17609 2.27008 3.50505 164.77 26.529 0.902751 7806 108.602 111351 3716 11377 10.37 37.074 1569933333 38.676 171.388 0.28452 84.4 11.8795649 101.8558 6.75881 297.4 31.1264 183.275 177624409 3.89 198.24 2316974.822459 23.69 62517 657.468 76.879 268467 28.63 0.271 80.2191 927066 81.7 1873.7 12.66 46.91 147100 812776667 207.6 2736.3 226588 39.838 35.506 40166 22.74 7963 8.851 205.4 5.25085 17.02 25.26 717063 2607.2 3235.7 81.8 13.20 111.11 66.1552 2245366 984 2969.1 48.57 83.525 104.849 109.65 89.25 9.365 64.4236 2713.8 19.20 120333333 188751.17 7.34 15.356 39.6266 2685.6 28.26 72.53 3086966667 3292133333 7165 22032745 85.3 10.31 12184.78 15.01 104.92 121300000 366.201 6.93 139033 29.56 211559.000 213970.43 348.747 3131.9 27.78 125 10.42 18.87 90.91 265.2 3022.1 2004.3 5673 104506 606.052 725870 157434 66836.4 355593.18 402686 322628 26183 1360453 28568 2731.13 754.982 506.972 83.265 13.7613 9.451 5.855 167218420 50.81 5614 401.780 21.646 8.563 85.284 5881.04 67.845 509 2208933333 13913.625 11.858 21161 1840 46915.5 4576.8 45998.8 716517 32.6500 193 1737714 485 216.231 0.903312 0.218363 1.39626 1.37432 8389 44.8 138.523 66.330 1.30999 234.92 10277000 661 579234165 190.48 3.01186 2440.72 10.618 28.139 667.646 40.723 319.487874 0.602936 0.353757 671.557 157.78 433.426 0.432599 1.80328 0.249170 431.304 0.190921 43.6 0.942653 18060 2.10694 3.47588 159.10 25.960 0.850400 7328 104.250 116803 3597 11235 10.94 35.987 1653866667 38.524 178.773 0.27083 88.6 11.7113234 106.1435 6.45646 287.2 32.4833 175.896 173250319 3.89 196.97 2261054.029261 22.82 63062 674.971 74.310 270519 29.14 0.263 82.6069 954214 84.0 1825 12.58 46.11 143600 832983333 212.7 2782.7 225195 39.522 36.300 40954 23.16 7871 8.884 203.3 5.16631 16.75 25.66 726623 2576.5 3251.5 83.0 13.33 110.19 66.8866 2224145 982 2947.3 49.09 82.638 104.315 109.07 88.16 9.451 65.0249 2740.7 19.31 120733333 189606.14 7.41 15.501 39.9669 2695.5 28.50 71.92 3106666667 3316633333 7173 22077090 85.7 10.38 12252.73 14.96 104.77 121200000 367.799 6.93 139400 29.54 214228.62 348.604 3131.1 27.78 125 10.42 18.87 90.91 259.1 2954.1 2128.7 6677 103051 374.203 576988 90927.9 48816.5 421171.59 405578 333611 26198 1414359 28282 2650.50 678.041 433.741 76.817 13.8745 8.967 5.361 171613950 49.87 5518 382.988 22.084 8.012 60.933 6011.42 OpenBenchmarking.org
Timed Wasmer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Wasmer Compilation 1.0.2 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 15 30 45 60 75 SE +/- 0.37, N = 3 SE +/- 0.19, N = 3 SE +/- 0.51, N = 11 44.61 68.80 67.85 1. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lgcc_s -lutil -lrt -lpthread -lm -lc
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 110 220 330 440 550 SE +/- 1.17, N = 3 SE +/- 1.67, N = 3 SE +/- 5.01, N = 3 479 356 509 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Liquid-DSP Threads: 64 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 700M 1400M 2100M 2800M 3500M SE +/- 19853994.84, N = 3 SE +/- 24378623.66, N = 4 SE +/- 4879321.45, N = 3 3044966667 2157125000 2208933333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
WRF Input: conus 2.5km OpenBenchmarking.org Seconds, Fewer Is Better WRF 4.2.2 Input: conus 2.5km Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3K 6K 9K 12K 15K 9903.05 9897.12 13913.63 -fallow-invalid-boz -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz -fallow-invalid-boz -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 1. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.010, N = 3 SE +/- 0.030, N = 3 SE +/- 0.045, N = 3 8.739 12.100 11.858 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Write Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6K 12K 18K 24K 30K SE +/- 43.42, N = 3 SE +/- 45.12, N = 3 SE +/- 84.38, N = 3 28712 20742 21161 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PJSIP Method: INVITE OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: INVITE Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 500 1000 1500 2000 2500 SE +/- 22.31, N = 15 SE +/- 14.40, N = 10 SE +/- 14.31, N = 3 2531 1866 1840 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O2
TensorFlow Lite Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Quant Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 13K 26K 39K 52K 65K SE +/- 601.06, N = 15 SE +/- 573.26, N = 15 SE +/- 706.39, N = 15 43074.5 58977.6 46915.5
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 1300 2600 3900 5200 6500 SE +/- 53.61, N = 3 SE +/- 43.45, N = 13 SE +/- 7.13, N = 3 6148.4 4516.9 4620.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
TensorFlow Lite Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Float Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 12K 24K 36K 48K 60K SE +/- 612.15, N = 15 SE +/- 733.25, N = 15 SE +/- 630.61, N = 15 41555.1 55489.8 45998.8
TensorFlow Lite Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception V4 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 200K 400K 600K 800K 1000K SE +/- 7233.85, N = 5 SE +/- 7218.89, N = 3 SE +/- 2002.95, N = 3 687019 892083 716517
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 8 16 24 32 40 SE +/- 0.29, N = 15 SE +/- 0.37, N = 15 SE +/- 0.49, N = 15 28.75 35.46 32.65 MIN: 15.46 MIN: 14.26 MIN: 9.3 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Nebular Empirical Analysis Tool OpenBenchmarking.org Seconds, Fewer Is Better Nebular Empirical Analysis Tool 2.3 Ubuntu 21.04 Ubuntu 20.10 13 26 39 52 65 SE +/- 0.19, N = 3 SE +/- 0.44, N = 3 45.57 55.92 1. (F9X) gfortran options: -O3 -cpp -ffree-line-length-0 -Jsource/ -fopenmp -fno-backtrace -lcfitsio
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40 80 120 160 200 SE +/- 1.20, N = 3 SE +/- 1.04, N = 3 SE +/- 0.44, N = 3 193 161 193 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenVKL Benchmark: vklBenchmarkUnstructuredVolume OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkUnstructuredVolume Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 400K 800K 1200K 1600K 2000K SE +/- 7183.20, N = 3 SE +/- 5996.30, N = 3 SE +/- 17974.50, N = 3 1776457 1493294 1737714 MIN: 24424 / MAX: 5906521 MIN: 18361 / MAX: 5191710 MIN: 21419 / MAX: 5849413
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 110 220 330 440 550 SE +/- 6.29, N = 12 SE +/- 4.21, N = 3 SE +/- 3.18, N = 3 500 435 485 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Timed LLVM Compilation Build System: Unix Makefiles OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 12.0 Build System: Unix Makefiles Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 50 100 150 200 250 SE +/- 1.40, N = 3 SE +/- 0.98, N = 3 SE +/- 2.23, N = 3 198.54 226.85 216.23
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.232 0.464 0.696 0.928 1.16 SE +/- 0.005014, N = 3 SE +/- 0.004578, N = 3 SE +/- 0.005605, N = 3 0.924756 1.031020 0.903312 MIN: 0.86 MIN: 0.84 MIN: 0.84 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.0556 0.1112 0.1668 0.2224 0.278 SE +/- 0.002322, N = 6 SE +/- 0.001954, N = 10 SE +/- 0.001876, N = 8 0.231483 0.247260 0.218363 MIN: 0.2 MIN: 0.2 MIN: 0.19 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.3554 0.7108 1.0662 1.4216 1.777 SE +/- 0.00589, N = 3 SE +/- 0.01111, N = 3 SE +/- 0.00638, N = 3 1.41084 1.57954 1.39626 MIN: 1.27 MIN: 1.24 MIN: 1.26 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.3475 0.695 1.0425 1.39 1.7375 SE +/- 0.01131, N = 3 SE +/- 0.02039, N = 3 SE +/- 0.01216, N = 3 1.37903 1.54442 1.37432 MIN: 1.33 MIN: 1.31 MIN: 1.32 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2K 4K 6K 8K 10K SE +/- 106.53, N = 3 SE +/- 35.85, N = 3 SE +/- 116.22, N = 3 8217 7470 8389 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 10 20 30 40 50 SE +/- 0.54, N = 15 SE +/- 0.27, N = 3 SE +/- 0.61, N = 15 46.0 41.0 44.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 12.0 Build System: Ninja Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30 60 90 120 150 SE +/- 0.36, N = 3 SE +/- 1.55, N = 3 SE +/- 0.38, N = 3 129.15 144.79 138.52
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 16 32 48 64 80 SE +/- 0.48, N = 3 SE +/- 0.57, N = 3 SE +/- 0.36, N = 3 70.06 74.32 66.33
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.3157 0.6314 0.9471 1.2628 1.5785 SE +/- 0.00905, N = 15 SE +/- 0.01231, N = 3 SE +/- 0.00863, N = 3 1.25869 1.40323 1.30999 MIN: 0.86 MIN: 0.82 MIN: 0.92 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 60 120 180 240 300 SE +/- 2.49, N = 3 SE +/- 2.36, N = 6 SE +/- 1.73, N = 3 260.88 237.32 234.92 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: MD5 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2M 4M 6M 8M 10M SE +/- 137751.39, N = 3 SE +/- 118341.78, N = 3 SE +/- 7549.83, N = 3 10077667 9267667 10277000 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
OpenVKL Benchmark: vklBenchmark OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmark Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 140 280 420 560 700 SE +/- 2.40, N = 3 SE +/- 7.42, N = 3 SE +/- 5.81, N = 3 669 604 661 MIN: 1 / MAX: 2821 MIN: 1 / MAX: 2606 MIN: 1 / MAX: 2845
Swet Average OpenBenchmarking.org Operations Per Second, More Is Better Swet 1.5.16 Average Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 140M 280M 420M 560M 700M SE +/- 1889606.21, N = 3 SE +/- 7650995.20, N = 3 SE +/- 7221883.45, N = 15 639155887 627367473 579234165 1. (CC) gcc options: -lm -lpthread -lcurses -lrt
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 50 100 150 200 250 SE +/- 2.00, N = 15 SE +/- 2.23, N = 3 SE +/- 2.60, N = 3 210.15 201.11 190.48 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
oneDNN Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.7447 1.4894 2.2341 2.9788 3.7235 SE +/- 0.02675, N = 12 SE +/- 0.03870, N = 3 SE +/- 0.03357, N = 3 3.03516 3.30961 3.01186 MIN: 2.86 MIN: 2.85 MIN: 2.84 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Cpuminer-Opt Algorithm: x25x OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: x25x Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 500 1000 1500 2000 2500 SE +/- 24.82, N = 3 SE +/- 25.12, N = 4 SE +/- 33.66, N = 15 2221.97 2253.80 2440.72 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Timed MPlayer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed MPlayer Compilation 1.4 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.12, N = 3 10.11 11.10 10.62
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 7 14 21 28 35 SE +/- 0.34, N = 3 SE +/- 0.37, N = 15 SE +/- 0.34, N = 15 29.81 27.16 28.14 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 160 320 480 640 800 SE +/- 5.64, N = 15 SE +/- 6.32, N = 15 SE +/- 1.50, N = 3 684.27 730.83 667.65 MIN: 645.4 MIN: 671.61 MIN: 642.94 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.4.2 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 10 20 30 40 50 SE +/- 0.52, N = 3 SE +/- 0.37, N = 9 SE +/- 0.02, N = 3 40.21 44.01 40.72
Xcompact3d Incompact3d Input: X3D-benchmarking input.i3d OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 70 140 210 280 350 SE +/- 1.18, N = 3 SE +/- 1.73, N = 3 SE +/- 3.10, N = 3 291.99 299.20 319.49 -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.1484 0.2968 0.4452 0.5936 0.742 SE +/- 0.004554, N = 3 SE +/- 0.004580, N = 3 SE +/- 0.002882, N = 3 0.615331 0.659638 0.602936 MIN: 0.57 MIN: 0.56 MIN: 0.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.087 0.174 0.261 0.348 0.435 SE +/- 0.004351, N = 3 SE +/- 0.003521, N = 3 SE +/- 0.003468, N = 3 0.362841 0.386794 0.353757 MIN: 0.32 MIN: 0.31 MIN: 0.31 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 160 320 480 640 800 SE +/- 1.73, N = 3 SE +/- 5.22, N = 15 SE +/- 5.68, N = 8 672.65 732.24 671.56 MIN: 646.15 MIN: 674.53 MIN: 639.68 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40 80 120 160 200 SE +/- 1.72, N = 3 SE +/- 1.13, N = 3 SE +/- 1.68, N = 3 172.02 161.78 157.78 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 100 200 300 400 500 SE +/- 0.96, N = 3 SE +/- 1.27, N = 3 SE +/- 1.72, N = 3 437.97 472.25 433.43 MIN: 422.42 MIN: 442.59 MIN: 414.41 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.106 0.212 0.318 0.424 0.53 SE +/- 0.004513, N = 6 SE +/- 0.003416, N = 15 SE +/- 0.003621, N = 8 0.445171 0.471052 0.432599 MIN: 0.4 MIN: 0.39 MIN: 0.39 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.4414 0.8828 1.3242 1.7656 2.207 SE +/- 0.01169, N = 14 SE +/- 0.01622, N = 9 SE +/- 0.00660, N = 3 1.82051 1.96166 1.80328 MIN: 1.67 MIN: 1.66 MIN: 1.67 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.0609 0.1218 0.1827 0.2436 0.3045 SE +/- 0.002948, N = 3 SE +/- 0.003157, N = 4 SE +/- 0.003241, N = 3 0.256555 0.270802 0.249170 MIN: 0.23 MIN: 0.23 MIN: 0.22 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 100 200 300 400 500 SE +/- 0.63, N = 3 SE +/- 2.30, N = 3 SE +/- 1.48, N = 3 437.87 468.05 431.30 MIN: 422.67 MIN: 438.61 MIN: 414.55 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.0466 0.0932 0.1398 0.1864 0.233 SE +/- 0.001836, N = 13 SE +/- 0.001494, N = 14 SE +/- 0.001679, N = 14 0.193198 0.206919 0.190921 MIN: 0.18 MIN: 0.18 MIN: 0.18 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 11 22 33 44 55 SE +/- 0.33, N = 3 SE +/- 0.39, N = 15 SE +/- 0.39, N = 15 47.2 45.1 43.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.2293 0.4586 0.6879 0.9172 1.1465 SE +/- 0.010679, N = 4 SE +/- 0.009311, N = 7 SE +/- 0.011491, N = 3 0.945450 1.018895 0.942653 MIN: 0.86 MIN: 0.84 MIN: 0.85 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
DaCapo Benchmark Java Test: Tradebeans OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Tradebeans Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 4K 8K 12K 16K 20K SE +/- 82.67, N = 4 SE +/- 203.85, N = 4 SE +/- 203.38, N = 4 16729 17609 18060
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.5108 1.0216 1.5324 2.0432 2.554 SE +/- 0.01823, N = 3 SE +/- 0.02930, N = 15 SE +/- 0.01729, N = 3 2.10597 2.27008 2.10694 MIN: 2.03 MIN: 2.03 MIN: 2.03 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.7886 1.5772 2.3658 3.1544 3.943 SE +/- 0.00929, N = 3 SE +/- 0.02670, N = 3 SE +/- 0.00414, N = 3 3.27051 3.50505 3.47588 MIN: 3.09 MIN: 3.07 MIN: 3.31 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40 80 120 160 200 SE +/- 2.03, N = 3 SE +/- 0.47, N = 3 SE +/- 1.88, N = 4 170.47 164.77 159.10 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.10.20 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6 12 18 24 30 SE +/- 0.35, N = 13 SE +/- 0.39, N = 14 SE +/- 0.36, N = 12 24.81 26.53 25.96
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.2031 0.4062 0.6093 0.8124 1.0155 SE +/- 0.008374, N = 6 SE +/- 0.008637, N = 6 SE +/- 0.008602, N = 6 0.846330 0.902751 0.850400 MIN: 0.8 MIN: 0.8 MIN: 0.8 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
toyBrot Fractal Generator Implementation: OpenMP OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: OpenMP Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2K 4K 6K 8K 10K SE +/- 105.33, N = 3 SE +/- 108.03, N = 3 SE +/- 81.17, N = 4 7438 7806 7328 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 15.11 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.97, N = 3 SE +/- 1.42, N = 3 SE +/- 0.48, N = 3 102.14 108.60 104.25
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: Blowfish Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30K 60K 90K 120K 150K SE +/- 160.48, N = 3 SE +/- 877.45, N = 3 SE +/- 277.78, N = 3 118366 111351 116803 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
PJSIP Method: OPTIONS, Stateful OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateful Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 800 1600 2400 3200 4000 SE +/- 6.89, N = 3 SE +/- 7.67, N = 3 SE +/- 13.59, N = 3 3815 3716 3597 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O2
DaCapo Benchmark Java Test: H2 OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: H2 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2K 4K 6K 8K 10K SE +/- 133.93, N = 4 SE +/- 129.08, N = 4 SE +/- 115.71, N = 20 10740 11377 11235
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.13, N = 15 SE +/- 0.12, N = 15 10.79 10.37 10.94 1. Ubuntu 21.04: Nodejs
v12.21.0 2. Ubuntu 20.10: Nodejs
v12.18.2 3. Ubuntu 20.04.2 LTS: Nodejs
v10.19.0
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.41 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 9 18 27 36 45 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.13, N = 3 35.19 37.07 35.99
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 400M 800M 1200M 1600M 2000M SE +/- 768837.51, N = 3 SE +/- 11623730.52, N = 3 SE +/- 5634516.64, N = 3 1631166667 1569933333 1653866667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 9 18 27 36 45 SE +/- 0.47, N = 3 SE +/- 0.05, N = 3 SE +/- 0.38, N = 3 36.76 38.68 38.52 1. (CXX) g++ options: -O3 -fPIC -lm
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40 80 120 160 200 SE +/- 0.83, N = 3 SE +/- 0.41, N = 3 SE +/- 0.79, N = 3 170.02 171.39 178.77 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.064 0.128 0.192 0.256 0.32 SE +/- 0.00049, N = 3 SE +/- 0.00359, N = 3 SE +/- 0.00044, N = 3 0.27128 0.28452 0.27083
srsLTE Test: PHY_DL_Test OpenBenchmarking.org UE Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.30, N = 3 SE +/- 0.90, N = 3 SE +/- 0.13, N = 3 86.6 84.4 88.6 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 11.33 11.88 11.71 -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.84, N = 10 SE +/- 0.47, N = 3 SE +/- 0.91, N = 3 106.69 101.86 106.14 MIN: 85.33 / MAX: 112.78 MIN: 80.69 / MAX: 111.15 MIN: 84.21 / MAX: 111.87
Tungsten Renderer Scene: Hair OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Hair Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2 4 6 8 10 SE +/- 0.07388, N = 3 SE +/- 0.07152, N = 15 SE +/- 0.08029, N = 15 6.52538 6.75881 6.45646 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 70 140 210 280 350 SE +/- 3.33, N = 3 SE +/- 1.00, N = 3 SE +/- 3.27, N = 3 300.2 297.4 287.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
Tungsten Renderer Scene: Water Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Water Caustic Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 8 16 24 32 40 SE +/- 0.27, N = 3 SE +/- 0.06, N = 3 SE +/- 0.19, N = 3 31.27 31.13 32.48 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Timed Erlang/OTP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Erlang/OTP Compilation 23.2 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40 80 120 160 200 SE +/- 0.85, N = 3 SE +/- 1.20, N = 3 SE +/- 1.32, N = 3 180.49 183.28 175.90
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40M 80M 120M 160M 200M SE +/- 2129761.70, N = 15 SE +/- 1600302.62, N = 3 SE +/- 1908422.73, N = 4 180493580 177624409 173250319 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.9113 1.8226 2.7339 3.6452 4.5565 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 4.05 3.89 3.89 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40 80 120 160 200 SE +/- 2.00, N = 15 SE +/- 1.24, N = 3 SE +/- 1.73, N = 3 204.86 198.24 196.97 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 500K 1000K 1500K 2000K 2500K SE +/- 2872.35, N = 3 SE +/- 3242.58, N = 3 SE +/- 24473.66, N = 5 2347453.57 2316974.82 2261054.03 1. (CC) gcc options: -O2 -lrt" -lrt
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6 12 18 24 30 SE +/- 0.28, N = 3 SE +/- 0.15, N = 3 SE +/- 0.23, N = 5 23.59 23.69 22.82 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
BlogBench Test: Write OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Write Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 14K 28K 42K 56K 70K SE +/- 717.36, N = 3 SE +/- 419.01, N = 3 SE +/- 770.50, N = 3 60768 62517 63062 1. (CC) gcc options: -O2 -pthread
WireGuard + Linux Networking Stack Stress Test OpenBenchmarking.org Seconds, Fewer Is Better WireGuard + Linux Networking Stack Stress Test Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 150 300 450 600 750 SE +/- 8.55, N = 3 SE +/- 7.37, N = 3 SE +/- 7.42, N = 3 681.12 657.47 674.97
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.13, N = 3 SE +/- 0.33, N = 3 SE +/- 1.05, N = 3 75.95 76.88 74.31
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Skeincoin Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 60K 120K 180K 240K 300K SE +/- 2421.06, N = 15 SE +/- 3496.83, N = 15 SE +/- 2324.26, N = 13 277683 268467 270519 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 7 14 21 28 35 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 SE +/- 0.33, N = 3 29.57 28.63 29.14 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 0.061 0.122 0.183 0.244 0.305 SE +/- 0.003, N = 15 SE +/- 0.004, N = 3 SE +/- 0.003, N = 4 0.266 0.271 0.263 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.58, N = 3 SE +/- 0.89, N = 5 SE +/- 0.68, N = 9 81.29 80.22 82.61 MIN: 67.06 / MAX: 91.79 MIN: 64.42 / MAX: 92.94 MIN: 67 / MAX: 93.51
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 200K 400K 600K 800K 1000K SE +/- 10755.01, N = 15 SE +/- 12775.84, N = 3 SE +/- 10643.51, N = 4 946634 927066 954214 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.81, N = 3 SE +/- 0.92, N = 4 SE +/- 0.68, N = 3 82.3 81.7 84.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
NWChem Input: C240 Buckyball OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 400 800 1200 1600 2000 1875.8 1873.7 1825.0 -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lz -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lz 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lm -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 12.92 12.66 12.58 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Fishy Cat - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 11 22 33 44 55 SE +/- 0.15, N = 3 SE +/- 0.44, N = 3 SE +/- 0.17, N = 3 45.72 46.91 46.11
Chia Blockchain VDF Test: Square Assembly Optimized OpenBenchmarking.org IPS, More Is Better Chia Blockchain VDF 1.0.1 Test: Square Assembly Optimized Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30K 60K 90K 120K 150K SE +/- 1568.63, N = 5 SE +/- 900.00, N = 3 SE +/- 680.69, N = 3 147240 147100 143600 1. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 200M 400M 600M 800M 1000M SE +/- 702242.44, N = 3 SE +/- 2859827.11, N = 3 SE +/- 5004752.19, N = 3 826756667 812776667 832983333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
srsLTE Test: PHY_DL_Test OpenBenchmarking.org eNb Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 50 100 150 200 250 SE +/- 0.84, N = 3 SE +/- 1.35, N = 3 SE +/- 0.18, N = 3 210.9 207.6 212.7 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 600 1200 1800 2400 3000 SE +/- 1.06, N = 3 SE +/- 5.79, N = 15 SE +/- 5.43, N = 15 2719.4 2736.3 2782.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
SecureMark Benchmark: SecureMark-TLS OpenBenchmarking.org marks, More Is Better SecureMark 1.0.4 Benchmark: SecureMark-TLS Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 50K 100K 150K 200K 250K SE +/- 178.37, N = 3 SE +/- 530.29, N = 3 SE +/- 124.15, N = 3 230272 226588 225195 1. (CC) gcc options: -pedantic -O3
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 9 18 27 36 45 SE +/- 0.15, N = 3 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 40.41 39.84 39.52 1. (CXX) g++ options: -O2 -lOpenCL
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.19, N = 3 SE +/- 0.04, N = 3 35.73 35.51 36.30 1. (CXX) g++ options: -O3 -pthread -lm
PJSIP Method: OPTIONS, Stateless OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateless Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 9K 18K 27K 36K 45K SE +/- 533.56, N = 3 SE +/- 217.12, N = 3 SE +/- 352.27, N = 3 40082 40166 40954 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O2
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6 12 18 24 30 SE +/- 0.11, N = 3 SE +/- 0.02, N = 3 SE +/- 0.24, N = 3 22.78 22.74 23.16 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2K 4K 6K 8K 10K SE +/- 77.75, N = 6 SE +/- 78.83, N = 6 SE +/- 99.62, N = 3 8015 7963 7871 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.058, N = 3 SE +/- 0.014, N = 3 SE +/- 0.033, N = 3 9.004 8.851 8.884 1. (CXX) g++ options: -O3 -pthread
srsRAN Test: PHY_DL_Test OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: PHY_DL_Test Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 50 100 150 200 250 SE +/- 0.78, N = 3 SE +/- 0.64, N = 3 SE +/- 0.53, N = 3 206.8 205.4 203.3 -lconfig++ -lsctp 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Tungsten Renderer Scene: Non-Exponential OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Non-Exponential Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 1.1814 2.3628 3.5442 4.7256 5.907 SE +/- 0.07466, N = 3 SE +/- 0.04628, N = 3 SE +/- 0.02249, N = 3 5.23912 5.25085 5.16631 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.17, N = 3 SE +/- 0.12, N = 3 16.97 17.02 16.75 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6 12 18 24 30 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 25.42 25.26 25.66 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
PHPBench PHP Benchmark Suite OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 160K 320K 480K 640K 800K SE +/- 3328.66, N = 3 SE +/- 4029.73, N = 3 SE +/- 5472.66, N = 3 715587 717063 726623
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 600 1200 1800 2400 3000 SE +/- 3.01, N = 15 SE +/- 9.81, N = 3 SE +/- 6.16, N = 8 2615.9 2607.2 2576.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 700 1400 2100 2800 3500 SE +/- 8.34, N = 3 SE +/- 15.05, N = 3 SE +/- 7.83, N = 3 3283.9 3235.7 3251.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 1.03, N = 15 SE +/- 0.46, N = 3 SE +/- 0.70, N = 8 82.2 81.8 83.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 13.38 13.20 13.33 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OSPray Demo: Magnetic Reconnection - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 1.01, N = 15 SE +/- 0.00, N = 3 SE +/- 0.93, N = 12 109.63 111.11 110.19 MIN: 20.83 / MAX: 125 MIN: 23.26 MIN: 21.74 / MAX: 111.11
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Crown Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 15 30 45 60 75 SE +/- 0.30, N = 3 SE +/- 0.47, N = 3 SE +/- 0.70, N = 3 67.04 66.16 66.89 MIN: 59.72 / MAX: 87.99 MIN: 57.81 / MAX: 87.38 MIN: 59.4 / MAX: 88.24
BlogBench Test: Read OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Read Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 500K 1000K 1500K 2000K 2500K SE +/- 30037.66, N = 3 SE +/- 19991.91, N = 3 SE +/- 21799.22, N = 9 2253810 2245366 2224145 1. (CC) gcc options: -O2 -pthread
PyBench Total For Average Test Times OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2018-02-16 Total For Average Test Times Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 200 400 600 800 1000 SE +/- 1.86, N = 3 SE +/- 2.67, N = 3 SE +/- 0.88, N = 3 995 984 982
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 600 1200 1800 2400 3000 SE +/- 9.20, N = 2 SE +/- 7.67, N = 5 2985.6 2969.1 2947.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 11 22 33 44 55 SE +/- 0.45, N = 3 SE +/- 0.20, N = 3 SE +/- 0.46, N = 3 49.20 48.57 49.09 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Helsing Digit Range: 14 digit OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 14 digit Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.73, N = 15 SE +/- 0.21, N = 3 SE +/- 0.11, N = 3 82.47 83.53 82.64 1. (CC) gcc options: -O2 -pthread
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.57, N = 3 SE +/- 0.02, N = 3 SE +/- 0.14, N = 3 105.65 104.85 104.32 1. (CXX) g++ options: -O2 -lOpenCL
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Barbershop - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.26, N = 3 SE +/- 0.41, N = 3 SE +/- 0.27, N = 3 108.31 109.65 109.07
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Pabellon Barcelona - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.24, N = 3 SE +/- 0.26, N = 3 SE +/- 0.21, N = 3 88.36 89.25 88.16
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.097, N = 5 SE +/- 0.063, N = 3 SE +/- 0.084, N = 3 9.476 9.365 9.451 -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lImath -lHalf -lIex -lIexMath -lpthread 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSM -lICE -lX11 -lIlmImf -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Crown Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 15 30 45 60 75 SE +/- 0.70, N = 3 SE +/- 0.77, N = 3 SE +/- 0.74, N = 3 64.30 64.42 65.02 MIN: 55.83 / MAX: 81.77 MIN: 56.4 / MAX: 82.25 MIN: 57.31 / MAX: 81.55
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 600 1200 1800 2400 3000 SE +/- 12.48, N = 3 SE +/- 15.59, N = 4 SE +/- 9.85, N = 3 2727.6 2713.8 2740.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 4K Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 5 10 15 20 25 SE +/- 0.15, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 19.39 19.20 19.31 1. (CXX) g++ options: -O3 -lpthread -lrt
srsLTE Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsLTE 20.10.1 Test: OFDM_Test Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30M 60M 90M 120M 150M SE +/- 650640.71, N = 3 SE +/- 260341.66, N = 3 SE +/- 176383.42, N = 3 121500000 120333333 120733333 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40K 80K 120K 160K 200K SE +/- 419.66, N = 3 SE +/- 579.11, N = 3 SE +/- 1295.33, N = 3 187807.08 188751.17 189606.14 -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Ubuntu 21.04: Open MPI 4.1.0 3. Ubuntu 20.10: Open MPI 4.0.3 4. Ubuntu 20.04.2 LTS: Open MPI 4.0.3
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 7.36 7.34 7.41 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 4 8 12 16 20 SE +/- 0.19, N = 4 SE +/- 0.11, N = 3 SE +/- 0.14, N = 15 15.38 15.36 15.50 1. (CXX) g++ options: -O3 -fPIC -lm
High Performance Conjugate Gradient OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 9 18 27 36 45 SE +/- 0.03, N = 3 SE +/- 0.15, N = 3 SE +/- 0.19, N = 3 39.82 39.63 39.97 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 600 1200 1800 2400 3000 SE +/- 5.02, N = 15 SE +/- 3.89, N = 3 SE +/- 5.48, N = 15 2708.5 2685.6 2695.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 7 14 21 28 35 SE +/- 0.34, N = 3 SE +/- 0.29, N = 5 SE +/- 0.25, N = 3 28.33 28.26 28.50 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Classroom - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 16 32 48 64 80 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 72.28 72.53 71.92
Liquid-DSP Threads: 160 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 160 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 700M 1400M 2100M 2800M 3500M SE +/- 15087448.79, N = 3 SE +/- 10305392.33, N = 3 SE +/- 11545032.60, N = 3 3080633333 3086966667 3106666667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 700M 1400M 2100M 2800M 3500M SE +/- 9583724.63, N = 3 SE +/- 13475203.56, N = 3 SE +/- 8772368.23, N = 3 3290766667 3292133333 3316633333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 1500 3000 4500 6000 7500 SE +/- 85.39, N = 4 SE +/- 61.67, N = 3 SE +/- 77.87, N = 4 7118 7165 7173 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
OpenVKL Benchmark: vklBenchmarkVdbVolume OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkVdbVolume Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 5M 10M 15M 20M 25M SE +/- 58341.71, N = 3 SE +/- 181382.68, N = 3 SE +/- 187763.97, N = 3 21915837 22032745 22077090 MIN: 1062215 / MAX: 153704880 MIN: 1023660 / MAX: 161438904 MIN: 1030467 / MAX: 169859376
srsRAN Test: PHY_DL_Test OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: PHY_DL_Test Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.78, N = 3 SE +/- 0.23, N = 3 SE +/- 0.37, N = 3 85.1 85.3 85.7 -lconfig++ -lsctp 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
OSPray Demo: XFrog Forest - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: Path Tracer Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 10.38 10.31 10.38 MIN: 7.58 / MAX: 10.75 MIN: 7.46 / MAX: 10.75 MIN: 9.17 / MAX: 10.53
Sysbench Test: RAM / Memory OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3K 6K 9K 12K 15K SE +/- 145.91, N = 15 SE +/- 151.11, N = 12 SE +/- 138.52, N = 15 12176.80 12184.78 12252.73 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 4 8 12 16 20 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 15.05 15.01 14.96 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
OpenFOAM Input: Motorbike 60M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 60M Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.46, N = 3 SE +/- 0.07, N = 3 SE +/- 0.16, N = 3 105.35 104.92 104.77 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
srsRAN Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsRAN 21.04 Test: OFDM_Test Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30M 60M 90M 120M 150M SE +/- 284800.12, N = 3 SE +/- 305505.05, N = 3 SE +/- 152752.52, N = 3 120766667 121300000 121200000 -lconfig++ -lsctp 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 80 160 240 320 400 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 SE +/- 0.53, N = 3 366.73 366.20 367.80 MIN: 365.95 / MAX: 373.2 MIN: 366.01 / MAX: 367.51 MIN: 367 / MAX: 370.11 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 6.90 6.93 6.93 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Chia Blockchain VDF Test: Square Plain C++ OpenBenchmarking.org IPS, More Is Better Chia Blockchain VDF 1.0.1 Test: Square Plain C++ Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30K 60K 90K 120K 150K SE +/- 88.19, N = 3 SE +/- 484.19, N = 3 SE +/- 585.95, N = 3 138967 139033 139400 1. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: BMW27 - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 29.61 29.56 29.54
Aircrack-ng OpenBenchmarking.org k/s, More Is Better Aircrack-ng 1.5.2 Ubuntu 21.04 Ubuntu 20.10 50K 100K 150K 200K 250K SE +/- 66.66, N = 3 SE +/- 149.68, N = 3 211170.46 211559.00 1. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 50K 100K 150K 200K 250K SE +/- 275.74, N = 3 SE +/- 253.78, N = 3 SE +/- 252.80, N = 3 213870.87 213970.43 214228.62 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
RELION Test: Basic - Device: CPU OpenBenchmarking.org Seconds, Fewer Is Better RELION 3.1.1 Test: Basic - Device: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 80 160 240 320 400 SE +/- 1.61, N = 3 SE +/- 0.44, N = 3 SE +/- 0.74, N = 3 348.84 348.75 348.60 1. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Decompression Speed Ubuntu 20.10 Ubuntu 20.04.2 LTS 700 1400 2100 2800 3500 SE +/- 31.17, N = 12 SE +/- 14.00, N = 14 3131.9 3131.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
OSPray Demo: NASA Streamlines - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: Path Tracer Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 7 14 21 28 35 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 27.78 27.78 27.78 MIN: 17.24 / MAX: 29.41 MIN: 16.67 / MAX: 29.41 MIN: 16.67 / MAX: 29.41
OSPray Demo: NASA Streamlines - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30 60 90 120 150 125 125 125 MIN: 31.25 / MAX: 142.86 MIN: 28.57 / MAX: 142.86 MIN: 33.33 / MAX: 142.86
OSPray Demo: San Miguel - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: Path Tracer Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 10.42 10.42 10.42 MIN: 7.87 / MAX: 10.64 MIN: 7.25 / MAX: 10.53 MIN: 7.94 / MAX: 10.53
OSPray Demo: XFrog Forest - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 18.87 18.87 18.87 MIN: 13.33 / MAX: 19.23 MIN: 12.05 / MAX: 19.23 MIN: 12.82 / MAX: 19.23
OSPray Demo: San Miguel - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 90.91 90.91 90.91 MIN: 55.56 / MAX: 100 MIN: 43.48 / MAX: 100 MIN: 55.56 / MAX: 100
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Compression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 60 120 180 240 300 SE +/- 1.99, N = 3 SE +/- 8.38, N = 12 SE +/- 4.39, N = 14 266.9 265.2 259.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 700 1400 2100 2800 3500 SE +/- 2.77, N = 3 SE +/- 6.44, N = 13 SE +/- 65.22, N = 15 3073.3 3022.1 2954.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Compression Speed Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 500 1000 1500 2000 2500 SE +/- 24.65, N = 3 SE +/- 42.14, N = 15 SE +/- 25.03, N = 15 2096.7 2004.3 2128.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 1500 3000 4500 6000 7500 SE +/- 204.88, N = 12 SE +/- 158.31, N = 12 SE +/- 227.25, N = 12 6967 5673 6677 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Apache Cassandra Test: Writes OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 3.11.4 Test: Writes Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20K 40K 60K 80K 100K SE +/- 2263.61, N = 15 SE +/- 1429.17, N = 3 SE +/- 1368.32, N = 15 106863 104506 103051
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 130 260 390 520 650 SE +/- 9.40, N = 15 SE +/- 12.21, N = 15 SE +/- 0.44, N = 3 443.55 606.05 374.20 MIN: 367.81 / MAX: 808.75 MIN: 372.96 / MAX: 843.9 MIN: 371.48 / MAX: 419.18 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TensorFlow Lite Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 160K 320K 480K 640K 800K SE +/- 790.57, N = 3 SE +/- 13732.96, N = 12 SE +/- 5213.69, N = 3 569303 725870 576988
TensorFlow Lite Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: NASNet Mobile Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 30K 60K 90K 120K 150K SE +/- 768.19, N = 15 SE +/- 5683.05, N = 15 SE +/- 1128.79, N = 4 81319.3 157434.0 90927.9
TensorFlow Lite Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: SqueezeNet Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 14K 28K 42K 56K 70K SE +/- 35.00, N = 3 SE +/- 1725.85, N = 12 SE +/- 510.23, N = 15 47808.3 66836.4 48816.5
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 110K 220K 330K 440K 550K SE +/- 4128.37, N = 3 SE +/- 13586.63, N = 12 SE +/- 18099.02, N = 15 524178.09 355593.18 421171.59 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Triple SHA-256, Onecoin Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 90K 180K 270K 360K 450K SE +/- 1250.96, N = 3 SE +/- 11124.39, N = 12 SE +/- 8447.87, N = 12 422680 402686 405578 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Quad SHA-256, Pyrite Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 70K 140K 210K 280K 350K SE +/- 3914.38, N = 15 SE +/- 9923.95, N = 13 SE +/- 5088.38, N = 15 317291 322628 333611 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Garlicoin Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6K 12K 18K 24K 30K SE +/- 1184.13, N = 12 SE +/- 1085.72, N = 12 SE +/- 855.76, N = 12 29473 26183 26198 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Blake-2 S Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 300K 600K 900K 1200K 1500K SE +/- 21571.08, N = 15 SE +/- 14563.41, N = 15 SE +/- 29885.49, N = 15 1395127 1360453 1414359 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Deepcoin Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6K 12K 18K 24K 30K SE +/- 326.37, N = 15 SE +/- 615.22, N = 12 SE +/- 453.46, N = 13 29191 28568 28282 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Magi Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 600 1200 1800 2400 3000 SE +/- 50.25, N = 12 SE +/- 5.62, N = 3 SE +/- 22.13, N = 15 2735.74 2731.13 2650.50 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 160 320 480 640 800 SE +/- 0.16, N = 3 SE +/- 19.97, N = 12 SE +/- 4.59, N = 15 671.54 754.98 678.04 MIN: 647.67 MIN: 673.86 MIN: 638.98 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 110 220 330 440 550 SE +/- 0.53, N = 3 SE +/- 33.83, N = 15 SE +/- 1.79, N = 3 439.68 506.97 433.74 MIN: 424.75 MIN: 435.95 MIN: 416.67 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
YafaRay Total Time For Sample Scene OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.4.1 Total Time For Sample Scene Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 2.26, N = 15 SE +/- 2.54, N = 12 SE +/- 1.59, N = 15 86.28 83.27 76.82 1. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread
Tungsten Renderer Scene: Volumetric Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Volumetric Caustic Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 4 8 12 16 20 SE +/- 0.44, N = 15 SE +/- 0.44, N = 15 SE +/- 0.39, N = 15 13.45 13.76 13.87 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 3 6 9 12 15 SE +/- 0.146, N = 15 SE +/- 0.088, N = 7 SE +/- 0.118, N = 12 8.522 9.451 8.967 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 1.3174 2.6348 3.9522 5.2696 6.587 SE +/- 0.074, N = 15 SE +/- 0.084, N = 15 SE +/- 0.099, N = 15 5.050 5.855 5.361 1. (CXX) g++ options: -O3 -fPIC -lm
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 40M 80M 120M 160M 200M SE +/- 1805964.50, N = 3 SE +/- 748824.48, N = 3 SE +/- 3374554.91, N = 12 172711481 167218420 171613950
Intel Open Image Denoise Scene: Memorial OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.2.0 Scene: Memorial Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 13 26 39 52 65 SE +/- 2.57, N = 12 SE +/- 2.62, N = 12 SE +/- 2.40, N = 12 57.59 50.81 49.87
DaCapo Benchmark Java Test: Jython OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Jython Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 1200 2400 3600 4800 6000 SE +/- 153.92, N = 20 SE +/- 174.19, N = 20 SE +/- 194.90, N = 20 5320 5614 5518
Java Gradle Build Gradle Build: Reactor OpenBenchmarking.org Seconds, Fewer Is Better Java Gradle Build Gradle Build: Reactor Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 90 180 270 360 450 SE +/- 5.06, N = 9 SE +/- 11.03, N = 9 SE +/- 10.71, N = 9 366.36 401.78 382.99
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.26, N = 15 SE +/- 0.64, N = 12 23.87 21.65 22.08 1. (CXX) g++ options: -O3 -pthread -lm
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 2 4 6 8 10 SE +/- 0.081, N = 3 SE +/- 0.089, N = 3 SE +/- 0.145, N = 15 7.696 8.563 8.012 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 0.59, N = 3 SE +/- 1.30, N = 15 62.04 85.28 60.93 1. (CXX) g++ options: -O2 -lOpenCL
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C Ubuntu 21.04 Ubuntu 20.10 Ubuntu 20.04.2 LTS 1300 2600 3900 5200 6500 SE +/- 51.84, N = 15 SE +/- 92.04, N = 15 SE +/- 81.23, N = 15 6259.88 5881.04 6011.42 -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Ubuntu 21.04: Open MPI 4.1.0 3. Ubuntu 20.10: Open MPI 4.0.3 4. Ubuntu 20.04.2 LTS: Open MPI 4.0.3
Phoronix Test Suite v10.8.4