ice-lake-ubuntu 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2105170-IB-2105155IB65&rdt .
ice-lake-ubuntu Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Compiler File-System Screen Resolution Ubuntu 21.04 Ubuntu 20.10 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads) Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) Intel Device 0998 504GB 800GB INTEL SSDPF21Q800GB ASPEED VE228 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP Ubuntu 21.04 5.11.0-17-generic (x86_64) GNOME Shell 3.38.4 X Server GCC 10.3.0 ext4 1920x1080 Ubuntu 20.10 5.8.0-53-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 GCC 10.2.0 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - Ubuntu 21.04: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Ubuntu 20.10: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270 Java Details - Ubuntu 21.04: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2) - Ubuntu 20.10: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.10) Python Details - Ubuntu 21.04: Python 3.9.4 - Ubuntu 20.10: Python 3.8.6 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
ice-lake-ubuntu blogbench: Read blogbench: Write wireguard: hpcg: npb: EP.C npb: EP.D npb: LU.C rodinia: OpenMP LavaMD rodinia: OpenMP HotSpot3D rodinia: OpenMP Leukocyte rodinia: OpenMP CFD Solver rodinia: OpenMP Streamcluster namd: ATPase Simulation - 327,506 Atoms neat: toybrot: TBB toybrot: OpenMP toybrot: C++ Tasks toybrot: C++ Threads mrbayes: Primate Phylogeny Analysis nwchem: C240 Buckyball incompact3d: X3D-benchmarking input.i3d incompact3d: input.i3d 193 Cells Per Direction openfoam: Motorbike 30M openfoam: Motorbike 60M relion: Basic - CPU lammps: 20k Atoms lammps: Rhodopsin Protein libgav1: Chimera 1080p libgav1: Summer Nature 4K libgav1: Summer Nature 1080p chia-vdf: Square Plain C++ chia-vdf: Square Assembly Optimized java-gradle-perf: Reactor dacapobench: H2 dacapobench: Jython dacapobench: Tradebeans compress-zstd: 19 - Compression Speed compress-zstd: 19 - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed srslte: OFDM_Test srslte: PHY_DL_Test srslte: PHY_DL_Test john-the-ripper: Blowfish john-the-ripper: MD5 node-express-loadtest: ospray: San Miguel - SciVis ospray: XFrog Forest - SciVis ospray: San Miguel - Path Tracer ospray: NASA Streamlines - SciVis ospray: XFrog Forest - Path Tracer ospray: Magnetic Reconnection - SciVis ospray: NASA Streamlines - Path Tracer ospray: Magnetic Reconnection - Path Tracer aom-av1: Speed 6 Realtime - Bosphorus 4K aom-av1: Speed 6 Two-Pass - Bosphorus 4K aom-av1: Speed 8 Realtime - Bosphorus 4K aom-av1: Speed 9 Realtime - Bosphorus 4K embree: Pathtracer - Crown embree: Pathtracer ISPC - Crown embree: Pathtracer - Asian Dragon embree: Pathtracer ISPC - Asian Dragon kvazaar: Bosphorus 4K - Medium kvazaar: Bosphorus 1080p - Medium kvazaar: Bosphorus 4K - Very Fast kvazaar: Bosphorus 4K - Ultra Fast kvazaar: Bosphorus 1080p - Very Fast kvazaar: Bosphorus 1080p - Ultra Fast svt-av1: Preset 4 - Bosphorus 4K svt-av1: Preset 8 - Bosphorus 4K svt-hevc: 1 - Bosphorus 1080p svt-hevc: 7 - Bosphorus 1080p svt-hevc: 10 - Bosphorus 1080p svt-vp9: VMAF Optimized - Bosphorus 1080p svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p svt-vp9: Visual Quality Optimized - Bosphorus 1080p x265: Bosphorus 4K x265: Bosphorus 1080p oidn: Memorial openvkl: vklBenchmark openvkl: vklBenchmarkVdbVolume openvkl: vklBenchmarkStructuredVolume openvkl: vklBenchmarkUnstructuredVolume coremark: CoreMark Size 666 - Iterations Per Second stockfish: Total Time asmfish: 1024 Hash Memory, 26 Depth swet: Average pjsip: INVITE pjsip: OPTIONS, Stateful pjsip: OPTIONS, Stateless avifenc: 6 avifenc: 10 avifenc: 6, Lossless avifenc: 10, Lossless build-apache: Time To Compile build-godot: Time To Compile build-linux-kernel: Time To Compile build-llvm: Ninja build-llvm: Unix Makefiles build-mplayer: Time To Compile build-nodejs: Time To Compile build-php: Time To Compile build2: Time To Compile povray: Trace Time tungsten: Hair tungsten: Water Caustic tungsten: Non-Exponential tungsten: Volumetric Caustic yafaray: Total Time For Sample Scene onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: IP Shapes 1D - bf16bf16bf16 - CPU onednn: IP Shapes 3D - bf16bf16bf16 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU numpy: build-eigen: Time To Compile build-erlang: Time To Compile build-wasmer: Time To Compile helsing: 14 digit aircrack-ng: cpuminer-opt: Magi cpuminer-opt: x25x cpuminer-opt: Deepcoin cpuminer-opt: Ringcoin cpuminer-opt: Blake-2 S cpuminer-opt: Garlicoin cpuminer-opt: Skeincoin cpuminer-opt: Myriad-Groestl cpuminer-opt: LBC, LBRY Credits cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: Triple SHA-256, Onecoin securemark: SecureMark-TLS node-web-tooling: liquid-dsp: 16 - 256 - 57 liquid-dsp: 32 - 256 - 57 liquid-dsp: 64 - 256 - 57 liquid-dsp: 128 - 256 - 57 liquid-dsp: 160 - 256 - 57 keydb: tensorflow-lite: SqueezeNet tensorflow-lite: Inception V4 tensorflow-lite: NASNet Mobile tensorflow-lite: Mobilenet Float tensorflow-lite: Mobilenet Quant tensorflow-lite: Inception ResNet V2 pgbench: 100 - 250 - Read Only pgbench: 100 - 250 - Read Only - Average Latency pgbench: 100 - 250 - Read Write pgbench: 100 - 250 - Read Write - Average Latency wrf: conus 2.5km tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 sysbench: RAM / Memory sysbench: CPU cassandra: Writes blender: BMW27 - CPU-Only blender: Classroom - CPU-Only blender: Fishy Cat - CPU-Only blender: Barbershop - CPU-Only blender: Pabellon Barcelona - CPU-Only onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU pybench: Total For Average Test Times phpbench: PHP Benchmark Suite compress-zstd: 3 - Compression Speed compress-zstd: 3 - Decompression Speed compress-zstd: 8 - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19 - Decompression Speed compress-zstd: 3, Long Mode - Compression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed gromacs: MPI CPU - water_GMX50_bare srsran: OFDM_Test srsran: PHY_DL_Test srsran: PHY_DL_Test compress-zstd: 3, Long Mode - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 2253810 60768 681.119 39.8231 6259.88 8659.20 187807.08 40.406 105.650 62.044 4.807 7.696 0.27128 45.569 6999 7438 8015 7118 170.019 1875.8 291.993968 11.3305505 15.05 105.35 348.838 35.725 23.866 34.39 19.39 41.83 138967 147240 366.363 10740 5320 16729 82.3 2727.6 296.3 3193.9 47.2 2719.4 121500000 210.9 86.6 118366 10077667 5554 90.91 18.87 10.42 125 10.38 109.63 27.78 477.78 7.36 4.05 16.97 23.59 64.3015 67.0395 81.2911 106.6858 6.90 25.42 13.38 22.78 49.20 85.41 2.845 29.809 29.57 172.02 260.88 210.15 204.86 170.47 12.92 28.33 57.59 669 21915837 73429487 1776457 2347453.567360 180493580 172711481 639155887 2531 3815 40082 15.383 5.050 36.764 8.522 35.187 75.951 24.813 129.152 198.538 10.109 102.136 40.207 70.057 9.476 6.52538 31.2739 5.23912 13.4481 86.283 0.945450 1.37903 1.25869 0.445171 3.03516 1.82051 1.41084 28.7517 0.846330 0.924756 0.362841 0.193198 672.646 437.866 684.271 2.10597 3.27051 3.60538 439.677 0.256555 671.541 437.972 0.231483 0.615331 332.22 87.461 180.485 44.607 82.465 211170.458 2735.74 2221.97 29191 3487.91 1395127 29473 277683 40753 163627 317291 422680 230272 10.79 826756667 1631166667 3044966667 3290766667 3080633333 524178.09 47808.3 687019 81319.3 41555.1 43074.5 569303 946634 0.266 28712 8.739 9903.05 443.547 366.729 12176.80 213870.87 106863 29.61 72.28 45.72 108.31 88.36 479 500 193 8217 6967 995 715587 6148.4 2985.6 2096.7 3073.3 82.2 2615.9 266.9 300.2 3283.9 46.0 2708.5 9.004 120766667 206.8 85.1 2245366 62517 657.468 39.6266 5881.04 9183.40 188751.17 39.838 104.849 85.284 4.941 8.563 0.28452 55.921 6946 7806 7963 7165 171.388 1873.7 299.195353 11.8795649 15.01 104.92 348.747 35.506 21.646 34.93 19.20 42.30 139033 147100 401.780 11377 5614 17609 81.7 2713.8 300.1 3192.5 45.1 2736.3 120333333 207.6 84.4 111351 9267667 5765 90.91 18.87 10.42 125 10.31 111.11 27.78 466.67 7.34 3.89 17.02 23.69 64.4236 66.1552 80.2191 101.8558 6.93 25.26 13.20 22.74 48.57 86.46 2.796 27.160 28.63 161.78 237.32 201.11 198.24 164.77 12.66 28.26 50.81 604 22032745 80648255 1493294 2316974.822459 177624409 167218420 627367473 1866 3716 40166 15.356 5.855 38.676 9.451 37.074 76.879 26.529 144.787 226.852 11.098 108.602 44.011 74.317 9.365 6.75881 31.1264 5.25085 13.7613 83.265 1.018895 1.54442 1.40323 0.471052 3.30961 1.96166 1.57954 35.4561 0.902751 1.03102 0.386794 0.206919 732.235 468.050 730.825 2.27008 3.50505 11.68014 506.972 0.270802 754.982 472.245 0.247260 0.659638 387.62 81.774 183.275 68.800 83.525 211559.000 2731.13 2253.80 28568 3954.42 1360453 26183 268467 46086 151978 322628 402686 226588 10.37 812776667 1569933333 2157125000 3292133333 3086966667 355593.18 66836.4 892083 157434 55489.8 58977.6 725870 927066 0.271 20742 12.100 9897.119 606.052 366.201 12184.78 213970.43 104506 29.56 72.53 46.91 109.65 89.25 356 435 161 7470 5673 984 717063 4516.9 2969.1 2004.3 3022.1 81.8 2607.2 265.2 297.4 3235.7 41.0 2685.6 8.851 121300000 205.4 85.3 3131.9 OpenBenchmarking.org
BlogBench Test: Read OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Read Ubuntu 21.04 Ubuntu 20.10 500K 1000K 1500K 2000K 2500K SE +/- 30037.66, N = 3 SE +/- 19991.91, N = 3 2253810 2245366 1. (CC) gcc options: -O2 -pthread
BlogBench Test: Write OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Write Ubuntu 21.04 Ubuntu 20.10 13K 26K 39K 52K 65K SE +/- 717.36, N = 3 SE +/- 419.01, N = 3 60768 62517 1. (CC) gcc options: -O2 -pthread
WireGuard + Linux Networking Stack Stress Test OpenBenchmarking.org Seconds, Fewer Is Better WireGuard + Linux Networking Stack Stress Test Ubuntu 21.04 Ubuntu 20.10 150 300 450 600 750 SE +/- 8.55, N = 3 SE +/- 7.37, N = 3 681.12 657.47
High Performance Conjugate Gradient OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 Ubuntu 21.04 Ubuntu 20.10 9 18 27 36 45 SE +/- 0.03, N = 3 SE +/- 0.15, N = 3 39.82 39.63 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C Ubuntu 21.04 Ubuntu 20.10 1300 2600 3900 5200 6500 SE +/- 51.84, N = 15 SE +/- 92.04, N = 15 6259.88 5881.04 -levent_core -levent 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_pthreads -lutil -lm -lrt -lz 2. Ubuntu 21.04: Open MPI 4.1.0 3. Ubuntu 20.10: Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D Ubuntu 21.04 Ubuntu 20.10 2K 4K 6K 8K 10K SE +/- 33.68, N = 3 SE +/- 121.01, N = 3 8659.20 9183.40 -levent_core -levent 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_pthreads -lutil -lm -lrt -lz 2. Ubuntu 21.04: Open MPI 4.1.0 3. Ubuntu 20.10: Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C Ubuntu 21.04 Ubuntu 20.10 40K 80K 120K 160K 200K SE +/- 419.66, N = 3 SE +/- 579.11, N = 3 187807.08 188751.17 -levent_core -levent 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_pthreads -lutil -lm -lrt -lz 2. Ubuntu 21.04: Open MPI 4.1.0 3. Ubuntu 20.10: Open MPI 4.0.3
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD Ubuntu 21.04 Ubuntu 20.10 9 18 27 36 45 SE +/- 0.15, N = 3 SE +/- 0.11, N = 3 40.41 39.84 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.57, N = 3 SE +/- 0.02, N = 3 105.65 104.85 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 0.59, N = 3 62.04 85.28 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver Ubuntu 21.04 Ubuntu 20.10 1.1117 2.2234 3.3351 4.4468 5.5585 SE +/- 0.071, N = 12 SE +/- 0.065, N = 15 4.807 4.941 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster Ubuntu 21.04 Ubuntu 20.10 2 4 6 8 10 SE +/- 0.081, N = 3 SE +/- 0.089, N = 3 7.696 8.563 1. (CXX) g++ options: -O2 -lOpenCL
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms Ubuntu 21.04 Ubuntu 20.10 0.064 0.128 0.192 0.256 0.32 SE +/- 0.00049, N = 3 SE +/- 0.00359, N = 3 0.27128 0.28452
Nebular Empirical Analysis Tool OpenBenchmarking.org Seconds, Fewer Is Better Nebular Empirical Analysis Tool 2.3 Ubuntu 21.04 Ubuntu 20.10 13 26 39 52 65 SE +/- 0.19, N = 3 SE +/- 0.44, N = 3 45.57 55.92 1. (F9X) gfortran options: -O3 -cpp -ffree-line-length-0 -Jsource/ -fopenmp -fno-backtrace -lcfitsio
toyBrot Fractal Generator Implementation: TBB OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: TBB Ubuntu 21.04 Ubuntu 20.10 1500 3000 4500 6000 7500 SE +/- 91.17, N = 15 SE +/- 68.57, N = 15 6999 6946 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
toyBrot Fractal Generator Implementation: OpenMP OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: OpenMP Ubuntu 21.04 Ubuntu 20.10 2K 4K 6K 8K 10K SE +/- 105.33, N = 3 SE +/- 108.03, N = 3 7438 7806 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks Ubuntu 21.04 Ubuntu 20.10 2K 4K 6K 8K 10K SE +/- 77.75, N = 6 SE +/- 78.83, N = 6 8015 7963 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads Ubuntu 21.04 Ubuntu 20.10 1500 3000 4500 6000 7500 SE +/- 85.39, N = 4 SE +/- 61.67, N = 3 7118 7165 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Ubuntu 21.04 Ubuntu 20.10 40 80 120 160 200 SE +/- 0.83, N = 3 SE +/- 0.41, N = 3 170.02 171.39 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
NWChem Input: C240 Buckyball OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball Ubuntu 21.04 Ubuntu 20.10 400 800 1200 1600 2000 1875.8 1873.7 -levent_core -levent 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_pthreads -lutil -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
Xcompact3d Incompact3d Input: X3D-benchmarking input.i3d OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d Ubuntu 21.04 Ubuntu 20.10 70 140 210 280 350 SE +/- 1.18, N = 3 SE +/- 1.73, N = 3 291.99 299.20 -levent_core -levent 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_pthreads -lutil -lm -lrt -lz
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction Ubuntu 21.04 Ubuntu 20.10 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 11.33 11.88 -levent_core -levent 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_pthreads -lutil -lm -lrt -lz
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M Ubuntu 21.04 Ubuntu 20.10 4 8 12 16 20 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 15.05 15.01 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
OpenFOAM Input: Motorbike 60M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 60M Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.46, N = 3 SE +/- 0.07, N = 3 105.35 104.92 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
RELION Test: Basic - Device: CPU OpenBenchmarking.org Seconds, Fewer Is Better RELION 3.1.1 Test: Basic - Device: CPU Ubuntu 21.04 Ubuntu 20.10 80 160 240 320 400 SE +/- 1.61, N = 3 SE +/- 0.44, N = 3 348.84 348.75 1. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms Ubuntu 21.04 Ubuntu 20.10 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.19, N = 3 35.73 35.51 1. (CXX) g++ options: -O3 -pthread -lm
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein Ubuntu 21.04 Ubuntu 20.10 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.26, N = 15 23.87 21.65 1. (CXX) g++ options: -O3 -pthread -lm
libgav1 Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Chimera 1080p Ubuntu 21.04 Ubuntu 20.10 8 16 24 32 40 SE +/- 0.19, N = 3 SE +/- 0.10, N = 3 34.39 34.93 1. (CXX) g++ options: -O3 -lpthread -lrt
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 4K Ubuntu 21.04 Ubuntu 20.10 5 10 15 20 25 SE +/- 0.15, N = 3 SE +/- 0.05, N = 3 19.39 19.20 1. (CXX) g++ options: -O3 -lpthread -lrt
libgav1 Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 1080p Ubuntu 21.04 Ubuntu 20.10 10 20 30 40 50 SE +/- 0.43, N = 3 SE +/- 0.04, N = 3 41.83 42.30 1. (CXX) g++ options: -O3 -lpthread -lrt
Chia Blockchain VDF Test: Square Plain C++ OpenBenchmarking.org IPS, More Is Better Chia Blockchain VDF 1.0.1 Test: Square Plain C++ Ubuntu 21.04 Ubuntu 20.10 30K 60K 90K 120K 150K SE +/- 88.19, N = 3 SE +/- 484.19, N = 3 138967 139033 1. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread
Chia Blockchain VDF Test: Square Assembly Optimized OpenBenchmarking.org IPS, More Is Better Chia Blockchain VDF 1.0.1 Test: Square Assembly Optimized Ubuntu 21.04 Ubuntu 20.10 30K 60K 90K 120K 150K SE +/- 1568.63, N = 5 SE +/- 900.00, N = 3 147240 147100 1. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread
Java Gradle Build Gradle Build: Reactor OpenBenchmarking.org Seconds, Fewer Is Better Java Gradle Build Gradle Build: Reactor Ubuntu 21.04 Ubuntu 20.10 90 180 270 360 450 SE +/- 5.06, N = 9 SE +/- 11.03, N = 9 366.36 401.78
DaCapo Benchmark Java Test: H2 OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: H2 Ubuntu 21.04 Ubuntu 20.10 2K 4K 6K 8K 10K SE +/- 133.93, N = 4 SE +/- 129.08, N = 4 10740 11377
DaCapo Benchmark Java Test: Jython OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Jython Ubuntu 21.04 Ubuntu 20.10 1200 2400 3600 4800 6000 SE +/- 153.92, N = 20 SE +/- 174.19, N = 20 5320 5614
DaCapo Benchmark Java Test: Tradebeans OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Tradebeans Ubuntu 21.04 Ubuntu 20.10 4K 8K 12K 16K 20K SE +/- 82.67, N = 4 SE +/- 203.85, N = 4 16729 17609
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.81, N = 3 SE +/- 0.92, N = 4 82.3 81.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 600 1200 1800 2400 3000 SE +/- 12.48, N = 3 SE +/- 15.59, N = 4 2727.6 2713.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Compression Speed Ubuntu 21.04 Ubuntu 20.10 70 140 210 280 350 SE +/- 3.49, N = 4 SE +/- 8.82, N = 14 296.3 300.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 700 1400 2100 2800 3500 SE +/- 6.74, N = 4 SE +/- 5.83, N = 14 3193.9 3192.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed Ubuntu 21.04 Ubuntu 20.10 11 22 33 44 55 SE +/- 0.33, N = 3 SE +/- 0.39, N = 15 47.2 45.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 600 1200 1800 2400 3000 SE +/- 1.06, N = 3 SE +/- 5.79, N = 15 2719.4 2736.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
srsLTE Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsLTE 20.10.1 Test: OFDM_Test Ubuntu 21.04 Ubuntu 20.10 30M 60M 90M 120M 150M SE +/- 650640.71, N = 3 SE +/- 260341.66, N = 3 121500000 120333333 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsLTE Test: PHY_DL_Test OpenBenchmarking.org eNb Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test Ubuntu 21.04 Ubuntu 20.10 50 100 150 200 250 SE +/- 0.84, N = 3 SE +/- 1.35, N = 3 210.9 207.6 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsLTE Test: PHY_DL_Test OpenBenchmarking.org UE Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.30, N = 3 SE +/- 0.90, N = 3 86.6 84.4 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: Blowfish Ubuntu 21.04 Ubuntu 20.10 30K 60K 90K 120K 150K SE +/- 160.48, N = 3 SE +/- 877.45, N = 3 118366 111351 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: MD5 Ubuntu 21.04 Ubuntu 20.10 2M 4M 6M 8M 10M SE +/- 137751.39, N = 3 SE +/- 118341.78, N = 3 10077667 9267667 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
Node.js Express HTTP Load Test OpenBenchmarking.org Requests Per Second, More Is Better Node.js Express HTTP Load Test Ubuntu 21.04 Ubuntu 20.10 1200 2400 3600 4800 6000 SE +/- 161.00, N = 15 SE +/- 203.22, N = 12 5554 5765 1. Ubuntu 21.04: Nodejs
v12.21.0 2. Ubuntu 20.10: Nodejs
v12.18.2
OSPray Demo: San Miguel - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 90.91 90.91 MIN: 55.56 / MAX: 100 MIN: 43.48 / MAX: 100
OSPray Demo: XFrog Forest - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.10 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 18.87 18.87 MIN: 13.33 / MAX: 19.23 MIN: 12.05 / MAX: 19.23
OSPray Demo: San Miguel - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: Path Tracer Ubuntu 21.04 Ubuntu 20.10 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 10.42 10.42 MIN: 7.87 / MAX: 10.64 MIN: 7.25 / MAX: 10.53
OSPray Demo: NASA Streamlines - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.10 30 60 90 120 150 125 125 MIN: 31.25 / MAX: 142.86 MIN: 28.57 / MAX: 142.86
OSPray Demo: XFrog Forest - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: Path Tracer Ubuntu 21.04 Ubuntu 20.10 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 10.38 10.31 MIN: 7.58 / MAX: 10.75 MIN: 7.46 / MAX: 10.75
OSPray Demo: Magnetic Reconnection - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: SciVis Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 1.01, N = 15 SE +/- 0.00, N = 3 109.63 111.11 MIN: 20.83 / MAX: 125 MIN: 23.26
OSPray Demo: NASA Streamlines - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: Path Tracer Ubuntu 21.04 Ubuntu 20.10 7 14 21 28 35 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 27.78 27.78 MIN: 17.24 / MAX: 29.41 MIN: 16.67 / MAX: 29.41
OSPray Demo: Magnetic Reconnection - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: Path Tracer Ubuntu 21.04 Ubuntu 20.10 100 200 300 400 500 SE +/- 15.14, N = 15 SE +/- 17.82, N = 15 477.78 466.67 MIN: 142.86 / MAX: 1000 MIN: 142.86 / MAX: 1000
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 7.36 7.34 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 0.9113 1.8226 2.7339 3.6452 4.5565 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 4.05 3.89 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.17, N = 3 16.97 17.02 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 6 12 18 24 30 SE +/- 0.28, N = 3 SE +/- 0.15, N = 3 23.59 23.69 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Crown Ubuntu 21.04 Ubuntu 20.10 14 28 42 56 70 SE +/- 0.70, N = 3 SE +/- 0.77, N = 3 64.30 64.42 MIN: 55.83 / MAX: 81.77 MIN: 56.4 / MAX: 82.25
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Crown Ubuntu 21.04 Ubuntu 20.10 15 30 45 60 75 SE +/- 0.30, N = 3 SE +/- 0.47, N = 3 67.04 66.16 MIN: 59.72 / MAX: 87.99 MIN: 57.81 / MAX: 87.38
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.58, N = 3 SE +/- 0.89, N = 5 81.29 80.22 MIN: 67.06 / MAX: 91.79 MIN: 64.42 / MAX: 92.94
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.84, N = 10 SE +/- 0.47, N = 3 106.69 101.86 MIN: 85.33 / MAX: 112.78 MIN: 80.69 / MAX: 111.15
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium Ubuntu 21.04 Ubuntu 20.10 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 6.90 6.93 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium Ubuntu 21.04 Ubuntu 20.10 6 12 18 24 30 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 25.42 25.26 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Ubuntu 21.04 Ubuntu 20.10 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 13.38 13.20 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Ubuntu 21.04 Ubuntu 20.10 5 10 15 20 25 SE +/- 0.11, N = 3 SE +/- 0.02, N = 3 22.78 22.74 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast Ubuntu 21.04 Ubuntu 20.10 11 22 33 44 55 SE +/- 0.45, N = 3 SE +/- 0.20, N = 3 49.20 48.57 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 0.21, N = 3 85.41 86.46 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 0.6401 1.2802 1.9203 2.5604 3.2005 SE +/- 0.037, N = 3 SE +/- 0.002, N = 3 2.845 2.796 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 7 14 21 28 35 SE +/- 0.34, N = 3 SE +/- 0.37, N = 15 29.81 27.16 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 7 14 21 28 35 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 29.57 28.63 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 40 80 120 160 200 SE +/- 1.72, N = 3 SE +/- 1.13, N = 3 172.02 161.78 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 60 120 180 240 300 SE +/- 2.49, N = 3 SE +/- 2.36, N = 6 260.88 237.32 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 50 100 150 200 250 SE +/- 2.00, N = 15 SE +/- 2.23, N = 3 210.15 201.11 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 40 80 120 160 200 SE +/- 2.00, N = 15 SE +/- 1.24, N = 3 204.86 198.24 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 40 80 120 160 200 SE +/- 2.03, N = 3 SE +/- 0.47, N = 3 170.47 164.77 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K Ubuntu 21.04 Ubuntu 20.10 3 6 9 12 15 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 12.92 12.66 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p Ubuntu 21.04 Ubuntu 20.10 7 14 21 28 35 SE +/- 0.34, N = 3 SE +/- 0.29, N = 5 28.33 28.26 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Intel Open Image Denoise Scene: Memorial OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.2.0 Scene: Memorial Ubuntu 21.04 Ubuntu 20.10 13 26 39 52 65 SE +/- 2.57, N = 12 SE +/- 2.62, N = 12 57.59 50.81
OpenVKL Benchmark: vklBenchmark OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmark Ubuntu 21.04 Ubuntu 20.10 140 280 420 560 700 SE +/- 2.40, N = 3 SE +/- 7.42, N = 3 669 604 MIN: 1 / MAX: 2821 MIN: 1 / MAX: 2606
OpenVKL Benchmark: vklBenchmarkVdbVolume OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkVdbVolume Ubuntu 21.04 Ubuntu 20.10 5M 10M 15M 20M 25M SE +/- 58341.71, N = 3 SE +/- 181382.68, N = 3 21915837 22032745 MIN: 1062215 / MAX: 153704880 MIN: 1023660 / MAX: 161438904
OpenVKL Benchmark: vklBenchmarkStructuredVolume OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkStructuredVolume Ubuntu 21.04 Ubuntu 20.10 20M 40M 60M 80M 100M SE +/- 65330.82, N = 3 SE +/- 2593649.80, N = 12 73429487 80648255 MIN: 1371963 / MAX: 676014012 MIN: 1000000 / MAX: 1324247328
OpenVKL Benchmark: vklBenchmarkUnstructuredVolume OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkUnstructuredVolume Ubuntu 21.04 Ubuntu 20.10 400K 800K 1200K 1600K 2000K SE +/- 7183.20, N = 3 SE +/- 5996.30, N = 3 1776457 1493294 MIN: 24424 / MAX: 5906521 MIN: 18361 / MAX: 5191710
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second Ubuntu 21.04 Ubuntu 20.10 500K 1000K 1500K 2000K 2500K SE +/- 2872.35, N = 3 SE +/- 3242.58, N = 3 2347453.57 2316974.82 1. (CC) gcc options: -O2 -lrt" -lrt
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time Ubuntu 21.04 Ubuntu 20.10 40M 80M 120M 160M 200M SE +/- 2129761.70, N = 15 SE +/- 1600302.62, N = 3 180493580 177624409 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth Ubuntu 21.04 Ubuntu 20.10 40M 80M 120M 160M 200M SE +/- 1805964.50, N = 3 SE +/- 748824.48, N = 3 172711481 167218420
Swet Average OpenBenchmarking.org Operations Per Second, More Is Better Swet 1.5.16 Average Ubuntu 21.04 Ubuntu 20.10 140M 280M 420M 560M 700M SE +/- 1889606.21, N = 3 SE +/- 7650995.20, N = 3 639155887 627367473 1. (CC) gcc options: -lm -lpthread -lcurses -lrt
PJSIP Method: INVITE OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: INVITE Ubuntu 21.04 Ubuntu 20.10 500 1000 1500 2000 2500 SE +/- 22.31, N = 15 SE +/- 14.40, N = 10 2531 1866 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O2
PJSIP Method: OPTIONS, Stateful OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateful Ubuntu 21.04 Ubuntu 20.10 800 1600 2400 3200 4000 SE +/- 6.89, N = 3 SE +/- 7.67, N = 3 3815 3716 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O2
PJSIP Method: OPTIONS, Stateless OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateless Ubuntu 21.04 Ubuntu 20.10 9K 18K 27K 36K 45K SE +/- 533.56, N = 3 SE +/- 217.12, N = 3 40082 40166 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O2
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 Ubuntu 21.04 Ubuntu 20.10 4 8 12 16 20 SE +/- 0.19, N = 4 SE +/- 0.11, N = 3 15.38 15.36 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 Ubuntu 21.04 Ubuntu 20.10 1.3174 2.6348 3.9522 5.2696 6.587 SE +/- 0.074, N = 15 SE +/- 0.084, N = 15 5.050 5.855 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Ubuntu 21.04 Ubuntu 20.10 9 18 27 36 45 SE +/- 0.47, N = 3 SE +/- 0.05, N = 3 36.76 38.68 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Ubuntu 21.04 Ubuntu 20.10 3 6 9 12 15 SE +/- 0.146, N = 15 SE +/- 0.088, N = 7 8.522 9.451 1. (CXX) g++ options: -O3 -fPIC -lm
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.41 Time To Compile Ubuntu 21.04 Ubuntu 20.10 9 18 27 36 45 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 35.19 37.07
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.13, N = 3 SE +/- 0.33, N = 3 75.95 76.88
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.10.20 Time To Compile Ubuntu 21.04 Ubuntu 20.10 6 12 18 24 30 SE +/- 0.35, N = 13 SE +/- 0.39, N = 14 24.81 26.53
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 12.0 Build System: Ninja Ubuntu 21.04 Ubuntu 20.10 30 60 90 120 150 SE +/- 0.36, N = 3 SE +/- 1.55, N = 3 129.15 144.79
Timed LLVM Compilation Build System: Unix Makefiles OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 12.0 Build System: Unix Makefiles Ubuntu 21.04 Ubuntu 20.10 50 100 150 200 250 SE +/- 1.40, N = 3 SE +/- 0.98, N = 3 198.54 226.85
Timed MPlayer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed MPlayer Compilation 1.4 Time To Compile Ubuntu 21.04 Ubuntu 20.10 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 10.11 11.10
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 15.11 Time To Compile Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.97, N = 3 SE +/- 1.42, N = 3 102.14 108.60
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.4.2 Time To Compile Ubuntu 21.04 Ubuntu 20.10 10 20 30 40 50 SE +/- 0.52, N = 3 SE +/- 0.37, N = 9 40.21 44.01
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile Ubuntu 21.04 Ubuntu 20.10 16 32 48 64 80 SE +/- 0.48, N = 3 SE +/- 0.57, N = 3 70.06 74.32
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time Ubuntu 21.04 Ubuntu 20.10 3 6 9 12 15 SE +/- 0.097, N = 5 SE +/- 0.063, N = 3 9.476 9.365 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
Tungsten Renderer Scene: Hair OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Hair Ubuntu 21.04 Ubuntu 20.10 2 4 6 8 10 SE +/- 0.07388, N = 3 SE +/- 0.07152, N = 15 6.52538 6.75881 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Tungsten Renderer Scene: Water Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Water Caustic Ubuntu 21.04 Ubuntu 20.10 7 14 21 28 35 SE +/- 0.27, N = 3 SE +/- 0.06, N = 3 31.27 31.13 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Tungsten Renderer Scene: Non-Exponential OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Non-Exponential Ubuntu 21.04 Ubuntu 20.10 1.1814 2.3628 3.5442 4.7256 5.907 SE +/- 0.07466, N = 3 SE +/- 0.04628, N = 3 5.23912 5.25085 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Tungsten Renderer Scene: Volumetric Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Volumetric Caustic Ubuntu 21.04 Ubuntu 20.10 4 8 12 16 20 SE +/- 0.44, N = 15 SE +/- 0.44, N = 15 13.45 13.76 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
YafaRay Total Time For Sample Scene OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.4.1 Total Time For Sample Scene Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 2.26, N = 15 SE +/- 2.54, N = 12 86.28 83.27 1. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.2293 0.4586 0.6879 0.9172 1.1465 SE +/- 0.010679, N = 4 SE +/- 0.009311, N = 7 0.945450 1.018895 MIN: 0.86 MIN: 0.84 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.3475 0.695 1.0425 1.39 1.7375 SE +/- 0.01131, N = 3 SE +/- 0.02039, N = 3 1.37903 1.54442 MIN: 1.33 MIN: 1.31 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.3157 0.6314 0.9471 1.2628 1.5785 SE +/- 0.00905, N = 15 SE +/- 0.01231, N = 3 1.25869 1.40323 MIN: 0.86 MIN: 0.82 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.106 0.212 0.318 0.424 0.53 SE +/- 0.004513, N = 6 SE +/- 0.003416, N = 15 0.445171 0.471052 MIN: 0.4 MIN: 0.39 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.7447 1.4894 2.2341 2.9788 3.7235 SE +/- 0.02675, N = 12 SE +/- 0.03870, N = 3 3.03516 3.30961 MIN: 2.86 MIN: 2.85 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.4414 0.8828 1.3242 1.7656 2.207 SE +/- 0.01169, N = 14 SE +/- 0.01622, N = 9 1.82051 1.96166 MIN: 1.67 MIN: 1.66 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.3554 0.7108 1.0662 1.4216 1.777 SE +/- 0.00589, N = 3 SE +/- 0.01111, N = 3 1.41084 1.57954 MIN: 1.27 MIN: 1.24 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 8 16 24 32 40 SE +/- 0.29, N = 15 SE +/- 0.37, N = 15 28.75 35.46 MIN: 15.46 MIN: 14.26 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.2031 0.4062 0.6093 0.8124 1.0155 SE +/- 0.008374, N = 6 SE +/- 0.008637, N = 6 0.846330 0.902751 MIN: 0.8 MIN: 0.8 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.232 0.464 0.696 0.928 1.16 SE +/- 0.005014, N = 3 SE +/- 0.004578, N = 3 0.924756 1.031020 MIN: 0.86 MIN: 0.84 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.087 0.174 0.261 0.348 0.435 SE +/- 0.004351, N = 3 SE +/- 0.003521, N = 3 0.362841 0.386794 MIN: 0.32 MIN: 0.31 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.0466 0.0932 0.1398 0.1864 0.233 SE +/- 0.001836, N = 13 SE +/- 0.001494, N = 14 0.193198 0.206919 MIN: 0.18 MIN: 0.18 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 160 320 480 640 800 SE +/- 1.73, N = 3 SE +/- 5.22, N = 15 672.65 732.24 MIN: 646.15 MIN: 674.53 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 100 200 300 400 500 SE +/- 0.63, N = 3 SE +/- 2.30, N = 3 437.87 468.05 MIN: 422.67 MIN: 438.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 160 320 480 640 800 SE +/- 5.64, N = 15 SE +/- 6.32, N = 15 684.27 730.83 MIN: 645.4 MIN: 671.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.5108 1.0216 1.5324 2.0432 2.554 SE +/- 0.01823, N = 3 SE +/- 0.02930, N = 15 2.10597 2.27008 MIN: 2.03 MIN: 2.03 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.7886 1.5772 2.3658 3.1544 3.943 SE +/- 0.00929, N = 3 SE +/- 0.02670, N = 3 3.27051 3.50505 MIN: 3.09 MIN: 3.07 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 3 6 9 12 15 SE +/- 0.03296, N = 14 SE +/- 3.41513, N = 12 3.60538 11.68014 MIN: 3.48 MIN: 3.48 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 110 220 330 440 550 SE +/- 0.53, N = 3 SE +/- 33.83, N = 15 439.68 506.97 MIN: 424.75 MIN: 435.95 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.0609 0.1218 0.1827 0.2436 0.3045 SE +/- 0.002948, N = 3 SE +/- 0.003157, N = 4 0.256555 0.270802 MIN: 0.23 MIN: 0.23 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 160 320 480 640 800 SE +/- 0.16, N = 3 SE +/- 19.97, N = 12 671.54 754.98 MIN: 647.67 MIN: 673.86 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 100 200 300 400 500 SE +/- 0.96, N = 3 SE +/- 1.27, N = 3 437.97 472.25 MIN: 422.42 MIN: 442.59 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.0556 0.1112 0.1668 0.2224 0.278 SE +/- 0.002322, N = 6 SE +/- 0.001954, N = 10 0.231483 0.247260 MIN: 0.2 MIN: 0.2 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU Ubuntu 21.04 Ubuntu 20.10 0.1484 0.2968 0.4452 0.5936 0.742 SE +/- 0.004554, N = 3 SE +/- 0.004580, N = 3 0.615331 0.659638 MIN: 0.57 MIN: 0.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Numpy Benchmark OpenBenchmarking.org Score, More Is Better Numpy Benchmark Ubuntu 21.04 Ubuntu 20.10 80 160 240 320 400 SE +/- 2.48, N = 12 SE +/- 2.90, N = 12 332.22 387.62
Timed Eigen Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Eigen Compilation 3.3.9 Time To Compile Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.84, N = 3 SE +/- 0.34, N = 3 87.46 81.77
Timed Erlang/OTP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Erlang/OTP Compilation 23.2 Time To Compile Ubuntu 21.04 Ubuntu 20.10 40 80 120 160 200 SE +/- 0.85, N = 3 SE +/- 1.20, N = 3 180.49 183.28
Timed Wasmer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Wasmer Compilation 1.0.2 Time To Compile Ubuntu 21.04 Ubuntu 20.10 15 30 45 60 75 SE +/- 0.37, N = 3 SE +/- 0.19, N = 3 44.61 68.80 1. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lgcc_s -lutil -lrt -lpthread -lm -lc
Helsing Digit Range: 14 digit OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 14 digit Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.73, N = 15 SE +/- 0.21, N = 3 82.47 83.53 1. (CC) gcc options: -O2 -pthread
Aircrack-ng OpenBenchmarking.org k/s, More Is Better Aircrack-ng 1.5.2 Ubuntu 21.04 Ubuntu 20.10 50K 100K 150K 200K 250K SE +/- 66.66, N = 3 SE +/- 149.68, N = 3 211170.46 211559.00 1. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Magi Ubuntu 21.04 Ubuntu 20.10 600 1200 1800 2400 3000 SE +/- 50.25, N = 12 SE +/- 5.62, N = 3 2735.74 2731.13 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: x25x OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: x25x Ubuntu 21.04 Ubuntu 20.10 500 1000 1500 2000 2500 SE +/- 24.82, N = 3 SE +/- 25.12, N = 4 2221.97 2253.80 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Deepcoin Ubuntu 21.04 Ubuntu 20.10 6K 12K 18K 24K 30K SE +/- 326.37, N = 15 SE +/- 615.22, N = 12 29191 28568 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Ringcoin Ubuntu 21.04 Ubuntu 20.10 800 1600 2400 3200 4000 SE +/- 99.60, N = 15 SE +/- 168.93, N = 15 3487.91 3954.42 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Blake-2 S Ubuntu 21.04 Ubuntu 20.10 300K 600K 900K 1200K 1500K SE +/- 21571.08, N = 15 SE +/- 14563.41, N = 15 1395127 1360453 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Garlicoin Ubuntu 21.04 Ubuntu 20.10 6K 12K 18K 24K 30K SE +/- 1184.13, N = 12 SE +/- 1085.72, N = 12 29473 26183 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Skeincoin Ubuntu 21.04 Ubuntu 20.10 60K 120K 180K 240K 300K SE +/- 2421.06, N = 15 SE +/- 3496.83, N = 15 277683 268467 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Myriad-Groestl Ubuntu 21.04 Ubuntu 20.10 10K 20K 30K 40K 50K SE +/- 1514.24, N = 15 SE +/- 1259.45, N = 15 40753 46086 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: LBC, LBRY Credits Ubuntu 21.04 Ubuntu 20.10 40K 80K 120K 160K 200K SE +/- 2699.74, N = 15 SE +/- 3093.55, N = 12 163627 151978 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Quad SHA-256, Pyrite Ubuntu 21.04 Ubuntu 20.10 70K 140K 210K 280K 350K SE +/- 3914.38, N = 15 SE +/- 9923.95, N = 13 317291 322628 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Triple SHA-256, Onecoin Ubuntu 21.04 Ubuntu 20.10 90K 180K 270K 360K 450K SE +/- 1250.96, N = 3 SE +/- 11124.39, N = 12 422680 402686 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
SecureMark Benchmark: SecureMark-TLS OpenBenchmarking.org marks, More Is Better SecureMark 1.0.4 Benchmark: SecureMark-TLS Ubuntu 21.04 Ubuntu 20.10 50K 100K 150K 200K 250K SE +/- 178.37, N = 3 SE +/- 530.29, N = 3 230272 226588 1. (CC) gcc options: -pedantic -O3
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark Ubuntu 21.04 Ubuntu 20.10 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.13, N = 15 10.79 10.37 1. Ubuntu 21.04: Nodejs
v12.21.0 2. Ubuntu 20.10: Nodejs
v12.18.2
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.10 200M 400M 600M 800M 1000M SE +/- 702242.44, N = 3 SE +/- 2859827.11, N = 3 826756667 812776667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.10 300M 600M 900M 1200M 1500M SE +/- 768837.51, N = 3 SE +/- 11623730.52, N = 3 1631166667 1569933333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 64 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.10 700M 1400M 2100M 2800M 3500M SE +/- 19853994.84, N = 3 SE +/- 24378623.66, N = 4 3044966667 2157125000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.10 700M 1400M 2100M 2800M 3500M SE +/- 9583724.63, N = 3 SE +/- 13475203.56, N = 3 3290766667 3292133333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 160 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 160 - Buffer Length: 256 - Filter Length: 57 Ubuntu 21.04 Ubuntu 20.10 700M 1400M 2100M 2800M 3500M SE +/- 15087448.79, N = 3 SE +/- 10305392.33, N = 3 3080633333 3086966667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 Ubuntu 21.04 Ubuntu 20.10 110K 220K 330K 440K 550K SE +/- 4128.37, N = 3 SE +/- 13586.63, N = 12 524178.09 355593.18 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
TensorFlow Lite Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: SqueezeNet Ubuntu 21.04 Ubuntu 20.10 14K 28K 42K 56K 70K SE +/- 35.00, N = 3 SE +/- 1725.85, N = 12 47808.3 66836.4
TensorFlow Lite Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception V4 Ubuntu 21.04 Ubuntu 20.10 200K 400K 600K 800K 1000K SE +/- 7233.85, N = 5 SE +/- 7218.89, N = 3 687019 892083
TensorFlow Lite Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: NASNet Mobile Ubuntu 21.04 Ubuntu 20.10 30K 60K 90K 120K 150K SE +/- 768.19, N = 15 SE +/- 5683.05, N = 15 81319.3 157434.0
TensorFlow Lite Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Float Ubuntu 21.04 Ubuntu 20.10 12K 24K 36K 48K 60K SE +/- 612.15, N = 15 SE +/- 733.25, N = 15 41555.1 55489.8
TensorFlow Lite Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Quant Ubuntu 21.04 Ubuntu 20.10 13K 26K 39K 52K 65K SE +/- 601.06, N = 15 SE +/- 573.26, N = 15 43074.5 58977.6
TensorFlow Lite Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 Ubuntu 21.04 Ubuntu 20.10 160K 320K 480K 640K 800K SE +/- 790.57, N = 3 SE +/- 13732.96, N = 12 569303 725870
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only Ubuntu 21.04 Ubuntu 20.10 200K 400K 600K 800K 1000K SE +/- 10755.01, N = 15 SE +/- 12775.84, N = 3 946634 927066 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency Ubuntu 21.04 Ubuntu 20.10 0.061 0.122 0.183 0.244 0.305 SE +/- 0.003, N = 15 SE +/- 0.004, N = 3 0.266 0.271 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Write Ubuntu 21.04 Ubuntu 20.10 6K 12K 18K 24K 30K SE +/- 43.42, N = 3 SE +/- 45.12, N = 3 28712 20742 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency Ubuntu 21.04 Ubuntu 20.10 3 6 9 12 15 SE +/- 0.010, N = 3 SE +/- 0.030, N = 3 8.739 12.100 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
WRF Input: conus 2.5km OpenBenchmarking.org Seconds, Fewer Is Better WRF 4.2.2 Input: conus 2.5km Ubuntu 21.04 Ubuntu 20.10 2K 4K 6K 8K 10K 9903.05 9897.12 -levent_core -levent 1. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -fallow-invalid-boz -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_pthreads -lutil -lm -lrt -lz
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 Ubuntu 21.04 Ubuntu 20.10 130 260 390 520 650 SE +/- 9.40, N = 15 SE +/- 12.21, N = 15 443.55 606.05 MIN: 367.81 / MAX: 808.75 MIN: 372.96 / MAX: 843.9 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 Ubuntu 21.04 Ubuntu 20.10 80 160 240 320 400 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 366.73 366.20 MIN: 365.95 / MAX: 373.2 MIN: 366.01 / MAX: 367.51 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Sysbench Test: RAM / Memory OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory Ubuntu 21.04 Ubuntu 20.10 3K 6K 9K 12K 15K SE +/- 145.91, N = 15 SE +/- 151.11, N = 12 12176.80 12184.78 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU Ubuntu 21.04 Ubuntu 20.10 50K 100K 150K 200K 250K SE +/- 275.74, N = 3 SE +/- 253.78, N = 3 213870.87 213970.43 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
Apache Cassandra Test: Writes OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 3.11.4 Test: Writes Ubuntu 21.04 Ubuntu 20.10 20K 40K 60K 80K 100K SE +/- 2263.61, N = 15 SE +/- 1429.17, N = 3 106863 104506
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: BMW27 - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.10 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.14, N = 3 29.61 29.56
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Classroom - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.10 16 32 48 64 80 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 72.28 72.53
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Fishy Cat - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.10 11 22 33 44 55 SE +/- 0.15, N = 3 SE +/- 0.44, N = 3 45.72 46.91
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Barbershop - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.26, N = 3 SE +/- 0.41, N = 3 108.31 109.65
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Pabellon Barcelona - Compute: CPU-Only Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.24, N = 3 SE +/- 0.26, N = 3 88.36 89.25
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU Ubuntu 21.04 Ubuntu 20.10 100 200 300 400 500 SE +/- 1.17, N = 3 SE +/- 1.67, N = 3 479 356 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU Ubuntu 21.04 Ubuntu 20.10 110 220 330 440 550 SE +/- 6.29, N = 12 SE +/- 4.21, N = 3 500 435 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU Ubuntu 21.04 Ubuntu 20.10 40 80 120 160 200 SE +/- 1.20, N = 3 SE +/- 1.04, N = 3 193 161 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU Ubuntu 21.04 Ubuntu 20.10 2K 4K 6K 8K 10K SE +/- 106.53, N = 3 SE +/- 35.85, N = 3 8217 7470 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU Ubuntu 21.04 Ubuntu 20.10 1500 3000 4500 6000 7500 SE +/- 204.88, N = 12 SE +/- 158.31, N = 12 6967 5673 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
PyBench Total For Average Test Times OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2018-02-16 Total For Average Test Times Ubuntu 21.04 Ubuntu 20.10 200 400 600 800 1000 SE +/- 1.86, N = 3 SE +/- 2.67, N = 3 995 984
PHPBench PHP Benchmark Suite OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite Ubuntu 21.04 Ubuntu 20.10 150K 300K 450K 600K 750K SE +/- 3328.66, N = 3 SE +/- 4029.73, N = 3 715587 717063
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed Ubuntu 21.04 Ubuntu 20.10 1300 2600 3900 5200 6500 SE +/- 53.61, N = 3 SE +/- 43.45, N = 13 6148.4 4516.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 600 1200 1800 2400 3000 SE +/- 9.20, N = 2 SE +/- 7.67, N = 5 2985.6 2969.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Compression Speed Ubuntu 21.04 Ubuntu 20.10 500 1000 1500 2000 2500 SE +/- 24.65, N = 3 SE +/- 42.14, N = 15 2096.7 2004.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 700 1400 2100 2800 3500 SE +/- 2.77, N = 3 SE +/- 6.44, N = 13 3073.3 3022.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 1.03, N = 15 SE +/- 0.46, N = 3 82.2 81.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 600 1200 1800 2400 3000 SE +/- 3.01, N = 15 SE +/- 9.81, N = 3 2615.9 2607.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Compression Speed Ubuntu 21.04 Ubuntu 20.10 60 120 180 240 300 SE +/- 1.99, N = 3 SE +/- 8.38, N = 12 266.9 265.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed Ubuntu 21.04 Ubuntu 20.10 70 140 210 280 350 SE +/- 3.33, N = 3 SE +/- 1.00, N = 3 300.2 297.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 700 1400 2100 2800 3500 SE +/- 8.34, N = 3 SE +/- 15.05, N = 3 3283.9 3235.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed Ubuntu 21.04 Ubuntu 20.10 10 20 30 40 50 SE +/- 0.54, N = 15 SE +/- 0.27, N = 3 46.0 41.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed Ubuntu 21.04 Ubuntu 20.10 600 1200 1800 2400 3000 SE +/- 5.02, N = 15 SE +/- 3.89, N = 3 2708.5 2685.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare Ubuntu 21.04 Ubuntu 20.10 3 6 9 12 15 SE +/- 0.058, N = 3 SE +/- 0.014, N = 3 9.004 8.851 1. (CXX) g++ options: -O3 -pthread
srsRAN Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsRAN 21.04 Test: OFDM_Test Ubuntu 21.04 Ubuntu 20.10 30M 60M 90M 120M 150M SE +/- 284800.12, N = 3 SE +/- 305505.05, N = 3 120766667 121300000 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: PHY_DL_Test OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: PHY_DL_Test Ubuntu 21.04 Ubuntu 20.10 50 100 150 200 250 SE +/- 0.78, N = 3 SE +/- 0.64, N = 3 206.8 205.4 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: PHY_DL_Test OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: PHY_DL_Test Ubuntu 21.04 Ubuntu 20.10 20 40 60 80 100 SE +/- 0.78, N = 3 SE +/- 0.23, N = 3 85.1 85.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Decompression Speed Ubuntu 20.10 700 1400 2100 2800 3500 SE +/- 31.17, N = 12 3131.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
Phoronix Test Suite v10.8.4