Xeon Platinum Ubuntu 19.04 vs 21.04

2 x Intel Xeon Platinum 8280 comparison for a future article looking at the performance change over two years.

HTML result view exported from: https://openbenchmarking.org/result/2102114-HA-CASCADE0202.

Xeon Platinum Ubuntu 19.04 vs 21.04ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionVulkanUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC112 x Intel Xeon Platinum 8280 @ 4.00GHz (56 Cores / 112 Threads)GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS)Intel Sky Lake-E DMI3 Registers378GB280GB INTEL SSDPED1D280GAllvmpipe2 x Intel X722 for 1GbE + 2 x QLogic FastLinQ QL41000 10/25/40/50GbEUbuntu 19.045.0.0-38-generic (x86_64)GNOME Shell 3.32.2X Server 1.20.43.3 Mesa 19.0.8 (LLVM 8.0 256 bits)GCC 8.3.0ext41024x768ASPEEDUbuntu 21.045.8.0-36-generic (x86_64)GNOME Shell 3.38.2X Server 1.20.91.0.2GCC 10.2.1 202101305.11.0-051100rc7daily20210209-generic (x86_64) 20210208GCC 11.0.0 20210115OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Details- Ubuntu 19.04: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Ubuntu 21.04 Dev: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-YIlI3N/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-YIlI3N/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Ubuntu 21.04 Dev + Linux 5.11: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-YIlI3N/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-YIlI3N/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Ubuntu 21.04 Dev + Linux 5.11 + GCC11: --build=x86_64-linux-gnu --disable-nls --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=yes,extra,rtl --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-snapshot-nHdSaM/gcc-snapshot-20210115/debian/tmp-nvptx/usr/lib/gcc-snapshot,amdgcn-amdhsa=/build/gcc-snapshot-nHdSaM/gcc-snapshot-20210115/debian/tmp-gcn/usr/lib/gcc-snapshot --enable-plugin --enable-shared --host=x86_64-linux-gnu --program-prefix= --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Ubuntu 19.04: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x500002c- Ubuntu 21.04 Dev: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x5003003- Ubuntu 21.04 Dev + Linux 5.11: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x5003003- Ubuntu 21.04 Dev + Linux 5.11 + GCC11: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x5003003Python Details- Ubuntu 19.04: Python 2.7.16 + Python 3.7.3- Ubuntu 21.04 Dev: Python 3.9.1+- Ubuntu 21.04 Dev + Linux 5.11: Python 3.9.1+- Ubuntu 21.04 Dev + Linux 5.11 + GCC11: Python 3.9.1+Security Details- Ubuntu 19.04: itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + tsx_async_abort: Mitigation of TSX disabled - Ubuntu 21.04 Dev: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled - Ubuntu 21.04 Dev + Linux 5.11: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled - Ubuntu 21.04 Dev + Linux 5.11 + GCC11: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled

Xeon Platinum Ubuntu 19.04 vs 21.04hpcg: npb: EP.Drodinia: OpenMP HotSpot3Drodinia: OpenMP Leukocyterodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusternamd: ATPase Simulation - 327,506 Atomsdolfyn: Computational Fluid Dynamicsneat: pennant: leblancbigmafft: Multiple Sequence Alignment - LSU RNAopenfoam: Motorbike 60Mlammps: 20k Atomslibraw: Post-Processing Benchmarkarrayfire: Conjugate Gradient CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Ultra Fastsvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080px265: Bosphorus 1080pmt-dgemm: Sustained Floating-Point Rateoidn: Memorialopenvkl: vklBenchmarkopenvkl: vklBenchmarkVdbVolumeopenvkl: vklBenchmarkStructuredVolumehimeno: Poisson Pressure Solveryafaray: Total Time For Sample Scenenumpy: ngspice: C2670ngspice: C7552financebench: Repo OpenMPfinancebench: Bonds OpenMPaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddingaskap: Hogbom Clean OpenMPtensorflow-lite: SqueezeNettensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Floattensorflow-lite: Mobilenet Quanttensorflow-lite: Inception ResNet V2mysqlslap: 128mysqlslap: 256mysqlslap: 512pgbench: 100 - 250 - Read Writepgbench: 100 - 250 - Read Write - Average Latencysqlite-speedtest: Timed Time - Size 1,000redis: SADDredis: LPUSHredis: GETredis: SETtnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1plaidml: No - Inference - VGG19 - CPUplaidml: No - Inference - ResNet 50 - CPUblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyonnx: yolov4 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: fcn-resnet101-11 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUpybench: Total For Average Test Timespyperformance: 2to3pyperformance: chaospyperformance: pathlibpyperformance: raytracepyperformance: json_loadspyperformance: crypto_pyaespyperformance: regex_compilepyperformance: python_startuppyperformance: django_templatepyperformance: pickle_pure_pythonappleseed: Emilyphpbench: PHP Benchmark Suitekripke: v-ray: CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1129.03954069.10114.03662.0647.26011.9600.3659722.89232.9588.46201813.049168.5226.65438.618.4833.173604.58332558.126923.534576.4880.3319910.81479916.8828.63258.83267.36222.7834.5018.71889135.4745624812767750779133994.504177223.468344.63163.824135.57351894.80468788678.11458410262.112085.0615.30667294.4104739.153606.757234.61011558599499521434525.77164.9071901359.651486892.542201881.921747022.39365.997329.55523.494.7238.38104.2231740213861566410108034010918.746826.110917311.354.1445157.874552633177935025774741929.16676262.31109.62462.8145.82311.3580.3686121.11931.4468.54103011.285169.4625.81541.967.8663.188744.38790546.314988.091549.2100.2910250.82434316.9629.09265.20266.01226.5634.7619.78559235.3846526865549792144884024.81869397.926364.56137.291120.02741538.34765658793.12500010396.912439.1824.37364866.8101158.152927.855274.61050017610527507459435.45663.9562155059.331630926.212550238.751907194.92354.344326.37824.124.7138.10104.4835040414466206262107133210317.144725.41041629.7947.1417158.4574656702161032487274723429.25096504.82109.77951.6186.83911.6750.3661121.76127.0938.54639511.849169.6825.86043.057.2773.193444.38671543.337983.392548.0770.2946850.82245917.0829.22270.03278.26233.0334.8819.85475435.6248526512800756699573983.46210098.502355.32137.504126.14741697.77864659180.92447910436.212438.7663.72464059.997422.351553.752709.61022900618507527496935.04564.3732152898.171643872.292445380.331887004.08348.787327.05323.524.6738.17104.2734141214367736155106933810317.444825.710416211.247.3419155.5478896682451042515174664029.18476230.86112.55051.6196.80911.6240.3692420.84727.5658.19978512.161170.2526.75241.957.1033.184174.25314559.360981.081547.8150.2927440.82080717.1629.49273.04277.39226.8134.9120.15767735.2146427008330755770884073.143218100.875366.08135.827126.29540971.98697957588.26822910416.512382.8665.19563798.796474.049849.052690.6103071863952652662.8352227428.421695910.252643078.501993808.87350.536328.66523.304.5937.93104.3434538714167306947107233910417.244725.610516311.347.3422155.82633366650110581295047033OpenBenchmarking.org

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1Ubuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11714212835SE +/- 0.15, N = 3SE +/- 0.13, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 329.0429.1729.2529.181. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1114002800420056007000SE +/- 18.13, N = 3SE +/- 160.71, N = 13SE +/- 73.13, N = 4SE +/- 4.94, N = 34069.106262.316504.826230.86-lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz-lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz-lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Ubuntu 19.04: Open MPI 3.1.33. Ubuntu 21.04 Dev: Open MPI 4.1.04. Ubuntu 21.04 Dev + Linux 5.11: Open MPI 4.1.05. Ubuntu 21.04 Dev + Linux 5.11 + GCC11: Open MPI 4.1.0

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11306090120150SE +/- 1.32, N = 4SE +/- 0.14, N = 3SE +/- 0.16, N = 3SE +/- 0.81, N = 3114.04109.62109.78112.551. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC111428425670SE +/- 0.71, N = 13SE +/- 0.24, N = 3SE +/- 0.41, N = 3SE +/- 0.59, N = 1562.0662.8151.6251.621. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11246810SE +/- 0.090, N = 15SE +/- 0.129, N = 15SE +/- 0.010, N = 3SE +/- 0.012, N = 37.2605.8236.8396.8091. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC113691215SE +/- 0.02, N = 3SE +/- 0.09, N = 15SE +/- 0.12, N = 3SE +/- 0.09, N = 1511.9611.3611.6811.621. (CXX) g++ options: -O2 -lOpenCL

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC110.08310.16620.24930.33240.4155SE +/- 0.00094, N = 3SE +/- 0.00131, N = 3SE +/- 0.00050, N = 3SE +/- 0.00222, N = 30.365970.368610.366110.36924

Dolfyn

Computational Fluid Dynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11510152025SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 322.8921.1221.7620.85

Nebular Empirical Analysis Tool

OpenBenchmarking.orgSeconds, Fewer Is BetterNebular Empirical Analysis Tool 2020-02-29Ubuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11816243240SE +/- 0.26, N = 15SE +/- 0.21, N = 15SE +/- 0.05, N = 3SE +/- 0.39, N = 332.9631.4527.0927.571. (F9X) gfortran options: -cpp -ffree-line-length-0 -Jsource/ -fopenmp -O3 -fno-backtrace

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11246810SE +/- 0.013902, N = 3SE +/- 0.039683, N = 3SE +/- 0.004992, N = 3SE +/- 0.055343, N = 38.4620188.5410308.5463958.1997851. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNAUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC113691215SE +/- 0.18, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 313.0511.2911.8512.161. (CC) gcc options: -std=c99 -O3 -lm -lpthread

OpenFOAM

Input: Motorbike 60M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60MUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC114080120160200SE +/- 0.34, N = 3SE +/- 0.34, N = 3SE +/- 0.23, N = 3SE +/- 0.16, N = 3168.52169.46169.68170.25-lfoamToVTK -lfileFormats-lfoamToVTK -lfileFormats-lfoamToVTK -lfileFormats-ldecompose -lmetisDecomp -lscotchDecomp -lregionModels1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -llagrangian -lgenericPatchFields -lOpenFOAM -ldl -lm

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11612182430SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 326.6525.8225.8626.751. (CXX) g++ options: -O3 -march=native -pthread -lm

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC111020304050SE +/- 0.05, N = 3SE +/- 0.22, N = 3SE +/- 0.15, N = 3SE +/- 0.11, N = 338.6141.9643.0541.951. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm

ArrayFire

Test: Conjugate Gradient CPU

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11246810SE +/- 0.739, N = 15SE +/- 0.182, N = 15SE +/- 0.086, N = 15SE +/- 0.099, N = 158.4837.8667.2777.1031. (CXX) g++ options: -O3 -march=native -rdynamic

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC110.71851.4372.15552.8743.5925SE +/- 0.01090, N = 3SE +/- 0.01304, N = 3SE +/- 0.00395, N = 3SE +/- 0.01203, N = 33.173603.188743.193443.18417MIN: 2.94MIN: 2.96MIN: 2.97MIN: 2.951. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC111.03122.06243.09364.12485.156SE +/- 0.00382, N = 3SE +/- 0.00385, N = 3SE +/- 0.00438, N = 3SE +/- 0.00276, N = 34.583324.387904.386714.25314MIN: 4.45MIN: 4.25MIN: 4.24MIN: 4.121. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11120240360480600SE +/- 5.67, N = 3SE +/- 3.68, N = 3SE +/- 2.43, N = 3SE +/- 7.60, N = 3558.13546.31543.34559.36MIN: 543.23MIN: 533.79MIN: 530.81MIN: 536.061. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC112004006008001000SE +/- 0.79, N = 3SE +/- 9.48, N = 6SE +/- 4.40, N = 3SE +/- 1.76, N = 3923.53988.09983.39981.08MIN: 910.01MIN: 952.93MIN: 964.72MIN: 966.891. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11120240360480600SE +/- 9.03, N = 15SE +/- 1.81, N = 3SE +/- 3.91, N = 3SE +/- 0.48, N = 3576.49549.21548.08547.82MIN: 541.14MIN: 537.59MIN: 532.31MIN: 534.251. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC110.07470.14940.22410.29880.3735SE +/- 0.002873, N = 8SE +/- 0.003010, N = 4SE +/- 0.003609, N = 4SE +/- 0.003168, N = 30.3319910.2910250.2946850.292744MIN: 0.27MIN: 0.25MIN: 0.25MIN: 0.251. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC110.18550.3710.55650.7420.9275SE +/- 0.001869, N = 3SE +/- 0.003410, N = 3SE +/- 0.000961, N = 3SE +/- 0.000418, N = 30.8147990.8243430.8224590.820807MIN: 0.76MIN: 0.76MIN: 0.77MIN: 0.761. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very FastUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1148121620SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 316.8816.9617.0817.161. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra FastUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11714212835SE +/- 0.22, N = 3SE +/- 0.07, N = 3SE +/- 0.16, N = 3SE +/- 0.06, N = 328.6329.0929.2229.491. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lpthread -lm -lrt

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080pUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1160120180240300SE +/- 3.52, N = 3SE +/- 1.72, N = 3SE +/- 2.34, N = 15SE +/- 1.55, N = 3258.83265.20270.03273.041. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1160120180240300SE +/- 3.58, N = 3SE +/- 3.60, N = 3SE +/- 1.92, N = 15SE +/- 2.29, N = 3267.36266.01278.26277.391. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080pUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1150100150200250SE +/- 2.60, N = 3SE +/- 1.89, N = 9SE +/- 2.15, N = 15SE +/- 1.69, N = 3222.78226.56233.03226.811. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11816243240SE +/- 0.14, N = 3SE +/- 0.18, N = 3SE +/- 0.16, N = 3SE +/- 0.20, N = 334.5034.7634.8834.911. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11510152025SE +/- 0.08, N = 3SE +/- 0.25, N = 3SE +/- 0.20, N = 5SE +/- 0.01, N = 318.7219.7919.8520.161. (CC) gcc options: -O3 -march=native -fopenmp

Intel Open Image Denoise

Scene: Memorial

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11816243240SE +/- 0.41, N = 3SE +/- 0.05, N = 3SE +/- 0.20, N = 3SE +/- 0.19, N = 335.4735.3835.6235.21

OpenVKL

Benchmark: vklBenchmark

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11100200300400500SE +/- 4.20, N = 9SE +/- 5.62, N = 4SE +/- 1.45, N = 3SE +/- 6.49, N = 3456465485464MIN: 1 / MAX: 1812MIN: 1 / MAX: 1811MIN: 1 / MAX: 1790MIN: 1 / MAX: 1832

OpenVKL

Benchmark: vklBenchmarkVdbVolume

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkVdbVolumeUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC116M12M18M24M30MSE +/- 70798.88, N = 3SE +/- 217704.15, N = 3SE +/- 65172.45, N = 3SE +/- 94239.71, N = 324812767268655492651280027008330MIN: 868934 / MAX: 162479232MIN: 1021515 / MAX: 200332152MIN: 1022996 / MAX: 184673880MIN: 1049658 / MAX: 186492960

OpenVKL

Benchmark: vklBenchmarkStructuredVolume

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkStructuredVolumeUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1120M40M60M80M100MSE +/- 1084350.33, N = 15SE +/- 907902.42, N = 15SE +/- 887912.44, N = 4SE +/- 515321.24, N = 375077913792144887566995775577088MIN: 1073646 / MAX: 803527920MIN: 1183368 / MAX: 1019150352MIN: 1211791 / MAX: 633286944MIN: 1183326 / MAX: 645253056

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC119001800270036004500SE +/- 0.36, N = 3SE +/- 5.05, N = 3SE +/- 2.71, N = 3SE +/- 1.60, N = 33994.504024.823983.464073.141. (CC) gcc options: -O3 -march=native -mavx2

YafaRay

Total Time For Sample Scene

OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.4.1Total Time For Sample SceneUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1150100150200250SE +/- 16.23, N = 12SE +/- 0.79, N = 3SE +/- 0.82, N = 3SE +/- 1.79, N = 15223.4797.9398.50100.88-lboost_system -lboost_filesystem -lboost_locale-lpthread-lpthread-lpthread1. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1180160240320400SE +/- 1.31, N = 3SE +/- 0.40, N = 3SE +/- 2.59, N = 3SE +/- 0.49, N = 3344.63364.56355.32366.08

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670Ubuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC114080120160200SE +/- 1.27, N = 12SE +/- 1.52, N = 3SE +/- 0.55, N = 3SE +/- 0.39, N = 3163.82137.29137.50135.831. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552Ubuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11306090120150SE +/- 1.71, N = 3SE +/- 0.07, N = 3SE +/- 0.66, N = 3SE +/- 2.20, N = 12135.57120.03126.15126.301. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1111K22K33K44K55KSE +/- 498.18, N = 3SE +/- 180.05, N = 3SE +/- 131.71, N = 3SE +/- 231.02, N = 351894.8041538.3541697.7840971.991. (CXX) g++ options: -O3 -march=native -fopenmp

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1120K40K60K80K100KSE +/- 101.32, N = 3SE +/- 107.08, N = 3SE +/- 53.90, N = 3SE +/- 83.74, N = 388678.1158793.1359180.9257588.271. (CXX) g++ options: -O3 -march=native -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC112K4K6K8K10KSE +/- 66.19, N = 3SE +/- 19.67, N = 3SE +/- 0.00, N = 3SE +/- 19.67, N = 310262.110396.910436.210416.51. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC113K6K9K12K15KSE +/- 78.97, N = 3SE +/- 55.90, N = 3SE +/- 28.13, N = 3SE +/- 27.77, N = 312085.012439.112438.712382.81. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMPUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC112004006008001000SE +/- 17.36, N = 15SE +/- 8.96, N = 3SE +/- 1.47, N = 3SE +/- 1.47, N = 3615.31824.37663.72665.201. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

TensorFlow Lite

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1114K28K42K56K70KSE +/- 669.07, N = 15SE +/- 715.03, N = 4SE +/- 618.86, N = 3SE +/- 332.53, N = 367294.464866.864059.963798.7

TensorFlow Lite

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1120K40K60K80K100KSE +/- 2444.48, N = 12SE +/- 974.76, N = 6SE +/- 760.68, N = 10SE +/- 469.23, N = 3104739.1101158.197422.396474.0

TensorFlow Lite

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1111K22K33K44K55KSE +/- 675.73, N = 12SE +/- 444.73, N = 15SE +/- 381.91, N = 11SE +/- 396.71, N = 353606.752927.851553.749849.0

TensorFlow Lite

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1112K24K36K48K60KSE +/- 727.18, N = 15SE +/- 580.61, N = 15SE +/- 147.50, N = 3SE +/- 335.56, N = 357234.655274.652709.652690.6

TensorFlow Lite

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2Ubuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11200K400K600K800K1000KSE +/- 17126.87, N = 15SE +/- 9914.36, N = 3SE +/- 8164.03, N = 3SE +/- 10731.38, N = 51011558105001710229001030718

MariaDB

Clients: 128

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 128Ubuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11140280420560700SE +/- 2.90, N = 3SE +/- 2.27, N = 3SE +/- 0.38, N = 3SE +/- 1.78, N = 3599610618639-llzma -lbz2 -lsnappy -lnuma -lcrypt -lssl -lcrypto-llzma -lbz2 -lsnappy -lnuma -lcrypt -lssl -lcrypto-llzma -lbz2 -lsnappy -lnuma -lcrypt -lssl -lcrypto-shared -lrt -lstdc++1. (CXX) g++ options: -O3 -march=native -pie -fPIC -fstack-protector -O2 -lpthread -lz -lm -ldl

MariaDB

Clients: 256

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 256Ubuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11110220330440550SE +/- 1.09, N = 3SE +/- 0.61, N = 3SE +/- 0.23, N = 3SE +/- 0.24, N = 3499527507526-llzma -lbz2 -lsnappy -lnuma -lcrypt -lssl -lcrypto-llzma -lbz2 -lsnappy -lnuma -lcrypt -lssl -lcrypto-llzma -lbz2 -lsnappy -lnuma -lcrypt -lssl -lcrypto-shared -lrt -lstdc++1. (CXX) g++ options: -O3 -march=native -pie -fPIC -fstack-protector -O2 -lpthread -lz -lm -ldl

MariaDB

Clients: 512

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 512Ubuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11110220330440550SE +/- 0.18, N = 3SE +/- 0.56, N = 3SE +/- 0.33, N = 3SE +/- 1.29, N = 3521507527526-llzma -lbz2 -lsnappy -lnuma -lcrypt -lssl -lcrypto-llzma -lbz2 -lsnappy -lnuma -lcrypt -lssl -lcrypto-llzma -lbz2 -lsnappy -lnuma -lcrypt -lssl -lcrypto-shared -lrt -lstdc++1. (CXX) g++ options: -O3 -march=native -pie -fPIC -fstack-protector -O2 -lpthread -lz -lm -ldl

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read WriteUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.1111K22K33K44K55KSE +/- 269.58, N = 3SE +/- 140.45, N = 3SE +/- 126.36, N = 34345245943496931. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average LatencyUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.111.29852.5973.89555.1946.4925SE +/- 0.037, N = 3SE +/- 0.017, N = 3SE +/- 0.013, N = 35.7715.4565.0451. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000Ubuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC111428425670SE +/- 0.15, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 364.9163.9664.3762.841. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADDUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11500K1000K1500K2000K2500KSE +/- 39203.08, N = 12SE +/- 22931.99, N = 3SE +/- 14200.11, N = 3SE +/- 8203.23, N = 31901359.652155059.332152898.172227428.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSHUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11400K800K1200K1600K2000KSE +/- 12883.11, N = 3SE +/- 4419.84, N = 3SE +/- 3153.11, N = 3SE +/- 776.26, N = 31486892.541630926.211643872.291695910.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11600K1200K1800K2400K3000KSE +/- 70535.69, N = 15SE +/- 6330.39, N = 3SE +/- 13811.23, N = 3SE +/- 11499.67, N = 32201881.922550238.752445380.332643078.501. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11400K800K1200K1600K2000KSE +/- 30273.24, N = 15SE +/- 14871.09, N = 3SE +/- 24706.77, N = 3SE +/- 22443.34, N = 31747022.391907194.921887004.081993808.871. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2Ubuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1180160240320400SE +/- 1.21, N = 3SE +/- 3.23, N = 3SE +/- 0.41, N = 3SE +/- 1.10, N = 3366.00354.34348.79350.54MIN: 354.49 / MAX: 501.68MIN: 345.92 / MAX: 489.99MIN: 346.29 / MAX: 410.44MIN: 347.02 / MAX: 522.091. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1Ubuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1170140210280350SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.69, N = 3SE +/- 0.74, N = 3329.56326.38327.05328.67MIN: 329.04 / MAX: 331.16MIN: 325.29 / MAX: 327.12MIN: 326.19 / MAX: 330.75MIN: 326.96 / MAX: 331.191. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl

PlaidML

FP16: No - Mode: Inference - Network: VGG19 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11612182430SE +/- 0.25, N = 15SE +/- 0.21, N = 15SE +/- 0.32, N = 15SE +/- 0.30, N = 1523.4924.1223.5223.30

PlaidML

FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC111.0622.1243.1864.2485.31SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 34.724.714.674.59

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CPU-OnlyUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11918273645SE +/- 0.17, N = 3SE +/- 0.24, N = 3SE +/- 0.17, N = 3SE +/- 0.08, N = 338.3838.1038.1737.93

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CPU-OnlyUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1120406080100SE +/- 0.38, N = 3SE +/- 0.15, N = 3SE +/- 0.20, N = 3SE +/- 0.14, N = 3104.22104.48104.27104.34

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1180160240320400SE +/- 3.24, N = 3SE +/- 1.59, N = 3SE +/- 3.62, N = 3SE +/- 1.48, N = 33173503413451. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1190180270360450SE +/- 2.57, N = 3SE +/- 2.47, N = 3SE +/- 4.92, N = 12SE +/- 1.92, N = 34024044123871. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11306090120150SE +/- 0.58, N = 3SE +/- 1.09, N = 3SE +/- 0.60, N = 3SE +/- 1.09, N = 31381441431411. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1115003000450060007500SE +/- 35.31, N = 3SE +/- 70.00, N = 12SE +/- 26.43, N = 3SE +/- 19.77, N = 361566620677367301. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1115003000450060007500SE +/- 33.38, N = 3SE +/- 54.74, N = 3SE +/- 78.69, N = 12SE +/- 72.47, N = 364106262615569471. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt

PyBench

Total For Average Test Times

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test TimesUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC112004006008001000SE +/- 2.52, N = 3SE +/- 2.52, N = 3SE +/- 1.67, N = 31080107110691072

PyPerformance

Benchmark: 2to3

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: 2to3Ubuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1170140210280350340332338339

PyPerformance

Benchmark: chaos

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: chaosUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1120406080100SE +/- 0.33, N = 3SE +/- 0.33, N = 3109103103104

PyPerformance

Benchmark: pathlib

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pathlibUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11510152025SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 318.717.117.417.2

PyPerformance

Benchmark: raytrace

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: raytraceUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11100200300400500SE +/- 0.33, N = 3SE +/- 1.00, N = 3SE +/- 1.20, N = 3468447448447

PyPerformance

Benchmark: json_loads

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: json_loadsUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11612182430SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 326.125.425.725.6

PyPerformance

Benchmark: crypto_pyaes

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: crypto_pyaesUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1120406080100109104104105

PyPerformance

Benchmark: regex_compile

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: regex_compileUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC114080120160200173162162163

PyPerformance

Benchmark: python_startup

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: python_startupUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC113691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 311.309.7911.2011.30

PyPerformance

Benchmark: django_template

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: django_templateUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC111224364860SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.12, N = 354.147.147.347.3

PyPerformance

Benchmark: pickle_pure_python

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pickle_pure_pythonUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11100200300400500SE +/- 0.88, N = 3SE +/- 1.45, N = 3445417419422

Appleseed

Scene: Emily

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: EmilyUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC114080120160200157.87158.46155.55155.83

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark SuiteUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC11140K280K420K560K700KSE +/- 321.84, N = 3SE +/- 1341.16, N = 3SE +/- 2936.03, N = 3SE +/- 1019.45, N = 3633177670216668245666501

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4Ubuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1120M40M60M80M100MSE +/- 1474539.43, N = 15SE +/- 1522978.65, N = 15SE +/- 860592.92, N = 15SE +/- 1086593.46, N = 4935025771032487271042515171058129501. (CXX) g++ options: -O3 -march=native -fopenmp

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 5Mode: CPUUbuntu 19.04Ubuntu 21.04 DevUbuntu 21.04 Dev + Linux 5.11Ubuntu 21.04 Dev + Linux 5.11 + GCC1110K20K30K40K50KSE +/- 166.62, N = 3SE +/- 121.64, N = 3SE +/- 571.93, N = 3SE +/- 467.11, N = 347419472344664047033


Phoronix Test Suite v10.8.4