Cascade Lake Clear Linux 2021

2 x Intel Xeon Platinum 8280 testing with a GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS) and llvmpipe on Ubuntu 21.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2103273-IB-2103266IB69.

Cascade Lake Clear Linux 2021ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionClear Linux 34420Ubuntu 21.04 Dev2 x Intel Xeon Platinum 8280 @ 4.00GHz (56 Cores / 112 Threads)GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS)Intel Sky Lake-E DMI3 Registers378GB280GB INTEL SSDPED1D280GAllvmpipeVE2282 x Intel X722 for 1GbE + 2 x QLogic FastLinQ QL41000 10/25/40/50GbEClear Linux OS 344205.10.19-1032.native (x86_64)GNOME Shell 3.38.4X Server 1.20.104.5 Mesa 20.3.4 (LLVM 10.0.1 256 bits)GCC 10.2.1 20210324 releases/gcc-10.2.0-1013-g592388d4f6 + Clang 10.0.1 + LLVM 10.0.1ext41920x1080Ubuntu 21.045.11.0-11-generic (x86_64)GNOME Shell 3.38.3X Server4.5 Mesa 21.0.0 (LLVM 11.0.1 256 bits)GCC 10.2.1 20210320OpenBenchmarking.orgKernel Details- Clear Linux 34420: Transparent Huge Pages: always- Ubuntu 21.04 Dev: Transparent Huge Pages: madviseEnvironment Details- Clear Linux 34420: FFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,--enable-new-dtags -Wa,-mbranches-within-32B-boundaries" CXXFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -Wformat -Wformat-security -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -fno-semantic-interposition -ffat-lto-objects -fno-trapping-math -Wl,-sort-common -Wl,--enable-new-dtags -mtune=skylake -Wa,-mbranches-within-32B-boundaries -fvisibility-inlines-hidden -Wl,--enable-new-dtags" MESA_GLSL_CACHE_DISABLE=0 FCFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,-sort-common -Wl,--enable-new-dtags" CFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -Wformat -Wformat-security -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -fno-semantic-interposition -ffat-lto-objects -fno-trapping-math -Wl,-sort-common -Wl,--enable-new-dtags -mtune=skylake -Wa,-mbranches-within-32B-boundaries" THEANO_FLAGS="floatX=float32,openmp=true,gcc.cxxflags="-ftree-vectorize -mavx"" Compiler Details- Clear Linux 34420: --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell - Ubuntu 21.04 Dev: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-DjbZbO/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-DjbZbO/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Clear Linux 34420: Scaling Governor: intel_pstate performance - CPU Microcode: 0x5003006- Ubuntu 21.04 Dev: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x5003003Python Details- Clear Linux 34420: Python 3.9.2Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled

Cascade Lake Clear Linux 2021npb: EP.Dnpb: LU.Crodinia: OpenMP LavaMDrodinia: OpenMP HotSpot3Drodinia: OpenMP Leukocyterodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusternamd: ATPase Simulation - 327,506 Atomsincompact3d: X3D-benchmarking input.i3dincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionsimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDcompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speeddav1d: Chimera 1080pdav1d: Summer Nature 4Kdav1d: Summer Nature 1080pdav1d: Chimera 1080p 10-bitospray: San Miguel - Path Tracerospray: XFrog Forest - Path Tracerospray: NASA Streamlines - Path Tracerospray: Magnetic Reconnection - Path Tracerembree: Pathtracer - Crownembree: Pathtracer ISPC - Crownembree: Pathtracer - Asian Dragonembree: Pathtracer ISPC - Asian Dragonsvt-av1: Enc Mode 0 - 1080psvt-av1: Enc Mode 4 - 1080psvt-av1: Enc Mode 8 - 1080psvt-hevc: 1 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080poidn: Memorialopenvkl: vklBenchmarkstockfish: Total Timeavifenc: 2avifenc: 6avifenc: 10avifenc: 6, Losslessavifenc: 10, Losslessonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 3D - bf16bf16bf16 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUsynthmark: VoiceMark_100liquid-dsp: 1 - 256 - 57liquid-dsp: 16 - 256 - 57liquid-dsp: 32 - 256 - 57liquid-dsp: 64 - 256 - 57liquid-dsp: 112 - 256 - 57gromacs: water_GMX50_baretensorflow-lite: SqueezeNettensorflow-lite: Inception V4tensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Floattensorflow-lite: Mobilenet Quanttensorflow-lite: Inception ResNet V2mnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3tnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1indigobench: CPU - Bedroomindigobench: CPU - Supercarblender: BMW27 - CPU-Onlyblender: Fishy Cat - CPU-Onlyv-ray: CPUcompress-zstd: 8, Long Mode - Decompression SpeedClear Linux 34420Ubuntu 21.04 Dev6759.15125567.3353.547111.14552.6796.54112.9010.37170458.1748254.4892463719.33319662.520.884.014.2148.006096.946.816135.484.31889.2834.838.92087.91038.26396.461010.38240.096.626.6718.5250051.386954.708961.211770.48200.1449.00974.63230.09270.36480.50348.26356.90291.2437.4451113619031730.12111.7343.66928.7136.4421.718881.0387143.623752.110613.818930.3837720.309397900.3393.195914.039064.42830476.656895.445477.7150.3302180.833080614.457587123338228633331518633333231590000028536333335.75961863.588311497445.751318.349894.18116388.15725.7334.7502.67232.477330.630330.1598.57620.20237.8455.71490826207.05136533.7658.053116.00863.7197.73413.7260.40085467.6321114.4697863319.01316642.430.813.513.6148.196297.044.836308.074.42306.2325.540.22297.9419.62244.27451.23101.706.496.6218.1850050.096154.133759.188370.71580.1247.68854.99428.55222.57329.16268.71266.40209.9335.7540413280094234.86115.0855.51234.6598.7251.651791.055464.063723.008054.174840.4492780.3830701092.423.517895.005844.67069613.0721087.27615.2920.4122640.942426547.498489456677011933331320366667213316666723634000005.710105347127901928241783813.383725.411515908.59929.3204.8803.23934.881434.391343.3458.29719.57938.5296.03465732814.2OpenBenchmarking.org

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DClear Linux 34420Ubuntu 21.04 Dev14002800420056007000SE +/- 29.20, N = 3SE +/- 21.90, N = 36759.156207.05-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake-lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Clear Linux 34420: 3.23. Ubuntu 21.04 Dev: Open MPI 4.1.0

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CClear Linux 34420Ubuntu 21.04 Dev30K60K90K120K150KSE +/- 241.57, N = 3SE +/- 92.89, N = 3125567.33136533.76-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake-lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Clear Linux 34420: 3.23. Ubuntu 21.04 Dev: Open MPI 4.1.0

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDClear Linux 34420Ubuntu 21.04 Dev1326395265SE +/- 0.10, N = 3SE +/- 0.17, N = 353.5558.051. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DClear Linux 34420Ubuntu 21.04 Dev306090120150SE +/- 0.17, N = 3SE +/- 1.17, N = 6111.15116.011. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteClear Linux 34420Ubuntu 21.04 Dev1428425670SE +/- 1.63, N = 15SE +/- 0.49, N = 352.6863.721. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverClear Linux 34420Ubuntu 21.04 Dev246810SE +/- 0.127, N = 15SE +/- 0.087, N = 156.5417.7341. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterClear Linux 34420Ubuntu 21.04 Dev48121620SE +/- 0.13, N = 5SE +/- 0.29, N = 1212.9013.731. (CXX) g++ options: -O2 -lOpenCL

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsClear Linux 34420Ubuntu 21.04 Dev0.09020.18040.27060.36080.451SE +/- 0.00145, N = 3SE +/- 0.00344, N = 150.371700.40085

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dClear Linux 34420Ubuntu 21.04 Dev100200300400500SE +/- 0.16, N = 3SE +/- 6.63, N = 3458.17467.63-lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per DirectionClear Linux 34420Ubuntu 21.04 Dev1.01012.02023.03034.04045.0505SE +/- 0.00386211, N = 3SE +/- 0.04397043, N = 34.489246374.46978633-lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionClear Linux 34420Ubuntu 21.04 Dev510152025SE +/- 0.02, N = 3SE +/- 0.08, N = 319.3319.01-lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: KostyaClear Linux 34420Ubuntu 21.04 Dev0.5671.1341.7012.2682.835SE +/- 0.01, N = 3SE +/- 0.00, N = 32.522.43-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandomClear Linux 34420Ubuntu 21.04 Dev0.1980.3960.5940.7920.99SE +/- 0.00, N = 3SE +/- 0.00, N = 30.880.81-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweetsClear Linux 34420Ubuntu 21.04 Dev0.90231.80462.70693.60924.5115SE +/- 0.01, N = 3SE +/- 0.01, N = 34.013.51-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserIDClear Linux 34420Ubuntu 21.04 Dev0.94731.89462.84193.78924.7365SE +/- 0.01, N = 3SE +/- 0.01, N = 34.213.61-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CXX) g++ options: -O3 -pthread

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedClear Linux 34420Ubuntu 21.04 Dev1122334455SE +/- 0.03, N = 3SE +/- 0.01, N = 348.0048.191. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedClear Linux 34420Ubuntu 21.04 Dev13002600390052006500SE +/- 177.02, N = 3SE +/- 121.63, N = 36096.96297.01. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedClear Linux 34420Ubuntu 21.04 Dev1122334455SE +/- 0.16, N = 3SE +/- 0.01, N = 346.8144.831. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedClear Linux 34420Ubuntu 21.04 Dev14002800420056007000SE +/- 83.53, N = 3SE +/- 86.84, N = 36135.46308.01. (CC) gcc options: -O3

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression SpeedClear Linux 34420Ubuntu 21.04 Dev20406080100SE +/- 0.68, N = 15SE +/- 0.86, N = 484.374.4-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression SpeedClear Linux 34420Ubuntu 21.04 Dev5001000150020002500SE +/- 1.54, N = 15SE +/- 12.81, N = 41889.22306.2-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression SpeedClear Linux 34420Ubuntu 21.04 Dev2004006008001000SE +/- 2.10, N = 3SE +/- 2.07, N = 15834.8325.5-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression SpeedClear Linux 34420Ubuntu 21.04 Dev918273645SE +/- 0.43, N = 4SE +/- 0.47, N = 1538.940.2-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression SpeedClear Linux 34420Ubuntu 21.04 Dev5001000150020002500SE +/- 2.64, N = 4SE +/- 8.87, N = 152087.92297.9-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake-llzma1. (CC) gcc options: -O3 -pthread -lz

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080pClear Linux 34420Ubuntu 21.04 Dev2004006008001000SE +/- 8.00, N = 10SE +/- 1.21, N = 31038.26419.62-O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 367.3 / MAX: 1324.25MIN: 231.67 / MAX: 545.481. (CC) gcc options: -pthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4KClear Linux 34420Ubuntu 21.04 Dev90180270360450SE +/- 1.88, N = 3SE +/- 0.46, N = 3396.46244.27-O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 163.78 / MAX: 435.2MIN: 101.68 / MAX: 272.31. (CC) gcc options: -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080pClear Linux 34420Ubuntu 21.04 Dev2004006008001000SE +/- 4.74, N = 3SE +/- 0.16, N = 31010.38451.23-O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 375.41 / MAX: 1131.64MIN: 169.52 / MAX: 513.731. (CC) gcc options: -pthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p 10-bitClear Linux 34420Ubuntu 21.04 Dev50100150200250SE +/- 0.21, N = 3SE +/- 0.18, N = 3240.09101.70-O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 170.42 / MAX: 370.95MIN: 79.78 / MAX: 152.751. (CC) gcc options: -pthread

OSPray

Demo: San Miguel - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerClear Linux 34420Ubuntu 21.04 Dev246810SE +/- 0.00, N = 3SE +/- 0.00, N = 36.626.49MIN: 5.08 / MAX: 6.67MIN: 4.69 / MAX: 6.54

OSPray

Demo: XFrog Forest - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: Path TracerClear Linux 34420Ubuntu 21.04 Dev246810SE +/- 0.00, N = 3SE +/- 0.00, N = 36.676.62MIN: 5.38 / MAX: 6.71MIN: 5.92 / MAX: 6.67

OSPray

Demo: NASA Streamlines - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path TracerClear Linux 34420Ubuntu 21.04 Dev510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 318.5218.18MIN: 14.29 / MAX: 19.23MIN: 12.35 / MAX: 18.52

OSPray

Demo: Magnetic Reconnection - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: Path TracerClear Linux 34420Ubuntu 21.04 Dev110220330440550500500MIN: 250MIN: 111.11

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: CrownClear Linux 34420Ubuntu 21.04 Dev1224364860SE +/- 0.28, N = 3SE +/- 0.38, N = 351.3950.10MIN: 48.91 / MAX: 53.74MIN: 45.6 / MAX: 53.58

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: CrownClear Linux 34420Ubuntu 21.04 Dev1224364860SE +/- 0.20, N = 3SE +/- 0.55, N = 354.7154.13MIN: 52.61 / MAX: 57.82MIN: 49.99 / MAX: 57.62

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian DragonClear Linux 34420Ubuntu 21.04 Dev1428425670SE +/- 0.42, N = 3SE +/- 0.80, N = 361.2159.19MIN: 59 / MAX: 63.14MIN: 56.39 / MAX: 62.22

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian DragonClear Linux 34420Ubuntu 21.04 Dev1632486480SE +/- 0.56, N = 3SE +/- 0.04, N = 370.4870.72MIN: 67.48 / MAX: 74.56MIN: 68 / MAX: 73.5

SVT-AV1

Encoder Mode: Enc Mode 0 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pClear Linux 34420Ubuntu 21.04 Dev0.03240.06480.09720.12960.162SE +/- 0.001, N = 3SE +/- 0.000, N = 30.1440.1241. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pClear Linux 34420Ubuntu 21.04 Dev3691215SE +/- 0.032, N = 3SE +/- 0.060, N = 39.0097.6881. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pClear Linux 34420Ubuntu 21.04 Dev20406080100SE +/- 0.78, N = 5SE +/- 0.35, N = 374.6354.991. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pClear Linux 34420Ubuntu 21.04 Dev714212835SE +/- 0.10, N = 3SE +/- 0.04, N = 330.0928.55-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pClear Linux 34420Ubuntu 21.04 Dev60120180240300SE +/- 2.46, N = 3SE +/- 2.15, N = 3270.36222.57-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pClear Linux 34420Ubuntu 21.04 Dev100200300400500SE +/- 5.37, N = 3SE +/- 2.14, N = 3480.50329.16-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pClear Linux 34420Ubuntu 21.04 Dev80160240320400SE +/- 2.43, N = 3SE +/- 2.04, N = 3348.26268.71-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pClear Linux 34420Ubuntu 21.04 Dev80160240320400SE +/- 1.54, N = 3SE +/- 2.74, N = 3356.90266.40-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pClear Linux 34420Ubuntu 21.04 Dev60120180240300SE +/- 1.15, N = 3SE +/- 1.85, N = 8291.24209.93-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Intel Open Image Denoise

Scene: Memorial

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialClear Linux 34420Ubuntu 21.04 Dev918273645SE +/- 0.28, N = 3SE +/- 0.37, N = 337.4435.75

OpenVKL

Benchmark: vklBenchmark

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkClear Linux 34420Ubuntu 21.04 Dev110220330440550SE +/- 2.91, N = 3SE +/- 4.45, N = 9511404MIN: 1 / MAX: 1812MIN: 1 / MAX: 1613

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total TimeClear Linux 34420Ubuntu 21.04 Dev30M60M90M120M150MSE +/- 1037933.70, N = 15SE +/- 1671020.36, N = 15136190317132800942-pipe -fexceptions -fstack-protector -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CXX) g++ options: -lgcov -m64 -lpthread -O3 -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -fprofile-use -fno-peel-loops -fno-tracer -flto=jobserver

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2Clear Linux 34420Ubuntu 21.04 Dev816243240SE +/- 0.42, N = 3SE +/- 0.20, N = 330.1234.861. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6Clear Linux 34420Ubuntu 21.04 Dev48121620SE +/- 0.11, N = 7SE +/- 0.13, N = 1511.7315.091. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10Clear Linux 34420Ubuntu 21.04 Dev1.24022.48043.72064.96086.201SE +/- 0.020, N = 3SE +/- 0.042, N = 153.6695.5121. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, LosslessClear Linux 34420Ubuntu 21.04 Dev816243240SE +/- 0.32, N = 5SE +/- 0.06, N = 328.7134.661. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, LosslessClear Linux 34420Ubuntu 21.04 Dev246810SE +/- 0.039, N = 3SE +/- 0.016, N = 36.4428.7251. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev0.38670.77341.16011.54681.9335SE +/- 0.01330, N = 3SE +/- 0.01259, N = 151.718881.65179-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 1.52MIN: 1.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev0.23750.4750.71250.951.1875SE +/- 0.013211, N = 15SE +/- 0.004748, N = 31.0387141.055460-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 0.69MIN: 0.741. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev0.91431.82862.74293.65724.5715SE +/- 0.01053, N = 3SE +/- 0.03832, N = 33.623754.06372-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 3.41MIN: 3.311. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev0.67681.35362.03042.70723.384SE +/- 0.00683, N = 3SE +/- 0.12420, N = 152.110613.00805-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 2.02MIN: 2.041. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev0.93931.87862.81793.75724.6965SE +/- 0.02129, N = 3SE +/- 0.03967, N = 33.818934.17484-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 3.72MIN: 3.551. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev0.10110.20220.30330.40440.5055SE +/- 0.001058, N = 3SE +/- 0.006464, N = 30.3837720.449278-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 0.34MIN: 0.341. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev0.08620.17240.25860.34480.431SE +/- 0.003975, N = 15SE +/- 0.002916, N = 30.3093970.383070-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 0.28MIN: 0.311. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev2004006008001000SE +/- 3.62, N = 3SE +/- 9.54, N = 7900.341092.42-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 883.53MIN: 1002.121. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev0.79151.5832.37453.1663.9575SE +/- 0.00586, N = 3SE +/- 0.03291, N = 33.195913.51789-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 2.99MIN: 2.941. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev1.12632.25263.37894.50525.6315SE +/- 0.01370, N = 3SE +/- 0.00960, N = 34.039065.00584-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 3.94MIN: 4.261. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev1.05092.10183.15274.20365.2545SE +/- 0.00032, N = 3SE +/- 0.01675, N = 34.428304.67069-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 4.39MIN: 4.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev130260390520650SE +/- 1.18, N = 3SE +/- 7.21, N = 3476.66613.07-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 466MIN: 556.951. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev2004006008001000SE +/- 2.11, N = 3SE +/- 5.62, N = 3895.451087.27-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 885.58MIN: 1031.431. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev130260390520650SE +/- 2.69, N = 3SE +/- 8.22, N = 3477.72615.29-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 464.94MIN: 568.581. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev0.09280.18560.27840.37120.464SE +/- 0.002990, N = 7SE +/- 0.004170, N = 30.3302180.412264-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 0.28MIN: 0.281. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPUClear Linux 34420Ubuntu 21.04 Dev0.2120.4240.6360.8481.06SE +/- 0.006956, N = 3SE +/- 0.004903, N = 30.8330800.942426-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 0.77MIN: 0.761. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100Clear Linux 34420Ubuntu 21.04 Dev130260390520650SE +/- 0.92, N = 3SE +/- 0.98, N = 3614.46547.501. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57Clear Linux 34420Ubuntu 21.04 Dev13M26M39M52M65MSE +/- 6887.99, N = 3SE +/- 478618.06, N = 65871233348945667-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57Clear Linux 34420Ubuntu 21.04 Dev200M400M600M800M1000MSE +/- 1029827.39, N = 3SE +/- 2162963.50, N = 3822863333701193333-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57Clear Linux 34420Ubuntu 21.04 Dev300M600M900M1200M1500MSE +/- 3507293.99, N = 3SE +/- 9745141.24, N = 315186333331320366667-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57Clear Linux 34420Ubuntu 21.04 Dev500M1000M1500M2000M2500MSE +/- 22409893.65, N = 3SE +/- 33333.33, N = 323159000002133166667-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 112 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 112 - Buffer Length: 256 - Filter Length: 57Clear Linux 34420Ubuntu 21.04 Dev600M1200M1800M2400M3000MSE +/- 3573202.73, N = 3SE +/- 5168171.82, N = 328536333332363400000-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

GROMACS

Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bareClear Linux 34420Ubuntu 21.04 Dev1.29582.59163.88745.18326.479SE +/- 0.020, N = 3SE +/- 0.018, N = 35.7595.710-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake1. (CXX) g++ options: -O3 -pthread

TensorFlow Lite

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetClear Linux 34420Ubuntu 21.04 Dev20K40K60K80K100KSE +/- 312.31, N = 3SE +/- 1493.06, N = 361863.5105347.0

TensorFlow Lite

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4Clear Linux 34420Ubuntu 21.04 Dev300K600K900K1200K1500KSE +/- 3472.47, N = 3SE +/- 12025.89, N = 78831141279019

TensorFlow Lite

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileClear Linux 34420Ubuntu 21.04 Dev60K120K180K240K300KSE +/- 1598.38, N = 15SE +/- 3037.32, N = 1597445.7282417.0

TensorFlow Lite

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatClear Linux 34420Ubuntu 21.04 Dev20K40K60K80K100KSE +/- 591.27, N = 3SE +/- 1637.30, N = 1251318.383813.3

TensorFlow Lite

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantClear Linux 34420Ubuntu 21.04 Dev20K40K60K80K100KSE +/- 584.51, N = 3SE +/- 1321.28, N = 1549894.183725.4

TensorFlow Lite

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2Clear Linux 34420Ubuntu 21.04 Dev200K400K600K800K1000KSE +/- 5456.47, N = 3SE +/- 3000.84, N = 38116381151590

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.0Clear Linux 34420Ubuntu 21.04 Dev246810SE +/- 0.099, N = 15SE +/- 0.073, N = 158.1578.599-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 5.82 / MAX: 12.06MIN: 7.15 / MAX: 17.811. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50Clear Linux 34420Ubuntu 21.04 Dev714212835SE +/- 0.13, N = 15SE +/- 0.13, N = 1525.7329.32-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 24.56 / MAX: 67.2MIN: 27.43 / MAX: 115.461. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_224Clear Linux 34420Ubuntu 21.04 Dev1.0982.1963.2944.3925.49SE +/- 0.059, N = 15SE +/- 0.057, N = 154.7504.880-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 3.76 / MAX: 9.4MIN: 3.83 / MAX: 16.181. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.0Clear Linux 34420Ubuntu 21.04 Dev0.72881.45762.18642.91523.644SE +/- 0.010, N = 15SE +/- 0.038, N = 152.6723.239-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 2.48 / MAX: 4.13MIN: 2.44 / MAX: 9.71. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3Clear Linux 34420Ubuntu 21.04 Dev816243240SE +/- 0.11, N = 15SE +/- 0.13, N = 1532.4834.88-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 29.61 / MAX: 94.15MIN: 33.07 / MAX: 87.631. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2Clear Linux 34420Ubuntu 21.04 Dev90180270360450SE +/- 0.21, N = 3SE +/- 3.34, N = 3330.63434.39-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 329.78 / MAX: 333.62MIN: 363.56 / MAX: 546.491. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1Clear Linux 34420Ubuntu 21.04 Dev70140210280350SE +/- 0.12, N = 3SE +/- 0.22, N = 3330.16343.35-pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake - MIN: 329.51 / MAX: 332.56MIN: 341.6 / MAX: 345.441. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: BedroomClear Linux 34420Ubuntu 21.04 Dev246810SE +/- 0.013, N = 3SE +/- 0.011, N = 38.5768.297

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: SupercarClear Linux 34420Ubuntu 21.04 Dev510152025SE +/- 0.11, N = 3SE +/- 0.04, N = 320.2019.58

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: CPU-OnlyClear Linux 34420Ubuntu 21.04 Dev918273645SE +/- 0.10, N = 3SE +/- 0.14, N = 337.8438.52

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: CPU-OnlyClear Linux 34420Ubuntu 21.04 Dev20406080100SE +/- 0.05, N = 3SE +/- 36.38, N = 1255.7196.03

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 5Mode: CPUClear Linux 34420Ubuntu 21.04 Dev11K22K33K44K55KSE +/- 217.08, N = 3SE +/- 231.66, N = 34908246573

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression SpeedUbuntu 21.04 Dev6001200180024003000SE +/- 4.70, N = 152814.21. (CC) gcc options: -O3 -pthread -lz -llzma


Phoronix Test Suite v10.8.4