Xeon Gold 6226R March 2021

Intel Xeon Gold 6226R testing with a Supermicro X11SPL-F v1.02 (3.1 BIOS) and ASPEED on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2103284-PTS-XEONGOLD61&grs&sor.

Xeon Gold 6226R March 2021ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen Resolution123Intel Xeon Gold 6226R @ 3.90GHz (16 Cores / 32 Threads)Supermicro X11SPL-F v1.02 (3.1 BIOS)Intel Sky Lake-E DMI3 Registers188GB280GB INTEL SSDPED1D280GAASPEED2 x Intel I210Ubuntu 20.105.11.0-rc4-max-boost-inv-patch (x86_64) 20210121GNOME Shell 3.38.1X Server 1.20.9GCC 10.2.0ext41024x768OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq schedutil - CPU Microcode: 0x5003003Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled

Xeon Gold 6226R March 2021dav1d: Summer Nature 1080pgnuradio: FIR Filterjpegxl-decode: Allviennacl: CPU BLAS - dGEMM-NTdav1d: Summer Nature 4Kgnuradio: Hilbert Transformviennacl: CPU BLAS - dGEMM-TTstockfish: Total Timeviennacl: CPU BLAS - dGEMM-TNaom-av1: Speed 9 Realtime - Bosphorus 1080paom-av1: Speed 9 Realtime - Bosphorus 4Kgnuradio: Signal Source (Cosine)toybrot: TBBonednn: Deconvolution Batch shapes_3d - f32 - CPUaom-av1: Speed 6 Realtime - Bosphorus 4Kaom-av1: Speed 6 Two-Pass - Bosphorus 4Kcompress-zstd: 19 - Compression Speedonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dDOTaom-av1: Speed 8 Realtime - Bosphorus 4Kjpegxl: JPEG - 5avifenc: 10gnuradio: Five Back to Back FIR Filtersngspice: C2670svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080pcompress-zstd: 3, Long Mode - Compression Speedjpegxl: JPEG - 8simdjson: Kostyaonednn: IP Shapes 3D - bf16bf16bf16 - CPUaom-av1: Speed 8 Realtime - Bosphorus 1080pcompress-zstd: 3 - Decompression Speedincompact3d: input.i3d 129 Cells Per Directioncompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19 - Decompression Speedavifenc: 6, Losslessonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUcompress-zstd: 3 - Compression Speedonednn: Recurrent Neural Network Inference - f32 - CPUliquid-dsp: 2 - 256 - 57jpegxl-decode: 1viennacl: CPU BLAS - sDOTviennacl: CPU BLAS - sAXPYjpegxl: PNG - 8avifenc: 10, Losslesscompress-zstd: 8 - Compression Speedsvt-hevc: 10 - Bosphorus 1080pngspice: C7552avifenc: 2jpegxl: PNG - 7aom-av1: Speed 6 Realtime - Bosphorus 1080ponednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUbuild-linux-kernel: Time To Compiletoybrot: OpenMPavifenc: 0onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUbotan: ChaCha20Poly1305svt-hevc: 7 - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080pjpegxl: JPEG - 7svt-vp9: VMAF Optimized - Bosphorus 1080psrslte: OFDM_Testonednn: IP Shapes 3D - u8s8f32 - CPUavifenc: 6viennacl: CPU BLAS - sCOPYonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUviennacl: CPU BLAS - dAXPYgcrypt: basis: ETC1Sbuild-mesa: Time To Compileonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUjpegxl: PNG - 5liquid-dsp: 4 - 256 - 57viennacl: CPU BLAS - dGEMV-Tastcenc: Mediumviennacl: CPU BLAS - dCOPYcompress-zstd: 3, Long Mode - Decompression Speedbotan: ChaCha20Poly1305 - Decryptonednn: Recurrent Neural Network Training - u8s8f32 - CPUsrslte: PHY_DL_Testluaradio: Complex Phaseluaradio: FM Deemphasis Filterluaradio: Five Back to Back FIR Filterstoybrot: C++ Tasksonednn: IP Shapes 3D - f32 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUliquid-dsp: 1 - 256 - 57onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUbotan: CAST-256viennacl: CPU BLAS - dGEMV-Npennant: sedovbigsvt-hevc: 1 - Bosphorus 1080ponednn: IP Shapes 1D - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUpennant: leblancbigcompress-zstd: 8, Long Mode - Decompression Speedbotan: CAST-256 - Decryptonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUaom-av1: Speed 4 Two-Pass - Bosphorus 4Kliquid-dsp: 8 - 256 - 57botan: KASUMIliquid-dsp: 16 - 256 - 57botan: AES-256botan: Blowfishaom-av1: Speed 6 Two-Pass - Bosphorus 1080pastcenc: Thoroughonednn: IP Shapes 1D - f32 - CPUbasis: UASTC Level 0build-erlang: Time To Compileonednn: Recurrent Neural Network Training - f32 - CPUgromacs: water_GMX50_barebotan: AES-256 - Decryptbotan: Blowfish - Decryptsimdjson: DistinctUserIDbasis: UASTC Level 2srslte: PHY_DL_Testbotan: Twofishbasis: UASTC Level 3compress-zstd: 8 - Decompression Speedonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUbotan: Twofish - Decryptincompact3d: input.i3d 193 Cells Per Directionbuild-nodejs: Time To Compileluaradio: Hilbert Transformtoybrot: C++ Threadsbotan: KASUMI - Decryptastcenc: Exhaustiveliquid-dsp: 32 - 256 - 57gmpbench: Total Timeaom-av1: Speed 4 Two-Pass - Bosphorus 1080paom-av1: Speed 0 Two-Pass - Bosphorus 1080paom-av1: Speed 0 Two-Pass - Bosphorus 4Ksimdjson: PartialTweetssimdjson: LargeRandonednn: Deconvolution Batch shapes_1d - f32 - CPUgnuradio: FM Deemphasis Filtergnuradio: IIR Filter123269.99533.9198.3556.2174.26323.859.23870274158.248.8521.882257.7293763.472387.524.19590.90602257.310816.9343.464.504726.7200.497267.34367.420.752.342.8367838.543069.414.0785799429.540.72729.22670.745.573993.8182913.21008.888893400032.081241250.637.448686.5307.85171.35637.7447.1611.0912.36166.692907170.1851764.31670.062152.36201.9641.56267.031128000001.3921514.23283.74.61416105236.84828.83142.660.597683990.54440.731770200001166.715571.73303.5662.7251771.9978.3469.5357.9857.6286183.4270112.7549492190001.09663128.31599.656.2035810.410.5413642.2703427.251853269.1128.8185.881320.5810514.700482.3634371000083.9616389500003647.61407.1718.4710.63312.551489.041121.21779.671.8063659.312403.444.0325.422190.6332.66845.2133114.69.83488332.01752.4951591329.50664.92816381.21478.29817288100004757.73.360.340.133.380.835.55575645.1412.6271.02522.9179.7655.5172.64308.557.94024854957.748.1223.022152.7282873.408387.784.1360.40.93017555.710816.7442.364.612717.5205.123269.85359.220.302.322.8975638.143115.113.8286765436.741.42677.62701.145.7321005.652935.01003.289039833332.201251250.647.463676.2311.16171.17238.2897.1211.0912.423565.8502888871.0741784.29662.065152.04204.1342.04264.141138666671.4069014.30683.94.57869106235.25528.92742.8620.592321999.49040.791782100001166.713571.73291.7657.5301785.7277.8466.1359.7853.9288033.4171712.6919489783331.09014128.91799.856.4599910.380.5387772.2612627.367803254.0129.4095.880650.5818144.688142.3734262333383.6156382733333645.653408.1098.4610.66722.542749.035120.8161774.741.8093661.517404.4474.0325.465190.4333.13545.2993117.59.84755332.57452.5711772329.81665.02820181.24778.30657285566674759.53.360.340.133.380.834.50935588.5448.3317.53479.1178.4250.9187.9297.754.74122171154.846.1722.952193.4281343.33587.814.05610.90389757.111116.4942.764.619735.4202.75273.5359.320.692.372.8892438.943133.414.1164236438.241.52721.42721.446.4151011.782883991.1598979100031.681261270.637.35678.3312.5173.73538.2317.0610.9412.524566.7212869370.4431786.55663.9153.85201.7841.91266.291140000001.3926114.1784.54.62231105237.49129.09542.4760.597077995.40841.091785700001156.770872.33318.7658.1281779.4377.7468.5357.1851.8287463.4051912.6749492870001.09422129.00699.356.4781510.430.5393352.2720727.378543258.6129.0185.855890.57934.680432.3634406000083.8536363700003633.666408.6918.4910.66972.547769.011120.8111779.421.8043651.744404.0194.0425.479190.2332.44845.2113120.39.83093332.23852.5831146330.035652816481.31478.36747282400004756.23.360.340.133.380.833.98082531.6435.9OpenBenchmarking.org

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080p32170140210280350SE +/- 0.83, N = 3317.53271.02269.99MIN: 291.94 / MAX: 371.15MIN: 239.91 / MAX: 295.67MIN: 246.57 / MAX: 294.051. (CC) gcc options: -pthread

GNU Radio

Test: FIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR Filter123120240360480600SE +/- 12.50, N = 3533.9522.9479.11. 3.8.1.0

JPEG XL Decoding

CPU Threads: All

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.3CPU Threads: All1234080120160200SE +/- 1.38, N = 3198.35179.76178.42

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NT1231326395265SE +/- 0.57, N = 356.255.550.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4K3124080120160200SE +/- 0.57, N = 3187.90174.26172.64MIN: 172.38 / MAX: 222.92MIN: 157.96 / MAX: 208.32MIN: 139.18 / MAX: 203.21. (CC) gcc options: -pthread

GNU Radio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert Transform12370140210280350SE +/- 5.15, N = 3323.8308.5297.71. 3.8.1.0

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TT1231326395265SE +/- 1.65, N = 359.257.954.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time3219M18M27M36M45MSE +/- 343002.27, N = 34122171140248549387027411. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TN1231326395265SE +/- 1.21, N = 358.257.754.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p1231122334455SE +/- 0.69, N = 348.8548.1246.171. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K231612182430SE +/- 0.09, N = 323.0222.9521.881. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

GNU Radio

Test: Signal Source (Cosine)

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)1325001000150020002500SE +/- 54.15, N = 32257.72193.42152.71. 3.8.1.0

toyBrot Fractal Generator

Implementation: TBB

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBB3216K12K18K24K30KSE +/- 71.70, N = 32813428287293761. (CXX) g++ options: -O3 -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU3210.78131.56262.34393.12523.9065SE +/- 0.03666, N = 33.335803.408383.47238MIN: 3.17MIN: 3.17MIN: 3.171. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K321246810SE +/- 0.01, N = 37.817.787.521. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K1230.94281.88562.82843.77124.714SE +/- 0.05, N = 34.194.134.051. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression Speed3211428425670SE +/- 0.67, N = 361.060.459.01. (CC) gcc options: -O3 -pthread -lz -llzma

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU3120.20930.41860.62790.83721.0465SE +/- 0.013165, N = 30.9038970.9060220.930175MIN: 0.83MIN: 0.84MIN: 0.841. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NN1321326395265SE +/- 1.50, N = 357.357.155.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOT32120406080100SE +/- 0.58, N = 31111081081. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K12348121620SE +/- 0.17, N = 316.9316.7416.491. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

JPEG XL

Input: JPEG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 51321020304050SE +/- 0.23, N = 343.4642.7642.361. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 101231.03932.07863.11794.15725.1965SE +/- 0.015, N = 34.5044.6124.6191. (CXX) g++ options: -O3 -fPIC -lm

GNU Radio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR Filters312160320480640800SE +/- 12.25, N = 3735.4726.7717.51. 3.8.1.0

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C267013250100150200250SE +/- 1.40, N = 3200.50202.75205.121. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p32160120180240300SE +/- 2.98, N = 3273.50269.85267.341. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Compression Speed13280160240320400SE +/- 1.95, N = 3367.4359.3359.21. (CC) gcc options: -O3 -pthread -lz -llzma

JPEG XL

Input: JPEG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 8132510152025SE +/- 0.19, N = 320.7520.6920.301. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: Kostya3120.53331.06661.59992.13322.6665SE +/- 0.01, N = 32.372.342.321. (CXX) g++ options: -O3 -pthread

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU1320.6521.3041.9562.6083.26SE +/- 0.00361, N = 32.836782.889242.89756MIN: 2.55MIN: 2.57MIN: 2.551. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p312918273645SE +/- 0.41, N = 338.9438.5438.141. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Decompression Speed3217001400210028003500SE +/- 9.25, N = 23133.43115.13069.41. (CC) gcc options: -O3 -pthread -lz -llzma

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction21348121620SE +/- 0.20, N = 313.8314.0814.121. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression Speed32190180270360450SE +/- 1.59, N = 3438.2436.7429.51. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression Speed321918273645SE +/- 0.15, N = 341.541.440.71. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression Speed1326001200180024003000SE +/- 16.34, N = 32729.22721.42677.61. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression Speed3216001200180024003000SE +/- 15.06, N = 32721.42701.12670.71. (CC) gcc options: -O3 -pthread -lz -llzma

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, Lossless1231122334455SE +/- 0.12, N = 345.5745.7346.421. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1232004006008001000SE +/- 0.52, N = 3993.821005.651011.78MIN: 968.8MIN: 980.08MIN: 985.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Compression Speed2136001200180024003000SE +/- 17.00, N = 32935.02913.22883.01. (CC) gcc options: -O3 -pthread -lz -llzma

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU3212004006008001000SE +/- 0.67, N = 3991.161003.281008.88MIN: 968.71MIN: 978.75MIN: 982.031. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 5723120M40M60M80M100MSE +/- 69848.25, N = 39039833389791000889340001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

JPEG XL Decoding

CPU Threads: 1

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.3CPU Threads: 1213714212835SE +/- 0.20, N = 332.2032.0831.68

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOT3213060901201501261251241. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPY321306090120150SE +/- 0.58, N = 31271251251. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

JPEG XL

Input: PNG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 82310.1440.2880.4320.5760.72SE +/- 0.00, N = 30.640.630.631. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, Lossless312246810SE +/- 0.020, N = 37.3507.4487.4631. (CXX) g++ options: -O3 -fPIC -lm

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression Speed132150300450600750SE +/- 1.10, N = 3686.5678.3676.21. (CC) gcc options: -O3 -pthread -lz -llzma

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p32170140210280350SE +/- 1.08, N = 3312.50311.16307.851. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C75522134080120160200SE +/- 1.27, N = 3171.17171.36173.741. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2132918273645SE +/- 0.21, N = 337.7438.2338.291. (CXX) g++ options: -O3 -fPIC -lm

JPEG XL

Input: PNG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 7123246810SE +/- 0.03, N = 37.167.127.061. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p2133691215SE +/- 0.01, N = 311.0911.0910.941. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU1233691215SE +/- 0.07, N = 312.3612.4212.52MIN: 11.45MIN: 11.45MIN: 11.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To Compile2131530456075SE +/- 0.38, N = 365.8566.6966.72

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMP3216K12K18K24K30KSE +/- 142.73, N = 32869328888290711. (CXX) g++ options: -O3 -lpthread

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 01321632486480SE +/- 0.36, N = 370.1970.4471.071. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU123400800120016002000SE +/- 4.93, N = 31764.311784.291786.55MIN: 1737.4MIN: 1739.16MIN: 1755.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305132140280420560700SE +/- 1.77, N = 3670.06663.90662.071. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p312306090120150SE +/- 0.36, N = 3153.85152.36152.041. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p2134080120160200SE +/- 0.48, N = 3204.13201.96201.781. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

JPEG XL

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 72311020304050SE +/- 0.43, N = 342.0441.9141.561. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p13260120180240300SE +/- 1.43, N = 3267.03266.29264.141. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

srsLTE

Test: OFDM_Test

OpenBenchmarking.orgSamples / Second, More Is BettersrsLTE 20.10.1Test: OFDM_Test32120M40M60M80M100MSE +/- 348010.22, N = 31140000001138666671128000001. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1320.31660.63320.94981.26641.583SE +/- 0.01012, N = 31.392151.392611.40690MIN: 1.22MIN: 1.21MIN: 1.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 631248121620SE +/- 0.03, N = 314.1714.2314.311. (CXX) g++ options: -O3 -fPIC -lm

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPY32120406080100SE +/- 0.09, N = 384.583.983.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU2131.042.083.124.165.2SE +/- 0.01741, N = 34.578694.614164.62231MIN: 4.13MIN: 4.16MIN: 4.171. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPY231204060801001061051051. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.921350100150200250SE +/- 0.71, N = 3235.26236.85237.491. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1S123714212835SE +/- 0.12, N = 328.8328.9329.101. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compile3121020304050SE +/- 0.03, N = 342.4842.6642.86

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU2310.13450.2690.40350.5380.6725SE +/- 0.002509, N = 30.5923210.5970770.597683MIN: 0.51MIN: 0.52MIN: 0.521. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1322004006008001000SE +/- 4.21, N = 3990.54995.41999.49MIN: 959.7MIN: 971.26MIN: 966.551. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

JPEG XL

Input: PNG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 5321918273645SE +/- 0.18, N = 341.0940.7940.731. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 5732140M80M120M160M200MSE +/- 507181.76, N = 31785700001782100001770200001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-T2133060901201501161161151. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Medium213246810SE +/- 0.0030, N = 36.71356.71556.77081. (CXX) g++ options: -O3 -flto -pthread

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPY3211632486480SE +/- 0.15, N = 372.371.771.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Decompression Speed3127001400210028003500SE +/- 15.06, N = 33318.73303.53291.71. (CC) gcc options: -O3 -pthread -lz -llzma

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decrypt132140280420560700SE +/- 0.48, N = 3662.73658.13657.531. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU132400800120016002000SE +/- 1.23, N = 31771.991779.431785.72MIN: 1740.69MIN: 1751.55MIN: 1752.231. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgUE Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Test12320406080100SE +/- 0.10, N = 378.377.877.71. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

LuaRadio

Test: Complex Phase

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex Phase132100200300400500SE +/- 0.97, N = 3469.5468.5466.1

LuaRadio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis Filter21380160240320400SE +/- 0.09, N = 3359.7357.9357.1

LuaRadio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR Filters1232004006008001000SE +/- 2.65, N = 3857.6853.9851.8

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasks1326K12K18K24K30KSE +/- 160.28, N = 32861828746288031. (CXX) g++ options: -O3 -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU3210.77111.54222.31333.08443.8555SE +/- 0.00459, N = 33.405193.417173.42701MIN: 3.12MIN: 3.11MIN: 3.111. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU3213691215SE +/- 0.02, N = 312.6712.6912.75MIN: 12.37MIN: 12.41MIN: 12.411. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 5731211M22M33M44M55MSE +/- 214667.44, N = 34928700049219000489783331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU2310.24670.49340.74010.98681.2335SE +/- 0.00360, N = 31.090141.094221.09663MIN: 0.94MIN: 0.95MIN: 0.951. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256321306090120150SE +/- 0.06, N = 3129.01128.92128.321. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-N21320406080100SE +/- 0.59, N = 399.899.699.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig1231326395265SE +/- 0.02, N = 356.2056.4656.481. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p3123691215SE +/- 0.03, N = 310.4310.4110.381. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU2310.12180.24360.36540.48720.609SE +/- 0.000426, N = 30.5387770.5393350.541364MIN: 0.47MIN: 0.48MIN: 0.471. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU2130.51121.02241.53362.04482.556SE +/- 0.00606, N = 32.261262.270342.27207MIN: 1.97MIN: 2.01MIN: 2.011. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig123612182430SE +/- 0.01, N = 327.2527.3727.381. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression Speed1327001400210028003500SE +/- 16.76, N = 33269.13258.63254.01. (CC) gcc options: -O3 -pthread -lz -llzma

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decrypt231306090120150SE +/- 0.02, N = 3129.41129.02128.821. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU3211.32332.64663.96995.29326.6165SE +/- 0.01210, N = 35.855895.880655.88132MIN: 5.47MIN: 5.44MIN: 5.431. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU3120.13090.26180.39270.52360.6545SE +/- 0.001705, N = 30.5793000.5810510.581814MIN: 0.52MIN: 0.52MIN: 0.521. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU3211.05762.11523.17284.23045.288SE +/- 0.00914, N = 34.680434.688144.70048MIN: 4.27MIN: 4.24MIN: 4.251. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K2310.53331.06661.59992.13322.6665SE +/- 0.01, N = 32.372.362.361. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 5731270M140M210M280M350MSE +/- 416706.66, N = 33440600003437100003426233331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI13220406080100SE +/- 0.31, N = 383.9683.8583.621. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57123140M280M420M560M700MSE +/- 451897.24, N = 36389500006382733336363700001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-2561238001600240032004000SE +/- 0.22, N = 33647.613645.653633.671. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish32190180270360450SE +/- 0.24, N = 3408.69408.11407.171. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p312246810SE +/- 0.01, N = 38.498.478.461. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thorough1233691215SE +/- 0.01, N = 310.6310.6710.671. (CXX) g++ options: -O3 -flto -pthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU2310.57411.14821.72232.29642.8705SE +/- 0.00338, N = 32.542742.547762.55148MIN: 2.26MIN: 2.27MIN: 2.271. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 03213691215SE +/- 0.006, N = 39.0119.0359.0411. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Timed Erlang/OTP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To Compile321306090120150SE +/- 0.32, N = 3120.81120.82121.20

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU231400800120016002000SE +/- 5.17, N = 31774.741779.421779.67MIN: 1741.55MIN: 1751.79MIN: 1752.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

GROMACS

Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bare2130.4070.8141.2211.6282.035SE +/- 0.003, N = 31.8091.8061.8041. (CXX) g++ options: -O3 -pthread

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decrypt2138001600240032004000SE +/- 0.85, N = 33661.523659.313651.741. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decrypt23190180270360450SE +/- 0.15, N = 3404.45404.02403.441. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserID3210.9091.8182.7273.6364.545SE +/- 0.00, N = 34.044.034.031. (CXX) g++ options: -O3 -pthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2123612182430SE +/- 0.04, N = 325.4225.4725.481. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgeNb Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Test1234080120160200SE +/- 0.29, N = 3190.6190.4190.21. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish21370140210280350SE +/- 0.60, N = 3333.14332.67332.451. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 33121020304050SE +/- 0.05, N = 345.2145.2145.301. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Decompression Speed3217001400210028003500SE +/- 3.38, N = 33120.33117.53114.61. (CC) gcc options: -O3 -pthread -lz -llzma

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU3123691215SE +/- 0.00857, N = 39.830939.834889.84755MIN: 9.11MIN: 9.08MIN: 9.081. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decrypt23170140210280350SE +/- 0.24, N = 3332.57332.24332.021. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction1231224364860SE +/- 0.28, N = 352.5052.5752.581. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compile12370140210280350SE +/- 0.41, N = 3329.51329.82330.04

LuaRadio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert Transform3211530456075SE +/- 0.03, N = 365.065.064.9

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threads1326K12K18K24K30KSE +/- 43.02, N = 32816328164282011. (CXX) g++ options: -O3 -lpthread

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decrypt32120406080100SE +/- 0.35, N = 381.3181.2581.211. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustive12320406080100SE +/- 0.01, N = 378.3078.3178.371. (CXX) g++ options: -O3 -flto -pthread

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57123160M320M480M640M800MSE +/- 131698.31, N = 37288100007285566677282400001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time213100020003000400050004759.54757.74756.21. (CC) gcc options: -O3 -fomit-frame-pointer -lm

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p3210.7561.5122.2683.0243.78SE +/- 0.00, N = 33.363.363.361. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p3210.07650.1530.22950.3060.3825SE +/- 0.00, N = 30.340.340.341. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K3210.02930.05860.08790.11720.1465SE +/- 0.00, N = 30.130.130.131. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweets3210.76051.5212.28153.0423.8025SE +/- 0.00, N = 33.383.383.381. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandom3210.18680.37360.56040.74720.934SE +/- 0.00, N = 30.830.830.831. (CXX) g++ options: -O3 -pthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU3211.252.53.7556.25SE +/- 0.09201, N = 153.980824.509355.55575MIN: 3.27MIN: 3.22MIN: 3.261. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

GNU Radio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis Filter123140280420560700SE +/- 22.66, N = 3645.1588.5531.61. 3.8.1.0

GNU Radio

Test: IIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR Filter231100200300400500SE +/- 15.90, N = 3448.3435.9412.61. 3.8.1.0


Phoronix Test Suite v10.8.5