990KS March

Intel Core i9-9900KS testing with a ASUS PRIME Z390-A (1502 BIOS) and ASUS Intel UHD 630 CFL GT2 3GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2104012-IB-990KSMARC41&grs&sro.

990KS MarchProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen Resolution123Intel Core i9-9900KS @ 5.00GHz (8 Cores / 16 Threads)ASUS PRIME Z390-A (1502 BIOS)Intel Cannon Lake PCH32GB240GB Corsair Force MP510ASUS Intel UHD 630 CFL GT2 3GB (1200MHz)Realtek ALC1220G237HLIntel I219-VUbuntu 20.045.9.0-050900rc8daily20201005-generic (x86_64) 20201004GNOME Shell 3.36.2X Server 1.20.84.6 Mesa 20.2.6OpenCL 2.1GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xcc - Thermald 1.9.1 Python Details- Python 2.7.18rc1 + Python 3.8.2Security Details- itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Mitigation of TSX disabled + tsx_async_abort: Mitigation of TSX disabled

990KS Marchcompress-zstd: 8, Long Mode - Compression Speedaom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 6 Two-Pass - Bosphorus 1080paom-av1: Speed 0 Two-Pass - Bosphorus 1080pshoc: OpenCL - Triadopenscad: Projector Mount Swiveldav1d: Chimera 1080p 10-bitaom-av1: Speed 9 Realtime - Bosphorus 4Kaom-av1: Speed 8 Realtime - Bosphorus 1080pcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 8 - Compression Speedsrslte: OFDM_Teststockfish: Total Timeonednn: IP Shapes 3D - f32 - CPUcompress-zstd: 19, Long Mode - Compression Speedmnn: SqueezeNetV1.0viennacl: OpenCL BLAS - sCOPYaom-av1: Speed 9 Realtime - Bosphorus 1080pmnn: inception-v3basis: UASTC Level 0onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUsrslte: PHY_DL_Testsrslte: PHY_DL_Testmnn: MobileNetV2_224onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUaom-av1: Speed 4 Two-Pass - Bosphorus 4Kluaradio: Complex Phasecompress-zstd: 8, Long Mode - Decompression Speedmnn: mobilenet-v1-1.0svt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080pshoc: OpenCL - S3Dopenscad: Leonardo Phone Case Slimavifenc: 0aom-av1: Speed 6 Two-Pass - Bosphorus 4Kmnn: resnet-v2-50openscad: Retro Carcompress-zstd: 19, Long Mode - Decompression Speedviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dCOPYcompress-zstd: 3 - Compression Speedgnuradio: Five Back to Back FIR Filtersluaradio: FM Deemphasis Filtersysbench: RAM / Memorycompress-zstd: 3 - Decompression Speedviennacl: OpenCL BLAS - dGEMV-Ndav1d: Summer Nature 4Kgmpbench: Total Timesvt-vp9: Visual Quality Optimized - Bosphorus 1080pavifenc: 10gnuradio: FIR Filteraom-av1: Speed 6 Realtime - Bosphorus 1080ponednn: IP Shapes 3D - u8s8f32 - CPUliquid-dsp: 2 - 256 - 57luaradio: Five Back to Back FIR Filtersonednn: Deconvolution Batch shapes_3d - f32 - CPUluaradio: Hilbert Transformbuild-erlang: Time To Compileavifenc: 6, Losslessaom-av1: Speed 4 Two-Pass - Bosphorus 1080psimdjson: DistinctUserIDonednn: Deconvolution Batch shapes_1d - f32 - CPUopenscad: Mini-ITX Caseavifenc: 6svt-hevc: 10 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080pbasis: ETC1Sastcenc: Mediumshoc: OpenCL - Bus Speed Readbackgnuradio: Hilbert Transformcompress-zstd: 19 - Decompression Speedviennacl: CPU BLAS - sCOPYbotan: Twofishbotan: ChaCha20Poly1305onednn: IP Shapes 1D - f32 - CPUsimdjson: Kostyadav1d: Summer Nature 1080paom-av1: Speed 6 Realtime - Bosphorus 4Kincompact3d: input.i3d 129 Cells Per Directionviennacl: CPU BLAS - dGEMM-NTcompress-zstd: 19 - Compression Speedviennacl: OpenCL BLAS - dGEMV-Tliquid-dsp: 4 - 256 - 57viennacl: OpenCL BLAS - dAXPYavifenc: 10, Losslessviennacl: CPU BLAS - dGEMM-TNgnuradio: Signal Source (Cosine)viennacl: OpenCL BLAS - sDOTbasis: UASTC Level 2dav1d: Chimera 1080ptoybrot: C++ Tasksonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUviennacl: CPU BLAS - sAXPYsimdjson: PartialTweetsliquid-dsp: 8 - 256 - 57viennacl: CPU BLAS - dGEMV-Nbuild-linux-kernel: Time To Compilecompress-zstd: 3, Long Mode - Decompression Speedopenscad: Pistolonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUavifenc: 2onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUshoc: OpenCL - Reductiongnuradio: IIR Filtertoybrot: C++ Threadstoybrot: TBBshoc: OpenCL - Bus Speed Downloadonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUsvt-hevc: 1 - Bosphorus 1080ponednn: Convolution Batch Shapes Auto - f32 - CPUbotan: AES-256 - Decryptbotan: Twofish - Decryptshoc: OpenCL - FFT SPgnuradio: FM Deemphasis Filteronednn: IP Shapes 1D - u8s8f32 - CPUincompact3d: input.i3d 193 Cells Per Directioncompress-zstd: 8 - Decompression Speedbuild-mesa: Time To Compilebotan: KASUMI - Decryptastcenc: Exhaustivebotan: Blowfishbuild-nodejs: Time To Compileastcenc: Thoroughonednn: Recurrent Neural Network Training - f32 - CPUsysbench: CPUbasis: UASTC Level 3onednn: Recurrent Neural Network Inference - u8s8f32 - CPUtoybrot: OpenMPliquid-dsp: 16 - 256 - 57botan: Blowfish - Decryptbotan: ChaCha20Poly1305 - Decryptshoc: OpenCL - GEMM SGEMM_Nonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUliquid-dsp: 1 - 256 - 57onednn: Recurrent Neural Network Training - u8s8f32 - CPUbotan: CAST-256 - Decryptshoc: OpenCL - Texture Read Bandwidthbotan: KASUMIshoc: OpenCL - Max SP Flopsshoc: OpenCL - MD5 Hashbotan: AES-256botan: CAST-256systemd-boot-total: Userspacesystemd-boot-total: Firmwaresystemd-boot-total: Loadersystemd-boot-total: Kernelsystemd-boot-total: Totalviennacl: OpenCL BLAS - dGEMM-TTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - sAXPYviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - sDOTaom-av1: Speed 0 Two-Pass - Bosphorus 4Ksimdjson: LargeRand123395.138.6819.680.3815.605392.931160.9147.78124.391164.0332.6126700000237778788.2610331.94.89932.9141.4829.4477.1723.97706293.2112.72.6723413.973.42704.24609.82.329173.31176.2920.993116.97572.0746.3425.30817.2593957.516.534.02379.41190.2486.828391.474186.537.9185.136250.3136.763.154822.325.061.832151301833331297.96.6399387.2113.00062.6746.764.538.8389841.90812.952230.27106.1923.4555.794929.0495618.03969.028.2436.078916.1083.577403.1648.2613.1634.337596934.134.534.825085666735.35.77836.53003.237.733.461733.644475614.033542.84.3845271333345.594.7144433.4100.7671854.8837.2533.4400336.7934666.1448544405829.21193.154336.8514.28304806.102437.29715.3682836.71.66747121.6911704312.456.415107.861113.8873543.597477.09714.92073408.1619380.3463.5431852.2643866500333333539.238908.311245.9201852.132.14490676296673414.52172.09156.9099112.9801760.700.38614815.972171.9542210915679355519412405015.715.818.93834.635.534.846.544.139.326.247.90.121.07378.437.9419.110.3715.233693.908164.4048.72122.641156.9327.6128466667235015918.1469531.84.88233.3140.3929.6707.0913.93198291.8112.22.6553445.233.41700.24601.32.333172.24175.2720.851816.94172.2586.3225.25417.3463963.916.534.12374.51195.5487.728491.744164.538.1184.256221.8136.223.160820.024.951.829101300466671292.46.6465987.1112.55862.3896.734.558.8650142.05313.005229.42105.9023.3905.775729.0520618.33983.128.2436.124913.0253.574553.10649.6713.1234.439722734.234.634.725157666735.35.77136.43011.437.633.374731.764484213.998842.94.3845207333345.694.5694437.7100.9781851.2237.3043.4335936.7270664.9447744413529.16413.151576.8414.30314811.181437.28815.3757837.71.66579121.8260734308.956.468107.764113.7734543.385476.80114.90593405.3219382.7163.5601850.8243854500263333539.017907.952245.8851851.142.14385676323333413.02172.07656.8932112.9431761.070.38614814.828171.9512210915679355519412405015.715.818.93834.635.534.846.544.139.326.247.90.121.07387.237.1219.210.3815.251894.966163.7648.78121.961177.1327.5128500000238322088.1678831.54.94333.3139.8229.7957.0893.93216295.0113.42.6833415.353.39698.14574.82.346172.10175.0620.847917.05471.7996.3025.41417.3673939.416.434.22388.41188.7489.528338.594180.337.9184.166217.6136.053.170818.325.071.823401306633331291.86.6155787.5112.48462.3966.754.548.8761842.08412.964229.36105.7823.3675.773729.1531620.23981.928.3434.656914.4543.565773.09647.6313.1434.384600334.134.634.825147333335.45.78736.53011.237.633.387732.834472914.010642.84.3745308666745.594.7684442.7100.8131854.4037.3263.4391436.7633665.0448054408529.17663.149436.8414.29364812.421436.74415.3573837.31.6655121.7424724307.756.475107.878113.8933543.951477.28414.91033405.9919367.3063.5931851.3043838500016667539.337908.474245.7831852.052.14452675996673413.88172.15056.9175112.9851760.610.3864815.617171.9582210915679355519412405015.715.818.93834.635.534.846.544.139.326.247.90.121.07OpenBenchmarking.org

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression Speed12390180270360450SE +/- 4.78, N = 3SE +/- 2.00, N = 3SE +/- 2.94, N = 3395.1378.4387.21. (CC) gcc options: -O3 -pthread -lz -llzma

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K123918273645SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.51, N = 438.6837.9437.121. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p123510152025SE +/- 0.17, N = 3SE +/- 0.08, N = 3SE +/- 0.01, N = 319.6819.1119.211. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p1230.08550.1710.25650.3420.4275SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.380.370.381. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triad12348121620SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 315.6115.2315.251. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenSCAD

Render: Projector Mount Swivel

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Projector Mount Swivel12320406080100SE +/- 0.21, N = 3SE +/- 0.42, N = 3SE +/- 0.08, N = 392.9393.9194.971. OpenSCAD version 2019.05

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p 10-bit1234080120160200SE +/- 0.13, N = 3SE +/- 1.75, N = 3SE +/- 1.42, N = 3160.91164.40163.76MIN: 103.41 / MAX: 410.16MIN: 103.73 / MAX: 392.07MIN: 104.03 / MAX: 384.961. (CC) gcc options: -pthread -lm

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K1231122334455SE +/- 0.53, N = 15SE +/- 0.10, N = 3SE +/- 0.07, N = 347.7848.7248.781. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p123306090120150SE +/- 0.42, N = 3SE +/- 1.89, N = 3SE +/- 1.32, N = 12124.39122.64121.961. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Compression Speed12330060090012001500SE +/- 5.14, N = 3SE +/- 10.72, N = 3SE +/- 6.67, N = 31164.01156.91177.11. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression Speed12370140210280350SE +/- 1.66, N = 3SE +/- 2.23, N = 3SE +/- 0.88, N = 3332.6327.6327.51. (CC) gcc options: -O3 -pthread -lz -llzma

srsLTE

Test: OFDM_Test

OpenBenchmarking.orgSamples / Second, More Is BettersrsLTE 20.10.1Test: OFDM_Test12330M60M90M120M150MSE +/- 416333.20, N = 3SE +/- 821245.67, N = 3SE +/- 665832.81, N = 31267000001284666671285000001. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time1235M10M15M20M25MSE +/- 294389.71, N = 4SE +/- 299604.32, N = 3SE +/- 377829.22, N = 32377787823501591238322081. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU123246810SE +/- 0.00549, N = 3SE +/- 0.00679, N = 3SE +/- 0.01559, N = 38.261038.146958.16788MIN: 8.14MIN: 8MIN: 8.031. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression Speed123714212835SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 331.931.831.51. (CC) gcc options: -O3 -pthread -lz -llzma

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.01231.11222.22443.33664.44885.561SE +/- 0.029, N = 3SE +/- 0.030, N = 3SE +/- 0.027, N = 34.8994.8824.943MIN: 4.71 / MAX: 7.4MIN: 4.68 / MAX: 5.58MIN: 4.65 / MAX: 16.51. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPY123816243240SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 332.933.333.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p123306090120150SE +/- 1.63, N = 6SE +/- 0.46, N = 3SE +/- 2.02, N = 4141.48140.39139.821. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3123714212835SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.16, N = 329.4529.6729.80MIN: 29.17 / MAX: 40.18MIN: 29.48 / MAX: 42.09MIN: 29.45 / MAX: 42.011. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 0123246810SE +/- 0.000, N = 3SE +/- 0.003, N = 3SE +/- 0.005, N = 37.1727.0917.0891. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1230.89481.78962.68443.57924.474SE +/- 0.02149, N = 3SE +/- 0.01212, N = 3SE +/- 0.01771, N = 33.977063.931983.93216MIN: 3.65MIN: 3.64MIN: 3.621. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgeNb Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Test12360120180240300SE +/- 0.19, N = 3SE +/- 0.73, N = 3SE +/- 0.84, N = 3293.2291.8295.01. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgUE Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Test123306090120150SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 0.12, N = 3112.7112.2113.41. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_2241230.60371.20741.81112.41483.0185SE +/- 0.063, N = 3SE +/- 0.051, N = 3SE +/- 0.063, N = 32.6722.6552.683MIN: 2.34 / MAX: 4.31MIN: 2.35 / MAX: 3.81MIN: 2.37 / MAX: 4.911. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1237001400210028003500SE +/- 2.52, N = 3SE +/- 26.13, N = 3SE +/- 0.66, N = 33413.973445.233415.35MIN: 3402.03MIN: 3408.23MIN: 3409.681. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K1230.76951.5392.30853.0783.8475SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 33.423.413.391. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

LuaRadio

Test: Complex Phase

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex Phase123150300450600750SE +/- 5.92, N = 3SE +/- 6.74, N = 3SE +/- 11.13, N = 3704.2700.2698.1

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression Speed12310002000300040005000SE +/- 2.70, N = 3SE +/- 1.73, N = 3SE +/- 22.35, N = 34609.84601.34574.81. (CC) gcc options: -O3 -pthread -lz -llzma

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.01230.52791.05581.58372.11162.6395SE +/- 0.007, N = 3SE +/- 0.006, N = 3SE +/- 0.010, N = 32.3292.3332.346MIN: 2.26 / MAX: 4.96MIN: 2.27 / MAX: 3.55MIN: 2.28 / MAX: 3.631. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p1234080120160200SE +/- 0.19, N = 3SE +/- 0.43, N = 3SE +/- 0.26, N = 3173.31172.24172.101. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p1234080120160200SE +/- 0.31, N = 3SE +/- 0.47, N = 3SE +/- 0.09, N = 3176.29175.27175.061. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3D123510152025SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 320.9920.8520.851. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenSCAD

Render: Leonardo Phone Case Slim

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Leonardo Phone Case Slim12348121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 316.9816.9417.051. OpenSCAD version 2019.05

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 01231632486480SE +/- 0.16, N = 3SE +/- 0.25, N = 3SE +/- 0.26, N = 372.0772.2671.801. (CXX) g++ options: -O3 -fPIC -lm

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 36.346.326.301. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50123612182430SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 325.3125.2525.41MIN: 25.08 / MAX: 37.21MIN: 25.09 / MAX: 35.04MIN: 25.17 / MAX: 37.21. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenSCAD

Render: Retro Car

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Retro Car12348121620SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 317.2617.3517.371. OpenSCAD version 2019.05

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression Speed1239001800270036004500SE +/- 9.00, N = 3SE +/- 0.96, N = 3SE +/- 5.83, N = 33957.53963.93939.41. (CC) gcc options: -O3 -pthread -lz -llzma

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NT12348121620SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 316.516.516.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPY123816243240SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 334.034.134.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Compression Speed1235001000150020002500SE +/- 12.12, N = 3SE +/- 16.90, N = 3SE +/- 12.46, N = 32379.42374.52388.41. (CC) gcc options: -O3 -pthread -lz -llzma

GNU Radio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR Filters12330060090012001500SE +/- 12.45, N = 3SE +/- 9.72, N = 3SE +/- 15.25, N = 31190.21195.51188.71. 3.8.1.0

LuaRadio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis Filter123110220330440550SE +/- 1.86, N = 3SE +/- 1.98, N = 3SE +/- 0.27, N = 3486.8487.7489.5

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memory1236K12K18K24K30KSE +/- 98.12, N = 3SE +/- 138.44, N = 3SE +/- 41.95, N = 328391.4728491.7428338.591. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Decompression Speed1239001800270036004500SE +/- 4.56, N = 3SE +/- 19.69, N = 3SE +/- 1.88, N = 34186.54164.54180.31. (CC) gcc options: -O3 -pthread -lz -llzma

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-N123918273645SE +/- 0.00, N = 337.938.137.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4K1234080120160200SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 0.19, N = 3185.13184.25184.16MIN: 174.56 / MAX: 210.52MIN: 173.91 / MAX: 209.75MIN: 173.76 / MAX: 209.161. (CC) gcc options: -pthread -lm

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time123130026003900520065006250.36221.86217.61. (CC) gcc options: -O3 -fomit-frame-pointer -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p123306090120150SE +/- 0.26, N = 3SE +/- 0.13, N = 3SE +/- 0.35, N = 3136.76136.22136.051. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 101230.71331.42662.13992.85323.5665SE +/- 0.004, N = 3SE +/- 0.010, N = 3SE +/- 0.012, N = 33.1543.1603.1701. (CXX) g++ options: -O3 -fPIC -lm

GNU Radio

Test: FIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR Filter1232004006008001000SE +/- 1.65, N = 3SE +/- 3.57, N = 3SE +/- 2.94, N = 3822.3820.0818.31. 3.8.1.0

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p123612182430SE +/- 0.24, N = 3SE +/- 0.29, N = 5SE +/- 0.22, N = 325.0624.9525.071. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.41220.82441.23661.64882.061SE +/- 0.00346, N = 3SE +/- 0.00543, N = 3SE +/- 0.00185, N = 31.832151.829101.82340MIN: 1.79MIN: 1.78MIN: 1.781. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 5712330M60M90M120M150MSE +/- 40551.75, N = 3SE +/- 367891.89, N = 3SE +/- 35276.68, N = 31301833331300466671306633331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

LuaRadio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR Filters12330060090012001500SE +/- 2.64, N = 3SE +/- 3.21, N = 3SE +/- 4.98, N = 31297.91292.41291.8

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU123246810SE +/- 0.00984, N = 3SE +/- 0.01698, N = 3SE +/- 0.00264, N = 36.639936.646596.61557MIN: 6MIN: 6MIN: 5.991. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

LuaRadio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert Transform12320406080100SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.32, N = 387.287.187.5

Timed Erlang/OTP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To Compile123306090120150SE +/- 0.14, N = 3SE +/- 0.40, N = 3SE +/- 0.31, N = 3113.00112.56112.48

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, Lossless1231428425670SE +/- 0.15, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 362.6762.3962.401. (CXX) g++ options: -O3 -fPIC -lm

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 36.766.736.751. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserID1231.02382.04763.07144.09525.119SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.534.554.541. (CXX) g++ options: -O3 -pthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU123246810SE +/- 0.00652, N = 3SE +/- 0.01330, N = 3SE +/- 0.00592, N = 38.838988.865018.87618MIN: 4.68MIN: 4.67MIN: 4.771. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenSCAD

Render: Mini-ITX Case

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Mini-ITX Case1231020304050SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 341.9142.0542.081. OpenSCAD version 2019.05

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 61233691215SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 312.9513.0112.961. (CXX) g++ options: -O3 -fPIC -lm

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p12350100150200250SE +/- 0.11, N = 3SE +/- 0.19, N = 3SE +/- 0.43, N = 3230.27229.42229.361. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p12320406080100SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3106.19105.90105.781. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1S123612182430SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 323.4623.3923.371. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Medium1231.30392.60783.91175.21566.5195SE +/- 0.0231, N = 3SE +/- 0.0051, N = 3SE +/- 0.0062, N = 35.79495.77575.77371. (CXX) g++ options: -O3 -flto -pthread

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readback123714212835SE +/- 0.22, N = 3SE +/- 0.10, N = 3SE +/- 0.20, N = 329.0529.0529.151. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

GNU Radio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert Transform123130260390520650SE +/- 1.07, N = 3SE +/- 2.62, N = 3SE +/- 1.23, N = 3618.0618.3620.21. 3.8.1.0

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression Speed1239001800270036004500SE +/- 10.42, N = 3SE +/- 1.35, N = 3SE +/- 4.27, N = 33969.03983.13981.91. (CC) gcc options: -O3 -pthread -lz -llzma

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPY123714212835SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 328.228.228.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish12390180270360450SE +/- 0.16, N = 3SE +/- 0.12, N = 3SE +/- 1.83, N = 3436.08436.12434.661. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly13051232004006008001000SE +/- 0.12, N = 3SE +/- 3.02, N = 3SE +/- 1.80, N = 3916.11913.03914.451. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1230.80491.60982.41473.21964.0245SE +/- 0.01205, N = 3SE +/- 0.00753, N = 3SE +/- 0.00251, N = 33.577403.574553.56577MIN: 3.23MIN: 3.24MIN: 3.261. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: Kostya1230.69751.3952.09252.793.4875SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 33.103.103.091. (CXX) g++ options: -O3 -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080p123140280420560700SE +/- 1.42, N = 3SE +/- 0.67, N = 3SE +/- 0.54, N = 3648.26649.67647.63MIN: 589.55 / MAX: 721.08MIN: 588.3 / MAX: 717.46MIN: 582.84 / MAX: 717.061. (CC) gcc options: -pthread -lm

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K1233691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 313.1613.1213.141. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction123816243240SE +/- 0.35, N = 3SE +/- 0.34, N = 3SE +/- 0.31, N = 334.3434.4434.381. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NT123816243240SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 334.134.234.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression Speed123816243240SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 334.534.634.61. (CC) gcc options: -O3 -pthread -lz -llzma

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-T123816243240SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 334.834.734.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 5712350M100M150M200M250MSE +/- 265476.51, N = 3SE +/- 375470.08, N = 3SE +/- 321679.62, N = 32508566672515766672514733331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPY123816243240SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 335.335.335.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, Lossless1231.30212.60423.90635.20846.5105SE +/- 0.013, N = 3SE +/- 0.013, N = 3SE +/- 0.012, N = 35.7785.7715.7871. (CXX) g++ options: -O3 -fPIC -lm

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TN123816243240SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 336.536.436.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

GNU Radio

Test: Signal Source (Cosine)

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)1236001200180024003000SE +/- 3.36, N = 3SE +/- 2.10, N = 3SE +/- 2.72, N = 33003.23011.43011.21. 3.8.1.0

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOT123918273645SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 337.737.637.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2123816243240SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.12, N = 333.4633.3733.391. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p123160320480640800SE +/- 1.00, N = 3SE +/- 2.01, N = 3SE +/- 0.90, N = 3733.64731.76732.83MIN: 538.11 / MAX: 1144.18MIN: 538.01 / MAX: 1138.02MIN: 537.85 / MAX: 1141.981. (CC) gcc options: -pthread -lm

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasks12310K20K30K40K50KSE +/- 16.42, N = 3SE +/- 118.82, N = 3SE +/- 38.73, N = 34475644842447291. (CXX) g++ options: -O3 -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU12348121620SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 314.0314.0014.01MIN: 13.92MIN: 13.9MIN: 13.881. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPY1231020304050SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 342.842.942.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweets1230.98551.9712.95653.9424.9275SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.384.384.371. (CXX) g++ options: -O3 -pthread

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57123100M200M300M400M500MSE +/- 1523508.38, N = 3SE +/- 514274.03, N = 3SE +/- 2209451.92, N = 34527133334520733334530866671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-N1231020304050SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 345.545.645.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To Compile12320406080100SE +/- 0.78, N = 3SE +/- 0.60, N = 3SE +/- 0.47, N = 394.7194.5794.77

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Decompression Speed12310002000300040005000SE +/- 7.93, N = 3SE +/- 5.89, N = 3SE +/- 1.29, N = 34433.44437.74442.71. (CC) gcc options: -O3 -pthread -lz -llzma

OpenSCAD

Render: Pistol

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Pistol12320406080100SE +/- 0.07, N = 3SE +/- 0.21, N = 3SE +/- 0.09, N = 3100.77100.98100.811. OpenSCAD version 2019.05

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU123400800120016002000SE +/- 3.26, N = 3SE +/- 1.24, N = 3SE +/- 1.53, N = 31854.881851.221854.40MIN: 1847.25MIN: 1841.63MIN: 1846.771. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2123918273645SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 337.2537.3037.331. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.7741.5482.3223.0963.87SE +/- 0.00410, N = 3SE +/- 0.00274, N = 3SE +/- 0.00188, N = 33.440033.433593.43914MIN: 3.15MIN: 3.17MIN: 3.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Reduction123816243240SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 336.7936.7336.761. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

GNU Radio

Test: IIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR Filter123140280420560700SE +/- 1.14, N = 3SE +/- 1.34, N = 3SE +/- 0.81, N = 3666.1664.9665.01. 3.8.1.0

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threads12310K20K30K40K50KSE +/- 27.00, N = 3SE +/- 25.27, N = 3SE +/- 36.98, N = 34485444774448051. (CXX) g++ options: -O3 -lpthread

toyBrot Fractal Generator

Implementation: TBB

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBB1239K18K27K36K45KSE +/- 59.38, N = 3SE +/- 72.84, N = 3SE +/- 82.93, N = 34405844135440851. (CXX) g++ options: -O3 -lpthread

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Download123714212835SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.16, N = 329.2129.1629.181. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.70971.41942.12912.83883.5485SE +/- 0.00701, N = 3SE +/- 0.00442, N = 3SE +/- 0.00951, N = 33.154333.151573.14943MIN: 3.1MIN: 3.1MIN: 3.091. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p123246810SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 36.856.846.841. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU12348121620SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 314.2814.3014.29MIN: 14.2MIN: 14.2MIN: 14.21. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decrypt12310002000300040005000SE +/- 4.51, N = 3SE +/- 0.85, N = 3SE +/- 0.04, N = 34806.104811.184812.421. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decrypt12390180270360450SE +/- 0.18, N = 3SE +/- 0.15, N = 3SE +/- 0.95, N = 3437.30437.29436.741. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SP12348121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 315.3715.3815.361. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

GNU Radio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis Filter1232004006008001000SE +/- 2.38, N = 3SE +/- 1.84, N = 3SE +/- 1.55, N = 3836.7837.7837.31. 3.8.1.0

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.37520.75041.12561.50081.876SE +/- 0.00165, N = 3SE +/- 0.00242, N = 3SE +/- 0.00123, N = 31.667471.665791.66550MIN: 1.51MIN: 1.51MIN: 1.511. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction123306090120150SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3121.69121.83121.741. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Decompression Speed1239001800270036004500SE +/- 11.68, N = 3SE +/- 5.34, N = 3SE +/- 5.58, N = 34312.44308.94307.71. (CC) gcc options: -O3 -pthread -lz -llzma

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compile1231326395265SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 356.4256.4756.48

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decrypt12320406080100SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3107.86107.76107.881. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustive123306090120150SE +/- 0.12, N = 3SE +/- 0.14, N = 3SE +/- 0.16, N = 3113.89113.77113.891. (CXX) g++ options: -O3 -flto -pthread

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish123120240360480600SE +/- 0.05, N = 3SE +/- 0.40, N = 3SE +/- 0.10, N = 3543.60543.39543.951. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compile123100200300400500SE +/- 0.32, N = 3SE +/- 0.34, N = 3SE +/- 0.19, N = 3477.10476.80477.28

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thorough12348121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 314.9214.9114.911. (CXX) g++ options: -O3 -flto -pthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1237001400210028003500SE +/- 5.97, N = 3SE +/- 3.80, N = 3SE +/- 4.35, N = 33408.163405.323405.99MIN: 3391.46MIN: 3391.39MIN: 3393.271. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU1234K8K12K16K20KSE +/- 14.24, N = 3SE +/- 20.02, N = 3SE +/- 16.54, N = 319380.3419382.7119367.301. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 31231428425670SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 363.5463.5663.591. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU123400800120016002000SE +/- 0.58, N = 3SE +/- 0.16, N = 3SE +/- 1.14, N = 31852.261850.821851.30MIN: 1846.85MIN: 1844.82MIN: 1843.571. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMP1239K18K27K36K45KSE +/- 13.86, N = 3SE +/- 16.82, N = 3SE +/- 19.34, N = 34386643854438381. (CXX) g++ options: -O3 -lpthread

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57123110M220M330M440M550MSE +/- 324054.18, N = 3SE +/- 188355.46, N = 3SE +/- 199360.09, N = 35003333335002633335000166671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decrypt123120240360480600SE +/- 0.04, N = 3SE +/- 0.37, N = 3SE +/- 0.03, N = 3539.24539.02539.341. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decrypt1232004006008001000SE +/- 0.56, N = 3SE +/- 0.82, N = 3SE +/- 0.54, N = 3908.31907.95908.471. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_N12350100150200250SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3245.92245.89245.781. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU123400800120016002000SE +/- 0.88, N = 3SE +/- 1.72, N = 3SE +/- 0.20, N = 31852.131851.141852.05MIN: 1846.26MIN: 1843.95MIN: 1846.881. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1230.48260.96521.44781.93042.413SE +/- 0.00353, N = 3SE +/- 0.00224, N = 3SE +/- 0.00152, N = 32.144902.143852.14452MIN: 1.94MIN: 1.97MIN: 1.971. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 5712314M28M42M56M70MSE +/- 881.92, N = 3SE +/- 1763.83, N = 3SE +/- 27834.83, N = 36762966767632333675996671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1237001400210028003500SE +/- 3.64, N = 3SE +/- 0.58, N = 3SE +/- 0.89, N = 33414.523413.023413.88MIN: 3404.15MIN: 3406.86MIN: 3407.321. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decrypt1234080120160200SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3172.09172.08172.151. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidth1231326395265SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 356.9156.8956.921. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI123306090120150SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3112.98112.94112.991. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flops123400800120016002000SE +/- 1.38, N = 3SE +/- 0.55, N = 3SE +/- 0.83, N = 31760.701761.071760.611. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hash1230.08690.17380.26070.34760.4345SE +/- 0.0000, N = 3SE +/- 0.0000, N = 3SE +/- 0.0000, N = 30.38610.38610.38601. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-25612310002000300040005000SE +/- 0.17, N = 3SE +/- 0.54, N = 3SE +/- 0.22, N = 34815.974814.834815.621. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-2561234080120160200SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3171.95171.95171.961. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Systemd Total Boot Time

Test: Userspace

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Userspace1235K10K15K20K25K221092210922109

Systemd Total Boot Time

Test: Firmware

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Firmware1233K6K9K12K15K156791567915679

Systemd Total Boot Time

Test: Loader

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Loader1238001600240032004000355535553555

Systemd Total Boot Time

Test: Kernel

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Kernel123400800120016002000194119411941

Systemd Total Boot Time

Test: Total

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Total1235K10K15K20K25K240502405024050

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TT12348121620SE +/- 0.00, N = 2SE +/- 0.00, N = 2SE +/- 0.00, N = 215.715.715.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TN12348121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 315.815.815.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NN123510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 318.918.918.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOT1239182736453838381. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPY123816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 334.634.634.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TT123816243240SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 335.535.535.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NN123816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 334.834.834.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-T1231122334455SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 346.546.546.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOT1231020304050SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 344.144.144.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPY123918273645SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 339.339.339.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPY123612182430SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 326.226.226.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOT1231122334455SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 347.947.947.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K1230.0270.0540.0810.1080.135SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.120.120.121. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandom1230.24080.48160.72240.96321.204SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.071.071.071. (CXX) g++ options: -O3 -pthread


Phoronix Test Suite v10.8.4