990KS March

Intel Core i9-9900KS testing with a ASUS PRIME Z390-A (1502 BIOS) and ASUS Intel UHD 630 CFL GT2 3GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2104012-IB-990KSMARC41&grs&sor.

990KS MarchProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen Resolution123Intel Core i9-9900KS @ 5.00GHz (8 Cores / 16 Threads)ASUS PRIME Z390-A (1502 BIOS)Intel Cannon Lake PCH32GB240GB Corsair Force MP510ASUS Intel UHD 630 CFL GT2 3GB (1200MHz)Realtek ALC1220G237HLIntel I219-VUbuntu 20.045.9.0-050900rc8daily20201005-generic (x86_64) 20201004GNOME Shell 3.36.2X Server 1.20.84.6 Mesa 20.2.6OpenCL 2.1GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xcc - Thermald 1.9.1 Python Details- Python 2.7.18rc1 + Python 3.8.2Security Details- itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Mitigation of TSX disabled + tsx_async_abort: Mitigation of TSX disabled

990KS Marchcompress-zstd: 8, Long Mode - Compression Speedaom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 6 Two-Pass - Bosphorus 1080paom-av1: Speed 0 Two-Pass - Bosphorus 1080pshoc: OpenCL - Triadopenscad: Projector Mount Swiveldav1d: Chimera 1080p 10-bitaom-av1: Speed 9 Realtime - Bosphorus 4Kaom-av1: Speed 8 Realtime - Bosphorus 1080pcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 8 - Compression Speedsrslte: OFDM_Teststockfish: Total Timeonednn: IP Shapes 3D - f32 - CPUcompress-zstd: 19, Long Mode - Compression Speedmnn: SqueezeNetV1.0viennacl: OpenCL BLAS - sCOPYaom-av1: Speed 9 Realtime - Bosphorus 1080pmnn: inception-v3basis: UASTC Level 0onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUsrslte: PHY_DL_Testsrslte: PHY_DL_Testmnn: MobileNetV2_224onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUaom-av1: Speed 4 Two-Pass - Bosphorus 4Kluaradio: Complex Phasecompress-zstd: 8, Long Mode - Decompression Speedmnn: mobilenet-v1-1.0svt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080pshoc: OpenCL - S3Dopenscad: Leonardo Phone Case Slimavifenc: 0aom-av1: Speed 6 Two-Pass - Bosphorus 4Kmnn: resnet-v2-50openscad: Retro Carcompress-zstd: 19, Long Mode - Decompression Speedviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dCOPYcompress-zstd: 3 - Compression Speedgnuradio: Five Back to Back FIR Filtersluaradio: FM Deemphasis Filtersysbench: RAM / Memorycompress-zstd: 3 - Decompression Speedviennacl: OpenCL BLAS - dGEMV-Ndav1d: Summer Nature 4Kgmpbench: Total Timesvt-vp9: Visual Quality Optimized - Bosphorus 1080pavifenc: 10gnuradio: FIR Filteraom-av1: Speed 6 Realtime - Bosphorus 1080ponednn: IP Shapes 3D - u8s8f32 - CPUliquid-dsp: 2 - 256 - 57luaradio: Five Back to Back FIR Filtersonednn: Deconvolution Batch shapes_3d - f32 - CPUluaradio: Hilbert Transformbuild-erlang: Time To Compileavifenc: 6, Losslessaom-av1: Speed 4 Two-Pass - Bosphorus 1080psimdjson: DistinctUserIDonednn: Deconvolution Batch shapes_1d - f32 - CPUopenscad: Mini-ITX Caseavifenc: 6svt-hevc: 10 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080pbasis: ETC1Sastcenc: Mediumshoc: OpenCL - Bus Speed Readbackgnuradio: Hilbert Transformcompress-zstd: 19 - Decompression Speedviennacl: CPU BLAS - sCOPYbotan: Twofishbotan: ChaCha20Poly1305onednn: IP Shapes 1D - f32 - CPUsimdjson: Kostyadav1d: Summer Nature 1080paom-av1: Speed 6 Realtime - Bosphorus 4Kincompact3d: input.i3d 129 Cells Per Directionviennacl: CPU BLAS - dGEMM-NTcompress-zstd: 19 - Compression Speedviennacl: OpenCL BLAS - dGEMV-Tliquid-dsp: 4 - 256 - 57viennacl: OpenCL BLAS - dAXPYavifenc: 10, Losslessviennacl: CPU BLAS - dGEMM-TNgnuradio: Signal Source (Cosine)viennacl: OpenCL BLAS - sDOTbasis: UASTC Level 2dav1d: Chimera 1080ptoybrot: C++ Tasksonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUviennacl: CPU BLAS - sAXPYsimdjson: PartialTweetsliquid-dsp: 8 - 256 - 57viennacl: CPU BLAS - dGEMV-Nbuild-linux-kernel: Time To Compilecompress-zstd: 3, Long Mode - Decompression Speedopenscad: Pistolonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUavifenc: 2onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUshoc: OpenCL - Reductiongnuradio: IIR Filtertoybrot: C++ Threadstoybrot: TBBshoc: OpenCL - Bus Speed Downloadonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUsvt-hevc: 1 - Bosphorus 1080ponednn: Convolution Batch Shapes Auto - f32 - CPUbotan: AES-256 - Decryptbotan: Twofish - Decryptshoc: OpenCL - FFT SPgnuradio: FM Deemphasis Filteronednn: IP Shapes 1D - u8s8f32 - CPUincompact3d: input.i3d 193 Cells Per Directioncompress-zstd: 8 - Decompression Speedbuild-mesa: Time To Compilebotan: KASUMI - Decryptastcenc: Exhaustivebotan: Blowfishbuild-nodejs: Time To Compileastcenc: Thoroughonednn: Recurrent Neural Network Training - f32 - CPUsysbench: CPUbasis: UASTC Level 3onednn: Recurrent Neural Network Inference - u8s8f32 - CPUtoybrot: OpenMPliquid-dsp: 16 - 256 - 57botan: Blowfish - Decryptbotan: ChaCha20Poly1305 - Decryptshoc: OpenCL - GEMM SGEMM_Nonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUliquid-dsp: 1 - 256 - 57onednn: Recurrent Neural Network Training - u8s8f32 - CPUbotan: CAST-256 - Decryptshoc: OpenCL - Texture Read Bandwidthbotan: KASUMIshoc: OpenCL - Max SP Flopsshoc: OpenCL - MD5 Hashbotan: AES-256botan: CAST-256systemd-boot-total: Userspacesystemd-boot-total: Firmwaresystemd-boot-total: Loadersystemd-boot-total: Kernelsystemd-boot-total: Totalviennacl: OpenCL BLAS - dGEMM-TTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - sAXPYviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - sDOTaom-av1: Speed 0 Two-Pass - Bosphorus 4Ksimdjson: LargeRand123395.138.6819.680.3815.605392.931160.9147.78124.391164.0332.6126700000237778788.2610331.94.89932.9141.4829.4477.1723.97706293.2112.72.6723413.973.42704.24609.82.329173.31176.2920.993116.97572.0746.3425.30817.2593957.516.534.02379.41190.2486.828391.474186.537.9185.136250.3136.763.154822.325.061.832151301833331297.96.6399387.2113.00062.6746.764.538.8389841.90812.952230.27106.1923.4555.794929.0495618.03969.028.2436.078916.1083.577403.1648.2613.1634.337596934.134.534.825085666735.35.77836.53003.237.733.461733.644475614.033542.84.3845271333345.594.7144433.4100.7671854.8837.2533.4400336.7934666.1448544405829.21193.154336.8514.28304806.102437.29715.3682836.71.66747121.6911704312.456.415107.861113.8873543.597477.09714.92073408.1619380.3463.5431852.2643866500333333539.238908.311245.9201852.132.14490676296673414.52172.09156.9099112.9801760.700.38614815.972171.9542210915679355519412405015.715.818.93834.635.534.846.544.139.326.247.90.121.07378.437.9419.110.3715.233693.908164.4048.72122.641156.9327.6128466667235015918.1469531.84.88233.3140.3929.6707.0913.93198291.8112.22.6553445.233.41700.24601.32.333172.24175.2720.851816.94172.2586.3225.25417.3463963.916.534.12374.51195.5487.728491.744164.538.1184.256221.8136.223.160820.024.951.829101300466671292.46.6465987.1112.55862.3896.734.558.8650142.05313.005229.42105.9023.3905.775729.0520618.33983.128.2436.124913.0253.574553.10649.6713.1234.439722734.234.634.725157666735.35.77136.43011.437.633.374731.764484213.998842.94.3845207333345.694.5694437.7100.9781851.2237.3043.4335936.7270664.9447744413529.16413.151576.8414.30314811.181437.28815.3757837.71.66579121.8260734308.956.468107.764113.7734543.385476.80114.90593405.3219382.7163.5601850.8243854500263333539.017907.952245.8851851.142.14385676323333413.02172.07656.8932112.9431761.070.38614814.828171.9512210915679355519412405015.715.818.93834.635.534.846.544.139.326.247.90.121.07387.237.1219.210.3815.251894.966163.7648.78121.961177.1327.5128500000238322088.1678831.54.94333.3139.8229.7957.0893.93216295.0113.42.6833415.353.39698.14574.82.346172.10175.0620.847917.05471.7996.3025.41417.3673939.416.434.22388.41188.7489.528338.594180.337.9184.166217.6136.053.170818.325.071.823401306633331291.86.6155787.5112.48462.3966.754.548.8761842.08412.964229.36105.7823.3675.773729.1531620.23981.928.3434.656914.4543.565773.09647.6313.1434.384600334.134.634.825147333335.45.78736.53011.237.633.387732.834472914.010642.84.3745308666745.594.7684442.7100.8131854.4037.3263.4391436.7633665.0448054408529.17663.149436.8414.29364812.421436.74415.3573837.31.6655121.7424724307.756.475107.878113.8933543.951477.28414.91033405.9919367.3063.5931851.3043838500016667539.337908.474245.7831852.052.14452675996673413.88172.15056.9175112.9851760.610.3864815.617171.9582210915679355519412405015.715.818.93834.635.534.846.544.139.326.247.90.121.07OpenBenchmarking.org

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression Speed13290180270360450SE +/- 4.78, N = 3SE +/- 2.94, N = 3SE +/- 2.00, N = 3395.1387.2378.41. (CC) gcc options: -O3 -pthread -lz -llzma

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K123918273645SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.51, N = 438.6837.9437.121. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p132510152025SE +/- 0.17, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 319.6819.2119.111. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p3120.08550.1710.25650.3420.4275SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.380.380.371. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triad13248121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 315.6115.2515.231. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenSCAD

Render: Projector Mount Swivel

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Projector Mount Swivel12320406080100SE +/- 0.21, N = 3SE +/- 0.42, N = 3SE +/- 0.08, N = 392.9393.9194.971. OpenSCAD version 2019.05

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p 10-bit2314080120160200SE +/- 1.75, N = 3SE +/- 1.42, N = 3SE +/- 0.13, N = 3164.40163.76160.91MIN: 103.73 / MAX: 392.07MIN: 104.03 / MAX: 384.96MIN: 103.41 / MAX: 410.161. (CC) gcc options: -pthread -lm

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K3211122334455SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.53, N = 1548.7848.7247.781. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p123306090120150SE +/- 0.42, N = 3SE +/- 1.89, N = 3SE +/- 1.32, N = 12124.39122.64121.961. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Compression Speed31230060090012001500SE +/- 6.67, N = 3SE +/- 5.14, N = 3SE +/- 10.72, N = 31177.11164.01156.91. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression Speed12370140210280350SE +/- 1.66, N = 3SE +/- 2.23, N = 3SE +/- 0.88, N = 3332.6327.6327.51. (CC) gcc options: -O3 -pthread -lz -llzma

srsLTE

Test: OFDM_Test

OpenBenchmarking.orgSamples / Second, More Is BettersrsLTE 20.10.1Test: OFDM_Test32130M60M90M120M150MSE +/- 665832.81, N = 3SE +/- 821245.67, N = 3SE +/- 416333.20, N = 31285000001284666671267000001. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time3125M10M15M20M25MSE +/- 377829.22, N = 3SE +/- 294389.71, N = 4SE +/- 299604.32, N = 32383220823777878235015911. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU231246810SE +/- 0.00679, N = 3SE +/- 0.01559, N = 3SE +/- 0.00549, N = 38.146958.167888.26103MIN: 8MIN: 8.03MIN: 8.141. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression Speed123714212835SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 331.931.831.51. (CC) gcc options: -O3 -pthread -lz -llzma

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.02131.11222.22443.33664.44885.561SE +/- 0.030, N = 3SE +/- 0.029, N = 3SE +/- 0.027, N = 34.8824.8994.943MIN: 4.68 / MAX: 5.58MIN: 4.71 / MAX: 7.4MIN: 4.65 / MAX: 16.51. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPY321816243240SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 333.333.332.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p123306090120150SE +/- 1.63, N = 6SE +/- 0.46, N = 3SE +/- 2.02, N = 4141.48140.39139.821. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3123714212835SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.16, N = 329.4529.6729.80MIN: 29.17 / MAX: 40.18MIN: 29.48 / MAX: 42.09MIN: 29.45 / MAX: 42.011. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 0321246810SE +/- 0.005, N = 3SE +/- 0.003, N = 3SE +/- 0.000, N = 37.0897.0917.1721. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU2310.89481.78962.68443.57924.474SE +/- 0.01212, N = 3SE +/- 0.01771, N = 3SE +/- 0.02149, N = 33.931983.932163.97706MIN: 3.64MIN: 3.62MIN: 3.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgeNb Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Test31260120180240300SE +/- 0.84, N = 3SE +/- 0.19, N = 3SE +/- 0.73, N = 3295.0293.2291.81. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgUE Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Test312306090120150SE +/- 0.12, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 3113.4112.7112.21. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_2242130.60371.20741.81112.41483.0185SE +/- 0.051, N = 3SE +/- 0.063, N = 3SE +/- 0.063, N = 32.6552.6722.683MIN: 2.35 / MAX: 3.81MIN: 2.34 / MAX: 4.31MIN: 2.37 / MAX: 4.911. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1327001400210028003500SE +/- 2.52, N = 3SE +/- 0.66, N = 3SE +/- 26.13, N = 33413.973415.353445.23MIN: 3402.03MIN: 3409.68MIN: 3408.231. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K1230.76951.5392.30853.0783.8475SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 33.423.413.391. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

LuaRadio

Test: Complex Phase

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex Phase123150300450600750SE +/- 5.92, N = 3SE +/- 6.74, N = 3SE +/- 11.13, N = 3704.2700.2698.1

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression Speed12310002000300040005000SE +/- 2.70, N = 3SE +/- 1.73, N = 3SE +/- 22.35, N = 34609.84601.34574.81. (CC) gcc options: -O3 -pthread -lz -llzma

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.01230.52791.05581.58372.11162.6395SE +/- 0.007, N = 3SE +/- 0.006, N = 3SE +/- 0.010, N = 32.3292.3332.346MIN: 2.26 / MAX: 4.96MIN: 2.27 / MAX: 3.55MIN: 2.28 / MAX: 3.631. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p1234080120160200SE +/- 0.19, N = 3SE +/- 0.43, N = 3SE +/- 0.26, N = 3173.31172.24172.101. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p1234080120160200SE +/- 0.31, N = 3SE +/- 0.47, N = 3SE +/- 0.09, N = 3176.29175.27175.061. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3D123510152025SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 320.9920.8520.851. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenSCAD

Render: Leonardo Phone Case Slim

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Leonardo Phone Case Slim21348121620SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 316.9416.9817.051. OpenSCAD version 2019.05

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 03121632486480SE +/- 0.26, N = 3SE +/- 0.16, N = 3SE +/- 0.25, N = 371.8072.0772.261. (CXX) g++ options: -O3 -fPIC -lm

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 36.346.326.301. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50213612182430SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 325.2525.3125.41MIN: 25.09 / MAX: 35.04MIN: 25.08 / MAX: 37.21MIN: 25.17 / MAX: 37.21. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenSCAD

Render: Retro Car

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Retro Car12348121620SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 317.2617.3517.371. OpenSCAD version 2019.05

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression Speed2139001800270036004500SE +/- 0.96, N = 3SE +/- 9.00, N = 3SE +/- 5.83, N = 33963.93957.53939.41. (CC) gcc options: -O3 -pthread -lz -llzma

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NT21348121620SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 316.516.516.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPY321816243240SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 334.234.134.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Compression Speed3125001000150020002500SE +/- 12.46, N = 3SE +/- 12.12, N = 3SE +/- 16.90, N = 32388.42379.42374.51. (CC) gcc options: -O3 -pthread -lz -llzma

GNU Radio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR Filters21330060090012001500SE +/- 9.72, N = 3SE +/- 12.45, N = 3SE +/- 15.25, N = 31195.51190.21188.71. 3.8.1.0

LuaRadio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis Filter321110220330440550SE +/- 0.27, N = 3SE +/- 1.98, N = 3SE +/- 1.86, N = 3489.5487.7486.8

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memory2136K12K18K24K30KSE +/- 138.44, N = 3SE +/- 98.12, N = 3SE +/- 41.95, N = 328491.7428391.4728338.591. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Decompression Speed1329001800270036004500SE +/- 4.56, N = 3SE +/- 1.88, N = 3SE +/- 19.69, N = 34186.54180.34164.51. (CC) gcc options: -O3 -pthread -lz -llzma

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-N231918273645SE +/- 0.00, N = 338.137.937.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4K1234080120160200SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 0.19, N = 3185.13184.25184.16MIN: 174.56 / MAX: 210.52MIN: 173.91 / MAX: 209.75MIN: 173.76 / MAX: 209.161. (CC) gcc options: -pthread -lm

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time123130026003900520065006250.36221.86217.61. (CC) gcc options: -O3 -fomit-frame-pointer -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p123306090120150SE +/- 0.26, N = 3SE +/- 0.13, N = 3SE +/- 0.35, N = 3136.76136.22136.051. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 101230.71331.42662.13992.85323.5665SE +/- 0.004, N = 3SE +/- 0.010, N = 3SE +/- 0.012, N = 33.1543.1603.1701. (CXX) g++ options: -O3 -fPIC -lm

GNU Radio

Test: FIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR Filter1232004006008001000SE +/- 1.65, N = 3SE +/- 3.57, N = 3SE +/- 2.94, N = 3822.3820.0818.31. 3.8.1.0

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p312612182430SE +/- 0.22, N = 3SE +/- 0.24, N = 3SE +/- 0.29, N = 525.0725.0624.951. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU3210.41220.82441.23661.64882.061SE +/- 0.00185, N = 3SE +/- 0.00543, N = 3SE +/- 0.00346, N = 31.823401.829101.83215MIN: 1.78MIN: 1.78MIN: 1.791. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 5731230M60M90M120M150MSE +/- 35276.68, N = 3SE +/- 40551.75, N = 3SE +/- 367891.89, N = 31306633331301833331300466671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

LuaRadio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR Filters12330060090012001500SE +/- 2.64, N = 3SE +/- 3.21, N = 3SE +/- 4.98, N = 31297.91292.41291.8

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU312246810SE +/- 0.00264, N = 3SE +/- 0.00984, N = 3SE +/- 0.01698, N = 36.615576.639936.64659MIN: 5.99MIN: 6MIN: 61. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

LuaRadio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert Transform31220406080100SE +/- 0.32, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 387.587.287.1

Timed Erlang/OTP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To Compile321306090120150SE +/- 0.31, N = 3SE +/- 0.40, N = 3SE +/- 0.14, N = 3112.48112.56113.00

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, Lossless2311428425670SE +/- 0.11, N = 3SE +/- 0.14, N = 3SE +/- 0.15, N = 362.3962.4062.671. (CXX) g++ options: -O3 -fPIC -lm

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p132246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 36.766.756.731. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserID2311.02382.04763.07144.09525.119SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 34.554.544.531. (CXX) g++ options: -O3 -pthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU123246810SE +/- 0.00652, N = 3SE +/- 0.01330, N = 3SE +/- 0.00592, N = 38.838988.865018.87618MIN: 4.68MIN: 4.67MIN: 4.771. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenSCAD

Render: Mini-ITX Case

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Mini-ITX Case1231020304050SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 341.9142.0542.081. OpenSCAD version 2019.05

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 61323691215SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 312.9512.9613.011. (CXX) g++ options: -O3 -fPIC -lm

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p12350100150200250SE +/- 0.11, N = 3SE +/- 0.19, N = 3SE +/- 0.43, N = 3230.27229.42229.361. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p12320406080100SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3106.19105.90105.781. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1S321612182430SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 323.3723.3923.461. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Medium3211.30392.60783.91175.21566.5195SE +/- 0.0062, N = 3SE +/- 0.0051, N = 3SE +/- 0.0231, N = 35.77375.77575.79491. (CXX) g++ options: -O3 -flto -pthread

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readback321714212835SE +/- 0.20, N = 3SE +/- 0.10, N = 3SE +/- 0.22, N = 329.1529.0529.051. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

GNU Radio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert Transform321130260390520650SE +/- 1.23, N = 3SE +/- 2.62, N = 3SE +/- 1.07, N = 3620.2618.3618.01. 3.8.1.0

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression Speed2319001800270036004500SE +/- 1.35, N = 3SE +/- 4.27, N = 3SE +/- 10.42, N = 33983.13981.93969.01. (CC) gcc options: -O3 -pthread -lz -llzma

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPY321714212835SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 328.328.228.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish21390180270360450SE +/- 0.12, N = 3SE +/- 0.16, N = 3SE +/- 1.83, N = 3436.12436.08434.661. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly13051322004006008001000SE +/- 0.12, N = 3SE +/- 1.80, N = 3SE +/- 3.02, N = 3916.11914.45913.031. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU3210.80491.60982.41473.21964.0245SE +/- 0.00251, N = 3SE +/- 0.00753, N = 3SE +/- 0.01205, N = 33.565773.574553.57740MIN: 3.26MIN: 3.24MIN: 3.231. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: Kostya2130.69751.3952.09252.793.4875SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 33.103.103.091. (CXX) g++ options: -O3 -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080p213140280420560700SE +/- 0.67, N = 3SE +/- 1.42, N = 3SE +/- 0.54, N = 3649.67648.26647.63MIN: 588.3 / MAX: 717.46MIN: 589.55 / MAX: 721.08MIN: 582.84 / MAX: 717.061. (CC) gcc options: -pthread -lm

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K1323691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 313.1613.1413.121. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction132816243240SE +/- 0.35, N = 3SE +/- 0.31, N = 3SE +/- 0.34, N = 334.3434.3834.441. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NT231816243240SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 334.234.134.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression Speed321816243240SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 334.634.634.51. (CC) gcc options: -O3 -pthread -lz -llzma

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-T312816243240SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 334.834.834.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 5723150M100M150M200M250MSE +/- 375470.08, N = 3SE +/- 321679.62, N = 3SE +/- 265476.51, N = 32515766672514733332508566671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPY321816243240SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 335.435.335.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, Lossless2131.30212.60423.90635.20846.5105SE +/- 0.013, N = 3SE +/- 0.013, N = 3SE +/- 0.012, N = 35.7715.7785.7871. (CXX) g++ options: -O3 -fPIC -lm

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TN312816243240SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 336.536.536.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

GNU Radio

Test: Signal Source (Cosine)

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)2316001200180024003000SE +/- 2.10, N = 3SE +/- 2.72, N = 3SE +/- 3.36, N = 33011.43011.23003.21. 3.8.1.0

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOT132918273645SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 337.737.637.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2231816243240SE +/- 0.10, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 333.3733.3933.461. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p132160320480640800SE +/- 1.00, N = 3SE +/- 0.90, N = 3SE +/- 2.01, N = 3733.64732.83731.76MIN: 538.11 / MAX: 1144.18MIN: 537.85 / MAX: 1141.98MIN: 538.01 / MAX: 1138.021. (CC) gcc options: -pthread -lm

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasks31210K20K30K40K50KSE +/- 38.73, N = 3SE +/- 16.42, N = 3SE +/- 118.82, N = 34472944756448421. (CXX) g++ options: -O3 -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU23148121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 314.0014.0114.03MIN: 13.9MIN: 13.88MIN: 13.921. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPY2311020304050SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 342.942.842.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweets2130.98551.9712.95653.9424.9275SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.384.384.371. (CXX) g++ options: -O3 -pthread

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57312100M200M300M400M500MSE +/- 2209451.92, N = 3SE +/- 1523508.38, N = 3SE +/- 514274.03, N = 34530866674527133334520733331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-N2311020304050SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 345.645.545.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To Compile21320406080100SE +/- 0.60, N = 3SE +/- 0.78, N = 3SE +/- 0.47, N = 394.5794.7194.77

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Decompression Speed32110002000300040005000SE +/- 1.29, N = 3SE +/- 5.89, N = 3SE +/- 7.93, N = 34442.74437.74433.41. (CC) gcc options: -O3 -pthread -lz -llzma

OpenSCAD

Render: Pistol

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Pistol13220406080100SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.21, N = 3100.77100.81100.981. OpenSCAD version 2019.05

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU231400800120016002000SE +/- 1.24, N = 3SE +/- 1.53, N = 3SE +/- 3.26, N = 31851.221854.401854.88MIN: 1841.63MIN: 1846.77MIN: 1847.251. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2123918273645SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 337.2537.3037.331. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU2310.7741.5482.3223.0963.87SE +/- 0.00274, N = 3SE +/- 0.00188, N = 3SE +/- 0.00410, N = 33.433593.439143.44003MIN: 3.17MIN: 3.18MIN: 3.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Reduction132816243240SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 336.7936.7636.731. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

GNU Radio

Test: IIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR Filter132140280420560700SE +/- 1.14, N = 3SE +/- 0.81, N = 3SE +/- 1.34, N = 3666.1665.0664.91. 3.8.1.0

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threads23110K20K30K40K50KSE +/- 25.27, N = 3SE +/- 36.98, N = 3SE +/- 27.00, N = 34477444805448541. (CXX) g++ options: -O3 -lpthread

toyBrot Fractal Generator

Implementation: TBB

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBB1329K18K27K36K45KSE +/- 59.38, N = 3SE +/- 82.93, N = 3SE +/- 72.84, N = 34405844085441351. (CXX) g++ options: -O3 -lpthread

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Download132714212835SE +/- 0.10, N = 3SE +/- 0.16, N = 3SE +/- 0.05, N = 329.2129.1829.161. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU3210.70971.41942.12912.83883.5485SE +/- 0.00951, N = 3SE +/- 0.00442, N = 3SE +/- 0.00701, N = 33.149433.151573.15433MIN: 3.09MIN: 3.1MIN: 3.11. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p132246810SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 36.856.846.841. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU13248121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 314.2814.2914.30MIN: 14.2MIN: 14.2MIN: 14.21. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decrypt32110002000300040005000SE +/- 0.04, N = 3SE +/- 0.85, N = 3SE +/- 4.51, N = 34812.424811.184806.101. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decrypt12390180270360450SE +/- 0.18, N = 3SE +/- 0.15, N = 3SE +/- 0.95, N = 3437.30437.29436.741. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SP21348121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 315.3815.3715.361. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

GNU Radio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis Filter2312004006008001000SE +/- 1.84, N = 3SE +/- 1.55, N = 3SE +/- 2.38, N = 3837.7837.3836.71. 3.8.1.0

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU3210.37520.75041.12561.50081.876SE +/- 0.00123, N = 3SE +/- 0.00242, N = 3SE +/- 0.00165, N = 31.665501.665791.66747MIN: 1.51MIN: 1.51MIN: 1.511. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction132306090120150SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3121.69121.74121.831. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Decompression Speed1239001800270036004500SE +/- 11.68, N = 3SE +/- 5.34, N = 3SE +/- 5.58, N = 34312.44308.94307.71. (CC) gcc options: -O3 -pthread -lz -llzma

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compile1231326395265SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 356.4256.4756.48

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decrypt31220406080100SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3107.88107.86107.761. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustive213306090120150SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 0.16, N = 3113.77113.89113.891. (CXX) g++ options: -O3 -flto -pthread

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish312120240360480600SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.40, N = 3543.95543.60543.391. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compile213100200300400500SE +/- 0.34, N = 3SE +/- 0.32, N = 3SE +/- 0.19, N = 3476.80477.10477.28

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thorough23148121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 314.9114.9114.921. (CXX) g++ options: -O3 -flto -pthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU2317001400210028003500SE +/- 3.80, N = 3SE +/- 4.35, N = 3SE +/- 5.97, N = 33405.323405.993408.16MIN: 3391.39MIN: 3393.27MIN: 3391.461. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU2134K8K12K16K20KSE +/- 20.02, N = 3SE +/- 14.24, N = 3SE +/- 16.54, N = 319382.7119380.3419367.301. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 31231428425670SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 363.5463.5663.591. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU231400800120016002000SE +/- 0.16, N = 3SE +/- 1.14, N = 3SE +/- 0.58, N = 31850.821851.301852.26MIN: 1844.82MIN: 1843.57MIN: 1846.851. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMP3219K18K27K36K45KSE +/- 19.34, N = 3SE +/- 16.82, N = 3SE +/- 13.86, N = 34383843854438661. (CXX) g++ options: -O3 -lpthread

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57123110M220M330M440M550MSE +/- 324054.18, N = 3SE +/- 188355.46, N = 3SE +/- 199360.09, N = 35003333335002633335000166671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decrypt312120240360480600SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.37, N = 3539.34539.24539.021. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decrypt3122004006008001000SE +/- 0.54, N = 3SE +/- 0.56, N = 3SE +/- 0.82, N = 3908.47908.31907.951. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_N12350100150200250SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3245.92245.89245.781. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU231400800120016002000SE +/- 1.72, N = 3SE +/- 0.20, N = 3SE +/- 0.88, N = 31851.141852.051852.13MIN: 1843.95MIN: 1846.88MIN: 1846.261. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU2310.48260.96521.44781.93042.413SE +/- 0.00224, N = 3SE +/- 0.00152, N = 3SE +/- 0.00353, N = 32.143852.144522.14490MIN: 1.97MIN: 1.97MIN: 1.941. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 5721314M28M42M56M70MSE +/- 1763.83, N = 3SE +/- 881.92, N = 3SE +/- 27834.83, N = 36763233367629667675996671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU2317001400210028003500SE +/- 0.58, N = 3SE +/- 0.89, N = 3SE +/- 3.64, N = 33413.023413.883414.52MIN: 3406.86MIN: 3407.32MIN: 3404.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decrypt3124080120160200SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3172.15172.09172.081. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidth3121326395265SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 356.9256.9156.891. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI312306090120150SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3112.99112.98112.941. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flops213400800120016002000SE +/- 0.55, N = 3SE +/- 1.38, N = 3SE +/- 0.83, N = 31761.071760.701760.611. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hash2130.08690.17380.26070.34760.4345SE +/- 0.0000, N = 3SE +/- 0.0000, N = 3SE +/- 0.0000, N = 30.38610.38610.38601. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-25613210002000300040005000SE +/- 0.17, N = 3SE +/- 0.22, N = 3SE +/- 0.54, N = 34815.974815.624814.831. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-2563124080120160200SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3171.96171.95171.951. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Systemd Total Boot Time

Test: Userspace

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Userspace1235K10K15K20K25K221092210922109

Systemd Total Boot Time

Test: Firmware

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Firmware1233K6K9K12K15K156791567915679

Systemd Total Boot Time

Test: Loader

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Loader1238001600240032004000355535553555

Systemd Total Boot Time

Test: Kernel

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Kernel123400800120016002000194119411941

Systemd Total Boot Time

Test: Total

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Total1235K10K15K20K25K240502405024050

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TT32148121620SE +/- 0.00, N = 2SE +/- 0.00, N = 2SE +/- 0.00, N = 215.715.715.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TN32148121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 315.815.815.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NN321510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 318.918.918.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOT3219182736453838381. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPY321816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 334.634.634.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TT321816243240SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 335.535.535.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NN321816243240SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 334.834.834.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-T3211122334455SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 346.546.546.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOT3211020304050SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 344.144.144.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPY321918273645SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 339.339.339.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPY321612182430SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 326.226.226.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOT3211122334455SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 347.947.947.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K3210.0270.0540.0810.1080.135SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.120.120.121. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandom3210.24080.48160.72240.96321.204SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.071.071.071. (CXX) g++ options: -O3 -pthread


Phoronix Test Suite v10.8.5