990KS March

Intel Core i9-9900KS testing with a ASUS PRIME Z390-A (1502 BIOS) and ASUS Intel UHD 630 CFL GT2 3GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2104012-IB-990KSMARC41&rdt&grr.

990KS MarchProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen Resolution123Intel Core i9-9900KS @ 5.00GHz (8 Cores / 16 Threads)ASUS PRIME Z390-A (1502 BIOS)Intel Cannon Lake PCH32GB240GB Corsair Force MP510ASUS Intel UHD 630 CFL GT2 3GB (1200MHz)Realtek ALC1220G237HLIntel I219-VUbuntu 20.045.9.0-050900rc8daily20201005-generic (x86_64) 20201004GNOME Shell 3.36.2X Server 1.20.84.6 Mesa 20.2.6OpenCL 2.1GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xcc - Thermald 1.9.1 Python Details- Python 2.7.18rc1 + Python 3.8.2Security Details- itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Mitigation of TSX disabled + tsx_async_abort: Mitigation of TSX disabled

990KS Marchbuild-nodejs: Time To Compilegnuradio: Hilbert Transformgnuradio: FM Deemphasis Filtergnuradio: IIR Filtergnuradio: FIR Filtergnuradio: Signal Source (Cosine)gnuradio: Five Back to Back FIR Filtersluaradio: Complex Phaseluaradio: Hilbert Transformluaradio: FM Deemphasis Filterluaradio: Five Back to Back FIR Filtersaom-av1: Speed 4 Two-Pass - Bosphorus 4Kshoc: OpenCL - S3Daom-av1: Speed 0 Two-Pass - Bosphorus 4Kgmpbench: Total Timeaom-av1: Speed 6 Two-Pass - Bosphorus 4Kincompact3d: input.i3d 193 Cells Per Directionastcenc: Exhaustivebuild-erlang: Time To Compileopenscad: Pistolaom-av1: Speed 4 Two-Pass - Bosphorus 1080pbuild-linux-kernel: Time To Compileopenscad: Projector Mount Swivelsysbench: CPUsvt-hevc: 1 - Bosphorus 1080pdav1d: Chimera 1080p 10-bitshoc: OpenCL - Texture Read Bandwidthonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUshoc: OpenCL - Max SP Flopsonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUmnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: resnet-v2-50mnn: SqueezeNetV1.0avifenc: 0simdjson: DistinctUserIDsimdjson: PartialTweetsbasis: UASTC Level 3avifenc: 6, Losslesssimdjson: Kostyabuild-mesa: Time To Compileaom-av1: Speed 0 Two-Pass - Bosphorus 1080pcompress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedsimdjson: LargeRandcompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedaom-av1: Speed 6 Realtime - Bosphorus 4Ktoybrot: C++ Threadstoybrot: C++ Taskstoybrot: TBBshoc: OpenCL - MD5 Hashtoybrot: OpenMPopenscad: Mini-ITX Casestockfish: Total Timesrslte: PHY_DL_Testsrslte: PHY_DL_Testaom-av1: Speed 6 Two-Pass - Bosphorus 1080pavifenc: 2srslte: OFDM_Testcompress-zstd: 8 - Decompression Speedcompress-zstd: 8 - Compression Speedcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedincompact3d: input.i3d 129 Cells Per Directioncompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedviennacl: OpenCL BLAS - dGEMM-TTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sCOPYcompress-zstd: 3 - Decompression Speedcompress-zstd: 3 - Compression Speedbasis: UASTC Level 2botan: AES-256 - Decryptbotan: AES-256aom-av1: Speed 9 Realtime - Bosphorus 4Kbotan: ChaCha20Poly1305 - Decryptbotan: ChaCha20Poly1305botan: Blowfish - Decryptbotan: Blowfishbotan: Twofish - Decryptbotan: Twofishbotan: CAST-256 - Decryptbotan: CAST-256botan: KASUMI - Decryptbotan: KASUMIaom-av1: Speed 6 Realtime - Bosphorus 1080pviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sCOPYbasis: ETC1Sshoc: OpenCL - GEMM SGEMM_Nonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUdav1d: Summer Nature 4Kliquid-dsp: 4 - 256 - 57liquid-dsp: 2 - 256 - 57liquid-dsp: 1 - 256 - 57liquid-dsp: 16 - 256 - 57liquid-dsp: 8 - 256 - 57astcenc: Thoroughaom-av1: Speed 8 Realtime - Bosphorus 4Kopenscad: Retro Caropenscad: Leonardo Phone Case Slimdav1d: Chimera 1080ponednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUavifenc: 6onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUshoc: OpenCL - Reductionaom-av1: Speed 8 Realtime - Bosphorus 1080ponednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUbasis: UASTC Level 0aom-av1: Speed 9 Realtime - Bosphorus 1080ponednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUshoc: OpenCL - FFT SPsvt-hevc: 7 - Bosphorus 1080pdav1d: Summer Nature 1080pastcenc: Mediumavifenc: 10, Losslesssvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psysbench: RAM / Memoryavifenc: 10onednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUsvt-hevc: 10 - Bosphorus 1080pshoc: OpenCL - Triadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Bus Speed Downloadsystemd-boot-total: Userspacesystemd-boot-total: Firmwaresystemd-boot-total: Loadersystemd-boot-total: Kernelsystemd-boot-total: Total123477.097618.0836.7666.1822.33003.21190.2704.287.2486.81297.93.4220.99310.126250.36.34121.691170113.8873113.000100.7676.7694.71492.93119380.346.85160.9156.90993413.973414.523408.161760.701854.881852.131852.2629.4472.3292.67225.3084.89972.0744.534.3863.54362.6743.156.4150.383957.531.91.073969.034.513.164485444756440580.38614386641.90823777878112.7293.219.6837.2531267000004312.4332.64609.8395.134.33759694433.41164.015.715.816.518.934.837.93835.334.037.734.632.94186.52379.433.4614806.1024815.97247.78908.311916.108539.238543.597437.297436.078172.091171.954107.861112.98025.0635.536.534.134.846.545.544.139.326.247.942.828.223.455245.9208.838982.14490185.132508566671301833336762966750033333345271333314.920738.6817.25916.975733.643.577401.6674712.9523.154333.4400336.7934124.398.261031.832157.172141.4814.283014.033515.3682106.19648.265.79495.778136.76173.31176.2928391.473.1546.639933.97706230.2715.605329.049529.211922109156793555194124050476.801618.3837.7664.9820.03011.41195.5700.287.1487.71292.43.4120.85180.126221.86.32121.826073113.7734112.558100.9786.7394.56993.90819382.716.84164.4056.89323445.233413.023405.321761.071851.221851.141850.8229.6702.3332.65525.2544.88272.2584.554.3863.56062.3893.1056.4680.373963.931.81.073983.134.613.124477444842441350.38614385442.05323501591112.2291.819.1137.3041284666674308.9327.64601.3378.434.43972274437.71156.915.715.816.518.934.738.13835.334.137.634.633.34164.52374.533.3744811.1814814.82848.72907.952913.025539.017543.385437.288436.124172.076171.951107.764112.94324.9535.536.434.234.846.545.644.139.326.247.942.928.223.390245.8858.865012.14385184.252515766671300466676763233350026333345207333314.905937.9417.34616.941731.763.574551.6657913.0053.151573.4335936.7270122.648.146951.829107.091140.3914.303113.998815.3757105.90649.675.77575.771136.22172.24175.2728491.743.1606.646593.93198229.4215.233629.052029.164122109156793555194124050477.284620.2837.3665.0818.33011.21188.7698.187.5489.51291.83.3920.84790.126217.66.30121.742472113.8933112.484100.8136.7594.76894.96619367.306.84163.7656.91753415.353413.883405.991760.611854.401852.051851.3029.7952.3462.68325.4144.94371.7994.544.3763.59362.3963.0956.4750.383939.431.51.073981.934.613.144480544729440850.3864383842.08423832208113.4295.019.2137.3261285000004307.7327.54574.8387.234.38460034442.71177.115.715.816.418.934.837.93835.434.237.634.633.34180.32388.433.3874812.4214815.61748.78908.474914.454539.337543.951436.744434.656172.150171.958107.878112.98525.0735.536.534.134.846.545.544.139.326.247.942.828.323.367245.7838.876182.14452184.162514733331306633336759966750001666745308666714.910337.1217.36717.054732.833.565771.665512.9643.149433.4391436.7633121.968.167881.823407.089139.8214.293614.010615.3573105.78647.635.77375.787136.05172.10175.0628338.593.1706.615573.93216229.3615.251829.153129.176622109156793555194124050OpenBenchmarking.org

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compile123100200300400500SE +/- 0.32, N = 3SE +/- 0.34, N = 3SE +/- 0.19, N = 3477.10476.80477.28

GNU Radio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert Transform123130260390520650SE +/- 1.07, N = 3SE +/- 2.62, N = 3SE +/- 1.23, N = 3618.0618.3620.21. 3.8.1.0

GNU Radio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis Filter1232004006008001000SE +/- 2.38, N = 3SE +/- 1.84, N = 3SE +/- 1.55, N = 3836.7837.7837.31. 3.8.1.0

GNU Radio

Test: IIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR Filter123140280420560700SE +/- 1.14, N = 3SE +/- 1.34, N = 3SE +/- 0.81, N = 3666.1664.9665.01. 3.8.1.0

GNU Radio

Test: FIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR Filter1232004006008001000SE +/- 1.65, N = 3SE +/- 3.57, N = 3SE +/- 2.94, N = 3822.3820.0818.31. 3.8.1.0

GNU Radio

Test: Signal Source (Cosine)

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)1236001200180024003000SE +/- 3.36, N = 3SE +/- 2.10, N = 3SE +/- 2.72, N = 33003.23011.43011.21. 3.8.1.0

GNU Radio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR Filters12330060090012001500SE +/- 12.45, N = 3SE +/- 9.72, N = 3SE +/- 15.25, N = 31190.21195.51188.71. 3.8.1.0

LuaRadio

Test: Complex Phase

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex Phase123150300450600750SE +/- 5.92, N = 3SE +/- 6.74, N = 3SE +/- 11.13, N = 3704.2700.2698.1

LuaRadio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert Transform12320406080100SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.32, N = 387.287.187.5

LuaRadio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis Filter123110220330440550SE +/- 1.86, N = 3SE +/- 1.98, N = 3SE +/- 0.27, N = 3486.8487.7489.5

LuaRadio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR Filters12330060090012001500SE +/- 2.64, N = 3SE +/- 3.21, N = 3SE +/- 4.98, N = 31297.91292.41291.8

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K1230.76951.5392.30853.0783.8475SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 33.423.413.391. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3D123510152025SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 320.9920.8520.851. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K1230.0270.0540.0810.1080.135SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.120.120.121. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time123130026003900520065006250.36221.86217.61. (CC) gcc options: -O3 -fomit-frame-pointer -lm

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 36.346.326.301. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction123306090120150SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3121.69121.83121.741. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustive123306090120150SE +/- 0.12, N = 3SE +/- 0.14, N = 3SE +/- 0.16, N = 3113.89113.77113.891. (CXX) g++ options: -O3 -flto -pthread

Timed Erlang/OTP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To Compile123306090120150SE +/- 0.14, N = 3SE +/- 0.40, N = 3SE +/- 0.31, N = 3113.00112.56112.48

OpenSCAD

Render: Pistol

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Pistol12320406080100SE +/- 0.07, N = 3SE +/- 0.21, N = 3SE +/- 0.09, N = 3100.77100.98100.811. OpenSCAD version 2019.05

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 36.766.736.751. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To Compile12320406080100SE +/- 0.78, N = 3SE +/- 0.60, N = 3SE +/- 0.47, N = 394.7194.5794.77

OpenSCAD

Render: Projector Mount Swivel

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Projector Mount Swivel12320406080100SE +/- 0.21, N = 3SE +/- 0.42, N = 3SE +/- 0.08, N = 392.9393.9194.971. OpenSCAD version 2019.05

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU1234K8K12K16K20KSE +/- 14.24, N = 3SE +/- 20.02, N = 3SE +/- 16.54, N = 319380.3419382.7119367.301. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p123246810SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 36.856.846.841. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p 10-bit1234080120160200SE +/- 0.13, N = 3SE +/- 1.75, N = 3SE +/- 1.42, N = 3160.91164.40163.76MIN: 103.41 / MAX: 410.16MIN: 103.73 / MAX: 392.07MIN: 104.03 / MAX: 384.961. (CC) gcc options: -pthread -lm

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidth1231326395265SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 356.9156.8956.921. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1237001400210028003500SE +/- 2.52, N = 3SE +/- 26.13, N = 3SE +/- 0.66, N = 33413.973445.233415.35MIN: 3402.03MIN: 3408.23MIN: 3409.681. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1237001400210028003500SE +/- 3.64, N = 3SE +/- 0.58, N = 3SE +/- 0.89, N = 33414.523413.023413.88MIN: 3404.15MIN: 3406.86MIN: 3407.321. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1237001400210028003500SE +/- 5.97, N = 3SE +/- 3.80, N = 3SE +/- 4.35, N = 33408.163405.323405.99MIN: 3391.46MIN: 3391.39MIN: 3393.271. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flops123400800120016002000SE +/- 1.38, N = 3SE +/- 0.55, N = 3SE +/- 0.83, N = 31760.701761.071760.611. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU123400800120016002000SE +/- 3.26, N = 3SE +/- 1.24, N = 3SE +/- 1.53, N = 31854.881851.221854.40MIN: 1847.25MIN: 1841.63MIN: 1846.771. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU123400800120016002000SE +/- 0.88, N = 3SE +/- 1.72, N = 3SE +/- 0.20, N = 31852.131851.141852.05MIN: 1846.26MIN: 1843.95MIN: 1846.881. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU123400800120016002000SE +/- 0.58, N = 3SE +/- 0.16, N = 3SE +/- 1.14, N = 31852.261850.821851.30MIN: 1846.85MIN: 1844.82MIN: 1843.571. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3123714212835SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.16, N = 329.4529.6729.80MIN: 29.17 / MAX: 40.18MIN: 29.48 / MAX: 42.09MIN: 29.45 / MAX: 42.011. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.01230.52791.05581.58372.11162.6395SE +/- 0.007, N = 3SE +/- 0.006, N = 3SE +/- 0.010, N = 32.3292.3332.346MIN: 2.26 / MAX: 4.96MIN: 2.27 / MAX: 3.55MIN: 2.28 / MAX: 3.631. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_2241230.60371.20741.81112.41483.0185SE +/- 0.063, N = 3SE +/- 0.051, N = 3SE +/- 0.063, N = 32.6722.6552.683MIN: 2.34 / MAX: 4.31MIN: 2.35 / MAX: 3.81MIN: 2.37 / MAX: 4.911. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50123612182430SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 325.3125.2525.41MIN: 25.08 / MAX: 37.21MIN: 25.09 / MAX: 35.04MIN: 25.17 / MAX: 37.21. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.01231.11222.22443.33664.44885.561SE +/- 0.029, N = 3SE +/- 0.030, N = 3SE +/- 0.027, N = 34.8994.8824.943MIN: 4.71 / MAX: 7.4MIN: 4.68 / MAX: 5.58MIN: 4.65 / MAX: 16.51. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 01231632486480SE +/- 0.16, N = 3SE +/- 0.25, N = 3SE +/- 0.26, N = 372.0772.2671.801. (CXX) g++ options: -O3 -fPIC -lm

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserID1231.02382.04763.07144.09525.119SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.534.554.541. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweets1230.98551.9712.95653.9424.9275SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.384.384.371. (CXX) g++ options: -O3 -pthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 31231428425670SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 363.5463.5663.591. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, Lossless1231428425670SE +/- 0.15, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 362.6762.3962.401. (CXX) g++ options: -O3 -fPIC -lm

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: Kostya1230.69751.3952.09252.793.4875SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 33.103.103.091. (CXX) g++ options: -O3 -pthread

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compile1231326395265SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 356.4256.4756.48

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p1230.08550.1710.25650.3420.4275SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.380.370.381. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression Speed1239001800270036004500SE +/- 9.00, N = 3SE +/- 0.96, N = 3SE +/- 5.83, N = 33957.53963.93939.41. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression Speed123714212835SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 331.931.831.51. (CC) gcc options: -O3 -pthread -lz -llzma

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandom1230.24080.48160.72240.96321.204SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.071.071.071. (CXX) g++ options: -O3 -pthread

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression Speed1239001800270036004500SE +/- 10.42, N = 3SE +/- 1.35, N = 3SE +/- 4.27, N = 33969.03983.13981.91. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression Speed123816243240SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 334.534.634.61. (CC) gcc options: -O3 -pthread -lz -llzma

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K1233691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 313.1613.1213.141. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threads12310K20K30K40K50KSE +/- 27.00, N = 3SE +/- 25.27, N = 3SE +/- 36.98, N = 34485444774448051. (CXX) g++ options: -O3 -lpthread

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasks12310K20K30K40K50KSE +/- 16.42, N = 3SE +/- 118.82, N = 3SE +/- 38.73, N = 34475644842447291. (CXX) g++ options: -O3 -lpthread

toyBrot Fractal Generator

Implementation: TBB

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBB1239K18K27K36K45KSE +/- 59.38, N = 3SE +/- 72.84, N = 3SE +/- 82.93, N = 34405844135440851. (CXX) g++ options: -O3 -lpthread

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hash1230.08690.17380.26070.34760.4345SE +/- 0.0000, N = 3SE +/- 0.0000, N = 3SE +/- 0.0000, N = 30.38610.38610.38601. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMP1239K18K27K36K45KSE +/- 13.86, N = 3SE +/- 16.82, N = 3SE +/- 19.34, N = 34386643854438381. (CXX) g++ options: -O3 -lpthread

OpenSCAD

Render: Mini-ITX Case

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Mini-ITX Case1231020304050SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 341.9142.0542.081. OpenSCAD version 2019.05

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time1235M10M15M20M25MSE +/- 294389.71, N = 4SE +/- 299604.32, N = 3SE +/- 377829.22, N = 32377787823501591238322081. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgUE Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Test123306090120150SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 0.12, N = 3112.7112.2113.41. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgeNb Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Test12360120180240300SE +/- 0.19, N = 3SE +/- 0.73, N = 3SE +/- 0.84, N = 3293.2291.8295.01. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p123510152025SE +/- 0.17, N = 3SE +/- 0.08, N = 3SE +/- 0.01, N = 319.6819.1119.211. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2123918273645SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 337.2537.3037.331. (CXX) g++ options: -O3 -fPIC -lm

srsLTE

Test: OFDM_Test

OpenBenchmarking.orgSamples / Second, More Is BettersrsLTE 20.10.1Test: OFDM_Test12330M60M90M120M150MSE +/- 416333.20, N = 3SE +/- 821245.67, N = 3SE +/- 665832.81, N = 31267000001284666671285000001. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Decompression Speed1239001800270036004500SE +/- 11.68, N = 3SE +/- 5.34, N = 3SE +/- 5.58, N = 34312.44308.94307.71. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression Speed12370140210280350SE +/- 1.66, N = 3SE +/- 2.23, N = 3SE +/- 0.88, N = 3332.6327.6327.51. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression Speed12310002000300040005000SE +/- 2.70, N = 3SE +/- 1.73, N = 3SE +/- 22.35, N = 34609.84601.34574.81. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression Speed12390180270360450SE +/- 4.78, N = 3SE +/- 2.00, N = 3SE +/- 2.94, N = 3395.1378.4387.21. (CC) gcc options: -O3 -pthread -lz -llzma

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction123816243240SE +/- 0.35, N = 3SE +/- 0.34, N = 3SE +/- 0.31, N = 334.3434.4434.381. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Decompression Speed12310002000300040005000SE +/- 7.93, N = 3SE +/- 5.89, N = 3SE +/- 1.29, N = 34433.44437.74442.71. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Compression Speed12330060090012001500SE +/- 5.14, N = 3SE +/- 10.72, N = 3SE +/- 6.67, N = 31164.01156.91177.11. (CC) gcc options: -O3 -pthread -lz -llzma

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TT12348121620SE +/- 0.00, N = 2SE +/- 0.00, N = 2SE +/- 0.00, N = 215.715.715.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TN12348121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 315.815.815.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NT12348121620SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 316.516.516.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NN123510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 318.918.918.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-T123816243240SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 334.834.734.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-N123918273645SE +/- 0.00, N = 337.938.137.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOT1239182736453838381. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPY123816243240SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 335.335.335.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPY123816243240SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 334.034.134.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOT123918273645SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 337.737.637.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPY123816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 334.634.634.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPY123816243240SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 332.933.333.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Decompression Speed1239001800270036004500SE +/- 4.56, N = 3SE +/- 19.69, N = 3SE +/- 1.88, N = 34186.54164.54180.31. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Compression Speed1235001000150020002500SE +/- 12.12, N = 3SE +/- 16.90, N = 3SE +/- 12.46, N = 32379.42374.52388.41. (CC) gcc options: -O3 -pthread -lz -llzma

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2123816243240SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.12, N = 333.4633.3733.391. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decrypt12310002000300040005000SE +/- 4.51, N = 3SE +/- 0.85, N = 3SE +/- 0.04, N = 34806.104811.184812.421. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-25612310002000300040005000SE +/- 0.17, N = 3SE +/- 0.54, N = 3SE +/- 0.22, N = 34815.974814.834815.621. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K1231122334455SE +/- 0.53, N = 15SE +/- 0.10, N = 3SE +/- 0.07, N = 347.7848.7248.781. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decrypt1232004006008001000SE +/- 0.56, N = 3SE +/- 0.82, N = 3SE +/- 0.54, N = 3908.31907.95908.471. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly13051232004006008001000SE +/- 0.12, N = 3SE +/- 3.02, N = 3SE +/- 1.80, N = 3916.11913.03914.451. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decrypt123120240360480600SE +/- 0.04, N = 3SE +/- 0.37, N = 3SE +/- 0.03, N = 3539.24539.02539.341. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish123120240360480600SE +/- 0.05, N = 3SE +/- 0.40, N = 3SE +/- 0.10, N = 3543.60543.39543.951. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decrypt12390180270360450SE +/- 0.18, N = 3SE +/- 0.15, N = 3SE +/- 0.95, N = 3437.30437.29436.741. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish12390180270360450SE +/- 0.16, N = 3SE +/- 0.12, N = 3SE +/- 1.83, N = 3436.08436.12434.661. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decrypt1234080120160200SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3172.09172.08172.151. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-2561234080120160200SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3171.95171.95171.961. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decrypt12320406080100SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3107.86107.76107.881. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI123306090120150SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3112.98112.94112.991. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p123612182430SE +/- 0.24, N = 3SE +/- 0.29, N = 5SE +/- 0.22, N = 325.0624.9525.071. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TT123816243240SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 335.535.535.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TN123816243240SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 336.536.436.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NT123816243240SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 334.134.234.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NN123816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 334.834.834.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-T1231122334455SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 346.546.546.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-N1231020304050SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 345.545.645.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOT1231020304050SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 344.144.144.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPY123918273645SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 339.339.339.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPY123612182430SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 326.226.226.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOT1231122334455SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 347.947.947.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPY1231020304050SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 342.842.942.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPY123714212835SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 328.228.228.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1S123612182430SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 323.4623.3923.371. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_N12350100150200250SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3245.92245.89245.781. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU123246810SE +/- 0.00652, N = 3SE +/- 0.01330, N = 3SE +/- 0.00592, N = 38.838988.865018.87618MIN: 4.68MIN: 4.67MIN: 4.771. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1230.48260.96521.44781.93042.413SE +/- 0.00353, N = 3SE +/- 0.00224, N = 3SE +/- 0.00152, N = 32.144902.143852.14452MIN: 1.94MIN: 1.97MIN: 1.971. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4K1234080120160200SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 0.19, N = 3185.13184.25184.16MIN: 174.56 / MAX: 210.52MIN: 173.91 / MAX: 209.75MIN: 173.76 / MAX: 209.161. (CC) gcc options: -pthread -lm

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 5712350M100M150M200M250MSE +/- 265476.51, N = 3SE +/- 375470.08, N = 3SE +/- 321679.62, N = 32508566672515766672514733331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 5712330M60M90M120M150MSE +/- 40551.75, N = 3SE +/- 367891.89, N = 3SE +/- 35276.68, N = 31301833331300466671306633331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 5712314M28M42M56M70MSE +/- 881.92, N = 3SE +/- 1763.83, N = 3SE +/- 27834.83, N = 36762966767632333675996671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57123110M220M330M440M550MSE +/- 324054.18, N = 3SE +/- 188355.46, N = 3SE +/- 199360.09, N = 35003333335002633335000166671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57123100M200M300M400M500MSE +/- 1523508.38, N = 3SE +/- 514274.03, N = 3SE +/- 2209451.92, N = 34527133334520733334530866671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thorough12348121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 314.9214.9114.911. (CXX) g++ options: -O3 -flto -pthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K123918273645SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.51, N = 438.6837.9437.121. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenSCAD

Render: Retro Car

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Retro Car12348121620SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 317.2617.3517.371. OpenSCAD version 2019.05

OpenSCAD

Render: Leonardo Phone Case Slim

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Leonardo Phone Case Slim12348121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 316.9816.9417.051. OpenSCAD version 2019.05

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p123160320480640800SE +/- 1.00, N = 3SE +/- 2.01, N = 3SE +/- 0.90, N = 3733.64731.76732.83MIN: 538.11 / MAX: 1144.18MIN: 538.01 / MAX: 1138.02MIN: 537.85 / MAX: 1141.981. (CC) gcc options: -pthread -lm

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1230.80491.60982.41473.21964.0245SE +/- 0.01205, N = 3SE +/- 0.00753, N = 3SE +/- 0.00251, N = 33.577403.574553.56577MIN: 3.23MIN: 3.24MIN: 3.261. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.37520.75041.12561.50081.876SE +/- 0.00165, N = 3SE +/- 0.00242, N = 3SE +/- 0.00123, N = 31.667471.665791.66550MIN: 1.51MIN: 1.51MIN: 1.511. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 61233691215SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 312.9513.0112.961. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.70971.41942.12912.83883.5485SE +/- 0.00701, N = 3SE +/- 0.00442, N = 3SE +/- 0.00951, N = 33.154333.151573.14943MIN: 3.1MIN: 3.1MIN: 3.091. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.7741.5482.3223.0963.87SE +/- 0.00410, N = 3SE +/- 0.00274, N = 3SE +/- 0.00188, N = 33.440033.433593.43914MIN: 3.15MIN: 3.17MIN: 3.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Reduction123816243240SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 336.7936.7336.761. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p123306090120150SE +/- 0.42, N = 3SE +/- 1.89, N = 3SE +/- 1.32, N = 12124.39122.64121.961. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU123246810SE +/- 0.00549, N = 3SE +/- 0.00679, N = 3SE +/- 0.01559, N = 38.261038.146958.16788MIN: 8.14MIN: 8MIN: 8.031. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.41220.82441.23661.64882.061SE +/- 0.00346, N = 3SE +/- 0.00543, N = 3SE +/- 0.00185, N = 31.832151.829101.82340MIN: 1.79MIN: 1.78MIN: 1.781. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 0123246810SE +/- 0.000, N = 3SE +/- 0.003, N = 3SE +/- 0.005, N = 37.1727.0917.0891. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p123306090120150SE +/- 1.63, N = 6SE +/- 0.46, N = 3SE +/- 2.02, N = 4141.48140.39139.821. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU12348121620SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 314.2814.3014.29MIN: 14.2MIN: 14.2MIN: 14.21. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU12348121620SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 314.0314.0014.01MIN: 13.92MIN: 13.9MIN: 13.881. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SP12348121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 315.3715.3815.361. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p12320406080100SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3106.19105.90105.781. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080p123140280420560700SE +/- 1.42, N = 3SE +/- 0.67, N = 3SE +/- 0.54, N = 3648.26649.67647.63MIN: 589.55 / MAX: 721.08MIN: 588.3 / MAX: 717.46MIN: 582.84 / MAX: 717.061. (CC) gcc options: -pthread -lm

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Medium1231.30392.60783.91175.21566.5195SE +/- 0.0231, N = 3SE +/- 0.0051, N = 3SE +/- 0.0062, N = 35.79495.77575.77371. (CXX) g++ options: -O3 -flto -pthread

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, Lossless1231.30212.60423.90635.20846.5105SE +/- 0.013, N = 3SE +/- 0.013, N = 3SE +/- 0.012, N = 35.7785.7715.7871. (CXX) g++ options: -O3 -fPIC -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p123306090120150SE +/- 0.26, N = 3SE +/- 0.13, N = 3SE +/- 0.35, N = 3136.76136.22136.051. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p1234080120160200SE +/- 0.19, N = 3SE +/- 0.43, N = 3SE +/- 0.26, N = 3173.31172.24172.101. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p1234080120160200SE +/- 0.31, N = 3SE +/- 0.47, N = 3SE +/- 0.09, N = 3176.29175.27175.061. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memory1236K12K18K24K30KSE +/- 98.12, N = 3SE +/- 138.44, N = 3SE +/- 41.95, N = 328391.4728491.7428338.591. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 101230.71331.42662.13992.85323.5665SE +/- 0.004, N = 3SE +/- 0.010, N = 3SE +/- 0.012, N = 33.1543.1603.1701. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU123246810SE +/- 0.00984, N = 3SE +/- 0.01698, N = 3SE +/- 0.00264, N = 36.639936.646596.61557MIN: 6MIN: 6MIN: 5.991. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1230.89481.78962.68443.57924.474SE +/- 0.02149, N = 3SE +/- 0.01212, N = 3SE +/- 0.01771, N = 33.977063.931983.93216MIN: 3.65MIN: 3.64MIN: 3.621. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p12350100150200250SE +/- 0.11, N = 3SE +/- 0.19, N = 3SE +/- 0.43, N = 3230.27229.42229.361. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triad12348121620SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 315.6115.2315.251. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readback123714212835SE +/- 0.22, N = 3SE +/- 0.10, N = 3SE +/- 0.20, N = 329.0529.0529.151. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Download123714212835SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.16, N = 329.2129.1629.181. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

Systemd Total Boot Time

Test: Userspace

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Userspace1235K10K15K20K25K221092210922109

Systemd Total Boot Time

Test: Firmware

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Firmware1233K6K9K12K15K156791567915679

Systemd Total Boot Time

Test: Loader

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Loader1238001600240032004000355535553555

Systemd Total Boot Time

Test: Kernel

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Kernel123400800120016002000194119411941

Systemd Total Boot Time

Test: Total

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Total1235K10K15K20K25K240502405024050


Phoronix Test Suite v10.8.5