2700 march AMD Ryzen 7 2700 Eight-Core testing with a Gigabyte AB350N-Gaming WIFI-CF (F20 BIOS) and HIS AMD Radeon HD 6450/7450/8450 / R5 230 OEM 1GB on Ubuntu 19.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2103315-IB-2700MARCH64&grs .
2700 march Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution 1 2 3 AMD Ryzen 7 2700 Eight-Core @ 3.20GHz (8 Cores / 16 Threads) Gigabyte AB350N-Gaming WIFI-CF (F20 BIOS) AMD 17h 16GB 120GB ADATA SU700 HIS AMD Radeon HD 6450/7450/8450 / R5 230 OEM 1GB AMD Caicos HDMI Audio DELL S2409W Realtek RTL8111/8168/8411 + Intel 3165 Ubuntu 19.10 5.9.0-050900rc7daily20201004-generic (x86_64) 20201003 GNOME Shell 3.34.1 X Server 1.20.5 3.3 Mesa 19.2.8 (LLVM 9.0.0) GCC 9.2.1 20191008 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x800820b Python Details - Python 2.7.17 + Python 3.7.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected
2700 march mnn: inception-v3 incompact3d: input.i3d 129 Cells Per Direction onednn: Recurrent Neural Network Training - f32 - CPU incompact3d: input.i3d 193 Cells Per Direction mnn: SqueezeNetV1.0 openscad: Projector Mount Swivel onednn: IP Shapes 3D - f32 - CPU stockfish: Total Time aom-av1: Speed 6 Realtime - Bosphorus 1080p mnn: resnet-v2-50 aom-av1: Speed 8 Realtime - Bosphorus 4K srslte: PHY_DL_Test openscad: Pistol sysbench: RAM / Memory luaradio: Hilbert Transform svt-vp9: Visual Quality Optimized - Bosphorus 1080p onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU aom-av1: Speed 6 Two-Pass - Bosphorus 4K openscad: Mini-ITX Case openscad: Retro Car luaradio: FM Deemphasis Filter luaradio: Five Back to Back FIR Filters basis: UASTC Level 0 onednn: Recurrent Neural Network Inference - u8s8f32 - CPU mnn: mobilenet-v1-1.0 viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-NN svt-vp9: VMAF Optimized - Bosphorus 1080p viennacl: CPU BLAS - dGEMM-TT viennacl: CPU BLAS - dGEMM-TN botan: AES-256 - Decrypt liquid-dsp: 2 - 256 - 57 viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - sCOPY aom-av1: Speed 4 Two-Pass - Bosphorus 1080p mnn: MobileNetV2_224 srslte: OFDM_Test viennacl: CPU BLAS - dDOT srslte: PHY_DL_Test svt-hevc: 7 - Bosphorus 1080p aom-av1: Speed 4 Two-Pass - Bosphorus 4K onednn: IP Shapes 1D - f32 - CPU build-linux-kernel: Time To Compile simdjson: Kostya onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU botan: ChaCha20Poly1305 botan: AES-256 onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - dAXPY aom-av1: Speed 6 Realtime - Bosphorus 4K luaradio: Complex Phase sysbench: CPU liquid-dsp: 1 - 256 - 57 build-nodejs: Time To Compile onednn: Recurrent Neural Network Inference - f32 - CPU liquid-dsp: 16 - 256 - 57 onednn: Recurrent Neural Network Training - u8s8f32 - CPU svt-hevc: 10 - Bosphorus 1080p onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU basis: ETC1S onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU viennacl: CPU BLAS - sDOT gmpbench: Total Time onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU botan: ChaCha20Poly1305 - Decrypt viennacl: CPU BLAS - dGEMV-N build-erlang: Time To Compile viennacl: CPU BLAS - dGEMV-T basis: UASTC Level 2 onednn: Deconvolution Batch shapes_1d - f32 - CPU svt-hevc: 1 - Bosphorus 1080p aom-av1: Speed 6 Two-Pass - Bosphorus 1080p liquid-dsp: 4 - 256 - 57 onednn: IP Shapes 1D - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU openscad: Leonardo Phone Case Slim liquid-dsp: 8 - 256 - 57 botan: Twofish - Decrypt svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p onednn: IP Shapes 3D - u8s8f32 - CPU basis: UASTC Level 3 botan: CAST-256 botan: Blowfish - Decrypt botan: Twofish botan: CAST-256 - Decrypt botan: Blowfish aom-av1: Speed 9 Realtime - Bosphorus 4K onednn: Convolution Batch Shapes Auto - f32 - CPU botan: KASUMI - Decrypt botan: KASUMI systemd-boot-total: Userspace systemd-boot-total: Firmware systemd-boot-total: Loader systemd-boot-total: Kernel systemd-boot-total: Total aom-av1: Speed 0 Two-Pass - Bosphorus 1080p aom-av1: Speed 0 Two-Pass - Bosphorus 4K simdjson: DistinctUserID simdjson: PartialTweets simdjson: LargeRand aom-av1: Speed 9 Realtime - Bosphorus 1080p aom-av1: Speed 8 Realtime - Bosphorus 1080p 1 2 3 54.185 44.3870951 8187.48 157.428889 10.149 170.706 13.3764 18133486 15.17 45.858 24.32 82.0 150.193 7979.40 92.1 77.66 8211.26 3.82 65.386 27.135 387.9 480.3 9.246 4343.32 4.918 32.7 33.3 93.63 34.2 34.7 5483.849 111863333 18.4 18.5 3.74 5.398 77866667 39.0 216.0 72.79 2.00 8.40798 112.365 2.08 4366.85 14.5525 484.192 5427.381 11.1635 27.6 27.8 8.61 551.7 14106.47 56320333 614.671 4355.03 438133333 8186.05 149.71 8.37857 31.194 4.98031 39.1 4923.1 24.4047 482.849 41.4 170.803 43.2 40.534 10.8615 5.46 11.14 209113333 8.10530 5.37265 25.401 378376667 352.194 98.92 2.79376 77.200 141.152 431.594 352.929 141.164 431.385 30.40 23.7916 87.844 91.552 23295 11240 5212 2761 26056 0.22 0.07 3.08 2.84 0.87 64.75 54.48 55.770 44.9321429 8294.96 160.035952 10.215 169.994 13.2109 18333344 15.07 46.225 24.58 81.4 151.185 8016.57 92.8 78.16 8166.93 3.85 65.864 27.118 386.8 483.4 9.219 4370.98 4.887 32.7 33.5 93.71 34.3 34.5 5461.957 111236667 18.5 18.4 3.73 5.415 78266667 39.2 214.9 72.89 2.01 8.42009 112.390 2.08 4351.79 14.5886 482.250 5406.600 11.1749 27.5 27.7 8.60 553.6 14107.97 56235333 615.075 4340.82 437706667 8197.44 150.09 8.38646 31.134 4.96746 39.0 4917.2 24.3446 481.666 41.4 171.209 43.2 40.502 10.8822 5.46 11.16 209460000 8.10551 5.37691 25.388 378373333 351.992 99.05 2.79736 77.178 141.211 431.245 353.089 141.194 431.348 30.39 23.7977 87.864 91.562 23295 11240 5212 2761 26056 0.22 0.07 3.08 2.84 0.87 64.84 54.90 54.033 44.1209208 8149.80 157.363368 10.066 172.293 13.2285 18110615 14.99 45.685 24.59 82.3 151.735 8052.05 92.0 78.29 8147.02 3.84 65.652 27.296 389.3 481.8 9.187 4346.16 4.904 32.5 33.3 93.16 34.4 34.5 5493.612 111496667 18.4 18.5 3.75 5.387 78100000 39.2 216.0 73.16 2.01 8.37864 111.852 2.09 4347.64 14.6134 484.226 5426.079 11.1327 27.5 27.7 8.63 552.3 14154.66 56134333 613.065 4341.46 439076667 8172.53 149.66 8.36378 31.113 4.97285 39.0 4929.8 24.3779 482.571 41.3 170.802 43.1 40.450 10.8773 5.47 11.16 209336667 8.11780 5.36897 25.425 377836667 352.489 99.03 2.79570 77.122 141.271 431.376 353.065 141.131 431.517 30.40 23.7932 87.857 91.553 23295 11240 5212 2761 26056 0.22 0.07 3.08 2.84 0.87 62.67 55.96 OpenBenchmarking.org
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: inception-v3 1 2 3 13 26 39 52 65 SE +/- 0.40, N = 3 SE +/- 1.06, N = 3 SE +/- 0.14, N = 3 54.19 55.77 54.03 MIN: 53.22 / MAX: 69.33 MIN: 53.52 / MAX: 70.99 MIN: 53.61 / MAX: 68.4 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 1 2 3 10 20 30 40 50 SE +/- 0.60, N = 3 SE +/- 0.46, N = 3 SE +/- 0.07, N = 3 44.39 44.93 44.12 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 1 2 3 2K 4K 6K 8K 10K SE +/- 3.41, N = 3 SE +/- 80.20, N = 3 SE +/- 16.54, N = 3 8187.48 8294.96 8149.80 MIN: 8175.57 MIN: 8168.26 MIN: 8112.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction 1 2 3 40 80 120 160 200 SE +/- 0.13, N = 3 SE +/- 1.44, N = 12 SE +/- 0.10, N = 3 157.43 160.04 157.36 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: SqueezeNetV1.0 1 2 3 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 10.15 10.22 10.07 MIN: 9.96 / MAX: 15.18 MIN: 9.92 / MAX: 25.01 MIN: 9.92 / MAX: 12.59 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenSCAD Render: Projector Mount Swivel OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Projector Mount Swivel 1 2 3 40 80 120 160 200 SE +/- 0.17, N = 3 SE +/- 1.17, N = 3 SE +/- 0.80, N = 3 170.71 169.99 172.29 1. OpenSCAD version 2019.01-RC2
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 13.38 13.21 13.23 MIN: 12.91 MIN: 12.88 MIN: 12.9 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time 1 2 3 4M 8M 12M 16M 20M SE +/- 262210.51, N = 4 SE +/- 135386.39, N = 3 SE +/- 240236.23, N = 5 18133486 18333344 18110615 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p 1 2 3 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.12, N = 3 15.17 15.07 14.99 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: resnet-v2-50 1 2 3 10 20 30 40 50 SE +/- 0.14, N = 3 SE +/- 0.48, N = 3 SE +/- 0.13, N = 3 45.86 46.23 45.69 MIN: 45.28 / MAX: 65.36 MIN: 45.1 / MAX: 60.21 MIN: 45.03 / MAX: 60.75 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K 1 2 3 6 12 18 24 30 SE +/- 0.35, N = 4 SE +/- 0.04, N = 3 SE +/- 0.11, N = 3 24.32 24.58 24.59 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
srsLTE Test: PHY_DL_Test OpenBenchmarking.org UE Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test 1 2 3 20 40 60 80 100 SE +/- 0.20, N = 3 SE +/- 0.60, N = 3 SE +/- 0.13, N = 3 82.0 81.4 82.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lm -lfftw3f -lmbedcrypto
OpenSCAD Render: Pistol OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Pistol 1 2 3 30 60 90 120 150 SE +/- 0.29, N = 3 SE +/- 0.45, N = 3 SE +/- 0.85, N = 3 150.19 151.19 151.74 1. OpenSCAD version 2019.01-RC2
Sysbench Test: RAM / Memory OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory 1 2 3 2K 4K 6K 8K 10K SE +/- 21.59, N = 3 SE +/- 7.05, N = 3 SE +/- 22.63, N = 3 7979.40 8016.57 8052.05 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
LuaRadio Test: Hilbert Transform OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Hilbert Transform 1 2 3 20 40 60 80 100 SE +/- 0.23, N = 3 SE +/- 0.23, N = 3 SE +/- 0.23, N = 3 92.1 92.8 92.0
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p 1 2 3 20 40 60 80 100 SE +/- 0.23, N = 3 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 77.66 78.16 78.29 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 2K 4K 6K 8K 10K SE +/- 15.18, N = 3 SE +/- 8.62, N = 3 SE +/- 18.94, N = 3 8211.26 8166.93 8147.02 MIN: 8168.07 MIN: 8149.6 MIN: 8105.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K 1 2 3 0.8663 1.7326 2.5989 3.4652 4.3315 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.82 3.85 3.84 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenSCAD Render: Mini-ITX Case OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Mini-ITX Case 1 2 3 15 30 45 60 75 SE +/- 0.15, N = 3 SE +/- 0.24, N = 3 SE +/- 0.24, N = 3 65.39 65.86 65.65 1. OpenSCAD version 2019.01-RC2
OpenSCAD Render: Retro Car OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Retro Car 1 2 3 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 27.14 27.12 27.30 1. OpenSCAD version 2019.01-RC2
LuaRadio Test: FM Deemphasis Filter OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: FM Deemphasis Filter 1 2 3 80 160 240 320 400 SE +/- 0.55, N = 3 SE +/- 1.30, N = 3 SE +/- 1.13, N = 3 387.9 386.8 389.3
LuaRadio Test: Five Back to Back FIR Filters OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Five Back to Back FIR Filters 1 2 3 100 200 300 400 500 SE +/- 2.34, N = 3 SE +/- 1.63, N = 3 SE +/- 0.71, N = 3 480.3 483.4 481.8
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 1 2 3 3 6 9 12 15 SE +/- 0.007, N = 3 SE +/- 0.015, N = 3 SE +/- 0.005, N = 3 9.246 9.219 9.187 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 1 2 3 900 1800 2700 3600 4500 SE +/- 14.69, N = 3 SE +/- 13.60, N = 3 SE +/- 7.09, N = 3 4343.32 4370.98 4346.16 MIN: 4304.78 MIN: 4346.29 MIN: 4326.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: mobilenet-v1-1.0 1 2 3 1.1066 2.2132 3.3198 4.4264 5.533 SE +/- 0.034, N = 3 SE +/- 0.055, N = 3 SE +/- 0.009, N = 3 4.918 4.887 4.904 MIN: 4.83 / MAX: 6.07 MIN: 4.79 / MAX: 8.37 MIN: 4.85 / MAX: 19.45 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT 1 2 3 8 16 24 32 40 SE +/- 0.36, N = 3 SE +/- 0.35, N = 3 SE +/- 0.42, N = 3 32.7 32.7 32.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN 1 2 3 8 16 24 32 40 SE +/- 0.29, N = 3 SE +/- 0.40, N = 3 SE +/- 0.40, N = 3 33.3 33.5 33.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p 1 2 3 20 40 60 80 100 SE +/- 1.49, N = 3 SE +/- 1.35, N = 3 SE +/- 1.39, N = 3 93.63 93.71 93.16 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT 1 2 3 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.12, N = 3 SE +/- 0.19, N = 3 34.2 34.3 34.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN 1 2 3 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 34.7 34.5 34.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Botan Test: AES-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt 1 2 3 1200 2400 3600 4800 6000 SE +/- 4.48, N = 3 SE +/- 16.12, N = 3 SE +/- 4.78, N = 3 5483.85 5461.96 5493.61 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Liquid-DSP Threads: 2 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 2 - Buffer Length: 256 - Filter Length: 57 1 2 3 20M 40M 60M 80M 100M SE +/- 462793.45, N = 3 SE +/- 287595.39, N = 3 SE +/- 362276.75, N = 3 111863333 111236667 111496667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY 1 2 3 5 10 15 20 25 SE +/- 0.00, N = 2 18.4 18.5 18.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY 1 2 3 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 18.5 18.4 18.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p 1 2 3 0.8438 1.6876 2.5314 3.3752 4.219 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.74 3.73 3.75 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: MobileNetV2_224 1 2 3 1.2184 2.4368 3.6552 4.8736 6.092 SE +/- 0.047, N = 3 SE +/- 0.077, N = 3 SE +/- 0.021, N = 3 5.398 5.415 5.387 MIN: 5.27 / MAX: 11.32 MIN: 5.3 / MAX: 24.9 MIN: 5.31 / MAX: 7.46 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
srsLTE Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsLTE 20.10.1 Test: OFDM_Test 1 2 3 20M 40M 60M 80M 100M SE +/- 371184.29, N = 3 SE +/- 88191.71, N = 3 SE +/- 305505.05, N = 3 77866667 78266667 78100000 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lm -lfftw3f -lmbedcrypto
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT 1 2 3 9 18 27 36 45 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 39.0 39.2 39.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
srsLTE Test: PHY_DL_Test OpenBenchmarking.org eNb Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test 1 2 3 50 100 150 200 250 SE +/- 0.39, N = 3 SE +/- 0.17, N = 3 SE +/- 0.46, N = 3 216.0 214.9 216.0 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lm -lfftw3f -lmbedcrypto
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p 1 2 3 16 32 48 64 80 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 72.79 72.89 73.16 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K 1 2 3 0.4523 0.9046 1.3569 1.8092 2.2615 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.00 2.01 2.01 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.00658, N = 3 SE +/- 0.00479, N = 3 SE +/- 0.01107, N = 3 8.40798 8.42009 8.37864 MIN: 8.2 MIN: 8.19 MIN: 8.09 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.10.20 Time To Compile 1 2 3 30 60 90 120 150 SE +/- 1.03, N = 3 SE +/- 1.50, N = 3 SE +/- 0.94, N = 3 112.37 112.39 111.85
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: Kostya 1 2 3 0.4703 0.9406 1.4109 1.8812 2.3515 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 2.08 2.08 2.09 1. (CXX) g++ options: -O3 -pthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 900 1800 2700 3600 4500 SE +/- 5.40, N = 3 SE +/- 6.88, N = 3 SE +/- 2.52, N = 3 4366.85 4351.79 4347.64 MIN: 4351.4 MIN: 4329.09 MIN: 4335.4 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 14.55 14.59 14.61 MIN: 14.15 MIN: 14.16 MIN: 13.97 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Botan Test: ChaCha20Poly1305 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 1 2 3 100 200 300 400 500 SE +/- 0.30, N = 3 SE +/- 1.00, N = 3 SE +/- 0.20, N = 3 484.19 482.25 484.23 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 1 2 3 1200 2400 3600 4800 6000 SE +/- 1.56, N = 3 SE +/- 10.44, N = 3 SE +/- 1.41, N = 3 5427.38 5406.60 5426.08 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 11.16 11.17 11.13 MIN: 11.09 MIN: 11.08 MIN: 11.05 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY 1 2 3 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 27.6 27.5 27.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY 1 2 3 7 14 21 28 35 SE +/- 0.05, N = 2 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 27.8 27.7 27.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 8.61 8.60 8.63 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
LuaRadio Test: Complex Phase OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Complex Phase 1 2 3 120 240 360 480 600 SE +/- 1.45, N = 3 SE +/- 1.66, N = 3 SE +/- 1.02, N = 3 551.7 553.6 552.3
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU 1 2 3 3K 6K 9K 12K 15K SE +/- 0.53, N = 3 SE +/- 0.20, N = 3 SE +/- 6.91, N = 3 14106.47 14107.97 14154.66 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 1 2 3 12M 24M 36M 48M 60M SE +/- 221002.51, N = 3 SE +/- 183205.11, N = 3 SE +/- 290873.59, N = 3 56320333 56235333 56134333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 15.11 Time To Compile 1 2 3 130 260 390 520 650 SE +/- 1.62, N = 3 SE +/- 1.55, N = 3 SE +/- 1.22, N = 3 614.67 615.08 613.07
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 1 2 3 900 1800 2700 3600 4500 SE +/- 6.89, N = 3 SE +/- 13.08, N = 3 SE +/- 12.70, N = 3 4355.03 4340.82 4341.46 MIN: 4334.34 MIN: 4309.94 MIN: 4314.23 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 1 2 3 90M 180M 270M 360M 450M SE +/- 653715.87, N = 3 SE +/- 698721.05, N = 3 SE +/- 745035.42, N = 3 438133333 437706667 439076667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 1 2 3 2K 4K 6K 8K 10K SE +/- 18.76, N = 3 SE +/- 18.96, N = 3 SE +/- 14.12, N = 3 8186.05 8197.44 8172.53 MIN: 8154.67 MIN: 8159.65 MIN: 8141.85 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p 1 2 3 30 60 90 120 150 SE +/- 0.38, N = 3 SE +/- 0.36, N = 3 SE +/- 0.38, N = 3 149.71 150.09 149.66 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.01097, N = 3 SE +/- 0.00543, N = 3 SE +/- 0.01793, N = 3 8.37857 8.38646 8.36378 MIN: 8.25 MIN: 8.25 MIN: 8.25 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Basis Universal Settings: ETC1S OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: ETC1S 1 2 3 7 14 21 28 35 SE +/- 0.10, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 31.19 31.13 31.11 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 1 2 3 1.1206 2.2412 3.3618 4.4824 5.603 SE +/- 0.00815, N = 3 SE +/- 0.00255, N = 3 SE +/- 0.00159, N = 3 4.98031 4.96746 4.97285 MIN: 4.9 MIN: 4.89 MIN: 4.88 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT 1 2 3 9 18 27 36 45 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 39.1 39.0 39.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
GNU GMP GMPbench Total Time OpenBenchmarking.org GMPbench Score, More Is Better GNU GMP GMPbench 6.2.1 Total Time 1 2 3 1100 2200 3300 4400 5500 4923.1 4917.2 4929.8 1. (CC) gcc options: -O3 -fomit-frame-pointer -lm
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 1 2 3 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 24.40 24.34 24.38 MIN: 23.71 MIN: 23.78 MIN: 23.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Botan Test: ChaCha20Poly1305 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt 1 2 3 100 200 300 400 500 SE +/- 0.25, N = 3 SE +/- 0.72, N = 3 SE +/- 0.25, N = 3 482.85 481.67 482.57 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N 1 2 3 9 18 27 36 45 SE +/- 0.15, N = 3 SE +/- 0.18, N = 3 SE +/- 0.18, N = 3 41.4 41.4 41.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Timed Erlang/OTP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Erlang/OTP Compilation 23.2 Time To Compile 1 2 3 40 80 120 160 200 SE +/- 0.10, N = 3 SE +/- 0.12, N = 3 SE +/- 0.15, N = 3 170.80 171.21 170.80
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T 1 2 3 10 20 30 40 50 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 43.2 43.2 43.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 1 2 3 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 0.11, N = 3 40.53 40.50 40.45 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 10.86 10.88 10.88 MIN: 10.55 MIN: 10.47 MIN: 10.54 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p 1 2 3 1.2308 2.4616 3.6924 4.9232 6.154 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 5.46 5.46 5.47 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 11.14 11.16 11.16 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Liquid-DSP Threads: 4 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 1 2 3 40M 80M 120M 160M 200M SE +/- 384071.75, N = 3 SE +/- 308598.98, N = 3 SE +/- 626905.98, N = 3 209113333 209460000 209336667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.00328, N = 3 SE +/- 0.00340, N = 3 SE +/- 0.01300, N = 3 8.10530 8.10551 8.11780 MIN: 7.13 MIN: 7.12 MIN: 7.78 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 1 2 3 1.2098 2.4196 3.6294 4.8392 6.049 SE +/- 0.00225, N = 3 SE +/- 0.01071, N = 3 SE +/- 0.00128, N = 3 5.37265 5.37691 5.36897 MIN: 5.28 MIN: 5.24 MIN: 5.28 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenSCAD Render: Leonardo Phone Case Slim OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Leonardo Phone Case Slim 1 2 3 6 12 18 24 30 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 25.40 25.39 25.43 1. OpenSCAD version 2019.01-RC2
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 1 2 3 80M 160M 240M 320M 400M SE +/- 54873.59, N = 3 SE +/- 140988.57, N = 3 SE +/- 471322.02, N = 3 378376667 378373333 377836667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Botan Test: Twofish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt 1 2 3 80 160 240 320 400 SE +/- 0.21, N = 3 SE +/- 0.01, N = 3 SE +/- 0.14, N = 3 352.19 351.99 352.49 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p 1 2 3 20 40 60 80 100 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 SE +/- 0.39, N = 3 98.92 99.05 99.03 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.6294 1.2588 1.8882 2.5176 3.147 SE +/- 0.00339, N = 3 SE +/- 0.00485, N = 3 SE +/- 0.00436, N = 3 2.79376 2.79736 2.79570 MIN: 2.57 MIN: 2.59 MIN: 2.6 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 1 2 3 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 77.20 77.18 77.12 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Botan Test: CAST-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 1 2 3 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 141.15 141.21 141.27 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt 1 2 3 90 180 270 360 450 SE +/- 0.10, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 431.59 431.25 431.38 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish 1 2 3 80 160 240 320 400 SE +/- 0.24, N = 3 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 352.93 353.09 353.07 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt 1 2 3 30 60 90 120 150 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 141.16 141.19 141.13 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish 1 2 3 90 180 270 360 450 SE +/- 0.04, N = 3 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 431.39 431.35 431.52 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K 1 2 3 7 14 21 28 35 SE +/- 0.32, N = 8 SE +/- 0.30, N = 9 SE +/- 0.35, N = 6 30.40 30.39 30.40 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 1 2 3 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 23.79 23.80 23.79 MIN: 23.24 MIN: 23.38 MIN: 23.46 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Botan Test: KASUMI - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt 1 2 3 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 87.84 87.86 87.86 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI 1 2 3 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 91.55 91.56 91.55 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Systemd Total Boot Time Test: Userspace OpenBenchmarking.org ms, Fewer Is Better Systemd Total Boot Time Test: Userspace 1 2 3 5K 10K 15K 20K 25K 23295 23295 23295
Systemd Total Boot Time Test: Firmware OpenBenchmarking.org ms, Fewer Is Better Systemd Total Boot Time Test: Firmware 1 2 3 2K 4K 6K 8K 10K 11240 11240 11240
Systemd Total Boot Time Test: Loader OpenBenchmarking.org ms, Fewer Is Better Systemd Total Boot Time Test: Loader 1 2 3 1100 2200 3300 4400 5500 5212 5212 5212
Systemd Total Boot Time Test: Kernel OpenBenchmarking.org ms, Fewer Is Better Systemd Total Boot Time Test: Kernel 1 2 3 600 1200 1800 2400 3000 2761 2761 2761
Systemd Total Boot Time Test: Total OpenBenchmarking.org ms, Fewer Is Better Systemd Total Boot Time Test: Total 1 2 3 6K 12K 18K 24K 30K 26056 26056 26056
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p 1 2 3 0.0495 0.099 0.1485 0.198 0.2475 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.22 0.22 0.22 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K 1 2 3 0.0158 0.0316 0.0474 0.0632 0.079 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.07 0.07 0.07 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: DistinctUserID 1 2 3 0.693 1.386 2.079 2.772 3.465 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.08 3.08 3.08 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: PartialTweets 1 2 3 0.639 1.278 1.917 2.556 3.195 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.84 2.84 2.84 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: LargeRandom 1 2 3 0.1958 0.3916 0.5874 0.7832 0.979 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.87 0.87 0.87 1. (CXX) g++ options: -O3 -pthread
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p 1 2 3 14 28 42 56 70 SE +/- 1.45, N = 15 SE +/- 1.43, N = 15 SE +/- 1.64, N = 12 64.75 64.84 62.67 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p 1 2 3 13 26 39 52 65 SE +/- 0.92, N = 15 SE +/- 0.83, N = 15 SE +/- 0.50, N = 3 54.48 54.90 55.96 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Phoronix Test Suite v10.8.4