2700 march AMD Ryzen 7 2700 Eight-Core testing with a Gigabyte AB350N-Gaming WIFI-CF (F20 BIOS) and HIS AMD Radeon HD 6450/7450/8450 / R5 230 OEM 1GB on Ubuntu 19.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2103315-IB-2700MARCH64&grr .
2700 march Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution 1 2 3 AMD Ryzen 7 2700 Eight-Core @ 3.20GHz (8 Cores / 16 Threads) Gigabyte AB350N-Gaming WIFI-CF (F20 BIOS) AMD 17h 16GB 120GB ADATA SU700 HIS AMD Radeon HD 6450/7450/8450 / R5 230 OEM 1GB AMD Caicos HDMI Audio DELL S2409W Realtek RTL8111/8168/8411 + Intel 3165 Ubuntu 19.10 5.9.0-050900rc7daily20201004-generic (x86_64) 20201003 GNOME Shell 3.34.1 X Server 1.20.5 3.3 Mesa 19.2.8 (LLVM 9.0.0) GCC 9.2.1 20191008 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x800820b Python Details - Python 2.7.17 + Python 3.7.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected
2700 march build-nodejs: Time To Compile aom-av1: Speed 4 Two-Pass - Bosphorus 4K incompact3d: input.i3d 193 Cells Per Direction aom-av1: Speed 0 Two-Pass - Bosphorus 4K luaradio: Complex Phase luaradio: Hilbert Transform luaradio: FM Deemphasis Filter luaradio: Five Back to Back FIR Filters aom-av1: Speed 6 Two-Pass - Bosphorus 4K aom-av1: Speed 4 Two-Pass - Bosphorus 1080p openscad: Projector Mount Swivel build-erlang: Time To Compile openscad: Pistol gmpbench: Total Time mnn: inception-v3 mnn: mobilenet-v1-1.0 mnn: MobileNetV2_224 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 build-linux-kernel: Time To Compile svt-hevc: 1 - Bosphorus 1080p onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU aom-av1: Speed 0 Two-Pass - Bosphorus 1080p sysbench: CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU basis: UASTC Level 3 simdjson: PartialTweets simdjson: DistinctUserID aom-av1: Speed 6 Realtime - Bosphorus 4K aom-av1: Speed 6 Two-Pass - Bosphorus 1080p stockfish: Total Time openscad: Mini-ITX Case simdjson: Kostya srslte: OFDM_Test aom-av1: Speed 9 Realtime - Bosphorus 4K simdjson: LargeRand srslte: PHY_DL_Test srslte: PHY_DL_Test aom-av1: Speed 9 Realtime - Bosphorus 1080p incompact3d: input.i3d 129 Cells Per Direction aom-av1: Speed 8 Realtime - Bosphorus 1080p basis: UASTC Level 2 aom-av1: Speed 6 Realtime - Bosphorus 1080p botan: AES-256 - Decrypt botan: AES-256 basis: ETC1S botan: Blowfish - Decrypt botan: Blowfish botan: ChaCha20Poly1305 - Decrypt botan: ChaCha20Poly1305 botan: Twofish - Decrypt botan: Twofish botan: CAST-256 - Decrypt botan: CAST-256 botan: KASUMI - Decrypt botan: KASUMI aom-av1: Speed 8 Realtime - Bosphorus 4K openscad: Retro Car viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - dGEMM-TT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sCOPY openscad: Leonardo Phone Case Slim onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU liquid-dsp: 2 - 256 - 57 liquid-dsp: 1 - 256 - 57 liquid-dsp: 4 - 256 - 57 liquid-dsp: 16 - 256 - 57 liquid-dsp: 8 - 256 - 57 onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU sysbench: RAM / Memory onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU onednn: IP Shapes 3D - f32 - CPU basis: UASTC Level 0 onednn: IP Shapes 3D - u8s8f32 - CPU svt-hevc: 7 - Bosphorus 1080p svt-vp9: Visual Quality Optimized - Bosphorus 1080p svt-vp9: VMAF Optimized - Bosphorus 1080p svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU svt-hevc: 10 - Bosphorus 1080p onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU systemd-boot-total: Total systemd-boot-total: Firmware systemd-boot-total: Loader systemd-boot-total: Kernel systemd-boot-total: Userspace 1 2 3 614.671 2.00 157.428889 0.07 551.7 92.1 387.9 480.3 3.82 3.74 170.706 170.803 150.193 4923.1 54.185 4.918 5.398 45.858 10.149 112.365 5.46 8187.48 8211.26 8186.05 0.22 14106.47 4366.85 4343.32 4355.03 77.200 2.84 3.08 8.61 11.14 18133486 65.386 2.08 77866667 30.40 0.87 82.0 216.0 64.75 44.3870951 54.48 40.534 15.17 5483.849 5427.381 31.194 431.594 431.385 482.849 484.192 352.194 352.929 141.164 141.152 87.844 91.552 24.32 27.135 18.4 34.2 34.7 32.7 33.3 43.2 41.4 39.0 27.8 39.1 27.6 18.5 25.401 10.8615 8.37857 111863333 56320333 209113333 438133333 378376667 8.40798 8.10530 7979.40 5.37265 4.98031 13.3764 9.246 2.79376 72.79 77.66 93.63 98.92 24.4047 23.7916 149.71 14.5525 11.1635 26056 11240 5212 2761 23295 615.075 2.01 160.035952 0.07 553.6 92.8 386.8 483.4 3.85 3.73 169.994 171.209 151.185 4917.2 55.770 4.887 5.415 46.225 10.215 112.390 5.46 8294.96 8166.93 8197.44 0.22 14107.97 4351.79 4370.98 4340.82 77.178 2.84 3.08 8.60 11.16 18333344 65.864 2.08 78266667 30.39 0.87 81.4 214.9 64.84 44.9321429 54.90 40.502 15.07 5461.957 5406.600 31.134 431.245 431.348 481.666 482.250 351.992 353.089 141.194 141.211 87.864 91.562 24.58 27.118 18.5 34.3 34.5 32.7 33.5 43.2 41.4 39.2 27.7 39.0 27.5 18.4 25.388 10.8822 8.38646 111236667 56235333 209460000 437706667 378373333 8.42009 8.10551 8016.57 5.37691 4.96746 13.2109 9.219 2.79736 72.89 78.16 93.71 99.05 24.3446 23.7977 150.09 14.5886 11.1749 26056 11240 5212 2761 23295 613.065 2.01 157.363368 0.07 552.3 92.0 389.3 481.8 3.84 3.75 172.293 170.802 151.735 4929.8 54.033 4.904 5.387 45.685 10.066 111.852 5.47 8149.80 8147.02 8172.53 0.22 14154.66 4347.64 4346.16 4341.46 77.122 2.84 3.08 8.63 11.16 18110615 65.652 2.09 78100000 30.40 0.87 82.3 216.0 62.67 44.1209208 55.96 40.450 14.99 5493.612 5426.079 31.113 431.376 431.517 482.571 484.226 352.489 353.065 141.131 141.271 87.857 91.553 24.59 27.296 18.4 34.4 34.5 32.5 33.3 43.1 41.3 39.2 27.7 39.0 27.5 18.5 25.425 10.8773 8.36378 111496667 56134333 209336667 439076667 377836667 8.37864 8.11780 8052.05 5.36897 4.97285 13.2285 9.187 2.79570 73.16 78.29 93.16 99.03 24.3779 23.7932 149.66 14.6134 11.1327 26056 11240 5212 2761 23295 OpenBenchmarking.org
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 15.11 Time To Compile 1 2 3 130 260 390 520 650 SE +/- 1.62, N = 3 SE +/- 1.55, N = 3 SE +/- 1.22, N = 3 614.67 615.08 613.07
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K 1 2 3 0.4523 0.9046 1.3569 1.8092 2.2615 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.00 2.01 2.01 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction 1 2 3 40 80 120 160 200 SE +/- 0.13, N = 3 SE +/- 1.44, N = 12 SE +/- 0.10, N = 3 157.43 160.04 157.36 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K 1 2 3 0.0158 0.0316 0.0474 0.0632 0.079 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.07 0.07 0.07 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
LuaRadio Test: Complex Phase OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Complex Phase 1 2 3 120 240 360 480 600 SE +/- 1.45, N = 3 SE +/- 1.66, N = 3 SE +/- 1.02, N = 3 551.7 553.6 552.3
LuaRadio Test: Hilbert Transform OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Hilbert Transform 1 2 3 20 40 60 80 100 SE +/- 0.23, N = 3 SE +/- 0.23, N = 3 SE +/- 0.23, N = 3 92.1 92.8 92.0
LuaRadio Test: FM Deemphasis Filter OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: FM Deemphasis Filter 1 2 3 80 160 240 320 400 SE +/- 0.55, N = 3 SE +/- 1.30, N = 3 SE +/- 1.13, N = 3 387.9 386.8 389.3
LuaRadio Test: Five Back to Back FIR Filters OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Five Back to Back FIR Filters 1 2 3 100 200 300 400 500 SE +/- 2.34, N = 3 SE +/- 1.63, N = 3 SE +/- 0.71, N = 3 480.3 483.4 481.8
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K 1 2 3 0.8663 1.7326 2.5989 3.4652 4.3315 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.82 3.85 3.84 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p 1 2 3 0.8438 1.6876 2.5314 3.3752 4.219 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.74 3.73 3.75 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenSCAD Render: Projector Mount Swivel OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Projector Mount Swivel 1 2 3 40 80 120 160 200 SE +/- 0.17, N = 3 SE +/- 1.17, N = 3 SE +/- 0.80, N = 3 170.71 169.99 172.29 1. OpenSCAD version 2019.01-RC2
Timed Erlang/OTP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Erlang/OTP Compilation 23.2 Time To Compile 1 2 3 40 80 120 160 200 SE +/- 0.10, N = 3 SE +/- 0.12, N = 3 SE +/- 0.15, N = 3 170.80 171.21 170.80
OpenSCAD Render: Pistol OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Pistol 1 2 3 30 60 90 120 150 SE +/- 0.29, N = 3 SE +/- 0.45, N = 3 SE +/- 0.85, N = 3 150.19 151.19 151.74 1. OpenSCAD version 2019.01-RC2
GNU GMP GMPbench Total Time OpenBenchmarking.org GMPbench Score, More Is Better GNU GMP GMPbench 6.2.1 Total Time 1 2 3 1100 2200 3300 4400 5500 4923.1 4917.2 4929.8 1. (CC) gcc options: -O3 -fomit-frame-pointer -lm
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: inception-v3 1 2 3 13 26 39 52 65 SE +/- 0.40, N = 3 SE +/- 1.06, N = 3 SE +/- 0.14, N = 3 54.19 55.77 54.03 MIN: 53.22 / MAX: 69.33 MIN: 53.52 / MAX: 70.99 MIN: 53.61 / MAX: 68.4 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: mobilenet-v1-1.0 1 2 3 1.1066 2.2132 3.3198 4.4264 5.533 SE +/- 0.034, N = 3 SE +/- 0.055, N = 3 SE +/- 0.009, N = 3 4.918 4.887 4.904 MIN: 4.83 / MAX: 6.07 MIN: 4.79 / MAX: 8.37 MIN: 4.85 / MAX: 19.45 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: MobileNetV2_224 1 2 3 1.2184 2.4368 3.6552 4.8736 6.092 SE +/- 0.047, N = 3 SE +/- 0.077, N = 3 SE +/- 0.021, N = 3 5.398 5.415 5.387 MIN: 5.27 / MAX: 11.32 MIN: 5.3 / MAX: 24.9 MIN: 5.31 / MAX: 7.46 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: resnet-v2-50 1 2 3 10 20 30 40 50 SE +/- 0.14, N = 3 SE +/- 0.48, N = 3 SE +/- 0.13, N = 3 45.86 46.23 45.69 MIN: 45.28 / MAX: 65.36 MIN: 45.1 / MAX: 60.21 MIN: 45.03 / MAX: 60.75 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: SqueezeNetV1.0 1 2 3 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 10.15 10.22 10.07 MIN: 9.96 / MAX: 15.18 MIN: 9.92 / MAX: 25.01 MIN: 9.92 / MAX: 12.59 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.10.20 Time To Compile 1 2 3 30 60 90 120 150 SE +/- 1.03, N = 3 SE +/- 1.50, N = 3 SE +/- 0.94, N = 3 112.37 112.39 111.85
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p 1 2 3 1.2308 2.4616 3.6924 4.9232 6.154 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 5.46 5.46 5.47 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 1 2 3 2K 4K 6K 8K 10K SE +/- 3.41, N = 3 SE +/- 80.20, N = 3 SE +/- 16.54, N = 3 8187.48 8294.96 8149.80 MIN: 8175.57 MIN: 8168.26 MIN: 8112.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 2K 4K 6K 8K 10K SE +/- 15.18, N = 3 SE +/- 8.62, N = 3 SE +/- 18.94, N = 3 8211.26 8166.93 8147.02 MIN: 8168.07 MIN: 8149.6 MIN: 8105.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 1 2 3 2K 4K 6K 8K 10K SE +/- 18.76, N = 3 SE +/- 18.96, N = 3 SE +/- 14.12, N = 3 8186.05 8197.44 8172.53 MIN: 8154.67 MIN: 8159.65 MIN: 8141.85 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p 1 2 3 0.0495 0.099 0.1485 0.198 0.2475 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.22 0.22 0.22 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU 1 2 3 3K 6K 9K 12K 15K SE +/- 0.53, N = 3 SE +/- 0.20, N = 3 SE +/- 6.91, N = 3 14106.47 14107.97 14154.66 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 900 1800 2700 3600 4500 SE +/- 5.40, N = 3 SE +/- 6.88, N = 3 SE +/- 2.52, N = 3 4366.85 4351.79 4347.64 MIN: 4351.4 MIN: 4329.09 MIN: 4335.4 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 1 2 3 900 1800 2700 3600 4500 SE +/- 14.69, N = 3 SE +/- 13.60, N = 3 SE +/- 7.09, N = 3 4343.32 4370.98 4346.16 MIN: 4304.78 MIN: 4346.29 MIN: 4326.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 1 2 3 900 1800 2700 3600 4500 SE +/- 6.89, N = 3 SE +/- 13.08, N = 3 SE +/- 12.70, N = 3 4355.03 4340.82 4341.46 MIN: 4334.34 MIN: 4309.94 MIN: 4314.23 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 1 2 3 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 77.20 77.18 77.12 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: PartialTweets 1 2 3 0.639 1.278 1.917 2.556 3.195 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.84 2.84 2.84 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: DistinctUserID 1 2 3 0.693 1.386 2.079 2.772 3.465 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.08 3.08 3.08 1. (CXX) g++ options: -O3 -pthread
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 8.61 8.60 8.63 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 11.14 11.16 11.16 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time 1 2 3 4M 8M 12M 16M 20M SE +/- 262210.51, N = 4 SE +/- 135386.39, N = 3 SE +/- 240236.23, N = 5 18133486 18333344 18110615 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
OpenSCAD Render: Mini-ITX Case OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Mini-ITX Case 1 2 3 15 30 45 60 75 SE +/- 0.15, N = 3 SE +/- 0.24, N = 3 SE +/- 0.24, N = 3 65.39 65.86 65.65 1. OpenSCAD version 2019.01-RC2
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: Kostya 1 2 3 0.4703 0.9406 1.4109 1.8812 2.3515 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 2.08 2.08 2.09 1. (CXX) g++ options: -O3 -pthread
srsLTE Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsLTE 20.10.1 Test: OFDM_Test 1 2 3 20M 40M 60M 80M 100M SE +/- 371184.29, N = 3 SE +/- 88191.71, N = 3 SE +/- 305505.05, N = 3 77866667 78266667 78100000 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lm -lfftw3f -lmbedcrypto
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K 1 2 3 7 14 21 28 35 SE +/- 0.32, N = 8 SE +/- 0.30, N = 9 SE +/- 0.35, N = 6 30.40 30.39 30.40 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: LargeRandom 1 2 3 0.1958 0.3916 0.5874 0.7832 0.979 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.87 0.87 0.87 1. (CXX) g++ options: -O3 -pthread
srsLTE Test: PHY_DL_Test OpenBenchmarking.org UE Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test 1 2 3 20 40 60 80 100 SE +/- 0.20, N = 3 SE +/- 0.60, N = 3 SE +/- 0.13, N = 3 82.0 81.4 82.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lm -lfftw3f -lmbedcrypto
srsLTE Test: PHY_DL_Test OpenBenchmarking.org eNb Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test 1 2 3 50 100 150 200 250 SE +/- 0.39, N = 3 SE +/- 0.17, N = 3 SE +/- 0.46, N = 3 216.0 214.9 216.0 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lm -lfftw3f -lmbedcrypto
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p 1 2 3 14 28 42 56 70 SE +/- 1.45, N = 15 SE +/- 1.43, N = 15 SE +/- 1.64, N = 12 64.75 64.84 62.67 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 1 2 3 10 20 30 40 50 SE +/- 0.60, N = 3 SE +/- 0.46, N = 3 SE +/- 0.07, N = 3 44.39 44.93 44.12 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p 1 2 3 13 26 39 52 65 SE +/- 0.92, N = 15 SE +/- 0.83, N = 15 SE +/- 0.50, N = 3 54.48 54.90 55.96 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 1 2 3 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 0.11, N = 3 40.53 40.50 40.45 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p 1 2 3 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.12, N = 3 15.17 15.07 14.99 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Botan Test: AES-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt 1 2 3 1200 2400 3600 4800 6000 SE +/- 4.48, N = 3 SE +/- 16.12, N = 3 SE +/- 4.78, N = 3 5483.85 5461.96 5493.61 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 1 2 3 1200 2400 3600 4800 6000 SE +/- 1.56, N = 3 SE +/- 10.44, N = 3 SE +/- 1.41, N = 3 5427.38 5406.60 5426.08 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Basis Universal Settings: ETC1S OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: ETC1S 1 2 3 7 14 21 28 35 SE +/- 0.10, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 31.19 31.13 31.11 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Botan Test: Blowfish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt 1 2 3 90 180 270 360 450 SE +/- 0.10, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 431.59 431.25 431.38 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish 1 2 3 90 180 270 360 450 SE +/- 0.04, N = 3 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 431.39 431.35 431.52 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt 1 2 3 100 200 300 400 500 SE +/- 0.25, N = 3 SE +/- 0.72, N = 3 SE +/- 0.25, N = 3 482.85 481.67 482.57 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 1 2 3 100 200 300 400 500 SE +/- 0.30, N = 3 SE +/- 1.00, N = 3 SE +/- 0.20, N = 3 484.19 482.25 484.23 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt 1 2 3 80 160 240 320 400 SE +/- 0.21, N = 3 SE +/- 0.01, N = 3 SE +/- 0.14, N = 3 352.19 351.99 352.49 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish 1 2 3 80 160 240 320 400 SE +/- 0.24, N = 3 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 352.93 353.09 353.07 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt 1 2 3 30 60 90 120 150 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 141.16 141.19 141.13 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 1 2 3 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 141.15 141.21 141.27 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt 1 2 3 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 87.84 87.86 87.86 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI 1 2 3 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 91.55 91.56 91.55 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K 1 2 3 6 12 18 24 30 SE +/- 0.35, N = 4 SE +/- 0.04, N = 3 SE +/- 0.11, N = 3 24.32 24.58 24.59 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenSCAD Render: Retro Car OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Retro Car 1 2 3 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 27.14 27.12 27.30 1. OpenSCAD version 2019.01-RC2
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY 1 2 3 5 10 15 20 25 SE +/- 0.00, N = 2 18.4 18.5 18.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT 1 2 3 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.12, N = 3 SE +/- 0.19, N = 3 34.2 34.3 34.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN 1 2 3 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 34.7 34.5 34.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT 1 2 3 8 16 24 32 40 SE +/- 0.36, N = 3 SE +/- 0.35, N = 3 SE +/- 0.42, N = 3 32.7 32.7 32.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN 1 2 3 8 16 24 32 40 SE +/- 0.29, N = 3 SE +/- 0.40, N = 3 SE +/- 0.40, N = 3 33.3 33.5 33.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T 1 2 3 10 20 30 40 50 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 43.2 43.2 43.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N 1 2 3 9 18 27 36 45 SE +/- 0.15, N = 3 SE +/- 0.18, N = 3 SE +/- 0.18, N = 3 41.4 41.4 41.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT 1 2 3 9 18 27 36 45 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 39.0 39.2 39.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY 1 2 3 7 14 21 28 35 SE +/- 0.05, N = 2 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 27.8 27.7 27.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT 1 2 3 9 18 27 36 45 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 39.1 39.0 39.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY 1 2 3 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 27.6 27.5 27.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY 1 2 3 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 18.5 18.4 18.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenSCAD Render: Leonardo Phone Case Slim OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Leonardo Phone Case Slim 1 2 3 6 12 18 24 30 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 25.40 25.39 25.43 1. OpenSCAD version 2019.01-RC2
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 10.86 10.88 10.88 MIN: 10.55 MIN: 10.47 MIN: 10.54 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.01097, N = 3 SE +/- 0.00543, N = 3 SE +/- 0.01793, N = 3 8.37857 8.38646 8.36378 MIN: 8.25 MIN: 8.25 MIN: 8.25 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Liquid-DSP Threads: 2 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 2 - Buffer Length: 256 - Filter Length: 57 1 2 3 20M 40M 60M 80M 100M SE +/- 462793.45, N = 3 SE +/- 287595.39, N = 3 SE +/- 362276.75, N = 3 111863333 111236667 111496667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 1 2 3 12M 24M 36M 48M 60M SE +/- 221002.51, N = 3 SE +/- 183205.11, N = 3 SE +/- 290873.59, N = 3 56320333 56235333 56134333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 4 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 1 2 3 40M 80M 120M 160M 200M SE +/- 384071.75, N = 3 SE +/- 308598.98, N = 3 SE +/- 626905.98, N = 3 209113333 209460000 209336667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 1 2 3 90M 180M 270M 360M 450M SE +/- 653715.87, N = 3 SE +/- 698721.05, N = 3 SE +/- 745035.42, N = 3 438133333 437706667 439076667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 1 2 3 80M 160M 240M 320M 400M SE +/- 54873.59, N = 3 SE +/- 140988.57, N = 3 SE +/- 471322.02, N = 3 378376667 378373333 377836667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.00658, N = 3 SE +/- 0.00479, N = 3 SE +/- 0.01107, N = 3 8.40798 8.42009 8.37864 MIN: 8.2 MIN: 8.19 MIN: 8.09 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.00328, N = 3 SE +/- 0.00340, N = 3 SE +/- 0.01300, N = 3 8.10530 8.10551 8.11780 MIN: 7.13 MIN: 7.12 MIN: 7.78 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Sysbench Test: RAM / Memory OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory 1 2 3 2K 4K 6K 8K 10K SE +/- 21.59, N = 3 SE +/- 7.05, N = 3 SE +/- 22.63, N = 3 7979.40 8016.57 8052.05 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 1 2 3 1.2098 2.4196 3.6294 4.8392 6.049 SE +/- 0.00225, N = 3 SE +/- 0.01071, N = 3 SE +/- 0.00128, N = 3 5.37265 5.37691 5.36897 MIN: 5.28 MIN: 5.24 MIN: 5.28 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 1 2 3 1.1206 2.2412 3.3618 4.4824 5.603 SE +/- 0.00815, N = 3 SE +/- 0.00255, N = 3 SE +/- 0.00159, N = 3 4.98031 4.96746 4.97285 MIN: 4.9 MIN: 4.89 MIN: 4.88 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 13.38 13.21 13.23 MIN: 12.91 MIN: 12.88 MIN: 12.9 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 1 2 3 3 6 9 12 15 SE +/- 0.007, N = 3 SE +/- 0.015, N = 3 SE +/- 0.005, N = 3 9.246 9.219 9.187 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.6294 1.2588 1.8882 2.5176 3.147 SE +/- 0.00339, N = 3 SE +/- 0.00485, N = 3 SE +/- 0.00436, N = 3 2.79376 2.79736 2.79570 MIN: 2.57 MIN: 2.59 MIN: 2.6 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p 1 2 3 16 32 48 64 80 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 72.79 72.89 73.16 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p 1 2 3 20 40 60 80 100 SE +/- 0.23, N = 3 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 77.66 78.16 78.29 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p 1 2 3 20 40 60 80 100 SE +/- 1.49, N = 3 SE +/- 1.35, N = 3 SE +/- 1.39, N = 3 93.63 93.71 93.16 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p 1 2 3 20 40 60 80 100 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 SE +/- 0.39, N = 3 98.92 99.05 99.03 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 1 2 3 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 24.40 24.34 24.38 MIN: 23.71 MIN: 23.78 MIN: 23.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 1 2 3 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 23.79 23.80 23.79 MIN: 23.24 MIN: 23.38 MIN: 23.46 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p 1 2 3 30 60 90 120 150 SE +/- 0.38, N = 3 SE +/- 0.36, N = 3 SE +/- 0.38, N = 3 149.71 150.09 149.66 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 14.55 14.59 14.61 MIN: 14.15 MIN: 14.16 MIN: 13.97 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 11.16 11.17 11.13 MIN: 11.09 MIN: 11.08 MIN: 11.05 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Systemd Total Boot Time Test: Total OpenBenchmarking.org ms, Fewer Is Better Systemd Total Boot Time Test: Total 1 2 3 6K 12K 18K 24K 30K 26056 26056 26056
Systemd Total Boot Time Test: Firmware OpenBenchmarking.org ms, Fewer Is Better Systemd Total Boot Time Test: Firmware 1 2 3 2K 4K 6K 8K 10K 11240 11240 11240
Systemd Total Boot Time Test: Loader OpenBenchmarking.org ms, Fewer Is Better Systemd Total Boot Time Test: Loader 1 2 3 1100 2200 3300 4400 5500 5212 5212 5212
Systemd Total Boot Time Test: Kernel OpenBenchmarking.org ms, Fewer Is Better Systemd Total Boot Time Test: Kernel 1 2 3 600 1200 1800 2400 3000 2761 2761 2761
Systemd Total Boot Time Test: Userspace OpenBenchmarking.org ms, Fewer Is Better Systemd Total Boot Time Test: Userspace 1 2 3 5K 10K 15K 20K 25K 23295 23295 23295
Phoronix Test Suite v10.8.4