2700 march AMD Ryzen 7 2700 Eight-Core testing with a Gigabyte AB350N-Gaming WIFI-CF (F20 BIOS) and HIS AMD Radeon HD 6450/7450/8450 / R5 230 OEM 1GB on Ubuntu 19.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2103315-IB-2700MARCH64&sro .
2700 march Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution 1 2 3 AMD Ryzen 7 2700 Eight-Core @ 3.20GHz (8 Cores / 16 Threads) Gigabyte AB350N-Gaming WIFI-CF (F20 BIOS) AMD 17h 16GB 120GB ADATA SU700 HIS AMD Radeon HD 6450/7450/8450 / R5 230 OEM 1GB AMD Caicos HDMI Audio DELL S2409W Realtek RTL8111/8168/8411 + Intel 3165 Ubuntu 19.10 5.9.0-050900rc7daily20201004-generic (x86_64) 20201003 GNOME Shell 3.34.1 X Server 1.20.5 3.3 Mesa 19.2.8 (LLVM 9.0.0) GCC 9.2.1 20191008 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x800820b Python Details - Python 2.7.17 + Python 3.7.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected
2700 march incompact3d: input.i3d 129 Cells Per Direction incompact3d: input.i3d 193 Cells Per Direction simdjson: Kostya simdjson: LargeRand simdjson: PartialTweets simdjson: DistinctUserID gmpbench: Total Time srslte: OFDM_Test srslte: PHY_DL_Test srslte: PHY_DL_Test botan: KASUMI botan: KASUMI - Decrypt botan: AES-256 botan: AES-256 - Decrypt botan: Twofish botan: Twofish - Decrypt botan: Blowfish botan: Blowfish - Decrypt botan: CAST-256 botan: CAST-256 - Decrypt botan: ChaCha20Poly1305 botan: ChaCha20Poly1305 - Decrypt luaradio: Five Back to Back FIR Filters luaradio: FM Deemphasis Filter luaradio: Hilbert Transform luaradio: Complex Phase aom-av1: Speed 0 Two-Pass - Bosphorus 4K aom-av1: Speed 4 Two-Pass - Bosphorus 4K aom-av1: Speed 6 Realtime - Bosphorus 4K aom-av1: Speed 6 Two-Pass - Bosphorus 4K aom-av1: Speed 8 Realtime - Bosphorus 4K aom-av1: Speed 9 Realtime - Bosphorus 4K aom-av1: Speed 0 Two-Pass - Bosphorus 1080p aom-av1: Speed 4 Two-Pass - Bosphorus 1080p aom-av1: Speed 6 Realtime - Bosphorus 1080p aom-av1: Speed 6 Two-Pass - Bosphorus 1080p aom-av1: Speed 8 Realtime - Bosphorus 1080p aom-av1: Speed 9 Realtime - Bosphorus 1080p svt-hevc: 1 - Bosphorus 1080p svt-hevc: 7 - Bosphorus 1080p svt-hevc: 10 - Bosphorus 1080p svt-vp9: VMAF Optimized - Bosphorus 1080p svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p svt-vp9: Visual Quality Optimized - Bosphorus 1080p stockfish: Total Time build-linux-kernel: Time To Compile build-nodejs: Time To Compile onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU build-erlang: Time To Compile liquid-dsp: 1 - 256 - 57 liquid-dsp: 2 - 256 - 57 liquid-dsp: 4 - 256 - 57 liquid-dsp: 8 - 256 - 57 liquid-dsp: 16 - 256 - 57 viennacl: CPU BLAS - sCOPY viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-TT viennacl: CPU BLAS - dCOPY basis: ETC1S basis: UASTC Level 0 basis: UASTC Level 2 basis: UASTC Level 3 openscad: Pistol openscad: Retro Car openscad: Mini-ITX Case openscad: Projector Mount Swivel openscad: Leonardo Phone Case Slim mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 sysbench: RAM / Memory sysbench: CPU systemd-boot-total: Total systemd-boot-total: Kernel systemd-boot-total: Loader systemd-boot-total: Firmware systemd-boot-total: Userspace 1 2 3 44.3870951 157.428889 2.08 0.87 2.84 3.08 4923.1 77866667 216.0 82.0 91.552 87.844 5427.381 5483.849 352.929 352.194 431.385 431.594 141.152 141.164 484.192 482.849 480.3 387.9 92.1 551.7 0.07 2.00 8.61 3.82 24.32 30.40 0.22 3.74 15.17 11.14 54.48 64.75 5.46 72.79 149.71 93.63 98.92 77.66 18133486 112.365 614.671 8.40798 13.3764 8.10530 2.79376 23.7916 10.8615 14.5525 24.4047 8.37857 11.1635 8187.48 4355.03 8186.05 4343.32 5.37265 8211.26 4366.85 4.98031 170.803 56320333 111863333 209113333 378376667 438133333 18.5 27.6 39.1 27.8 39.0 41.4 43.2 33.3 32.7 34.7 34.2 18.4 31.194 9.246 40.534 77.200 150.193 27.135 65.386 170.706 25.401 10.149 45.858 5.398 4.918 54.185 7979.40 14106.47 26056 2761 5212 11240 23295 44.9321429 160.035952 2.08 0.87 2.84 3.08 4917.2 78266667 214.9 81.4 91.562 87.864 5406.600 5461.957 353.089 351.992 431.348 431.245 141.211 141.194 482.250 481.666 483.4 386.8 92.8 553.6 0.07 2.01 8.60 3.85 24.58 30.39 0.22 3.73 15.07 11.16 54.90 64.84 5.46 72.89 150.09 93.71 99.05 78.16 18333344 112.390 615.075 8.42009 13.2109 8.10551 2.79736 23.7977 10.8822 14.5886 24.3446 8.38646 11.1749 8294.96 4340.82 8197.44 4370.98 5.37691 8166.93 4351.79 4.96746 171.209 56235333 111236667 209460000 378373333 437706667 18.4 27.5 39.0 27.7 39.2 41.4 43.2 33.5 32.7 34.5 34.3 18.5 31.134 9.219 40.502 77.178 151.185 27.118 65.864 169.994 25.388 10.215 46.225 5.415 4.887 55.770 8016.57 14107.97 26056 2761 5212 11240 23295 44.1209208 157.363368 2.09 0.87 2.84 3.08 4929.8 78100000 216.0 82.3 91.553 87.857 5426.079 5493.612 353.065 352.489 431.517 431.376 141.271 141.131 484.226 482.571 481.8 389.3 92.0 552.3 0.07 2.01 8.63 3.84 24.59 30.40 0.22 3.75 14.99 11.16 55.96 62.67 5.47 73.16 149.66 93.16 99.03 78.29 18110615 111.852 613.065 8.37864 13.2285 8.11780 2.79570 23.7932 10.8773 14.6134 24.3779 8.36378 11.1327 8149.80 4341.46 8172.53 4346.16 5.36897 8147.02 4347.64 4.97285 170.802 56134333 111496667 209336667 377836667 439076667 18.5 27.5 39.0 27.7 39.2 41.3 43.1 33.3 32.5 34.5 34.4 18.4 31.113 9.187 40.450 77.122 151.735 27.296 65.652 172.293 25.425 10.066 45.685 5.387 4.904 54.033 8052.05 14154.66 26056 2761 5212 11240 23295 OpenBenchmarking.org
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 1 2 3 10 20 30 40 50 SE +/- 0.60, N = 3 SE +/- 0.46, N = 3 SE +/- 0.07, N = 3 44.39 44.93 44.12 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction 1 2 3 40 80 120 160 200 SE +/- 0.13, N = 3 SE +/- 1.44, N = 12 SE +/- 0.10, N = 3 157.43 160.04 157.36 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: Kostya 1 2 3 0.4703 0.9406 1.4109 1.8812 2.3515 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 2.08 2.08 2.09 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: LargeRandom 1 2 3 0.1958 0.3916 0.5874 0.7832 0.979 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.87 0.87 0.87 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: PartialTweets 1 2 3 0.639 1.278 1.917 2.556 3.195 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.84 2.84 2.84 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: DistinctUserID 1 2 3 0.693 1.386 2.079 2.772 3.465 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.08 3.08 3.08 1. (CXX) g++ options: -O3 -pthread
GNU GMP GMPbench Total Time OpenBenchmarking.org GMPbench Score, More Is Better GNU GMP GMPbench 6.2.1 Total Time 1 2 3 1100 2200 3300 4400 5500 4923.1 4917.2 4929.8 1. (CC) gcc options: -O3 -fomit-frame-pointer -lm
srsLTE Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsLTE 20.10.1 Test: OFDM_Test 1 2 3 20M 40M 60M 80M 100M SE +/- 371184.29, N = 3 SE +/- 88191.71, N = 3 SE +/- 305505.05, N = 3 77866667 78266667 78100000 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lm -lfftw3f -lmbedcrypto
srsLTE Test: PHY_DL_Test OpenBenchmarking.org eNb Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test 1 2 3 50 100 150 200 250 SE +/- 0.39, N = 3 SE +/- 0.17, N = 3 SE +/- 0.46, N = 3 216.0 214.9 216.0 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lm -lfftw3f -lmbedcrypto
srsLTE Test: PHY_DL_Test OpenBenchmarking.org UE Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test 1 2 3 20 40 60 80 100 SE +/- 0.20, N = 3 SE +/- 0.60, N = 3 SE +/- 0.13, N = 3 82.0 81.4 82.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lm -lfftw3f -lmbedcrypto
Botan Test: KASUMI OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI 1 2 3 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 91.55 91.56 91.55 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt 1 2 3 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 87.84 87.86 87.86 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 1 2 3 1200 2400 3600 4800 6000 SE +/- 1.56, N = 3 SE +/- 10.44, N = 3 SE +/- 1.41, N = 3 5427.38 5406.60 5426.08 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt 1 2 3 1200 2400 3600 4800 6000 SE +/- 4.48, N = 3 SE +/- 16.12, N = 3 SE +/- 4.78, N = 3 5483.85 5461.96 5493.61 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish 1 2 3 80 160 240 320 400 SE +/- 0.24, N = 3 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 352.93 353.09 353.07 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt 1 2 3 80 160 240 320 400 SE +/- 0.21, N = 3 SE +/- 0.01, N = 3 SE +/- 0.14, N = 3 352.19 351.99 352.49 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish 1 2 3 90 180 270 360 450 SE +/- 0.04, N = 3 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 431.39 431.35 431.52 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt 1 2 3 90 180 270 360 450 SE +/- 0.10, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 431.59 431.25 431.38 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 1 2 3 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 141.15 141.21 141.27 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt 1 2 3 30 60 90 120 150 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 141.16 141.19 141.13 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 1 2 3 100 200 300 400 500 SE +/- 0.30, N = 3 SE +/- 1.00, N = 3 SE +/- 0.20, N = 3 484.19 482.25 484.23 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt 1 2 3 100 200 300 400 500 SE +/- 0.25, N = 3 SE +/- 0.72, N = 3 SE +/- 0.25, N = 3 482.85 481.67 482.57 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
LuaRadio Test: Five Back to Back FIR Filters OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Five Back to Back FIR Filters 1 2 3 100 200 300 400 500 SE +/- 2.34, N = 3 SE +/- 1.63, N = 3 SE +/- 0.71, N = 3 480.3 483.4 481.8
LuaRadio Test: FM Deemphasis Filter OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: FM Deemphasis Filter 1 2 3 80 160 240 320 400 SE +/- 0.55, N = 3 SE +/- 1.30, N = 3 SE +/- 1.13, N = 3 387.9 386.8 389.3
LuaRadio Test: Hilbert Transform OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Hilbert Transform 1 2 3 20 40 60 80 100 SE +/- 0.23, N = 3 SE +/- 0.23, N = 3 SE +/- 0.23, N = 3 92.1 92.8 92.0
LuaRadio Test: Complex Phase OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Complex Phase 1 2 3 120 240 360 480 600 SE +/- 1.45, N = 3 SE +/- 1.66, N = 3 SE +/- 1.02, N = 3 551.7 553.6 552.3
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K 1 2 3 0.0158 0.0316 0.0474 0.0632 0.079 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.07 0.07 0.07 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K 1 2 3 0.4523 0.9046 1.3569 1.8092 2.2615 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.00 2.01 2.01 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 8.61 8.60 8.63 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K 1 2 3 0.8663 1.7326 2.5989 3.4652 4.3315 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.82 3.85 3.84 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K 1 2 3 6 12 18 24 30 SE +/- 0.35, N = 4 SE +/- 0.04, N = 3 SE +/- 0.11, N = 3 24.32 24.58 24.59 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K 1 2 3 7 14 21 28 35 SE +/- 0.32, N = 8 SE +/- 0.30, N = 9 SE +/- 0.35, N = 6 30.40 30.39 30.40 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p 1 2 3 0.0495 0.099 0.1485 0.198 0.2475 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.22 0.22 0.22 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p 1 2 3 0.8438 1.6876 2.5314 3.3752 4.219 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.74 3.73 3.75 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p 1 2 3 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.12, N = 3 15.17 15.07 14.99 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 11.14 11.16 11.16 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p 1 2 3 13 26 39 52 65 SE +/- 0.92, N = 15 SE +/- 0.83, N = 15 SE +/- 0.50, N = 3 54.48 54.90 55.96 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.0 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p 1 2 3 14 28 42 56 70 SE +/- 1.45, N = 15 SE +/- 1.43, N = 15 SE +/- 1.64, N = 12 64.75 64.84 62.67 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p 1 2 3 1.2308 2.4616 3.6924 4.9232 6.154 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 5.46 5.46 5.47 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p 1 2 3 16 32 48 64 80 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 SE +/- 0.21, N = 3 72.79 72.89 73.16 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p 1 2 3 30 60 90 120 150 SE +/- 0.38, N = 3 SE +/- 0.36, N = 3 SE +/- 0.38, N = 3 149.71 150.09 149.66 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p 1 2 3 20 40 60 80 100 SE +/- 1.49, N = 3 SE +/- 1.35, N = 3 SE +/- 1.39, N = 3 93.63 93.71 93.16 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p 1 2 3 20 40 60 80 100 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 SE +/- 0.39, N = 3 98.92 99.05 99.03 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p 1 2 3 20 40 60 80 100 SE +/- 0.23, N = 3 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 77.66 78.16 78.29 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time 1 2 3 4M 8M 12M 16M 20M SE +/- 262210.51, N = 4 SE +/- 135386.39, N = 3 SE +/- 240236.23, N = 5 18133486 18333344 18110615 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.10.20 Time To Compile 1 2 3 30 60 90 120 150 SE +/- 1.03, N = 3 SE +/- 1.50, N = 3 SE +/- 0.94, N = 3 112.37 112.39 111.85
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 15.11 Time To Compile 1 2 3 130 260 390 520 650 SE +/- 1.62, N = 3 SE +/- 1.55, N = 3 SE +/- 1.22, N = 3 614.67 615.08 613.07
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.00658, N = 3 SE +/- 0.00479, N = 3 SE +/- 0.01107, N = 3 8.40798 8.42009 8.37864 MIN: 8.2 MIN: 8.19 MIN: 8.09 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 13.38 13.21 13.23 MIN: 12.91 MIN: 12.88 MIN: 12.9 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.00328, N = 3 SE +/- 0.00340, N = 3 SE +/- 0.01300, N = 3 8.10530 8.10551 8.11780 MIN: 7.13 MIN: 7.12 MIN: 7.78 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.6294 1.2588 1.8882 2.5176 3.147 SE +/- 0.00339, N = 3 SE +/- 0.00485, N = 3 SE +/- 0.00436, N = 3 2.79376 2.79736 2.79570 MIN: 2.57 MIN: 2.59 MIN: 2.6 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 1 2 3 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 23.79 23.80 23.79 MIN: 23.24 MIN: 23.38 MIN: 23.46 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 10.86 10.88 10.88 MIN: 10.55 MIN: 10.47 MIN: 10.54 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 14.55 14.59 14.61 MIN: 14.15 MIN: 14.16 MIN: 13.97 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 1 2 3 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 24.40 24.34 24.38 MIN: 23.71 MIN: 23.78 MIN: 23.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.01097, N = 3 SE +/- 0.00543, N = 3 SE +/- 0.01793, N = 3 8.37857 8.38646 8.36378 MIN: 8.25 MIN: 8.25 MIN: 8.25 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 11.16 11.17 11.13 MIN: 11.09 MIN: 11.08 MIN: 11.05 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 1 2 3 2K 4K 6K 8K 10K SE +/- 3.41, N = 3 SE +/- 80.20, N = 3 SE +/- 16.54, N = 3 8187.48 8294.96 8149.80 MIN: 8175.57 MIN: 8168.26 MIN: 8112.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 1 2 3 900 1800 2700 3600 4500 SE +/- 6.89, N = 3 SE +/- 13.08, N = 3 SE +/- 12.70, N = 3 4355.03 4340.82 4341.46 MIN: 4334.34 MIN: 4309.94 MIN: 4314.23 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 1 2 3 2K 4K 6K 8K 10K SE +/- 18.76, N = 3 SE +/- 18.96, N = 3 SE +/- 14.12, N = 3 8186.05 8197.44 8172.53 MIN: 8154.67 MIN: 8159.65 MIN: 8141.85 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 1 2 3 900 1800 2700 3600 4500 SE +/- 14.69, N = 3 SE +/- 13.60, N = 3 SE +/- 7.09, N = 3 4343.32 4370.98 4346.16 MIN: 4304.78 MIN: 4346.29 MIN: 4326.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 1 2 3 1.2098 2.4196 3.6294 4.8392 6.049 SE +/- 0.00225, N = 3 SE +/- 0.01071, N = 3 SE +/- 0.00128, N = 3 5.37265 5.37691 5.36897 MIN: 5.28 MIN: 5.24 MIN: 5.28 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 2K 4K 6K 8K 10K SE +/- 15.18, N = 3 SE +/- 8.62, N = 3 SE +/- 18.94, N = 3 8211.26 8166.93 8147.02 MIN: 8168.07 MIN: 8149.6 MIN: 8105.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 900 1800 2700 3600 4500 SE +/- 5.40, N = 3 SE +/- 6.88, N = 3 SE +/- 2.52, N = 3 4366.85 4351.79 4347.64 MIN: 4351.4 MIN: 4329.09 MIN: 4335.4 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 1 2 3 1.1206 2.2412 3.3618 4.4824 5.603 SE +/- 0.00815, N = 3 SE +/- 0.00255, N = 3 SE +/- 0.00159, N = 3 4.98031 4.96746 4.97285 MIN: 4.9 MIN: 4.89 MIN: 4.88 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Timed Erlang/OTP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Erlang/OTP Compilation 23.2 Time To Compile 1 2 3 40 80 120 160 200 SE +/- 0.10, N = 3 SE +/- 0.12, N = 3 SE +/- 0.15, N = 3 170.80 171.21 170.80
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 1 2 3 12M 24M 36M 48M 60M SE +/- 221002.51, N = 3 SE +/- 183205.11, N = 3 SE +/- 290873.59, N = 3 56320333 56235333 56134333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 2 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 2 - Buffer Length: 256 - Filter Length: 57 1 2 3 20M 40M 60M 80M 100M SE +/- 462793.45, N = 3 SE +/- 287595.39, N = 3 SE +/- 362276.75, N = 3 111863333 111236667 111496667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 4 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 1 2 3 40M 80M 120M 160M 200M SE +/- 384071.75, N = 3 SE +/- 308598.98, N = 3 SE +/- 626905.98, N = 3 209113333 209460000 209336667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 1 2 3 80M 160M 240M 320M 400M SE +/- 54873.59, N = 3 SE +/- 140988.57, N = 3 SE +/- 471322.02, N = 3 378376667 378373333 377836667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 1 2 3 90M 180M 270M 360M 450M SE +/- 653715.87, N = 3 SE +/- 698721.05, N = 3 SE +/- 745035.42, N = 3 438133333 437706667 439076667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY 1 2 3 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 18.5 18.4 18.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY 1 2 3 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 27.6 27.5 27.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT 1 2 3 9 18 27 36 45 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 39.1 39.0 39.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY 1 2 3 7 14 21 28 35 SE +/- 0.05, N = 2 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 27.8 27.7 27.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT 1 2 3 9 18 27 36 45 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 39.0 39.2 39.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N 1 2 3 9 18 27 36 45 SE +/- 0.15, N = 3 SE +/- 0.18, N = 3 SE +/- 0.18, N = 3 41.4 41.4 41.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T 1 2 3 10 20 30 40 50 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 43.2 43.2 43.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN 1 2 3 8 16 24 32 40 SE +/- 0.29, N = 3 SE +/- 0.40, N = 3 SE +/- 0.40, N = 3 33.3 33.5 33.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT 1 2 3 8 16 24 32 40 SE +/- 0.36, N = 3 SE +/- 0.35, N = 3 SE +/- 0.42, N = 3 32.7 32.7 32.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN 1 2 3 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 34.7 34.5 34.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT 1 2 3 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.12, N = 3 SE +/- 0.19, N = 3 34.2 34.3 34.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY 1 2 3 5 10 15 20 25 SE +/- 0.00, N = 2 18.4 18.5 18.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Basis Universal Settings: ETC1S OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: ETC1S 1 2 3 7 14 21 28 35 SE +/- 0.10, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 31.19 31.13 31.11 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 1 2 3 3 6 9 12 15 SE +/- 0.007, N = 3 SE +/- 0.015, N = 3 SE +/- 0.005, N = 3 9.246 9.219 9.187 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 1 2 3 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 0.11, N = 3 40.53 40.50 40.45 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 1 2 3 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 77.20 77.18 77.12 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenSCAD Render: Pistol OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Pistol 1 2 3 30 60 90 120 150 SE +/- 0.29, N = 3 SE +/- 0.45, N = 3 SE +/- 0.85, N = 3 150.19 151.19 151.74 1. OpenSCAD version 2019.01-RC2
OpenSCAD Render: Retro Car OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Retro Car 1 2 3 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 27.14 27.12 27.30 1. OpenSCAD version 2019.01-RC2
OpenSCAD Render: Mini-ITX Case OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Mini-ITX Case 1 2 3 15 30 45 60 75 SE +/- 0.15, N = 3 SE +/- 0.24, N = 3 SE +/- 0.24, N = 3 65.39 65.86 65.65 1. OpenSCAD version 2019.01-RC2
OpenSCAD Render: Projector Mount Swivel OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Projector Mount Swivel 1 2 3 40 80 120 160 200 SE +/- 0.17, N = 3 SE +/- 1.17, N = 3 SE +/- 0.80, N = 3 170.71 169.99 172.29 1. OpenSCAD version 2019.01-RC2
OpenSCAD Render: Leonardo Phone Case Slim OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Leonardo Phone Case Slim 1 2 3 6 12 18 24 30 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 25.40 25.39 25.43 1. OpenSCAD version 2019.01-RC2
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: SqueezeNetV1.0 1 2 3 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 10.15 10.22 10.07 MIN: 9.96 / MAX: 15.18 MIN: 9.92 / MAX: 25.01 MIN: 9.92 / MAX: 12.59 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: resnet-v2-50 1 2 3 10 20 30 40 50 SE +/- 0.14, N = 3 SE +/- 0.48, N = 3 SE +/- 0.13, N = 3 45.86 46.23 45.69 MIN: 45.28 / MAX: 65.36 MIN: 45.1 / MAX: 60.21 MIN: 45.03 / MAX: 60.75 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: MobileNetV2_224 1 2 3 1.2184 2.4368 3.6552 4.8736 6.092 SE +/- 0.047, N = 3 SE +/- 0.077, N = 3 SE +/- 0.021, N = 3 5.398 5.415 5.387 MIN: 5.27 / MAX: 11.32 MIN: 5.3 / MAX: 24.9 MIN: 5.31 / MAX: 7.46 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: mobilenet-v1-1.0 1 2 3 1.1066 2.2132 3.3198 4.4264 5.533 SE +/- 0.034, N = 3 SE +/- 0.055, N = 3 SE +/- 0.009, N = 3 4.918 4.887 4.904 MIN: 4.83 / MAX: 6.07 MIN: 4.79 / MAX: 8.37 MIN: 4.85 / MAX: 19.45 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: inception-v3 1 2 3 13 26 39 52 65 SE +/- 0.40, N = 3 SE +/- 1.06, N = 3 SE +/- 0.14, N = 3 54.19 55.77 54.03 MIN: 53.22 / MAX: 69.33 MIN: 53.52 / MAX: 70.99 MIN: 53.61 / MAX: 68.4 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Sysbench Test: RAM / Memory OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory 1 2 3 2K 4K 6K 8K 10K SE +/- 21.59, N = 3 SE +/- 7.05, N = 3 SE +/- 22.63, N = 3 7979.40 8016.57 8052.05 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU 1 2 3 3K 6K 9K 12K 15K SE +/- 0.53, N = 3 SE +/- 0.20, N = 3 SE +/- 6.91, N = 3 14106.47 14107.97 14154.66 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
Systemd Total Boot Time Test: Total OpenBenchmarking.org ms, Fewer Is Better Systemd Total Boot Time Test: Total 1 2 3 6K 12K 18K 24K 30K 26056 26056 26056
Systemd Total Boot Time Test: Kernel OpenBenchmarking.org ms, Fewer Is Better Systemd Total Boot Time Test: Kernel 1 2 3 600 1200 1800 2400 3000 2761 2761 2761
Systemd Total Boot Time Test: Loader OpenBenchmarking.org ms, Fewer Is Better Systemd Total Boot Time Test: Loader 1 2 3 1100 2200 3300 4400 5500 5212 5212 5212
Systemd Total Boot Time Test: Firmware OpenBenchmarking.org ms, Fewer Is Better Systemd Total Boot Time Test: Firmware 1 2 3 2K 4K 6K 8K 10K 11240 11240 11240
Systemd Total Boot Time Test: Userspace OpenBenchmarking.org ms, Fewer Is Better Systemd Total Boot Time Test: Userspace 1 2 3 5K 10K 15K 20K 25K 23295 23295 23295
Phoronix Test Suite v10.8.4