2400G 2021 AMD Ryzen 5 2400G testing with a MSI B350M GAMING PRO (MS-7A39) v1.0 (2.NM BIOS) and MSI AMD Radeon Vega / Mobile 2GB on Ubuntu 19.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101160-HA-2400G202136&sro&grr .
2400G 2021 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 4 AMD Ryzen 5 2400G @ 3.60GHz (4 Cores / 8 Threads) MSI B350M GAMING PRO (MS-7A39) v1.0 (2.NM BIOS) AMD Raven/Raven2 6GB 120GB Corsair Force MP500 MSI AMD Radeon Vega / Mobile 2GB (1250/1467MHz) AMD Raven/Raven2/Fenghuang G237HL Realtek RTL8111/8168/8411 Ubuntu 19.10 5.3.0-64-generic (x86_64) GNOME Shell 3.34.1 X Server 1.20.5 modesetting 1.20.5 4.5 Mesa 19.2.8 (LLVM 9.0.0) 1.1.107 GCC 9.2.1 20191008 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8101016 Java Details - OpenJDK Runtime Environment (build 11.0.7+10-post-Ubuntu-2ubuntu219.10) Python Details - Python 2.7.17rc1 + Python 3.7.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected
2400G 2021 astcenc: Exhaustive cp2k: Fayalite-FIST Data kvazaar: Bosphorus 4K - Medium build-godot: Time To Compile build2: Time To Compile numpy: onednn: Recurrent Neural Network Training - f32 - CPU asmfish: 1024 Hash Memory, 26 Depth dav1d: Chimera 1080p 10-bit compress-lz4: 3 - Decompression Speed compress-lz4: 3 - Compression Speed compress-lz4: 1 - Decompression Speed compress-lz4: 1 - Compression Speed cloverleaf: Lagrangian-Eulerian Hydrodynamics compress-lz4: 9 - Decompression Speed compress-lz4: 9 - Compression Speed mnn: inception-v3 mnn: mobilenet-v1-1.0 mnn: MobileNetV2_224 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 espeak: Text-To-Speech Synthesis unpack-firefox: firefox-84.0.source.tar.xz clomp: Static OMP Speedup ncnn: CPU - regnety_400m ncnn: CPU - squeezenet_ssd ncnn: CPU - yolov4-tiny ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet build-ffmpeg: Time To Compile hmmer: Pfam Database Search kvazaar: Bosphorus 4K - Very Fast x265: Bosphorus 4K onednn: Recurrent Neural Network Inference - f32 - CPU node-web-tooling: stockfish: Total Time cryptsetup: Twofish-XTS 512b Decryption cryptsetup: Twofish-XTS 512b Encryption cryptsetup: Serpent-XTS 512b Decryption cryptsetup: Serpent-XTS 512b Encryption cryptsetup: AES-XTS 512b Decryption cryptsetup: AES-XTS 512b Encryption cryptsetup: Twofish-XTS 256b Decryption cryptsetup: Twofish-XTS 256b Encryption cryptsetup: Serpent-XTS 256b Decryption cryptsetup: Serpent-XTS 256b Encryption cryptsetup: AES-XTS 256b Decryption cryptsetup: AES-XTS 256b Encryption cryptsetup: PBKDF2-whirlpool cryptsetup: PBKDF2-sha512 build-eigen: Time To Compile sqlite-speedtest: Timed Time - Size 1,000 kvazaar: Bosphorus 1080p - Medium rav1e: 5 astcenc: Thorough kvazaar: Bosphorus 4K - Ultra Fast rav1e: 1 indigobench: CPU - Bedroom indigobench: CPU - Supercar simdjson: LargeRand simdjson: PartialTweets simdjson: DistinctUserID rav1e: 6 dav1d: Summer Nature 4K dav1d: Chimera 1080p simdjson: Kostya redis: LPUSH onednn: Deconvolution Batch shapes_1d - f32 - CPU redis: LPOP phpbench: PHP Benchmark Suite rav1e: 10 redis: SADD kvazaar: Bosphorus 1080p - Very Fast redis: GET crafty: Elapsed Time x265: Bosphorus 1080p coremark: CoreMark Size 666 - Iterations Per Second sunflow: Global Illumination + Image Synthesis redis: SET encode-wavpack: WAV To WavPack tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 kvazaar: Bosphorus 1080p - Ultra Fast dav1d: Summer Nature 1080p onednn: IP Shapes 1D - f32 - CPU astcenc: Medium encode-opus: WAV To Opus Encode onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU amg: onednn: IP Shapes 3D - f32 - CPU astcenc: Fast lammps: Rhodopsin Protein onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU 1 2 3 4 546.60 1355.877 1.88 308.135 296.225 281.21 8155.36 11318679 65.31 8702.4 43.29 8890.8 8182.46 176.50 8727.9 44.38 72.434 7.198 5.386 57.704 11.444 34.580 23.585 2 19.47 43.27 56.71 66.32 21.60 25.30 103.36 30.50 3.02 15.70 10.05 11.15 9.47 12.00 48.74 131.199 126.297 4.96 5.31 7519.82 9.09 7702974 383.8 384.5 368.2 353.2 1559.4 1559.4 384.1 384.4 368.0 341.8 1759.5 1756.2 617890 1358386 82.186 79.071 8.13 0.848 65.14 8.93 0.312 0.773 1.613 0.34 0.44 0.45 1.088 70.29 242.68 0.4 1120777.87 20.0218 2089551.59 517313 2.569 1625716.67 19.90 2002260.29 6937983 22.75 146076.571361 2.515 1360473.30 14.328 294.501 275.705 34.91 236.91 15.9974 9.86 8.409 6.89532 203466800 9.76165 8.45 2.857 21.5866 34.9925 546.99 1338.944 1.88 305.665 298.542 280.17 8342.65 11219779 65.22 8704.0 44.68 8831.0 8199.81 171.79 8697.2 42.96 74.404 7.204 5.381 58.927 12.761 34.794 23.466 2 19.48 42.28 56.74 66.08 22.42 25.30 104.71 30.59 3.05 15.93 10.58 11.02 9.77 11.59 49.12 130.791 126.319 4.98 5.32 7498.52 9.05 7764480 385.3 386.4 368.3 339.3 1563.9 1562.7 385.3 386.4 370.5 369.3 1766.3 1762.9 619487 1421589 82.110 80.731 8.12 0.853 65.12 8.95 0.313 0.774 1.617 0.34 0.44 0.45 1.078 70.41 242.71 0.4 1134102.71 19.8197 1192686.09 520765 2.593 1613560.77 19.78 1837364.39 6918794 22.54 146125.960019 2.561 1410438.04 14.364 292.846 274.071 34.79 237.53 16.0547 9.83 8.540 6.99297 204210267 9.98826 8.37 2.864 21.4621 35.0847 545.83 1351.369 1.88 305.722 297.008 280.40 8103.95 11301303 65.26 8704.8 43.27 8778.0 8099.62 182.03 8710.5 43.10 73.221 7.235 5.446 57.507 11.317 35.192 23.220 2 19.50 43.30 56.78 66.85 21.93 25.28 104.15 31.59 3.38 16.07 10.27 10.98 9.53 12.33 49.26 130.614 126.143 4.96 5.30 7499.59 8.97 7646379 374.2 372.5 360.4 349.7 1532.5 1530.9 374.1 371.9 360.5 354.2 1726.2 1681.4 619727 1420249 82.353 78.973 8.12 0.848 64.90 8.94 0.317 0.778 1.626 0.34 0.44 0.45 1.090 70.50 242.04 0.4 1150279.50 19.9791 1214876.92 520417 2.575 1575249.37 19.91 1826116.87 6936477 22.55 146576.086059 2.547 1411275.42 14.394 291.122 273.146 34.91 236.72 16.0860 9.84 8.382 6.91070 203312833 9.93337 8.36 2.913 21.5901 35.0568 1354.311 1.88 8394.91 65.23 8674.7 43.95 8744.0 8045.71 181.70 8694.3 44.41 2 126.645 4.96 5.30 7501.96 7750184 8.12 0.853 8.95 0.320 0.34 0.44 0.45 1.077 70.47 242.98 0.4 20.4562 2.577 19.88 6923515 22.84 146261.358184 35.11 237.31 16.0591 6.89369 203289100 9.96883 2.898 21.7181 35.1873 OpenBenchmarking.org
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Exhaustive 1 2 3 120 240 360 480 600 SE +/- 0.53, N = 3 SE +/- 0.59, N = 3 SE +/- 0.78, N = 3 546.60 546.99 545.83 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
CP2K Molecular Dynamics Fayalite-FIST Data OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 8.1 Fayalite-FIST Data 1 2 3 4 300 600 900 1200 1500 1355.88 1338.94 1351.37 1354.31
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium 1 2 3 4 0.423 0.846 1.269 1.692 2.115 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.88 1.88 1.88 1.88 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 1 2 3 70 140 210 280 350 SE +/- 1.18, N = 3 SE +/- 0.24, N = 3 SE +/- 0.40, N = 3 308.14 305.67 305.72
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile 1 2 3 70 140 210 280 350 SE +/- 2.45, N = 3 SE +/- 4.17, N = 3 SE +/- 3.18, N = 3 296.23 298.54 297.01
Numpy Benchmark OpenBenchmarking.org Score, More Is Better Numpy Benchmark 1 2 3 60 120 180 240 300 SE +/- 0.40, N = 3 SE +/- 0.50, N = 3 SE +/- 0.47, N = 3 281.21 280.17 280.40
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 1 2 3 4 2K 4K 6K 8K 10K SE +/- 93.64, N = 3 SE +/- 74.29, N = 11 SE +/- 104.65, N = 15 SE +/- 113.91, N = 3 8155.36 8342.65 8103.95 8394.91 MIN: 7535.58 MIN: 7400.41 MIN: 7388.27 MIN: 7807.51 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth 1 2 3 2M 4M 6M 8M 10M SE +/- 142536.84, N = 3 SE +/- 32130.07, N = 3 SE +/- 65567.52, N = 3 11318679 11219779 11301303
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 10-bit 1 2 3 4 15 30 45 60 75 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 65.31 65.22 65.26 65.23 MIN: 43.39 / MAX: 153.22 MIN: 43.37 / MAX: 150.79 MIN: 43.24 / MAX: 150.77 MIN: 43.38 / MAX: 150.04 1. (CC) gcc options: -pthread
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed 1 2 3 4 2K 4K 6K 8K 10K SE +/- 9.85, N = 3 SE +/- 24.99, N = 3 SE +/- 11.38, N = 12 SE +/- 4.59, N = 15 8702.4 8704.0 8704.8 8674.7 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed 1 2 3 4 10 20 30 40 50 SE +/- 0.67, N = 3 SE +/- 0.31, N = 3 SE +/- 0.73, N = 12 SE +/- 0.58, N = 15 43.29 44.68 43.27 43.95 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed 1 2 3 4 2K 4K 6K 8K 10K SE +/- 31.60, N = 3 SE +/- 23.72, N = 3 SE +/- 61.19, N = 3 SE +/- 45.93, N = 3 8890.8 8831.0 8778.0 8744.0 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Compression Speed 1 2 3 4 2K 4K 6K 8K 10K SE +/- 57.23, N = 3 SE +/- 42.47, N = 3 SE +/- 103.87, N = 3 SE +/- 82.51, N = 3 8182.46 8199.81 8099.62 8045.71 1. (CC) gcc options: -O3
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 1 2 3 4 40 80 120 160 200 SE +/- 0.62, N = 3 SE +/- 0.28, N = 3 SE +/- 0.14, N = 3 SE +/- 0.09, N = 3 176.50 171.79 182.03 181.70 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed 1 2 3 4 2K 4K 6K 8K 10K SE +/- 19.50, N = 3 SE +/- 7.28, N = 15 SE +/- 8.77, N = 3 SE +/- 7.53, N = 7 8727.9 8697.2 8710.5 8694.3 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed 1 2 3 4 10 20 30 40 50 SE +/- 0.49, N = 3 SE +/- 0.61, N = 15 SE +/- 0.66, N = 3 SE +/- 0.48, N = 7 44.38 42.96 43.10 44.41 1. (CC) gcc options: -O3
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 1 2 3 20 40 60 80 100 SE +/- 0.39, N = 3 SE +/- 1.31, N = 3 SE +/- 1.07, N = 3 72.43 74.40 73.22 MIN: 70.53 / MAX: 86.1 MIN: 70.84 / MAX: 120.63 MIN: 70.71 / MAX: 88.15 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 1 2 3 2 4 6 8 10 SE +/- 0.083, N = 3 SE +/- 0.074, N = 3 SE +/- 0.094, N = 3 7.198 7.204 7.235 MIN: 6.98 / MAX: 20.14 MIN: 7 / MAX: 9.44 MIN: 6.99 / MAX: 8.95 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 1 2 3 1.2254 2.4508 3.6762 4.9016 6.127 SE +/- 0.013, N = 3 SE +/- 0.011, N = 3 SE +/- 0.049, N = 3 5.386 5.381 5.446 MIN: 5.32 / MAX: 17.3 MIN: 5.32 / MAX: 16.01 MIN: 5.34 / MAX: 18.36 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 1 2 3 13 26 39 52 65 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 SE +/- 0.15, N = 3 57.70 58.93 57.51 MIN: 56.97 / MAX: 97.24 MIN: 58.25 / MAX: 71.96 MIN: 56.78 / MAX: 70.54 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 1 2 3 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 11.44 12.76 11.32 MIN: 11.27 / MAX: 24.54 MIN: 12.63 / MAX: 14.44 MIN: 11.18 / MAX: 13.37 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis 1 2 3 8 16 24 32 40 SE +/- 0.31, N = 20 SE +/- 0.26, N = 20 SE +/- 0.51, N = 4 34.58 34.79 35.19 1. (CC) gcc options: -O2 -std=c99
Unpacking Firefox Extracting: firefox-84.0.source.tar.xz OpenBenchmarking.org Seconds, Fewer Is Better Unpacking Firefox 84.0 Extracting: firefox-84.0.source.tar.xz 1 2 3 6 12 18 24 30 SE +/- 0.36, N = 20 SE +/- 0.41, N = 20 SE +/- 0.40, N = 16 23.59 23.47 23.22
CLOMP Static OMP Speedup OpenBenchmarking.org Speedup, More Is Better CLOMP 1.2 Static OMP Speedup 1 2 3 4 0.45 0.9 1.35 1.8 2.25 2 2 2 2 1. (CC) gcc options: -fopenmp -O3 -lm
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: regnety_400m 1 2 3 5 10 15 20 25 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 SE +/- 0.16, N = 3 19.47 19.48 19.50 MIN: 19.23 / MAX: 21.64 MIN: 19.18 / MAX: 31.02 MIN: 19.16 / MAX: 23.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: squeezenet_ssd 1 2 3 10 20 30 40 50 SE +/- 0.41, N = 3 SE +/- 0.25, N = 3 SE +/- 0.13, N = 3 43.27 42.28 43.30 MIN: 36.99 / MAX: 55.23 MIN: 36.93 / MAX: 50.75 MIN: 37.21 / MAX: 48.81 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: yolov4-tiny 1 2 3 13 26 39 52 65 SE +/- 0.14, N = 3 SE +/- 0.15, N = 3 SE +/- 0.13, N = 3 56.71 56.74 56.78 MIN: 53.66 / MAX: 69.53 MIN: 54.12 / MAX: 68.2 MIN: 53.9 / MAX: 66.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet50 1 2 3 15 30 45 60 75 SE +/- 0.02, N = 3 SE +/- 0.10, N = 3 SE +/- 0.61, N = 3 66.32 66.08 66.85 MIN: 65.96 / MAX: 76.23 MIN: 65.7 / MAX: 77.86 MIN: 65.73 / MAX: 78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: alexnet 1 2 3 5 10 15 20 25 SE +/- 0.42, N = 3 SE +/- 0.81, N = 3 SE +/- 0.96, N = 3 21.60 22.42 21.93 MIN: 20.81 / MAX: 28.41 MIN: 20.82 / MAX: 31.14 MIN: 20.79 / MAX: 24.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet18 1 2 3 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 25.30 25.30 25.28 MIN: 25 / MAX: 28.34 MIN: 24.92 / MAX: 35.73 MIN: 24.92 / MAX: 37.74 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: vgg16 1 2 3 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.37, N = 3 SE +/- 0.51, N = 3 103.36 104.71 104.15 MIN: 101.57 / MAX: 120.6 MIN: 101.58 / MAX: 116.21 MIN: 101.58 / MAX: 119.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: googlenet 1 2 3 7 14 21 28 35 SE +/- 1.26, N = 3 SE +/- 1.12, N = 3 SE +/- 1.18, N = 3 30.50 30.59 31.59 MIN: 28.43 / MAX: 47.49 MIN: 28.35 / MAX: 34.7 MIN: 28.8 / MAX: 36.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: blazeface 1 2 3 0.7605 1.521 2.2815 3.042 3.8025 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 3.02 3.05 3.38 MIN: 2.77 / MAX: 3.46 MIN: 2.78 / MAX: 3.47 MIN: 2.87 / MAX: 3.89 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: efficientnet-b0 1 2 3 4 8 12 16 20 SE +/- 0.14, N = 3 SE +/- 0.41, N = 3 SE +/- 0.41, N = 3 15.70 15.93 16.07 MIN: 15.2 / MAX: 19.7 MIN: 15.16 / MAX: 18.44 MIN: 15.12 / MAX: 18.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mnasnet 1 2 3 3 6 9 12 15 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 SE +/- 0.13, N = 3 10.05 10.58 10.27 MIN: 9.75 / MAX: 11.86 MIN: 10.21 / MAX: 12.32 MIN: 9.7 / MAX: 21.22 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: shufflenet-v2 1 2 3 3 6 9 12 15 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 11.15 11.02 10.98 MIN: 10.9 / MAX: 23.16 MIN: 10.78 / MAX: 13.17 MIN: 10.8 / MAX: 12.72 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 9.47 9.77 9.53 MIN: 9 / MAX: 21.33 MIN: 9.05 / MAX: 13.2 MIN: 8.98 / MAX: 11.5 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 3 6 9 12 15 SE +/- 0.26, N = 3 SE +/- 0.47, N = 3 SE +/- 0.17, N = 3 12.00 11.59 12.33 MIN: 11.15 / MAX: 17.47 MIN: 10.26 / MAX: 15.27 MIN: 11.34 / MAX: 17.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mobilenet 1 2 3 11 22 33 44 55 SE +/- 0.21, N = 3 SE +/- 0.18, N = 3 SE +/- 0.47, N = 3 48.74 49.12 49.26 MIN: 47.01 / MAX: 52.41 MIN: 46.96 / MAX: 60.99 MIN: 46.93 / MAX: 52.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.2.2 Time To Compile 1 2 3 30 60 90 120 150 SE +/- 0.48, N = 3 SE +/- 0.33, N = 3 SE +/- 0.19, N = 3 131.20 130.79 130.61
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 2 3 4 30 60 90 120 150 SE +/- 0.08, N = 3 SE +/- 0.29, N = 3 SE +/- 0.45, N = 3 SE +/- 0.26, N = 3 126.30 126.32 126.14 126.65 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast 1 2 3 4 1.1205 2.241 3.3615 4.482 5.6025 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 4.96 4.98 4.96 4.96 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K 1 2 3 4 1.197 2.394 3.591 4.788 5.985 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 5.31 5.32 5.30 5.30 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 1 2 3 4 1600 3200 4800 6400 8000 SE +/- 17.90, N = 3 SE +/- 8.54, N = 3 SE +/- 14.23, N = 3 SE +/- 27.70, N = 3 7519.82 7498.52 7499.59 7501.96 MIN: 7453.06 MIN: 7439.82 MIN: 7425.51 MIN: 7396.37 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark 1 2 3 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 9.09 9.05 8.97 1. Nodejs
v10.15.2
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 12 Total Time 1 2 3 4 1.7M 3.4M 5.1M 6.8M 8.5M SE +/- 95465.05, N = 3 SE +/- 128808.86, N = 3 SE +/- 81107.31, N = 3 SE +/- 89981.20, N = 3 7702974 7764480 7646379 7750184 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
Cryptsetup Twofish-XTS 512b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 512b Decryption 1 2 3 80 160 240 320 400 SE +/- 1.40, N = 12 SE +/- 0.10, N = 2 SE +/- 6.58, N = 3 383.8 385.3 374.2
Cryptsetup Twofish-XTS 512b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 512b Encryption 1 2 3 80 160 240 320 400 SE +/- 1.72, N = 13 SE +/- 0.00, N = 2 SE +/- 7.88, N = 3 384.5 386.4 372.5
Cryptsetup Serpent-XTS 512b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 512b Decryption 1 2 3 80 160 240 320 400 SE +/- 1.22, N = 15 SE +/- 1.76, N = 3 SE +/- 5.35, N = 3 368.2 368.3 360.4
Cryptsetup Serpent-XTS 512b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 512b Encryption 1 2 3 80 160 240 320 400 SE +/- 5.87, N = 15 SE +/- 17.20, N = 3 SE +/- 8.24, N = 3 353.2 339.3 349.7
Cryptsetup AES-XTS 512b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 512b Decryption 1 2 3 300 600 900 1200 1500 SE +/- 3.88, N = 15 SE +/- 3.70, N = 3 SE +/- 18.68, N = 3 1559.4 1563.9 1532.5
Cryptsetup AES-XTS 512b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 512b Encryption 1 2 3 300 600 900 1200 1500 SE +/- 4.00, N = 15 SE +/- 4.19, N = 3 SE +/- 19.24, N = 3 1559.4 1562.7 1530.9
Cryptsetup Twofish-XTS 256b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 256b Decryption 1 2 3 80 160 240 320 400 SE +/- 1.19, N = 15 SE +/- 0.00, N = 3 SE +/- 6.67, N = 3 384.1 385.3 374.1
Cryptsetup Twofish-XTS 256b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 256b Encryption 1 2 3 80 160 240 320 400 SE +/- 1.73, N = 15 SE +/- 0.07, N = 3 SE +/- 7.32, N = 3 384.4 386.4 371.9
Cryptsetup Serpent-XTS 256b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 256b Decryption 1 2 3 80 160 240 320 400 SE +/- 1.28, N = 15 SE +/- 0.30, N = 3 SE +/- 6.08, N = 3 368.0 370.5 360.5
Cryptsetup Serpent-XTS 256b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 256b Encryption 1 2 3 80 160 240 320 400 SE +/- 4.94, N = 15 SE +/- 4.12, N = 3 SE +/- 3.15, N = 3 341.8 369.3 354.2
Cryptsetup AES-XTS 256b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 256b Decryption 1 2 3 400 800 1200 1600 2000 SE +/- 5.86, N = 15 SE +/- 6.26, N = 3 SE +/- 27.72, N = 3 1759.5 1766.3 1726.2
Cryptsetup AES-XTS 256b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 256b Encryption 1 2 3 400 800 1200 1600 2000 SE +/- 6.44, N = 15 SE +/- 3.18, N = 3 SE +/- 49.12, N = 3 1756.2 1762.9 1681.4
Cryptsetup PBKDF2-whirlpool OpenBenchmarking.org Iterations Per Second, More Is Better Cryptsetup PBKDF2-whirlpool 1 2 3 130K 260K 390K 520K 650K SE +/- 1649.33, N = 15 SE +/- 1355.87, N = 3 SE +/- 733.33, N = 3 617890 619487 619727
Cryptsetup PBKDF2-sha512 OpenBenchmarking.org Iterations Per Second, More Is Better Cryptsetup PBKDF2-sha512 1 2 3 300K 600K 900K 1200K 1500K SE +/- 15050.06, N = 15 SE +/- 8936.67, N = 3 SE +/- 6334.57, N = 3 1358386 1421589 1420249
Timed Eigen Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Eigen Compilation 3.3.9 Time To Compile 1 2 3 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 82.19 82.11 82.35
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 1 2 3 20 40 60 80 100 SE +/- 0.86, N = 3 SE +/- 0.40, N = 3 SE +/- 0.81, N = 3 79.07 80.73 78.97 1. (CC) gcc options: -O2 -ldl -lz -lpthread
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium 1 2 3 4 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.13 8.12 8.12 8.12 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 1 2 3 4 0.1919 0.3838 0.5757 0.7676 0.9595 SE +/- 0.004, N = 3 SE +/- 0.000, N = 3 SE +/- 0.005, N = 3 SE +/- 0.006, N = 3 0.848 0.853 0.848 0.853
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Thorough 1 2 3 15 30 45 60 75 SE +/- 0.23, N = 3 SE +/- 0.24, N = 3 SE +/- 0.22, N = 3 65.14 65.12 64.90 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast 1 2 3 4 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 8.93 8.95 8.94 8.95 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 1 1 2 3 4 0.072 0.144 0.216 0.288 0.36 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 0.312 0.313 0.317 0.320
IndigoBench Acceleration: CPU - Scene: Bedroom OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Bedroom 1 2 3 0.1751 0.3502 0.5253 0.7004 0.8755 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 0.773 0.774 0.778
IndigoBench Acceleration: CPU - Scene: Supercar OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Supercar 1 2 3 0.3659 0.7318 1.0977 1.4636 1.8295 SE +/- 0.003, N = 3 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 1.613 1.617 1.626
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: LargeRandom 1 2 3 4 0.0765 0.153 0.2295 0.306 0.3825 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.34 0.34 0.34 0.34 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: PartialTweets 1 2 3 4 0.099 0.198 0.297 0.396 0.495 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.44 0.44 0.44 0.44 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: DistinctUserID 1 2 3 4 0.1013 0.2026 0.3039 0.4052 0.5065 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.45 0.45 0.45 0.45 1. (CXX) g++ options: -O3 -pthread
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 1 2 3 4 0.2453 0.4906 0.7359 0.9812 1.2265 SE +/- 0.007, N = 3 SE +/- 0.002, N = 3 SE +/- 0.006, N = 3 SE +/- 0.005, N = 3 1.088 1.078 1.090 1.077
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 4K 1 2 3 4 16 32 48 64 80 SE +/- 0.11, N = 3 SE +/- 0.12, N = 3 SE +/- 0.14, N = 3 SE +/- 0.08, N = 3 70.29 70.41 70.50 70.47 MIN: 66.9 / MAX: 76.26 MIN: 66.94 / MAX: 76.37 MIN: 67.04 / MAX: 76.76 MIN: 67.07 / MAX: 76.57 1. (CC) gcc options: -pthread
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 1 2 3 4 50 100 150 200 250 SE +/- 0.67, N = 3 SE +/- 0.62, N = 3 SE +/- 0.95, N = 3 SE +/- 0.88, N = 3 242.68 242.71 242.04 242.98 MIN: 179.52 / MAX: 416.5 MIN: 179.42 / MAX: 421.76 MIN: 179.53 / MAX: 426.81 MIN: 179.71 / MAX: 424.73 1. (CC) gcc options: -pthread
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: Kostya 1 2 3 4 0.09 0.18 0.27 0.36 0.45 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.4 0.4 0.4 0.4 1. (CXX) g++ options: -O3 -pthread
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 200K 400K 600K 800K 1000K SE +/- 18152.00, N = 15 SE +/- 14151.84, N = 15 SE +/- 15401.38, N = 5 1120777.87 1134102.71 1150279.50 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 1 2 3 4 5 10 15 20 25 SE +/- 0.16, N = 3 SE +/- 0.26, N = 3 SE +/- 0.23, N = 3 SE +/- 0.23, N = 15 20.02 19.82 19.98 20.46 MIN: 18.88 MIN: 18.83 MIN: 18.83 MIN: 18.86 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 24605.67, N = 15 SE +/- 16947.09, N = 15 SE +/- 18592.80, N = 3 2089551.59 1192686.09 1214876.92 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
PHPBench PHP Benchmark Suite OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite 1 2 3 110K 220K 330K 440K 550K SE +/- 4398.89, N = 3 SE +/- 2872.36, N = 3 SE +/- 3539.26, N = 3 517313 520765 520417
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 1 2 3 4 0.5834 1.1668 1.7502 2.3336 2.917 SE +/- 0.012, N = 3 SE +/- 0.004, N = 3 SE +/- 0.014, N = 3 SE +/- 0.008, N = 3 2.569 2.593 2.575 2.577
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 300K 600K 900K 1200K 1500K SE +/- 23486.62, N = 13 SE +/- 29790.86, N = 13 SE +/- 8565.40, N = 3 1625716.67 1613560.77 1575249.37 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast 1 2 3 4 5 10 15 20 25 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 19.90 19.78 19.91 19.88 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 41912.87, N = 12 SE +/- 16816.34, N = 10 SE +/- 10868.80, N = 3 2002260.29 1837364.39 1826116.87 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time 1 2 3 4 1.5M 3M 4.5M 6M 7.5M SE +/- 7119.37, N = 3 SE +/- 24247.95, N = 3 SE +/- 15347.30, N = 3 SE +/- 6793.89, N = 3 6937983 6918794 6936477 6923515 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p 1 2 3 4 5 10 15 20 25 SE +/- 0.09, N = 3 SE +/- 0.12, N = 3 SE +/- 0.01, N = 3 SE +/- 0.13, N = 3 22.75 22.54 22.55 22.84 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second 1 2 3 4 30K 60K 90K 120K 150K SE +/- 82.97, N = 3 SE +/- 203.24, N = 3 SE +/- 29.09, N = 3 SE +/- 108.15, N = 3 146076.57 146125.96 146576.09 146261.36 1. (CC) gcc options: -O2 -lrt" -lrt
Sunflow Rendering System Global Illumination + Image Synthesis OpenBenchmarking.org Seconds, Fewer Is Better Sunflow Rendering System 0.07.2 Global Illumination + Image Synthesis 1 2 3 0.5762 1.1524 1.7286 2.3048 2.881 SE +/- 0.014, N = 3 SE +/- 0.015, N = 3 SE +/- 0.034, N = 3 2.515 2.561 2.547 MIN: 2.3 / MAX: 3.5 MIN: 2.32 / MAX: 3.38 MIN: 2.31 / MAX: 3.25
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 300K 600K 900K 1200K 1500K SE +/- 16819.77, N = 15 SE +/- 13500.87, N = 3 SE +/- 22164.72, N = 3 1360473.30 1410438.04 1411275.42 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack 1 2 3 4 8 12 16 20 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 SE +/- 0.06, N = 5 14.33 14.36 14.39 1. (CXX) g++ options: -rdynamic
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 3 60 120 180 240 300 SE +/- 0.91, N = 3 SE +/- 0.75, N = 3 SE +/- 0.96, N = 3 294.50 292.85 291.12 MIN: 285.67 / MAX: 317.77 MIN: 282.82 / MAX: 346.97 MIN: 282.45 / MAX: 314.39 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 2 3 60 120 180 240 300 SE +/- 0.50, N = 3 SE +/- 0.78, N = 3 SE +/- 0.02, N = 3 275.71 274.07 273.15 MIN: 273.13 / MAX: 289 MIN: 272.51 / MAX: 277.76 MIN: 272.72 / MAX: 273.64 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast 1 2 3 4 8 16 24 32 40 SE +/- 0.14, N = 3 SE +/- 0.04, N = 3 SE +/- 0.23, N = 3 SE +/- 0.13, N = 3 34.91 34.79 34.91 35.11 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 1080p 1 2 3 4 50 100 150 200 250 SE +/- 0.96, N = 3 SE +/- 0.57, N = 3 SE +/- 0.28, N = 3 SE +/- 0.59, N = 3 236.91 237.53 236.72 237.31 MIN: 214.92 / MAX: 258.69 MIN: 210.44 / MAX: 259.41 MIN: 211.76 / MAX: 257.76 MIN: 215.04 / MAX: 257.78 1. (CC) gcc options: -pthread
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 1 2 3 4 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.11, N = 3 SE +/- 0.20, N = 3 SE +/- 0.19, N = 3 16.00 16.05 16.09 16.06 MIN: 14.3 MIN: 14.16 MIN: 14.31 MIN: 14.23 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Medium 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 9.86 9.83 9.84 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode 1 2 3 2 4 6 8 10 SE +/- 0.047, N = 5 SE +/- 0.063, N = 5 SE +/- 0.027, N = 5 8.409 8.540 8.382 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 1 2 3 4 2 4 6 8 10 SE +/- 0.01723, N = 3 SE +/- 0.05526, N = 3 SE +/- 0.00553, N = 3 SE +/- 0.00433, N = 3 6.89532 6.99297 6.91070 6.89369 MIN: 6.63 MIN: 6.73 MIN: 6.71 MIN: 6.69 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 1 2 3 4 40M 80M 120M 160M 200M SE +/- 127171.79, N = 3 SE +/- 442597.72, N = 3 SE +/- 84515.21, N = 3 SE +/- 170844.97, N = 3 203466800 204210267 203312833 203289100 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1 2 3 4 3 6 9 12 15 SE +/- 0.02437, N = 3 SE +/- 0.02016, N = 3 SE +/- 0.01413, N = 3 SE +/- 0.01524, N = 3 9.76165 9.98826 9.93337 9.96883 MIN: 9.53 MIN: 9.76 MIN: 9.73 MIN: 9.8 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
ASTC Encoder Preset: Fast OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Fast 1 2 3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.45 8.37 8.36 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 1 2 3 4 0.6554 1.3108 1.9662 2.6216 3.277 SE +/- 0.032, N = 3 SE +/- 0.031, N = 3 SE +/- 0.002, N = 3 SE +/- 0.013, N = 3 2.857 2.864 2.913 2.898 1. (CXX) g++ options: -O3 -pthread -lm
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 1 2 3 4 5 10 15 20 25 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 21.59 21.46 21.59 21.72 MIN: 19.99 MIN: 20.02 MIN: 20.01 MIN: 20.05 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 1 2 3 4 8 16 24 32 40 SE +/- 0.05, N = 3 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 34.99 35.08 35.06 35.19 MIN: 30.29 MIN: 32.66 MIN: 30.53 MIN: 32.2 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Phoronix Test Suite v10.8.4