6800K May Intel Core i7-6800K testing with a MSI X99A WORKSTATION (MS-7A54) v1.0 (1.10 BIOS) and Zotac NVIDIA NV137 2GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2105106-IB-6800KMAY414&grr&sor .
6800K May Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution 1 2 3 Intel Core i7-6800K @ 3.80GHz (6 Cores / 12 Threads) MSI X99A WORKSTATION (MS-7A54) v1.0 (1.10 BIOS) Intel Xeon E7 v4/Xeon 16GB 120GB TOSHIBA TR150 Zotac NVIDIA NV137 2GB Realtek ALC1150 G237HL Intel I218-LM + Intel I210 Ubuntu 20.10 5.8.0-41-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 nouveau 4.3 Mesa 20.2.1 GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0xb000038 Python Details - Python 3.8.6 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
6800K May build-llvm: Unix Makefiles build-llvm: Ninja helsing: 14 digit build-nodejs: Time To Compile blender: Classroom - CPU-Only gnuradio: Hilbert Transform gnuradio: FM Deemphasis Filter gnuradio: IIR Filter gnuradio: FIR Filter gnuradio: Signal Source (Cosine) gnuradio: Five Back to Back FIR Filters toktx: UASTC 4 + Zstd Compression 19 cp2k: Fayalite-FIST securemark: SecureMark-TLS jpegxl: PNG - 8 blender: Fishy Cat - CPU-Only aom-av1: Speed 4 Two-Pass - Bosphorus 4K xmrig: Monero - 1M gromacs: water_GMX50_bare mnn: inception-v3 mnn: mobilenet-v1-1.0 mnn: MobileNetV2_224 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 aom-av1: Speed 0 Two-Pass - Bosphorus 4K xmrig: Wownero - 1M blender: BMW27 - CPU-Only libgav1: Chimera 1080p 10-bit luaradio: Complex Phase luaradio: Hilbert Transform luaradio: FM Deemphasis Filter luaradio: Five Back to Back FIR Filters aom-av1: Speed 6 Two-Pass - Bosphorus 4K mrbayes: Primate Phylogeny Analysis astcenc: Exhaustive jpegxl: PNG - 7 gmpbench: Total Time aom-av1: Speed 4 Two-Pass - Bosphorus 1080p build-linux-kernel: Time To Compile build-erlang: Time To Compile build-wasmer: Time To Compile svt-hevc: 1 - Bosphorus 1080p incompact3d: input.i3d 129 Cells Per Direction dav1d: Chimera 1080p 10-bit incompact3d: input.i3d 193 Cells Per Direction hmmer: Pfam Database Search openscad: Pistol openscad: Projector Mount Swivel avifenc: 0 onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU basis: UASTC Level 3 avifenc: 6, Lossless embree: Pathtracer - Asian Dragon Obj vosk: build-mesa: Time To Compile embree: Pathtracer - Crown onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU embree: Pathtracer ISPC - Asian Dragon Obj aom-av1: Speed 0 Two-Pass - Bosphorus 1080p jpegxl-decode: 1 libgav1: Summer Nature 4K sysbench: CPU embree: Pathtracer ISPC - Crown embree: Pathtracer - Asian Dragon aom-av1: Speed 6 Realtime - Bosphorus 4K libgav1: Chimera 1080p aom-av1: Speed 6 Two-Pass - Bosphorus 1080p embree: Pathtracer ISPC - Asian Dragon compress-zstd: 19, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed viennacl: CPU BLAS - dGEMM-TT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sCOPY jpegxl-decode: All pjsip: INVITE pjsip: OPTIONS, Stateful avifenc: 2 openscad: Retro Car srslte: OFDM_Test compress-zstd: 19 - Decompression Speed compress-zstd: 19 - Compression Speed basis: UASTC Level 2 srslte: PHY_DL_Test srslte: PHY_DL_Test openscad: Mini-ITX Case stockfish: Total Time qmcpack: simple-H2O liquid-dsp: 8 - 256 - 57 liquid-dsp: 2 - 256 - 57 aom-av1: Speed 6 Realtime - Bosphorus 1080p pjsip: OPTIONS, Stateless stream: Copy chia-vdf: Square Plain C++ compress-zstd: 3, Long Mode - Decompression Speed compress-zstd: 3, Long Mode - Compression Speed compress-zstd: 3 - Decompression Speed compress-zstd: 3 - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 8 - Compression Speed liquid-dsp: 4 - 256 - 57 compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed dav1d: Summer Nature 4K basis: ETC1S jpegxl: PNG - 5 dav1d: Chimera 1080p chia-vdf: Square Assembly Optimized jpegxl: JPEG - 5 astcenc: Thorough draco: Church Facade toktx: UASTC 3 + Zstd Compression 19 basis: UASTC Level 0 libgav1: Summer Nature 1080p aom-av1: Speed 8 Realtime - Bosphorus 4K jpegxl: JPEG - 7 tjbench: Decompression Throughput openscad: Leonardo Phone Case Slim toktx: UASTC 3 avifenc: 6 toktx: Zstd Compression 19 onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU liquid-dsp: 1 - 256 - 57 liquid-dsp: 12 - 256 - 57 aom-av1: Speed 9 Realtime - Bosphorus 4K onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU jpegxl: JPEG - 8 onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU svt-hevc: 7 - Bosphorus 1080p dav1d: Summer Nature 1080p svt-vp9: VMAF Optimized - Bosphorus 1080p helsing: 12 digit onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU avifenc: 10, Lossless svt-vp9: Visual Quality Optimized - Bosphorus 1080p draco: Lion aom-av1: Speed 8 Realtime - Bosphorus 1080p astcenc: Medium aom-av1: Speed 9 Realtime - Bosphorus 1080p svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU sysbench: RAM / Memory svt-hevc: 10 - Bosphorus 1080p avifenc: 10 toktx: Zstd Compression 9 onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU stream: Add stream: Triad stream: Scale 1 2 3 1394.792 1368.459 918.416 888.501 856.95 335.3 593 436 532.9 2194.1 504.8 655.567 1347.639 205983 0.59 401.68 1.87 2565.9 0.566 74.299 7.042 6.371 59.04 12.364 0.07 3445.7 285.35 35.5 481.3 82.9 365.6 651.6 3.52 173.544 198.6593 5.77 4608.3 3.8 176.899 169.146 165.544 3.84 41.1415367 87.27 156.713745 151.69 139.345 129.726 122.957 11062.7 11040.2 11018.3 113.168 111.766 7.106 25.697 100.103 6.4186 7382.83 7353.4 7328.24 8.0582 0.21 29.17 39.91 10354.19 7.2156 7.5488 7.4 112.23 10.89 8.8669 2783.8 21.3 20.9 19.5 20 20.2 24.3 24.6 23.1 33.6 22.6 18 29.2 19.9 129.91 3779 6406 63.797 24.251 79500000 2810.6 26.7 59.031 78.5 197 58.362 13973741 55.883 255830000 98947000 13.38 48893 43325.4 119000 3161.7 673.2 2988.7 1739.9 2996.4 317.2 188270000 3215.6 336.2 104.3 35.089 39.88 369.44 150500 43 25.9384 9048 29.179 10.899 135.37 24.26 42.45 171.679919 23.438 23.051 21.756 21.532 19.4447 10.3818 52021000 284670000 34.84 12.7942 9.09941 19.5 7.11325 9.70189 55.68 332.31 89.33 10.27 9.84608 3.36519 8.545 75.25 6335 77.27 8.0446 92.81 95.25 16.9558 17.1722 16736.89 121.24 5.133 3.412 16.0755 10.7622 29342.3 29294 27218.3 1397.413 1370.702 918.952 889.312 858.13 334.6 595.3 449.8 519.7 2176.4 490.9 655.439 1335.09 206085 0.59 400.78 1.87 2580.7 0.565 73.524 7.058 6.203 57.332 12.127 0.07 3438.5 286.64 35.40 482.7 82.8 366.4 654.7 3.53 174.592 198.6162 5.76 4608 3.81 173.552 168.784 164.766 3.84 43.4190303 87.13 149.263896 151.321 139.800 129.572 123.708 11053.1 11061.1 11106.7 113.846 112.325 6.9095 23.408 98.819 6.3377 7367.46 7368.77 7369.73 7.8579 0.21 29.63 39.94 10352.60 7.1725 7.5463 7.38 111.79 10.87 8.8919 2766.1 20.8 18.9 21.3 19.6 20.7 24.4 24.8 23.5 33.9 22.5 17.4 28.6 19.9 128.81 3775 6419 63.954 24.442 80466667 2802.3 26.7 59.076 77.3 195.5 58.322 13605923 55.279 277256667 98514333 13.42 49132 43524.3 118900 3135.9 669.8 2982.2 1773.5 3031.7 317.8 190735000 3216.1 337.5 104.38 35.228 40.26 371.37 153300 41.92 25.9345 9045 28.904 10.923 135.33 24.15 42.30 171.397632 23.357 23.220 21.716 21.526 19.4434 10.3787 52412000 286536667 34.83 12.7342 9.03875 20.10 7.11114 9.78194 57.43 332.59 92.51 10.304 9.90494 3.57796 8.603 75.16 6333 76.87 8.0747 92.42 95.30 16.8938 17.1720 16779.82 121.87 5.227 3.412 15.6669 10.7842 29554.4 29324.2 27311.6 1398.367 1370.148 919.967 889.461 860.12 335.4 597.7 448.9 525.1 2226.4 478.0 655.462 1341.451 199607 0.60 400.10 1.87 2587.9 0.566 73.004 7.867 6.039 57.526 11.519 0.07 3443.9 285.71 35.42 479.1 82.5 357.4 658.6 3.53 180.163 198.5244 5.76 4519.2 3.80 174.277 169.186 165.459 3.84 42.5190188 87.03 150.013595 151.195 139.503 130.000 123.797 11060.2 11082.8 11089.2 113.366 112.435 7.0486 23.327 98.898 6.4339 7347.50 7333.18 7372.41 7.9602 0.21 29.40 39.94 10358.79 7.1957 7.6012 7.37 112.18 10.85 8.8853 2785.3 21.1 19.6 21.5 19.1 19.6 24.0 24.4 23.7 33.7 22.6 17.7 29.1 20.1 126.70 3781 6414 64.068 24.181 80833333 2788.7 26.4 59.189 78 196.8 58.713 13737688 55.582 269162667 97365167 13.35 49207 43699.5 118933 3159.3 668.3 2983.7 1755.8 3031.6 323.6 190431667 3211.5 337.8 104.17 35.023 39.89 369.97 153233 41.74 25.9505 9189 28.877 11.040 135.24 24.21 42.05 170.100454 23.491 23.374 21.787 21.496 19.3952 10.3930 52477667 286083333 34.89 12.7154 9.05168 19.76 7.09340 9.76810 57.37 332.73 92.74 10.234 9.96725 3.56832 8.663 74.80 6331 76.56 8.0995 92.46 95.25 17.0378 17.2335 16673.45 121.55 5.173 3.394 15.6227 10.7538 29759.5 29521.1 27580.0 OpenBenchmarking.org
Timed LLVM Compilation Build System: Unix Makefiles OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 12.0 Build System: Unix Makefiles 1 2 3 300 600 900 1200 1500 SE +/- 5.10, N = 3 SE +/- 1.12, N = 3 1394.79 1397.41 1398.37
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 12.0 Build System: Ninja 1 3 2 300 600 900 1200 1500 SE +/- 0.07, N = 3 SE +/- 1.68, N = 3 1368.46 1370.15 1370.70
Helsing Digit Range: 14 digit OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 14 digit 1 2 3 200 400 600 800 1000 SE +/- 0.54, N = 3 SE +/- 0.13, N = 3 918.42 918.95 919.97 1. (CC) gcc options: -O2 -pthread
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 15.11 Time To Compile 1 2 3 200 400 600 800 1000 SE +/- 0.14, N = 3 SE +/- 0.33, N = 3 888.50 889.31 889.46
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Classroom - Compute: CPU-Only 1 2 3 200 400 600 800 1000 SE +/- 0.62, N = 3 SE +/- 1.77, N = 3 856.95 858.13 860.12
GNU Radio Test: Hilbert Transform OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Hilbert Transform 3 1 2 70 140 210 280 350 SE +/- 1.35, N = 3 SE +/- 1.51, N = 9 335.4 335.3 334.6 1. 3.8.1.0
GNU Radio Test: FM Deemphasis Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FM Deemphasis Filter 3 2 1 130 260 390 520 650 SE +/- 8.03, N = 3 SE +/- 4.77, N = 9 597.7 595.3 593.0 1. 3.8.1.0
GNU Radio Test: IIR Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: IIR Filter 2 3 1 100 200 300 400 500 SE +/- 1.40, N = 9 SE +/- 2.30, N = 3 449.8 448.9 436.0 1. 3.8.1.0
GNU Radio Test: FIR Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FIR Filter 1 3 2 120 240 360 480 600 SE +/- 2.59, N = 3 SE +/- 5.14, N = 9 532.9 525.1 519.7 1. 3.8.1.0
GNU Radio Test: Signal Source (Cosine) OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Signal Source (Cosine) 3 1 2 500 1000 1500 2000 2500 SE +/- 13.74, N = 3 SE +/- 20.50, N = 9 2226.4 2194.1 2176.4 1. 3.8.1.0
GNU Radio Test: Five Back to Back FIR Filters OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Five Back to Back FIR Filters 1 2 3 110 220 330 440 550 SE +/- 6.36, N = 9 SE +/- 4.01, N = 3 504.8 490.9 478.0 1. 3.8.1.0
KTX-Software toktx Settings: UASTC 4 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 4 + Zstd Compression 19 2 3 1 140 280 420 560 700 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 655.44 655.46 655.57
CP2K Molecular Dynamics Input: Fayalite-FIST OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 8.1 Input: Fayalite-FIST 2 3 1 300 600 900 1200 1500 1335.09 1341.45 1347.64
SecureMark Benchmark: SecureMark-TLS OpenBenchmarking.org marks, More Is Better SecureMark 1.0.4 Benchmark: SecureMark-TLS 2 1 3 40K 80K 120K 160K 200K SE +/- 181.65, N = 3 SE +/- 5046.21, N = 9 206085 205983 199607 1. (CC) gcc options: -pedantic -O3
JPEG XL Input: PNG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 8 3 2 1 0.135 0.27 0.405 0.54 0.675 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.60 0.59 0.59 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Fishy Cat - Compute: CPU-Only 3 2 1 90 180 270 360 450 SE +/- 0.22, N = 3 SE +/- 0.27, N = 3 400.10 400.78 401.68
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K 3 2 1 0.4208 0.8416 1.2624 1.6832 2.104 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.87 1.87 1.87 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Xmrig Variant: Monero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Monero - Hash Count: 1M 3 2 1 600 1200 1800 2400 3000 SE +/- 4.53, N = 3 SE +/- 3.21, N = 3 2587.9 2580.7 2565.9 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
GROMACS Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021 Input: water_GMX50_bare 3 1 2 0.1274 0.2548 0.3822 0.5096 0.637 SE +/- 0.007, N = 4 SE +/- 0.006, N = 3 0.566 0.566 0.565 1. (CXX) g++ options: -O3 -pthread
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: inception-v3 3 2 1 16 32 48 64 80 SE +/- 0.90, N = 3 SE +/- 0.45, N = 9 73.00 73.52 74.30 MIN: 56.13 / MAX: 217.15 MIN: 57.58 / MAX: 234.51 MIN: 63.07 / MAX: 247.5 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: mobilenet-v1-1.0 1 2 3 2 4 6 8 10 SE +/- 0.054, N = 9 SE +/- 0.799, N = 3 7.042 7.058 7.867 MIN: 6.48 / MAX: 25.93 MIN: 4.84 / MAX: 31.92 MIN: 4.68 / MAX: 31.89 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: MobileNetV2_224 3 2 1 2 4 6 8 10 SE +/- 0.012, N = 3 SE +/- 0.130, N = 9 6.039 6.203 6.371 MIN: 5.58 / MAX: 23.61 MIN: 3.99 / MAX: 30.96 MIN: 3.99 / MAX: 30.98 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: resnet-v2-50 2 3 1 13 26 39 52 65 SE +/- 0.31, N = 9 SE +/- 0.82, N = 3 57.33 57.53 59.04 MIN: 42.93 / MAX: 183.34 MIN: 43.37 / MAX: 182.6 MIN: 47.53 / MAX: 248.6 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: SqueezeNetV1.0 3 2 1 3 6 9 12 15 SE +/- 0.14, N = 3 SE +/- 0.35, N = 9 11.52 12.13 12.36 MIN: 8.33 / MAX: 53.25 MIN: 7.02 / MAX: 50.71 MIN: 7.66 / MAX: 47.8 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K 3 2 1 0.0158 0.0316 0.0474 0.0632 0.079 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.07 0.07 0.07 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Xmrig Variant: Wownero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Wownero - Hash Count: 1M 1 3 2 700 1400 2100 2800 3500 SE +/- 3.71, N = 3 SE +/- 13.36, N = 3 3445.7 3443.9 3438.5 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: BMW27 - Compute: CPU-Only 1 3 2 60 120 180 240 300 SE +/- 0.07, N = 3 SE +/- 0.16, N = 3 285.35 285.71 286.64
libgav1 Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Chimera 1080p 10-bit 1 3 2 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 35.50 35.42 35.40 1. (CXX) g++ options: -O3 -lpthread -lrt
LuaRadio Test: Complex Phase OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Complex Phase 2 1 3 100 200 300 400 500 SE +/- 0.93, N = 3 SE +/- 2.75, N = 3 482.7 481.3 479.1
LuaRadio Test: Hilbert Transform OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Hilbert Transform 1 2 3 20 40 60 80 100 SE +/- 0.12, N = 3 SE +/- 0.38, N = 3 82.9 82.8 82.5
LuaRadio Test: FM Deemphasis Filter OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: FM Deemphasis Filter 2 1 3 80 160 240 320 400 SE +/- 0.27, N = 3 SE +/- 11.52, N = 3 366.4 365.6 357.4
LuaRadio Test: Five Back to Back FIR Filters OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Five Back to Back FIR Filters 3 2 1 140 280 420 560 700 SE +/- 0.20, N = 3 SE +/- 4.23, N = 3 658.6 654.7 651.6
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K 3 2 1 0.7943 1.5886 2.3829 3.1772 3.9715 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.53 3.53 3.52 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis 1 2 3 40 80 120 160 200 SE +/- 1.69, N = 3 SE +/- 1.97, N = 5 173.54 174.59 180.16 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Exhaustive 3 2 1 40 80 120 160 200 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 198.52 198.62 198.66 1. (CXX) g++ options: -O3 -flto -pthread
JPEG XL Input: PNG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 7 1 3 2 1.2983 2.5966 3.8949 5.1932 6.4915 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.77 5.76 5.76 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
GNU GMP GMPbench Total Time OpenBenchmarking.org GMPbench Score, More Is Better GNU GMP GMPbench 6.2.1 Total Time 1 2 3 1000 2000 3000 4000 5000 4608.3 4608.0 4519.2 1. (CC) gcc options: -O3 -fomit-frame-pointer -lm
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p 2 3 1 0.8573 1.7146 2.5719 3.4292 4.2865 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.81 3.80 3.80 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.10.20 Time To Compile 2 3 1 40 80 120 160 200 SE +/- 1.19, N = 3 SE +/- 1.62, N = 3 173.55 174.28 176.90
Timed Erlang/OTP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Erlang/OTP Compilation 23.2 Time To Compile 2 1 3 40 80 120 160 200 SE +/- 0.21, N = 3 SE +/- 0.21, N = 3 168.78 169.15 169.19
Timed Wasmer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Wasmer Compilation 1.0.2 Time To Compile 2 3 1 40 80 120 160 200 SE +/- 0.63, N = 3 SE +/- 1.08, N = 3 164.77 165.46 165.54 1. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p 3 2 1 0.864 1.728 2.592 3.456 4.32 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.84 3.84 3.84 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 1 3 2 10 20 30 40 50 SE +/- 0.35, N = 9 SE +/- 0.29, N = 15 41.14 42.52 43.42 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Chimera 1080p 10-bit 1 2 3 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 87.27 87.13 87.03 MIN: 57.49 / MAX: 206.69 MIN: 57.31 / MAX: 206.69 MIN: 57.18 / MAX: 209.79 1. (CC) gcc options: -pthread -lm
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction 2 3 1 30 60 90 120 150 SE +/- 0.63, N = 3 SE +/- 0.64, N = 3 149.26 150.01 156.71 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.2 Pfam Database Search 3 2 1 30 60 90 120 150 SE +/- 0.10, N = 3 SE +/- 0.11, N = 3 151.20 151.32 151.69 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm -lmpi
OpenSCAD Render: Pistol OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Pistol 1 3 2 30 60 90 120 150 SE +/- 0.20, N = 3 SE +/- 0.29, N = 3 139.35 139.50 139.80 1. OpenSCAD version 2019.05
OpenSCAD Render: Projector Mount Swivel OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Projector Mount Swivel 2 1 3 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.54, N = 3 129.57 129.73 130.00 1. OpenSCAD version 2019.05
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 0 1 2 3 30 60 90 120 150 SE +/- 0.51, N = 3 SE +/- 0.22, N = 3 122.96 123.71 123.80 1. (CXX) g++ options: -O3 -fPIC -lm
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 2 3 1 2K 4K 6K 8K 10K SE +/- 16.13, N = 3 SE +/- 10.66, N = 3 11053.1 11060.2 11062.7 MIN: 10763.5 MIN: 10798.1 MIN: 10779.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 1 2 3 2K 4K 6K 8K 10K SE +/- 13.40, N = 3 SE +/- 10.08, N = 3 11040.2 11061.1 11082.8 MIN: 10813.1 MIN: 10800.2 MIN: 10778.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 1 3 2 2K 4K 6K 8K 10K SE +/- 20.98, N = 3 SE +/- 27.42, N = 3 11018.3 11089.2 11106.7 MIN: 10752.6 MIN: 10779.6 MIN: 10805 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 1 3 2 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.60, N = 3 113.17 113.37 113.85 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless 1 2 3 30 60 90 120 150 SE +/- 0.28, N = 3 SE +/- 0.51, N = 3 111.77 112.33 112.44 1. (CXX) g++ options: -O3 -fPIC -lm
Embree Binary: Pathtracer - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon Obj 1 3 2 2 4 6 8 10 SE +/- 0.0278, N = 3 SE +/- 0.0326, N = 3 7.1060 7.0486 6.9095 MIN: 7.08 / MAX: 7.17 MIN: 6.93 / MAX: 7.17 MIN: 6.79 / MAX: 7.12
VOSK Speech Recognition Toolkit OpenBenchmarking.org Seconds, Fewer Is Better VOSK Speech Recognition Toolkit 0.3.21 3 2 1 6 12 18 24 30 SE +/- 0.20, N = 15 SE +/- 0.21, N = 15 23.33 23.41 25.70
Timed Mesa Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 21.0 Time To Compile 2 3 1 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.09, N = 3 98.82 98.90 100.10
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Crown 3 1 2 2 4 6 8 10 SE +/- 0.0158, N = 3 SE +/- 0.0011, N = 3 6.4339 6.4186 6.3377 MIN: 6.37 / MAX: 6.54 MIN: 6.38 / MAX: 6.5 MIN: 6.28 / MAX: 6.47
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 3 2 1 1600 3200 4800 6400 8000 SE +/- 6.08, N = 3 SE +/- 8.78, N = 3 7347.50 7367.46 7382.83 MIN: 7067.91 MIN: 7086.46 MIN: 7128.16 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 3 1 2 1600 3200 4800 6400 8000 SE +/- 10.21, N = 3 SE +/- 11.77, N = 3 7333.18 7353.40 7368.77 MIN: 7032.31 MIN: 7100.35 MIN: 7106.81 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 1 2 3 1600 3200 4800 6400 8000 SE +/- 9.15, N = 3 SE +/- 4.84, N = 3 7328.24 7369.73 7372.41 MIN: 7044.31 MIN: 7085.09 MIN: 7094.3 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Embree Binary: Pathtracer ISPC - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon Obj 1 3 2 2 4 6 8 10 SE +/- 0.0144, N = 3 SE +/- 0.0092, N = 3 8.0582 7.9602 7.8579 MIN: 8.02 / MAX: 8.14 MIN: 7.9 / MAX: 8.09 MIN: 7.79 / MAX: 8
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p 3 2 1 0.0473 0.0946 0.1419 0.1892 0.2365 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.21 0.21 0.21 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
JPEG XL Decoding CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.3 CPU Threads: 1 2 3 1 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 29.63 29.40 29.17
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 4K 3 2 1 9 18 27 36 45 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 39.94 39.94 39.91 1. (CXX) g++ options: -O3 -lpthread -lrt
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU 3 1 2 2K 4K 6K 8K 10K SE +/- 2.02, N = 3 SE +/- 2.89, N = 3 10358.79 10354.19 10352.60 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Crown 1 3 2 2 4 6 8 10 SE +/- 0.0160, N = 3 SE +/- 0.0186, N = 3 7.2156 7.1957 7.1725 MIN: 7.17 / MAX: 7.32 MIN: 7.12 / MAX: 7.32 MIN: 7.09 / MAX: 7.31
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon 3 1 2 2 4 6 8 10 SE +/- 0.0353, N = 3 SE +/- 0.0542, N = 3 7.6012 7.5488 7.5463 MIN: 7.51 / MAX: 7.76 MIN: 7.51 / MAX: 7.62 MIN: 7.46 / MAX: 7.75
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 7.40 7.38 7.37 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
libgav1 Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Chimera 1080p 1 3 2 30 60 90 120 150 SE +/- 0.35, N = 3 SE +/- 0.31, N = 3 112.23 112.18 111.79 1. (CXX) g++ options: -O3 -lpthread -lrt
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p 1 2 3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 10.89 10.87 10.85 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon 2 3 1 2 4 6 8 10 SE +/- 0.0200, N = 3 SE +/- 0.0342, N = 3 8.8919 8.8853 8.8669 MIN: 8.83 / MAX: 9.03 MIN: 8.78 / MAX: 9.04 MIN: 8.83 / MAX: 8.95
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed 3 1 2 600 1200 1800 2400 3000 SE +/- 0.75, N = 3 SE +/- 17.80, N = 3 2785.3 2783.8 2766.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed 1 3 2 5 10 15 20 25 SE +/- 0.12, N = 3 SE +/- 0.06, N = 3 21.3 21.1 20.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT 1 3 2 5 10 15 20 25 SE +/- 0.38, N = 12 SE +/- 0.49, N = 3 20.9 19.6 18.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN 3 2 1 5 10 15 20 25 SE +/- 0.42, N = 11 SE +/- 1.18, N = 3 21.5 21.3 19.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT 1 2 3 5 10 15 20 25 SE +/- 0.46, N = 3 SE +/- 0.39, N = 11 20.0 19.6 19.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN 2 1 3 5 10 15 20 25 SE +/- 0.70, N = 3 SE +/- 0.40, N = 10 20.7 20.2 19.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T 2 1 3 6 12 18 24 30 SE +/- 0.26, N = 3 SE +/- 0.15, N = 10 24.4 24.3 24.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N 2 1 3 6 12 18 24 30 SE +/- 0.29, N = 3 SE +/- 0.10, N = 12 24.8 24.6 24.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT 3 2 1 6 12 18 24 30 SE +/- 0.18, N = 12 SE +/- 0.50, N = 3 23.7 23.5 23.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY 2 3 1 8 16 24 32 40 SE +/- 0.35, N = 3 SE +/- 0.12, N = 12 33.9 33.7 33.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY 3 1 2 5 10 15 20 25 SE +/- 0.07, N = 12 SE +/- 0.15, N = 3 22.6 22.6 22.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT 1 3 2 4 8 12 16 20 SE +/- 0.17, N = 12 SE +/- 0.55, N = 3 18.0 17.7 17.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY 1 3 2 7 14 21 28 35 SE +/- 0.11, N = 12 SE +/- 0.20, N = 3 29.2 29.1 28.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY 3 2 1 5 10 15 20 25 SE +/- 0.14, N = 12 SE +/- 0.10, N = 3 20.1 19.9 19.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
JPEG XL Decoding CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.3 CPU Threads: All 1 2 3 30 60 90 120 150 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 129.91 128.81 126.70
PJSIP Method: INVITE OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: INVITE 3 1 2 800 1600 2400 3200 4000 SE +/- 3.28, N = 3 SE +/- 1.20, N = 3 3781 3779 3775 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O2
PJSIP Method: OPTIONS, Stateful OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateful 2 3 1 1400 2800 4200 5600 7000 SE +/- 4.04, N = 3 SE +/- 0.88, N = 3 6419 6414 6406 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O2
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 2 1 2 3 14 28 42 56 70 SE +/- 0.31, N = 3 SE +/- 0.04, N = 3 63.80 63.95 64.07 1. (CXX) g++ options: -O3 -fPIC -lm
OpenSCAD Render: Retro Car OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Retro Car 3 1 2 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.20, N = 14 24.18 24.25 24.44 1. OpenSCAD version 2019.05
srsLTE Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsLTE 20.10.1 Test: OFDM_Test 3 2 1 20M 40M 60M 80M 100M SE +/- 384418.75, N = 3 SE +/- 633333.33, N = 3 80833333 80466667 79500000 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed 1 2 3 600 1200 1800 2400 3000 SE +/- 3.02, N = 3 SE +/- 10.31, N = 3 2810.6 2802.3 2788.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed 2 1 3 6 12 18 24 30 SE +/- 0.29, N = 3 SE +/- 0.27, N = 3 26.7 26.7 26.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 1 2 3 13 26 39 52 65 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 59.03 59.08 59.19 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
srsLTE Test: PHY_DL_Test OpenBenchmarking.org UE Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test 1 3 2 20 40 60 80 100 SE +/- 0.24, N = 3 78.5 78.0 77.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsLTE Test: PHY_DL_Test OpenBenchmarking.org eNb Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test 1 3 2 40 80 120 160 200 SE +/- 0.27, N = 3 SE +/- 1.19, N = 3 197.0 196.8 195.5 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
OpenSCAD Render: Mini-ITX Case OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Mini-ITX Case 2 1 3 13 26 39 52 65 SE +/- 0.09, N = 3 SE +/- 0.50, N = 3 58.32 58.36 58.71 1. OpenSCAD version 2019.05
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time 1 3 2 3M 6M 9M 12M 15M SE +/- 62832.49, N = 3 SE +/- 72340.52, N = 3 13973741 13737688 13605923 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.11 Input: simple-H2O 2 3 1 13 26 39 52 65 SE +/- 0.21, N = 3 SE +/- 0.69, N = 3 55.28 55.58 55.88 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 2 3 1 60M 120M 180M 240M 300M SE +/- 2271668.50, N = 3 SE +/- 2636553.03, N = 15 277256667 269162667 255830000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 2 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 2 - Buffer Length: 256 - Filter Length: 57 1 2 3 20M 40M 60M 80M 100M SE +/- 471745.81, N = 3 SE +/- 1085656.09, N = 12 98947000 98514333 97365167 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p 2 1 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 13.42 13.38 13.35 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
PJSIP Method: OPTIONS, Stateless OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateless 3 2 1 11K 22K 33K 44K 55K SE +/- 35.35, N = 3 SE +/- 96.85, N = 3 49207 49132 48893 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O2
Stream Type: Copy OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Copy 3 2 1 9K 18K 27K 36K 45K SE +/- 128.05, N = 5 SE +/- 41.44, N = 5 43699.5 43524.3 43325.4 1. (CC) gcc options: -O3 -march=native -fopenmp
Chia Blockchain VDF Test: Square Plain C++ OpenBenchmarking.org IPS, More Is Better Chia Blockchain VDF 1.0.1 Test: Square Plain C++ 1 3 2 30K 60K 90K 120K 150K SE +/- 33.33, N = 3 SE +/- 57.74, N = 3 119000 118933 118900 1. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Decompression Speed 1 3 2 700 1400 2100 2800 3500 SE +/- 1.14, N = 3 SE +/- 19.54, N = 3 3161.7 3159.3 3135.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3, Long Mode - Compression Speed 1 2 3 150 300 450 600 750 SE +/- 3.37, N = 3 SE +/- 5.03, N = 3 673.2 669.8 668.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3 - Decompression Speed 1 3 2 600 1200 1800 2400 3000 SE +/- 2.22, N = 3 SE +/- 0.78, N = 3 2988.7 2983.7 2982.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 3 - Compression Speed 2 3 1 400 800 1200 1600 2000 SE +/- 9.22, N = 3 SE +/- 9.91, N = 3 1773.5 1755.8 1739.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Decompression Speed 2 3 1 700 1400 2100 2800 3500 SE +/- 1.79, N = 3 SE +/- 1.06, N = 3 3031.7 3031.6 2996.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8 - Compression Speed 3 2 1 70 140 210 280 350 SE +/- 0.90, N = 3 SE +/- 1.37, N = 3 323.6 317.8 317.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
Liquid-DSP Threads: 4 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 2 3 1 40M 80M 120M 160M 200M SE +/- 1897283.58, N = 6 SE +/- 1876216.79, N = 6 190735000 190431667 188270000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Decompression Speed 2 1 3 700 1400 2100 2800 3500 SE +/- 2.82, N = 3 SE +/- 4.71, N = 3 3216.1 3215.6 3211.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Compression Speed 3 2 1 70 140 210 280 350 SE +/- 0.55, N = 3 SE +/- 1.06, N = 3 337.8 337.5 336.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 4K 2 1 3 20 40 60 80 100 SE +/- 0.16, N = 3 SE +/- 0.15, N = 3 104.38 104.30 104.17 MIN: 98.48 / MAX: 117.64 MIN: 98.53 / MAX: 116.86 MIN: 98.31 / MAX: 117.28 1. (CC) gcc options: -pthread -lm
Basis Universal Settings: ETC1S OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: ETC1S 3 1 2 8 16 24 32 40 SE +/- 0.09, N = 3 SE +/- 0.19, N = 3 35.02 35.09 35.23 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
JPEG XL Input: PNG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: PNG - Encode Speed: 5 2 3 1 9 18 27 36 45 SE +/- 0.20, N = 3 SE +/- 0.07, N = 3 40.26 39.89 39.88 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Chimera 1080p 2 3 1 80 160 240 320 400 SE +/- 0.08, N = 3 SE +/- 0.78, N = 3 371.37 369.97 369.44 MIN: 273.86 / MAX: 598.69 MIN: 272.55 / MAX: 578.08 MIN: 273.31 / MAX: 554.87 1. (CC) gcc options: -pthread -lm
Chia Blockchain VDF Test: Square Assembly Optimized OpenBenchmarking.org IPS, More Is Better Chia Blockchain VDF 1.0.1 Test: Square Assembly Optimized 2 3 1 30K 60K 90K 120K 150K SE +/- 1001.67, N = 3 SE +/- 1328.32, N = 3 153300 153233 150500 1. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread
JPEG XL Input: JPEG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 5 1 2 3 10 20 30 40 50 SE +/- 0.25, N = 3 SE +/- 0.14, N = 3 43.00 41.92 41.74 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Thorough 2 1 3 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 25.93 25.94 25.95 1. (CXX) g++ options: -O3 -flto -pthread
Google Draco Model: Church Facade OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.4.1 Model: Church Facade 2 1 3 2K 4K 6K 8K 10K SE +/- 7.51, N = 3 SE +/- 112.19, N = 15 9045 9048 9189 1. (CXX) g++ options: -O3
KTX-Software toktx Settings: UASTC 3 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 + Zstd Compression 19 3 2 1 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 28.88 28.90 29.18
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 1 2 3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.11, N = 14 10.90 10.92 11.04 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
libgav1 Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 1080p 1 2 3 30 60 90 120 150 SE +/- 0.05, N = 3 SE +/- 0.54, N = 3 135.37 135.33 135.24 1. (CXX) g++ options: -O3 -lpthread -lrt
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K 1 3 2 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 24.26 24.21 24.15 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
JPEG XL Input: JPEG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 7 1 2 3 10 20 30 40 50 SE +/- 0.44, N = 3 SE +/- 0.16, N = 3 42.45 42.30 42.05 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
libjpeg-turbo tjbench Test: Decompression Throughput OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 2.1.0 Test: Decompression Throughput 1 2 3 40 80 120 160 200 SE +/- 0.61, N = 3 SE +/- 0.71, N = 3 171.68 171.40 170.10 1. (CC) gcc options: -O3 -rdynamic
OpenSCAD Render: Leonardo Phone Case Slim OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Leonardo Phone Case Slim 2 1 3 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 23.36 23.44 23.49 1. OpenSCAD version 2019.05
KTX-Software toktx Settings: UASTC 3 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 1 2 3 6 12 18 24 30 SE +/- 0.12, N = 3 SE +/- 0.28, N = 3 23.05 23.22 23.37
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 2 1 3 5 10 15 20 25 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 21.72 21.76 21.79 1. (CXX) g++ options: -O3 -fPIC -lm
KTX-Software toktx Settings: Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: Zstd Compression 19 3 2 1 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 21.50 21.53 21.53
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 3 2 1 5 10 15 20 25 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 19.40 19.44 19.44 MIN: 8.12 MIN: 8.16 MIN: 9.35 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU 2 1 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 10.38 10.38 10.39 MIN: 5.14 MIN: 5.16 MIN: 5.15 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 3 2 1 11M 22M 33M 44M 55M SE +/- 13678.86, N = 3 SE +/- 44508.43, N = 3 52477667 52412000 52021000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 12 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 12 - Buffer Length: 256 - Filter Length: 57 2 3 1 60M 120M 180M 240M 300M SE +/- 112150.69, N = 3 SE +/- 163333.33, N = 3 286536667 286083333 284670000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K 3 1 2 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 34.89 34.84 34.83 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 3 2 1 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 12.72 12.73 12.79 MIN: 7.47 MIN: 6.1 MIN: 8.26 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU 2 3 1 3 6 9 12 15 SE +/- 0.02692, N = 3 SE +/- 0.03152, N = 3 9.03875 9.05168 9.09941 MIN: 3.85 MIN: 3.92 MIN: 3.91 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
JPEG XL Input: JPEG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.3 Input: JPEG - Encode Speed: 8 2 3 1 5 10 15 20 25 SE +/- 0.10, N = 3 SE +/- 0.20, N = 3 20.10 19.76 19.50 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 3 2 1 2 4 6 8 10 SE +/- 0.02595, N = 3 SE +/- 0.01221, N = 3 7.09340 7.11114 7.11325 MIN: 3.38 MIN: 3.38 MIN: 3.36 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 1 3 2 3 6 9 12 15 SE +/- 0.03293, N = 3 SE +/- 0.03538, N = 3 9.70189 9.76810 9.78194 MIN: 4.61 MIN: 4.62 MIN: 4.62 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p 2 3 1 13 26 39 52 65 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 57.43 57.37 55.68 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 1080p 3 2 1 70 140 210 280 350 SE +/- 1.02, N = 3 SE +/- 0.62, N = 3 332.73 332.59 332.31 MIN: 281.58 / MAX: 363.48 MIN: 290.31 / MAX: 364.3 MIN: 294.32 / MAX: 361.09 1. (CC) gcc options: -pthread -lm
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p 3 2 1 20 40 60 80 100 SE +/- 0.97, N = 5 SE +/- 0.97, N = 5 92.74 92.51 89.33 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Helsing Digit Range: 12 digit OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 12 digit 3 1 2 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 10.23 10.27 10.30 1. (CC) gcc options: -O2 -pthread
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.04476, N = 3 SE +/- 0.04181, N = 3 9.84608 9.90494 9.96725 MIN: 8.59 MIN: 8.69 MIN: 8.68 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU 1 3 2 0.805 1.61 2.415 3.22 4.025 SE +/- 0.01805, N = 3 SE +/- 0.02094, N = 3 3.36519 3.56832 3.57796 MIN: 3.1 MIN: 3.05 MIN: 3.12 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless 1 2 3 2 4 6 8 10 SE +/- 0.030, N = 3 SE +/- 0.010, N = 3 8.545 8.603 8.663 1. (CXX) g++ options: -O3 -fPIC -lm
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p 1 2 3 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.23, N = 3 75.25 75.16 74.80 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Google Draco Model: Lion OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.4.1 Model: Lion 3 2 1 1400 2800 4200 5600 7000 SE +/- 8.84, N = 3 SE +/- 15.84, N = 3 6331 6333 6335 1. (CXX) g++ options: -O3
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p 1 2 3 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.30, N = 3 77.27 76.87 76.56 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Medium 1 2 3 2 4 6 8 10 SE +/- 0.0129, N = 3 SE +/- 0.0322, N = 3 8.0446 8.0747 8.0995 1. (CXX) g++ options: -O3 -flto -pthread
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p 1 3 2 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.14, N = 3 92.81 92.46 92.42 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p 2 3 1 20 40 60 80 100 SE +/- 0.15, N = 3 SE +/- 0.22, N = 3 95.30 95.25 95.25 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 2 1 3 4 8 12 16 20 SE +/- 0.13, N = 3 SE +/- 0.07, N = 3 16.89 16.96 17.04 MIN: 13.96 MIN: 14.02 MIN: 13.85 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 2 1 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 17.17 17.17 17.23 MIN: 13.2 MIN: 12.97 MIN: 12.69 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Sysbench Test: RAM / Memory OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory 2 1 3 4K 8K 12K 16K 20K SE +/- 47.60, N = 3 SE +/- 92.41, N = 3 16779.82 16736.89 16673.45 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p 2 3 1 30 60 90 120 150 SE +/- 0.24, N = 3 SE +/- 0.29, N = 3 121.87 121.55 121.24 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 1 3 2 1.1761 2.3522 3.5283 4.7044 5.8805 SE +/- 0.006, N = 3 SE +/- 0.051, N = 3 5.133 5.173 5.227 1. (CXX) g++ options: -O3 -fPIC -lm
KTX-Software toktx Settings: Zstd Compression 9 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: Zstd Compression 9 3 1 2 0.7677 1.5354 2.3031 3.0708 3.8385 SE +/- 0.005, N = 3 SE +/- 0.006, N = 3 3.394 3.412 3.412
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 3 2 1 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.03, N = 3 15.62 15.67 16.08 MIN: 11.89 MIN: 11.93 MIN: 11.92 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 3 1 2 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 10.75 10.76 10.78 MIN: 8.21 MIN: 8.2 MIN: 8.22 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Stream Type: Add OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Add 3 2 1 6K 12K 18K 24K 30K SE +/- 12.57, N = 5 SE +/- 15.91, N = 5 29759.5 29554.4 29342.3 1. (CC) gcc options: -O3 -march=native -fopenmp
Stream Type: Triad OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Triad 3 2 1 6K 12K 18K 24K 30K SE +/- 12.68, N = 5 SE +/- 4.27, N = 5 29521.1 29324.2 29294.0 1. (CC) gcc options: -O3 -march=native -fopenmp
Stream Type: Scale OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Scale 3 2 1 6K 12K 18K 24K 30K SE +/- 6.54, N = 5 SE +/- 41.98, N = 5 27580.0 27311.6 27218.3 1. (CC) gcc options: -O3 -march=native -fopenmp
Phoronix Test Suite v10.8.4