7980XE Feb 2921 Intel Core i9-7980XE testing with a ASUS PRIME X299-A (2002 BIOS) and Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102162-HA-7980XEFEB73&grr .
7980XE Feb 2921 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 4 5 Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads) ASUS PRIME X299-A (2002 BIOS) Intel Sky Lake-E DMI3 Registers 16GB Samsung SSD 970 EVO 500GB Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB (1206/1750MHz) Realtek ALC1220 G237HL Intel I219-V Ubuntu 20.10 5.8.0-36-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 4.6 Mesa 20.2.6 (LLVM 11.0.0) 1.2.131 GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x2006a08 Graphics Details - GLAMOR Python Details - Python 3.8.6 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
7980XE Feb 2921 lammps: 20k Atoms webp2: Quality 100, Lossless Compression onnx: bertsquad-10 - OpenMP CPU jpegxl: PNG - 8 onnx: super-resolution-10 - OpenMP CPU webp2: Quality 95, Compression Effort 7 ior: 32MB - Default Test Directory ior: 64MB - Default Test Directory paraview: Many Spheres - 1920 x 1080 paraview: Many Spheres - 1920 x 1080 gcrypt: webp2: Quality 75, Compression Effort 7 ior: 8MB - Default Test Directory ngspice: C2670 ngspice: C7552 quantlib: askap: tConvolve MT - Degridding askap: tConvolve MT - Gridding mnn: inception-v3 mnn: mobilenet-v1-1.0 mnn: MobileNetV2_224 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 jpegxl: PNG - 7 onnx: fcn-resnet101-11 - OpenMP CPU onnx: yolov4 - OpenMP CPU ior: 16MB - Default Test Directory onnx: shufflenet-v2-10 - OpenMP CPU gromacs: water_GMX50_bare build-godot: Time To Compile cloverleaf: Lagrangian-Eulerian Hydrodynamics jpegxl-decode: 1 v-ray: CPU npb: EP.D gnupg: 2.7GB Sample File Encryption financebench: Bonds OpenMP ior: 4MB - Default Test Directory rav1e: 5 rav1e: 1 pennant: sedovbig jpegxl-decode: All financebench: Repo OpenMP rav1e: 6 npb: LU.C lzbench: XZ 0 - Decompression lzbench: XZ 0 - Compression qmcpack: simple-H2O pennant: leblancbig rav1e: 10 ior: 2MB - Default Test Directory etcpak: ETC1 + Dithering synthmark: VoiceMark_100 lzbench: Crush 0 - Decompression lzbench: Crush 0 - Compression jpegxl: JPEG - 5 etcpak: ETC2 jpegxl: PNG - 5 tnn: CPU - MobileNet v2 lzbench: Zstd 8 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 1 - Compression npb: EP.C tnn: CPU - SqueezeNet v1.1 lzbench: Brotli 2 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 0 - Compression jpegxl: JPEG - 7 askap: Hogbom Clean OpenMP paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 redis: SADD redis: SET redis: LPUSH etcpak: ETC1 redis: LPOP paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 lzbench: Libdeflate 1 - Compression redis: GET lammps: Rhodopsin Protein askap: tConvolve OpenMP - Degridding askap: tConvolve OpenMP - Gridding jpegxl: JPEG - 8 webp2: Quality 100, Compression Effort 5 lulesh: etcpak: DXT1 webp2: Default 1 2 3 4 5 11.176 593.066 630 0.71 7531 282.047 493.82 492.85 481.692 4.80 212.796 155.740 525.44 155.596 138.661 2222.4 2618.53 1815.85 44.157 2.676 4.201 37.184 7.108 7.86 143 553 481.36 9436 1.699 98.889 85.54 31.81 17828 2140.50 67.390 55767.164062 887.94 0.975 0.358 53.14447 175.22 39726.164063 1.269 47323.88 115 42 38.340 34.29865 2.677 592.90 303.570 553.640 483 111 48.64 183.995 52.50 361.235 1792 85 1767 495 2081.02 316.358 711 189 613 469 48.50 376.885 864.682 82.97 2153944.42 1912809.71 1712504.00 315.616 2698309.25 1156.973 72.31 237 2539377.42 9.483 3973.97 2458.07 23.19 9.195 6695.0413 1365.386 3.369 11.210 590.293 649 0.70 7249 280.636 474.62 480.71 481.834 4.81 212.057 156.550 530.83 151.978 137.582 2222.6 2622.75 1820.90 48.330 2.618 4.634 37.345 7.232 7.93 145 555 484.06 9471 1.698 98.932 84.88 32.39 17872 2241.41 67.175 55622.968750 466.30 0.976 0.359 52.66769 178.23 39461.821614 1.272 47734.40 115 42 39.048 33.68917 2.675 396.56 303.514 555.295 484 109 49.08 182.977 53.03 360.923 1801 86 1767 495 2001.11 319.049 711 189 612 470 49.22 377.362 864.937 83.00 2168410.90 1952727.25 1642807.13 313.703 1769937.88 1156.216 72.26 238 2415143.75 9.084 3973.97 2420.91 23.25 9.110 6707.8596 1387.302 3.363 11.195 596.408 658 0.7 7080 282.575 470.37 489.25 481.420 4.80 211.787 157.528 354.01 153.061 139.573 2207.1 2633.11 1813.56 45.970 2.686 4.348 37.283 7.355 7.93 144 557 442.47 9541 1.698 98.872 84.21 32.26 17842 2202.89 67.124 55666.716146 467.76 0.975 0.359 52.70973 177.88 39681.566406 1.270 47778.76 114 42 38.489 33.55896 2.694 429.36 303.105 553.553 484 110 48.89 183.424 53.44 361.454 1795 85 1772 496 2033.28 317.136 709 189 612 472 49.41 377.835 864.804 82.98 2159575.21 1889099.97 1695317.29 314.662 1797918.46 1155.674 72.23 238 2431811.58 9.408 3973.97 2428.72 23.43 9.075 6678.4880 1385.842 3.325 11.205 593.530 659 0.70 7457 279.573 484.50 490.66 468.456 4.67 211.895 154.880 527.45 154.214 139.032 2215.6 2633.32 1821.82 48.375 2.662 4.513 37.607 7.281 7.92 144 555 500.39 9533 1.700 98.964 84.27 32.17 17742 2112.54 67.151 55656.115885 493.98 0.976 0.360 52.76675 178.61 39350.223958 1.269 47482.70 115 42 39.795 33.40175 2.679 419.08 290.726 548.556 483 109 48.85 182.722 53.11 361.425 1803 86 1771 496 1997.31 318.208 708 188 612 469 48.85 378.788 865.027 83.01 2158064.17 1906086.42 1699000.00 314.271 1761464.25 1157.110 72.32 237 2442355.92 9.099 3973.97 2420.64 23.37 9.309 6697.3184 1376.344 3.347 11.216 590.211 652 0.7 6939 283.331 516.94 493.41 481.347 4.80 212.963 156.753 510.05 153.099 139.017 2216.0 2609.75 1821.36 48.640 2.645 4.546 37.338 7.272 7.91 145 555 506.02 9519 1.703 98.873 85.20 32.30 17849 2214.08 67.102 55876.180990 478.01 0.977 0.359 52.84683 178.47 39275.548177 1.274 47360.36 114 42 38.621 33.93533 2.676 437.60 303.890 552.792 483 110 49.13 183.118 53.69 361.488 1800 85 1770 494 1999.52 319.314 709 189 611 470 49.18 378.802 864.654 82.97 2179924.5 1923099.42 1682254.45 313.693 1770711.21 1156.869 72.31 237 2443276.75 8.901 3973.97 2442.85 23.37 9.216 6708.2322 1377.894 3.350 OpenBenchmarking.org
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms 1 2 3 4 5 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 11.18 11.21 11.20 11.21 11.22 1. (CXX) g++ options: -O3 -pthread -lm
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 1 2 3 4 5 130 260 390 520 650 SE +/- 1.66, N = 3 SE +/- 0.45, N = 3 SE +/- 0.58, N = 3 SE +/- 1.92, N = 3 SE +/- 0.58, N = 3 593.07 590.29 596.41 593.53 590.21 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 1 2 3 4 5 140 280 420 560 700 SE +/- 5.46, N = 3 SE +/- 11.30, N = 12 SE +/- 13.29, N = 12 SE +/- 13.34, N = 12 SE +/- 11.09, N = 12 630 649 658 659 652 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
JPEG XL Input: PNG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 8 1 2 3 4 5 0.1598 0.3196 0.4794 0.6392 0.799 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.71 0.70 0.70 0.70 0.70 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 2 3 4 5 1600 3200 4800 6400 8000 SE +/- 57.46, N = 3 SE +/- 133.82, N = 12 SE +/- 173.63, N = 9 SE +/- 109.17, N = 3 SE +/- 212.04, N = 12 7531 7249 7080 7457 6939 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 1 2 3 4 5 60 120 180 240 300 SE +/- 1.07, N = 3 SE +/- 0.54, N = 3 SE +/- 2.02, N = 3 SE +/- 1.22, N = 3 SE +/- 2.02, N = 3 282.05 280.64 282.58 279.57 283.33 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
IOR Block Size: 32MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 32MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 3.89, N = 3 SE +/- 4.72, N = 8 SE +/- 5.73, N = 3 SE +/- 4.09, N = 12 SE +/- 4.67, N = 3 493.82 474.62 470.37 484.50 516.94 MIN: 245.23 / MAX: 1245.29 MIN: 202.58 / MAX: 1355.46 MIN: 197.76 / MAX: 1370.54 MIN: 196.66 / MAX: 1538.21 MIN: 413.96 / MAX: 1183.64 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 64MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 64MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 3.92, N = 3 SE +/- 2.46, N = 3 SE +/- 4.79, N = 3 SE +/- 4.68, N = 3 SE +/- 6.10, N = 3 492.85 480.71 489.25 490.66 493.41 MIN: 402.07 / MAX: 1345.65 MIN: 301.17 / MAX: 1079.47 MIN: 374.86 / MAX: 1033.08 MIN: 348.43 / MAX: 1036.5 MIN: 246.91 / MAX: 1088.06 1. (CC) gcc options: -O2 -lm -pthread -lmpi
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 2 3 4 5 100 200 300 400 500 SE +/- 0.37, N = 3 SE +/- 0.39, N = 3 SE +/- 0.27, N = 3 SE +/- 12.92, N = 11 SE +/- 0.34, N = 3 481.69 481.83 481.42 468.46 481.35
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 2 3 4 5 1.0823 2.1646 3.2469 4.3292 5.4115 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.13, N = 11 SE +/- 0.00, N = 3 4.80 4.81 4.80 4.67 4.80
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1 2 3 4 5 50 100 150 200 250 SE +/- 0.63, N = 3 SE +/- 0.34, N = 3 SE +/- 0.06, N = 3 SE +/- 0.49, N = 3 SE +/- 0.29, N = 3 212.80 212.06 211.79 211.90 212.96 1. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 1 2 3 4 5 30 60 90 120 150 SE +/- 1.77, N = 3 SE +/- 0.15, N = 3 SE +/- 0.29, N = 3 SE +/- 0.66, N = 3 SE +/- 0.73, N = 3 155.74 156.55 157.53 154.88 156.75 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
IOR Block Size: 8MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 8MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 25.39, N = 15 SE +/- 5.42, N = 15 SE +/- 4.96, N = 15 SE +/- 4.21, N = 14 SE +/- 6.12, N = 5 525.44 530.83 354.01 527.45 510.05 MIN: 290.23 / MAX: 1447.59 MIN: 248.03 / MAX: 1386.96 MIN: 189.98 / MAX: 1385.03 MIN: 251.24 / MAX: 1378.27 MIN: 222.93 / MAX: 1266.18 1. (CC) gcc options: -O2 -lm -pthread -lmpi
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 1 2 3 4 5 30 60 90 120 150 SE +/- 2.33, N = 3 SE +/- 1.75, N = 3 SE +/- 1.44, N = 3 SE +/- 2.23, N = 3 SE +/- 1.61, N = 3 155.60 151.98 153.06 154.21 153.10 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 1 2 3 4 5 30 60 90 120 150 SE +/- 0.78, N = 3 SE +/- 0.10, N = 3 SE +/- 1.23, N = 3 SE +/- 1.21, N = 3 SE +/- 2.04, N = 3 138.66 137.58 139.57 139.03 139.02 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 4 5 500 1000 1500 2000 2500 SE +/- 17.98, N = 13 SE +/- 19.83, N = 12 SE +/- 27.77, N = 12 SE +/- 21.07, N = 12 SE +/- 28.27, N = 12 2222.4 2222.6 2207.1 2215.6 2216.0 1. (CXX) g++ options: -O3 -march=native -rdynamic
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 2 3 4 5 600 1200 1800 2400 3000 SE +/- 10.95, N = 3 SE +/- 4.46, N = 3 SE +/- 7.85, N = 3 SE +/- 4.51, N = 3 SE +/- 11.98, N = 3 2618.53 2622.75 2633.11 2633.32 2609.75 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 2.23, N = 3 SE +/- 0.35, N = 3 SE +/- 2.62, N = 3 SE +/- 0.61, N = 3 SE +/- 0.31, N = 3 1815.85 1820.90 1813.56 1821.82 1821.36 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 1 2 3 4 5 11 22 33 44 55 SE +/- 1.80, N = 5 SE +/- 0.10, N = 3 SE +/- 2.44, N = 3 SE +/- 0.11, N = 3 SE +/- 0.04, N = 3 44.16 48.33 45.97 48.38 48.64 MIN: 40.29 / MAX: 51.28 MIN: 47.9 / MAX: 50.26 MIN: 40.58 / MAX: 48.74 MIN: 47.91 / MAX: 48.97 MIN: 48.25 / MAX: 49.28 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 1 2 3 4 5 0.6044 1.2088 1.8132 2.4176 3.022 SE +/- 0.029, N = 5 SE +/- 0.019, N = 3 SE +/- 0.045, N = 3 SE +/- 0.010, N = 3 SE +/- 0.011, N = 3 2.676 2.618 2.686 2.662 2.645 MIN: 2.44 / MAX: 3.19 MIN: 2.5 / MAX: 2.92 MIN: 2.42 / MAX: 3.11 MIN: 2.48 / MAX: 2.98 MIN: 2.44 / MAX: 3.04 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 1 2 3 4 5 1.0427 2.0854 3.1281 4.1708 5.2135 SE +/- 0.129, N = 5 SE +/- 0.086, N = 3 SE +/- 0.205, N = 3 SE +/- 0.041, N = 3 SE +/- 0.008, N = 3 4.201 4.634 4.348 4.513 4.546 MIN: 3.68 / MAX: 4.8 MIN: 4.2 / MAX: 4.99 MIN: 3.62 / MAX: 4.88 MIN: 4.22 / MAX: 4.96 MIN: 4.28 / MAX: 4.91 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 1 2 3 4 5 9 18 27 36 45 SE +/- 0.32, N = 5 SE +/- 0.12, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 37.18 37.35 37.28 37.61 37.34 MIN: 35.67 / MAX: 38.06 MIN: 36.97 / MAX: 37.99 MIN: 36.85 / MAX: 38.11 MIN: 37.16 / MAX: 38.09 MIN: 36.95 / MAX: 37.85 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 1 2 3 4 5 2 4 6 8 10 SE +/- 0.091, N = 5 SE +/- 0.085, N = 3 SE +/- 0.110, N = 3 SE +/- 0.070, N = 3 SE +/- 0.007, N = 3 7.108 7.232 7.355 7.281 7.272 MIN: 6.52 / MAX: 7.68 MIN: 6.81 / MAX: 7.85 MIN: 6.8 / MAX: 7.87 MIN: 6.97 / MAX: 7.74 MIN: 7 / MAX: 7.67 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
JPEG XL Input: PNG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 7 1 2 3 4 5 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 7.86 7.93 7.93 7.92 7.91 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 1 2 3 4 5 30 60 90 120 150 SE +/- 0.44, N = 3 SE +/- 0.33, N = 3 SE +/- 0.87, N = 3 SE +/- 0.44, N = 3 SE +/- 0.17, N = 3 143 145 144 144 145 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 1 2 3 4 5 120 240 360 480 600 SE +/- 1.09, N = 3 SE +/- 0.29, N = 3 SE +/- 2.25, N = 3 SE +/- 1.64, N = 3 SE +/- 0.88, N = 3 553 555 557 555 555 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
IOR Block Size: 16MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 16MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 6.27, N = 3 SE +/- 5.80, N = 15 SE +/- 6.54, N = 3 SE +/- 1.65, N = 3 SE +/- 7.12, N = 3 481.36 484.06 442.47 500.39 506.02 MIN: 316.02 / MAX: 1504.02 MIN: 224.25 / MAX: 1505.62 MIN: 217.24 / MAX: 1379.75 MIN: 315.3 / MAX: 1210.23 MIN: 308.33 / MAX: 1247.46 1. (CC) gcc options: -O2 -lm -pthread -lmpi
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 3 4 5 2K 4K 6K 8K 10K SE +/- 35.99, N = 3 SE +/- 30.21, N = 3 SE +/- 36.96, N = 3 SE +/- 46.43, N = 3 SE +/- 54.82, N = 3 9436 9471 9541 9533 9519 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
GROMACS Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021 Input: water_GMX50_bare 1 2 3 4 5 0.3832 0.7664 1.1496 1.5328 1.916 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 1.699 1.698 1.698 1.700 1.703 1. (CXX) g++ options: -O3 -pthread
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 1 2 3 4 5 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.29, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 98.89 98.93 98.87 98.96 98.87
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 1 2 3 4 5 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 SE +/- 0.12, N = 3 SE +/- 0.08, N = 3 85.54 84.88 84.21 84.27 85.20 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
JPEG XL Decoding CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: 1 1 2 3 4 5 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 31.81 32.39 32.26 32.17 32.30
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 1 2 3 4 5 4K 8K 12K 16K 20K SE +/- 47.06, N = 3 SE +/- 70.27, N = 3 SE +/- 6.44, N = 3 SE +/- 54.08, N = 3 SE +/- 84.18, N = 3 17828 17872 17842 17742 17849
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 4 5 500 1000 1500 2000 2500 SE +/- 10.76, N = 3 SE +/- 3.15, N = 3 SE +/- 34.24, N = 3 SE +/- 35.18, N = 3 SE +/- 28.36, N = 4 2140.50 2241.41 2202.89 2112.54 2214.08 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 1 2 3 4 5 15 30 45 60 75 SE +/- 0.36, N = 3 SE +/- 0.12, N = 3 SE +/- 0.12, N = 3 SE +/- 0.20, N = 3 SE +/- 0.15, N = 3 67.39 67.18 67.12 67.15 67.10 1. (CC) gcc options: -O2
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 1 2 3 4 5 12K 24K 36K 48K 60K SE +/- 86.69, N = 3 SE +/- 37.92, N = 3 SE +/- 83.09, N = 3 SE +/- 86.41, N = 3 SE +/- 53.09, N = 3 55767.16 55622.97 55666.72 55656.12 55876.18 1. (CXX) g++ options: -O3 -march=native -fopenmp
IOR Block Size: 4MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 4MB - Disk Target: Default Test Directory 1 2 3 4 5 200 400 600 800 1000 SE +/- 1.87, N = 3 SE +/- 5.42, N = 15 SE +/- 45.69, N = 12 SE +/- 6.63, N = 15 SE +/- 5.81, N = 6 887.94 466.30 467.76 493.98 478.01 MIN: 629.82 / MAX: 1334.23 MIN: 212.95 / MAX: 1329.58 MIN: 188.74 / MAX: 1435.44 MIN: 240.44 / MAX: 1344.87 MIN: 236.85 / MAX: 1266.85 1. (CC) gcc options: -O2 -lm -pthread -lmpi
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 1 2 3 4 5 0.2198 0.4396 0.6594 0.8792 1.099 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.975 0.976 0.975 0.976 0.977
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 1 1 2 3 4 5 0.081 0.162 0.243 0.324 0.405 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.358 0.359 0.359 0.360 0.359
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 1 2 3 4 5 12 24 36 48 60 SE +/- 0.12, N = 3 SE +/- 0.13, N = 3 SE +/- 0.36, N = 3 SE +/- 0.21, N = 3 SE +/- 0.26, N = 3 53.14 52.67 52.71 52.77 52.85 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
JPEG XL Decoding CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: All 1 2 3 4 5 40 80 120 160 200 SE +/- 0.25, N = 3 SE +/- 0.25, N = 3 SE +/- 0.10, N = 3 SE +/- 0.45, N = 3 SE +/- 0.19, N = 3 175.22 178.23 177.88 178.61 178.47
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1 2 3 4 5 9K 18K 27K 36K 45K SE +/- 210.55, N = 3 SE +/- 76.80, N = 3 SE +/- 151.55, N = 3 SE +/- 69.89, N = 3 SE +/- 20.70, N = 3 39726.16 39461.82 39681.57 39350.22 39275.55 1. (CXX) g++ options: -O3 -march=native -fopenmp
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 1 2 3 4 5 0.2867 0.5734 0.8601 1.1468 1.4335 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 1.269 1.272 1.270 1.269 1.274
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 4 5 10K 20K 30K 40K 50K SE +/- 154.44, N = 3 SE +/- 7.96, N = 3 SE +/- 139.20, N = 3 SE +/- 376.58, N = 3 SE +/- 44.69, N = 3 47323.88 47734.40 47778.76 47482.70 47360.36 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1 2 3 4 5 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 115 115 114 115 114 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 1 2 3 4 5 10 20 30 40 50 SE +/- 0.33, N = 3 42 42 42 42 42 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 4 5 9 18 27 36 45 SE +/- 0.09, N = 3 SE +/- 0.45, N = 3 SE +/- 0.23, N = 3 SE +/- 0.68, N = 3 SE +/- 0.24, N = 3 38.34 39.05 38.49 39.80 38.62 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 1 2 3 4 5 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.13, N = 3 SE +/- 0.17, N = 3 SE +/- 0.29, N = 3 34.30 33.69 33.56 33.40 33.94 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 1 2 3 4 5 0.6062 1.2124 1.8186 2.4248 3.031 SE +/- 0.006, N = 3 SE +/- 0.012, N = 3 SE +/- 0.005, N = 3 SE +/- 0.005, N = 3 SE +/- 0.007, N = 3 2.677 2.675 2.694 2.679 2.676
IOR Block Size: 2MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 2MB - Disk Target: Default Test Directory 1 2 3 4 5 130 260 390 520 650 SE +/- 5.79, N = 3 SE +/- 4.65, N = 3 SE +/- 7.93, N = 15 SE +/- 10.65, N = 12 SE +/- 7.25, N = 15 592.90 396.56 429.36 419.08 437.60 MIN: 431.01 / MAX: 1028.65 MIN: 214.64 / MAX: 1028.95 MIN: 220.22 / MAX: 1149.54 MIN: 152.75 / MAX: 1057.83 MIN: 162.6 / MAX: 1036.24 1. (CC) gcc options: -O2 -lm -pthread -lmpi
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 2 3 4 5 70 140 210 280 350 SE +/- 0.86, N = 3 SE +/- 0.32, N = 3 SE +/- 0.60, N = 3 SE +/- 7.07, N = 15 SE +/- 0.09, N = 3 303.57 303.51 303.11 290.73 303.89 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 1 2 3 4 5 120 240 360 480 600 SE +/- 2.84, N = 3 SE +/- 1.36, N = 3 SE +/- 3.76, N = 3 SE +/- 2.23, N = 3 SE +/- 1.67, N = 3 553.64 555.30 553.55 548.56 552.79 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 2 3 4 5 100 200 300 400 500 SE +/- 0.67, N = 3 SE +/- 1.53, N = 3 SE +/- 0.88, N = 3 SE +/- 1.15, N = 3 SE +/- 1.86, N = 3 483 484 484 483 483 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 4 5 20 40 60 80 100 SE +/- 0.58, N = 3 SE +/- 1.00, N = 3 SE +/- 0.67, N = 3 111 109 110 109 110 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
JPEG XL Input: JPEG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 5 1 2 3 4 5 11 22 33 44 55 SE +/- 0.07, N = 3 SE +/- 0.26, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 48.64 49.08 48.89 48.85 49.13 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 4 5 40 80 120 160 200 SE +/- 0.08, N = 3 SE +/- 0.28, N = 3 SE +/- 0.16, N = 3 SE +/- 0.12, N = 3 SE +/- 0.23, N = 3 184.00 182.98 183.42 182.72 183.12 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
JPEG XL Input: PNG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 5 1 2 3 4 5 12 24 36 48 60 SE +/- 0.14, N = 3 SE +/- 0.15, N = 3 SE +/- 0.33, N = 3 SE +/- 0.16, N = 3 SE +/- 0.13, N = 3 52.50 53.03 53.44 53.11 53.69 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 3 4 5 80 160 240 320 400 SE +/- 0.37, N = 3 SE +/- 0.27, N = 3 SE +/- 0.39, N = 3 SE +/- 0.38, N = 3 SE +/- 0.31, N = 3 361.24 360.92 361.45 361.43 361.49 MIN: 357.28 / MAX: 391.58 MIN: 356.72 / MAX: 372.66 MIN: 356.32 / MAX: 386.75 MIN: 356.24 / MAX: 380.38 MIN: 357.11 / MAX: 384.74 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 3.33, N = 3 SE +/- 1.86, N = 3 SE +/- 3.38, N = 3 SE +/- 3.71, N = 3 1792 1801 1795 1803 1800 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 4 5 20 40 60 80 100 85 86 85 86 85 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 6.00, N = 3 SE +/- 3.06, N = 3 SE +/- 1.76, N = 3 SE +/- 1.73, N = 3 1767 1767 1772 1771 1770 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 2 3 4 5 110 220 330 440 550 SE +/- 2.19, N = 3 SE +/- 2.00, N = 3 SE +/- 2.67, N = 3 SE +/- 3.67, N = 3 SE +/- 3.67, N = 3 495 495 496 496 494 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 23.92, N = 15 SE +/- 51.51, N = 15 SE +/- 51.69, N = 12 SE +/- 32.10, N = 15 SE +/- 46.79, N = 15 2081.02 2001.11 2033.28 1997.31 1999.52 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 2 3 4 5 70 140 210 280 350 SE +/- 0.46, N = 3 SE +/- 0.57, N = 3 SE +/- 0.62, N = 3 SE +/- 0.59, N = 3 SE +/- 0.43, N = 3 316.36 319.05 317.14 318.21 319.31 MIN: 313.3 / MAX: 320.26 MIN: 313 / MAX: 340.64 MIN: 314.18 / MAX: 323.54 MIN: 316.15 / MAX: 320.81 MIN: 316.94 / MAX: 338.61 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1 2 3 4 5 150 300 450 600 750 SE +/- 0.88, N = 3 711 711 709 708 709 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 2 3 4 5 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 189 189 189 188 189 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 4 5 130 260 390 520 650 SE +/- 1.20, N = 3 SE +/- 2.73, N = 3 SE +/- 1.00, N = 3 SE +/- 0.67, N = 3 613 612 612 612 611 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 2 3 4 5 100 200 300 400 500 SE +/- 0.88, N = 3 SE +/- 1.67, N = 3 SE +/- 1.20, N = 3 SE +/- 1.33, N = 3 SE +/- 0.88, N = 3 469 470 472 469 470 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
JPEG XL Input: JPEG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 7 1 2 3 4 5 11 22 33 44 55 SE +/- 0.11, N = 3 SE +/- 0.28, N = 3 SE +/- 0.07, N = 3 SE +/- 0.19, N = 3 SE +/- 0.14, N = 3 48.50 49.22 49.41 48.85 49.18 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 2 3 4 5 80 160 240 320 400 SE +/- 0.47, N = 3 SE +/- 0.82, N = 3 SE +/- 0.48, N = 3 SE +/- 0.00, N = 3 SE +/- 1.66, N = 3 376.89 377.36 377.84 378.79 378.80 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 5 200 400 600 800 1000 SE +/- 0.51, N = 3 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 SE +/- 0.39, N = 3 SE +/- 0.27, N = 3 864.68 864.94 864.80 865.03 864.65
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 5 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 82.97 83.00 82.98 83.01 82.97
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 4 5 500K 1000K 1500K 2000K 2500K SE +/- 18441.13, N = 3 SE +/- 27569.90, N = 5 SE +/- 34602.53, N = 3 SE +/- 20189.91, N = 3 SE +/- 11360.66, N = 3 2153944.42 2168410.90 2159575.21 2158064.17 2179924.50 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 4 5 400K 800K 1200K 1600K 2000K SE +/- 15512.42, N = 3 SE +/- 23318.24, N = 3 SE +/- 24702.46, N = 4 SE +/- 30239.19, N = 3 SE +/- 21792.45, N = 3 1912809.71 1952727.25 1889099.97 1906086.42 1923099.42 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 4 5 400K 800K 1200K 1600K 2000K SE +/- 10755.06, N = 3 SE +/- 13174.46, N = 3 SE +/- 9034.55, N = 3 SE +/- 10458.55, N = 3 SE +/- 10827.27, N = 3 1712504.00 1642807.13 1695317.29 1699000.00 1682254.45 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 2 3 4 5 70 140 210 280 350 SE +/- 0.30, N = 3 SE +/- 0.85, N = 3 SE +/- 1.01, N = 3 SE +/- 0.49, N = 3 SE +/- 0.49, N = 3 315.62 313.70 314.66 314.27 313.69 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 4 5 600K 1200K 1800K 2400K 3000K SE +/- 29192.48, N = 3 SE +/- 2566.99, N = 3 SE +/- 19169.69, N = 3 SE +/- 15108.16, N = 3 SE +/- 18070.29, N = 3 2698309.25 1769937.88 1797918.46 1761464.25 1770711.21 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 5 200 400 600 800 1000 SE +/- 0.52, N = 3 SE +/- 0.36, N = 3 SE +/- 0.26, N = 3 SE +/- 0.60, N = 3 SE +/- 0.98, N = 3 1156.97 1156.22 1155.67 1157.11 1156.87
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 5 16 32 48 64 80 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 72.31 72.26 72.23 72.32 72.31
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 2 3 4 5 50 100 150 200 250 SE +/- 0.58, N = 3 237 238 238 237 237 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 4 5 500K 1000K 1500K 2000K 2500K SE +/- 22779.82, N = 3 SE +/- 31640.05, N = 3 SE +/- 17482.31, N = 3 SE +/- 18752.77, N = 3 SE +/- 23178.69, N = 3 2539377.42 2415143.75 2431811.58 2442355.92 2443276.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 1 2 3 4 5 3 6 9 12 15 SE +/- 0.370, N = 12 SE +/- 0.433, N = 15 SE +/- 0.361, N = 12 SE +/- 0.334, N = 15 SE +/- 0.306, N = 15 9.483 9.084 9.408 9.099 8.901 1. (CXX) g++ options: -O3 -pthread -lm
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 1 2 3 4 5 900 1800 2700 3600 4500 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3973.97 3973.97 3973.97 3973.97 3973.97 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 2 3 4 5 500 1000 1500 2000 2500 SE +/- 19.92, N = 3 SE +/- 22.21, N = 3 SE +/- 32.33, N = 3 SE +/- 12.71, N = 3 SE +/- 12.94, N = 3 2458.07 2420.91 2428.72 2420.64 2442.85 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
JPEG XL Input: JPEG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 8 1 2 3 4 5 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 SE +/- 0.14, N = 3 23.19 23.25 23.43 23.37 23.37 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 1 2 3 4 5 3 6 9 12 15 SE +/- 0.105, N = 3 SE +/- 0.020, N = 3 SE +/- 0.024, N = 3 SE +/- 0.141, N = 3 SE +/- 0.149, N = 3 9.195 9.110 9.075 9.309 9.216 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 1 2 3 4 5 1400 2800 4200 5600 7000 SE +/- 21.98, N = 3 SE +/- 16.05, N = 3 SE +/- 43.78, N = 3 SE +/- 39.16, N = 3 SE +/- 30.71, N = 3 6695.04 6707.86 6678.49 6697.32 6708.23 1. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 1 2 3 4 5 300 600 900 1200 1500 SE +/- 13.09, N = 3 SE +/- 3.63, N = 3 SE +/- 4.38, N = 3 SE +/- 3.03, N = 3 SE +/- 2.52, N = 3 1365.39 1387.30 1385.84 1376.34 1377.89 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 1 2 3 4 5 0.758 1.516 2.274 3.032 3.79 SE +/- 0.034, N = 3 SE +/- 0.012, N = 3 SE +/- 0.027, N = 3 SE +/- 0.049, N = 3 SE +/- 0.057, N = 3 3.369 3.363 3.325 3.347 3.350 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Phoronix Test Suite v10.8.5