7980XE Feb 2921 Intel Core i9-7980XE testing with a ASUS PRIME X299-A (2002 BIOS) and Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102162-HA-7980XEFEB73&grw .
7980XE Feb 2921 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 4 5 Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads) ASUS PRIME X299-A (2002 BIOS) Intel Sky Lake-E DMI3 Registers 16GB Samsung SSD 970 EVO 500GB Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB (1206/1750MHz) Realtek ALC1220 G237HL Intel I219-V Ubuntu 20.10 5.8.0-36-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 4.6 Mesa 20.2.6 (LLVM 11.0.0) 1.2.131 GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x2006a08 Graphics Details - GLAMOR Python Details - Python 3.8.6 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
7980XE Feb 2921 lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Crush 0 - Compression lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression ior: 64MB - Default Test Directory lzbench: Zstd 8 - Compression ior: 32MB - Default Test Directory ior: 16MB - Default Test Directory ior: 8MB - Default Test Directory lzbench: Zstd 8 - Decompression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression ngspice: C2670 ngspice: C7552 etcpak: DXT1 etcpak: ETC1 etcpak: ETC2 etcpak: ETC1 + Dithering jpegxl: PNG - 5 jpegxl: PNG - 7 jpegxl: PNG - 8 jpegxl: JPEG - 5 jpegxl: JPEG - 7 jpegxl: JPEG - 8 jpegxl-decode: 1 jpegxl-decode: All webp2: Default webp2: Quality 75, Compression Effort 7 webp2: Quality 95, Compression Effort 7 webp2: Quality 100, Compression Effort 5 webp2: Quality 100, Lossless Compression synthmark: VoiceMark_100 ior: 4MB - Default Test Directory gcrypt: quantlib: cloverleaf: Lagrangian-Eulerian Hydrodynamics mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 gromacs: water_GMX50_bare lammps: 20k Atoms lammps: Rhodopsin Protein npb: EP.C npb: EP.D npb: LU.C askap: tConvolve MT - Gridding askap: tConvolve MT - Degridding askap: tConvolve OpenMP - Gridding askap: tConvolve OpenMP - Degridding askap: Hogbom Clean OpenMP pennant: sedovbig pennant: leblancbig lulesh: qmcpack: simple-H2O v-ray: CPU rav1e: 1 rav1e: 5 rav1e: 6 rav1e: 10 build-godot: Time To Compile financebench: Repo OpenMP redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET gnupg: 2.7GB Sample File Encryption paraview: Many Spheres - 1920 x 1080 paraview: Many Spheres - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 ior: 2MB - Default Test Directory financebench: Bonds OpenMP 1 2 3 4 5 42 115 495 1767 111 483 469 492.85 85 493.82 481.36 525.44 1792 613 189 711 237 155.596 138.661 1365.386 315.616 183.995 303.570 52.50 7.86 0.71 48.64 48.50 23.19 31.81 175.22 3.369 155.740 282.047 9.195 593.066 553.640 887.94 212.796 2222.4 85.54 7.108 37.184 4.201 2.676 44.157 553 630 143 9436 7531 361.235 316.358 1.699 11.176 9.483 2081.02 2140.50 47323.88 1815.85 2618.53 2458.07 3973.97 376.885 53.14447 34.29865 6695.0413 38.340 17828 0.358 0.975 1.269 2.677 98.889 39726.164063 2698309.25 2153944.42 1712504.00 2539377.42 1912809.71 67.390 4.80 481.692 72.31 1156.973 82.97 864.682 592.90 55767.164062 42 115 495 1767 109 484 470 480.71 86 474.62 484.06 530.83 1801 612 189 711 238 151.978 137.582 1387.302 313.703 182.977 303.514 53.03 7.93 0.70 49.08 49.22 23.25 32.39 178.23 3.363 156.550 280.636 9.110 590.293 555.295 466.30 212.057 2222.6 84.88 7.232 37.345 4.634 2.618 48.330 555 649 145 9471 7249 360.923 319.049 1.698 11.210 9.084 2001.11 2241.41 47734.40 1820.90 2622.75 2420.91 3973.97 377.362 52.66769 33.68917 6707.8596 39.048 17872 0.359 0.976 1.272 2.675 98.932 39461.821614 1769937.88 2168410.90 1642807.13 2415143.75 1952727.25 67.175 4.81 481.834 72.26 1156.216 83.00 864.937 396.56 55622.968750 42 114 496 1772 110 484 472 489.25 85 470.37 442.47 354.01 1795 612 189 709 238 153.061 139.573 1385.842 314.662 183.424 303.105 53.44 7.93 0.7 48.89 49.41 23.43 32.26 177.88 3.325 157.528 282.575 9.075 596.408 553.553 467.76 211.787 2207.1 84.21 7.355 37.283 4.348 2.686 45.970 557 658 144 9541 7080 361.454 317.136 1.698 11.195 9.408 2033.28 2202.89 47778.76 1813.56 2633.11 2428.72 3973.97 377.835 52.70973 33.55896 6678.4880 38.489 17842 0.359 0.975 1.270 2.694 98.872 39681.566406 1797918.46 2159575.21 1695317.29 2431811.58 1889099.97 67.124 4.80 481.420 72.23 1155.674 82.98 864.804 429.36 55666.716146 42 115 496 1771 109 483 469 490.66 86 484.50 500.39 527.45 1803 612 188 708 237 154.214 139.032 1376.344 314.271 182.722 290.726 53.11 7.92 0.70 48.85 48.85 23.37 32.17 178.61 3.347 154.880 279.573 9.309 593.530 548.556 493.98 211.895 2215.6 84.27 7.281 37.607 4.513 2.662 48.375 555 659 144 9533 7457 361.425 318.208 1.700 11.205 9.099 1997.31 2112.54 47482.70 1821.82 2633.32 2420.64 3973.97 378.788 52.76675 33.40175 6697.3184 39.795 17742 0.360 0.976 1.269 2.679 98.964 39350.223958 1761464.25 2158064.17 1699000.00 2442355.92 1906086.42 67.151 4.67 468.456 72.32 1157.110 83.01 865.027 419.08 55656.115885 42 114 494 1770 110 483 470 493.41 85 516.94 506.02 510.05 1800 611 189 709 237 153.099 139.017 1377.894 313.693 183.118 303.890 53.69 7.91 0.7 49.13 49.18 23.37 32.30 178.47 3.350 156.753 283.331 9.216 590.211 552.792 478.01 212.963 2216.0 85.20 7.272 37.338 4.546 2.645 48.640 555 652 145 9519 6939 361.488 319.314 1.703 11.216 8.901 1999.52 2214.08 47360.36 1821.36 2609.75 2442.85 3973.97 378.802 52.84683 33.93533 6708.2322 38.621 17849 0.359 0.977 1.274 2.676 98.873 39275.548177 1770711.21 2179924.5 1682254.45 2443276.75 1923099.42 67.102 4.80 481.347 72.31 1156.869 82.97 864.654 437.60 55876.180990 OpenBenchmarking.org
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 1 2 3 4 5 10 20 30 40 50 SE +/- 0.33, N = 3 42 42 42 42 42 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1 2 3 4 5 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 115 115 114 115 114 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 2 3 4 5 110 220 330 440 550 SE +/- 2.19, N = 3 SE +/- 2.00, N = 3 SE +/- 2.67, N = 3 SE +/- 3.67, N = 3 SE +/- 3.67, N = 3 495 495 496 496 494 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 6.00, N = 3 SE +/- 3.06, N = 3 SE +/- 1.76, N = 3 SE +/- 1.73, N = 3 1767 1767 1772 1771 1770 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 4 5 20 40 60 80 100 SE +/- 0.58, N = 3 SE +/- 1.00, N = 3 SE +/- 0.67, N = 3 111 109 110 109 110 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 2 3 4 5 100 200 300 400 500 SE +/- 0.67, N = 3 SE +/- 1.53, N = 3 SE +/- 0.88, N = 3 SE +/- 1.15, N = 3 SE +/- 1.86, N = 3 483 484 484 483 483 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 2 3 4 5 100 200 300 400 500 SE +/- 0.88, N = 3 SE +/- 1.67, N = 3 SE +/- 1.20, N = 3 SE +/- 1.33, N = 3 SE +/- 0.88, N = 3 469 470 472 469 470 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
IOR Block Size: 64MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 64MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 3.92, N = 3 SE +/- 2.46, N = 3 SE +/- 4.79, N = 3 SE +/- 4.68, N = 3 SE +/- 6.10, N = 3 492.85 480.71 489.25 490.66 493.41 MIN: 402.07 / MAX: 1345.65 MIN: 301.17 / MAX: 1079.47 MIN: 374.86 / MAX: 1033.08 MIN: 348.43 / MAX: 1036.5 MIN: 246.91 / MAX: 1088.06 1. (CC) gcc options: -O2 -lm -pthread -lmpi
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 4 5 20 40 60 80 100 85 86 85 86 85 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
IOR Block Size: 32MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 32MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 3.89, N = 3 SE +/- 4.72, N = 8 SE +/- 5.73, N = 3 SE +/- 4.09, N = 12 SE +/- 4.67, N = 3 493.82 474.62 470.37 484.50 516.94 MIN: 245.23 / MAX: 1245.29 MIN: 202.58 / MAX: 1355.46 MIN: 197.76 / MAX: 1370.54 MIN: 196.66 / MAX: 1538.21 MIN: 413.96 / MAX: 1183.64 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 16MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 16MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 6.27, N = 3 SE +/- 5.80, N = 15 SE +/- 6.54, N = 3 SE +/- 1.65, N = 3 SE +/- 7.12, N = 3 481.36 484.06 442.47 500.39 506.02 MIN: 316.02 / MAX: 1504.02 MIN: 224.25 / MAX: 1505.62 MIN: 217.24 / MAX: 1379.75 MIN: 315.3 / MAX: 1210.23 MIN: 308.33 / MAX: 1247.46 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 8MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 8MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 25.39, N = 15 SE +/- 5.42, N = 15 SE +/- 4.96, N = 15 SE +/- 4.21, N = 14 SE +/- 6.12, N = 5 525.44 530.83 354.01 527.45 510.05 MIN: 290.23 / MAX: 1447.59 MIN: 248.03 / MAX: 1386.96 MIN: 189.98 / MAX: 1385.03 MIN: 251.24 / MAX: 1378.27 MIN: 222.93 / MAX: 1266.18 1. (CC) gcc options: -O2 -lm -pthread -lmpi
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 3.33, N = 3 SE +/- 1.86, N = 3 SE +/- 3.38, N = 3 SE +/- 3.71, N = 3 1792 1801 1795 1803 1800 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 4 5 130 260 390 520 650 SE +/- 1.20, N = 3 SE +/- 2.73, N = 3 SE +/- 1.00, N = 3 SE +/- 0.67, N = 3 613 612 612 612 611 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 2 3 4 5 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 189 189 189 188 189 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1 2 3 4 5 150 300 450 600 750 SE +/- 0.88, N = 3 711 711 709 708 709 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 2 3 4 5 50 100 150 200 250 SE +/- 0.58, N = 3 237 238 238 237 237 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 1 2 3 4 5 30 60 90 120 150 SE +/- 2.33, N = 3 SE +/- 1.75, N = 3 SE +/- 1.44, N = 3 SE +/- 2.23, N = 3 SE +/- 1.61, N = 3 155.60 151.98 153.06 154.21 153.10 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 1 2 3 4 5 30 60 90 120 150 SE +/- 0.78, N = 3 SE +/- 0.10, N = 3 SE +/- 1.23, N = 3 SE +/- 1.21, N = 3 SE +/- 2.04, N = 3 138.66 137.58 139.57 139.03 139.02 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 1 2 3 4 5 300 600 900 1200 1500 SE +/- 13.09, N = 3 SE +/- 3.63, N = 3 SE +/- 4.38, N = 3 SE +/- 3.03, N = 3 SE +/- 2.52, N = 3 1365.39 1387.30 1385.84 1376.34 1377.89 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 2 3 4 5 70 140 210 280 350 SE +/- 0.30, N = 3 SE +/- 0.85, N = 3 SE +/- 1.01, N = 3 SE +/- 0.49, N = 3 SE +/- 0.49, N = 3 315.62 313.70 314.66 314.27 313.69 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 4 5 40 80 120 160 200 SE +/- 0.08, N = 3 SE +/- 0.28, N = 3 SE +/- 0.16, N = 3 SE +/- 0.12, N = 3 SE +/- 0.23, N = 3 184.00 182.98 183.42 182.72 183.12 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 2 3 4 5 70 140 210 280 350 SE +/- 0.86, N = 3 SE +/- 0.32, N = 3 SE +/- 0.60, N = 3 SE +/- 7.07, N = 15 SE +/- 0.09, N = 3 303.57 303.51 303.11 290.73 303.89 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
JPEG XL Input: PNG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 5 1 2 3 4 5 12 24 36 48 60 SE +/- 0.14, N = 3 SE +/- 0.15, N = 3 SE +/- 0.33, N = 3 SE +/- 0.16, N = 3 SE +/- 0.13, N = 3 52.50 53.03 53.44 53.11 53.69 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: PNG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 7 1 2 3 4 5 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 7.86 7.93 7.93 7.92 7.91 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: PNG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 8 1 2 3 4 5 0.1598 0.3196 0.4794 0.6392 0.799 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.71 0.70 0.70 0.70 0.70 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 5 1 2 3 4 5 11 22 33 44 55 SE +/- 0.07, N = 3 SE +/- 0.26, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 48.64 49.08 48.89 48.85 49.13 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 7 1 2 3 4 5 11 22 33 44 55 SE +/- 0.11, N = 3 SE +/- 0.28, N = 3 SE +/- 0.07, N = 3 SE +/- 0.19, N = 3 SE +/- 0.14, N = 3 48.50 49.22 49.41 48.85 49.18 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 8 1 2 3 4 5 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 SE +/- 0.14, N = 3 23.19 23.25 23.43 23.37 23.37 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Decoding CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: 1 1 2 3 4 5 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 31.81 32.39 32.26 32.17 32.30
JPEG XL Decoding CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: All 1 2 3 4 5 40 80 120 160 200 SE +/- 0.25, N = 3 SE +/- 0.25, N = 3 SE +/- 0.10, N = 3 SE +/- 0.45, N = 3 SE +/- 0.19, N = 3 175.22 178.23 177.88 178.61 178.47
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 1 2 3 4 5 0.758 1.516 2.274 3.032 3.79 SE +/- 0.034, N = 3 SE +/- 0.012, N = 3 SE +/- 0.027, N = 3 SE +/- 0.049, N = 3 SE +/- 0.057, N = 3 3.369 3.363 3.325 3.347 3.350 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 1 2 3 4 5 30 60 90 120 150 SE +/- 1.77, N = 3 SE +/- 0.15, N = 3 SE +/- 0.29, N = 3 SE +/- 0.66, N = 3 SE +/- 0.73, N = 3 155.74 156.55 157.53 154.88 156.75 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 1 2 3 4 5 60 120 180 240 300 SE +/- 1.07, N = 3 SE +/- 0.54, N = 3 SE +/- 2.02, N = 3 SE +/- 1.22, N = 3 SE +/- 2.02, N = 3 282.05 280.64 282.58 279.57 283.33 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 1 2 3 4 5 3 6 9 12 15 SE +/- 0.105, N = 3 SE +/- 0.020, N = 3 SE +/- 0.024, N = 3 SE +/- 0.141, N = 3 SE +/- 0.149, N = 3 9.195 9.110 9.075 9.309 9.216 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 1 2 3 4 5 130 260 390 520 650 SE +/- 1.66, N = 3 SE +/- 0.45, N = 3 SE +/- 0.58, N = 3 SE +/- 1.92, N = 3 SE +/- 0.58, N = 3 593.07 590.29 596.41 593.53 590.21 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 1 2 3 4 5 120 240 360 480 600 SE +/- 2.84, N = 3 SE +/- 1.36, N = 3 SE +/- 3.76, N = 3 SE +/- 2.23, N = 3 SE +/- 1.67, N = 3 553.64 555.30 553.55 548.56 552.79 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
IOR Block Size: 4MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 4MB - Disk Target: Default Test Directory 1 2 3 4 5 200 400 600 800 1000 SE +/- 1.87, N = 3 SE +/- 5.42, N = 15 SE +/- 45.69, N = 12 SE +/- 6.63, N = 15 SE +/- 5.81, N = 6 887.94 466.30 467.76 493.98 478.01 MIN: 629.82 / MAX: 1334.23 MIN: 212.95 / MAX: 1329.58 MIN: 188.74 / MAX: 1435.44 MIN: 240.44 / MAX: 1344.87 MIN: 236.85 / MAX: 1266.85 1. (CC) gcc options: -O2 -lm -pthread -lmpi
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1 2 3 4 5 50 100 150 200 250 SE +/- 0.63, N = 3 SE +/- 0.34, N = 3 SE +/- 0.06, N = 3 SE +/- 0.49, N = 3 SE +/- 0.29, N = 3 212.80 212.06 211.79 211.90 212.96 1. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 4 5 500 1000 1500 2000 2500 SE +/- 17.98, N = 13 SE +/- 19.83, N = 12 SE +/- 27.77, N = 12 SE +/- 21.07, N = 12 SE +/- 28.27, N = 12 2222.4 2222.6 2207.1 2215.6 2216.0 1. (CXX) g++ options: -O3 -march=native -rdynamic
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 1 2 3 4 5 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 SE +/- 0.12, N = 3 SE +/- 0.08, N = 3 85.54 84.88 84.21 84.27 85.20 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 1 2 3 4 5 2 4 6 8 10 SE +/- 0.091, N = 5 SE +/- 0.085, N = 3 SE +/- 0.110, N = 3 SE +/- 0.070, N = 3 SE +/- 0.007, N = 3 7.108 7.232 7.355 7.281 7.272 MIN: 6.52 / MAX: 7.68 MIN: 6.81 / MAX: 7.85 MIN: 6.8 / MAX: 7.87 MIN: 6.97 / MAX: 7.74 MIN: 7 / MAX: 7.67 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 1 2 3 4 5 9 18 27 36 45 SE +/- 0.32, N = 5 SE +/- 0.12, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 37.18 37.35 37.28 37.61 37.34 MIN: 35.67 / MAX: 38.06 MIN: 36.97 / MAX: 37.99 MIN: 36.85 / MAX: 38.11 MIN: 37.16 / MAX: 38.09 MIN: 36.95 / MAX: 37.85 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 1 2 3 4 5 1.0427 2.0854 3.1281 4.1708 5.2135 SE +/- 0.129, N = 5 SE +/- 0.086, N = 3 SE +/- 0.205, N = 3 SE +/- 0.041, N = 3 SE +/- 0.008, N = 3 4.201 4.634 4.348 4.513 4.546 MIN: 3.68 / MAX: 4.8 MIN: 4.2 / MAX: 4.99 MIN: 3.62 / MAX: 4.88 MIN: 4.22 / MAX: 4.96 MIN: 4.28 / MAX: 4.91 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 1 2 3 4 5 0.6044 1.2088 1.8132 2.4176 3.022 SE +/- 0.029, N = 5 SE +/- 0.019, N = 3 SE +/- 0.045, N = 3 SE +/- 0.010, N = 3 SE +/- 0.011, N = 3 2.676 2.618 2.686 2.662 2.645 MIN: 2.44 / MAX: 3.19 MIN: 2.5 / MAX: 2.92 MIN: 2.42 / MAX: 3.11 MIN: 2.48 / MAX: 2.98 MIN: 2.44 / MAX: 3.04 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 1 2 3 4 5 11 22 33 44 55 SE +/- 1.80, N = 5 SE +/- 0.10, N = 3 SE +/- 2.44, N = 3 SE +/- 0.11, N = 3 SE +/- 0.04, N = 3 44.16 48.33 45.97 48.38 48.64 MIN: 40.29 / MAX: 51.28 MIN: 47.9 / MAX: 50.26 MIN: 40.58 / MAX: 48.74 MIN: 47.91 / MAX: 48.97 MIN: 48.25 / MAX: 49.28 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 1 2 3 4 5 120 240 360 480 600 SE +/- 1.09, N = 3 SE +/- 0.29, N = 3 SE +/- 2.25, N = 3 SE +/- 1.64, N = 3 SE +/- 0.88, N = 3 553 555 557 555 555 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 1 2 3 4 5 140 280 420 560 700 SE +/- 5.46, N = 3 SE +/- 11.30, N = 12 SE +/- 13.29, N = 12 SE +/- 13.34, N = 12 SE +/- 11.09, N = 12 630 649 658 659 652 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 1 2 3 4 5 30 60 90 120 150 SE +/- 0.44, N = 3 SE +/- 0.33, N = 3 SE +/- 0.87, N = 3 SE +/- 0.44, N = 3 SE +/- 0.17, N = 3 143 145 144 144 145 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 3 4 5 2K 4K 6K 8K 10K SE +/- 35.99, N = 3 SE +/- 30.21, N = 3 SE +/- 36.96, N = 3 SE +/- 46.43, N = 3 SE +/- 54.82, N = 3 9436 9471 9541 9533 9519 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 2 3 4 5 1600 3200 4800 6400 8000 SE +/- 57.46, N = 3 SE +/- 133.82, N = 12 SE +/- 173.63, N = 9 SE +/- 109.17, N = 3 SE +/- 212.04, N = 12 7531 7249 7080 7457 6939 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 3 4 5 80 160 240 320 400 SE +/- 0.37, N = 3 SE +/- 0.27, N = 3 SE +/- 0.39, N = 3 SE +/- 0.38, N = 3 SE +/- 0.31, N = 3 361.24 360.92 361.45 361.43 361.49 MIN: 357.28 / MAX: 391.58 MIN: 356.72 / MAX: 372.66 MIN: 356.32 / MAX: 386.75 MIN: 356.24 / MAX: 380.38 MIN: 357.11 / MAX: 384.74 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 2 3 4 5 70 140 210 280 350 SE +/- 0.46, N = 3 SE +/- 0.57, N = 3 SE +/- 0.62, N = 3 SE +/- 0.59, N = 3 SE +/- 0.43, N = 3 316.36 319.05 317.14 318.21 319.31 MIN: 313.3 / MAX: 320.26 MIN: 313 / MAX: 340.64 MIN: 314.18 / MAX: 323.54 MIN: 316.15 / MAX: 320.81 MIN: 316.94 / MAX: 338.61 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
GROMACS Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021 Input: water_GMX50_bare 1 2 3 4 5 0.3832 0.7664 1.1496 1.5328 1.916 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 1.699 1.698 1.698 1.700 1.703 1. (CXX) g++ options: -O3 -pthread
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms 1 2 3 4 5 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 11.18 11.21 11.20 11.21 11.22 1. (CXX) g++ options: -O3 -pthread -lm
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 1 2 3 4 5 3 6 9 12 15 SE +/- 0.370, N = 12 SE +/- 0.433, N = 15 SE +/- 0.361, N = 12 SE +/- 0.334, N = 15 SE +/- 0.306, N = 15 9.483 9.084 9.408 9.099 8.901 1. (CXX) g++ options: -O3 -pthread -lm
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 23.92, N = 15 SE +/- 51.51, N = 15 SE +/- 51.69, N = 12 SE +/- 32.10, N = 15 SE +/- 46.79, N = 15 2081.02 2001.11 2033.28 1997.31 1999.52 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 4 5 500 1000 1500 2000 2500 SE +/- 10.76, N = 3 SE +/- 3.15, N = 3 SE +/- 34.24, N = 3 SE +/- 35.18, N = 3 SE +/- 28.36, N = 4 2140.50 2241.41 2202.89 2112.54 2214.08 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 4 5 10K 20K 30K 40K 50K SE +/- 154.44, N = 3 SE +/- 7.96, N = 3 SE +/- 139.20, N = 3 SE +/- 376.58, N = 3 SE +/- 44.69, N = 3 47323.88 47734.40 47778.76 47482.70 47360.36 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 2.23, N = 3 SE +/- 0.35, N = 3 SE +/- 2.62, N = 3 SE +/- 0.61, N = 3 SE +/- 0.31, N = 3 1815.85 1820.90 1813.56 1821.82 1821.36 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 2 3 4 5 600 1200 1800 2400 3000 SE +/- 10.95, N = 3 SE +/- 4.46, N = 3 SE +/- 7.85, N = 3 SE +/- 4.51, N = 3 SE +/- 11.98, N = 3 2618.53 2622.75 2633.11 2633.32 2609.75 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 2 3 4 5 500 1000 1500 2000 2500 SE +/- 19.92, N = 3 SE +/- 22.21, N = 3 SE +/- 32.33, N = 3 SE +/- 12.71, N = 3 SE +/- 12.94, N = 3 2458.07 2420.91 2428.72 2420.64 2442.85 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 1 2 3 4 5 900 1800 2700 3600 4500 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3973.97 3973.97 3973.97 3973.97 3973.97 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 2 3 4 5 80 160 240 320 400 SE +/- 0.47, N = 3 SE +/- 0.82, N = 3 SE +/- 0.48, N = 3 SE +/- 0.00, N = 3 SE +/- 1.66, N = 3 376.89 377.36 377.84 378.79 378.80 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 1 2 3 4 5 12 24 36 48 60 SE +/- 0.12, N = 3 SE +/- 0.13, N = 3 SE +/- 0.36, N = 3 SE +/- 0.21, N = 3 SE +/- 0.26, N = 3 53.14 52.67 52.71 52.77 52.85 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 1 2 3 4 5 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.13, N = 3 SE +/- 0.17, N = 3 SE +/- 0.29, N = 3 34.30 33.69 33.56 33.40 33.94 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 1 2 3 4 5 1400 2800 4200 5600 7000 SE +/- 21.98, N = 3 SE +/- 16.05, N = 3 SE +/- 43.78, N = 3 SE +/- 39.16, N = 3 SE +/- 30.71, N = 3 6695.04 6707.86 6678.49 6697.32 6708.23 1. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 4 5 9 18 27 36 45 SE +/- 0.09, N = 3 SE +/- 0.45, N = 3 SE +/- 0.23, N = 3 SE +/- 0.68, N = 3 SE +/- 0.24, N = 3 38.34 39.05 38.49 39.80 38.62 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 1 2 3 4 5 4K 8K 12K 16K 20K SE +/- 47.06, N = 3 SE +/- 70.27, N = 3 SE +/- 6.44, N = 3 SE +/- 54.08, N = 3 SE +/- 84.18, N = 3 17828 17872 17842 17742 17849
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 1 1 2 3 4 5 0.081 0.162 0.243 0.324 0.405 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.358 0.359 0.359 0.360 0.359
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 1 2 3 4 5 0.2198 0.4396 0.6594 0.8792 1.099 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.975 0.976 0.975 0.976 0.977
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 1 2 3 4 5 0.2867 0.5734 0.8601 1.1468 1.4335 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 1.269 1.272 1.270 1.269 1.274
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 1 2 3 4 5 0.6062 1.2124 1.8186 2.4248 3.031 SE +/- 0.006, N = 3 SE +/- 0.012, N = 3 SE +/- 0.005, N = 3 SE +/- 0.005, N = 3 SE +/- 0.007, N = 3 2.677 2.675 2.694 2.679 2.676
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 1 2 3 4 5 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.29, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 98.89 98.93 98.87 98.96 98.87
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1 2 3 4 5 9K 18K 27K 36K 45K SE +/- 210.55, N = 3 SE +/- 76.80, N = 3 SE +/- 151.55, N = 3 SE +/- 69.89, N = 3 SE +/- 20.70, N = 3 39726.16 39461.82 39681.57 39350.22 39275.55 1. (CXX) g++ options: -O3 -march=native -fopenmp
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 4 5 600K 1200K 1800K 2400K 3000K SE +/- 29192.48, N = 3 SE +/- 2566.99, N = 3 SE +/- 19169.69, N = 3 SE +/- 15108.16, N = 3 SE +/- 18070.29, N = 3 2698309.25 1769937.88 1797918.46 1761464.25 1770711.21 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 4 5 500K 1000K 1500K 2000K 2500K SE +/- 18441.13, N = 3 SE +/- 27569.90, N = 5 SE +/- 34602.53, N = 3 SE +/- 20189.91, N = 3 SE +/- 11360.66, N = 3 2153944.42 2168410.90 2159575.21 2158064.17 2179924.50 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 4 5 400K 800K 1200K 1600K 2000K SE +/- 10755.06, N = 3 SE +/- 13174.46, N = 3 SE +/- 9034.55, N = 3 SE +/- 10458.55, N = 3 SE +/- 10827.27, N = 3 1712504.00 1642807.13 1695317.29 1699000.00 1682254.45 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 4 5 500K 1000K 1500K 2000K 2500K SE +/- 22779.82, N = 3 SE +/- 31640.05, N = 3 SE +/- 17482.31, N = 3 SE +/- 18752.77, N = 3 SE +/- 23178.69, N = 3 2539377.42 2415143.75 2431811.58 2442355.92 2443276.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 4 5 400K 800K 1200K 1600K 2000K SE +/- 15512.42, N = 3 SE +/- 23318.24, N = 3 SE +/- 24702.46, N = 4 SE +/- 30239.19, N = 3 SE +/- 21792.45, N = 3 1912809.71 1952727.25 1889099.97 1906086.42 1923099.42 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 1 2 3 4 5 15 30 45 60 75 SE +/- 0.36, N = 3 SE +/- 0.12, N = 3 SE +/- 0.12, N = 3 SE +/- 0.20, N = 3 SE +/- 0.15, N = 3 67.39 67.18 67.12 67.15 67.10 1. (CC) gcc options: -O2
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 2 3 4 5 1.0823 2.1646 3.2469 4.3292 5.4115 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.13, N = 11 SE +/- 0.00, N = 3 4.80 4.81 4.80 4.67 4.80
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 2 3 4 5 100 200 300 400 500 SE +/- 0.37, N = 3 SE +/- 0.39, N = 3 SE +/- 0.27, N = 3 SE +/- 12.92, N = 11 SE +/- 0.34, N = 3 481.69 481.83 481.42 468.46 481.35
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 5 16 32 48 64 80 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 72.31 72.26 72.23 72.32 72.31
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 5 200 400 600 800 1000 SE +/- 0.52, N = 3 SE +/- 0.36, N = 3 SE +/- 0.26, N = 3 SE +/- 0.60, N = 3 SE +/- 0.98, N = 3 1156.97 1156.22 1155.67 1157.11 1156.87
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 5 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 82.97 83.00 82.98 83.01 82.97
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 5 200 400 600 800 1000 SE +/- 0.51, N = 3 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 SE +/- 0.39, N = 3 SE +/- 0.27, N = 3 864.68 864.94 864.80 865.03 864.65
IOR Block Size: 2MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 2MB - Disk Target: Default Test Directory 1 2 3 4 5 130 260 390 520 650 SE +/- 5.79, N = 3 SE +/- 4.65, N = 3 SE +/- 7.93, N = 15 SE +/- 10.65, N = 12 SE +/- 7.25, N = 15 592.90 396.56 429.36 419.08 437.60 MIN: 431.01 / MAX: 1028.65 MIN: 214.64 / MAX: 1028.95 MIN: 220.22 / MAX: 1149.54 MIN: 152.75 / MAX: 1057.83 MIN: 162.6 / MAX: 1036.24 1. (CC) gcc options: -O2 -lm -pthread -lmpi
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 1 2 3 4 5 12K 24K 36K 48K 60K SE +/- 86.69, N = 3 SE +/- 37.92, N = 3 SE +/- 83.09, N = 3 SE +/- 86.41, N = 3 SE +/- 53.09, N = 3 55767.16 55622.97 55666.72 55656.12 55876.18 1. (CXX) g++ options: -O3 -march=native -fopenmp
Phoronix Test Suite v10.8.5