7980XE Feb 2921 Intel Core i9-7980XE testing with a ASUS PRIME X299-A (2002 BIOS) and Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102162-HA-7980XEFEB73&grt .
7980XE Feb 2921 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 4 5 Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads) ASUS PRIME X299-A (2002 BIOS) Intel Sky Lake-E DMI3 Registers 16GB Samsung SSD 970 EVO 500GB Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB (1206/1750MHz) Realtek ALC1220 G237HL Intel I219-V Ubuntu 20.10 5.8.0-36-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 4.6 Mesa 20.2.6 (LLVM 11.0.0) 1.2.131 GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x2006a08 Graphics Details - GLAMOR Python Details - Python 3.8.6 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
7980XE Feb 2921 askap: tConvolve MT - Gridding askap: tConvolve MT - Degridding askap: tConvolve OpenMP - Gridding askap: tConvolve OpenMP - Degridding askap: Hogbom Clean OpenMP v-ray: CPU cloverleaf: Lagrangian-Eulerian Hydrodynamics etcpak: DXT1 etcpak: ETC1 etcpak: ETC2 etcpak: ETC1 + Dithering financebench: Repo OpenMP financebench: Bonds OpenMP gcrypt: gnupg: 2.7GB Sample File Encryption synthmark: VoiceMark_100 gromacs: water_GMX50_bare ior: 2MB - Default Test Directory ior: 4MB - Default Test Directory ior: 8MB - Default Test Directory ior: 16MB - Default Test Directory ior: 32MB - Default Test Directory ior: 64MB - Default Test Directory jpegxl: PNG - 5 jpegxl: PNG - 7 jpegxl: PNG - 8 jpegxl: JPEG - 5 jpegxl: JPEG - 7 jpegxl: JPEG - 8 jpegxl-decode: 1 jpegxl-decode: All lammps: 20k Atoms lammps: Rhodopsin Protein lulesh: lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 8 - Decompression lzbench: Crush 0 - Compression lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 npb: EP.C npb: EP.D npb: LU.C ngspice: C2670 ngspice: C7552 onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU paraview: Many Spheres - 1920 x 1080 paraview: Many Spheres - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 pennant: sedovbig pennant: leblancbig qmcpack: simple-H2O quantlib: rav1e: 1 rav1e: 5 rav1e: 6 rav1e: 10 redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET build-godot: Time To Compile tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 webp2: Default webp2: Quality 75, Compression Effort 7 webp2: Quality 95, Compression Effort 7 webp2: Quality 100, Compression Effort 5 webp2: Quality 100, Lossless Compression 1 2 3 4 5 1815.85 2618.53 2458.07 3973.97 376.885 17828 85.54 1365.386 315.616 183.995 303.570 39726.164063 55767.164062 212.796 67.390 553.640 1.699 592.90 887.94 525.44 481.36 493.82 492.85 52.50 7.86 0.71 48.64 48.50 23.19 31.81 175.22 11.176 9.483 6695.0413 42 115 495 1767 85 1792 111 483 469 613 189 711 237 7.108 37.184 4.201 2.676 44.157 2081.02 2140.50 47323.88 155.596 138.661 553 630 143 9436 7531 4.80 481.692 72.31 1156.973 82.97 864.682 53.14447 34.29865 38.340 2222.4 0.358 0.975 1.269 2.677 2698309.25 2153944.42 1712504.00 2539377.42 1912809.71 98.889 361.235 316.358 3.369 155.740 282.047 9.195 593.066 1820.90 2622.75 2420.91 3973.97 377.362 17872 84.88 1387.302 313.703 182.977 303.514 39461.821614 55622.968750 212.057 67.175 555.295 1.698 396.56 466.30 530.83 484.06 474.62 480.71 53.03 7.93 0.70 49.08 49.22 23.25 32.39 178.23 11.210 9.084 6707.8596 42 115 495 1767 86 1801 109 484 470 612 189 711 238 7.232 37.345 4.634 2.618 48.330 2001.11 2241.41 47734.40 151.978 137.582 555 649 145 9471 7249 4.81 481.834 72.26 1156.216 83.00 864.937 52.66769 33.68917 39.048 2222.6 0.359 0.976 1.272 2.675 1769937.88 2168410.90 1642807.13 2415143.75 1952727.25 98.932 360.923 319.049 3.363 156.550 280.636 9.110 590.293 1813.56 2633.11 2428.72 3973.97 377.835 17842 84.21 1385.842 314.662 183.424 303.105 39681.566406 55666.716146 211.787 67.124 553.553 1.698 429.36 467.76 354.01 442.47 470.37 489.25 53.44 7.93 0.7 48.89 49.41 23.43 32.26 177.88 11.195 9.408 6678.4880 42 114 496 1772 85 1795 110 484 472 612 189 709 238 7.355 37.283 4.348 2.686 45.970 2033.28 2202.89 47778.76 153.061 139.573 557 658 144 9541 7080 4.80 481.420 72.23 1155.674 82.98 864.804 52.70973 33.55896 38.489 2207.1 0.359 0.975 1.270 2.694 1797918.46 2159575.21 1695317.29 2431811.58 1889099.97 98.872 361.454 317.136 3.325 157.528 282.575 9.075 596.408 1821.82 2633.32 2420.64 3973.97 378.788 17742 84.27 1376.344 314.271 182.722 290.726 39350.223958 55656.115885 211.895 67.151 548.556 1.700 419.08 493.98 527.45 500.39 484.50 490.66 53.11 7.92 0.70 48.85 48.85 23.37 32.17 178.61 11.205 9.099 6697.3184 42 115 496 1771 86 1803 109 483 469 612 188 708 237 7.281 37.607 4.513 2.662 48.375 1997.31 2112.54 47482.70 154.214 139.032 555 659 144 9533 7457 4.67 468.456 72.32 1157.110 83.01 865.027 52.76675 33.40175 39.795 2215.6 0.360 0.976 1.269 2.679 1761464.25 2158064.17 1699000.00 2442355.92 1906086.42 98.964 361.425 318.208 3.347 154.880 279.573 9.309 593.530 1821.36 2609.75 2442.85 3973.97 378.802 17849 85.20 1377.894 313.693 183.118 303.890 39275.548177 55876.180990 212.963 67.102 552.792 1.703 437.60 478.01 510.05 506.02 516.94 493.41 53.69 7.91 0.7 49.13 49.18 23.37 32.30 178.47 11.216 8.901 6708.2322 42 114 494 1770 85 1800 110 483 470 611 189 709 237 7.272 37.338 4.546 2.645 48.640 1999.52 2214.08 47360.36 153.099 139.017 555 652 145 9519 6939 4.80 481.347 72.31 1156.869 82.97 864.654 52.84683 33.93533 38.621 2216.0 0.359 0.977 1.274 2.676 1770711.21 2179924.5 1682254.45 2443276.75 1923099.42 98.873 361.488 319.314 3.350 156.753 283.331 9.216 590.211 OpenBenchmarking.org
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 2.23, N = 3 SE +/- 0.35, N = 3 SE +/- 2.62, N = 3 SE +/- 0.61, N = 3 SE +/- 0.31, N = 3 1815.85 1820.90 1813.56 1821.82 1821.36 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 2 3 4 5 600 1200 1800 2400 3000 SE +/- 10.95, N = 3 SE +/- 4.46, N = 3 SE +/- 7.85, N = 3 SE +/- 4.51, N = 3 SE +/- 11.98, N = 3 2618.53 2622.75 2633.11 2633.32 2609.75 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 2 3 4 5 500 1000 1500 2000 2500 SE +/- 19.92, N = 3 SE +/- 22.21, N = 3 SE +/- 32.33, N = 3 SE +/- 12.71, N = 3 SE +/- 12.94, N = 3 2458.07 2420.91 2428.72 2420.64 2442.85 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 1 2 3 4 5 900 1800 2700 3600 4500 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3973.97 3973.97 3973.97 3973.97 3973.97 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 2 3 4 5 80 160 240 320 400 SE +/- 0.47, N = 3 SE +/- 0.82, N = 3 SE +/- 0.48, N = 3 SE +/- 0.00, N = 3 SE +/- 1.66, N = 3 376.89 377.36 377.84 378.79 378.80 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 1 2 3 4 5 4K 8K 12K 16K 20K SE +/- 47.06, N = 3 SE +/- 70.27, N = 3 SE +/- 6.44, N = 3 SE +/- 54.08, N = 3 SE +/- 84.18, N = 3 17828 17872 17842 17742 17849
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 1 2 3 4 5 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 SE +/- 0.12, N = 3 SE +/- 0.08, N = 3 85.54 84.88 84.21 84.27 85.20 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 1 2 3 4 5 300 600 900 1200 1500 SE +/- 13.09, N = 3 SE +/- 3.63, N = 3 SE +/- 4.38, N = 3 SE +/- 3.03, N = 3 SE +/- 2.52, N = 3 1365.39 1387.30 1385.84 1376.34 1377.89 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 2 3 4 5 70 140 210 280 350 SE +/- 0.30, N = 3 SE +/- 0.85, N = 3 SE +/- 1.01, N = 3 SE +/- 0.49, N = 3 SE +/- 0.49, N = 3 315.62 313.70 314.66 314.27 313.69 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 4 5 40 80 120 160 200 SE +/- 0.08, N = 3 SE +/- 0.28, N = 3 SE +/- 0.16, N = 3 SE +/- 0.12, N = 3 SE +/- 0.23, N = 3 184.00 182.98 183.42 182.72 183.12 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 2 3 4 5 70 140 210 280 350 SE +/- 0.86, N = 3 SE +/- 0.32, N = 3 SE +/- 0.60, N = 3 SE +/- 7.07, N = 15 SE +/- 0.09, N = 3 303.57 303.51 303.11 290.73 303.89 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1 2 3 4 5 9K 18K 27K 36K 45K SE +/- 210.55, N = 3 SE +/- 76.80, N = 3 SE +/- 151.55, N = 3 SE +/- 69.89, N = 3 SE +/- 20.70, N = 3 39726.16 39461.82 39681.57 39350.22 39275.55 1. (CXX) g++ options: -O3 -march=native -fopenmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 1 2 3 4 5 12K 24K 36K 48K 60K SE +/- 86.69, N = 3 SE +/- 37.92, N = 3 SE +/- 83.09, N = 3 SE +/- 86.41, N = 3 SE +/- 53.09, N = 3 55767.16 55622.97 55666.72 55656.12 55876.18 1. (CXX) g++ options: -O3 -march=native -fopenmp
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1 2 3 4 5 50 100 150 200 250 SE +/- 0.63, N = 3 SE +/- 0.34, N = 3 SE +/- 0.06, N = 3 SE +/- 0.49, N = 3 SE +/- 0.29, N = 3 212.80 212.06 211.79 211.90 212.96 1. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 1 2 3 4 5 15 30 45 60 75 SE +/- 0.36, N = 3 SE +/- 0.12, N = 3 SE +/- 0.12, N = 3 SE +/- 0.20, N = 3 SE +/- 0.15, N = 3 67.39 67.18 67.12 67.15 67.10 1. (CC) gcc options: -O2
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 1 2 3 4 5 120 240 360 480 600 SE +/- 2.84, N = 3 SE +/- 1.36, N = 3 SE +/- 3.76, N = 3 SE +/- 2.23, N = 3 SE +/- 1.67, N = 3 553.64 555.30 553.55 548.56 552.79 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
GROMACS Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021 Input: water_GMX50_bare 1 2 3 4 5 0.3832 0.7664 1.1496 1.5328 1.916 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 1.699 1.698 1.698 1.700 1.703 1. (CXX) g++ options: -O3 -pthread
IOR Block Size: 2MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 2MB - Disk Target: Default Test Directory 1 2 3 4 5 130 260 390 520 650 SE +/- 5.79, N = 3 SE +/- 4.65, N = 3 SE +/- 7.93, N = 15 SE +/- 10.65, N = 12 SE +/- 7.25, N = 15 592.90 396.56 429.36 419.08 437.60 MIN: 431.01 / MAX: 1028.65 MIN: 214.64 / MAX: 1028.95 MIN: 220.22 / MAX: 1149.54 MIN: 152.75 / MAX: 1057.83 MIN: 162.6 / MAX: 1036.24 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 4MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 4MB - Disk Target: Default Test Directory 1 2 3 4 5 200 400 600 800 1000 SE +/- 1.87, N = 3 SE +/- 5.42, N = 15 SE +/- 45.69, N = 12 SE +/- 6.63, N = 15 SE +/- 5.81, N = 6 887.94 466.30 467.76 493.98 478.01 MIN: 629.82 / MAX: 1334.23 MIN: 212.95 / MAX: 1329.58 MIN: 188.74 / MAX: 1435.44 MIN: 240.44 / MAX: 1344.87 MIN: 236.85 / MAX: 1266.85 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 8MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 8MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 25.39, N = 15 SE +/- 5.42, N = 15 SE +/- 4.96, N = 15 SE +/- 4.21, N = 14 SE +/- 6.12, N = 5 525.44 530.83 354.01 527.45 510.05 MIN: 290.23 / MAX: 1447.59 MIN: 248.03 / MAX: 1386.96 MIN: 189.98 / MAX: 1385.03 MIN: 251.24 / MAX: 1378.27 MIN: 222.93 / MAX: 1266.18 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 16MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 16MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 6.27, N = 3 SE +/- 5.80, N = 15 SE +/- 6.54, N = 3 SE +/- 1.65, N = 3 SE +/- 7.12, N = 3 481.36 484.06 442.47 500.39 506.02 MIN: 316.02 / MAX: 1504.02 MIN: 224.25 / MAX: 1505.62 MIN: 217.24 / MAX: 1379.75 MIN: 315.3 / MAX: 1210.23 MIN: 308.33 / MAX: 1247.46 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 32MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 32MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 3.89, N = 3 SE +/- 4.72, N = 8 SE +/- 5.73, N = 3 SE +/- 4.09, N = 12 SE +/- 4.67, N = 3 493.82 474.62 470.37 484.50 516.94 MIN: 245.23 / MAX: 1245.29 MIN: 202.58 / MAX: 1355.46 MIN: 197.76 / MAX: 1370.54 MIN: 196.66 / MAX: 1538.21 MIN: 413.96 / MAX: 1183.64 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 64MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 64MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 3.92, N = 3 SE +/- 2.46, N = 3 SE +/- 4.79, N = 3 SE +/- 4.68, N = 3 SE +/- 6.10, N = 3 492.85 480.71 489.25 490.66 493.41 MIN: 402.07 / MAX: 1345.65 MIN: 301.17 / MAX: 1079.47 MIN: 374.86 / MAX: 1033.08 MIN: 348.43 / MAX: 1036.5 MIN: 246.91 / MAX: 1088.06 1. (CC) gcc options: -O2 -lm -pthread -lmpi
JPEG XL Input: PNG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 5 1 2 3 4 5 12 24 36 48 60 SE +/- 0.14, N = 3 SE +/- 0.15, N = 3 SE +/- 0.33, N = 3 SE +/- 0.16, N = 3 SE +/- 0.13, N = 3 52.50 53.03 53.44 53.11 53.69 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: PNG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 7 1 2 3 4 5 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 7.86 7.93 7.93 7.92 7.91 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: PNG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 8 1 2 3 4 5 0.1598 0.3196 0.4794 0.6392 0.799 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.71 0.70 0.70 0.70 0.70 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 5 1 2 3 4 5 11 22 33 44 55 SE +/- 0.07, N = 3 SE +/- 0.26, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 48.64 49.08 48.89 48.85 49.13 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 7 1 2 3 4 5 11 22 33 44 55 SE +/- 0.11, N = 3 SE +/- 0.28, N = 3 SE +/- 0.07, N = 3 SE +/- 0.19, N = 3 SE +/- 0.14, N = 3 48.50 49.22 49.41 48.85 49.18 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 8 1 2 3 4 5 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 SE +/- 0.14, N = 3 23.19 23.25 23.43 23.37 23.37 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Decoding CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: 1 1 2 3 4 5 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 31.81 32.39 32.26 32.17 32.30
JPEG XL Decoding CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: All 1 2 3 4 5 40 80 120 160 200 SE +/- 0.25, N = 3 SE +/- 0.25, N = 3 SE +/- 0.10, N = 3 SE +/- 0.45, N = 3 SE +/- 0.19, N = 3 175.22 178.23 177.88 178.61 178.47
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms 1 2 3 4 5 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 11.18 11.21 11.20 11.21 11.22 1. (CXX) g++ options: -O3 -pthread -lm
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 1 2 3 4 5 3 6 9 12 15 SE +/- 0.370, N = 12 SE +/- 0.433, N = 15 SE +/- 0.361, N = 12 SE +/- 0.334, N = 15 SE +/- 0.306, N = 15 9.483 9.084 9.408 9.099 8.901 1. (CXX) g++ options: -O3 -pthread -lm
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 1 2 3 4 5 1400 2800 4200 5600 7000 SE +/- 21.98, N = 3 SE +/- 16.05, N = 3 SE +/- 43.78, N = 3 SE +/- 39.16, N = 3 SE +/- 30.71, N = 3 6695.04 6707.86 6678.49 6697.32 6708.23 1. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 1 2 3 4 5 10 20 30 40 50 SE +/- 0.33, N = 3 42 42 42 42 42 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1 2 3 4 5 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 115 115 114 115 114 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 2 3 4 5 110 220 330 440 550 SE +/- 2.19, N = 3 SE +/- 2.00, N = 3 SE +/- 2.67, N = 3 SE +/- 3.67, N = 3 SE +/- 3.67, N = 3 495 495 496 496 494 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 6.00, N = 3 SE +/- 3.06, N = 3 SE +/- 1.76, N = 3 SE +/- 1.73, N = 3 1767 1767 1772 1771 1770 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 4 5 20 40 60 80 100 85 86 85 86 85 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 3.33, N = 3 SE +/- 1.86, N = 3 SE +/- 3.38, N = 3 SE +/- 3.71, N = 3 1792 1801 1795 1803 1800 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 4 5 20 40 60 80 100 SE +/- 0.58, N = 3 SE +/- 1.00, N = 3 SE +/- 0.67, N = 3 111 109 110 109 110 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 2 3 4 5 100 200 300 400 500 SE +/- 0.67, N = 3 SE +/- 1.53, N = 3 SE +/- 0.88, N = 3 SE +/- 1.15, N = 3 SE +/- 1.86, N = 3 483 484 484 483 483 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 2 3 4 5 100 200 300 400 500 SE +/- 0.88, N = 3 SE +/- 1.67, N = 3 SE +/- 1.20, N = 3 SE +/- 1.33, N = 3 SE +/- 0.88, N = 3 469 470 472 469 470 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 4 5 130 260 390 520 650 SE +/- 1.20, N = 3 SE +/- 2.73, N = 3 SE +/- 1.00, N = 3 SE +/- 0.67, N = 3 613 612 612 612 611 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 2 3 4 5 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 189 189 189 188 189 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1 2 3 4 5 150 300 450 600 750 SE +/- 0.88, N = 3 711 711 709 708 709 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 2 3 4 5 50 100 150 200 250 SE +/- 0.58, N = 3 237 238 238 237 237 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 1 2 3 4 5 2 4 6 8 10 SE +/- 0.091, N = 5 SE +/- 0.085, N = 3 SE +/- 0.110, N = 3 SE +/- 0.070, N = 3 SE +/- 0.007, N = 3 7.108 7.232 7.355 7.281 7.272 MIN: 6.52 / MAX: 7.68 MIN: 6.81 / MAX: 7.85 MIN: 6.8 / MAX: 7.87 MIN: 6.97 / MAX: 7.74 MIN: 7 / MAX: 7.67 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 1 2 3 4 5 9 18 27 36 45 SE +/- 0.32, N = 5 SE +/- 0.12, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 37.18 37.35 37.28 37.61 37.34 MIN: 35.67 / MAX: 38.06 MIN: 36.97 / MAX: 37.99 MIN: 36.85 / MAX: 38.11 MIN: 37.16 / MAX: 38.09 MIN: 36.95 / MAX: 37.85 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 1 2 3 4 5 1.0427 2.0854 3.1281 4.1708 5.2135 SE +/- 0.129, N = 5 SE +/- 0.086, N = 3 SE +/- 0.205, N = 3 SE +/- 0.041, N = 3 SE +/- 0.008, N = 3 4.201 4.634 4.348 4.513 4.546 MIN: 3.68 / MAX: 4.8 MIN: 4.2 / MAX: 4.99 MIN: 3.62 / MAX: 4.88 MIN: 4.22 / MAX: 4.96 MIN: 4.28 / MAX: 4.91 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 1 2 3 4 5 0.6044 1.2088 1.8132 2.4176 3.022 SE +/- 0.029, N = 5 SE +/- 0.019, N = 3 SE +/- 0.045, N = 3 SE +/- 0.010, N = 3 SE +/- 0.011, N = 3 2.676 2.618 2.686 2.662 2.645 MIN: 2.44 / MAX: 3.19 MIN: 2.5 / MAX: 2.92 MIN: 2.42 / MAX: 3.11 MIN: 2.48 / MAX: 2.98 MIN: 2.44 / MAX: 3.04 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 1 2 3 4 5 11 22 33 44 55 SE +/- 1.80, N = 5 SE +/- 0.10, N = 3 SE +/- 2.44, N = 3 SE +/- 0.11, N = 3 SE +/- 0.04, N = 3 44.16 48.33 45.97 48.38 48.64 MIN: 40.29 / MAX: 51.28 MIN: 47.9 / MAX: 50.26 MIN: 40.58 / MAX: 48.74 MIN: 47.91 / MAX: 48.97 MIN: 48.25 / MAX: 49.28 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 23.92, N = 15 SE +/- 51.51, N = 15 SE +/- 51.69, N = 12 SE +/- 32.10, N = 15 SE +/- 46.79, N = 15 2081.02 2001.11 2033.28 1997.31 1999.52 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 4 5 500 1000 1500 2000 2500 SE +/- 10.76, N = 3 SE +/- 3.15, N = 3 SE +/- 34.24, N = 3 SE +/- 35.18, N = 3 SE +/- 28.36, N = 4 2140.50 2241.41 2202.89 2112.54 2214.08 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 4 5 10K 20K 30K 40K 50K SE +/- 154.44, N = 3 SE +/- 7.96, N = 3 SE +/- 139.20, N = 3 SE +/- 376.58, N = 3 SE +/- 44.69, N = 3 47323.88 47734.40 47778.76 47482.70 47360.36 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 1 2 3 4 5 30 60 90 120 150 SE +/- 2.33, N = 3 SE +/- 1.75, N = 3 SE +/- 1.44, N = 3 SE +/- 2.23, N = 3 SE +/- 1.61, N = 3 155.60 151.98 153.06 154.21 153.10 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 1 2 3 4 5 30 60 90 120 150 SE +/- 0.78, N = 3 SE +/- 0.10, N = 3 SE +/- 1.23, N = 3 SE +/- 1.21, N = 3 SE +/- 2.04, N = 3 138.66 137.58 139.57 139.03 139.02 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 1 2 3 4 5 120 240 360 480 600 SE +/- 1.09, N = 3 SE +/- 0.29, N = 3 SE +/- 2.25, N = 3 SE +/- 1.64, N = 3 SE +/- 0.88, N = 3 553 555 557 555 555 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 1 2 3 4 5 140 280 420 560 700 SE +/- 5.46, N = 3 SE +/- 11.30, N = 12 SE +/- 13.29, N = 12 SE +/- 13.34, N = 12 SE +/- 11.09, N = 12 630 649 658 659 652 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 1 2 3 4 5 30 60 90 120 150 SE +/- 0.44, N = 3 SE +/- 0.33, N = 3 SE +/- 0.87, N = 3 SE +/- 0.44, N = 3 SE +/- 0.17, N = 3 143 145 144 144 145 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 3 4 5 2K 4K 6K 8K 10K SE +/- 35.99, N = 3 SE +/- 30.21, N = 3 SE +/- 36.96, N = 3 SE +/- 46.43, N = 3 SE +/- 54.82, N = 3 9436 9471 9541 9533 9519 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 2 3 4 5 1600 3200 4800 6400 8000 SE +/- 57.46, N = 3 SE +/- 133.82, N = 12 SE +/- 173.63, N = 9 SE +/- 109.17, N = 3 SE +/- 212.04, N = 12 7531 7249 7080 7457 6939 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 2 3 4 5 1.0823 2.1646 3.2469 4.3292 5.4115 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.13, N = 11 SE +/- 0.00, N = 3 4.80 4.81 4.80 4.67 4.80
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 2 3 4 5 100 200 300 400 500 SE +/- 0.37, N = 3 SE +/- 0.39, N = 3 SE +/- 0.27, N = 3 SE +/- 12.92, N = 11 SE +/- 0.34, N = 3 481.69 481.83 481.42 468.46 481.35
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 5 16 32 48 64 80 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 72.31 72.26 72.23 72.32 72.31
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 5 200 400 600 800 1000 SE +/- 0.52, N = 3 SE +/- 0.36, N = 3 SE +/- 0.26, N = 3 SE +/- 0.60, N = 3 SE +/- 0.98, N = 3 1156.97 1156.22 1155.67 1157.11 1156.87
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 5 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 82.97 83.00 82.98 83.01 82.97
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 5 200 400 600 800 1000 SE +/- 0.51, N = 3 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 SE +/- 0.39, N = 3 SE +/- 0.27, N = 3 864.68 864.94 864.80 865.03 864.65
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 1 2 3 4 5 12 24 36 48 60 SE +/- 0.12, N = 3 SE +/- 0.13, N = 3 SE +/- 0.36, N = 3 SE +/- 0.21, N = 3 SE +/- 0.26, N = 3 53.14 52.67 52.71 52.77 52.85 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 1 2 3 4 5 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.13, N = 3 SE +/- 0.17, N = 3 SE +/- 0.29, N = 3 34.30 33.69 33.56 33.40 33.94 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 4 5 9 18 27 36 45 SE +/- 0.09, N = 3 SE +/- 0.45, N = 3 SE +/- 0.23, N = 3 SE +/- 0.68, N = 3 SE +/- 0.24, N = 3 38.34 39.05 38.49 39.80 38.62 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 4 5 500 1000 1500 2000 2500 SE +/- 17.98, N = 13 SE +/- 19.83, N = 12 SE +/- 27.77, N = 12 SE +/- 21.07, N = 12 SE +/- 28.27, N = 12 2222.4 2222.6 2207.1 2215.6 2216.0 1. (CXX) g++ options: -O3 -march=native -rdynamic
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 1 1 2 3 4 5 0.081 0.162 0.243 0.324 0.405 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.358 0.359 0.359 0.360 0.359
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 1 2 3 4 5 0.2198 0.4396 0.6594 0.8792 1.099 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.975 0.976 0.975 0.976 0.977
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 1 2 3 4 5 0.2867 0.5734 0.8601 1.1468 1.4335 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 1.269 1.272 1.270 1.269 1.274
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 1 2 3 4 5 0.6062 1.2124 1.8186 2.4248 3.031 SE +/- 0.006, N = 3 SE +/- 0.012, N = 3 SE +/- 0.005, N = 3 SE +/- 0.005, N = 3 SE +/- 0.007, N = 3 2.677 2.675 2.694 2.679 2.676
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 4 5 600K 1200K 1800K 2400K 3000K SE +/- 29192.48, N = 3 SE +/- 2566.99, N = 3 SE +/- 19169.69, N = 3 SE +/- 15108.16, N = 3 SE +/- 18070.29, N = 3 2698309.25 1769937.88 1797918.46 1761464.25 1770711.21 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 4 5 500K 1000K 1500K 2000K 2500K SE +/- 18441.13, N = 3 SE +/- 27569.90, N = 5 SE +/- 34602.53, N = 3 SE +/- 20189.91, N = 3 SE +/- 11360.66, N = 3 2153944.42 2168410.90 2159575.21 2158064.17 2179924.50 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 4 5 400K 800K 1200K 1600K 2000K SE +/- 10755.06, N = 3 SE +/- 13174.46, N = 3 SE +/- 9034.55, N = 3 SE +/- 10458.55, N = 3 SE +/- 10827.27, N = 3 1712504.00 1642807.13 1695317.29 1699000.00 1682254.45 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 4 5 500K 1000K 1500K 2000K 2500K SE +/- 22779.82, N = 3 SE +/- 31640.05, N = 3 SE +/- 17482.31, N = 3 SE +/- 18752.77, N = 3 SE +/- 23178.69, N = 3 2539377.42 2415143.75 2431811.58 2442355.92 2443276.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 4 5 400K 800K 1200K 1600K 2000K SE +/- 15512.42, N = 3 SE +/- 23318.24, N = 3 SE +/- 24702.46, N = 4 SE +/- 30239.19, N = 3 SE +/- 21792.45, N = 3 1912809.71 1952727.25 1889099.97 1906086.42 1923099.42 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 1 2 3 4 5 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.29, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 98.89 98.93 98.87 98.96 98.87
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 3 4 5 80 160 240 320 400 SE +/- 0.37, N = 3 SE +/- 0.27, N = 3 SE +/- 0.39, N = 3 SE +/- 0.38, N = 3 SE +/- 0.31, N = 3 361.24 360.92 361.45 361.43 361.49 MIN: 357.28 / MAX: 391.58 MIN: 356.72 / MAX: 372.66 MIN: 356.32 / MAX: 386.75 MIN: 356.24 / MAX: 380.38 MIN: 357.11 / MAX: 384.74 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 2 3 4 5 70 140 210 280 350 SE +/- 0.46, N = 3 SE +/- 0.57, N = 3 SE +/- 0.62, N = 3 SE +/- 0.59, N = 3 SE +/- 0.43, N = 3 316.36 319.05 317.14 318.21 319.31 MIN: 313.3 / MAX: 320.26 MIN: 313 / MAX: 340.64 MIN: 314.18 / MAX: 323.54 MIN: 316.15 / MAX: 320.81 MIN: 316.94 / MAX: 338.61 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 1 2 3 4 5 0.758 1.516 2.274 3.032 3.79 SE +/- 0.034, N = 3 SE +/- 0.012, N = 3 SE +/- 0.027, N = 3 SE +/- 0.049, N = 3 SE +/- 0.057, N = 3 3.369 3.363 3.325 3.347 3.350 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 1 2 3 4 5 30 60 90 120 150 SE +/- 1.77, N = 3 SE +/- 0.15, N = 3 SE +/- 0.29, N = 3 SE +/- 0.66, N = 3 SE +/- 0.73, N = 3 155.74 156.55 157.53 154.88 156.75 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 1 2 3 4 5 60 120 180 240 300 SE +/- 1.07, N = 3 SE +/- 0.54, N = 3 SE +/- 2.02, N = 3 SE +/- 1.22, N = 3 SE +/- 2.02, N = 3 282.05 280.64 282.58 279.57 283.33 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 1 2 3 4 5 3 6 9 12 15 SE +/- 0.105, N = 3 SE +/- 0.020, N = 3 SE +/- 0.024, N = 3 SE +/- 0.141, N = 3 SE +/- 0.149, N = 3 9.195 9.110 9.075 9.309 9.216 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 1 2 3 4 5 130 260 390 520 650 SE +/- 1.66, N = 3 SE +/- 0.45, N = 3 SE +/- 0.58, N = 3 SE +/- 1.92, N = 3 SE +/- 0.58, N = 3 593.07 590.29 596.41 593.53 590.21 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Phoronix Test Suite v10.8.5