7980XE Feb 2921 Intel Core i9-7980XE testing with a ASUS PRIME X299-A (2002 BIOS) and Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102162-HA-7980XEFEB73&grs&sor .
7980XE Feb 2921 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 4 5 Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads) ASUS PRIME X299-A (2002 BIOS) Intel Sky Lake-E DMI3 Registers 16GB Samsung SSD 970 EVO 500GB Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB (1206/1750MHz) Realtek ALC1220 G237HL Intel I219-V Ubuntu 20.10 5.8.0-36-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 4.6 Mesa 20.2.6 (LLVM 11.0.0) 1.2.131 GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x2006a08 Graphics Details - GLAMOR Python Details - Python 3.8.6 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
7980XE Feb 2921 redis: LPOP ior: 16MB - Default Test Directory ior: 32MB - Default Test Directory npb: EP.D redis: GET redis: LPUSH qmcpack: simple-H2O mnn: SqueezeNetV1.0 redis: SET pennant: leblancbig ior: 64MB - Default Test Directory mnn: mobilenet-v1-1.0 webp2: Quality 100, Compression Effort 5 ngspice: C2670 jpegxl: PNG - 5 jpegxl-decode: All jpegxl: JPEG - 7 lzbench: Crush 0 - Compression jpegxl-decode: 1 webp2: Quality 75, Compression Effort 7 etcpak: DXT1 cloverleaf: Lagrangian-Eulerian Hydrodynamics askap: tConvolve OpenMP - Gridding ngspice: C7552 jpegxl: PNG - 8 onnx: fcn-resnet101-11 - OpenMP CPU webp2: Quality 95, Compression Effort 7 webp2: Default synthmark: VoiceMark_100 redis: SADD lzbench: Zstd 8 - Compression financebench: Repo OpenMP mnn: resnet-v2-50 onnx: shufflenet-v2-10 - OpenMP CPU webp2: Quality 100, Lossless Compression jpegxl: JPEG - 8 jpegxl: JPEG - 5 npb: LU.C tnn: CPU - SqueezeNet v1.1 pennant: sedovbig askap: tConvolve MT - Degridding jpegxl: PNG - 7 lzbench: XZ 0 - Decompression v-ray: CPU onnx: yolov4 - OpenMP CPU rav1e: 10 quantlib: etcpak: ETC2 lzbench: Brotli 0 - Compression lzbench: Zstd 8 - Decompression etcpak: ETC1 rav1e: 1 gcrypt: lzbench: Brotli 2 - Compression askap: Hogbom Clean OpenMP askap: tConvolve MT - Gridding financebench: Bonds OpenMP lulesh: gnupg: 2.7GB Sample File Encryption lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression lzbench: Zstd 1 - Compression rav1e: 6 lammps: 20k Atoms lzbench: Brotli 0 - Decompression gromacs: water_GMX50_bare lzbench: Zstd 1 - Decompression lzbench: Crush 0 - Decompression rav1e: 5 tnn: CPU - MobileNet v2 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 build-godot: Time To Compile paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 askap: tConvolve OpenMP - Degridding lzbench: XZ 0 - Compression onnx: super-resolution-10 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU mnn: inception-v3 mnn: MobileNetV2_224 lammps: Rhodopsin Protein npb: EP.C etcpak: ETC1 + Dithering paraview: Many Spheres - 1920 x 1080 paraview: Many Spheres - 1920 x 1080 ior: 8MB - Default Test Directory ior: 4MB - Default Test Directory ior: 2MB - Default Test Directory 1 2 3 4 5 2698309.25 481.36 493.82 2140.50 2539377.42 1712504.00 38.340 7.108 1912809.71 34.29865 492.85 2.676 9.195 155.596 52.50 175.22 48.50 111 31.81 155.740 1365.386 85.54 2458.07 138.661 0.71 143 282.047 3.369 553.640 2153944.42 85 39726.164063 37.184 9436 593.066 23.19 48.64 47323.88 316.358 53.14447 2618.53 7.86 115 17828 553 2.677 2222.4 183.995 469 1792 315.616 0.358 212.796 189 376.885 1815.85 55767.164062 6695.0413 67.390 711 237 495 1.269 11.176 613 1.699 1767 483 0.975 361.235 72.31 1156.973 98.889 82.97 864.682 3973.97 42 7531 630 44.157 4.201 9.483 2081.02 303.570 481.692 4.80 525.44 887.94 592.90 1769937.88 484.06 474.62 2241.41 2415143.75 1642807.13 39.048 7.232 1952727.25 33.68917 480.71 2.618 9.110 151.978 53.03 178.23 49.22 109 32.39 156.550 1387.302 84.88 2420.91 137.582 0.70 145 280.636 3.363 555.295 2168410.90 86 39461.821614 37.345 9471 590.293 23.25 49.08 47734.40 319.049 52.66769 2622.75 7.93 115 17872 555 2.675 2222.6 182.977 470 1801 313.703 0.359 212.057 189 377.362 1820.90 55622.968750 6707.8596 67.175 711 238 495 1.272 11.210 612 1.698 1767 484 0.976 360.923 72.26 1156.216 98.932 83.00 864.937 3973.97 42 7249 649 48.330 4.634 9.084 2001.11 303.514 481.834 4.81 530.83 466.30 396.56 1797918.46 442.47 470.37 2202.89 2431811.58 1695317.29 38.489 7.355 1889099.97 33.55896 489.25 2.686 9.075 153.061 53.44 177.88 49.41 110 32.26 157.528 1385.842 84.21 2428.72 139.573 0.7 144 282.575 3.325 553.553 2159575.21 85 39681.566406 37.283 9541 596.408 23.43 48.89 47778.76 317.136 52.70973 2633.11 7.93 114 17842 557 2.694 2207.1 183.424 472 1795 314.662 0.359 211.787 189 377.835 1813.56 55666.716146 6678.4880 67.124 709 238 496 1.270 11.195 612 1.698 1772 484 0.975 361.454 72.23 1155.674 98.872 82.98 864.804 3973.97 42 7080 658 45.970 4.348 9.408 2033.28 303.105 481.420 4.80 354.01 467.76 429.36 1761464.25 500.39 484.50 2112.54 2442355.92 1699000.00 39.795 7.281 1906086.42 33.40175 490.66 2.662 9.309 154.214 53.11 178.61 48.85 109 32.17 154.880 1376.344 84.27 2420.64 139.032 0.70 144 279.573 3.347 548.556 2158064.17 86 39350.223958 37.607 9533 593.530 23.37 48.85 47482.70 318.208 52.76675 2633.32 7.92 115 17742 555 2.679 2215.6 182.722 469 1803 314.271 0.360 211.895 188 378.788 1821.82 55656.115885 6697.3184 67.151 708 237 496 1.269 11.205 612 1.700 1771 483 0.976 361.425 72.32 1157.110 98.964 83.01 865.027 3973.97 42 7457 659 48.375 4.513 9.099 1997.31 290.726 468.456 4.67 527.45 493.98 419.08 1770711.21 506.02 516.94 2214.08 2443276.75 1682254.45 38.621 7.272 1923099.42 33.93533 493.41 2.645 9.216 153.099 53.69 178.47 49.18 110 32.30 156.753 1377.894 85.20 2442.85 139.017 0.7 145 283.331 3.350 552.792 2179924.5 85 39275.548177 37.338 9519 590.211 23.37 49.13 47360.36 319.314 52.84683 2609.75 7.91 114 17849 555 2.676 2216.0 183.118 470 1800 313.693 0.359 212.963 189 378.802 1821.36 55876.180990 6708.2322 67.102 709 237 494 1.274 11.216 611 1.703 1770 483 0.977 361.488 72.31 1156.869 98.873 82.97 864.654 3973.97 42 6939 652 48.640 4.546 8.901 1999.52 303.890 481.347 4.80 510.05 478.01 437.60 OpenBenchmarking.org
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 3 5 2 4 600K 1200K 1800K 2400K 3000K SE +/- 29192.48, N = 3 SE +/- 19169.69, N = 3 SE +/- 18070.29, N = 3 SE +/- 2566.99, N = 3 SE +/- 15108.16, N = 3 2698309.25 1797918.46 1770711.21 1769937.88 1761464.25 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
IOR Block Size: 16MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 16MB - Disk Target: Default Test Directory 5 4 2 1 3 110 220 330 440 550 SE +/- 7.12, N = 3 SE +/- 1.65, N = 3 SE +/- 5.80, N = 15 SE +/- 6.27, N = 3 SE +/- 6.54, N = 3 506.02 500.39 484.06 481.36 442.47 MIN: 308.33 / MAX: 1247.46 MIN: 315.3 / MAX: 1210.23 MIN: 224.25 / MAX: 1505.62 MIN: 316.02 / MAX: 1504.02 MIN: 217.24 / MAX: 1379.75 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 32MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 32MB - Disk Target: Default Test Directory 5 1 4 2 3 110 220 330 440 550 SE +/- 4.67, N = 3 SE +/- 3.89, N = 3 SE +/- 4.09, N = 12 SE +/- 4.72, N = 8 SE +/- 5.73, N = 3 516.94 493.82 484.50 474.62 470.37 MIN: 413.96 / MAX: 1183.64 MIN: 245.23 / MAX: 1245.29 MIN: 196.66 / MAX: 1538.21 MIN: 202.58 / MAX: 1355.46 MIN: 197.76 / MAX: 1370.54 1. (CC) gcc options: -O2 -lm -pthread -lmpi
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 2 5 3 1 4 500 1000 1500 2000 2500 SE +/- 3.15, N = 3 SE +/- 28.36, N = 4 SE +/- 34.24, N = 3 SE +/- 10.76, N = 3 SE +/- 35.18, N = 3 2241.41 2214.08 2202.89 2140.50 2112.54 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 5 4 3 2 500K 1000K 1500K 2000K 2500K SE +/- 22779.82, N = 3 SE +/- 23178.69, N = 3 SE +/- 18752.77, N = 3 SE +/- 17482.31, N = 3 SE +/- 31640.05, N = 3 2539377.42 2443276.75 2442355.92 2431811.58 2415143.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 4 3 5 2 400K 800K 1200K 1600K 2000K SE +/- 10755.06, N = 3 SE +/- 10458.55, N = 3 SE +/- 9034.55, N = 3 SE +/- 10827.27, N = 3 SE +/- 13174.46, N = 3 1712504.00 1699000.00 1695317.29 1682254.45 1642807.13 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 3 5 2 4 9 18 27 36 45 SE +/- 0.09, N = 3 SE +/- 0.23, N = 3 SE +/- 0.24, N = 3 SE +/- 0.45, N = 3 SE +/- 0.68, N = 3 38.34 38.49 38.62 39.05 39.80 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 1 2 5 4 3 2 4 6 8 10 SE +/- 0.091, N = 5 SE +/- 0.085, N = 3 SE +/- 0.007, N = 3 SE +/- 0.070, N = 3 SE +/- 0.110, N = 3 7.108 7.232 7.272 7.281 7.355 MIN: 6.52 / MAX: 7.68 MIN: 6.81 / MAX: 7.85 MIN: 7 / MAX: 7.67 MIN: 6.97 / MAX: 7.74 MIN: 6.8 / MAX: 7.87 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 2 5 1 4 3 400K 800K 1200K 1600K 2000K SE +/- 23318.24, N = 3 SE +/- 21792.45, N = 3 SE +/- 15512.42, N = 3 SE +/- 30239.19, N = 3 SE +/- 24702.46, N = 4 1952727.25 1923099.42 1912809.71 1906086.42 1889099.97 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 4 3 2 5 1 8 16 24 32 40 SE +/- 0.17, N = 3 SE +/- 0.13, N = 3 SE +/- 0.09, N = 3 SE +/- 0.29, N = 3 SE +/- 0.03, N = 3 33.40 33.56 33.69 33.94 34.30 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
IOR Block Size: 64MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 64MB - Disk Target: Default Test Directory 5 1 4 3 2 110 220 330 440 550 SE +/- 6.10, N = 3 SE +/- 3.92, N = 3 SE +/- 4.68, N = 3 SE +/- 4.79, N = 3 SE +/- 2.46, N = 3 493.41 492.85 490.66 489.25 480.71 MIN: 246.91 / MAX: 1088.06 MIN: 402.07 / MAX: 1345.65 MIN: 348.43 / MAX: 1036.5 MIN: 374.86 / MAX: 1033.08 MIN: 301.17 / MAX: 1079.47 1. (CC) gcc options: -O2 -lm -pthread -lmpi
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 2 5 4 1 3 0.6044 1.2088 1.8132 2.4176 3.022 SE +/- 0.019, N = 3 SE +/- 0.011, N = 3 SE +/- 0.010, N = 3 SE +/- 0.029, N = 5 SE +/- 0.045, N = 3 2.618 2.645 2.662 2.676 2.686 MIN: 2.5 / MAX: 2.92 MIN: 2.44 / MAX: 3.04 MIN: 2.48 / MAX: 2.98 MIN: 2.44 / MAX: 3.19 MIN: 2.42 / MAX: 3.11 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 3 2 1 5 4 3 6 9 12 15 SE +/- 0.024, N = 3 SE +/- 0.020, N = 3 SE +/- 0.105, N = 3 SE +/- 0.149, N = 3 SE +/- 0.141, N = 3 9.075 9.110 9.195 9.216 9.309 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 2 3 5 4 1 30 60 90 120 150 SE +/- 1.75, N = 3 SE +/- 1.44, N = 3 SE +/- 1.61, N = 3 SE +/- 2.23, N = 3 SE +/- 2.33, N = 3 151.98 153.06 153.10 154.21 155.60 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
JPEG XL Input: PNG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 5 5 3 4 2 1 12 24 36 48 60 SE +/- 0.13, N = 3 SE +/- 0.33, N = 3 SE +/- 0.16, N = 3 SE +/- 0.15, N = 3 SE +/- 0.14, N = 3 53.69 53.44 53.11 53.03 52.50 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Decoding CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: All 4 5 2 3 1 40 80 120 160 200 SE +/- 0.45, N = 3 SE +/- 0.19, N = 3 SE +/- 0.25, N = 3 SE +/- 0.10, N = 3 SE +/- 0.25, N = 3 178.61 178.47 178.23 177.88 175.22
JPEG XL Input: JPEG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 7 3 2 5 4 1 11 22 33 44 55 SE +/- 0.07, N = 3 SE +/- 0.28, N = 3 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 SE +/- 0.11, N = 3 49.41 49.22 49.18 48.85 48.50 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 5 3 4 2 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 1.00, N = 3 SE +/- 0.58, N = 3 111 110 110 109 109 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
JPEG XL Decoding CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: 1 2 5 3 4 1 8 16 24 32 40 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 32.39 32.30 32.26 32.17 31.81
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 4 1 2 5 3 30 60 90 120 150 SE +/- 0.66, N = 3 SE +/- 1.77, N = 3 SE +/- 0.15, N = 3 SE +/- 0.73, N = 3 SE +/- 0.29, N = 3 154.88 155.74 156.55 156.75 157.53 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 2 3 5 4 1 300 600 900 1200 1500 SE +/- 3.63, N = 3 SE +/- 4.38, N = 3 SE +/- 2.52, N = 3 SE +/- 3.03, N = 3 SE +/- 13.09, N = 3 1387.30 1385.84 1377.89 1376.34 1365.39 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 3 4 2 5 1 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.12, N = 3 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 SE +/- 0.18, N = 3 84.21 84.27 84.88 85.20 85.54 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 5 3 2 4 500 1000 1500 2000 2500 SE +/- 19.92, N = 3 SE +/- 12.94, N = 3 SE +/- 32.33, N = 3 SE +/- 22.21, N = 3 SE +/- 12.71, N = 3 2458.07 2442.85 2428.72 2420.91 2420.64 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 2 1 5 4 3 30 60 90 120 150 SE +/- 0.10, N = 3 SE +/- 0.78, N = 3 SE +/- 2.04, N = 3 SE +/- 1.21, N = 3 SE +/- 1.23, N = 3 137.58 138.66 139.02 139.03 139.57 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
JPEG XL Input: PNG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 8 1 5 4 3 2 0.1598 0.3196 0.4794 0.6392 0.799 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 0.71 0.70 0.70 0.70 0.70 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 5 2 4 3 1 30 60 90 120 150 SE +/- 0.17, N = 3 SE +/- 0.33, N = 3 SE +/- 0.44, N = 3 SE +/- 0.87, N = 3 SE +/- 0.44, N = 3 145 145 144 144 143 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 4 2 1 3 5 60 120 180 240 300 SE +/- 1.22, N = 3 SE +/- 0.54, N = 3 SE +/- 1.07, N = 3 SE +/- 2.02, N = 3 SE +/- 2.02, N = 3 279.57 280.64 282.05 282.58 283.33 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 3 4 5 2 1 0.758 1.516 2.274 3.032 3.79 SE +/- 0.027, N = 3 SE +/- 0.049, N = 3 SE +/- 0.057, N = 3 SE +/- 0.012, N = 3 SE +/- 0.034, N = 3 3.325 3.347 3.350 3.363 3.369 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 2 1 3 5 4 120 240 360 480 600 SE +/- 1.36, N = 3 SE +/- 2.84, N = 3 SE +/- 3.76, N = 3 SE +/- 1.67, N = 3 SE +/- 2.23, N = 3 555.30 553.64 553.55 552.79 548.56 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 5 2 3 4 1 500K 1000K 1500K 2000K 2500K SE +/- 11360.66, N = 3 SE +/- 27569.90, N = 5 SE +/- 34602.53, N = 3 SE +/- 20189.91, N = 3 SE +/- 18441.13, N = 3 2179924.50 2168410.90 2159575.21 2158064.17 2153944.42 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 4 2 5 3 1 20 40 60 80 100 86 86 85 85 85 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 5 4 2 3 1 9K 18K 27K 36K 45K SE +/- 20.70, N = 3 SE +/- 69.89, N = 3 SE +/- 76.80, N = 3 SE +/- 151.55, N = 3 SE +/- 210.55, N = 3 39275.55 39350.22 39461.82 39681.57 39726.16 1. (CXX) g++ options: -O3 -march=native -fopenmp
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 1 3 5 2 4 9 18 27 36 45 SE +/- 0.32, N = 5 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 SE +/- 0.12, N = 3 SE +/- 0.11, N = 3 37.18 37.28 37.34 37.35 37.61 MIN: 35.67 / MAX: 38.06 MIN: 36.85 / MAX: 38.11 MIN: 36.95 / MAX: 37.85 MIN: 36.97 / MAX: 37.99 MIN: 37.16 / MAX: 38.09 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 3 4 5 2 1 2K 4K 6K 8K 10K SE +/- 36.96, N = 3 SE +/- 46.43, N = 3 SE +/- 54.82, N = 3 SE +/- 30.21, N = 3 SE +/- 35.99, N = 3 9541 9533 9519 9471 9436 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 5 2 1 4 3 130 260 390 520 650 SE +/- 0.58, N = 3 SE +/- 0.45, N = 3 SE +/- 1.66, N = 3 SE +/- 1.92, N = 3 SE +/- 0.58, N = 3 590.21 590.29 593.07 593.53 596.41 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
JPEG XL Input: JPEG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 8 3 5 4 2 1 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.14, N = 3 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 23.43 23.37 23.37 23.25 23.19 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 5 5 2 3 4 1 11 22 33 44 55 SE +/- 0.09, N = 3 SE +/- 0.26, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 49.13 49.08 48.89 48.85 48.64 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 3 2 4 5 1 10K 20K 30K 40K 50K SE +/- 139.20, N = 3 SE +/- 7.96, N = 3 SE +/- 376.58, N = 3 SE +/- 44.69, N = 3 SE +/- 154.44, N = 3 47778.76 47734.40 47482.70 47360.36 47323.88 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 3 4 2 5 70 140 210 280 350 SE +/- 0.46, N = 3 SE +/- 0.62, N = 3 SE +/- 0.59, N = 3 SE +/- 0.57, N = 3 SE +/- 0.43, N = 3 316.36 317.14 318.21 319.05 319.31 MIN: 313.3 / MAX: 320.26 MIN: 314.18 / MAX: 323.54 MIN: 316.15 / MAX: 320.81 MIN: 313 / MAX: 340.64 MIN: 316.94 / MAX: 338.61 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 2 3 4 5 1 12 24 36 48 60 SE +/- 0.13, N = 3 SE +/- 0.36, N = 3 SE +/- 0.21, N = 3 SE +/- 0.26, N = 3 SE +/- 0.12, N = 3 52.67 52.71 52.77 52.85 53.14 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 4 3 2 1 5 600 1200 1800 2400 3000 SE +/- 4.51, N = 3 SE +/- 7.85, N = 3 SE +/- 4.46, N = 3 SE +/- 10.95, N = 3 SE +/- 11.98, N = 3 2633.32 2633.11 2622.75 2618.53 2609.75 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
JPEG XL Input: PNG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 7 3 2 4 5 1 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 7.93 7.93 7.92 7.91 7.86 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 4 2 1 5 3 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 115 115 115 114 114 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 2 5 3 1 4 4K 8K 12K 16K 20K SE +/- 70.27, N = 3 SE +/- 84.18, N = 3 SE +/- 6.44, N = 3 SE +/- 47.06, N = 3 SE +/- 54.08, N = 3 17872 17849 17842 17828 17742
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 3 5 4 2 1 120 240 360 480 600 SE +/- 2.25, N = 3 SE +/- 0.88, N = 3 SE +/- 1.64, N = 3 SE +/- 0.29, N = 3 SE +/- 1.09, N = 3 557 555 555 555 553 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 3 4 1 5 2 0.6062 1.2124 1.8186 2.4248 3.031 SE +/- 0.005, N = 3 SE +/- 0.005, N = 3 SE +/- 0.006, N = 3 SE +/- 0.007, N = 3 SE +/- 0.012, N = 3 2.694 2.679 2.677 2.676 2.675
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 2 1 5 4 3 500 1000 1500 2000 2500 SE +/- 19.83, N = 12 SE +/- 17.98, N = 13 SE +/- 28.27, N = 12 SE +/- 21.07, N = 12 SE +/- 27.77, N = 12 2222.6 2222.4 2216.0 2215.6 2207.1 1. (CXX) g++ options: -O3 -march=native -rdynamic
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 3 5 2 4 40 80 120 160 200 SE +/- 0.08, N = 3 SE +/- 0.16, N = 3 SE +/- 0.23, N = 3 SE +/- 0.28, N = 3 SE +/- 0.12, N = 3 184.00 183.42 183.12 182.98 182.72 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 3 5 2 4 1 100 200 300 400 500 SE +/- 1.20, N = 3 SE +/- 0.88, N = 3 SE +/- 1.67, N = 3 SE +/- 1.33, N = 3 SE +/- 0.88, N = 3 472 470 470 469 469 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 4 2 5 3 1 400 800 1200 1600 2000 SE +/- 3.38, N = 3 SE +/- 3.33, N = 3 SE +/- 3.71, N = 3 SE +/- 1.86, N = 3 1803 1801 1800 1795 1792 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 3 4 2 5 70 140 210 280 350 SE +/- 0.30, N = 3 SE +/- 1.01, N = 3 SE +/- 0.49, N = 3 SE +/- 0.85, N = 3 SE +/- 0.49, N = 3 315.62 314.66 314.27 313.70 313.69 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 1 4 5 3 2 1 0.081 0.162 0.243 0.324 0.405 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.360 0.359 0.359 0.359 0.358
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 3 4 2 1 5 50 100 150 200 250 SE +/- 0.06, N = 3 SE +/- 0.49, N = 3 SE +/- 0.34, N = 3 SE +/- 0.63, N = 3 SE +/- 0.29, N = 3 211.79 211.90 212.06 212.80 212.96 1. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 5 3 2 1 4 40 80 120 160 200 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 189 189 189 189 188 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 5 4 3 2 1 80 160 240 320 400 SE +/- 1.66, N = 3 SE +/- 0.00, N = 3 SE +/- 0.48, N = 3 SE +/- 0.82, N = 3 SE +/- 0.47, N = 3 378.80 378.79 377.84 377.36 376.89 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 4 5 2 1 3 400 800 1200 1600 2000 SE +/- 0.61, N = 3 SE +/- 0.31, N = 3 SE +/- 0.35, N = 3 SE +/- 2.23, N = 3 SE +/- 2.62, N = 3 1821.82 1821.36 1820.90 1815.85 1813.56 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 2 4 3 1 5 12K 24K 36K 48K 60K SE +/- 37.92, N = 3 SE +/- 86.41, N = 3 SE +/- 83.09, N = 3 SE +/- 86.69, N = 3 SE +/- 53.09, N = 3 55622.97 55656.12 55666.72 55767.16 55876.18 1. (CXX) g++ options: -O3 -march=native -fopenmp
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 5 2 4 1 3 1400 2800 4200 5600 7000 SE +/- 30.71, N = 3 SE +/- 16.05, N = 3 SE +/- 39.16, N = 3 SE +/- 21.98, N = 3 SE +/- 43.78, N = 3 6708.23 6707.86 6697.32 6695.04 6678.49 1. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 5 3 4 2 1 15 30 45 60 75 SE +/- 0.15, N = 3 SE +/- 0.12, N = 3 SE +/- 0.20, N = 3 SE +/- 0.12, N = 3 SE +/- 0.36, N = 3 67.10 67.12 67.15 67.18 67.39 1. (CC) gcc options: -O2
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 2 1 5 3 4 150 300 450 600 750 SE +/- 0.88, N = 3 711 711 709 709 708 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 3 2 5 4 1 50 100 150 200 250 SE +/- 0.58, N = 3 238 238 237 237 237 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 4 3 2 1 5 110 220 330 440 550 SE +/- 3.67, N = 3 SE +/- 2.67, N = 3 SE +/- 2.00, N = 3 SE +/- 2.19, N = 3 SE +/- 3.67, N = 3 496 496 495 495 494 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 5 2 3 4 1 0.2867 0.5734 0.8601 1.1468 1.4335 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 1.274 1.272 1.270 1.269 1.269
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms 5 2 4 3 1 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 11.22 11.21 11.21 11.20 11.18 1. (CXX) g++ options: -O3 -pthread -lm
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 4 3 2 5 130 260 390 520 650 SE +/- 1.00, N = 3 SE +/- 2.73, N = 3 SE +/- 1.20, N = 3 SE +/- 0.67, N = 3 613 612 612 612 611 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
GROMACS Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021 Input: water_GMX50_bare 5 4 1 3 2 0.3832 0.7664 1.1496 1.5328 1.916 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 1.703 1.700 1.699 1.698 1.698 1. (CXX) g++ options: -O3 -pthread
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 3 4 5 2 1 400 800 1200 1600 2000 SE +/- 1.76, N = 3 SE +/- 1.73, N = 3 SE +/- 3.06, N = 3 SE +/- 6.00, N = 3 1772 1771 1770 1767 1767 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 3 2 5 4 1 100 200 300 400 500 SE +/- 0.88, N = 3 SE +/- 1.53, N = 3 SE +/- 1.86, N = 3 SE +/- 1.15, N = 3 SE +/- 0.67, N = 3 484 484 483 483 483 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 5 4 2 3 1 0.2198 0.4396 0.6594 0.8792 1.099 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 0.977 0.976 0.976 0.975 0.975
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 2 1 4 3 5 80 160 240 320 400 SE +/- 0.27, N = 3 SE +/- 0.37, N = 3 SE +/- 0.38, N = 3 SE +/- 0.39, N = 3 SE +/- 0.31, N = 3 360.92 361.24 361.43 361.45 361.49 MIN: 356.72 / MAX: 372.66 MIN: 357.28 / MAX: 391.58 MIN: 356.24 / MAX: 380.38 MIN: 356.32 / MAX: 386.75 MIN: 357.11 / MAX: 384.74 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 4 5 1 2 3 16 32 48 64 80 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 72.32 72.31 72.31 72.26 72.23
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 4 1 5 2 3 200 400 600 800 1000 SE +/- 0.60, N = 3 SE +/- 0.52, N = 3 SE +/- 0.98, N = 3 SE +/- 0.36, N = 3 SE +/- 0.26, N = 3 1157.11 1156.97 1156.87 1156.22 1155.67
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 3 5 1 2 4 20 40 60 80 100 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 0.29, N = 3 SE +/- 0.11, N = 3 98.87 98.87 98.89 98.93 98.96
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 4 2 3 5 1 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 83.01 83.00 82.98 82.97 82.97
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 4 2 3 1 5 200 400 600 800 1000 SE +/- 0.39, N = 3 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 SE +/- 0.51, N = 3 SE +/- 0.27, N = 3 865.03 864.94 864.80 864.68 864.65
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 5 4 3 2 1 900 1800 2700 3600 4500 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3973.97 3973.97 3973.97 3973.97 3973.97 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 5 4 3 2 1 10 20 30 40 50 SE +/- 0.33, N = 3 42 42 42 42 42 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 4 2 3 5 1600 3200 4800 6400 8000 SE +/- 57.46, N = 3 SE +/- 109.17, N = 3 SE +/- 133.82, N = 12 SE +/- 173.63, N = 9 SE +/- 212.04, N = 12 7531 7457 7249 7080 6939 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 4 3 5 2 1 140 280 420 560 700 SE +/- 13.34, N = 12 SE +/- 13.29, N = 12 SE +/- 11.09, N = 12 SE +/- 11.30, N = 12 SE +/- 5.46, N = 3 659 658 652 649 630 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 1 3 2 4 5 11 22 33 44 55 SE +/- 1.80, N = 5 SE +/- 2.44, N = 3 SE +/- 0.10, N = 3 SE +/- 0.11, N = 3 SE +/- 0.04, N = 3 44.16 45.97 48.33 48.38 48.64 MIN: 40.29 / MAX: 51.28 MIN: 40.58 / MAX: 48.74 MIN: 47.9 / MAX: 50.26 MIN: 47.91 / MAX: 48.97 MIN: 48.25 / MAX: 49.28 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 1 3 4 5 2 1.0427 2.0854 3.1281 4.1708 5.2135 SE +/- 0.129, N = 5 SE +/- 0.205, N = 3 SE +/- 0.041, N = 3 SE +/- 0.008, N = 3 SE +/- 0.086, N = 3 4.201 4.348 4.513 4.546 4.634 MIN: 3.68 / MAX: 4.8 MIN: 3.62 / MAX: 4.88 MIN: 4.22 / MAX: 4.96 MIN: 4.28 / MAX: 4.91 MIN: 4.2 / MAX: 4.99 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 1 3 4 2 5 3 6 9 12 15 SE +/- 0.370, N = 12 SE +/- 0.361, N = 12 SE +/- 0.334, N = 15 SE +/- 0.433, N = 15 SE +/- 0.306, N = 15 9.483 9.408 9.099 9.084 8.901 1. (CXX) g++ options: -O3 -pthread -lm
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 3 2 5 4 400 800 1200 1600 2000 SE +/- 23.92, N = 15 SE +/- 51.69, N = 12 SE +/- 51.51, N = 15 SE +/- 46.79, N = 15 SE +/- 32.10, N = 15 2081.02 2033.28 2001.11 1999.52 1997.31 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 5 1 2 3 4 70 140 210 280 350 SE +/- 0.09, N = 3 SE +/- 0.86, N = 3 SE +/- 0.32, N = 3 SE +/- 0.60, N = 3 SE +/- 7.07, N = 15 303.89 303.57 303.51 303.11 290.73 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 2 1 3 5 4 100 200 300 400 500 SE +/- 0.39, N = 3 SE +/- 0.37, N = 3 SE +/- 0.27, N = 3 SE +/- 0.34, N = 3 SE +/- 12.92, N = 11 481.83 481.69 481.42 481.35 468.46
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 2 5 3 1 4 1.0823 2.1646 3.2469 4.3292 5.4115 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.13, N = 11 4.81 4.80 4.80 4.80 4.67
IOR Block Size: 8MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 8MB - Disk Target: Default Test Directory 2 4 1 5 3 110 220 330 440 550 SE +/- 5.42, N = 15 SE +/- 4.21, N = 14 SE +/- 25.39, N = 15 SE +/- 6.12, N = 5 SE +/- 4.96, N = 15 530.83 527.45 525.44 510.05 354.01 MIN: 248.03 / MAX: 1386.96 MIN: 251.24 / MAX: 1378.27 MIN: 290.23 / MAX: 1447.59 MIN: 222.93 / MAX: 1266.18 MIN: 189.98 / MAX: 1385.03 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 4MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 4MB - Disk Target: Default Test Directory 1 4 5 3 2 200 400 600 800 1000 SE +/- 1.87, N = 3 SE +/- 6.63, N = 15 SE +/- 5.81, N = 6 SE +/- 45.69, N = 12 SE +/- 5.42, N = 15 887.94 493.98 478.01 467.76 466.30 MIN: 629.82 / MAX: 1334.23 MIN: 240.44 / MAX: 1344.87 MIN: 236.85 / MAX: 1266.85 MIN: 188.74 / MAX: 1435.44 MIN: 212.95 / MAX: 1329.58 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 2MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 2MB - Disk Target: Default Test Directory 1 5 3 4 2 130 260 390 520 650 SE +/- 5.79, N = 3 SE +/- 7.25, N = 15 SE +/- 7.93, N = 15 SE +/- 10.65, N = 12 SE +/- 4.65, N = 3 592.90 437.60 429.36 419.08 396.56 MIN: 431.01 / MAX: 1028.65 MIN: 162.6 / MAX: 1036.24 MIN: 220.22 / MAX: 1149.54 MIN: 152.75 / MAX: 1057.83 MIN: 214.64 / MAX: 1028.95 1. (CC) gcc options: -O2 -lm -pthread -lmpi
Phoronix Test Suite v10.8.5