7980XE Feb 2921 Intel Core i9-7980XE testing with a ASUS PRIME X299-A (2002 BIOS) and Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102162-HA-7980XEFEB73&gru&rdt .
7980XE Feb 2921 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 4 5 Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads) ASUS PRIME X299-A (2002 BIOS) Intel Sky Lake-E DMI3 Registers 16GB Samsung SSD 970 EVO 500GB Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB (1206/1750MHz) Realtek ALC1220 G237HL Intel I219-V Ubuntu 20.10 5.8.0-36-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 4.6 Mesa 20.2.6 (LLVM 11.0.0) 1.2.131 GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x2006a08 Graphics Details - GLAMOR Python Details - Python 3.8.6 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
7980XE Feb 2921 paraview: Many Spheres - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 rav1e: 1 rav1e: 5 rav1e: 6 rav1e: 10 onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU askap: Hogbom Clean OpenMP ior: 2MB - Default Test Directory ior: 4MB - Default Test Directory ior: 8MB - Default Test Directory ior: 16MB - Default Test Directory ior: 32MB - Default Test Directory ior: 64MB - Default Test Directory lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 8 - Decompression lzbench: Crush 0 - Compression lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression quantlib: askap: tConvolve MT - Gridding askap: tConvolve MT - Degridding askap: tConvolve OpenMP - Gridding askap: tConvolve OpenMP - Degridding paraview: Many Spheres - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 jpegxl: PNG - 5 jpegxl: PNG - 7 jpegxl: PNG - 8 jpegxl: JPEG - 5 jpegxl: JPEG - 7 jpegxl: JPEG - 8 jpegxl-decode: 1 jpegxl-decode: All etcpak: DXT1 etcpak: ETC1 etcpak: ETC2 etcpak: ETC1 + Dithering gromacs: water_GMX50_bare lammps: 20k Atoms lammps: Rhodopsin Protein redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET npb: EP.C npb: EP.D npb: LU.C synthmark: VoiceMark_100 v-ray: CPU lulesh: pennant: sedovbig pennant: leblancbig financebench: Repo OpenMP financebench: Bonds OpenMP mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 cloverleaf: Lagrangian-Eulerian Hydrodynamics build-godot: Time To Compile gcrypt: ngspice: C2670 ngspice: C7552 webp2: Default webp2: Quality 75, Compression Effort 7 webp2: Quality 95, Compression Effort 7 webp2: Quality 100, Compression Effort 5 webp2: Quality 100, Lossless Compression gnupg: 2.7GB Sample File Encryption qmcpack: simple-H2O 1 2 3 4 5 4.80 72.31 82.97 0.358 0.975 1.269 2.677 553 630 143 9436 7531 376.885 592.90 887.94 525.44 481.36 493.82 492.85 42 115 495 1767 85 1792 111 483 469 613 189 711 237 2222.4 1815.85 2618.53 2458.07 3973.97 481.692 864.682 1156.973 52.50 7.86 0.71 48.64 48.50 23.19 31.81 175.22 1365.386 315.616 183.995 303.570 1.699 11.176 9.483 2698309.25 2153944.42 1712504.00 2539377.42 1912809.71 2081.02 2140.50 47323.88 553.640 17828 6695.0413 53.14447 34.29865 39726.164063 55767.164062 7.108 37.184 4.201 2.676 44.157 361.235 316.358 85.54 98.889 212.796 155.596 138.661 3.369 155.740 282.047 9.195 593.066 67.390 38.340 4.81 72.26 83.00 0.359 0.976 1.272 2.675 555 649 145 9471 7249 377.362 396.56 466.30 530.83 484.06 474.62 480.71 42 115 495 1767 86 1801 109 484 470 612 189 711 238 2222.6 1820.90 2622.75 2420.91 3973.97 481.834 864.937 1156.216 53.03 7.93 0.70 49.08 49.22 23.25 32.39 178.23 1387.302 313.703 182.977 303.514 1.698 11.210 9.084 1769937.88 2168410.90 1642807.13 2415143.75 1952727.25 2001.11 2241.41 47734.40 555.295 17872 6707.8596 52.66769 33.68917 39461.821614 55622.968750 7.232 37.345 4.634 2.618 48.330 360.923 319.049 84.88 98.932 212.057 151.978 137.582 3.363 156.550 280.636 9.110 590.293 67.175 39.048 4.80 72.23 82.98 0.359 0.975 1.270 2.694 557 658 144 9541 7080 377.835 429.36 467.76 354.01 442.47 470.37 489.25 42 114 496 1772 85 1795 110 484 472 612 189 709 238 2207.1 1813.56 2633.11 2428.72 3973.97 481.420 864.804 1155.674 53.44 7.93 0.7 48.89 49.41 23.43 32.26 177.88 1385.842 314.662 183.424 303.105 1.698 11.195 9.408 1797918.46 2159575.21 1695317.29 2431811.58 1889099.97 2033.28 2202.89 47778.76 553.553 17842 6678.4880 52.70973 33.55896 39681.566406 55666.716146 7.355 37.283 4.348 2.686 45.970 361.454 317.136 84.21 98.872 211.787 153.061 139.573 3.325 157.528 282.575 9.075 596.408 67.124 38.489 4.67 72.32 83.01 0.360 0.976 1.269 2.679 555 659 144 9533 7457 378.788 419.08 493.98 527.45 500.39 484.50 490.66 42 115 496 1771 86 1803 109 483 469 612 188 708 237 2215.6 1821.82 2633.32 2420.64 3973.97 468.456 865.027 1157.110 53.11 7.92 0.70 48.85 48.85 23.37 32.17 178.61 1376.344 314.271 182.722 290.726 1.700 11.205 9.099 1761464.25 2158064.17 1699000.00 2442355.92 1906086.42 1997.31 2112.54 47482.70 548.556 17742 6697.3184 52.76675 33.40175 39350.223958 55656.115885 7.281 37.607 4.513 2.662 48.375 361.425 318.208 84.27 98.964 211.895 154.214 139.032 3.347 154.880 279.573 9.309 593.530 67.151 39.795 4.80 72.31 82.97 0.359 0.977 1.274 2.676 555 652 145 9519 6939 378.802 437.60 478.01 510.05 506.02 516.94 493.41 42 114 494 1770 85 1800 110 483 470 611 189 709 237 2216.0 1821.36 2609.75 2442.85 3973.97 481.347 864.654 1156.869 53.69 7.91 0.7 49.13 49.18 23.37 32.30 178.47 1377.894 313.693 183.118 303.890 1.703 11.216 8.901 1770711.21 2179924.5 1682254.45 2443276.75 1923099.42 1999.52 2214.08 47360.36 552.792 17849 6708.2322 52.84683 33.93533 39275.548177 55876.180990 7.272 37.338 4.546 2.645 48.640 361.488 319.314 85.20 98.873 212.963 153.099 139.017 3.350 156.753 283.331 9.216 590.211 67.102 38.621 OpenBenchmarking.org
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 2 3 4 5 1.0823 2.1646 3.2469 4.3292 5.4115 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.13, N = 11 SE +/- 0.00, N = 3 4.80 4.81 4.80 4.67 4.80
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 5 16 32 48 64 80 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 72.31 72.26 72.23 72.32 72.31
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 5 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 82.97 83.00 82.98 83.01 82.97
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 1 1 2 3 4 5 0.081 0.162 0.243 0.324 0.405 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.358 0.359 0.359 0.360 0.359
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 1 2 3 4 5 0.2198 0.4396 0.6594 0.8792 1.099 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.975 0.976 0.975 0.976 0.977
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 1 2 3 4 5 0.2867 0.5734 0.8601 1.1468 1.4335 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 1.269 1.272 1.270 1.269 1.274
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 1 2 3 4 5 0.6062 1.2124 1.8186 2.4248 3.031 SE +/- 0.006, N = 3 SE +/- 0.012, N = 3 SE +/- 0.005, N = 3 SE +/- 0.005, N = 3 SE +/- 0.007, N = 3 2.677 2.675 2.694 2.679 2.676
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 1 2 3 4 5 120 240 360 480 600 SE +/- 1.09, N = 3 SE +/- 0.29, N = 3 SE +/- 2.25, N = 3 SE +/- 1.64, N = 3 SE +/- 0.88, N = 3 553 555 557 555 555 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 1 2 3 4 5 140 280 420 560 700 SE +/- 5.46, N = 3 SE +/- 11.30, N = 12 SE +/- 13.29, N = 12 SE +/- 13.34, N = 12 SE +/- 11.09, N = 12 630 649 658 659 652 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 1 2 3 4 5 30 60 90 120 150 SE +/- 0.44, N = 3 SE +/- 0.33, N = 3 SE +/- 0.87, N = 3 SE +/- 0.44, N = 3 SE +/- 0.17, N = 3 143 145 144 144 145 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 3 4 5 2K 4K 6K 8K 10K SE +/- 35.99, N = 3 SE +/- 30.21, N = 3 SE +/- 36.96, N = 3 SE +/- 46.43, N = 3 SE +/- 54.82, N = 3 9436 9471 9541 9533 9519 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 2 3 4 5 1600 3200 4800 6400 8000 SE +/- 57.46, N = 3 SE +/- 133.82, N = 12 SE +/- 173.63, N = 9 SE +/- 109.17, N = 3 SE +/- 212.04, N = 12 7531 7249 7080 7457 6939 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 2 3 4 5 80 160 240 320 400 SE +/- 0.47, N = 3 SE +/- 0.82, N = 3 SE +/- 0.48, N = 3 SE +/- 0.00, N = 3 SE +/- 1.66, N = 3 376.89 377.36 377.84 378.79 378.80 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
IOR Block Size: 2MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 2MB - Disk Target: Default Test Directory 1 2 3 4 5 130 260 390 520 650 SE +/- 5.79, N = 3 SE +/- 4.65, N = 3 SE +/- 7.93, N = 15 SE +/- 10.65, N = 12 SE +/- 7.25, N = 15 592.90 396.56 429.36 419.08 437.60 MIN: 431.01 / MAX: 1028.65 MIN: 214.64 / MAX: 1028.95 MIN: 220.22 / MAX: 1149.54 MIN: 152.75 / MAX: 1057.83 MIN: 162.6 / MAX: 1036.24 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 4MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 4MB - Disk Target: Default Test Directory 1 2 3 4 5 200 400 600 800 1000 SE +/- 1.87, N = 3 SE +/- 5.42, N = 15 SE +/- 45.69, N = 12 SE +/- 6.63, N = 15 SE +/- 5.81, N = 6 887.94 466.30 467.76 493.98 478.01 MIN: 629.82 / MAX: 1334.23 MIN: 212.95 / MAX: 1329.58 MIN: 188.74 / MAX: 1435.44 MIN: 240.44 / MAX: 1344.87 MIN: 236.85 / MAX: 1266.85 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 8MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 8MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 25.39, N = 15 SE +/- 5.42, N = 15 SE +/- 4.96, N = 15 SE +/- 4.21, N = 14 SE +/- 6.12, N = 5 525.44 530.83 354.01 527.45 510.05 MIN: 290.23 / MAX: 1447.59 MIN: 248.03 / MAX: 1386.96 MIN: 189.98 / MAX: 1385.03 MIN: 251.24 / MAX: 1378.27 MIN: 222.93 / MAX: 1266.18 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 16MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 16MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 6.27, N = 3 SE +/- 5.80, N = 15 SE +/- 6.54, N = 3 SE +/- 1.65, N = 3 SE +/- 7.12, N = 3 481.36 484.06 442.47 500.39 506.02 MIN: 316.02 / MAX: 1504.02 MIN: 224.25 / MAX: 1505.62 MIN: 217.24 / MAX: 1379.75 MIN: 315.3 / MAX: 1210.23 MIN: 308.33 / MAX: 1247.46 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 32MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 32MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 3.89, N = 3 SE +/- 4.72, N = 8 SE +/- 5.73, N = 3 SE +/- 4.09, N = 12 SE +/- 4.67, N = 3 493.82 474.62 470.37 484.50 516.94 MIN: 245.23 / MAX: 1245.29 MIN: 202.58 / MAX: 1355.46 MIN: 197.76 / MAX: 1370.54 MIN: 196.66 / MAX: 1538.21 MIN: 413.96 / MAX: 1183.64 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 64MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 64MB - Disk Target: Default Test Directory 1 2 3 4 5 110 220 330 440 550 SE +/- 3.92, N = 3 SE +/- 2.46, N = 3 SE +/- 4.79, N = 3 SE +/- 4.68, N = 3 SE +/- 6.10, N = 3 492.85 480.71 489.25 490.66 493.41 MIN: 402.07 / MAX: 1345.65 MIN: 301.17 / MAX: 1079.47 MIN: 374.86 / MAX: 1033.08 MIN: 348.43 / MAX: 1036.5 MIN: 246.91 / MAX: 1088.06 1. (CC) gcc options: -O2 -lm -pthread -lmpi
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 1 2 3 4 5 10 20 30 40 50 SE +/- 0.33, N = 3 42 42 42 42 42 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1 2 3 4 5 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 115 115 114 115 114 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 2 3 4 5 110 220 330 440 550 SE +/- 2.19, N = 3 SE +/- 2.00, N = 3 SE +/- 2.67, N = 3 SE +/- 3.67, N = 3 SE +/- 3.67, N = 3 495 495 496 496 494 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 6.00, N = 3 SE +/- 3.06, N = 3 SE +/- 1.76, N = 3 SE +/- 1.73, N = 3 1767 1767 1772 1771 1770 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 4 5 20 40 60 80 100 85 86 85 86 85 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 3.33, N = 3 SE +/- 1.86, N = 3 SE +/- 3.38, N = 3 SE +/- 3.71, N = 3 1792 1801 1795 1803 1800 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 4 5 20 40 60 80 100 SE +/- 0.58, N = 3 SE +/- 1.00, N = 3 SE +/- 0.67, N = 3 111 109 110 109 110 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 2 3 4 5 100 200 300 400 500 SE +/- 0.67, N = 3 SE +/- 1.53, N = 3 SE +/- 0.88, N = 3 SE +/- 1.15, N = 3 SE +/- 1.86, N = 3 483 484 484 483 483 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 2 3 4 5 100 200 300 400 500 SE +/- 0.88, N = 3 SE +/- 1.67, N = 3 SE +/- 1.20, N = 3 SE +/- 1.33, N = 3 SE +/- 0.88, N = 3 469 470 472 469 470 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 4 5 130 260 390 520 650 SE +/- 1.20, N = 3 SE +/- 2.73, N = 3 SE +/- 1.00, N = 3 SE +/- 0.67, N = 3 613 612 612 612 611 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 2 3 4 5 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 189 189 189 188 189 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1 2 3 4 5 150 300 450 600 750 SE +/- 0.88, N = 3 711 711 709 708 709 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 2 3 4 5 50 100 150 200 250 SE +/- 0.58, N = 3 237 238 238 237 237 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 4 5 500 1000 1500 2000 2500 SE +/- 17.98, N = 13 SE +/- 19.83, N = 12 SE +/- 27.77, N = 12 SE +/- 21.07, N = 12 SE +/- 28.27, N = 12 2222.4 2222.6 2207.1 2215.6 2216.0 1. (CXX) g++ options: -O3 -march=native -rdynamic
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 2.23, N = 3 SE +/- 0.35, N = 3 SE +/- 2.62, N = 3 SE +/- 0.61, N = 3 SE +/- 0.31, N = 3 1815.85 1820.90 1813.56 1821.82 1821.36 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 2 3 4 5 600 1200 1800 2400 3000 SE +/- 10.95, N = 3 SE +/- 4.46, N = 3 SE +/- 7.85, N = 3 SE +/- 4.51, N = 3 SE +/- 11.98, N = 3 2618.53 2622.75 2633.11 2633.32 2609.75 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 2 3 4 5 500 1000 1500 2000 2500 SE +/- 19.92, N = 3 SE +/- 22.21, N = 3 SE +/- 32.33, N = 3 SE +/- 12.71, N = 3 SE +/- 12.94, N = 3 2458.07 2420.91 2428.72 2420.64 2442.85 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 1 2 3 4 5 900 1800 2700 3600 4500 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3973.97 3973.97 3973.97 3973.97 3973.97 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 2 3 4 5 100 200 300 400 500 SE +/- 0.37, N = 3 SE +/- 0.39, N = 3 SE +/- 0.27, N = 3 SE +/- 12.92, N = 11 SE +/- 0.34, N = 3 481.69 481.83 481.42 468.46 481.35
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 5 200 400 600 800 1000 SE +/- 0.51, N = 3 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 SE +/- 0.39, N = 3 SE +/- 0.27, N = 3 864.68 864.94 864.80 865.03 864.65
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 5 200 400 600 800 1000 SE +/- 0.52, N = 3 SE +/- 0.36, N = 3 SE +/- 0.26, N = 3 SE +/- 0.60, N = 3 SE +/- 0.98, N = 3 1156.97 1156.22 1155.67 1157.11 1156.87
JPEG XL Input: PNG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 5 1 2 3 4 5 12 24 36 48 60 SE +/- 0.14, N = 3 SE +/- 0.15, N = 3 SE +/- 0.33, N = 3 SE +/- 0.16, N = 3 SE +/- 0.13, N = 3 52.50 53.03 53.44 53.11 53.69 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: PNG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 7 1 2 3 4 5 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 7.86 7.93 7.93 7.92 7.91 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: PNG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: PNG - Encode Speed: 8 1 2 3 4 5 0.1598 0.3196 0.4794 0.6392 0.799 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.71 0.70 0.70 0.70 0.70 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 5 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 5 1 2 3 4 5 11 22 33 44 55 SE +/- 0.07, N = 3 SE +/- 0.26, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 48.64 49.08 48.89 48.85 49.13 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 7 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 7 1 2 3 4 5 11 22 33 44 55 SE +/- 0.11, N = 3 SE +/- 0.28, N = 3 SE +/- 0.07, N = 3 SE +/- 0.19, N = 3 SE +/- 0.14, N = 3 48.50 49.22 49.41 48.85 49.18 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Input: JPEG - Encode Speed: 8 OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Input: JPEG - Encode Speed: 8 1 2 3 4 5 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 SE +/- 0.14, N = 3 23.19 23.25 23.43 23.37 23.37 1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl
JPEG XL Decoding CPU Threads: 1 OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: 1 1 2 3 4 5 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 31.81 32.39 32.26 32.17 32.30
JPEG XL Decoding CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: All 1 2 3 4 5 40 80 120 160 200 SE +/- 0.25, N = 3 SE +/- 0.25, N = 3 SE +/- 0.10, N = 3 SE +/- 0.45, N = 3 SE +/- 0.19, N = 3 175.22 178.23 177.88 178.61 178.47
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 1 2 3 4 5 300 600 900 1200 1500 SE +/- 13.09, N = 3 SE +/- 3.63, N = 3 SE +/- 4.38, N = 3 SE +/- 3.03, N = 3 SE +/- 2.52, N = 3 1365.39 1387.30 1385.84 1376.34 1377.89 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 2 3 4 5 70 140 210 280 350 SE +/- 0.30, N = 3 SE +/- 0.85, N = 3 SE +/- 1.01, N = 3 SE +/- 0.49, N = 3 SE +/- 0.49, N = 3 315.62 313.70 314.66 314.27 313.69 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 4 5 40 80 120 160 200 SE +/- 0.08, N = 3 SE +/- 0.28, N = 3 SE +/- 0.16, N = 3 SE +/- 0.12, N = 3 SE +/- 0.23, N = 3 184.00 182.98 183.42 182.72 183.12 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 2 3 4 5 70 140 210 280 350 SE +/- 0.86, N = 3 SE +/- 0.32, N = 3 SE +/- 0.60, N = 3 SE +/- 7.07, N = 15 SE +/- 0.09, N = 3 303.57 303.51 303.11 290.73 303.89 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
GROMACS Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021 Input: water_GMX50_bare 1 2 3 4 5 0.3832 0.7664 1.1496 1.5328 1.916 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 1.699 1.698 1.698 1.700 1.703 1. (CXX) g++ options: -O3 -pthread
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms 1 2 3 4 5 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 11.18 11.21 11.20 11.21 11.22 1. (CXX) g++ options: -O3 -pthread -lm
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 1 2 3 4 5 3 6 9 12 15 SE +/- 0.370, N = 12 SE +/- 0.433, N = 15 SE +/- 0.361, N = 12 SE +/- 0.334, N = 15 SE +/- 0.306, N = 15 9.483 9.084 9.408 9.099 8.901 1. (CXX) g++ options: -O3 -pthread -lm
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 4 5 600K 1200K 1800K 2400K 3000K SE +/- 29192.48, N = 3 SE +/- 2566.99, N = 3 SE +/- 19169.69, N = 3 SE +/- 15108.16, N = 3 SE +/- 18070.29, N = 3 2698309.25 1769937.88 1797918.46 1761464.25 1770711.21 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 4 5 500K 1000K 1500K 2000K 2500K SE +/- 18441.13, N = 3 SE +/- 27569.90, N = 5 SE +/- 34602.53, N = 3 SE +/- 20189.91, N = 3 SE +/- 11360.66, N = 3 2153944.42 2168410.90 2159575.21 2158064.17 2179924.50 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 4 5 400K 800K 1200K 1600K 2000K SE +/- 10755.06, N = 3 SE +/- 13174.46, N = 3 SE +/- 9034.55, N = 3 SE +/- 10458.55, N = 3 SE +/- 10827.27, N = 3 1712504.00 1642807.13 1695317.29 1699000.00 1682254.45 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 4 5 500K 1000K 1500K 2000K 2500K SE +/- 22779.82, N = 3 SE +/- 31640.05, N = 3 SE +/- 17482.31, N = 3 SE +/- 18752.77, N = 3 SE +/- 23178.69, N = 3 2539377.42 2415143.75 2431811.58 2442355.92 2443276.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 4 5 400K 800K 1200K 1600K 2000K SE +/- 15512.42, N = 3 SE +/- 23318.24, N = 3 SE +/- 24702.46, N = 4 SE +/- 30239.19, N = 3 SE +/- 21792.45, N = 3 1912809.71 1952727.25 1889099.97 1906086.42 1923099.42 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 23.92, N = 15 SE +/- 51.51, N = 15 SE +/- 51.69, N = 12 SE +/- 32.10, N = 15 SE +/- 46.79, N = 15 2081.02 2001.11 2033.28 1997.31 1999.52 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 4 5 500 1000 1500 2000 2500 SE +/- 10.76, N = 3 SE +/- 3.15, N = 3 SE +/- 34.24, N = 3 SE +/- 35.18, N = 3 SE +/- 28.36, N = 4 2140.50 2241.41 2202.89 2112.54 2214.08 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 4 5 10K 20K 30K 40K 50K SE +/- 154.44, N = 3 SE +/- 7.96, N = 3 SE +/- 139.20, N = 3 SE +/- 376.58, N = 3 SE +/- 44.69, N = 3 47323.88 47734.40 47778.76 47482.70 47360.36 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 1 2 3 4 5 120 240 360 480 600 SE +/- 2.84, N = 3 SE +/- 1.36, N = 3 SE +/- 3.76, N = 3 SE +/- 2.23, N = 3 SE +/- 1.67, N = 3 553.64 555.30 553.55 548.56 552.79 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 1 2 3 4 5 4K 8K 12K 16K 20K SE +/- 47.06, N = 3 SE +/- 70.27, N = 3 SE +/- 6.44, N = 3 SE +/- 54.08, N = 3 SE +/- 84.18, N = 3 17828 17872 17842 17742 17849
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 1 2 3 4 5 1400 2800 4200 5600 7000 SE +/- 21.98, N = 3 SE +/- 16.05, N = 3 SE +/- 43.78, N = 3 SE +/- 39.16, N = 3 SE +/- 30.71, N = 3 6695.04 6707.86 6678.49 6697.32 6708.23 1. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 1 2 3 4 5 12 24 36 48 60 SE +/- 0.12, N = 3 SE +/- 0.13, N = 3 SE +/- 0.36, N = 3 SE +/- 0.21, N = 3 SE +/- 0.26, N = 3 53.14 52.67 52.71 52.77 52.85 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 1 2 3 4 5 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.13, N = 3 SE +/- 0.17, N = 3 SE +/- 0.29, N = 3 34.30 33.69 33.56 33.40 33.94 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1 2 3 4 5 9K 18K 27K 36K 45K SE +/- 210.55, N = 3 SE +/- 76.80, N = 3 SE +/- 151.55, N = 3 SE +/- 69.89, N = 3 SE +/- 20.70, N = 3 39726.16 39461.82 39681.57 39350.22 39275.55 1. (CXX) g++ options: -O3 -march=native -fopenmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 1 2 3 4 5 12K 24K 36K 48K 60K SE +/- 86.69, N = 3 SE +/- 37.92, N = 3 SE +/- 83.09, N = 3 SE +/- 86.41, N = 3 SE +/- 53.09, N = 3 55767.16 55622.97 55666.72 55656.12 55876.18 1. (CXX) g++ options: -O3 -march=native -fopenmp
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 1 2 3 4 5 2 4 6 8 10 SE +/- 0.091, N = 5 SE +/- 0.085, N = 3 SE +/- 0.110, N = 3 SE +/- 0.070, N = 3 SE +/- 0.007, N = 3 7.108 7.232 7.355 7.281 7.272 MIN: 6.52 / MAX: 7.68 MIN: 6.81 / MAX: 7.85 MIN: 6.8 / MAX: 7.87 MIN: 6.97 / MAX: 7.74 MIN: 7 / MAX: 7.67 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 1 2 3 4 5 9 18 27 36 45 SE +/- 0.32, N = 5 SE +/- 0.12, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 37.18 37.35 37.28 37.61 37.34 MIN: 35.67 / MAX: 38.06 MIN: 36.97 / MAX: 37.99 MIN: 36.85 / MAX: 38.11 MIN: 37.16 / MAX: 38.09 MIN: 36.95 / MAX: 37.85 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 1 2 3 4 5 1.0427 2.0854 3.1281 4.1708 5.2135 SE +/- 0.129, N = 5 SE +/- 0.086, N = 3 SE +/- 0.205, N = 3 SE +/- 0.041, N = 3 SE +/- 0.008, N = 3 4.201 4.634 4.348 4.513 4.546 MIN: 3.68 / MAX: 4.8 MIN: 4.2 / MAX: 4.99 MIN: 3.62 / MAX: 4.88 MIN: 4.22 / MAX: 4.96 MIN: 4.28 / MAX: 4.91 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 1 2 3 4 5 0.6044 1.2088 1.8132 2.4176 3.022 SE +/- 0.029, N = 5 SE +/- 0.019, N = 3 SE +/- 0.045, N = 3 SE +/- 0.010, N = 3 SE +/- 0.011, N = 3 2.676 2.618 2.686 2.662 2.645 MIN: 2.44 / MAX: 3.19 MIN: 2.5 / MAX: 2.92 MIN: 2.42 / MAX: 3.11 MIN: 2.48 / MAX: 2.98 MIN: 2.44 / MAX: 3.04 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 1 2 3 4 5 11 22 33 44 55 SE +/- 1.80, N = 5 SE +/- 0.10, N = 3 SE +/- 2.44, N = 3 SE +/- 0.11, N = 3 SE +/- 0.04, N = 3 44.16 48.33 45.97 48.38 48.64 MIN: 40.29 / MAX: 51.28 MIN: 47.9 / MAX: 50.26 MIN: 40.58 / MAX: 48.74 MIN: 47.91 / MAX: 48.97 MIN: 48.25 / MAX: 49.28 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 3 4 5 80 160 240 320 400 SE +/- 0.37, N = 3 SE +/- 0.27, N = 3 SE +/- 0.39, N = 3 SE +/- 0.38, N = 3 SE +/- 0.31, N = 3 361.24 360.92 361.45 361.43 361.49 MIN: 357.28 / MAX: 391.58 MIN: 356.72 / MAX: 372.66 MIN: 356.32 / MAX: 386.75 MIN: 356.24 / MAX: 380.38 MIN: 357.11 / MAX: 384.74 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 2 3 4 5 70 140 210 280 350 SE +/- 0.46, N = 3 SE +/- 0.57, N = 3 SE +/- 0.62, N = 3 SE +/- 0.59, N = 3 SE +/- 0.43, N = 3 316.36 319.05 317.14 318.21 319.31 MIN: 313.3 / MAX: 320.26 MIN: 313 / MAX: 340.64 MIN: 314.18 / MAX: 323.54 MIN: 316.15 / MAX: 320.81 MIN: 316.94 / MAX: 338.61 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 1 2 3 4 5 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 SE +/- 0.12, N = 3 SE +/- 0.08, N = 3 85.54 84.88 84.21 84.27 85.20 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 1 2 3 4 5 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.29, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 98.89 98.93 98.87 98.96 98.87
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1 2 3 4 5 50 100 150 200 250 SE +/- 0.63, N = 3 SE +/- 0.34, N = 3 SE +/- 0.06, N = 3 SE +/- 0.49, N = 3 SE +/- 0.29, N = 3 212.80 212.06 211.79 211.90 212.96 1. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 1 2 3 4 5 30 60 90 120 150 SE +/- 2.33, N = 3 SE +/- 1.75, N = 3 SE +/- 1.44, N = 3 SE +/- 2.23, N = 3 SE +/- 1.61, N = 3 155.60 151.98 153.06 154.21 153.10 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 1 2 3 4 5 30 60 90 120 150 SE +/- 0.78, N = 3 SE +/- 0.10, N = 3 SE +/- 1.23, N = 3 SE +/- 1.21, N = 3 SE +/- 2.04, N = 3 138.66 137.58 139.57 139.03 139.02 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 1 2 3 4 5 0.758 1.516 2.274 3.032 3.79 SE +/- 0.034, N = 3 SE +/- 0.012, N = 3 SE +/- 0.027, N = 3 SE +/- 0.049, N = 3 SE +/- 0.057, N = 3 3.369 3.363 3.325 3.347 3.350 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 1 2 3 4 5 30 60 90 120 150 SE +/- 1.77, N = 3 SE +/- 0.15, N = 3 SE +/- 0.29, N = 3 SE +/- 0.66, N = 3 SE +/- 0.73, N = 3 155.74 156.55 157.53 154.88 156.75 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 1 2 3 4 5 60 120 180 240 300 SE +/- 1.07, N = 3 SE +/- 0.54, N = 3 SE +/- 2.02, N = 3 SE +/- 1.22, N = 3 SE +/- 2.02, N = 3 282.05 280.64 282.58 279.57 283.33 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 1 2 3 4 5 3 6 9 12 15 SE +/- 0.105, N = 3 SE +/- 0.020, N = 3 SE +/- 0.024, N = 3 SE +/- 0.141, N = 3 SE +/- 0.149, N = 3 9.195 9.110 9.075 9.309 9.216 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 1 2 3 4 5 130 260 390 520 650 SE +/- 1.66, N = 3 SE +/- 0.45, N = 3 SE +/- 0.58, N = 3 SE +/- 1.92, N = 3 SE +/- 0.58, N = 3 593.07 590.29 596.41 593.53 590.21 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 1 2 3 4 5 15 30 45 60 75 SE +/- 0.36, N = 3 SE +/- 0.12, N = 3 SE +/- 0.12, N = 3 SE +/- 0.20, N = 3 SE +/- 0.15, N = 3 67.39 67.18 67.12 67.15 67.10 1. (CC) gcc options: -O2
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 4 5 9 18 27 36 45 SE +/- 0.09, N = 3 SE +/- 0.45, N = 3 SE +/- 0.23, N = 3 SE +/- 0.68, N = 3 SE +/- 0.24, N = 3 38.34 39.05 38.49 39.80 38.62 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
Phoronix Test Suite v10.8.5