Xeon E3 1280 v5 Feb Intel Xeon E3-1280 v5 testing with a MSI Z170A SLI PLUS (MS-7998) v1.0 (2.A0 BIOS) and ASUS AMD Radeon HD 7850 / R7 265 R9 270 1024SP on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102081-HA-XEONE312889&grw&sor .
Xeon E3 1280 v5 Feb Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution 1 2 3 4 Intel Xeon E3-1280 v5 @ 4.00GHz (4 Cores / 8 Threads) MSI Z170A SLI PLUS (MS-7998) v1.0 (2.A0 BIOS) Intel Xeon E3-1200 v5/E3-1500 32GB 256GB TOSHIBA RD400 ASUS AMD Radeon HD 7850 / R7 265 R9 270 1024SP Realtek ALC1150 VA2431 Intel I219-V Ubuntu 20.04 5.9.0-050900rc2daily20200826-generic (x86_64) 20200825 GNOME Shell 3.36.4 X Server 1.20.8 4.5 Mesa 20.0.8 (LLVM 10.0.0) GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe2 - Thermald 1.9.1 Graphics Details - GLAMOR Python Details - Python 3.8.5 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Xeon E3 1280 v5 Feb lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 8 - Decompression lzbench: Crush 0 - Compression lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression lzbench: Libdeflate 1 - Decompression ngspice: C2670 ngspice: C7552 etcpak: DXT1 etcpak: ETC1 etcpak: ETC2 etcpak: ETC1 + Dithering webp2: Default webp2: Quality 75, Compression Effort 7 webp2: Quality 95, Compression Effort 7 webp2: Quality 100, Compression Effort 5 webp2: Quality 100, Lossless Compression gcrypt: quantlib: onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU npb: BT.C npb: CG.C npb: EP.C npb: EP.D npb: FT.C npb: IS.D npb: LU.C npb: MG.C npb: SP.B askap: tConvolve MT - Gridding askap: tConvolve MT - Degridding askap: tConvolve MPI - Degridding askap: tConvolve MPI - Gridding askap: tConvolve OpenMP - Gridding askap: tConvolve OpenMP - Degridding askap: Hogbom Clean OpenMP pennant: sedovbig pennant: leblancbig qmcpack: simple-H2O v-ray: CPU financebench: Repo OpenMP financebench: Bonds OpenMP redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET gnupg: 2.7GB Sample File Encryption paraview: Many Spheres - 1920 x 1080 paraview: Many Spheres - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 1 2 3 4 39 105 463 1600 86 1762 90 463 414 566 176 653 206 1020 156.733 163.555 1146.890 294.895 167.383 281.176 9.539 565.943 1042.443 32.282 2061.207 228.004 2277.8 233 359 39 13436 2818 14085.02 3279.96 534.87 538.74 7406.46 522.75 15044.95 7319.68 4474.21 948.093 1200.93 1033.09 1174.93 956.611 1315.94 134.772 163.7929 99.69725 30.553 4114 76866.679687 135570.807292 2398581.58 1893846.79 1442717.14 2246166.42 1723299.04 77.067 43.91 4402.292 100.48 1607.673 82.79 862.842 39 105 460 1606 84 1770 92 463 412 564 175 654 204 1023 156.678 162.071 1147.662 294.153 167.999 280.998 9.580 567.330 1044.997 32.331 2062.846 227.918 2279.6 234 359 39 13451 2825 14093.72 3268.09 541.21 541.67 7409.62 522.40 15039.27 7321.00 4466.84 948.233 1200.7 1033.05 1171.41 952.050 1315.94 134.530 164.0432 99.64131 30.614 4082 77080.151042 134942.13542 1508820.33 1890949.04 1467441.79 2106185.08 1701062.25 76.341 31.60 3167.389 71.69 1147.023 82.93 864.255 39 104 464 1600 83 1751 90 463 412 563 175 651 205 1020 158.497 162.599 1151.182 294.895 168.004 281.127 9.561 566.124 1044.924 32.383 2061.110 228.019 2254.0 234 358 39 13426 2788 14055.30 3251.12 541.24 542.43 7358.83 521.96 15037.87 7316.58 4470.48 947.811 1200.03 1035.78 1178.57 953.219 1313.77 134.108 163.7290 99.74793 30.694 4149 77117.473958 135109.276042 1497147.63 1866817.63 1462812.54 2100303.25 1704419.13 76.239 31.91 3199.197 70.77 1132.244 82.93 864.182 1151.055 293.489 167.773 279.591 2277.8 14060.83 3258.39 540.15 31.73 3181.527 71.54 1144.586 82.90 863.954 OpenBenchmarking.org
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 3 2 1 9 18 27 36 45 SE +/- 0.33, N = 3 39 39 39 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 2 1 3 20 40 60 80 100 SE +/- 0.33, N = 3 105 105 104 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 3 1 2 100 200 300 400 500 SE +/- 0.33, N = 3 SE +/- 2.33, N = 3 464 463 460 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 2 3 1 300 600 900 1200 1500 SE +/- 0.33, N = 3 SE +/- 6.24, N = 3 SE +/- 7.62, N = 3 1606 1600 1600 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 1.85, N = 12 86 84 83 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 2 1 3 400 800 1200 1600 2000 SE +/- 2.33, N = 3 SE +/- 8.24, N = 12 1770 1762 1751 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 2 3 1 20 40 60 80 100 SE +/- 1.53, N = 3 92 90 90 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 3 2 1 100 200 300 400 500 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 463 463 463 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 3 2 90 180 270 360 450 SE +/- 1.15, N = 3 SE +/- 2.19, N = 3 SE +/- 1.15, N = 3 414 412 412 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 120 240 360 480 600 SE +/- 0.58, N = 3 566 564 563 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 3 2 40 80 120 160 200 SE +/- 0.58, N = 3 176 175 175 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 2 1 3 140 280 420 560 700 SE +/- 0.67, N = 3 SE +/- 1.20, N = 3 SE +/- 1.20, N = 3 654 653 651 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 3 2 50 100 150 200 250 SE +/- 0.58, N = 3 206 205 204 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression 2 3 1 200 400 600 800 1000 SE +/- 0.58, N = 3 SE +/- 1.76, N = 3 SE +/- 1.67, N = 3 1023 1020 1020 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 2 1 3 40 80 120 160 200 SE +/- 0.06, N = 3 SE +/- 0.27, N = 3 SE +/- 1.68, N = 3 156.68 156.73 158.50 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 2 3 1 40 80 120 160 200 SE +/- 0.37, N = 3 SE +/- 0.86, N = 3 SE +/- 0.25, N = 3 162.07 162.60 163.56 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 3 4 2 1 200 400 600 800 1000 SE +/- 4.09, N = 3 SE +/- 2.68, N = 3 SE +/- 2.80, N = 3 SE +/- 2.32, N = 3 1151.18 1151.06 1147.66 1146.89 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 3 1 2 4 60 120 180 240 300 SE +/- 0.45, N = 3 SE +/- 1.22, N = 3 SE +/- 0.21, N = 3 SE +/- 1.38, N = 3 294.90 294.90 294.15 293.49 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 3 2 4 1 40 80 120 160 200 SE +/- 0.16, N = 3 SE +/- 0.18, N = 3 SE +/- 0.31, N = 3 SE +/- 0.27, N = 3 168.00 168.00 167.77 167.38 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 3 2 4 60 120 180 240 300 SE +/- 0.76, N = 3 SE +/- 0.09, N = 3 SE +/- 0.25, N = 3 SE +/- 1.35, N = 3 281.18 281.13 281.00 279.59 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 1 3 2 3 6 9 12 15 SE +/- 0.016, N = 3 SE +/- 0.018, N = 3 SE +/- 0.023, N = 3 9.539 9.561 9.580 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 1 3 2 120 240 360 480 600 SE +/- 0.29, N = 3 SE +/- 0.26, N = 3 SE +/- 0.86, N = 3 565.94 566.12 567.33 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 1 3 2 200 400 600 800 1000 SE +/- 0.82, N = 3 SE +/- 1.16, N = 3 SE +/- 1.34, N = 3 1042.44 1044.92 1045.00 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 1 2 3 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 32.28 32.33 32.38 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 3 1 2 400 800 1200 1600 2000 SE +/- 0.26, N = 3 SE +/- 0.77, N = 3 SE +/- 0.95, N = 3 2061.11 2061.21 2062.85 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 2 1 3 50 100 150 200 250 SE +/- 0.44, N = 3 SE +/- 0.20, N = 3 SE +/- 0.58, N = 3 227.92 228.00 228.02 1. (CC) gcc options: -O2 -fvisibility=hidden
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 2 4 1 3 500 1000 1500 2000 2500 SE +/- 5.20, N = 3 SE +/- 2.18, N = 3 SE +/- 3.78, N = 3 SE +/- 18.72, N = 3 2279.6 2277.8 2277.8 2254.0 1. (CXX) g++ options: -O3 -march=native -rdynamic
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 3 2 1 50 100 150 200 250 SE +/- 0.29, N = 3 SE +/- 0.29, N = 3 SE +/- 0.44, N = 3 234 234 233 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 2 1 3 80 160 240 320 400 SE +/- 0.29, N = 3 SE +/- 0.50, N = 3 SE +/- 0.17, N = 3 359 359 358 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 3 2 1 9 18 27 36 45 39 39 39 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 2 1 3 3K 6K 9K 12K 15K SE +/- 9.37, N = 3 SE +/- 19.44, N = 3 SE +/- 20.55, N = 3 13451 13436 13426 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 2 1 3 600 1200 1800 2400 3000 SE +/- 0.76, N = 3 SE +/- 3.11, N = 3 SE +/- 11.02, N = 3 2825 2818 2788 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 2 1 4 3 3K 6K 9K 12K 15K SE +/- 4.82, N = 3 SE +/- 14.63, N = 3 SE +/- 9.97, N = 3 SE +/- 19.62, N = 3 14093.72 14085.02 14060.83 14055.30 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 1 2 4 3 700 1400 2100 2800 3500 SE +/- 5.94, N = 3 SE +/- 1.85, N = 3 SE +/- 2.09, N = 3 SE +/- 1.89, N = 3 3279.96 3268.09 3258.39 3251.12 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 3 2 4 1 120 240 360 480 600 SE +/- 1.14, N = 3 SE +/- 1.18, N = 3 SE +/- 1.52, N = 3 SE +/- 7.88, N = 4 541.24 541.21 540.15 534.87 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 3 2 1 120 240 360 480 600 SE +/- 0.86, N = 3 SE +/- 0.91, N = 3 SE +/- 3.99, N = 3 542.43 541.67 538.74 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 2 1 3 1600 3200 4800 6400 8000 SE +/- 65.59, N = 3 SE +/- 20.58, N = 3 SE +/- 68.80, N = 3 7409.62 7406.46 7358.83 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 1 2 3 110 220 330 440 550 SE +/- 1.35, N = 3 SE +/- 0.44, N = 3 SE +/- 0.15, N = 3 522.75 522.40 521.96 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 3K 6K 9K 12K 15K SE +/- 6.51, N = 3 SE +/- 13.61, N = 3 SE +/- 17.13, N = 3 15044.95 15039.27 15037.87 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 2 1 3 1600 3200 4800 6400 8000 SE +/- 1.62, N = 3 SE +/- 1.93, N = 3 SE +/- 0.87, N = 3 7321.00 7319.68 7316.58 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 1 3 2 1000 2000 3000 4000 5000 SE +/- 1.45, N = 3 SE +/- 0.41, N = 3 SE +/- 0.68, N = 3 4474.21 4470.48 4466.84 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 2 1 3 200 400 600 800 1000 SE +/- 0.14, N = 3 SE +/- 0.28, N = 3 SE +/- 0.28, N = 3 948.23 948.09 947.81 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 2 3 300 600 900 1200 1500 SE +/- 0.45, N = 3 SE +/- 0.00, N = 3 SE +/- 0.39, N = 3 1200.93 1200.70 1200.03 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 3 1 2 200 400 600 800 1000 SE +/- 2.73, N = 3 SE +/- 4.70, N = 3 SE +/- 0.00, N = 3 1035.78 1033.09 1033.05 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 3 1 2 300 600 900 1200 1500 SE +/- 9.30, N = 3 SE +/- 3.52, N = 3 SE +/- 0.00, N = 3 1178.57 1174.93 1171.41 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 3 2 200 400 600 800 1000 SE +/- 1.14, N = 3 SE +/- 4.11, N = 3 SE +/- 1.14, N = 3 956.61 953.22 952.05 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 2 1 3 300 600 900 1200 1500 SE +/- 2.16, N = 3 SE +/- 2.16, N = 3 SE +/- 2.16, N = 3 1315.94 1315.94 1313.77 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 2 3 30 60 90 120 150 SE +/- 0.18, N = 3 SE +/- 0.26, N = 3 SE +/- 0.06, N = 3 134.77 134.53 134.11 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 3 1 2 40 80 120 160 200 SE +/- 0.07, N = 3 SE +/- 0.19, N = 3 SE +/- 0.44, N = 3 163.73 163.79 164.04 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 2 1 3 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 99.64 99.70 99.75 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 7 14 21 28 35 SE +/- 0.26, N = 3 SE +/- 0.10, N = 3 SE +/- 0.31, N = 3 30.55 30.61 30.69 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 3 1 2 900 1800 2700 3600 4500 SE +/- 8.39, N = 3 SE +/- 14.71, N = 3 SE +/- 21.70, N = 3 4149 4114 4082
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1 2 3 17K 34K 51K 68K 85K SE +/- 18.98, N = 3 SE +/- 140.84, N = 3 SE +/- 146.10, N = 3 76866.68 77080.15 77117.47 1. (CXX) g++ options: -O3 -march=native -fopenmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 2 3 1 30K 60K 90K 120K 150K SE +/- 54.89, N = 3 SE +/- 189.25, N = 3 SE +/- 536.30, N = 3 134942.14 135109.28 135570.81 1. (CXX) g++ options: -O3 -march=native -fopenmp
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 9331.80, N = 3 SE +/- 5323.27, N = 3 SE +/- 9905.91, N = 3 2398581.58 1508820.33 1497147.63 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 9940.28, N = 3 SE +/- 13660.64, N = 3 SE +/- 21892.90, N = 3 1893846.79 1890949.04 1866817.63 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 2 3 1 300K 600K 900K 1200K 1500K SE +/- 12162.45, N = 3 SE +/- 3143.22, N = 3 SE +/- 19180.57, N = 15 1467441.79 1462812.54 1442717.14 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 18154.71, N = 3 SE +/- 5466.07, N = 3 SE +/- 12639.41, N = 3 2246166.42 2106185.08 2100303.25 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 3 2 400K 800K 1200K 1600K 2000K SE +/- 3451.36, N = 3 SE +/- 18896.53, N = 3 SE +/- 15090.20, N = 3 1723299.04 1704419.13 1701062.25 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 3 2 1 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 SE +/- 0.76, N = 3 76.24 76.34 77.07 1. (CC) gcc options: -O2
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 3 4 2 10 20 30 40 50 SE +/- 0.67, N = 3 SE +/- 0.26, N = 15 SE +/- 0.35, N = 3 SE +/- 0.31, N = 3 43.91 31.91 31.73 31.60
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 3 4 2 900 1800 2700 3600 4500 SE +/- 67.37, N = 3 SE +/- 25.97, N = 15 SE +/- 34.93, N = 3 SE +/- 31.04, N = 3 4402.29 3199.20 3181.53 3167.39
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 4 3 20 40 60 80 100 SE +/- 0.31, N = 3 SE +/- 0.28, N = 3 SE +/- 0.12, N = 3 SE +/- 0.65, N = 3 100.48 71.69 71.54 70.77
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 4 3 300 600 900 1200 1500 SE +/- 4.98, N = 3 SE +/- 4.43, N = 3 SE +/- 1.89, N = 3 SE +/- 10.41, N = 3 1607.67 1147.02 1144.59 1132.24
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 3 2 4 1 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 82.93 82.93 82.90 82.79
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 2 3 4 1 200 400 600 800 1000 SE +/- 0.27, N = 3 SE +/- 0.25, N = 3 SE +/- 0.55, N = 3 SE +/- 0.50, N = 3 864.26 864.18 863.95 862.84
Phoronix Test Suite v10.8.4