Xeon E3 1280 v5 Feb Intel Xeon E3-1280 v5 testing with a MSI Z170A SLI PLUS (MS-7998) v1.0 (2.A0 BIOS) and ASUS AMD Radeon HD 7850 / R7 265 R9 270 1024SP on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102081-HA-XEONE312889&grr .
Xeon E3 1280 v5 Feb Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution 1 2 3 4 Intel Xeon E3-1280 v5 @ 4.00GHz (4 Cores / 8 Threads) MSI Z170A SLI PLUS (MS-7998) v1.0 (2.A0 BIOS) Intel Xeon E3-1200 v5/E3-1500 32GB 256GB TOSHIBA RD400 ASUS AMD Radeon HD 7850 / R7 265 R9 270 1024SP Realtek ALC1150 VA2431 Intel I219-V Ubuntu 20.04 5.9.0-050900rc2daily20200826-generic (x86_64) 20200825 GNOME Shell 3.36.4 X Server 1.20.8 4.5 Mesa 20.0.8 (LLVM 10.0.0) GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe2 - Thermald 1.9.1 Graphics Details - GLAMOR Python Details - Python 3.8.5 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Xeon E3 1280 v5 Feb webp2: Quality 100, Lossless Compression webp2: Quality 95, Compression Effort 7 webp2: Quality 75, Compression Effort 7 npb: EP.D gcrypt: npb: BT.C ngspice: C7552 pennant: sedovbig ngspice: C2670 financebench: Bonds OpenMP npb: LU.C onnx: fcn-resnet101-11 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: yolov4 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU npb: IS.D pennant: leblancbig financebench: Repo OpenMP npb: SP.B paraview: Many Spheres - 1920 x 1080 paraview: Many Spheres - 1920 x 1080 v-ray: CPU gnupg: 2.7GB Sample File Encryption npb: FT.C askap: tConvolve MT - Degridding askap: tConvolve MT - Gridding lzbench: Zstd 8 - Decompression lzbench: Zstd 8 - Compression npb: CG.C lzbench: XZ 0 - Decompression lzbench: XZ 0 - Compression redis: LPUSH webp2: Quality 100, Compression Effort 5 quantlib: askap: tConvolve MPI - Gridding askap: tConvolve MPI - Degridding qmcpack: simple-H2O etcpak: ETC2 askap: Hogbom Clean OpenMP lzbench: Libdeflate 1 - Decompression lzbench: Libdeflate 1 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 0 - Compression npb: MG.C lzbench: Brotli 2 - Decompression lzbench: Brotli 2 - Compression lzbench: Crush 0 - Decompression lzbench: Crush 0 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 1 - Compression paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 npb: EP.C etcpak: ETC1 + Dithering askap: tConvolve OpenMP - Degridding askap: tConvolve OpenMP - Gridding etcpak: ETC1 redis: SET redis: LPOP redis: SADD redis: GET paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 webp2: Default etcpak: DXT1 1 2 3 4 2061.207 1042.443 565.943 538.74 228.004 14085.02 163.555 163.7929 156.733 135570.807292 15044.95 39 359 233 13436 2818 522.75 99.69725 76866.679687 4474.21 4402.292 43.91 4114 77.067 7406.46 1200.93 948.093 1762 86 3279.96 105 39 1442717.14 32.282 2277.8 1174.93 1033.09 30.553 167.383 134.772 1020 206 566 414 7319.68 653 176 463 90 1600 463 862.842 82.79 534.87 281.176 1315.94 956.611 294.895 1723299.04 2398581.58 1893846.79 2246166.42 1607.673 100.48 9.539 1146.890 2062.846 1044.997 567.330 541.67 227.918 14093.72 162.071 164.0432 156.678 134942.13542 15039.27 39 359 234 13451 2825 522.40 99.64131 77080.151042 4466.84 3167.389 31.60 4082 76.341 7409.62 1200.7 948.233 1770 84 3268.09 105 39 1467441.79 32.331 2279.6 1171.41 1033.05 30.614 167.999 134.530 1023 204 564 412 7321.00 654 175 463 92 1606 460 864.255 82.93 541.21 280.998 1315.94 952.050 294.153 1701062.25 1508820.33 1890949.04 2106185.08 1147.023 71.69 9.580 1147.662 2061.110 1044.924 566.124 542.43 228.019 14055.30 162.599 163.7290 158.497 135109.276042 15037.87 39 358 234 13426 2788 521.96 99.74793 77117.473958 4470.48 3199.197 31.91 4149 76.239 7358.83 1200.03 947.811 1751 83 3251.12 104 39 1462812.54 32.383 2254.0 1178.57 1035.78 30.694 168.004 134.108 1020 205 563 412 7316.58 651 175 463 90 1600 464 864.182 82.93 541.24 281.127 1313.77 953.219 294.895 1704419.13 1497147.63 1866817.63 2100303.25 1132.244 70.77 9.561 1151.182 14060.83 3181.527 31.73 3258.39 2277.8 167.773 863.954 82.90 540.15 279.591 293.489 1144.586 71.54 1151.055 OpenBenchmarking.org
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 1 2 3 400 800 1200 1600 2000 SE +/- 0.77, N = 3 SE +/- 0.95, N = 3 SE +/- 0.26, N = 3 2061.21 2062.85 2061.11 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 1 2 3 200 400 600 800 1000 SE +/- 0.82, N = 3 SE +/- 1.34, N = 3 SE +/- 1.16, N = 3 1042.44 1045.00 1044.92 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 1 2 3 120 240 360 480 600 SE +/- 0.29, N = 3 SE +/- 0.86, N = 3 SE +/- 0.26, N = 3 565.94 567.33 566.12 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 120 240 360 480 600 SE +/- 3.99, N = 3 SE +/- 0.91, N = 3 SE +/- 0.86, N = 3 538.74 541.67 542.43 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1 2 3 50 100 150 200 250 SE +/- 0.20, N = 3 SE +/- 0.44, N = 3 SE +/- 0.58, N = 3 228.00 227.92 228.02 1. (CC) gcc options: -O2 -fvisibility=hidden
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 1 2 3 4 3K 6K 9K 12K 15K SE +/- 14.63, N = 3 SE +/- 4.82, N = 3 SE +/- 19.62, N = 3 SE +/- 9.97, N = 3 14085.02 14093.72 14055.30 14060.83 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 1 2 3 40 80 120 160 200 SE +/- 0.25, N = 3 SE +/- 0.37, N = 3 SE +/- 0.86, N = 3 163.56 162.07 162.60 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 1 2 3 40 80 120 160 200 SE +/- 0.19, N = 3 SE +/- 0.44, N = 3 SE +/- 0.07, N = 3 163.79 164.04 163.73 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 1 2 3 40 80 120 160 200 SE +/- 0.27, N = 3 SE +/- 0.06, N = 3 SE +/- 1.68, N = 3 156.73 156.68 158.50 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 1 2 3 30K 60K 90K 120K 150K SE +/- 536.30, N = 3 SE +/- 54.89, N = 3 SE +/- 189.25, N = 3 135570.81 134942.14 135109.28 1. (CXX) g++ options: -O3 -march=native -fopenmp
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 3K 6K 9K 12K 15K SE +/- 6.51, N = 3 SE +/- 13.61, N = 3 SE +/- 17.13, N = 3 15044.95 15039.27 15037.87 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 1 2 3 9 18 27 36 45 39 39 39 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 1 2 3 80 160 240 320 400 SE +/- 0.50, N = 3 SE +/- 0.29, N = 3 SE +/- 0.17, N = 3 359 359 358 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 1 2 3 50 100 150 200 250 SE +/- 0.44, N = 3 SE +/- 0.29, N = 3 SE +/- 0.29, N = 3 233 234 234 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 3 3K 6K 9K 12K 15K SE +/- 19.44, N = 3 SE +/- 9.37, N = 3 SE +/- 20.55, N = 3 13436 13451 13426 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 2 3 600 1200 1800 2400 3000 SE +/- 3.11, N = 3 SE +/- 0.76, N = 3 SE +/- 11.02, N = 3 2818 2825 2788 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 1 2 3 110 220 330 440 550 SE +/- 1.35, N = 3 SE +/- 0.44, N = 3 SE +/- 0.15, N = 3 522.75 522.40 521.96 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 1 2 3 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 99.70 99.64 99.75 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1 2 3 17K 34K 51K 68K 85K SE +/- 18.98, N = 3 SE +/- 140.84, N = 3 SE +/- 146.10, N = 3 76866.68 77080.15 77117.47 1. (CXX) g++ options: -O3 -march=native -fopenmp
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 1 2 3 1000 2000 3000 4000 5000 SE +/- 1.45, N = 3 SE +/- 0.68, N = 3 SE +/- 0.41, N = 3 4474.21 4466.84 4470.48 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 2 3 4 900 1800 2700 3600 4500 SE +/- 67.37, N = 3 SE +/- 31.04, N = 3 SE +/- 25.97, N = 15 SE +/- 34.93, N = 3 4402.29 3167.39 3199.20 3181.53
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 2 3 4 10 20 30 40 50 SE +/- 0.67, N = 3 SE +/- 0.31, N = 3 SE +/- 0.26, N = 15 SE +/- 0.35, N = 3 43.91 31.60 31.91 31.73
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 1 2 3 900 1800 2700 3600 4500 SE +/- 14.71, N = 3 SE +/- 21.70, N = 3 SE +/- 8.39, N = 3 4114 4082 4149
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 1 2 3 20 40 60 80 100 SE +/- 0.76, N = 3 SE +/- 0.05, N = 3 SE +/- 0.09, N = 3 77.07 76.34 76.24 1. (CC) gcc options: -O2
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 1 2 3 1600 3200 4800 6400 8000 SE +/- 20.58, N = 3 SE +/- 65.59, N = 3 SE +/- 68.80, N = 3 7406.46 7409.62 7358.83 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 2 3 300 600 900 1200 1500 SE +/- 0.45, N = 3 SE +/- 0.00, N = 3 SE +/- 0.39, N = 3 1200.93 1200.70 1200.03 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 1 2 3 200 400 600 800 1000 SE +/- 0.28, N = 3 SE +/- 0.14, N = 3 SE +/- 0.28, N = 3 948.09 948.23 947.81 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 2 3 400 800 1200 1600 2000 SE +/- 2.33, N = 3 SE +/- 8.24, N = 12 1762 1770 1751 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 1.85, N = 12 86 84 83 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 1 2 3 4 700 1400 2100 2800 3500 SE +/- 5.94, N = 3 SE +/- 1.85, N = 3 SE +/- 1.89, N = 3 SE +/- 2.09, N = 3 3279.96 3268.09 3251.12 3258.39 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1 2 3 20 40 60 80 100 SE +/- 0.33, N = 3 105 105 104 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 1 2 3 9 18 27 36 45 SE +/- 0.33, N = 3 39 39 39 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 300K 600K 900K 1200K 1500K SE +/- 19180.57, N = 15 SE +/- 12162.45, N = 3 SE +/- 3143.22, N = 3 1442717.14 1467441.79 1462812.54 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 1 2 3 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 32.28 32.33 32.38 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 4 500 1000 1500 2000 2500 SE +/- 3.78, N = 3 SE +/- 5.20, N = 3 SE +/- 18.72, N = 3 SE +/- 2.18, N = 3 2277.8 2279.6 2254.0 2277.8 1. (CXX) g++ options: -O3 -march=native -rdynamic
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 1 2 3 300 600 900 1200 1500 SE +/- 3.52, N = 3 SE +/- 0.00, N = 3 SE +/- 9.30, N = 3 1174.93 1171.41 1178.57 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 1 2 3 200 400 600 800 1000 SE +/- 4.70, N = 3 SE +/- 0.00, N = 3 SE +/- 2.73, N = 3 1033.09 1033.05 1035.78 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 7 14 21 28 35 SE +/- 0.26, N = 3 SE +/- 0.10, N = 3 SE +/- 0.31, N = 3 30.55 30.61 30.69 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 4 40 80 120 160 200 SE +/- 0.27, N = 3 SE +/- 0.18, N = 3 SE +/- 0.16, N = 3 SE +/- 0.31, N = 3 167.38 168.00 168.00 167.77 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 2 3 30 60 90 120 150 SE +/- 0.18, N = 3 SE +/- 0.26, N = 3 SE +/- 0.06, N = 3 134.77 134.53 134.11 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression 1 2 3 200 400 600 800 1000 SE +/- 1.67, N = 3 SE +/- 0.58, N = 3 SE +/- 1.76, N = 3 1020 1023 1020 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 2 3 50 100 150 200 250 SE +/- 0.58, N = 3 206 204 205 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 120 240 360 480 600 SE +/- 0.58, N = 3 566 564 563 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 2 3 90 180 270 360 450 SE +/- 1.15, N = 3 SE +/- 1.15, N = 3 SE +/- 2.19, N = 3 414 412 412 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 1 2 3 1600 3200 4800 6400 8000 SE +/- 1.93, N = 3 SE +/- 1.62, N = 3 SE +/- 0.87, N = 3 7319.68 7321.00 7316.58 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1 2 3 140 280 420 560 700 SE +/- 1.20, N = 3 SE +/- 0.67, N = 3 SE +/- 1.20, N = 3 653 654 651 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 2 3 40 80 120 160 200 SE +/- 0.58, N = 3 176 175 175 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 2 3 100 200 300 400 500 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 463 463 463 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 20 40 60 80 100 SE +/- 1.53, N = 3 90 92 90 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 1 2 3 300 600 900 1200 1500 SE +/- 7.62, N = 3 SE +/- 0.33, N = 3 SE +/- 6.24, N = 3 1600 1606 1600 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 2 3 100 200 300 400 500 SE +/- 2.33, N = 3 SE +/- 0.33, N = 3 463 460 464 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 200 400 600 800 1000 SE +/- 0.50, N = 3 SE +/- 0.27, N = 3 SE +/- 0.25, N = 3 SE +/- 0.55, N = 3 862.84 864.26 864.18 863.95
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 82.79 82.93 82.93 82.90
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 4 120 240 360 480 600 SE +/- 7.88, N = 4 SE +/- 1.18, N = 3 SE +/- 1.14, N = 3 SE +/- 1.52, N = 3 534.87 541.21 541.24 540.15 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 2 3 4 60 120 180 240 300 SE +/- 0.76, N = 3 SE +/- 0.25, N = 3 SE +/- 0.09, N = 3 SE +/- 1.35, N = 3 281.18 281.00 281.13 279.59 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 1 2 3 300 600 900 1200 1500 SE +/- 2.16, N = 3 SE +/- 2.16, N = 3 SE +/- 2.16, N = 3 1315.94 1315.94 1313.77 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 2 3 200 400 600 800 1000 SE +/- 1.14, N = 3 SE +/- 1.14, N = 3 SE +/- 4.11, N = 3 956.61 952.05 953.22 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 2 3 4 60 120 180 240 300 SE +/- 1.22, N = 3 SE +/- 0.21, N = 3 SE +/- 0.45, N = 3 SE +/- 1.38, N = 3 294.90 294.15 294.90 293.49 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 3451.36, N = 3 SE +/- 15090.20, N = 3 SE +/- 18896.53, N = 3 1723299.04 1701062.25 1704419.13 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 9331.80, N = 3 SE +/- 5323.27, N = 3 SE +/- 9905.91, N = 3 2398581.58 1508820.33 1497147.63 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 9940.28, N = 3 SE +/- 13660.64, N = 3 SE +/- 21892.90, N = 3 1893846.79 1890949.04 1866817.63 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 18154.71, N = 3 SE +/- 5466.07, N = 3 SE +/- 12639.41, N = 3 2246166.42 2106185.08 2100303.25 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 300 600 900 1200 1500 SE +/- 4.98, N = 3 SE +/- 4.43, N = 3 SE +/- 10.41, N = 3 SE +/- 1.89, N = 3 1607.67 1147.02 1132.24 1144.59
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 20 40 60 80 100 SE +/- 0.31, N = 3 SE +/- 0.28, N = 3 SE +/- 0.65, N = 3 SE +/- 0.12, N = 3 100.48 71.69 70.77 71.54
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 1 2 3 3 6 9 12 15 SE +/- 0.016, N = 3 SE +/- 0.023, N = 3 SE +/- 0.018, N = 3 9.539 9.580 9.561 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 1 2 3 4 200 400 600 800 1000 SE +/- 2.32, N = 3 SE +/- 2.80, N = 3 SE +/- 4.09, N = 3 SE +/- 2.68, N = 3 1146.89 1147.66 1151.18 1151.06 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Phoronix Test Suite v10.8.4