Xeon E3 1280 v5 Feb Intel Xeon E3-1280 v5 testing with a MSI Z170A SLI PLUS (MS-7998) v1.0 (2.A0 BIOS) and ASUS AMD Radeon HD 7850 / R7 265 R9 270 1024SP on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102081-HA-XEONE312889&sor&grt .
Xeon E3 1280 v5 Feb Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution 1 2 3 4 Intel Xeon E3-1280 v5 @ 4.00GHz (4 Cores / 8 Threads) MSI Z170A SLI PLUS (MS-7998) v1.0 (2.A0 BIOS) Intel Xeon E3-1200 v5/E3-1500 32GB 256GB TOSHIBA RD400 ASUS AMD Radeon HD 7850 / R7 265 R9 270 1024SP Realtek ALC1150 VA2431 Intel I219-V Ubuntu 20.04 5.9.0-050900rc2daily20200826-generic (x86_64) 20200825 GNOME Shell 3.36.4 X Server 1.20.8 4.5 Mesa 20.0.8 (LLVM 10.0.0) GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe2 - Thermald 1.9.1 Graphics Details - GLAMOR Python Details - Python 3.8.5 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Xeon E3 1280 v5 Feb askap: tConvolve MT - Gridding askap: tConvolve MT - Degridding askap: tConvolve MPI - Degridding askap: tConvolve MPI - Gridding askap: tConvolve OpenMP - Gridding askap: tConvolve OpenMP - Degridding askap: Hogbom Clean OpenMP v-ray: CPU etcpak: DXT1 etcpak: ETC1 etcpak: ETC2 etcpak: ETC1 + Dithering financebench: Repo OpenMP financebench: Bonds OpenMP gcrypt: gnupg: 2.7GB Sample File Encryption lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 8 - Decompression lzbench: Crush 0 - Compression lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression lzbench: Libdeflate 1 - Decompression npb: BT.C npb: CG.C npb: EP.C npb: EP.D npb: FT.C npb: IS.D npb: LU.C npb: MG.C npb: SP.B ngspice: C2670 ngspice: C7552 onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU paraview: Many Spheres - 1920 x 1080 paraview: Many Spheres - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 pennant: sedovbig pennant: leblancbig qmcpack: simple-H2O quantlib: redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET webp2: Default webp2: Quality 75, Compression Effort 7 webp2: Quality 95, Compression Effort 7 webp2: Quality 100, Compression Effort 5 webp2: Quality 100, Lossless Compression 1 2 3 4 948.093 1200.93 1033.09 1174.93 956.611 1315.94 134.772 4114 1146.890 294.895 167.383 281.176 76866.679687 135570.807292 228.004 77.067 39 105 463 1600 86 1762 90 463 414 566 176 653 206 1020 14085.02 3279.96 534.87 538.74 7406.46 522.75 15044.95 7319.68 4474.21 156.733 163.555 233 359 39 13436 2818 43.91 4402.292 100.48 1607.673 82.79 862.842 163.7929 99.69725 30.553 2277.8 2398581.58 1893846.79 1442717.14 2246166.42 1723299.04 9.539 565.943 1042.443 32.282 2061.207 948.233 1200.7 1033.05 1171.41 952.050 1315.94 134.530 4082 1147.662 294.153 167.999 280.998 77080.151042 134942.13542 227.918 76.341 39 105 460 1606 84 1770 92 463 412 564 175 654 204 1023 14093.72 3268.09 541.21 541.67 7409.62 522.40 15039.27 7321.00 4466.84 156.678 162.071 234 359 39 13451 2825 31.60 3167.389 71.69 1147.023 82.93 864.255 164.0432 99.64131 30.614 2279.6 1508820.33 1890949.04 1467441.79 2106185.08 1701062.25 9.580 567.330 1044.997 32.331 2062.846 947.811 1200.03 1035.78 1178.57 953.219 1313.77 134.108 4149 1151.182 294.895 168.004 281.127 77117.473958 135109.276042 228.019 76.239 39 104 464 1600 83 1751 90 463 412 563 175 651 205 1020 14055.30 3251.12 541.24 542.43 7358.83 521.96 15037.87 7316.58 4470.48 158.497 162.599 234 358 39 13426 2788 31.91 3199.197 70.77 1132.244 82.93 864.182 163.7290 99.74793 30.694 2254.0 1497147.63 1866817.63 1462812.54 2100303.25 1704419.13 9.561 566.124 1044.924 32.383 2061.110 1151.055 293.489 167.773 279.591 14060.83 3258.39 540.15 31.73 3181.527 71.54 1144.586 82.90 863.954 2277.8 OpenBenchmarking.org
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 2 1 3 200 400 600 800 1000 SE +/- 0.14, N = 3 SE +/- 0.28, N = 3 SE +/- 0.28, N = 3 948.23 948.09 947.81 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 2 3 300 600 900 1200 1500 SE +/- 0.45, N = 3 SE +/- 0.00, N = 3 SE +/- 0.39, N = 3 1200.93 1200.70 1200.03 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 3 1 2 200 400 600 800 1000 SE +/- 2.73, N = 3 SE +/- 4.70, N = 3 SE +/- 0.00, N = 3 1035.78 1033.09 1033.05 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 3 1 2 300 600 900 1200 1500 SE +/- 9.30, N = 3 SE +/- 3.52, N = 3 SE +/- 0.00, N = 3 1178.57 1174.93 1171.41 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 3 2 200 400 600 800 1000 SE +/- 1.14, N = 3 SE +/- 4.11, N = 3 SE +/- 1.14, N = 3 956.61 953.22 952.05 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 2 1 3 300 600 900 1200 1500 SE +/- 2.16, N = 3 SE +/- 2.16, N = 3 SE +/- 2.16, N = 3 1315.94 1315.94 1313.77 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 2 3 30 60 90 120 150 SE +/- 0.18, N = 3 SE +/- 0.26, N = 3 SE +/- 0.06, N = 3 134.77 134.53 134.11 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 3 1 2 900 1800 2700 3600 4500 SE +/- 8.39, N = 3 SE +/- 14.71, N = 3 SE +/- 21.70, N = 3 4149 4114 4082
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 3 4 2 1 200 400 600 800 1000 SE +/- 4.09, N = 3 SE +/- 2.68, N = 3 SE +/- 2.80, N = 3 SE +/- 2.32, N = 3 1151.18 1151.06 1147.66 1146.89 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 3 1 2 4 60 120 180 240 300 SE +/- 0.45, N = 3 SE +/- 1.22, N = 3 SE +/- 0.21, N = 3 SE +/- 1.38, N = 3 294.90 294.90 294.15 293.49 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 3 2 4 1 40 80 120 160 200 SE +/- 0.16, N = 3 SE +/- 0.18, N = 3 SE +/- 0.31, N = 3 SE +/- 0.27, N = 3 168.00 168.00 167.77 167.38 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 3 2 4 60 120 180 240 300 SE +/- 0.76, N = 3 SE +/- 0.09, N = 3 SE +/- 0.25, N = 3 SE +/- 1.35, N = 3 281.18 281.13 281.00 279.59 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1 2 3 17K 34K 51K 68K 85K SE +/- 18.98, N = 3 SE +/- 140.84, N = 3 SE +/- 146.10, N = 3 76866.68 77080.15 77117.47 1. (CXX) g++ options: -O3 -march=native -fopenmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 2 3 1 30K 60K 90K 120K 150K SE +/- 54.89, N = 3 SE +/- 189.25, N = 3 SE +/- 536.30, N = 3 134942.14 135109.28 135570.81 1. (CXX) g++ options: -O3 -march=native -fopenmp
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 2 1 3 50 100 150 200 250 SE +/- 0.44, N = 3 SE +/- 0.20, N = 3 SE +/- 0.58, N = 3 227.92 228.00 228.02 1. (CC) gcc options: -O2 -fvisibility=hidden
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 3 2 1 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 SE +/- 0.76, N = 3 76.24 76.34 77.07 1. (CC) gcc options: -O2
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 3 2 1 9 18 27 36 45 SE +/- 0.33, N = 3 39 39 39 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 2 1 3 20 40 60 80 100 SE +/- 0.33, N = 3 105 105 104 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 3 1 2 100 200 300 400 500 SE +/- 0.33, N = 3 SE +/- 2.33, N = 3 464 463 460 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 2 3 1 300 600 900 1200 1500 SE +/- 0.33, N = 3 SE +/- 6.24, N = 3 SE +/- 7.62, N = 3 1606 1600 1600 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 1.85, N = 12 86 84 83 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 2 1 3 400 800 1200 1600 2000 SE +/- 2.33, N = 3 SE +/- 8.24, N = 12 1770 1762 1751 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 2 3 1 20 40 60 80 100 SE +/- 1.53, N = 3 92 90 90 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 3 2 1 100 200 300 400 500 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 463 463 463 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 3 2 90 180 270 360 450 SE +/- 1.15, N = 3 SE +/- 2.19, N = 3 SE +/- 1.15, N = 3 414 412 412 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 120 240 360 480 600 SE +/- 0.58, N = 3 566 564 563 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 3 2 40 80 120 160 200 SE +/- 0.58, N = 3 176 175 175 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 2 1 3 140 280 420 560 700 SE +/- 0.67, N = 3 SE +/- 1.20, N = 3 SE +/- 1.20, N = 3 654 653 651 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 3 2 50 100 150 200 250 SE +/- 0.58, N = 3 206 205 204 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression 2 3 1 200 400 600 800 1000 SE +/- 0.58, N = 3 SE +/- 1.76, N = 3 SE +/- 1.67, N = 3 1023 1020 1020 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 2 1 4 3 3K 6K 9K 12K 15K SE +/- 4.82, N = 3 SE +/- 14.63, N = 3 SE +/- 9.97, N = 3 SE +/- 19.62, N = 3 14093.72 14085.02 14060.83 14055.30 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 1 2 4 3 700 1400 2100 2800 3500 SE +/- 5.94, N = 3 SE +/- 1.85, N = 3 SE +/- 2.09, N = 3 SE +/- 1.89, N = 3 3279.96 3268.09 3258.39 3251.12 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 3 2 4 1 120 240 360 480 600 SE +/- 1.14, N = 3 SE +/- 1.18, N = 3 SE +/- 1.52, N = 3 SE +/- 7.88, N = 4 541.24 541.21 540.15 534.87 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 3 2 1 120 240 360 480 600 SE +/- 0.86, N = 3 SE +/- 0.91, N = 3 SE +/- 3.99, N = 3 542.43 541.67 538.74 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 2 1 3 1600 3200 4800 6400 8000 SE +/- 65.59, N = 3 SE +/- 20.58, N = 3 SE +/- 68.80, N = 3 7409.62 7406.46 7358.83 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 1 2 3 110 220 330 440 550 SE +/- 1.35, N = 3 SE +/- 0.44, N = 3 SE +/- 0.15, N = 3 522.75 522.40 521.96 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 3K 6K 9K 12K 15K SE +/- 6.51, N = 3 SE +/- 13.61, N = 3 SE +/- 17.13, N = 3 15044.95 15039.27 15037.87 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 2 1 3 1600 3200 4800 6400 8000 SE +/- 1.62, N = 3 SE +/- 1.93, N = 3 SE +/- 0.87, N = 3 7321.00 7319.68 7316.58 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 1 3 2 1000 2000 3000 4000 5000 SE +/- 1.45, N = 3 SE +/- 0.41, N = 3 SE +/- 0.68, N = 3 4474.21 4470.48 4466.84 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 2 1 3 40 80 120 160 200 SE +/- 0.06, N = 3 SE +/- 0.27, N = 3 SE +/- 1.68, N = 3 156.68 156.73 158.50 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 2 3 1 40 80 120 160 200 SE +/- 0.37, N = 3 SE +/- 0.86, N = 3 SE +/- 0.25, N = 3 162.07 162.60 163.56 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 3 2 1 50 100 150 200 250 SE +/- 0.29, N = 3 SE +/- 0.29, N = 3 SE +/- 0.44, N = 3 234 234 233 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 2 1 3 80 160 240 320 400 SE +/- 0.29, N = 3 SE +/- 0.50, N = 3 SE +/- 0.17, N = 3 359 359 358 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 3 2 1 9 18 27 36 45 39 39 39 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 2 1 3 3K 6K 9K 12K 15K SE +/- 9.37, N = 3 SE +/- 19.44, N = 3 SE +/- 20.55, N = 3 13451 13436 13426 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 2 1 3 600 1200 1800 2400 3000 SE +/- 0.76, N = 3 SE +/- 3.11, N = 3 SE +/- 11.02, N = 3 2825 2818 2788 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 3 4 2 10 20 30 40 50 SE +/- 0.67, N = 3 SE +/- 0.26, N = 15 SE +/- 0.35, N = 3 SE +/- 0.31, N = 3 43.91 31.91 31.73 31.60
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 3 4 2 900 1800 2700 3600 4500 SE +/- 67.37, N = 3 SE +/- 25.97, N = 15 SE +/- 34.93, N = 3 SE +/- 31.04, N = 3 4402.29 3199.20 3181.53 3167.39
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 4 3 20 40 60 80 100 SE +/- 0.31, N = 3 SE +/- 0.28, N = 3 SE +/- 0.12, N = 3 SE +/- 0.65, N = 3 100.48 71.69 71.54 70.77
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 4 3 300 600 900 1200 1500 SE +/- 4.98, N = 3 SE +/- 4.43, N = 3 SE +/- 1.89, N = 3 SE +/- 10.41, N = 3 1607.67 1147.02 1144.59 1132.24
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 3 2 4 1 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 82.93 82.93 82.90 82.79
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 2 3 4 1 200 400 600 800 1000 SE +/- 0.27, N = 3 SE +/- 0.25, N = 3 SE +/- 0.55, N = 3 SE +/- 0.50, N = 3 864.26 864.18 863.95 862.84
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 3 1 2 40 80 120 160 200 SE +/- 0.07, N = 3 SE +/- 0.19, N = 3 SE +/- 0.44, N = 3 163.73 163.79 164.04 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 2 1 3 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 99.64 99.70 99.75 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 7 14 21 28 35 SE +/- 0.26, N = 3 SE +/- 0.10, N = 3 SE +/- 0.31, N = 3 30.55 30.61 30.69 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 2 4 1 3 500 1000 1500 2000 2500 SE +/- 5.20, N = 3 SE +/- 2.18, N = 3 SE +/- 3.78, N = 3 SE +/- 18.72, N = 3 2279.6 2277.8 2277.8 2254.0 1. (CXX) g++ options: -O3 -march=native -rdynamic
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 9331.80, N = 3 SE +/- 5323.27, N = 3 SE +/- 9905.91, N = 3 2398581.58 1508820.33 1497147.63 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 9940.28, N = 3 SE +/- 13660.64, N = 3 SE +/- 21892.90, N = 3 1893846.79 1890949.04 1866817.63 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 2 3 1 300K 600K 900K 1200K 1500K SE +/- 12162.45, N = 3 SE +/- 3143.22, N = 3 SE +/- 19180.57, N = 15 1467441.79 1462812.54 1442717.14 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 18154.71, N = 3 SE +/- 5466.07, N = 3 SE +/- 12639.41, N = 3 2246166.42 2106185.08 2100303.25 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 3 2 400K 800K 1200K 1600K 2000K SE +/- 3451.36, N = 3 SE +/- 18896.53, N = 3 SE +/- 15090.20, N = 3 1723299.04 1704419.13 1701062.25 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 1 3 2 3 6 9 12 15 SE +/- 0.016, N = 3 SE +/- 0.018, N = 3 SE +/- 0.023, N = 3 9.539 9.561 9.580 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 1 3 2 120 240 360 480 600 SE +/- 0.29, N = 3 SE +/- 0.26, N = 3 SE +/- 0.86, N = 3 565.94 566.12 567.33 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 1 3 2 200 400 600 800 1000 SE +/- 0.82, N = 3 SE +/- 1.16, N = 3 SE +/- 1.34, N = 3 1042.44 1044.92 1045.00 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 1 2 3 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 32.28 32.33 32.38 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 3 1 2 400 800 1200 1600 2000 SE +/- 0.26, N = 3 SE +/- 0.77, N = 3 SE +/- 0.95, N = 3 2061.11 2061.21 2062.85 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
Phoronix Test Suite v10.8.4