Xeon E3 1280 v5 Feb Intel Xeon E3-1280 v5 testing with a MSI Z170A SLI PLUS (MS-7998) v1.0 (2.A0 BIOS) and ASUS AMD Radeon HD 7850 / R7 265 R9 270 1024SP on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102081-HA-XEONE312889&sor&grs .
Xeon E3 1280 v5 Feb Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution 1 2 3 4 Intel Xeon E3-1280 v5 @ 4.00GHz (4 Cores / 8 Threads) MSI Z170A SLI PLUS (MS-7998) v1.0 (2.A0 BIOS) Intel Xeon E3-1200 v5/E3-1500 32GB 256GB TOSHIBA RD400 ASUS AMD Radeon HD 7850 / R7 265 R9 270 1024SP Realtek ALC1150 VA2431 Intel I219-V Ubuntu 20.04 5.9.0-050900rc2daily20200826-generic (x86_64) 20200825 GNOME Shell 3.36.4 X Server 1.20.8 4.5 Mesa 20.0.8 (LLVM 10.0.0) GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe2 - Thermald 1.9.1 Graphics Details - GLAMOR Python Details - Python 3.8.5 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Xeon E3 1280 v5 Feb redis: LPOP paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 paraview: Many Spheres - 1920 x 1080 paraview: Many Spheres - 1920 x 1080 redis: GET lzbench: Crush 0 - Compression redis: LPUSH v-ray: CPU redis: SADD onnx: super-resolution-10 - OpenMP CPU redis: SET npb: EP.C ngspice: C2670 quantlib: gnupg: 2.7GB Sample File Encryption lzbench: Zstd 8 - Decompression lzbench: Libdeflate 1 - Compression lzbench: XZ 0 - Decompression ngspice: C7552 npb: CG.C lzbench: Zstd 1 - Compression npb: FT.C npb: EP.D askap: tConvolve MPI - Gridding lzbench: Brotli 2 - Compression etcpak: ETC1 + Dithering lzbench: Brotli 0 - Decompression askap: Hogbom Clean OpenMP lzbench: Brotli 0 - Compression askap: tConvolve OpenMP - Gridding etcpak: ETC1 financebench: Bonds OpenMP qmcpack: simple-H2O lzbench: Brotli 2 - Decompression webp2: Default onnx: yolov4 - OpenMP CPU lzbench: Zstd 1 - Decompression etcpak: DXT1 etcpak: ETC2 financebench: Repo OpenMP webp2: Quality 100, Compression Effort 5 lzbench: Libdeflate 1 - Decompression onnx: bertsquad-10 - OpenMP CPU npb: BT.C askap: tConvolve MPI - Degridding webp2: Quality 75, Compression Effort 7 webp2: Quality 95, Compression Effort 7 pennant: sedovbig onnx: shufflenet-v2-10 - OpenMP CPU paraview: Wavelet Contour - 1920 x 1080 askap: tConvolve OpenMP - Degridding npb: SP.B paraview: Wavelet Contour - 1920 x 1080 npb: IS.D pennant: leblancbig webp2: Quality 100, Lossless Compression askap: tConvolve MT - Degridding npb: MG.C npb: LU.C askap: tConvolve MT - Gridding gcrypt: onnx: fcn-resnet101-11 - OpenMP CPU lzbench: Crush 0 - Decompression lzbench: XZ 0 - Compression lzbench: Zstd 8 - Compression 1 2 3 4 2398581.58 1607.673 100.48 4402.292 43.91 2246166.42 90 1442717.14 4114 1893846.79 2818 1723299.04 534.87 156.733 2277.8 77.067 1762 206 105 163.555 3279.96 463 7406.46 538.74 1174.93 176 281.176 566 134.772 414 956.611 294.895 135570.807292 30.553 653 9.539 233 1600 1146.890 167.383 76866.679687 32.282 1020 359 14085.02 1033.09 565.943 1042.443 163.7929 13436 82.79 1315.94 4474.21 862.842 522.75 99.69725 2061.207 1200.93 7319.68 15044.95 948.093 228.004 39 463 39 86 1508820.33 1147.023 71.69 3167.389 31.60 2106185.08 92 1467441.79 4082 1890949.04 2825 1701062.25 541.21 156.678 2279.6 76.341 1770 204 105 162.071 3268.09 460 7409.62 541.67 1171.41 175 280.998 564 134.530 412 952.050 294.153 134942.13542 30.614 654 9.580 234 1606 1147.662 167.999 77080.151042 32.331 1023 359 14093.72 1033.05 567.330 1044.997 164.0432 13451 82.93 1315.94 4466.84 864.255 522.40 99.64131 2062.846 1200.7 7321.00 15039.27 948.233 227.918 39 463 39 84 1497147.63 1132.244 70.77 3199.197 31.91 2100303.25 90 1462812.54 4149 1866817.63 2788 1704419.13 541.24 158.497 2254.0 76.239 1751 205 104 162.599 3251.12 464 7358.83 542.43 1178.57 175 281.127 563 134.108 412 953.219 294.895 135109.276042 30.694 651 9.561 234 1600 1151.182 168.004 77117.473958 32.383 1020 358 14055.30 1035.78 566.124 1044.924 163.7290 13426 82.93 1313.77 4470.48 864.182 521.96 99.74793 2061.110 1200.03 7316.58 15037.87 947.811 228.019 39 463 39 83 1144.586 71.54 3181.527 31.73 540.15 2277.8 3258.39 279.591 293.489 1151.055 167.773 14060.83 82.90 863.954 OpenBenchmarking.org
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 9331.80, N = 3 SE +/- 5323.27, N = 3 SE +/- 9905.91, N = 3 2398581.58 1508820.33 1497147.63 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 4 3 300 600 900 1200 1500 SE +/- 4.98, N = 3 SE +/- 4.43, N = 3 SE +/- 1.89, N = 3 SE +/- 10.41, N = 3 1607.67 1147.02 1144.59 1132.24
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 4 3 20 40 60 80 100 SE +/- 0.31, N = 3 SE +/- 0.28, N = 3 SE +/- 0.12, N = 3 SE +/- 0.65, N = 3 100.48 71.69 71.54 70.77
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 3 4 2 900 1800 2700 3600 4500 SE +/- 67.37, N = 3 SE +/- 25.97, N = 15 SE +/- 34.93, N = 3 SE +/- 31.04, N = 3 4402.29 3199.20 3181.53 3167.39
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 3 4 2 10 20 30 40 50 SE +/- 0.67, N = 3 SE +/- 0.26, N = 15 SE +/- 0.35, N = 3 SE +/- 0.31, N = 3 43.91 31.91 31.73 31.60
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 18154.71, N = 3 SE +/- 5466.07, N = 3 SE +/- 12639.41, N = 3 2246166.42 2106185.08 2100303.25 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 2 3 1 20 40 60 80 100 SE +/- 1.53, N = 3 92 90 90 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 2 3 1 300K 600K 900K 1200K 1500K SE +/- 12162.45, N = 3 SE +/- 3143.22, N = 3 SE +/- 19180.57, N = 15 1467441.79 1462812.54 1442717.14 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 3 1 2 900 1800 2700 3600 4500 SE +/- 8.39, N = 3 SE +/- 14.71, N = 3 SE +/- 21.70, N = 3 4149 4114 4082
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 9940.28, N = 3 SE +/- 13660.64, N = 3 SE +/- 21892.90, N = 3 1893846.79 1890949.04 1866817.63 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 2 1 3 600 1200 1800 2400 3000 SE +/- 0.76, N = 3 SE +/- 3.11, N = 3 SE +/- 11.02, N = 3 2825 2818 2788 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 3 2 400K 800K 1200K 1600K 2000K SE +/- 3451.36, N = 3 SE +/- 18896.53, N = 3 SE +/- 15090.20, N = 3 1723299.04 1704419.13 1701062.25 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 3 2 4 1 120 240 360 480 600 SE +/- 1.14, N = 3 SE +/- 1.18, N = 3 SE +/- 1.52, N = 3 SE +/- 7.88, N = 4 541.24 541.21 540.15 534.87 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 2 1 3 40 80 120 160 200 SE +/- 0.06, N = 3 SE +/- 0.27, N = 3 SE +/- 1.68, N = 3 156.68 156.73 158.50 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 2 4 1 3 500 1000 1500 2000 2500 SE +/- 5.20, N = 3 SE +/- 2.18, N = 3 SE +/- 3.78, N = 3 SE +/- 18.72, N = 3 2279.6 2277.8 2277.8 2254.0 1. (CXX) g++ options: -O3 -march=native -rdynamic
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 3 2 1 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 SE +/- 0.76, N = 3 76.24 76.34 77.07 1. (CC) gcc options: -O2
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 2 1 3 400 800 1200 1600 2000 SE +/- 2.33, N = 3 SE +/- 8.24, N = 12 1770 1762 1751 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 3 2 50 100 150 200 250 SE +/- 0.58, N = 3 206 205 204 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 2 1 3 20 40 60 80 100 SE +/- 0.33, N = 3 105 105 104 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 2 3 1 40 80 120 160 200 SE +/- 0.37, N = 3 SE +/- 0.86, N = 3 SE +/- 0.25, N = 3 162.07 162.60 163.56 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 1 2 4 3 700 1400 2100 2800 3500 SE +/- 5.94, N = 3 SE +/- 1.85, N = 3 SE +/- 2.09, N = 3 SE +/- 1.89, N = 3 3279.96 3268.09 3258.39 3251.12 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 3 1 2 100 200 300 400 500 SE +/- 0.33, N = 3 SE +/- 2.33, N = 3 464 463 460 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 2 1 3 1600 3200 4800 6400 8000 SE +/- 65.59, N = 3 SE +/- 20.58, N = 3 SE +/- 68.80, N = 3 7409.62 7406.46 7358.83 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 3 2 1 120 240 360 480 600 SE +/- 0.86, N = 3 SE +/- 0.91, N = 3 SE +/- 3.99, N = 3 542.43 541.67 538.74 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 3 1 2 300 600 900 1200 1500 SE +/- 9.30, N = 3 SE +/- 3.52, N = 3 SE +/- 0.00, N = 3 1178.57 1174.93 1171.41 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 3 2 40 80 120 160 200 SE +/- 0.58, N = 3 176 175 175 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 3 2 4 60 120 180 240 300 SE +/- 0.76, N = 3 SE +/- 0.09, N = 3 SE +/- 0.25, N = 3 SE +/- 1.35, N = 3 281.18 281.13 281.00 279.59 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 120 240 360 480 600 SE +/- 0.58, N = 3 566 564 563 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 2 3 30 60 90 120 150 SE +/- 0.18, N = 3 SE +/- 0.26, N = 3 SE +/- 0.06, N = 3 134.77 134.53 134.11 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 3 2 90 180 270 360 450 SE +/- 1.15, N = 3 SE +/- 2.19, N = 3 SE +/- 1.15, N = 3 414 412 412 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 3 2 200 400 600 800 1000 SE +/- 1.14, N = 3 SE +/- 4.11, N = 3 SE +/- 1.14, N = 3 956.61 953.22 952.05 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 3 1 2 4 60 120 180 240 300 SE +/- 0.45, N = 3 SE +/- 1.22, N = 3 SE +/- 0.21, N = 3 SE +/- 1.38, N = 3 294.90 294.90 294.15 293.49 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 2 3 1 30K 60K 90K 120K 150K SE +/- 54.89, N = 3 SE +/- 189.25, N = 3 SE +/- 536.30, N = 3 134942.14 135109.28 135570.81 1. (CXX) g++ options: -O3 -march=native -fopenmp
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 7 14 21 28 35 SE +/- 0.26, N = 3 SE +/- 0.10, N = 3 SE +/- 0.31, N = 3 30.55 30.61 30.69 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 2 1 3 140 280 420 560 700 SE +/- 0.67, N = 3 SE +/- 1.20, N = 3 SE +/- 1.20, N = 3 654 653 651 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 1 3 2 3 6 9 12 15 SE +/- 0.016, N = 3 SE +/- 0.018, N = 3 SE +/- 0.023, N = 3 9.539 9.561 9.580 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 3 2 1 50 100 150 200 250 SE +/- 0.29, N = 3 SE +/- 0.29, N = 3 SE +/- 0.44, N = 3 234 234 233 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 2 3 1 300 600 900 1200 1500 SE +/- 0.33, N = 3 SE +/- 6.24, N = 3 SE +/- 7.62, N = 3 1606 1600 1600 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 3 4 2 1 200 400 600 800 1000 SE +/- 4.09, N = 3 SE +/- 2.68, N = 3 SE +/- 2.80, N = 3 SE +/- 2.32, N = 3 1151.18 1151.06 1147.66 1146.89 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 3 2 4 1 40 80 120 160 200 SE +/- 0.16, N = 3 SE +/- 0.18, N = 3 SE +/- 0.31, N = 3 SE +/- 0.27, N = 3 168.00 168.00 167.77 167.38 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1 2 3 17K 34K 51K 68K 85K SE +/- 18.98, N = 3 SE +/- 140.84, N = 3 SE +/- 146.10, N = 3 76866.68 77080.15 77117.47 1. (CXX) g++ options: -O3 -march=native -fopenmp
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 1 2 3 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 32.28 32.33 32.38 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression 2 3 1 200 400 600 800 1000 SE +/- 0.58, N = 3 SE +/- 1.76, N = 3 SE +/- 1.67, N = 3 1023 1020 1020 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 2 1 3 80 160 240 320 400 SE +/- 0.29, N = 3 SE +/- 0.50, N = 3 SE +/- 0.17, N = 3 359 359 358 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 2 1 4 3 3K 6K 9K 12K 15K SE +/- 4.82, N = 3 SE +/- 14.63, N = 3 SE +/- 9.97, N = 3 SE +/- 19.62, N = 3 14093.72 14085.02 14060.83 14055.30 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 3 1 2 200 400 600 800 1000 SE +/- 2.73, N = 3 SE +/- 4.70, N = 3 SE +/- 0.00, N = 3 1035.78 1033.09 1033.05 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 1 3 2 120 240 360 480 600 SE +/- 0.29, N = 3 SE +/- 0.26, N = 3 SE +/- 0.86, N = 3 565.94 566.12 567.33 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 1 3 2 200 400 600 800 1000 SE +/- 0.82, N = 3 SE +/- 1.16, N = 3 SE +/- 1.34, N = 3 1042.44 1044.92 1045.00 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 3 1 2 40 80 120 160 200 SE +/- 0.07, N = 3 SE +/- 0.19, N = 3 SE +/- 0.44, N = 3 163.73 163.79 164.04 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 2 1 3 3K 6K 9K 12K 15K SE +/- 9.37, N = 3 SE +/- 19.44, N = 3 SE +/- 20.55, N = 3 13451 13436 13426 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 3 2 4 1 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 82.93 82.93 82.90 82.79
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 2 1 3 300 600 900 1200 1500 SE +/- 2.16, N = 3 SE +/- 2.16, N = 3 SE +/- 2.16, N = 3 1315.94 1315.94 1313.77 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 1 3 2 1000 2000 3000 4000 5000 SE +/- 1.45, N = 3 SE +/- 0.41, N = 3 SE +/- 0.68, N = 3 4474.21 4470.48 4466.84 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 2 3 4 1 200 400 600 800 1000 SE +/- 0.27, N = 3 SE +/- 0.25, N = 3 SE +/- 0.55, N = 3 SE +/- 0.50, N = 3 864.26 864.18 863.95 862.84
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 1 2 3 110 220 330 440 550 SE +/- 1.35, N = 3 SE +/- 0.44, N = 3 SE +/- 0.15, N = 3 522.75 522.40 521.96 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 2 1 3 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 99.64 99.70 99.75 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 3 1 2 400 800 1200 1600 2000 SE +/- 0.26, N = 3 SE +/- 0.77, N = 3 SE +/- 0.95, N = 3 2061.11 2061.21 2062.85 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 2 3 300 600 900 1200 1500 SE +/- 0.45, N = 3 SE +/- 0.00, N = 3 SE +/- 0.39, N = 3 1200.93 1200.70 1200.03 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 2 1 3 1600 3200 4800 6400 8000 SE +/- 1.62, N = 3 SE +/- 1.93, N = 3 SE +/- 0.87, N = 3 7321.00 7319.68 7316.58 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 3K 6K 9K 12K 15K SE +/- 6.51, N = 3 SE +/- 13.61, N = 3 SE +/- 17.13, N = 3 15044.95 15039.27 15037.87 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 2 1 3 200 400 600 800 1000 SE +/- 0.14, N = 3 SE +/- 0.28, N = 3 SE +/- 0.28, N = 3 948.23 948.09 947.81 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 2 1 3 50 100 150 200 250 SE +/- 0.44, N = 3 SE +/- 0.20, N = 3 SE +/- 0.58, N = 3 227.92 228.00 228.02 1. (CC) gcc options: -O2 -fvisibility=hidden
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 3 2 1 9 18 27 36 45 39 39 39 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 3 2 1 100 200 300 400 500 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 463 463 463 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 3 2 1 9 18 27 36 45 SE +/- 0.33, N = 3 39 39 39 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 1.85, N = 12 86 84 83 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Phoronix Test Suite v10.8.4