8100 Sunday Intel Core i3-8100 testing with a ASRock Z370M-ITX/ac (P4.10 BIOS) and Intel 8th Gen Core Gaussian Mixture Model 3GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101274-HA-8100SUNDA63&gru .
8100 Sunday Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 Intel Core i3-8100 @ 3.60GHz (4 Cores) ASRock Z370M-ITX/ac (P4.10 BIOS) Intel 8th Gen Core 4-core Desktop 8GB 60GB DREVO X1 SSD Intel 8th Gen Core Gaussian Mixture Model 3GB (1100MHz) Realtek ALC892 VA2431 Intel I219-V + Intel I211 + Intel Dual Band-AC 3168NGW Ubuntu 20.04 5.9.0-050900rc1daily20200819-generic (x86_64) 20200818 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 20.0.8 1.2.131 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xde - Thermald 1.9.1 Python Details - Python 3.8.5 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Mitigation of Clear buffers; SMT disabled + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected
8100 Sunday onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU cpuminer-opt: Magi cpuminer-opt: x25x cpuminer-opt: Deepcoin cpuminer-opt: Ringcoin cpuminer-opt: Blake-2 S cpuminer-opt: Garlicoin cpuminer-opt: Skeincoin cpuminer-opt: Myriad-Groestl cpuminer-opt: LBC, LBRY Credits cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: Triple SHA-256, Onecoin lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 8 - Decompression lzbench: Crush 0 - Compression lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression lzbench: Libdeflate 1 - Decompression quantlib: etcpak: DXT1 etcpak: ETC1 etcpak: ETC2 etcpak: ETC1 + Dithering redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET npb: BT.C npb: CG.C npb: EP.C npb: EP.D npb: FT.C npb: LU.C npb: MG.C npb: SP.B gcrypt: gnupg: 2.7GB Sample File Encryption qmcpack: simple-H2O 1 2 3 234 365 39 15441 2610 88.99 101.91 2678.50 704.60 113077 657.79 14183 4525.65 9779.44 21547 28328 36 95 428 1484 77 1636 86 421 383 518 160 597 192 931 2078.4 1078.732 271.424 154.592 259.432 2368756.42 1919397.87 1478285.38 2270967.25 1740309.54 15309.08 3796.83 528.70 522.30 8303.11 16154.78 8486.66 5031.43 249.290 84.728 30.884 234 366 39 15372 2607 89.02 101.79 2677.31 703.55 113383 657.63 13863 4520.53 9781.44 21550 28470 35 95 428 1487 76 1636 86 422 382 519 160 597 193 931 2052.5 1088.386 270.651 154.626 259.543 1508953.79 1913726.08 1467455.84 2117653.38 1710738.66 15355.49 3805.69 527.79 525.32 8538.85 16156.70 8499.77 5045.92 250.092 85.048 30.937 233 366 39 15390 2621 88.88 101.85 2693.01 703.96 113607 657.20 14007 4541.28 9740.16 36 95 427 1488 77 1636 86 422 383 518 160 597 192 932 2045.3 1082.698 271.032 154.599 259.605 1510825.33 1924693.58 1474399.87 2105763.96 1706001.96 15346.20 3809.85 519.70 529.79 8398.79 15885.31 8512.95 5033.86 250.673 85.071 31.179 OpenBenchmarking.org
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 1 2 3 50 100 150 200 250 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 234 234 233 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 1 2 3 80 160 240 320 400 SE +/- 0.33, N = 3 SE +/- 0.29, N = 3 SE +/- 0.44, N = 3 365 366 366 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 1 2 3 9 18 27 36 45 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 39 39 39 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 3 3K 6K 9K 12K 15K SE +/- 15.07, N = 3 SE +/- 27.44, N = 3 SE +/- 22.23, N = 3 15441 15372 15390 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 2 3 600 1200 1800 2400 3000 SE +/- 11.85, N = 3 SE +/- 9.29, N = 3 SE +/- 7.42, N = 3 2610 2607 2621 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Magi 1 2 3 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.18, N = 3 SE +/- 0.33, N = 3 88.99 89.02 88.88 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: x25x OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: x25x 1 2 3 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 101.91 101.79 101.85 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Deepcoin 1 2 3 600 1200 1800 2400 3000 SE +/- 6.01, N = 3 SE +/- 0.93, N = 3 SE +/- 6.07, N = 3 2678.50 2677.31 2693.01 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Ringcoin 1 2 3 150 300 450 600 750 SE +/- 1.41, N = 3 SE +/- 1.55, N = 3 SE +/- 1.40, N = 3 704.60 703.55 703.96 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Blake-2 S 1 2 3 20K 40K 60K 80K 100K SE +/- 699.87, N = 3 SE +/- 662.18, N = 3 SE +/- 459.00, N = 3 113077 113383 113607 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Garlicoin 1 2 3 140 280 420 560 700 SE +/- 0.07, N = 3 SE +/- 0.14, N = 3 SE +/- 0.25, N = 3 657.79 657.63 657.20 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Skeincoin 1 2 3 3K 6K 9K 12K 15K SE +/- 3.33, N = 3 SE +/- 162.88, N = 6 SE +/- 128.63, N = 3 14183 13863 14007 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Myriad-Groestl 1 2 3 1000 2000 3000 4000 5000 SE +/- 10.16, N = 3 SE +/- 16.51, N = 3 SE +/- 1.14, N = 3 4525.65 4520.53 4541.28 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: LBC, LBRY Credits 1 2 3 2K 4K 6K 8K 10K SE +/- 13.95, N = 3 SE +/- 11.01, N = 3 SE +/- 28.38, N = 3 9779.44 9781.44 9740.16 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Quad SHA-256, Pyrite 1 2 5K 10K 15K 20K 25K SE +/- 23.33, N = 3 SE +/- 10.00, N = 3 21547 21550 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Triple SHA-256, Onecoin 1 2 6K 12K 18K 24K 30K SE +/- 389.51, N = 12 SE +/- 85.05, N = 3 28328 28470 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 1 2 3 8 16 24 32 40 SE +/- 0.58, N = 3 36 35 36 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1 2 3 20 40 60 80 100 95 95 95 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 2 3 90 180 270 360 450 SE +/- 0.33, N = 3 SE +/- 1.33, N = 3 428 428 427 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 1 2 3 300 600 900 1200 1500 SE +/- 2.08, N = 3 SE +/- 0.67, N = 3 SE +/- 1.00, N = 3 1484 1487 1488 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 20 40 60 80 100 SE +/- 0.33, N = 3 77 76 77 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 2 3 400 800 1200 1600 2000 SE +/- 0.88, N = 3 SE +/- 1.20, N = 3 1636 1636 1636 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 20 40 60 80 100 86 86 86 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 2 3 90 180 270 360 450 421 422 422 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 2 3 80 160 240 320 400 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 383 382 383 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 110 220 330 440 550 SE +/- 0.33, N = 3 SE +/- 0.88, N = 3 518 519 518 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 2 3 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 160 160 160 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1 2 3 130 260 390 520 650 SE +/- 0.33, N = 3 597 597 597 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 2 3 40 80 120 160 200 SE +/- 0.67, N = 3 192 193 192 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression 1 2 3 200 400 600 800 1000 931 931 932 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 400 800 1200 1600 2000 SE +/- 2.40, N = 3 SE +/- 29.66, N = 4 SE +/- 25.01, N = 6 2078.4 2052.5 2045.3 1. (CXX) g++ options: -O3 -march=native -rdynamic
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 1 2 3 200 400 600 800 1000 SE +/- 0.20, N = 3 SE +/- 0.42, N = 3 SE +/- 2.00, N = 3 1078.73 1088.39 1082.70 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 2 3 60 120 180 240 300 SE +/- 0.41, N = 3 SE +/- 0.05, N = 3 SE +/- 0.41, N = 3 271.42 270.65 271.03 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 154.59 154.63 154.60 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 2 3 60 120 180 240 300 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 259.43 259.54 259.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 25595.08, N = 3 SE +/- 4107.34, N = 3 SE +/- 6482.15, N = 3 2368756.42 1508953.79 1510825.33 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 8758.39, N = 3 SE +/- 13317.33, N = 3 SE +/- 10909.16, N = 3 1919397.87 1913726.08 1924693.58 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 300K 600K 900K 1200K 1500K SE +/- 6852.92, N = 3 SE +/- 9896.92, N = 3 SE +/- 3881.16, N = 3 1478285.38 1467455.84 1474399.87 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 23894.94, N = 7 SE +/- 17838.66, N = 3 SE +/- 25309.76, N = 3 2270967.25 2117653.38 2105763.96 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 5600.63, N = 3 SE +/- 29400.90, N = 3 SE +/- 17741.35, N = 3 1740309.54 1710738.66 1706001.96 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 1 2 3 3K 6K 9K 12K 15K SE +/- 12.12, N = 3 SE +/- 12.24, N = 3 SE +/- 13.75, N = 3 15309.08 15355.49 15346.20 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 1 2 3 800 1600 2400 3200 4000 SE +/- 2.25, N = 3 SE +/- 5.00, N = 3 SE +/- 1.90, N = 3 3796.83 3805.69 3809.85 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 110 220 330 440 550 SE +/- 1.06, N = 3 SE +/- 0.67, N = 3 SE +/- 8.92, N = 3 528.70 527.79 519.70 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 110 220 330 440 550 SE +/- 6.38, N = 6 SE +/- 3.97, N = 3 SE +/- 0.30, N = 3 522.30 525.32 529.79 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 1 2 3 2K 4K 6K 8K 10K SE +/- 261.31, N = 12 SE +/- 32.90, N = 3 SE +/- 51.99, N = 3 8303.11 8538.85 8398.79 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 3K 6K 9K 12K 15K SE +/- 4.68, N = 3 SE +/- 9.60, N = 3 SE +/- 275.60, N = 11 16154.78 16156.70 15885.31 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 1 2 3 2K 4K 6K 8K 10K SE +/- 9.36, N = 3 SE +/- 7.54, N = 3 SE +/- 3.69, N = 3 8486.66 8499.77 8512.95 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 1 2 3 1100 2200 3300 4400 5500 SE +/- 1.69, N = 3 SE +/- 1.44, N = 3 SE +/- 1.40, N = 3 5031.43 5045.92 5033.86 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1 2 3 50 100 150 200 250 SE +/- 0.36, N = 3 SE +/- 0.56, N = 3 SE +/- 0.71, N = 3 249.29 250.09 250.67 1. (CC) gcc options: -O2 -fvisibility=hidden
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 1 2 3 20 40 60 80 100 SE +/- 1.03, N = 5 SE +/- 0.93, N = 3 SE +/- 1.05, N = 3 84.73 85.05 85.07 1. (CC) gcc options: -O2
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.28, N = 3 SE +/- 0.43, N = 3 30.88 30.94 31.18 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
Phoronix Test Suite v10.8.4