Intel Xeon E5-1680 v3 testing with a ASUS X99-A (3902 BIOS) and eVGA NVIDIA NVE7 1GB on Ubuntu 20.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2101305-HA-XEONE5HAS31 Xeon E5 Haswell 2021 - Phoronix Test Suite Xeon E5 Haswell 2021 Intel Xeon E5-1680 v3 testing with a ASUS X99-A (3902 BIOS) and eVGA NVIDIA NVE7 1GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101305-HA-XEONE5HAS31&grs&sor .
Xeon E5 Haswell 2021 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution 1 2 3 Intel Xeon E5-1680 v3 @ 3.80GHz (8 Cores / 16 Threads) ASUS X99-A (3902 BIOS) Intel Xeon E7 v3/Xeon 16GB PNY CS900 240GB eVGA NVIDIA NVE7 1GB Realtek ALC1150 G237HL Intel I218-V Ubuntu 20.04 5.4.0-58-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.8 nouveau 4.3 Mesa 20.0.8 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x44 Python Details - Python 3.8.5 Security Details - itlb_multihit: KVM: Vulnerable + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Xeon E5 Haswell 2021 redis: LPOP redis: GET redis: SET npb: EP.D askap: tConvolve MPI - Degridding cloverleaf: Lagrangian-Eulerian Hydrodynamics askap: tConvolve OpenMP - Gridding askap: tConvolve MPI - Gridding redis: SADD dav1d: Summer Nature 1080p rav1e: 10 npb: EP.C npb: FT.C npb: LU.C dav1d: Chimera 1080p redis: LPUSH qmcpack: simple-H2O lzbench: Zstd 8 - Decompression lzbench: Zstd 1 - Decompression build-godot: Time To Compile mnn: resnet-v2-50 rav1e: 5 mnn: mobilenet-v1-1.0 gcrypt: rav1e: 1 webp2: Quality 95, Compression Effort 7 etcpak: DXT1 kripke: dav1d: Summer Nature 4K dav1d: Chimera 1080p 10-bit gnupg: 2.7GB Sample File Encryption lzbench: Brotli 0 - Decompression rav1e: 6 cp2k: Fayalite-FIST Data cryptsetup: AES-XTS 256b Encryption tnn: CPU - MobileNet v2 cryptsetup: Twofish-XTS 512b Decryption financebench: Repo OpenMP lzbench: Zstd 1 - Compression financebench: Bonds OpenMP onnx: super-resolution-10 - OpenMP CPU cryptsetup: Twofish-XTS 256b Encryption cryptsetup: Serpent-XTS 512b Encryption qe: AUSURF112 webp2: Default lammps: Rhodopsin Protein cryptsetup: Serpent-XTS 256b Encryption webp2: Quality 75, Compression Effort 7 etcpak: ETC1 lzbench: Brotli 2 - Decompression mnn: inception-v3 openfoam: Motorbike 30M askap: tConvolve OpenMP - Degridding quantlib: mnn: MobileNetV2_224 lzbench: Brotli 0 - Compression lulesh: cryptsetup: Serpent-XTS 256b Decryption cryptsetup: Serpent-XTS 512b Decryption tnn: CPU - SqueezeNet v1.1 onnx: shufflenet-v2-10 - OpenMP CPU npb: CG.C lzbench: Crush 0 - Decompression cryptsetup: Twofish-XTS 256b Decryption cryptsetup: AES-XTS 512b Decryption mnn: SqueezeNetV1.0 lzbench: Libdeflate 1 - Decompression etcpak: ETC2 cryptsetup: PBKDF2-whirlpool etcpak: ETC1 + Dithering cryptsetup: AES-XTS 256b Decryption cryptsetup: AES-XTS 512b Encryption cryptsetup: PBKDF2-sha512 cryptsetup: Twofish-XTS 512b Encryption webp2: Quality 100, Compression Effort 5 amg: askap: Hogbom Clean OpenMP askap: tConvolve MT - Gridding askap: tConvolve MT - Degridding synthmark: VoiceMark_100 onnx: fcn-resnet101-11 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: yolov4 - OpenMP CPU lzbench: Libdeflate 1 - Compression lzbench: Brotli 2 - Compression lzbench: Crush 0 - Compression lzbench: Zstd 8 - Compression lzbench: XZ 0 - Decompression lzbench: XZ 0 - Compression warsow: 1920 x 1080 npb: MG.C 1 2 3 2184273.00 2117431.17 1612741.71 960.23 1711.34 135.59 1560.99 2162.83 1735755.75 362.38 2.484 953.43 13022.66 25421.02 446.57 1331267.79 37.697 1572 1511 197.715 40.644 0.859 2.957 248.828 0.294 613.025 1148.783 42778687 129.05 66.67 74.607 553 1.130 1366.236 1873.5 309.695 378.9 62310.579427 443 106201.333984 4352 377.3 603.4 2235.80 6.167 5.045 601.4 336.017 250.283 642 44.266 198.64 2295.31 1808.3 3.723 397 5187.4226 584.7 584.4 296.831 10700 4720.22 470 378.8 1545.6 7.258 1081 149.072 578472 240.451 1893.1 1556.0 1442334 378.3 16.286 333264500 249.378 1414.22 1880.56 582.138 60 582 328 201 164 87 73 105 37 21.3 16156.63 1392683.21 1929425.96 1564875.57 933.49 1718.82 135.41 1591.53 2174.68 1767770.42 361.53 2.522 938.84 13203.92 25115.18 445.73 1347316.54 37.693 1587 1518 199.482 40.996 0.855 2.979 250.660 0.293 610.956 1153.826 42986933 128.51 66.49 74.192 550 1.124 1361.662 1882.9 309.842 379.1 62205.911458 441 105922.046875 4334 378.7 602.6 2229.12 6.145 5.063 603.5 337.121 250.319 642 44.279 198.14 2288.77 1805.7 3.713 396 5192.2705 584.6 584.8 296.388 10696 4729.76 471 379.5 1547.7 7.259 1082 148.840 578049 240.467 1892.6 1557.8 1440358 378.7 16.283 333615967 249.170 1415.15 1881.67 582.006 60 582 328 201 164 87 73 105 37 21.3 15664.08 1381617.17 1945558.42 1579700.62 959.77 1674.97 132.34 1595.06 2133.39 1749575.63 356.18 2.483 940.83 13190.13 25428.46 441.18 1340317.13 38.065 1578 1525 197.972 40.774 0.852 2.970 249.966 0.292 614.856 1155.934 42728223 129.26 66.29 74.316 553 1.124 1359.184 1881.7 311.222 377.3 62019.235677 443 105740.312500 4347 378.6 601.2 2237.21 6.146 5.063 602.5 336.047 249.513 644 44.145 198.05 2295.31 1803.3 3.716 396 5200.2474 583.3 583.4 296.155 10676 4719.69 470 378.7 1544.5 7.273 1080 149.114 579110 240.112 1890.4 1555.6 1441674 378.2 16.303 333464900 249.170 1414.53 1881.12 582.306 60 582 328 201 164 87 73 105 37 21.3 16170.45 OpenBenchmarking.org
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 23519.20, N = 3 SE +/- 10036.91, N = 3 SE +/- 6117.34, N = 3 2184273.00 1392683.21 1381617.17 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 3 2 500K 1000K 1500K 2000K 2500K SE +/- 15428.09, N = 3 SE +/- 9787.91, N = 3 SE +/- 11787.58, N = 3 2117431.17 1945558.42 1929425.96 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 3 2 300K 600K 900K 1200K 1500K SE +/- 10891.82, N = 3 SE +/- 17997.98, N = 15 SE +/- 17648.94, N = 7 1612741.71 1579700.62 1564875.57 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 3 2 200 400 600 800 1000 SE +/- 1.78, N = 3 SE +/- 1.18, N = 3 SE +/- 12.44, N = 5 960.23 959.77 933.49 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 2 1 3 400 800 1200 1600 2000 SE +/- 7.47, N = 3 SE +/- 7.47, N = 3 SE +/- 9.46, N = 3 1718.82 1711.34 1674.97 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 3 2 1 30 60 90 120 150 SE +/- 0.13, N = 3 SE +/- 0.24, N = 3 SE +/- 0.18, N = 3 132.34 135.41 135.59 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 3 2 1 300 600 900 1200 1500 SE +/- 23.70, N = 3 SE +/- 16.87, N = 3 SE +/- 26.44, N = 3 1595.06 1591.53 1560.99 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 2 1 3 500 1000 1500 2000 2500 SE +/- 11.95, N = 3 SE +/- 15.66, N = 3 SE +/- 10.01, N = 3 2174.68 2162.83 2133.39 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 2 3 1 400K 800K 1200K 1600K 2000K SE +/- 1283.01, N = 3 SE +/- 7435.40, N = 3 SE +/- 16689.59, N = 3 1767770.42 1749575.63 1735755.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 1080p 1 2 3 80 160 240 320 400 SE +/- 0.33, N = 3 SE +/- 0.63, N = 3 SE +/- 4.40, N = 3 362.38 361.53 356.18 MIN: 314.39 / MAX: 395.69 MIN: 316.58 / MAX: 394.68 MIN: 234.5 / MAX: 393.88 1. (CC) gcc options: -pthread
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 2 1 3 0.5675 1.135 1.7025 2.27 2.8375 SE +/- 0.031, N = 3 SE +/- 0.003, N = 3 SE +/- 0.010, N = 3 2.522 2.484 2.483
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 3 2 200 400 600 800 1000 SE +/- 1.25, N = 3 SE +/- 12.50, N = 3 SE +/- 8.16, N = 11 953.43 940.83 938.84 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 2 3 1 3K 6K 9K 12K 15K SE +/- 3.79, N = 3 SE +/- 11.15, N = 3 SE +/- 108.74, N = 3 13203.92 13190.13 13022.66 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 3 1 2 5K 10K 15K 20K 25K SE +/- 20.70, N = 3 SE +/- 18.45, N = 3 SE +/- 291.88, N = 3 25428.46 25421.02 25115.18 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 1 2 3 100 200 300 400 500 SE +/- 1.21, N = 3 SE +/- 1.39, N = 3 SE +/- 3.79, N = 3 446.57 445.73 441.18 MIN: 334.94 / MAX: 621.82 MIN: 335.4 / MAX: 614.57 MIN: 334.23 / MAX: 614.69 1. (CC) gcc options: -pthread
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 2 3 1 300K 600K 900K 1200K 1500K SE +/- 11814.24, N = 3 SE +/- 15361.49, N = 3 SE +/- 21562.31, N = 12 1347316.54 1340317.13 1331267.79 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 2 1 3 9 18 27 36 45 SE +/- 0.18, N = 3 SE +/- 0.21, N = 3 SE +/- 0.41, N = 14 37.69 37.70 38.07 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 2 3 1 300 600 900 1200 1500 SE +/- 3.38, N = 3 SE +/- 2.08, N = 3 SE +/- 11.17, N = 3 1587 1578 1572 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 3 2 1 300 600 900 1200 1500 SE +/- 0.67, N = 3 SE +/- 3.33, N = 3 SE +/- 13.35, N = 3 1525 1518 1511 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 1 3 2 40 80 120 160 200 SE +/- 0.05, N = 3 SE +/- 0.24, N = 3 SE +/- 2.16, N = 3 197.72 197.97 199.48
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 1 3 2 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.16, N = 3 40.64 40.77 41.00 MIN: 40.04 / MAX: 63.91 MIN: 40.51 / MAX: 65 MIN: 40.1 / MAX: 85.1 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 1 2 3 0.1933 0.3866 0.5799 0.7732 0.9665 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 0.859 0.855 0.852
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 1 3 2 0.6703 1.3406 2.0109 2.6812 3.3515 SE +/- 0.010, N = 3 SE +/- 0.019, N = 3 SE +/- 0.015, N = 3 2.957 2.970 2.979 MIN: 2.91 / MAX: 41.71 MIN: 2.91 / MAX: 3.09 MIN: 2.92 / MAX: 27.52 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1 3 2 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.29, N = 3 SE +/- 0.79, N = 3 248.83 249.97 250.66 1. (CC) gcc options: -O2 -fvisibility=hidden
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 1 1 2 3 0.0662 0.1324 0.1986 0.2648 0.331 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.294 0.293 0.292
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 2 1 3 130 260 390 520 650 SE +/- 1.98, N = 3 SE +/- 0.63, N = 3 SE +/- 1.00, N = 3 610.96 613.03 614.86 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 3 2 1 200 400 600 800 1000 SE +/- 2.73, N = 3 SE +/- 3.10, N = 3 SE +/- 0.08, N = 3 1155.93 1153.83 1148.78 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 2 1 3 9M 18M 27M 36M 45M SE +/- 124129.23, N = 3 SE +/- 25008.61, N = 3 SE +/- 191237.56, N = 3 42986933 42778687 42728223 1. (CXX) g++ options: -O3 -fopenmp
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 4K 3 1 2 30 60 90 120 150 SE +/- 0.42, N = 3 SE +/- 0.87, N = 3 SE +/- 1.06, N = 3 129.26 129.05 128.51 MIN: 114.89 / MAX: 147.6 MIN: 120.69 / MAX: 149.13 MIN: 120.24 / MAX: 147.95 1. (CC) gcc options: -pthread
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 10-bit 1 2 3 15 30 45 60 75 SE +/- 0.04, N = 3 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 66.67 66.49 66.29 MIN: 42.38 / MAX: 172.79 MIN: 42.26 / MAX: 173.63 MIN: 42.31 / MAX: 167.63 1. (CC) gcc options: -pthread
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 2 3 1 20 40 60 80 100 SE +/- 0.58, N = 3 SE +/- 0.72, N = 3 SE +/- 0.94, N = 3 74.19 74.32 74.61 1. (CC) gcc options: -O2
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 3 1 2 120 240 360 480 600 SE +/- 2.40, N = 3 553 553 550 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 1 3 2 0.2543 0.5086 0.7629 1.0172 1.2715 SE +/- 0.005, N = 3 SE +/- 0.000, N = 3 SE +/- 0.005, N = 3 1.130 1.124 1.124
CP2K Molecular Dynamics Fayalite-FIST Data OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 8.1 Fayalite-FIST Data 3 2 1 300 600 900 1200 1500 1359.18 1361.66 1366.24
Cryptsetup AES-XTS 256b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 256b Encryption 2 3 1 400 800 1200 1600 2000 SE +/- 2.33, N = 3 SE +/- 2.11, N = 3 SE +/- 4.16, N = 3 1882.9 1881.7 1873.5
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 3 70 140 210 280 350 SE +/- 0.32, N = 3 SE +/- 0.22, N = 3 SE +/- 0.99, N = 3 309.70 309.84 311.22 MIN: 308.15 / MAX: 315.76 MIN: 308.21 / MAX: 347.09 MIN: 308.18 / MAX: 362.37 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Cryptsetup Twofish-XTS 512b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 512b Decryption 2 1 3 80 160 240 320 400 SE +/- 0.00, N = 3 SE +/- 0.30, N = 2 SE +/- 0.26, N = 3 379.1 378.9 377.3
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 3 2 1 13K 26K 39K 52K 65K SE +/- 36.55, N = 3 SE +/- 181.93, N = 3 SE +/- 273.76, N = 3 62019.24 62205.91 62310.58 1. (CXX) g++ options: -O3 -march=native -fopenmp
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 3 1 2 100 200 300 400 500 SE +/- 1.20, N = 3 443 443 441 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 3 2 1 20K 40K 60K 80K 100K SE +/- 1460.77, N = 3 SE +/- 1547.01, N = 3 SE +/- 1091.80, N = 8 105740.31 105922.05 106201.33 1. (CXX) g++ options: -O3 -march=native -fopenmp
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 3 2 900 1800 2700 3600 4500 SE +/- 9.85, N = 3 SE +/- 5.36, N = 3 SE +/- 11.44, N = 3 4352 4347 4334 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Cryptsetup Twofish-XTS 256b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 256b Encryption 2 3 1 80 160 240 320 400 SE +/- 0.20, N = 3 SE +/- 0.12, N = 3 SE +/- 1.44, N = 3 378.7 378.6 377.3
Cryptsetup Serpent-XTS 512b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 512b Encryption 1 2 3 130 260 390 520 650 SE +/- 0.64, N = 3 SE +/- 0.53, N = 3 SE +/- 1.25, N = 2 603.4 602.6 601.2
Quantum ESPRESSO Input: AUSURF112 OpenBenchmarking.org Seconds, Fewer Is Better Quantum ESPRESSO 6.7 Input: AUSURF112 2 1 3 500 1000 1500 2000 2500 SE +/- 3.87, N = 3 SE +/- 4.81, N = 3 SE +/- 6.57, N = 3 2229.12 2235.80 2237.21 1. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 2 3 1 2 4 6 8 10 SE +/- 0.013, N = 3 SE +/- 0.027, N = 3 SE +/- 0.040, N = 3 6.145 6.146 6.167 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 3 2 1 1.1392 2.2784 3.4176 4.5568 5.696 SE +/- 0.020, N = 3 SE +/- 0.024, N = 3 SE +/- 0.023, N = 3 5.063 5.063 5.045 1. (CXX) g++ options: -O3 -pthread -lm
Cryptsetup Serpent-XTS 256b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 256b Encryption 2 3 1 130 260 390 520 650 SE +/- 0.18, N = 3 SE +/- 0.15, N = 3 SE +/- 1.45, N = 3 603.5 602.5 601.4
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 1 3 2 70 140 210 280 350 SE +/- 0.26, N = 3 SE +/- 0.11, N = 3 SE +/- 0.20, N = 3 336.02 336.05 337.12 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 2 1 3 50 100 150 200 250 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.88, N = 3 250.32 250.28 249.51 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 3 2 1 140 280 420 560 700 SE +/- 0.58, N = 3 SE +/- 0.67, N = 3 644 642 642 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 3 1 2 10 20 30 40 50 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 44.15 44.27 44.28 MIN: 43.81 / MAX: 67.23 MIN: 43.53 / MAX: 95.59 MIN: 43.76 / MAX: 67.86 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M 3 2 1 40 80 120 160 200 SE +/- 0.32, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 198.05 198.14 198.64 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 3 1 2 500 1000 1500 2000 2500 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 6.54, N = 3 2295.31 2295.31 2288.77 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 400 800 1200 1600 2000 SE +/- 4.59, N = 3 SE +/- 12.13, N = 3 SE +/- 14.29, N = 3 1808.3 1805.7 1803.3 1. (CXX) g++ options: -O3 -march=native -rdynamic
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 2 3 1 0.8377 1.6754 2.5131 3.3508 4.1885 SE +/- 0.015, N = 3 SE +/- 0.009, N = 3 SE +/- 0.017, N = 3 3.713 3.716 3.723 MIN: 3.57 / MAX: 27.94 MIN: 3.59 / MAX: 17.46 MIN: 3.59 / MAX: 28.29 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 3 2 90 180 270 360 450 SE +/- 0.33, N = 3 397 396 396 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 3 2 1 1100 2200 3300 4400 5500 SE +/- 2.62, N = 3 SE +/- 8.45, N = 3 SE +/- 11.89, N = 3 5200.25 5192.27 5187.42 1. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
Cryptsetup Serpent-XTS 256b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 256b Decryption 1 2 3 130 260 390 520 650 SE +/- 0.58, N = 3 SE +/- 0.09, N = 3 SE +/- 0.32, N = 3 584.7 584.6 583.3
Cryptsetup Serpent-XTS 512b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Serpent-XTS 512b Decryption 2 1 3 130 260 390 520 650 SE +/- 0.06, N = 3 SE +/- 1.40, N = 2 SE +/- 0.87, N = 3 584.8 584.4 583.4
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 3 2 1 60 120 180 240 300 SE +/- 0.36, N = 3 SE +/- 0.23, N = 3 SE +/- 0.25, N = 3 296.16 296.39 296.83 MIN: 295.4 / MAX: 297.66 MIN: 295.86 / MAX: 297.88 MIN: 296.31 / MAX: 298.07 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 3 2K 4K 6K 8K 10K SE +/- 22.50, N = 3 SE +/- 8.57, N = 3 SE +/- 31.15, N = 3 10700 10696 10676 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 2 1 3 1000 2000 3000 4000 5000 SE +/- 4.55, N = 3 SE +/- 4.73, N = 3 SE +/- 6.17, N = 3 4729.76 4720.22 4719.69 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 2 3 1 100 200 300 400 500 SE +/- 0.33, N = 3 471 470 470 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Cryptsetup Twofish-XTS 256b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 256b Decryption 2 1 3 80 160 240 320 400 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 SE +/- 0.32, N = 3 379.5 378.8 378.7
Cryptsetup AES-XTS 512b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 512b Decryption 2 1 3 300 600 900 1200 1500 SE +/- 1.21, N = 3 SE +/- 0.88, N = 3 SE +/- 1.37, N = 3 1547.7 1545.6 1544.5
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 1 2 3 2 4 6 8 10 SE +/- 0.026, N = 3 SE +/- 0.017, N = 3 SE +/- 0.030, N = 3 7.258 7.259 7.273 MIN: 7.09 / MAX: 19.82 MIN: 7.06 / MAX: 31.25 MIN: 7.12 / MAX: 24.4 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression 2 1 3 200 400 600 800 1000 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 1082 1081 1080 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 3 1 2 30 60 90 120 150 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.20, N = 3 149.11 149.07 148.84 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Cryptsetup PBKDF2-whirlpool OpenBenchmarking.org Iterations Per Second, More Is Better Cryptsetup PBKDF2-whirlpool 3 1 2 120K 240K 360K 480K 600K SE +/- 213.00, N = 3 SE +/- 562.60, N = 3 SE +/- 972.16, N = 3 579110 578472 578049
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 2 1 3 50 100 150 200 250 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.41, N = 3 240.47 240.45 240.11 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Cryptsetup AES-XTS 256b Decryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 256b Decryption 1 2 3 400 800 1200 1600 2000 SE +/- 2.41, N = 3 SE +/- 1.92, N = 3 SE +/- 1.86, N = 3 1893.1 1892.6 1890.4
Cryptsetup AES-XTS 512b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup AES-XTS 512b Encryption 2 1 3 300 600 900 1200 1500 SE +/- 1.30, N = 3 SE +/- 1.55, N = 3 SE +/- 1.99, N = 3 1557.8 1556.0 1555.6
Cryptsetup PBKDF2-sha512 OpenBenchmarking.org Iterations Per Second, More Is Better Cryptsetup PBKDF2-sha512 1 3 2 300K 600K 900K 1200K 1500K SE +/- 1145.46, N = 3 SE +/- 1322.67, N = 3 SE +/- 2284.58, N = 3 1442334 1441674 1440358
Cryptsetup Twofish-XTS 512b Encryption OpenBenchmarking.org MiB/s, More Is Better Cryptsetup Twofish-XTS 512b Encryption 2 1 3 80 160 240 320 400 SE +/- 0.07, N = 3 SE +/- 0.12, N = 3 SE +/- 0.15, N = 3 378.7 378.3 378.2
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 2 1 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 16.28 16.29 16.30 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 2 3 1 70M 140M 210M 280M 350M SE +/- 689497.85, N = 3 SE +/- 530500.53, N = 3 SE +/- 454532.38, N = 3 333615967 333464900 333264500 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 3 2 50 100 150 200 250 SE +/- 0.36, N = 3 SE +/- 0.21, N = 3 SE +/- 0.21, N = 3 249.38 249.17 249.17 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 2 3 1 300 600 900 1200 1500 SE +/- 0.41, N = 3 SE +/- 0.16, N = 3 SE +/- 0.41, N = 3 1415.15 1414.53 1414.22 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 2 3 1 400 800 1200 1600 2000 SE +/- 0.83, N = 3 SE +/- 0.55, N = 3 SE +/- 0.28, N = 3 1881.67 1881.12 1880.56 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 3 1 2 130 260 390 520 650 SE +/- 0.08, N = 3 SE +/- 0.16, N = 3 SE +/- 0.19, N = 3 582.31 582.14 582.01 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 3 2 1 13 26 39 52 65 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 60 60 60 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 3 2 1 130 260 390 520 650 SE +/- 0.33, N = 3 SE +/- 0.29, N = 3 SE +/- 0.50, N = 3 582 582 582 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 3 2 1 70 140 210 280 350 SE +/- 0.17, N = 3 SE +/- 0.00, N = 3 SE +/- 0.29, N = 3 328 328 328 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 3 2 1 40 80 120 160 200 201 201 201 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 3 2 1 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 164 164 164 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 3 2 1 20 40 60 80 100 SE +/- 1.00, N = 3 87 87 87 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 3 2 1 16 32 48 64 80 73 73 73 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 3 2 1 20 40 60 80 100 105 105 105 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 3 2 1 9 18 27 36 45 37 37 37 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Warsow Resolution: 1920 x 1080 OpenBenchmarking.org Frames Per Second, More Is Better Warsow 2.5 Beta Resolution: 1920 x 1080 3 2 1 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 21.3 21.3 21.3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 3 1 2 3K 6K 9K 12K 15K SE +/- 36.14, N = 3 SE +/- 35.08, N = 3 SE +/- 249.22, N = 15 16170.45 16156.63 15664.08 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Phoronix Test Suite v10.8.4