Intel Xeon Gold 6226R testing with a Supermicro X11SPL-F v1.02 (3.1 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2101245-HA-XEONGOLDW20 Xeon Gold Weekend - Phoronix Test Suite Xeon Gold Weekend Intel Xeon Gold 6226R testing with a Supermicro X11SPL-F v1.02 (3.1 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101245-HA-XEONGOLDW20&rdt&grt .
Xeon Gold Weekend Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution 1 2 3 Intel Xeon Gold 6226R @ 3.90GHz (16 Cores / 32 Threads) Supermicro X11SPL-F v1.02 (3.1 BIOS) Intel Sky Lake-E DMI3 Registers 188GB 3841GB Micron_9300_MTFDHAL3T8TDP llvmpipe VE228 2 x Intel I210 Ubuntu 20.04 5.9.0-050900rc6daily20200921-generic (x86_64) 20200920 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 3.3 Mesa 20.0.8 (LLVM 10.0.0 256 bits) GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x5002f01 Python Details - Python 3.8.2 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Xeon Gold Weekend cloverleaf: Lagrangian-Eulerian Hydrodynamics cpuminer-opt: Magi cpuminer-opt: x25x cpuminer-opt: Deepcoin cpuminer-opt: Ringcoin cpuminer-opt: Blake-2 S cpuminer-opt: Garlicoin cpuminer-opt: Skeincoin cpuminer-opt: Myriad-Groestl cpuminer-opt: LBC, LBRY Credits cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: Triple SHA-256, Onecoin etcpak: DXT1 etcpak: ETC1 etcpak: ETC2 etcpak: ETC1 + Dithering financebench: Repo OpenMP financebench: Bonds OpenMP gcrypt: gnupg: 2.7GB Sample File Encryption synthmark: VoiceMark_100 lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 8 - Decompression lzbench: Crush 0 - Compression lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression lzbench: Libdeflate 1 - Decompression npb: BT.C npb: CG.C npb: EP.C npb: EP.D npb: FT.C npb: IS.D npb: LU.C npb: MG.C npb: SP.B onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU qmcpack: simple-H2O quantlib: redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET relion: Basic - CPU build-godot: Time To Compile 1 2 3 45.83 376.17 484.56 13557 2826.13 682701 6345.86 154083 13840 94847 109393 205558 1256.172 306.884 169.816 283.452 52319.098958 92868.065104 230.512 77.724 562.513 39 104 461 1608 79 1603 99 431 409 566 176 652 199 1083 46844.41 11659.45 1508.72 1850.52 26314.36 1418.74 48539.47 26730.05 20127.16 509 635 130 7697 5788 55.043 2209.4 2313006.91 1919267.04 1498024.71 2238423.92 1701384.46 1249.329 104.804 45.73 381.78 471.95 13517 2624.28 648007 6289.09 152283 13707 94405 110870 189637 1256.480 303.144 170.216 285.520 52456.917969 94292.097656 231.098 77.442 563.381 39 104 460 1607 78 1604 98 431 407 566 176 652 199 1083 47154.78 11687.51 1505.76 1788.49 25803.94 1442.06 48085.57 26986.50 20173.98 511 636 131 7688 5947 55.994 2200.3 1599875.00 1906942.77 1528246.50 2050163.46 1765064.67 1245.316 105.051 45.73 379.08 482.66 13000 2756.73 674335 6287.71 142930 13955 95738 102307 195802 1255.518 308.194 170.510 287.354 52960.246094 92908.937500 230.600 77.430 561.593 39 104 460 1611 79 1604 100 431 410 566 176 652 199 1083 47008.86 12426.57 1566.00 1938.05 25971.90 1459.23 48836.07 27670.58 20498.64 506 620 131 7668 6176 56.268 2202.0 1600325.79 1915277.54 1500356.00 2148268.17 1741171.25 1238.754 105.454 OpenBenchmarking.org
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 1 2 3 10 20 30 40 50 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 45.83 45.73 45.73 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Magi 1 2 3 80 160 240 320 400 SE +/- 0.31, N = 3 SE +/- 5.41, N = 13 SE +/- 2.96, N = 13 376.17 381.78 379.08 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: x25x OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: x25x 1 2 3 100 200 300 400 500 SE +/- 2.00, N = 3 SE +/- 3.14, N = 3 SE +/- 5.44, N = 6 484.56 471.95 482.66 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Deepcoin 1 2 3 3K 6K 9K 12K 15K SE +/- 234.12, N = 3 SE +/- 119.51, N = 15 SE +/- 145.72, N = 3 13557 13517 13000 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Ringcoin 1 2 3 600 1200 1800 2400 3000 SE +/- 21.53, N = 3 SE +/- 142.01, N = 12 SE +/- 37.05, N = 12 2826.13 2624.28 2756.73 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Blake-2 S 1 2 3 150K 300K 450K 600K 750K SE +/- 7963.13, N = 15 SE +/- 3967.71, N = 3 SE +/- 10068.01, N = 4 682701 648007 674335 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Garlicoin 1 2 3 1400 2800 4200 5600 7000 SE +/- 72.91, N = 3 SE +/- 10.94, N = 3 SE +/- 18.75, N = 3 6345.86 6289.09 6287.71 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Skeincoin 1 2 3 30K 60K 90K 120K 150K SE +/- 136.91, N = 3 SE +/- 1577.52, N = 3 SE +/- 6365.04, N = 12 154083 152283 142930 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Myriad-Groestl 1 2 3 3K 6K 9K 12K 15K SE +/- 85.44, N = 3 SE +/- 102.03, N = 3 SE +/- 114.28, N = 15 13840 13707 13955 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: LBC, LBRY Credits 1 2 3 20K 40K 60K 80K 100K SE +/- 567.72, N = 3 SE +/- 2187.49, N = 13 SE +/- 1243.89, N = 5 94847 94405 95738 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Quad SHA-256, Pyrite 1 2 3 20K 40K 60K 80K 100K SE +/- 7217.76, N = 15 SE +/- 6523.24, N = 15 SE +/- 5833.32, N = 15 109393 110870 102307 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Triple SHA-256, Onecoin 1 2 3 40K 80K 120K 160K 200K SE +/- 6250.18, N = 12 SE +/- 6846.95, N = 15 SE +/- 5198.06, N = 15 205558 189637 195802 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 1 2 3 300 600 900 1200 1500 SE +/- 0.61, N = 3 SE +/- 0.88, N = 3 SE +/- 1.46, N = 3 1256.17 1256.48 1255.52 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 2 3 70 140 210 280 350 SE +/- 1.36, N = 3 SE +/- 0.27, N = 3 SE +/- 0.16, N = 3 306.88 303.14 308.19 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 40 80 120 160 200 SE +/- 0.23, N = 3 SE +/- 0.27, N = 3 SE +/- 0.23, N = 3 169.82 170.22 170.51 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 2 3 60 120 180 240 300 SE +/- 2.02, N = 3 SE +/- 1.94, N = 3 SE +/- 0.07, N = 3 283.45 285.52 287.35 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1 2 3 11K 22K 33K 44K 55K SE +/- 22.19, N = 3 SE +/- 147.14, N = 3 SE +/- 632.12, N = 3 52319.10 52456.92 52960.25 1. (CXX) g++ options: -O3 -march=native -fopenmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 1 2 3 20K 40K 60K 80K 100K SE +/- 2.09, N = 3 SE +/- 1357.54, N = 4 SE +/- 14.55, N = 3 92868.07 94292.10 92908.94 1. (CXX) g++ options: -O3 -march=native -fopenmp
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1 2 3 50 100 150 200 250 SE +/- 0.27, N = 3 SE +/- 0.26, N = 3 SE +/- 0.39, N = 3 230.51 231.10 230.60 1. (CC) gcc options: -O2 -fvisibility=hidden
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 1 2 3 20 40 60 80 100 SE +/- 0.27, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 77.72 77.44 77.43 1. (CC) gcc options: -O2
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 1 2 3 120 240 360 480 600 SE +/- 1.79, N = 3 SE +/- 0.94, N = 3 SE +/- 1.34, N = 3 562.51 563.38 561.59 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 1 2 3 9 18 27 36 45 SE +/- 0.33, N = 3 39 39 39 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1 2 3 20 40 60 80 100 104 104 104 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 2 3 100 200 300 400 500 SE +/- 0.33, N = 3 461 460 460 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 1 2 3 300 600 900 1200 1500 SE +/- 0.67, N = 3 SE +/- 1.45, N = 3 1608 1607 1611 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 20 40 60 80 100 SE +/- 0.33, N = 3 79 78 79 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 2 3 300 600 900 1200 1500 SE +/- 1.20, N = 3 SE +/- 1.20, N = 3 SE +/- 3.21, N = 3 1603 1604 1604 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 20 40 60 80 100 99 98 100 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 2 3 90 180 270 360 450 431 431 431 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 2 3 90 180 270 360 450 409 407 410 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 120 240 360 480 600 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 566 566 566 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 2 3 40 80 120 160 200 176 176 176 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1 2 3 140 280 420 560 700 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 652 652 652 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 2 3 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 199 199 199 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression 1 2 3 200 400 600 800 1000 SE +/- 0.33, N = 3 1083 1083 1083 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 1 2 3 10K 20K 30K 40K 50K SE +/- 637.48, N = 3 SE +/- 632.12, N = 3 SE +/- 281.18, N = 3 46844.41 47154.78 47008.86 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 1 2 3 3K 6K 9K 12K 15K SE +/- 117.83, N = 3 SE +/- 158.32, N = 15 SE +/- 103.84, N = 3 11659.45 11687.51 12426.57 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 300 600 900 1200 1500 SE +/- 34.67, N = 15 SE +/- 35.93, N = 15 SE +/- 40.58, N = 15 1508.72 1505.76 1566.00 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 400 800 1200 1600 2000 SE +/- 21.33, N = 3 SE +/- 61.75, N = 12 SE +/- 18.96, N = 3 1850.52 1788.49 1938.05 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 1 2 3 6K 12K 18K 24K 30K SE +/- 321.22, N = 3 SE +/- 406.97, N = 3 SE +/- 168.03, N = 3 26314.36 25803.94 25971.90 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 1 2 3 300 600 900 1200 1500 SE +/- 12.19, N = 12 SE +/- 15.29, N = 15 SE +/- 18.36, N = 5 1418.74 1442.06 1459.23 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 10K 20K 30K 40K 50K SE +/- 494.88, N = 3 SE +/- 260.86, N = 3 SE +/- 520.25, N = 3 48539.47 48085.57 48836.07 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 1 2 3 6K 12K 18K 24K 30K SE +/- 333.51, N = 5 SE +/- 304.73, N = 3 SE +/- 464.53, N = 3 26730.05 26986.50 27670.58 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 1 2 3 4K 8K 12K 16K 20K SE +/- 60.56, N = 3 SE +/- 346.26, N = 3 SE +/- 163.12, N = 14 20127.16 20173.98 20498.64 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 1 2 3 110 220 330 440 550 SE +/- 0.83, N = 3 SE +/- 0.44, N = 3 SE +/- 0.29, N = 3 509 511 506 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 1 2 3 140 280 420 560 700 SE +/- 5.81, N = 10 SE +/- 6.91, N = 12 SE +/- 7.18, N = 3 635 636 620 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 1 2 3 30 60 90 120 150 SE +/- 0.44, N = 3 SE +/- 0.29, N = 3 SE +/- 0.60, N = 3 130 131 131 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 3 1600 3200 4800 6400 8000 SE +/- 17.81, N = 3 SE +/- 10.41, N = 3 SE +/- 19.72, N = 3 7697 7688 7668 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 2 3 1300 2600 3900 5200 6500 SE +/- 140.78, N = 12 SE +/- 116.26, N = 12 SE +/- 132.64, N = 12 5788 5947 6176 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 13 26 39 52 65 SE +/- 0.59, N = 15 SE +/- 0.60, N = 15 SE +/- 0.62, N = 3 55.04 55.99 56.27 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 500 1000 1500 2000 2500 SE +/- 2.69, N = 3 SE +/- 10.40, N = 3 SE +/- 2.99, N = 3 2209.4 2200.3 2202.0 1. (CXX) g++ options: -O3 -march=native -rdynamic
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 23670.35, N = 8 SE +/- 4634.81, N = 3 SE +/- 5950.39, N = 3 2313006.91 1599875.00 1600325.79 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 15053.52, N = 3 SE +/- 17090.43, N = 11 SE +/- 22589.25, N = 3 1919267.04 1906942.77 1915277.54 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 300K 600K 900K 1200K 1500K SE +/- 24049.36, N = 3 SE +/- 4224.49, N = 3 SE +/- 22357.86, N = 4 1498024.71 1528246.50 1500356.00 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 16679.19, N = 3 SE +/- 16498.29, N = 3 SE +/- 14006.17, N = 3 2238423.92 2050163.46 2148268.17 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 4654.40, N = 3 SE +/- 14348.21, N = 3 SE +/- 13659.73, N = 3 1701384.46 1765064.67 1741171.25 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
RELION Test: Basic - Device: CPU OpenBenchmarking.org Seconds, Fewer Is Better RELION 3.1.1 Test: Basic - Device: CPU 1 2 3 300 600 900 1200 1500 SE +/- 4.80, N = 3 SE +/- 3.28, N = 3 SE +/- 0.87, N = 3 1249.33 1245.32 1238.75 1. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 1 2 3 20 40 60 80 100 SE +/- 0.26, N = 3 SE +/- 0.13, N = 3 SE +/- 0.25, N = 3 104.80 105.05 105.45
Phoronix Test Suite v10.8.4