390)X Sunday AMD Ryzen 9 3900X 12-Core testing with a ASUS TUF GAMING X570-PLUS (WI-FI) (2203 BIOS) and MSI AMD Radeon RX 470/480/570/570X/580/580X/590 8GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101259-HA-390XSUNDA01&grt&sro&export=txt .
390)X Sunday Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads) ASUS TUF GAMING X570-PLUS (WI-FI) (2203 BIOS) AMD Starship/Matisse 16GB Samsung SSD 970 EVO Plus 250GB MSI AMD Radeon RX 470/480/570/570X/580/580X/590 8GB (1366/2000MHz) AMD Ellesmere HDMI Audio G237HL Realtek RTL8111/8168/8411 + Intel-AC 9260 Ubuntu 20.04 5.11.0-rc1-phx (x86_64) 20201228 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 21.0.0-devel (git-28a202f 2020-12-31 focal-oibaf-ppa) (LLVM 11.0.0) 1.2.145 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8701021 Python Details - Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
390)X Sunday askap: tConvolve MT - Gridding askap: tConvolve MT - Degridding askap: tConvolve MPI - Degridding askap: tConvolve MPI - Gridding askap: tConvolve OpenMP - Gridding askap: tConvolve OpenMP - Degridding askap: Hogbom Clean OpenMP cpuminer-opt: Magi cpuminer-opt: x25x cpuminer-opt: Deepcoin cpuminer-opt: Ringcoin cpuminer-opt: Blake-2 S cpuminer-opt: Garlicoin cpuminer-opt: Skeincoin cpuminer-opt: Myriad-Groestl cpuminer-opt: LBC, LBRY Credits cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: Triple SHA-256, Onecoin etcpak: DXT1 etcpak: ETC1 etcpak: ETC2 etcpak: ETC1 + Dithering financebench: Repo OpenMP financebench: Bonds OpenMP gcrypt: gnupg: 2.7GB Sample File Encryption synthmark: VoiceMark_100 lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 8 - Decompression lzbench: Crush 0 - Compression lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression lzbench: Libdeflate 1 - Decompression npb: EP.C npb: EP.D npb: LU.C onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU quantlib: redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET 1 2 3 768.200 1329.07 7051.52 7428.05 1482.32 1603.99 168.445 621.07 402.85 11695 2631.70 412783 2660.94 83088 18499 35394 84847 102915 1375.767 309.286 180.410 294.984 42031.884115 74792.953125 203.540 64.996 670.381 41 128 550 1684 106 1877 119 496 533 621 216 739 265 1243 1094.42 1076.16 24023.03 312 479 73 13173 5036 2632.0 2326110.40 1908271.40 1449115.08 2227494.92 1663979.09 767.742 1323.66 6987.35 7428.05 1465.64 1597.55 169.015 613.68 394.71 12036 2640.27 411815 2664.52 86862 18100 35025 84267 83551 1380.743 303.039 179.385 294.797 42185.287760 74056.320313 199.437 63.430 683.000 41 129 567 1732 108 1941 117 491 545 627 215 735 262 1222 1088.18 1075.65 24293.80 312 493 73 13065 5068 2534.8 1469768.92 1863996.21 1450086.71 2041514.48 1682444.25 765.379 1324.94 6999.98 7359.66 2002.23 1870.67 182.815 616.75 397.81 11727 2636.56 418870 2666.98 90202 18590 34650 87686 104747 1373.152 310.861 181.474 296.253 42124.738281 74105.049479 196.189 65.009 679.186 43 132 583 1775 111 1992 115 522 563 652 226 770 266 1246 1050.83 1074.95 24339.47 309 492 73 13067 5120 2647.4 1481787.54 1944071.17 1431668.00 2089924.33 1650762.75 OpenBenchmarking.org
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 1 2 3 170 340 510 680 850 SE +/- 0.25, N = 3 SE +/- 1.15, N = 3 SE +/- 0.54, N = 3 768.20 767.74 765.38 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 2 3 300 600 900 1200 1500 SE +/- 0.70, N = 3 SE +/- 2.24, N = 3 SE +/- 1.04, N = 3 1329.07 1323.66 1324.94 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 1 2 3 1500 3000 4500 6000 7500 SE +/- 55.12, N = 15 SE +/- 41.10, N = 3 SE +/- 49.12, N = 15 7051.52 6987.35 6999.98 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 1 2 3 1600 3200 4800 6400 8000 SE +/- 34.51, N = 12 SE +/- 80.93, N = 3 SE +/- 61.67, N = 15 7428.05 7428.05 7359.66 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 2 3 400 800 1200 1600 2000 SE +/- 16.78, N = 3 SE +/- 2.69, N = 3 SE +/- 17.38, N = 3 1482.32 1465.64 2002.23 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 1 2 3 400 800 1200 1600 2000 SE +/- 5.58, N = 3 SE +/- 3.20, N = 3 SE +/- 4.37, N = 3 1603.99 1597.55 1870.67 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 2 3 40 80 120 160 200 SE +/- 0.19, N = 3 SE +/- 0.25, N = 3 SE +/- 0.00, N = 3 168.45 169.02 182.82 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Magi 1 2 3 130 260 390 520 650 SE +/- 0.56, N = 3 SE +/- 6.06, N = 3 SE +/- 0.42, N = 3 621.07 613.68 616.75 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: x25x OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: x25x 1 2 3 90 180 270 360 450 SE +/- 5.12, N = 15 SE +/- 0.27, N = 3 SE +/- 0.71, N = 3 402.85 394.71 397.81 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Deepcoin 1 2 3 3K 6K 9K 12K 15K SE +/- 178.11, N = 15 SE +/- 330.28, N = 12 SE +/- 213.76, N = 15 11695 12036 11727 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Ringcoin 1 2 3 600 1200 1800 2400 3000 SE +/- 3.47, N = 3 SE +/- 2.28, N = 3 SE +/- 3.89, N = 3 2631.70 2640.27 2636.56 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Blake-2 S 1 2 3 90K 180K 270K 360K 450K SE +/- 3005.78, N = 3 SE +/- 4661.80, N = 15 SE +/- 965.97, N = 3 412783 411815 418870 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Garlicoin 1 2 3 600 1200 1800 2400 3000 SE +/- 3.55, N = 3 SE +/- 4.99, N = 3 SE +/- 1.57, N = 3 2660.94 2664.52 2666.98 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Skeincoin 1 2 3 20K 40K 60K 80K 100K SE +/- 2557.61, N = 12 SE +/- 1773.25, N = 12 SE +/- 622.69, N = 13 83088 86862 90202 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Myriad-Groestl 1 2 3 4K 8K 12K 16K 20K SE +/- 209.11, N = 15 SE +/- 178.96, N = 6 SE +/- 183.39, N = 3 18499 18100 18590 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: LBC, LBRY Credits 1 2 3 8K 16K 24K 32K 40K SE +/- 550.22, N = 12 SE +/- 255.12, N = 11 SE +/- 66.58, N = 3 35394 35025 34650 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Quad SHA-256, Pyrite 1 2 3 20K 40K 60K 80K 100K SE +/- 3663.30, N = 14 SE +/- 723.54, N = 3 SE +/- 925.13, N = 5 84847 84267 87686 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Triple SHA-256, Onecoin 1 2 3 20K 40K 60K 80K 100K SE +/- 1232.09, N = 4 SE +/- 7425.46, N = 15 SE +/- 626.11, N = 3 102915 83551 104747 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 1 2 3 300 600 900 1200 1500 SE +/- 3.14, N = 3 SE +/- 1.64, N = 3 SE +/- 3.99, N = 3 1375.77 1380.74 1373.15 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 2 3 70 140 210 280 350 SE +/- 1.30, N = 3 SE +/- 1.79, N = 3 SE +/- 2.84, N = 3 309.29 303.04 310.86 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 40 80 120 160 200 SE +/- 1.86, N = 3 SE +/- 0.99, N = 3 SE +/- 1.56, N = 8 180.41 179.39 181.47 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 2 3 60 120 180 240 300 SE +/- 0.64, N = 3 SE +/- 0.08, N = 3 SE +/- 1.22, N = 3 294.98 294.80 296.25 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1 2 3 9K 18K 27K 36K 45K SE +/- 112.48, N = 3 SE +/- 59.24, N = 3 SE +/- 94.06, N = 3 42031.88 42185.29 42124.74 1. (CXX) g++ options: -O3 -march=native -fopenmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 1 2 3 16K 32K 48K 64K 80K SE +/- 745.68, N = 6 SE +/- 238.59, N = 3 SE +/- 291.41, N = 3 74792.95 74056.32 74105.05 1. (CXX) g++ options: -O3 -march=native -fopenmp
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1 2 3 40 80 120 160 200 SE +/- 1.41, N = 3 SE +/- 1.94, N = 3 SE +/- 1.93, N = 3 203.54 199.44 196.19 1. (CC) gcc options: -O2 -fvisibility=hidden
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 1 2 3 15 30 45 60 75 SE +/- 0.49, N = 15 SE +/- 0.76, N = 4 SE +/- 0.68, N = 5 65.00 63.43 65.01 1. (CC) gcc options: -O2
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 1 2 3 150 300 450 600 750 SE +/- 2.59, N = 3 SE +/- 3.35, N = 3 SE +/- 1.04, N = 3 670.38 683.00 679.19 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 1 2 3 10 20 30 40 50 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 41 41 43 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1 2 3 30 60 90 120 150 SE +/- 1.00, N = 3 SE +/- 1.00, N = 3 128 129 132 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 2 3 130 260 390 520 650 SE +/- 5.13, N = 7 SE +/- 3.18, N = 3 550 567 583 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 1 2 3 400 800 1200 1600 2000 SE +/- 15.85, N = 7 SE +/- 6.69, N = 3 1684 1732 1775 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 106 108 111 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 2 3 400 800 1200 1600 2000 SE +/- 4.16, N = 3 SE +/- 23.18, N = 4 SE +/- 12.77, N = 3 1877 1941 1992 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 30 60 90 120 150 SE +/- 0.58, N = 3 SE +/- 0.88, N = 3 119 117 115 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 2 3 110 220 330 440 550 SE +/- 4.84, N = 3 SE +/- 0.33, N = 3 SE +/- 2.08, N = 3 496 491 522 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 2 3 120 240 360 480 600 SE +/- 4.18, N = 3 SE +/- 5.19, N = 6 SE +/- 1.33, N = 3 533 545 563 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 140 280 420 560 700 SE +/- 3.18, N = 3 SE +/- 5.75, N = 6 SE +/- 2.85, N = 3 621 627 652 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 2 3 50 100 150 200 250 SE +/- 2.37, N = 5 SE +/- 2.00, N = 3 216 215 226 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1 2 3 170 340 510 680 850 SE +/- 9.12, N = 5 SE +/- 7.42, N = 3 SE +/- 4.10, N = 3 739 735 770 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 2 3 60 120 180 240 300 SE +/- 2.87, N = 5 SE +/- 0.67, N = 3 265 262 266 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression 1 2 3 300 600 900 1200 1500 SE +/- 13.33, N = 5 SE +/- 1.86, N = 3 1243 1222 1246 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 200 400 600 800 1000 SE +/- 1.59, N = 3 SE +/- 3.37, N = 3 SE +/- 12.46, N = 4 1094.42 1088.18 1050.83 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 200 400 600 800 1000 SE +/- 2.16, N = 3 SE +/- 1.95, N = 3 SE +/- 0.35, N = 3 1076.16 1075.65 1074.95 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 5K 10K 15K 20K 25K SE +/- 23.65, N = 3 SE +/- 15.89, N = 3 SE +/- 22.21, N = 3 24023.03 24293.80 24339.47 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 1 2 3 70 140 210 280 350 SE +/- 0.33, N = 3 SE +/- 0.17, N = 3 SE +/- 2.02, N = 3 312 312 309 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 1 2 3 110 220 330 440 550 SE +/- 4.54, N = 12 SE +/- 5.01, N = 5 SE +/- 5.43, N = 4 479 493 492 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 1 2 3 16 32 48 64 80 SE +/- 0.17, N = 3 SE +/- 0.33, N = 3 SE +/- 0.29, N = 3 73 73 73 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 3 3K 6K 9K 12K 15K SE +/- 5.86, N = 3 SE +/- 52.71, N = 3 SE +/- 38.80, N = 3 13173 13065 13067 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 2 3 1100 2200 3300 4400 5500 SE +/- 9.86, N = 3 SE +/- 38.11, N = 12 SE +/- 47.60, N = 3 5036 5068 5120 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 600 1200 1800 2400 3000 SE +/- 26.20, N = 3 SE +/- 20.43, N = 9 SE +/- 19.32, N = 11 2632.0 2534.8 2647.4 1. (CXX) g++ options: -O3 -march=native -rdynamic
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 25949.62, N = 5 SE +/- 19710.98, N = 3 SE +/- 6818.95, N = 3 2326110.40 1469768.92 1481787.54 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 22085.72, N = 4 SE +/- 15789.30, N = 3 SE +/- 2946.64, N = 3 1908271.40 1863996.21 1944071.17 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 300K 600K 900K 1200K 1500K SE +/- 8316.35, N = 3 SE +/- 10231.57, N = 3 SE +/- 15812.81, N = 4 1449115.08 1450086.71 1431668.00 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 22607.64, N = 3 SE +/- 19485.15, N = 6 SE +/- 22138.91, N = 3 2227494.92 2041514.48 2089924.33 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 16867.94, N = 3 SE +/- 19351.58, N = 3 SE +/- 12482.70, N = 15 1663979.09 1682444.25 1650762.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Phoronix Test Suite v10.8.5