3900XT Mon AMD Ryzen 9 3900XT 12-Core testing with a MSI MEG X570 GODLIKE (MS-7C34) v1.0 (1.B3 BIOS) and AMD Radeon RX 56/64 8GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101262-PTS-3900XTMO34&sro&grr .
3900XT Mon Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 4 AMD Ryzen 9 3900XT 12-Core @ 3.80GHz (12 Cores / 24 Threads) MSI MEG X570 GODLIKE (MS-7C34) v1.0 (1.B3 BIOS) AMD Starship/Matisse 16GB 500GB Seagate FireCuda 520 SSD ZP500GM30002 AMD Radeon RX 56/64 8GB (1630/945MHz) AMD Vega 10 HDMI Audio ASUS MG28U Realtek Device 2600 + Realtek Device 3000 + Intel Wi-Fi 6 AX200 Ubuntu 20.10 5.11.0-rc1-phx (x86_64) 20201228 GNOME Shell 3.38.1 X Server 1.20.9 amdgpu 19.1.0 4.6 Mesa 20.2.1 (LLVM 11.0.0) 1.2.131 GCC 10.2.0 ext4 3840x2160 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8701021 Python Details - Python 2.7.18 + Python 3.8.6 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
3900XT Mon ior: 32MB - Default Test Directory ior: 16MB - Default Test Directory gcrypt: askap: tConvolve MT - Degridding askap: tConvolve MT - Gridding cpuminer-opt: Skeincoin cpuminer-opt: Blake-2 S npb: EP.D ior: 8MB - Default Test Directory cpuminer-opt: Myriad-Groestl cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: LBC, LBRY Credits cpuminer-opt: x25x gnupg: 2.7GB Sample File Encryption npb: LU.C cpuminer-opt: Triple SHA-256, Onecoin lzbench: Zstd 8 - Decompression lzbench: Zstd 8 - Compression lzbench: Crush 0 - Decompression lzbench: Crush 0 - Compression cpuminer-opt: Garlicoin quantlib: cpuminer-opt: Magi financebench: Bonds OpenMP ior: 4MB - Default Test Directory cpuminer-opt: Ringcoin cpuminer-opt: Deepcoin redis: SET askap: tConvolve MPI - Gridding askap: tConvolve MPI - Degridding lzbench: Brotli 0 - Decompression lzbench: Brotli 0 - Compression financebench: Repo OpenMP lzbench: XZ 0 - Decompression lzbench: XZ 0 - Compression redis: LPUSH lzbench: Brotli 2 - Decompression lzbench: Brotli 2 - Compression redis: SADD redis: GET redis: LPOP cython-bench: N-Queens etcpak: ETC2 etcpak: ETC1 + Dithering askap: Hogbom Clean OpenMP lzbench: Zstd 1 - Decompression lzbench: Zstd 1 - Compression etcpak: ETC1 ior: 2MB - Default Test Directory askap: tConvolve OpenMP - Degridding askap: tConvolve OpenMP - Gridding lzbench: Libdeflate 1 - Compression etcpak: DXT1 npb: EP.C 1 2 3 4 495.89 502.00 189.379 1304.40 768.326 79968 424753 1113.76 572.49 15704 66104 34423 437.02 64.036 25440.71 88850 2000 113 555 124 2740.94 2656.9 656.06 48900.851562 642.36 2708.29 11670 1848258.80 7717.52 7380.45 661 568 34171.065104 137 45 1639285.39 773 226 2198504.75 2495306.0 2646764.17 21.841 191.526 299.985 165.837 1821 586 323.102 1114.14 1594.35 1457.62 282 1444.481 1125.63 460.44 436.13 194.921 1305.90 765.929 75588 431572 1116.27 627.88 14706 57327 35490 434.50 63.864 25746.75 93133 1973 111 540 113 2738.61 2628.0 658.45 49063.674479 651.65 2737.92 11760 1864366.86 7717.52 7428.05 655 562 33975.346354 130 43 1611296.08 768 226 2144897.75 2268725.50 2053102.01 22.567 191.707 299.986 165.847 1748 567 324.961 1010.73 1603.95 1460.31 270 1431.376 1122.34 449.38 453.34 190.739 1303.41 766.603 76781 431999 1117.72 475.40 14520 63937 33917 436.52 64.675 25770.17 86578 1988 112 535 113 2743.31 2557.1 650.47 48799.832031 655.02 2724.16 11643 1845440.12 7620.44 7380.45 650 556 34077.460937 131 43 1605401.62 781 229 2065240.05 2249303.25 1592607.04 21.657 184.876 292.669 163.845 1750 567 319.761 1004.43 1600.75 1460.31 281 1402.734 1124.14 450.46 452.64 192.549 1302.87 767.094 77967 436577 1115.75 459.80 14474 66815 35479 443.08 63.853 25620.07 87550 1971 111 533 110 2786.75 2546.6 660.96 48943.471354 679.56 2554.69 11496.20 1829013.50 7523.37 7475.66 644 551 34164.032552 133 44 1587779.08 773 226 2095075.75 2315346.74 1667545.69 22.258 184.174 288.250 164.114 1749 566 315.411 1041.95 1591.26 1449.68 272 1379.740 1120.95 OpenBenchmarking.org
IOR Block Size: 32MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 32MB - Disk Target: Default Test Directory 1 2 3 4 110 220 330 440 550 SE +/- 6.54, N = 15 SE +/- 5.25, N = 15 SE +/- 8.32, N = 12 SE +/- 5.31, N = 4 495.89 460.44 449.38 450.46 MIN: 381.11 / MAX: 1625.18 MIN: 37.51 / MAX: 1722.27 MIN: 37.39 / MAX: 1646.71 MIN: 38.18 / MAX: 1768.91 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 16MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 16MB - Disk Target: Default Test Directory 1 2 3 4 110 220 330 440 550 SE +/- 4.78, N = 15 SE +/- 17.10, N = 15 SE +/- 5.04, N = 15 SE +/- 6.08, N = 12 502.00 436.13 453.34 452.64 MIN: 159.58 / MAX: 1798.86 MIN: 272.33 / MAX: 1746.13 MIN: 26.61 / MAX: 1793.9 MIN: 34 / MAX: 1575.24 1. (CC) gcc options: -O2 -lm -pthread -lmpi
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1 2 3 4 40 80 120 160 200 SE +/- 0.20, N = 3 SE +/- 2.37, N = 4 SE +/- 2.37, N = 3 SE +/- 1.74, N = 3 189.38 194.92 190.74 192.55 1. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 2 3 4 300 600 900 1200 1500 SE +/- 3.78, N = 3 SE +/- 3.17, N = 3 SE +/- 1.92, N = 3 SE +/- 1.53, N = 3 1304.40 1305.90 1303.41 1302.87 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 1 2 3 4 170 340 510 680 850 SE +/- 1.12, N = 3 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 SE +/- 0.39, N = 3 768.33 765.93 766.60 767.09 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Skeincoin 1 2 3 4 20K 40K 60K 80K 100K SE +/- 1179.26, N = 15 SE +/- 1432.79, N = 15 SE +/- 1469.38, N = 15 SE +/- 958.46, N = 15 79968 75588 76781 77967 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Blake-2 S 1 2 3 4 90K 180K 270K 360K 450K SE +/- 3444.51, N = 9 SE +/- 4302.88, N = 15 SE +/- 4016.95, N = 15 SE +/- 4045.19, N = 15 424753 431572 431999 436577 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 4 200 400 600 800 1000 SE +/- 1.21, N = 3 SE +/- 2.12, N = 3 SE +/- 1.25, N = 3 SE +/- 1.17, N = 3 1113.76 1116.27 1117.72 1115.75 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
IOR Block Size: 8MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 8MB - Disk Target: Default Test Directory 1 2 3 4 140 280 420 560 700 SE +/- 10.94, N = 15 SE +/- 14.65, N = 15 SE +/- 21.49, N = 15 SE +/- 18.04, N = 15 572.49 627.88 475.40 459.80 MIN: 313.1 / MAX: 1803.4 MIN: 245.9 / MAX: 1963.18 MIN: 27.73 / MAX: 1918.99 MIN: 30.99 / MAX: 1803.79 1. (CC) gcc options: -O2 -lm -pthread -lmpi
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Myriad-Groestl 1 2 3 4 3K 6K 9K 12K 15K SE +/- 1185.42, N = 15 SE +/- 153.52, N = 15 SE +/- 76.38, N = 3 SE +/- 236.87, N = 15 15704 14706 14520 14474 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Quad SHA-256, Pyrite 1 2 3 4 14K 28K 42K 56K 70K SE +/- 539.72, N = 9 SE +/- 3808.52, N = 12 SE +/- 699.10, N = 3 SE +/- 3975.46, N = 15 66104 57327 63937 66815 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: LBC, LBRY Credits 1 2 3 4 8K 16K 24K 32K 40K SE +/- 488.34, N = 3 SE +/- 298.72, N = 3 SE +/- 747.95, N = 15 SE +/- 381.67, N = 15 34423 35490 33917 35479 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: x25x OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: x25x 1 2 3 4 100 200 300 400 500 SE +/- 4.23, N = 3 SE +/- 8.08, N = 15 SE +/- 5.91, N = 3 SE +/- 6.02, N = 14 437.02 434.50 436.52 443.08 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 1 2 3 4 14 28 42 56 70 SE +/- 0.72, N = 4 SE +/- 0.61, N = 6 SE +/- 0.29, N = 3 SE +/- 0.84, N = 3 64.04 63.86 64.68 63.85 1. (CC) gcc options: -O2
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 4 6K 12K 18K 24K 30K SE +/- 18.43, N = 3 SE +/- 15.97, N = 3 SE +/- 6.52, N = 3 SE +/- 85.43, N = 3 25440.71 25746.75 25770.17 25620.07 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Triple SHA-256, Onecoin 1 2 3 4 20K 40K 60K 80K 100K SE +/- 475.01, N = 3 SE +/- 661.12, N = 3 SE +/- 2029.96, N = 12 SE +/- 2419.87, N = 12 88850 93133 86578 87550 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 2 3 4 400 800 1200 1600 2000 SE +/- 16.50, N = 3 SE +/- 12.78, N = 9 SE +/- 11.35, N = 15 SE +/- 23.36, N = 3 2000 1973 1988 1971 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 4 30 60 90 120 150 SE +/- 0.92, N = 9 SE +/- 0.98, N = 15 113 111 112 111 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 2 3 4 120 240 360 480 600 SE +/- 6.09, N = 5 SE +/- 1.57, N = 15 SE +/- 1.20, N = 3 555 540 535 533 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 4 30 60 90 120 150 SE +/- 1.21, N = 5 SE +/- 0.75, N = 15 124 113 113 110 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Garlicoin 1 2 3 4 600 1200 1800 2400 3000 SE +/- 7.98, N = 3 SE +/- 2.33, N = 3 SE +/- 27.32, N = 15 SE +/- 31.97, N = 3 2740.94 2738.61 2743.31 2786.75 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 4 600 1200 1800 2400 3000 SE +/- 2.19, N = 3 SE +/- 27.50, N = 5 SE +/- 20.85, N = 9 SE +/- 21.61, N = 8 2656.9 2628.0 2557.1 2546.6 1. (CXX) g++ options: -O3 -march=native -rdynamic
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Magi 1 2 3 4 140 280 420 560 700 SE +/- 7.57, N = 3 SE +/- 9.61, N = 14 SE +/- 1.27, N = 3 SE +/- 7.36, N = 3 656.06 658.45 650.47 660.96 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 1 2 3 4 11K 22K 33K 44K 55K SE +/- 106.82, N = 3 SE +/- 115.39, N = 3 SE +/- 92.99, N = 3 SE +/- 121.62, N = 3 48900.85 49063.67 48799.83 48943.47 1. (CXX) g++ options: -O3 -march=native -fopenmp
IOR Block Size: 4MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 4MB - Disk Target: Default Test Directory 1 2 3 4 150 300 450 600 750 SE +/- 26.70, N = 15 SE +/- 13.55, N = 15 SE +/- 16.65, N = 15 SE +/- 18.28, N = 15 642.36 651.65 655.02 679.56 MIN: 180.14 / MAX: 1833.27 MIN: 282.94 / MAX: 1862.29 MIN: 279.91 / MAX: 1864.38 MIN: 277.01 / MAX: 1836.9 1. (CC) gcc options: -O2 -lm -pthread -lmpi
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Ringcoin 1 2 3 4 600 1200 1800 2400 3000 SE +/- 1.25, N = 3 SE +/- 19.58, N = 3 SE +/- 25.71, N = 3 SE +/- 175.23, N = 12 2708.29 2737.92 2724.16 2554.69 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Deepcoin 1 2 3 4 3K 6K 9K 12K 15K SE +/- 97.13, N = 3 SE +/- 124.23, N = 3 SE +/- 14.53, N = 3 SE +/- 176.41, N = 12 11670.00 11760.00 11643.00 11496.20 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 4 400K 800K 1200K 1600K 2000K SE +/- 12069.89, N = 14 SE +/- 16495.42, N = 15 SE +/- 19390.22, N = 3 SE +/- 21610.11, N = 3 1848258.80 1864366.86 1845440.12 1829013.50 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 1 2 3 4 1700 3400 5100 6800 8500 SE +/- 97.08, N = 3 SE +/- 97.08, N = 3 7717.52 7717.52 7620.44 7523.37 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 1 2 3 4 1600 3200 4800 6400 8000 SE +/- 45.84, N = 3 SE +/- 80.93, N = 3 SE +/- 45.84, N = 3 SE +/- 93.44, N = 3 7380.45 7428.05 7380.45 7475.66 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 4 140 280 420 560 700 SE +/- 2.67, N = 3 SE +/- 6.22, N = 5 SE +/- 6.89, N = 3 SE +/- 4.31, N = 11 661 655 650 644 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 2 3 4 120 240 360 480 600 SE +/- 4.00, N = 3 SE +/- 5.90, N = 5 SE +/- 6.00, N = 3 SE +/- 4.02, N = 11 568 562 556 551 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1 2 3 4 7K 14K 21K 28K 35K SE +/- 199.03, N = 3 SE +/- 69.51, N = 3 SE +/- 127.60, N = 3 SE +/- 118.98, N = 3 34171.07 33975.35 34077.46 34164.03 1. (CXX) g++ options: -O3 -march=native -fopenmp
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1 2 3 4 30 60 90 120 150 SE +/- 1.49, N = 4 137 130 131 133 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 1 2 3 4 10 20 30 40 50 SE +/- 0.50, N = 4 45 43 43 44 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 4 400K 800K 1200K 1600K 2000K SE +/- 14329.46, N = 7 SE +/- 10913.83, N = 15 SE +/- 7188.41, N = 3 SE +/- 19053.53, N = 3 1639285.39 1611296.08 1605401.62 1587779.08 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1 2 3 4 200 400 600 800 1000 SE +/- 10.59, N = 3 SE +/- 10.17, N = 3 SE +/- 1.73, N = 3 SE +/- 6.98, N = 8 773 768 781 773 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 2 3 4 50 100 150 200 250 SE +/- 2.65, N = 3 SE +/- 3.06, N = 3 SE +/- 1.97, N = 8 226 226 229 226 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 4 500K 1000K 1500K 2000K 2500K SE +/- 23924.03, N = 3 SE +/- 18447.44, N = 3 SE +/- 16109.79, N = 15 SE +/- 19871.31, N = 3 2198504.75 2144897.75 2065240.05 2095075.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 4 500K 1000K 1500K 2000K 2500K SE +/- 33785.28, N = 3 SE +/- 9017.13, N = 3 SE +/- 25514.94, N = 3 SE +/- 21404.07, N = 15 2495306.00 2268725.50 2249303.25 2315346.74 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 4 600K 1200K 1800K 2400K 3000K SE +/- 31483.61, N = 3 SE +/- 138661.15, N = 12 SE +/- 11133.45, N = 3 SE +/- 20108.17, N = 4 2646764.17 2053102.01 1592607.04 1667545.69 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Cython Benchmark Test: N-Queens OpenBenchmarking.org Seconds, Fewer Is Better Cython Benchmark 0.29.21 Test: N-Queens 1 2 3 4 5 10 15 20 25 SE +/- 0.07, N = 3 SE +/- 0.25, N = 4 SE +/- 0.07, N = 3 SE +/- 0.25, N = 5 21.84 22.57 21.66 22.26
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 4 40 80 120 160 200 SE +/- 0.10, N = 3 SE +/- 0.27, N = 3 SE +/- 0.93, N = 3 SE +/- 0.86, N = 3 191.53 191.71 184.88 184.17 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 2 3 4 70 140 210 280 350 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 2.33, N = 9 SE +/- 0.06, N = 3 299.99 299.99 292.67 288.25 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 2 3 4 40 80 120 160 200 SE +/- 0.00, N = 3 SE +/- 0.88, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 165.84 165.85 163.85 164.11 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 1 2 3 4 400 800 1200 1600 2000 SE +/- 9.35, N = 3 SE +/- 5.36, N = 3 SE +/- 3.46, N = 3 SE +/- 1.45, N = 3 1821 1748 1750 1749 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 2 3 4 130 260 390 520 650 SE +/- 0.67, N = 3 SE +/- 2.65, N = 3 586 567 567 566 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 2 3 4 70 140 210 280 350 SE +/- 4.32, N = 3 SE +/- 2.93, N = 7 SE +/- 3.79, N = 3 SE +/- 0.84, N = 3 323.10 324.96 319.76 315.41 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
IOR Block Size: 2MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 2MB - Disk Target: Default Test Directory 1 2 3 4 200 400 600 800 1000 SE +/- 30.57, N = 15 SE +/- 29.45, N = 12 SE +/- 27.86, N = 15 SE +/- 29.72, N = 15 1114.14 1010.73 1004.43 1041.95 MIN: 240.36 / MAX: 1651.95 MIN: 229.32 / MAX: 1764.83 MIN: 243.37 / MAX: 1844.92 MIN: 228.82 / MAX: 1726.96 1. (CC) gcc options: -O2 -lm -pthread -lmpi
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 1 2 3 4 300 600 900 1200 1500 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 3.20, N = 3 SE +/- 8.36, N = 3 1594.35 1603.95 1600.75 1591.26 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 2 3 4 300 600 900 1200 1500 SE +/- 2.67, N = 3 SE +/- 5.36, N = 3 SE +/- 5.36, N = 3 SE +/- 2.64, N = 3 1457.62 1460.31 1460.31 1449.68 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 2 3 4 60 120 180 240 300 SE +/- 0.33, N = 3 282 270 281 272 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 1 2 3 4 300 600 900 1200 1500 SE +/- 4.84, N = 3 SE +/- 13.95, N = 6 SE +/- 10.54, N = 15 SE +/- 4.78, N = 3 1444.48 1431.38 1402.73 1379.74 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 4 200 400 600 800 1000 SE +/- 1.82, N = 3 SE +/- 2.21, N = 3 SE +/- 1.04, N = 3 SE +/- 1.41, N = 3 1125.63 1122.34 1124.14 1120.95 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
Phoronix Test Suite v10.8.4