390)X Sunday AMD Ryzen 9 3900X 12-Core testing with a ASUS TUF GAMING X570-PLUS (WI-FI) (2203 BIOS) and MSI AMD Radeon RX 470/480/570/570X/580/580X/590 8GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101259-HA-390XSUNDA01&grr&sor .
390)X Sunday Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads) ASUS TUF GAMING X570-PLUS (WI-FI) (2203 BIOS) AMD Starship/Matisse 16GB Samsung SSD 970 EVO Plus 250GB MSI AMD Radeon RX 470/480/570/570X/580/580X/590 8GB (1366/2000MHz) AMD Ellesmere HDMI Audio G237HL Realtek RTL8111/8168/8411 + Intel-AC 9260 Ubuntu 20.04 5.11.0-rc1-phx (x86_64) 20201228 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 21.0.0-devel (git-28a202f 2020-12-31 focal-oibaf-ppa) (LLVM 11.0.0) 1.2.145 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8701021 Python Details - Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
390)X Sunday onnx: bertsquad-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU gcrypt: gnupg: 2.7GB Sample File Encryption askap: tConvolve MPI - Gridding askap: tConvolve MPI - Degridding askap: tConvolve MT - Degridding askap: tConvolve MT - Gridding cpuminer-opt: Deepcoin npb: EP.D cpuminer-opt: Skeincoin onnx: fcn-resnet101-11 - OpenMP CPU onnx: yolov4 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU financebench: Bonds OpenMP cpuminer-opt: LBC, LBRY Credits npb: LU.C cpuminer-opt: Myriad-Groestl quantlib: cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: Triple SHA-256, Onecoin cpuminer-opt: Blake-2 S cpuminer-opt: x25x financebench: Repo OpenMP etcpak: ETC2 lzbench: XZ 0 - Decompression lzbench: XZ 0 - Compression redis: SET lzbench: Zstd 1 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 8 - Decompression lzbench: Zstd 8 - Compression lzbench: Brotli 2 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 0 - Compression cpuminer-opt: Garlicoin cpuminer-opt: Magi cpuminer-opt: Ringcoin synthmark: VoiceMark_100 lzbench: Crush 0 - Decompression lzbench: Crush 0 - Compression lzbench: Libdeflate 1 - Decompression lzbench: Libdeflate 1 - Compression redis: GET redis: LPOP redis: LPUSH askap: Hogbom Clean OpenMP redis: SADD etcpak: ETC1 + Dithering etcpak: ETC1 askap: tConvolve OpenMP - Degridding askap: tConvolve OpenMP - Gridding npb: EP.C etcpak: DXT1 1 2 3 479 5036 203.540 64.996 7428.05 7051.52 1329.07 768.200 11695 1076.16 83088 73 312 13173 74792.953125 35394 24023.03 18499 2632.0 84847 102915 412783 402.85 42031.884115 180.410 128 41 1663979.09 1684 550 1877 106 739 216 621 533 2660.94 621.07 2631.70 670.381 496 119 1243 265 2227494.92 2326110.40 1449115.08 168.445 1908271.40 294.984 309.286 1603.99 1482.32 1094.42 1375.767 493 5068 199.437 63.430 7428.05 6987.35 1323.66 767.742 12036 1075.65 86862 73 312 13065 74056.320313 35025 24293.80 18100 2534.8 84267 83551 411815 394.71 42185.287760 179.385 129 41 1682444.25 1732 567 1941 108 735 215 627 545 2664.52 613.68 2640.27 683.000 491 117 1222 262 2041514.48 1469768.92 1450086.71 169.015 1863996.21 294.797 303.039 1597.55 1465.64 1088.18 1380.743 492 5120 196.189 65.009 7359.66 6999.98 1324.94 765.379 11727 1074.95 90202 73 309 13067 74105.049479 34650 24339.47 18590 2647.4 87686 104747 418870 397.81 42124.738281 181.474 132 43 1650762.75 1775 583 1992 111 770 226 652 563 2666.98 616.75 2636.56 679.186 522 115 1246 266 2089924.33 1481787.54 1431668.00 182.815 1944071.17 296.253 310.861 1870.67 2002.23 1050.83 1373.152 OpenBenchmarking.org
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 2 3 1 110 220 330 440 550 SE +/- 5.01, N = 5 SE +/- 5.43, N = 4 SE +/- 4.54, N = 12 493 492 479 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 3 2 1 1100 2200 3300 4400 5500 SE +/- 47.60, N = 3 SE +/- 38.11, N = 12 SE +/- 9.86, N = 3 5120 5068 5036 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 3 2 1 40 80 120 160 200 SE +/- 1.93, N = 3 SE +/- 1.94, N = 3 SE +/- 1.41, N = 3 196.19 199.44 203.54 1. (CC) gcc options: -O2 -fvisibility=hidden
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 2 1 3 15 30 45 60 75 SE +/- 0.76, N = 4 SE +/- 0.49, N = 15 SE +/- 0.68, N = 5 63.43 65.00 65.01 1. (CC) gcc options: -O2
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 2 1 3 1600 3200 4800 6400 8000 SE +/- 80.93, N = 3 SE +/- 34.51, N = 12 SE +/- 61.67, N = 15 7428.05 7428.05 7359.66 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 1 3 2 1500 3000 4500 6000 7500 SE +/- 55.12, N = 15 SE +/- 49.12, N = 15 SE +/- 41.10, N = 3 7051.52 6999.98 6987.35 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 3 2 300 600 900 1200 1500 SE +/- 0.70, N = 3 SE +/- 1.04, N = 3 SE +/- 2.24, N = 3 1329.07 1324.94 1323.66 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 1 2 3 170 340 510 680 850 SE +/- 0.25, N = 3 SE +/- 1.15, N = 3 SE +/- 0.54, N = 3 768.20 767.74 765.38 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Deepcoin 2 3 1 3K 6K 9K 12K 15K SE +/- 330.28, N = 12 SE +/- 213.76, N = 15 SE +/- 178.11, N = 15 12036 11727 11695 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 200 400 600 800 1000 SE +/- 2.16, N = 3 SE +/- 1.95, N = 3 SE +/- 0.35, N = 3 1076.16 1075.65 1074.95 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Skeincoin 3 2 1 20K 40K 60K 80K 100K SE +/- 622.69, N = 13 SE +/- 1773.25, N = 12 SE +/- 2557.61, N = 12 90202 86862 83088 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 3 2 1 16 32 48 64 80 SE +/- 0.29, N = 3 SE +/- 0.33, N = 3 SE +/- 0.17, N = 3 73 73 73 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 2 1 3 70 140 210 280 350 SE +/- 0.17, N = 3 SE +/- 0.33, N = 3 SE +/- 2.02, N = 3 312 312 309 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 3 2 3K 6K 9K 12K 15K SE +/- 5.86, N = 3 SE +/- 38.80, N = 3 SE +/- 52.71, N = 3 13173 13067 13065 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 2 3 1 16K 32K 48K 64K 80K SE +/- 238.59, N = 3 SE +/- 291.41, N = 3 SE +/- 745.68, N = 6 74056.32 74105.05 74792.95 1. (CXX) g++ options: -O3 -march=native -fopenmp
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: LBC, LBRY Credits 1 2 3 8K 16K 24K 32K 40K SE +/- 550.22, N = 12 SE +/- 255.12, N = 11 SE +/- 66.58, N = 3 35394 35025 34650 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 3 2 1 5K 10K 15K 20K 25K SE +/- 22.21, N = 3 SE +/- 15.89, N = 3 SE +/- 23.65, N = 3 24339.47 24293.80 24023.03 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Myriad-Groestl 3 1 2 4K 8K 12K 16K 20K SE +/- 183.39, N = 3 SE +/- 209.11, N = 15 SE +/- 178.96, N = 6 18590 18499 18100 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 3 1 2 600 1200 1800 2400 3000 SE +/- 19.32, N = 11 SE +/- 26.20, N = 3 SE +/- 20.43, N = 9 2647.4 2632.0 2534.8 1. (CXX) g++ options: -O3 -march=native -rdynamic
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Quad SHA-256, Pyrite 3 1 2 20K 40K 60K 80K 100K SE +/- 925.13, N = 5 SE +/- 3663.30, N = 14 SE +/- 723.54, N = 3 87686 84847 84267 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Triple SHA-256, Onecoin 3 1 2 20K 40K 60K 80K 100K SE +/- 626.11, N = 3 SE +/- 1232.09, N = 4 SE +/- 7425.46, N = 15 104747 102915 83551 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Blake-2 S 3 1 2 90K 180K 270K 360K 450K SE +/- 965.97, N = 3 SE +/- 3005.78, N = 3 SE +/- 4661.80, N = 15 418870 412783 411815 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: x25x OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: x25x 1 3 2 90 180 270 360 450 SE +/- 5.12, N = 15 SE +/- 0.71, N = 3 SE +/- 0.27, N = 3 402.85 397.81 394.71 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1 3 2 9K 18K 27K 36K 45K SE +/- 112.48, N = 3 SE +/- 94.06, N = 3 SE +/- 59.24, N = 3 42031.88 42124.74 42185.29 1. (CXX) g++ options: -O3 -march=native -fopenmp
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 3 1 2 40 80 120 160 200 SE +/- 1.56, N = 8 SE +/- 1.86, N = 3 SE +/- 0.99, N = 3 181.47 180.41 179.39 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 3 2 1 30 60 90 120 150 SE +/- 1.00, N = 3 SE +/- 1.00, N = 3 132 129 128 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 3 2 1 10 20 30 40 50 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 43 41 41 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 2 1 3 400K 800K 1200K 1600K 2000K SE +/- 19351.58, N = 3 SE +/- 16867.94, N = 3 SE +/- 12482.70, N = 15 1682444.25 1663979.09 1650762.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 3 2 1 400 800 1200 1600 2000 SE +/- 6.69, N = 3 SE +/- 15.85, N = 7 1775 1732 1684 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 3 2 1 130 260 390 520 650 SE +/- 3.18, N = 3 SE +/- 5.13, N = 7 583 567 550 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 3 2 1 400 800 1200 1600 2000 SE +/- 12.77, N = 3 SE +/- 23.18, N = 4 SE +/- 4.16, N = 3 1992 1941 1877 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 3 2 1 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 111 108 106 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 3 1 2 170 340 510 680 850 SE +/- 4.10, N = 3 SE +/- 9.12, N = 5 SE +/- 7.42, N = 3 770 739 735 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 3 1 2 50 100 150 200 250 SE +/- 2.37, N = 5 SE +/- 2.00, N = 3 226 216 215 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 3 2 1 140 280 420 560 700 SE +/- 2.85, N = 3 SE +/- 5.75, N = 6 SE +/- 3.18, N = 3 652 627 621 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 3 2 1 120 240 360 480 600 SE +/- 1.33, N = 3 SE +/- 5.19, N = 6 SE +/- 4.18, N = 3 563 545 533 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Garlicoin 3 2 1 600 1200 1800 2400 3000 SE +/- 1.57, N = 3 SE +/- 4.99, N = 3 SE +/- 3.55, N = 3 2666.98 2664.52 2660.94 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Magi 1 3 2 130 260 390 520 650 SE +/- 0.56, N = 3 SE +/- 0.42, N = 3 SE +/- 6.06, N = 3 621.07 616.75 613.68 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Ringcoin 2 3 1 600 1200 1800 2400 3000 SE +/- 2.28, N = 3 SE +/- 3.89, N = 3 SE +/- 3.47, N = 3 2640.27 2636.56 2631.70 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 2 3 1 150 300 450 600 750 SE +/- 3.35, N = 3 SE +/- 1.04, N = 3 SE +/- 2.59, N = 3 683.00 679.19 670.38 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 3 1 2 110 220 330 440 550 SE +/- 2.08, N = 3 SE +/- 4.84, N = 3 SE +/- 0.33, N = 3 522 496 491 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 30 60 90 120 150 SE +/- 0.58, N = 3 SE +/- 0.88, N = 3 119 117 115 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression 3 1 2 300 600 900 1200 1500 SE +/- 1.86, N = 3 SE +/- 13.33, N = 5 1246 1243 1222 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 3 1 2 60 120 180 240 300 SE +/- 0.67, N = 3 SE +/- 2.87, N = 5 266 265 262 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 3 2 500K 1000K 1500K 2000K 2500K SE +/- 22607.64, N = 3 SE +/- 22138.91, N = 3 SE +/- 19485.15, N = 6 2227494.92 2089924.33 2041514.48 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 3 2 500K 1000K 1500K 2000K 2500K SE +/- 25949.62, N = 5 SE +/- 6818.95, N = 3 SE +/- 19710.98, N = 3 2326110.40 1481787.54 1469768.92 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 2 1 3 300K 600K 900K 1200K 1500K SE +/- 10231.57, N = 3 SE +/- 8316.35, N = 3 SE +/- 15812.81, N = 4 1450086.71 1449115.08 1431668.00 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 3 2 1 40 80 120 160 200 SE +/- 0.00, N = 3 SE +/- 0.25, N = 3 SE +/- 0.19, N = 3 182.82 169.02 168.45 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 3 1 2 400K 800K 1200K 1600K 2000K SE +/- 2946.64, N = 3 SE +/- 22085.72, N = 4 SE +/- 15789.30, N = 3 1944071.17 1908271.40 1863996.21 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 3 1 2 60 120 180 240 300 SE +/- 1.22, N = 3 SE +/- 0.64, N = 3 SE +/- 0.08, N = 3 296.25 294.98 294.80 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 3 1 2 70 140 210 280 350 SE +/- 2.84, N = 3 SE +/- 1.30, N = 3 SE +/- 1.79, N = 3 310.86 309.29 303.04 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 3 1 2 400 800 1200 1600 2000 SE +/- 4.37, N = 3 SE +/- 5.58, N = 3 SE +/- 3.20, N = 3 1870.67 1603.99 1597.55 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 3 1 2 400 800 1200 1600 2000 SE +/- 17.38, N = 3 SE +/- 16.78, N = 3 SE +/- 2.69, N = 3 2002.23 1482.32 1465.64 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 200 400 600 800 1000 SE +/- 1.59, N = 3 SE +/- 3.37, N = 3 SE +/- 12.46, N = 4 1094.42 1088.18 1050.83 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 2 1 3 300 600 900 1200 1500 SE +/- 1.64, N = 3 SE +/- 3.14, N = 3 SE +/- 3.99, N = 3 1380.74 1375.77 1373.15 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Phoronix Test Suite v10.8.5