390)X Sunday AMD Ryzen 9 3900X 12-Core testing with a ASUS TUF GAMING X570-PLUS (WI-FI) (2203 BIOS) and MSI AMD Radeon RX 470/480/570/570X/580/580X/590 8GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101259-HA-390XSUNDA01&sor&grs .
390)X Sunday Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads) ASUS TUF GAMING X570-PLUS (WI-FI) (2203 BIOS) AMD Starship/Matisse 16GB Samsung SSD 970 EVO Plus 250GB MSI AMD Radeon RX 470/480/570/570X/580/580X/590 8GB (1366/2000MHz) AMD Ellesmere HDMI Audio G237HL Realtek RTL8111/8168/8411 + Intel-AC 9260 Ubuntu 20.04 5.11.0-rc1-phx (x86_64) 20201228 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 21.0.0-devel (git-28a202f 2020-12-31 focal-oibaf-ppa) (LLVM 11.0.0) 1.2.145 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8701021 Python Details - Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
390)X Sunday redis: LPOP askap: tConvolve OpenMP - Gridding askap: tConvolve OpenMP - Degridding redis: GET askap: Hogbom Clean OpenMP lzbench: Crush 0 - Decompression lzbench: Zstd 8 - Decompression lzbench: Zstd 1 - Compression lzbench: Brotli 0 - Compression lzbench: Zstd 1 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 0 - Decompression lzbench: XZ 0 - Compression lzbench: Brotli 2 - Decompression lzbench: Zstd 8 - Compression quantlib: redis: SADD npb: EP.C gcrypt: lzbench: Crush 0 - Compression lzbench: XZ 0 - Decompression onnx: bertsquad-10 - OpenMP CPU cpuminer-opt: Myriad-Groestl etcpak: ETC1 gnupg: 2.7GB Sample File Encryption cpuminer-opt: LBC, LBRY Credits cpuminer-opt: x25x lzbench: Libdeflate 1 - Decompression redis: SET synthmark: VoiceMark_100 cpuminer-opt: Blake-2 S onnx: super-resolution-10 - OpenMP CPU lzbench: Libdeflate 1 - Compression npb: LU.C redis: LPUSH cpuminer-opt: Magi etcpak: ETC2 financebench: Bonds OpenMP onnx: yolov4 - OpenMP CPU askap: tConvolve MPI - Gridding askap: tConvolve MPI - Degridding onnx: shufflenet-v2-10 - OpenMP CPU etcpak: DXT1 etcpak: ETC1 + Dithering askap: tConvolve MT - Degridding askap: tConvolve MT - Gridding financebench: Repo OpenMP cpuminer-opt: Ringcoin cpuminer-opt: Garlicoin npb: EP.D onnx: fcn-resnet101-11 - OpenMP CPU cpuminer-opt: Triple SHA-256, Onecoin cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: Skeincoin cpuminer-opt: Deepcoin 1 2 3 2326110.40 1482.32 1603.99 2227494.92 168.445 496 1877 550 533 1684 216 621 41 739 106 2632.0 1908271.40 1094.42 203.540 119 128 479 18499 309.286 64.996 35394 402.85 1243 1663979.09 670.381 412783 5036 265 24023.03 1449115.08 621.07 180.410 74792.953125 312 7428.05 7051.52 13173 1375.767 294.984 1329.07 768.200 42031.884115 2631.70 2660.94 1076.16 73 102915 84847 83088 11695 1469768.92 1465.64 1597.55 2041514.48 169.015 491 1941 567 545 1732 215 627 41 735 108 2534.8 1863996.21 1088.18 199.437 117 129 493 18100 303.039 63.430 35025 394.71 1222 1682444.25 683.000 411815 5068 262 24293.80 1450086.71 613.68 179.385 74056.320313 312 7428.05 6987.35 13065 1380.743 294.797 1323.66 767.742 42185.287760 2640.27 2664.52 1075.65 73 83551 84267 86862 12036 1481787.54 2002.23 1870.67 2089924.33 182.815 522 1992 583 563 1775 226 652 43 770 111 2647.4 1944071.17 1050.83 196.189 115 132 492 18590 310.861 65.009 34650 397.81 1246 1650762.75 679.186 418870 5120 266 24339.47 1431668.00 616.75 181.474 74105.049479 309 7359.66 6999.98 13067 1373.152 296.253 1324.94 765.379 42124.738281 2636.56 2666.98 1074.95 73 104747 87686 90202 11727 OpenBenchmarking.org
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 3 2 500K 1000K 1500K 2000K 2500K SE +/- 25949.62, N = 5 SE +/- 6818.95, N = 3 SE +/- 19710.98, N = 3 2326110.40 1481787.54 1469768.92 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 3 1 2 400 800 1200 1600 2000 SE +/- 17.38, N = 3 SE +/- 16.78, N = 3 SE +/- 2.69, N = 3 2002.23 1482.32 1465.64 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 3 1 2 400 800 1200 1600 2000 SE +/- 4.37, N = 3 SE +/- 5.58, N = 3 SE +/- 3.20, N = 3 1870.67 1603.99 1597.55 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 3 2 500K 1000K 1500K 2000K 2500K SE +/- 22607.64, N = 3 SE +/- 22138.91, N = 3 SE +/- 19485.15, N = 6 2227494.92 2089924.33 2041514.48 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 3 2 1 40 80 120 160 200 SE +/- 0.00, N = 3 SE +/- 0.25, N = 3 SE +/- 0.19, N = 3 182.82 169.02 168.45 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 3 1 2 110 220 330 440 550 SE +/- 2.08, N = 3 SE +/- 4.84, N = 3 SE +/- 0.33, N = 3 522 496 491 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 3 2 1 400 800 1200 1600 2000 SE +/- 12.77, N = 3 SE +/- 23.18, N = 4 SE +/- 4.16, N = 3 1992 1941 1877 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 3 2 1 130 260 390 520 650 SE +/- 3.18, N = 3 SE +/- 5.13, N = 7 583 567 550 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 3 2 1 120 240 360 480 600 SE +/- 1.33, N = 3 SE +/- 5.19, N = 6 SE +/- 4.18, N = 3 563 545 533 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 3 2 1 400 800 1200 1600 2000 SE +/- 6.69, N = 3 SE +/- 15.85, N = 7 1775 1732 1684 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 3 1 2 50 100 150 200 250 SE +/- 2.37, N = 5 SE +/- 2.00, N = 3 226 216 215 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 3 2 1 140 280 420 560 700 SE +/- 2.85, N = 3 SE +/- 5.75, N = 6 SE +/- 3.18, N = 3 652 627 621 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 3 2 1 10 20 30 40 50 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 43 41 41 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 3 1 2 170 340 510 680 850 SE +/- 4.10, N = 3 SE +/- 9.12, N = 5 SE +/- 7.42, N = 3 770 739 735 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 3 2 1 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 111 108 106 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 3 1 2 600 1200 1800 2400 3000 SE +/- 19.32, N = 11 SE +/- 26.20, N = 3 SE +/- 20.43, N = 9 2647.4 2632.0 2534.8 1. (CXX) g++ options: -O3 -march=native -rdynamic
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 3 1 2 400K 800K 1200K 1600K 2000K SE +/- 2946.64, N = 3 SE +/- 22085.72, N = 4 SE +/- 15789.30, N = 3 1944071.17 1908271.40 1863996.21 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 200 400 600 800 1000 SE +/- 1.59, N = 3 SE +/- 3.37, N = 3 SE +/- 12.46, N = 4 1094.42 1088.18 1050.83 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 3 2 1 40 80 120 160 200 SE +/- 1.93, N = 3 SE +/- 1.94, N = 3 SE +/- 1.41, N = 3 196.19 199.44 203.54 1. (CC) gcc options: -O2 -fvisibility=hidden
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 30 60 90 120 150 SE +/- 0.58, N = 3 SE +/- 0.88, N = 3 119 117 115 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 3 2 1 30 60 90 120 150 SE +/- 1.00, N = 3 SE +/- 1.00, N = 3 132 129 128 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 2 3 1 110 220 330 440 550 SE +/- 5.01, N = 5 SE +/- 5.43, N = 4 SE +/- 4.54, N = 12 493 492 479 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Myriad-Groestl 3 1 2 4K 8K 12K 16K 20K SE +/- 183.39, N = 3 SE +/- 209.11, N = 15 SE +/- 178.96, N = 6 18590 18499 18100 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 3 1 2 70 140 210 280 350 SE +/- 2.84, N = 3 SE +/- 1.30, N = 3 SE +/- 1.79, N = 3 310.86 309.29 303.04 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 2 1 3 15 30 45 60 75 SE +/- 0.76, N = 4 SE +/- 0.49, N = 15 SE +/- 0.68, N = 5 63.43 65.00 65.01 1. (CC) gcc options: -O2
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: LBC, LBRY Credits 1 2 3 8K 16K 24K 32K 40K SE +/- 550.22, N = 12 SE +/- 255.12, N = 11 SE +/- 66.58, N = 3 35394 35025 34650 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: x25x OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: x25x 1 3 2 90 180 270 360 450 SE +/- 5.12, N = 15 SE +/- 0.71, N = 3 SE +/- 0.27, N = 3 402.85 397.81 394.71 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression 3 1 2 300 600 900 1200 1500 SE +/- 1.86, N = 3 SE +/- 13.33, N = 5 1246 1243 1222 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 2 1 3 400K 800K 1200K 1600K 2000K SE +/- 19351.58, N = 3 SE +/- 16867.94, N = 3 SE +/- 12482.70, N = 15 1682444.25 1663979.09 1650762.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 2 3 1 150 300 450 600 750 SE +/- 3.35, N = 3 SE +/- 1.04, N = 3 SE +/- 2.59, N = 3 683.00 679.19 670.38 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Blake-2 S 3 1 2 90K 180K 270K 360K 450K SE +/- 965.97, N = 3 SE +/- 3005.78, N = 3 SE +/- 4661.80, N = 15 418870 412783 411815 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 3 2 1 1100 2200 3300 4400 5500 SE +/- 47.60, N = 3 SE +/- 38.11, N = 12 SE +/- 9.86, N = 3 5120 5068 5036 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 3 1 2 60 120 180 240 300 SE +/- 0.67, N = 3 SE +/- 2.87, N = 5 266 265 262 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 3 2 1 5K 10K 15K 20K 25K SE +/- 22.21, N = 3 SE +/- 15.89, N = 3 SE +/- 23.65, N = 3 24339.47 24293.80 24023.03 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 2 1 3 300K 600K 900K 1200K 1500K SE +/- 10231.57, N = 3 SE +/- 8316.35, N = 3 SE +/- 15812.81, N = 4 1450086.71 1449115.08 1431668.00 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Magi 1 3 2 130 260 390 520 650 SE +/- 0.56, N = 3 SE +/- 0.42, N = 3 SE +/- 6.06, N = 3 621.07 616.75 613.68 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 3 1 2 40 80 120 160 200 SE +/- 1.56, N = 8 SE +/- 1.86, N = 3 SE +/- 0.99, N = 3 181.47 180.41 179.39 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 2 3 1 16K 32K 48K 64K 80K SE +/- 238.59, N = 3 SE +/- 291.41, N = 3 SE +/- 745.68, N = 6 74056.32 74105.05 74792.95 1. (CXX) g++ options: -O3 -march=native -fopenmp
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 2 1 3 70 140 210 280 350 SE +/- 0.17, N = 3 SE +/- 0.33, N = 3 SE +/- 2.02, N = 3 312 312 309 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 2 1 3 1600 3200 4800 6400 8000 SE +/- 80.93, N = 3 SE +/- 34.51, N = 12 SE +/- 61.67, N = 15 7428.05 7428.05 7359.66 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 1 3 2 1500 3000 4500 6000 7500 SE +/- 55.12, N = 15 SE +/- 49.12, N = 15 SE +/- 41.10, N = 3 7051.52 6999.98 6987.35 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 3 2 3K 6K 9K 12K 15K SE +/- 5.86, N = 3 SE +/- 38.80, N = 3 SE +/- 52.71, N = 3 13173 13067 13065 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 2 1 3 300 600 900 1200 1500 SE +/- 1.64, N = 3 SE +/- 3.14, N = 3 SE +/- 3.99, N = 3 1380.74 1375.77 1373.15 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 3 1 2 60 120 180 240 300 SE +/- 1.22, N = 3 SE +/- 0.64, N = 3 SE +/- 0.08, N = 3 296.25 294.98 294.80 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 3 2 300 600 900 1200 1500 SE +/- 0.70, N = 3 SE +/- 1.04, N = 3 SE +/- 2.24, N = 3 1329.07 1324.94 1323.66 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 1 2 3 170 340 510 680 850 SE +/- 0.25, N = 3 SE +/- 1.15, N = 3 SE +/- 0.54, N = 3 768.20 767.74 765.38 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1 3 2 9K 18K 27K 36K 45K SE +/- 112.48, N = 3 SE +/- 94.06, N = 3 SE +/- 59.24, N = 3 42031.88 42124.74 42185.29 1. (CXX) g++ options: -O3 -march=native -fopenmp
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Ringcoin 2 3 1 600 1200 1800 2400 3000 SE +/- 2.28, N = 3 SE +/- 3.89, N = 3 SE +/- 3.47, N = 3 2640.27 2636.56 2631.70 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Garlicoin 3 2 1 600 1200 1800 2400 3000 SE +/- 1.57, N = 3 SE +/- 4.99, N = 3 SE +/- 3.55, N = 3 2666.98 2664.52 2660.94 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 200 400 600 800 1000 SE +/- 2.16, N = 3 SE +/- 1.95, N = 3 SE +/- 0.35, N = 3 1076.16 1075.65 1074.95 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 3 2 1 16 32 48 64 80 SE +/- 0.29, N = 3 SE +/- 0.33, N = 3 SE +/- 0.17, N = 3 73 73 73 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Triple SHA-256, Onecoin 3 1 2 20K 40K 60K 80K 100K SE +/- 626.11, N = 3 SE +/- 1232.09, N = 4 SE +/- 7425.46, N = 15 104747 102915 83551 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Quad SHA-256, Pyrite 3 1 2 20K 40K 60K 80K 100K SE +/- 925.13, N = 5 SE +/- 3663.30, N = 14 SE +/- 723.54, N = 3 87686 84847 84267 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Skeincoin 3 2 1 20K 40K 60K 80K 100K SE +/- 622.69, N = 13 SE +/- 1773.25, N = 12 SE +/- 2557.61, N = 12 90202 86862 83088 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Deepcoin 2 3 1 3K 6K 9K 12K 15K SE +/- 330.28, N = 12 SE +/- 213.76, N = 15 SE +/- 178.11, N = 15 12036 11727 11695 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Phoronix Test Suite v10.8.5