TR 2970WX New 2021 AMD Ryzen Threadripper 2970WX 24-Core testing with a Gigabyte X399 AORUS Gaming 7 (F12h BIOS) and Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102066-HA-TR2970WXN34&grr&sro .
TR 2970WX New 2021 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution 1 2 3 4 AMD Ryzen Threadripper 2970WX 24-Core @ 3.00GHz (24 Cores / 48 Threads) Gigabyte X399 AORUS Gaming 7 (F12h BIOS) AMD 17h 16GB 120GB Corsair Force MP500 Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz) Realtek ALC1220 VA2431 Qualcomm Atheros Killer E2500 + 2 x QLogic cLOM8214 1/10GbE + Intel 8265 / 8275 Ubuntu 20.04 5.9.0-050900rc6daily20200926-generic (x86_64) 20200925 GNOME Shell 3.36.4 X Server 1.20.8 4.6 Mesa 20.2.6 (LLVM 11.0.0) GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x800820d Graphics Details - GLAMOR Python Details - Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected
TR 2970WX New 2021 npb: LU.C webp2: Quality 95, Compression Effort 7 financebench: Bonds OpenMP pennant: sedovbig financebench: Repo OpenMP gcrypt: pennant: leblancbig webp2: Quality 75, Compression Effort 7 qmcpack: simple-H2O npb: EP.D cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: Skeincoin gnupg: 2.7GB Sample File Encryption cpuminer-opt: Magi cpuminer-opt: LBC, LBRY Credits paraview: Many Spheres - 1920 x 1080 paraview: Many Spheres - 1920 x 1080 lzbench: XZ 0 - Decompression lzbench: XZ 0 - Compression redis: SADD askap: Hogbom Clean OpenMP etcpak: ETC2 cpuminer-opt: Triple SHA-256, Onecoin cpuminer-opt: Blake-2 S cpuminer-opt: Garlicoin cpuminer-opt: Ringcoin cpuminer-opt: Deepcoin cpuminer-opt: x25x cpuminer-opt: Myriad-Groestl synthmark: VoiceMark_100 lzbench: Zstd 8 - Decompression lzbench: Zstd 8 - Compression lzbench: Brotli 2 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Libdeflate 1 - Decompression lzbench: Libdeflate 1 - Compression lzbench: Crush 0 - Decompression lzbench: Crush 0 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 1 - Compression etcpak: ETC1 + Dithering etcpak: ETC1 redis: SET redis: LPUSH askap: tConvolve OpenMP - Degridding askap: tConvolve OpenMP - Gridding redis: LPOP paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 redis: GET paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 npb: EP.C webp2: Quality 100, Compression Effort 5 webp2: Default etcpak: DXT1 1 2 3 4 38463.17 267.927 106038.831380 55.99153 55137.901042 221.493 43.45899 144.796 37.105 1342.01 121896 95743 71.876 905.22 32230 1568.810 15.65 112 37 1646933.50 159.844 151.445 162513 388923 3542.80 3061.18 13210 624.09 9387.68 582.029 1747 94 652 196 557 485 1163 238 453 91 1556 530 225.465 240.462 1425303.66 1173888.5 1525.08 946.679 1968512.59 1227.463 117.78 1906599.63 2130.423 133.15 1362.44 7.512 3.146 1675.472 39462.12 272.012 121601.700521 53.91016 59107.768229 221.256 45.21094 146.643 39.869 1335.21 121733 98875 72.186 911.36 32828 1568.967 15.65 109 35 1583032.46 164.308 151.740 162033 388933 3536.47 3063.62 13337 617.44 9481.70 584.471 1750 96 657 197 563 490 1170 240 455 90 1564 532 223.532 236.882 1380477.45 1154272.04 2017.43 956.531 1218050.21 1228.111 117.85 1771191.63 2119.037 132.44 1361.29 7.532 3.128 1730.760 38566.53 269.984 117912.921875 53.41428 60899.583333 218.824 40.90000 145.507 39.435 1329.41 121660 97722 71.872 910.53 32900 1568.704 15.65 108 36 1633246.40 163.703 151.507 161737 380417 3543.51 3042.20 13353 624.44 9493.20 580.986 1780 97 671 200 564 489 1197 242 454 89 1563 532 223.444 236.771 1372523.54 1168206.63 1667.59 944.202 1251792.29 1228.796 117.91 1776002.79 2125.865 132.87 1346.34 7.474 3.116 1702.611 1568.826 15.65 1228.242 117.86 2127.447 132.96 1738.460 OpenBenchmarking.org
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 8K 16K 24K 32K 40K SE +/- 1094.85, N = 15 SE +/- 787.33, N = 15 SE +/- 829.30, N = 15 38463.17 39462.12 38566.53 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 1 2 3 60 120 180 240 300 SE +/- 0.17, N = 3 SE +/- 0.52, N = 3 SE +/- 0.47, N = 3 267.93 272.01 269.98 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 1 2 3 30K 60K 90K 120K 150K SE +/- 2912.04, N = 12 SE +/- 1261.07, N = 3 SE +/- 1674.14, N = 4 106038.83 121601.70 117912.92 1. (CXX) g++ options: -O3 -march=native -fopenmp
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 1 2 3 13 26 39 52 65 SE +/- 2.28, N = 12 SE +/- 2.08, N = 15 SE +/- 1.13, N = 15 55.99 53.91 53.41 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1 2 3 13K 26K 39K 52K 65K SE +/- 139.58, N = 3 SE +/- 1524.18, N = 15 SE +/- 1427.59, N = 15 55137.90 59107.77 60899.58 1. (CXX) g++ options: -O3 -march=native -fopenmp
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1 2 3 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.65, N = 3 SE +/- 0.17, N = 3 221.49 221.26 218.82 1. (CC) gcc options: -O2 -fvisibility=hidden
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 1 2 3 10 20 30 40 50 SE +/- 2.41, N = 12 SE +/- 2.13, N = 12 SE +/- 1.80, N = 15 43.46 45.21 40.90 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 1 2 3 30 60 90 120 150 SE +/- 2.00, N = 4 SE +/- 2.09, N = 4 SE +/- 2.31, N = 3 144.80 146.64 145.51 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 9 18 27 36 45 SE +/- 0.20, N = 3 SE +/- 1.33, N = 15 SE +/- 0.91, N = 12 37.11 39.87 39.44 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 300 600 900 1200 1500 SE +/- 7.49, N = 3 SE +/- 14.68, N = 3 SE +/- 14.93, N = 3 1342.01 1335.21 1329.41 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Quad SHA-256, Pyrite 1 2 3 30K 60K 90K 120K 150K SE +/- 1114.49, N = 12 SE +/- 948.48, N = 15 SE +/- 854.24, N = 3 121896 121733 121660 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Skeincoin 1 2 3 20K 40K 60K 80K 100K SE +/- 1012.86, N = 3 SE +/- 1330.10, N = 4 SE +/- 906.53, N = 15 95743 98875 97722 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 1 2 3 16 32 48 64 80 SE +/- 0.47, N = 3 SE +/- 0.61, N = 3 SE +/- 0.37, N = 3 71.88 72.19 71.87 1. (CC) gcc options: -O2
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Magi 1 2 3 200 400 600 800 1000 SE +/- 5.53, N = 3 SE +/- 3.20, N = 3 SE +/- 6.44, N = 14 905.22 911.36 910.53 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: LBC, LBRY Credits 1 2 3 7K 14K 21K 28K 35K SE +/- 52.92, N = 3 SE +/- 391.06, N = 13 SE +/- 480.14, N = 3 32230 32828 32900 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 2 3 4 300 600 900 1200 1500 SE +/- 0.31, N = 3 SE +/- 0.49, N = 3 SE +/- 0.14, N = 3 SE +/- 0.43, N = 3 1568.81 1568.97 1568.70 1568.83
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 1 2 3 4 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 15.65 15.65 15.65 15.65
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1 2 3 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 112 109 108 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 1 2 3 9 18 27 36 45 SE +/- 0.33, N = 3 37 35 36 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 27726.78, N = 3 SE +/- 6337.30, N = 3 SE +/- 12738.10, N = 15 1646933.50 1583032.46 1633246.40 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 2 3 40 80 120 160 200 SE +/- 1.08, N = 3 SE +/- 1.63, N = 9 SE +/- 1.73, N = 3 159.84 164.31 163.70 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 151.45 151.74 151.51 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Triple SHA-256, Onecoin 1 2 3 30K 60K 90K 120K 150K SE +/- 271.44, N = 3 SE +/- 1274.97, N = 3 SE +/- 934.78, N = 3 162513 162033 161737 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Blake-2 S 1 2 3 80K 160K 240K 320K 400K SE +/- 2127.83, N = 3 SE +/- 5128.22, N = 3 SE +/- 2926.41, N = 3 388923 388933 380417 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Garlicoin 1 2 3 800 1600 2400 3200 4000 SE +/- 10.97, N = 3 SE +/- 4.58, N = 3 SE +/- 5.45, N = 3 3542.80 3536.47 3543.51 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Ringcoin 1 2 3 700 1400 2100 2800 3500 SE +/- 8.20, N = 3 SE +/- 3.59, N = 3 SE +/- 5.15, N = 3 3061.18 3063.62 3042.20 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Deepcoin 1 2 3 3K 6K 9K 12K 15K SE +/- 15.28, N = 3 SE +/- 116.81, N = 3 SE +/- 128.37, N = 3 13210 13337 13353 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: x25x OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: x25x 1 2 3 130 260 390 520 650 SE +/- 1.98, N = 3 SE +/- 3.09, N = 3 SE +/- 7.45, N = 3 624.09 617.44 624.44 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Myriad-Groestl 1 2 3 2K 4K 6K 8K 10K SE +/- 14.77, N = 3 SE +/- 105.26, N = 3 SE +/- 104.95, N = 3 9387.68 9481.70 9493.20 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 1 2 3 130 260 390 520 650 SE +/- 0.16, N = 3 SE +/- 0.64, N = 3 SE +/- 2.76, N = 3 582.03 584.47 580.99 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 2 3 400 800 1200 1600 2000 SE +/- 3.21, N = 3 SE +/- 11.85, N = 3 SE +/- 5.21, N = 3 1747 1750 1780 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 20 40 60 80 100 SE +/- 0.58, N = 3 94 96 97 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1 2 3 140 280 420 560 700 SE +/- 2.08, N = 3 SE +/- 0.67, N = 3 SE +/- 7.42, N = 3 652 657 671 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 2 3 40 80 120 160 200 SE +/- 0.58, N = 3 SE +/- 1.00, N = 3 SE +/- 2.00, N = 3 196 197 200 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 120 240 360 480 600 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 557 563 564 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 2 3 110 220 330 440 550 SE +/- 1.20, N = 3 SE +/- 2.19, N = 3 SE +/- 1.53, N = 3 485 490 489 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression 1 2 3 300 600 900 1200 1500 SE +/- 9.33, N = 3 1163 1170 1197 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 2 3 50 100 150 200 250 SE +/- 1.53, N = 3 SE +/- 1.45, N = 3 SE +/- 0.58, N = 3 238 240 242 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 2 3 100 200 300 400 500 SE +/- 1.76, N = 3 453 455 454 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 20 40 60 80 100 SE +/- 1.20, N = 3 SE +/- 0.58, N = 3 91 90 89 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 1 2 3 300 600 900 1200 1500 SE +/- 0.58, N = 3 SE +/- 0.67, N = 3 1556 1564 1563 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 2 3 120 240 360 480 600 SE +/- 1.00, N = 3 530 532 532 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 2 3 50 100 150 200 250 SE +/- 1.60, N = 3 SE +/- 0.10, N = 3 SE +/- 0.06, N = 3 225.47 223.53 223.44 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 2 3 50 100 150 200 250 SE +/- 1.55, N = 3 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 240.46 236.88 236.77 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 300K 600K 900K 1200K 1500K SE +/- 21239.06, N = 4 SE +/- 21164.69, N = 3 SE +/- 18661.63, N = 3 1425303.66 1380477.45 1372523.54 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 300K 600K 900K 1200K 1500K SE +/- 17665.22, N = 3 SE +/- 2287.81, N = 3 SE +/- 12928.35, N = 3 1173888.50 1154272.04 1168206.63 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 1 2 3 400 800 1200 1600 2000 SE +/- 51.76, N = 3 SE +/- 167.10, N = 4 SE +/- 3.49, N = 3 1525.08 2017.43 1667.59 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 2 3 200 400 600 800 1000 SE +/- 11.27, N = 3 SE +/- 12.54, N = 4 SE +/- 3.87, N = 3 946.68 956.53 944.20 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 29446.00, N = 3 SE +/- 20158.66, N = 3 SE +/- 14422.34, N = 3 1968512.59 1218050.21 1251792.29 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 300 600 900 1200 1500 SE +/- 0.37, N = 3 SE +/- 0.46, N = 3 SE +/- 0.39, N = 3 SE +/- 0.26, N = 3 1227.46 1228.11 1228.80 1228.24
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 30 60 90 120 150 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 117.78 117.85 117.91 117.86
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 18718.72, N = 3 SE +/- 26950.46, N = 3 SE +/- 13229.54, N = 3 1906599.63 1771191.63 1776002.79 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 500 1000 1500 2000 2500 SE +/- 8.70, N = 3 SE +/- 11.17, N = 3 SE +/- 5.25, N = 3 SE +/- 7.90, N = 3 2130.42 2119.04 2125.87 2127.45
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 30 60 90 120 150 SE +/- 0.55, N = 3 SE +/- 0.70, N = 3 SE +/- 0.33, N = 3 SE +/- 0.49, N = 3 133.15 132.44 132.87 132.96
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 300 600 900 1200 1500 SE +/- 1.53, N = 3 SE +/- 2.95, N = 3 SE +/- 14.98, N = 6 1362.44 1361.29 1346.34 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 1 2 3 2 4 6 8 10 SE +/- 0.018, N = 3 SE +/- 0.055, N = 3 SE +/- 0.022, N = 3 7.512 7.532 7.474 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 1 2 3 0.7079 1.4158 2.1237 2.8316 3.5395 SE +/- 0.023, N = 3 SE +/- 0.023, N = 3 SE +/- 0.020, N = 3 3.146 3.128 3.116 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 1 2 3 4 400 800 1200 1600 2000 SE +/- 0.92, N = 3 SE +/- 5.28, N = 3 SE +/- 0.46, N = 3 SE +/- 1.24, N = 3 1675.47 1730.76 1702.61 1738.46 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Phoronix Test Suite v10.8.4