GCC 10 AMD Threadripper 3960X PGO Optimization AMD Ryzen Threadripper 3960X 24-Core testing with a MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS) and Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB on Ubuntu 19.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/1912220-PTS-GCC10AMD97&grs&sor .
GCC 10 AMD Threadripper 3960X PGO Optimization Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution GCC 10 Sabrent Rocket 4.0 1TB AMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads) MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS) AMD Starship/Matisse 32768MB 1000GB Sabrent Rocket 4.0 1TB Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB (1206/1750MHz) AMD Baffin HDMI/DP ASUS VP28U Aquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Device 2723 Ubuntu 19.10 5.4.0-nvme-hwmon (x86_64) GNOME Shell 3.34.1 X Server 1.20.5 modesetting 1.20.5 4.5 Mesa 19.2.1 (LLVM 9.0.0) GCC 10.0.0 20191208 ext4 3840x2160 OpenBenchmarking.org Compiler Details - --disable-multilib --enable-checking=release Disk Details - NONE / errors=remount-ro,relatime,rw Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301025 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + tsx_async_abort: Not affected
GCC 10 AMD Threadripper 3960X PGO Optimization fftw: Float + SSE - 2D FFT Size 4096 compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 fftw: Float + SSE - 1D FFT Size 32 stockfish: Total Time qmcpack: fftw: Stock - 1D FFT Size 32 build-imagemagick: Time To Compile fftw: Stock - 2D FFT Size 4096 fftw: Stock - 2D FFT Size 32 pgbench: Buffer Test - Heavy Contention - Read Only pgbench: Buffer Test - Normal Load - Read Only sqlite-speedtest: Timed Time - Size 1,000 byte: Dhrystone 2 ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 fftw: Float + SSE - 2D FFT Size 32 encode-mp3: WAV To MP3 gromacs: Water Benchmark mrbayes: Primate Phylogeny Analysis tscp: AI Chess Performance encode-flac: WAV To FLAC mkl-dnn: Convolution Batch conv_alexnet - f32 himeno: Poisson Pressure Solver sqlite: 1 openssl: RSA 4096-bit Performance rocksdb: Seq Fill rocksdb: Rand Read radiance: SMP Parallel rocksdb: Read While Writing mkl-dnn: Deconvolution Batch deconv_1d - f32 crafty: Elapsed Time rocksdb: Rand Fill rocksdb: Rand Fill Sync askap: tConvolve MT - Gridding radiance: Serial minife: Small askap: tConvolve MT - Degridding mkl-dnn: Recurrent Neural Network Training - f32 mkl-dnn: Convolution Batch conv_googlenet_v3 - f32 askap: tConvolve OpenMP - Degridding askap: tConvolve OpenMP - Gridding byte: Floating-Point Arithmetic byte: Register Arithmetic byte: Integer Arithmetic hpcc: Max Ping Pong Bandwidth hpcc: Rand Ring Bandwidth hpcc: Rand Ring Latency hpcc: G-Rand Access hpcc: EP-STREAM Triad hpcc: G-Ptrans hpcc: EP-DGEMM hpcc: G-Ffte hpcc: G-Ffte hpcc: G-HPL mt-dgemm: Sustained Floating-Point Rate GCC 10 Sabrent Rocket 4.0 1TB 22667 19.865 15396 79359613 1878 10443 16.469 6687.3 10512 676349.711031 669039.844251 57.263 48055276.3 938.471 10.260 45404 7.297 2.505 70.008 1346651 7.719 123.990 4684.299494 14.184 7180.6 1019862 145207827 171.295 4889956 2.32419 9234824 938039 24588 1947.24 555.936 7740.10 3359.12 194.248 52.3291 4096.25 5433.8 1 1 1 22976.998 3.40678 0.45863 0.14278 1.79750 5.47737 32.92793 10.49127 10.49127 63.62933 8.567282 3767.3 799.112 3309.7 2442535 7137.2 3436.6 5.799 2407 3862.4 260493.711863 264129.487095 130.22 21645648.2 5.01883 1595.096 3708.4 9.243 1.995 83.272 1189585 8.016 120.154 4820.052519 14.595 7060.1 1035308 143335505 169.594 4937048 2.30704 9301788 932120 24460 1937.58 554.955 7737.78 3359.7 194.225 52.3319 4096.25 5433.8 1 1 1 0.078489 OpenBenchmarking.org
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 GCC 10 Sabrent Rocket 4.0 1TB 5K 10K 15K 20K 25K SE +/- 285.77, N = 3 22667.0 3767.3 -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math 1. (CC) gcc options: -pthread -lm
XZ Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 GCC 10 Sabrent Rocket 4.0 1TB 200 400 600 800 1000 SE +/- 0.02, N = 3 19.87 799.11 1. (CC) gcc options: -pthread -fvisibility=hidden
FFTW Build: Float + SSE - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 GCC 10 Sabrent Rocket 4.0 1TB 3K 6K 9K 12K 15K SE +/- 15.37, N = 3 15396.0 3309.7 -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math 1. (CC) gcc options: -pthread -lm
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time GCC 10 Sabrent Rocket 4.0 1TB 20M 40M 60M 80M 100M SE +/- 526550.53, N = 3 79359613 2442535 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto
QMCPACK OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.8 GCC 10 Sabrent Rocket 4.0 1TB 1500 3000 4500 6000 7500 1878.0 7137.2 1. (CXX) g++ options: -fopenmp -fomit-frame-pointer -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 GCC 10 Sabrent Rocket 4.0 1TB 2K 4K 6K 8K 10K SE +/- 16.77, N = 3 10443.0 3436.6 -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math 1. (CC) gcc options: -pthread -lm
Timed ImageMagick Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile Sabrent Rocket 4.0 1TB GCC 10 4 8 12 16 20 SE +/- 0.078, N = 3 5.799 16.469
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 GCC 10 Sabrent Rocket 4.0 1TB 1400 2800 4200 5600 7000 SE +/- 7.80, N = 3 6687.3 2407.0 -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math 1. (CC) gcc options: -pthread -lm
FFTW Build: Stock - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 GCC 10 Sabrent Rocket 4.0 1TB 2K 4K 6K 8K 10K SE +/- 11.02, N = 3 10512.0 3862.4 -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math 1. (CC) gcc options: -pthread -lm
PostgreSQL pgbench Scaling: Buffer Test - Test: Heavy Contention - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Heavy Contention - Mode: Read Only GCC 10 Sabrent Rocket 4.0 1TB 140K 280K 420K 560K 700K SE +/- 4887.19, N = 3 676349.71 260493.71 -O2 -lpq 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
PostgreSQL pgbench Scaling: Buffer Test - Test: Normal Load - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only GCC 10 Sabrent Rocket 4.0 1TB 140K 280K 420K 560K 700K SE +/- 622.09, N = 3 669039.84 264129.49 -O2 -lpq 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 GCC 10 Sabrent Rocket 4.0 1TB 30 60 90 120 150 SE +/- 0.13, N = 3 57.26 130.22 -O2 1. (CC) gcc options: -ldl -lz -lpthread
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 GCC 10 Sabrent Rocket 4.0 1TB 10M 20M 30M 40M 50M SE +/- 550382.53, N = 3 48055276.3 21645648.2
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping GCC 10 Sabrent Rocket 4.0 1TB 200 400 600 800 1000 SE +/- 1.20634, N = 3 938.47100 5.01883 -O3 -fopenmp -fwhole-program 1. (CXX) g++ options: -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
Zstd Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 GCC 10 Sabrent Rocket 4.0 1TB 300 600 900 1200 1500 SE +/- 0.03, N = 3 10.26 1595.10 1. (CC) gcc options: -pthread -lz -llzma
FFTW Build: Float + SSE - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 GCC 10 Sabrent Rocket 4.0 1TB 10K 20K 30K 40K 50K SE +/- 56.20, N = 3 45404.0 3708.4 -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math 1. (CC) gcc options: -pthread -lm
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 GCC 10 Sabrent Rocket 4.0 1TB 3 6 9 12 15 SE +/- 0.002, N = 3 7.297 9.243 1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2019.4 Water Benchmark GCC 10 Sabrent Rocket 4.0 1TB 0.5636 1.1272 1.6908 2.2544 2.818 SE +/- 0.002, N = 3 2.505 1.995 1. (CXX) g++ options: -mavx2 -mfma -std=c++11 -O3 -funroll-all-loops -pthread -lrt -lpthread -lm
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis GCC 10 Sabrent Rocket 4.0 1TB 20 40 60 80 100 SE +/- 0.25, N = 3 70.01 83.27 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 10 Sabrent Rocket 4.0 1TB 300K 600K 900K 1200K 1500K SE +/- 1472.68, N = 5 1346651 1189585 1. (CC) gcc options: -O3 -march=native
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC GCC 10 Sabrent Rocket 4.0 1TB 2 4 6 8 10 SE +/- 0.009, N = 5 7.719 8.016 -O2 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
MKL-DNN DNNL Harness: Convolution Batch conv_alexnet - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Convolution Batch conv_alexnet - Data Type: f32 Sabrent Rocket 4.0 1TB GCC 10 30 60 90 120 150 SE +/- 1.44, N = 3 120.15 123.99 -lm - MIN: 119.49 MIN: 121.92 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver Sabrent Rocket 4.0 1TB GCC 10 1000 2000 3000 4000 5000 SE +/- 55.99, N = 5 4820.05 4684.30 1. (CC) gcc options: -O3 -mavx2
SQLite Threads / Copies: 1 OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.30.1 Threads / Copies: 1 GCC 10 Sabrent Rocket 4.0 1TB 4 8 12 16 20 SE +/- 0.01, N = 3 14.18 14.60 -O2 1. (CC) gcc options: -lz -lm -ldl -lpthread
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance GCC 10 Sabrent Rocket 4.0 1TB 1500 3000 4500 6000 7500 SE +/- 21.70, N = 3 7180.6 7060.1 -O3 -lssl 1. (CC) gcc options: -pthread -m64 -lcrypto -ldl
Facebook RocksDB Test: Sequential Fill OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Sequential Fill Sabrent Rocket 4.0 1TB GCC 10 200K 400K 600K 800K 1000K SE +/- 3135.06, N = 3 1035308 1019862 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
Facebook RocksDB Test: Random Read OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Random Read GCC 10 Sabrent Rocket 4.0 1TB 30M 60M 90M 120M 150M SE +/- 1800355.11, N = 3 145207827 143335505 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
Radiance Benchmark Test: SMP Parallel OpenBenchmarking.org Seconds, Fewer Is Better Radiance Benchmark 5.0 Test: SMP Parallel Sabrent Rocket 4.0 1TB GCC 10 40 80 120 160 200 169.59 171.30
Facebook RocksDB Test: Read While Writing OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Read While Writing Sabrent Rocket 4.0 1TB GCC 10 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 20082.88, N = 3 4937048 4889956 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
MKL-DNN DNNL Harness: Deconvolution Batch deconv_1d - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Deconvolution Batch deconv_1d - Data Type: f32 Sabrent Rocket 4.0 1TB GCC 10 0.5229 1.0458 1.5687 2.0916 2.6145 SE +/- 0.00388, N = 3 2.30704 2.32419 -lm - MIN: 2.25 MIN: 2.26 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time Sabrent Rocket 4.0 1TB GCC 10 2M 4M 6M 8M 10M SE +/- 7954.66, N = 3 9301788 9234824 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
Facebook RocksDB Test: Random Fill OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Random Fill GCC 10 Sabrent Rocket 4.0 1TB 200K 400K 600K 800K 1000K SE +/- 16043.44, N = 3 938039 932120 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
Facebook RocksDB Test: Random Fill Sync OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Random Fill Sync GCC 10 Sabrent Rocket 4.0 1TB 5K 10K 15K 20K 25K SE +/- 19.92, N = 3 24588 24460 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve MT - Gridding GCC 10 Sabrent Rocket 4.0 1TB 400 800 1200 1600 2000 SE +/- 3.33, N = 3 1947.24 1937.58 1. (CXX) g++ options: -lpthread
Radiance Benchmark Test: Serial OpenBenchmarking.org Seconds, Fewer Is Better Radiance Benchmark 5.0 Test: Serial Sabrent Rocket 4.0 1TB GCC 10 120 240 360 480 600 554.96 555.94
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small GCC 10 Sabrent Rocket 4.0 1TB 1700 3400 5100 6800 8500 SE +/- 11.29, N = 3 7740.10 7737.78 1. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve MT - Degridding Sabrent Rocket 4.0 1TB GCC 10 700 1400 2100 2800 3500 SE +/- 3.58, N = 3 3359.70 3359.12 1. (CXX) g++ options: -lpthread
MKL-DNN DNNL Harness: Recurrent Neural Network Training - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Recurrent Neural Network Training - Data Type: f32 Sabrent Rocket 4.0 1TB GCC 10 40 80 120 160 200 SE +/- 0.35, N = 3 194.23 194.25 -lm - MIN: 193.18 MIN: 192.53 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
MKL-DNN DNNL Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN DNNL 1.1 Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 GCC 10 Sabrent Rocket 4.0 1TB 12 24 36 48 60 SE +/- 0.14, N = 3 52.33 52.33 MIN: 51.48 -lm - MIN: 51.67 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve OpenMP - Degridding Sabrent Rocket 4.0 1TB GCC 10 900 1800 2700 3600 4500 SE +/- 0.00, N = 3 4096.25 4096.25 1. (CXX) g++ options: -lpthread
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve OpenMP - Gridding Sabrent Rocket 4.0 1TB GCC 10 1200 2400 3600 4800 6000 SE +/- 0.00, N = 3 5433.8 5433.8 1. (CXX) g++ options: -lpthread
BYTE Unix Benchmark Computational Test: Floating-Point Arithmetic OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Floating-Point Arithmetic Sabrent Rocket 4.0 1TB GCC 10 0.225 0.45 0.675 0.9 1.125 1 1
BYTE Unix Benchmark Computational Test: Register Arithmetic OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Register Arithmetic Sabrent Rocket 4.0 1TB GCC 10 0.225 0.45 0.675 0.9 1.125 1 1
BYTE Unix Benchmark Computational Test: Integer Arithmetic OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Integer Arithmetic Sabrent Rocket 4.0 1TB GCC 10 0.225 0.45 0.675 0.9 1.125 1 1
HPC Challenge Test / Class: Max Ping Pong Bandwidth OpenBenchmarking.org MB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Max Ping Pong Bandwidth GCC 10 5K 10K 15K 20K 25K SE +/- 313.29, N = 3 22977.00 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: Random Ring Bandwidth OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Bandwidth GCC 10 0.7665 1.533 2.2995 3.066 3.8325 SE +/- 0.01038, N = 3 3.40678 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: Random Ring Latency OpenBenchmarking.org usecs, Fewer Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Latency GCC 10 0.1032 0.2064 0.3096 0.4128 0.516 SE +/- 0.00067, N = 3 0.45863 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: G-Random Access OpenBenchmarking.org GUP/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Random Access GCC 10 0.0321 0.0642 0.0963 0.1284 0.1605 SE +/- 0.00039, N = 3 0.14278 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: EP-STREAM Triad OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: EP-STREAM Triad GCC 10 0.4044 0.8088 1.2132 1.6176 2.022 SE +/- 0.00127, N = 3 1.79750 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: G-Ptrans OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ptrans GCC 10 1.2324 2.4648 3.6972 4.9296 6.162 SE +/- 0.00581, N = 3 5.47737 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: EP-DGEMM OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: EP-DGEMM GCC 10 8 16 24 32 40 SE +/- 0.38, N = 3 32.93 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: G-Ffte OpenBenchmarking.org GFLOP/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ffte GCC 10 3 6 9 12 15 SE +/- 0.05, N = 3 10.49 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: G-Ffte OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ffte GCC 10 3 6 9 12 15 SE +/- 0.05, N = 3 10.49 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: G-HPL OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-HPL GCC 10 14 28 42 56 70 SE +/- 0.23, N = 3 63.63 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate GCC 10 Sabrent Rocket 4.0 1TB 2 4 6 8 10 SE +/- 0.158518, N = 12 8.567282 0.078489 1. (CC) gcc options: -O3 -march=native -fopenmp
Phoronix Test Suite v10.8.5