MPI And More Threadripper AMD Ryzen Threadripper 3970X 32-Core testing with a ASUS ROG ZENITH II EXTREME (0601 BIOS) and MSI AMD Radeon RX 470/480/570/570X/580/580X/590 8GB on Ubuntu 19.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/1912057-HU-MPIANDMOR01 .
MPI And More Threadripper Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution AMD Ryzen Threadripper 3970X 32-Core AMD Ryzen Threadripper 3970X 32-Core @ 3.70GHz (32 Cores / 64 Threads) ASUS ROG ZENITH II EXTREME (0601 BIOS) AMD Starship/Matisse 64512MB 3841GB Micron_9300_MTFDHAL3T8TDP MSI AMD Radeon RX 470/480/570/570X/580/580X/590 8GB (1366/2000MHz) AMD Ellesmere HDMI Audio ASUS VP28U Aquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Device 2723 Ubuntu 19.10 5.4.0-nvme-hwmon (x86_64) GNOME Shell 3.34.1 X Server 1.20.5 modesetting 1.20.5 4.5 Mesa 19.2.1 (LLVM 9.0.0) GCC 9.2.1 20191008 ext4 3840x2160 OpenBenchmarking.org - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301025 - Python 2.7.17rc1 + Python 3.7.5 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + tsx_async_abort: Not affected
MPI And More Threadripper npb: BT.C npb: CG.C npb: EP.C npb: EP.D npb: FT.C npb: IS.D npb: LU.C npb: MG.C npb: SP.B hpcc: G-HPL hpcc: G-Ffte hpcc: G-Ffte hpcc: EP-DGEMM hpcc: G-Ptrans hpcc: EP-STREAM Triad hpcc: G-Rand Access hpcc: Rand Ring Latency hpcc: Rand Ring Bandwidth hpcc: Max Ping Pong Bandwidth parboil: OpenMP LBM parboil: OpenMP CUTCP parboil: OpenMP Stencil parboil: OpenMP MRI Gridding blender: Barbershop - CPU-Only AMD Ryzen Threadripper 3970X 32-Core 74081.46 10934.02 2927.47 2922.09 35052.16 1482.46 67266.98 30806.33 34903.89 80.42790 16.14707 16.14707 28.94217 7.51040 1.93275 0.17374 0.45390 2.25419 22146.625 42.449944 0.946535 4.804557 58.367946 187.60 OpenBenchmarking.org
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C AMD Ryzen Threadripper 3970X 32-Core 16K 32K 48K 64K 80K SE +/- 1149.54, N = 3 74081.46 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 3.1.3
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C AMD Ryzen Threadripper 3970X 32-Core 2K 4K 6K 8K 10K SE +/- 107.97, N = 3 10934.02 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 3.1.3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C AMD Ryzen Threadripper 3970X 32-Core 600 1200 1800 2400 3000 SE +/- 11.81, N = 3 2927.47 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 3.1.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D AMD Ryzen Threadripper 3970X 32-Core 600 1200 1800 2400 3000 SE +/- 5.36, N = 3 2922.09 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 3.1.3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C AMD Ryzen Threadripper 3970X 32-Core 8K 16K 24K 32K 40K SE +/- 52.28, N = 3 35052.16 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 3.1.3
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D AMD Ryzen Threadripper 3970X 32-Core 300 600 900 1200 1500 SE +/- 5.47, N = 3 1482.46 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 3.1.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C AMD Ryzen Threadripper 3970X 32-Core 14K 28K 42K 56K 70K SE +/- 62.67, N = 3 67266.98 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 3.1.3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C AMD Ryzen Threadripper 3970X 32-Core 7K 14K 21K 28K 35K SE +/- 97.21, N = 3 30806.33 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 3.1.3
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B AMD Ryzen Threadripper 3970X 32-Core 7K 14K 21K 28K 35K SE +/- 222.04, N = 3 34903.89 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 3.1.3
HPC Challenge Test / Class: G-HPL OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-HPL AMD Ryzen Threadripper 3970X 32-Core 20 40 60 80 100 SE +/- 0.18, N = 3 80.43 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: G-Ffte OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ffte AMD Ryzen Threadripper 3970X 32-Core 4 8 12 16 20 SE +/- 0.08, N = 3 16.15 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: G-Ffte OpenBenchmarking.org GFLOP/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ffte AMD Ryzen Threadripper 3970X 32-Core 4 8 12 16 20 SE +/- 0.08, N = 3 16.15 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: EP-DGEMM OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: EP-DGEMM AMD Ryzen Threadripper 3970X 32-Core 7 14 21 28 35 SE +/- 0.10, N = 3 28.94 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: G-Ptrans OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ptrans AMD Ryzen Threadripper 3970X 32-Core 2 4 6 8 10 SE +/- 0.02457, N = 3 7.51040 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: EP-STREAM Triad OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: EP-STREAM Triad AMD Ryzen Threadripper 3970X 32-Core 0.4349 0.8698 1.3047 1.7396 2.1745 SE +/- 0.00007, N = 3 1.93275 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: G-Random Access OpenBenchmarking.org GUP/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Random Access AMD Ryzen Threadripper 3970X 32-Core 0.0391 0.0782 0.1173 0.1564 0.1955 SE +/- 0.00329, N = 3 0.17374 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: Random Ring Latency OpenBenchmarking.org usecs, Fewer Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Latency AMD Ryzen Threadripper 3970X 32-Core 0.1021 0.2042 0.3063 0.4084 0.5105 SE +/- 0.00063, N = 3 0.45390 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: Random Ring Bandwidth OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Bandwidth AMD Ryzen Threadripper 3970X 32-Core 0.5072 1.0144 1.5216 2.0288 2.536 SE +/- 0.02349, N = 3 2.25419 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
HPC Challenge Test / Class: Max Ping Pong Bandwidth OpenBenchmarking.org MB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Max Ping Pong Bandwidth AMD Ryzen Threadripper 3970X 32-Core 5K 10K 15K 20K 25K SE +/- 355.34, N = 3 22146.63 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 3.1.3
Parboil Test: OpenMP LBM OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM AMD Ryzen Threadripper 3970X 32-Core 10 20 30 40 50 SE +/- 0.02, N = 3 42.45 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenMP CUTCP OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP AMD Ryzen Threadripper 3970X 32-Core 0.213 0.426 0.639 0.852 1.065 SE +/- 0.001869, N = 3 0.946535 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenMP Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil AMD Ryzen Threadripper 3970X 32-Core 1.081 2.162 3.243 4.324 5.405 SE +/- 0.005658, N = 3 4.804557 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenMP MRI Gridding OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding AMD Ryzen Threadripper 3970X 32-Core 13 26 39 52 65 SE +/- 0.07, N = 3 58.37 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.81 Blend File: Barbershop - Compute: CPU-Only AMD Ryzen Threadripper 3970X 32-Core 40 80 120 160 200 SE +/- 0.49, N = 3 187.60
Phoronix Test Suite v10.8.4