999 AMD Ryzen 9 3900X 12-Core testing with a ASUS ROG CROSSHAIR VIII HERO (2103 BIOS) and Gigabyte NVIDIA GeForce GTX 980 Ti 6GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2012080-FI-99922012470&grt&sro .
999 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 999 989 AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads) ASUS ROG CROSSHAIR VIII HERO (2103 BIOS) AMD Starship/Matisse 64GB 2 x Samsung SSD 970 EVO 500GB + 480GB KINGSTON SA400S3 + 750GB SAMSUNG HD753LJ + 1000GB Western Digital WD1001FALS-0 + 1000GB TOSHIBA MQ01ABD1 + 1000GB Seagate ST1000DM003-1ER1 Gigabyte NVIDIA GeForce GTX 980 Ti 6GB (1062/3505MHz) NVIDIA GM200 HD Audio Cinema HD Realtek RTL8125 2.5GbE + Intel I211 Ubuntu 20.04 5.4.0-56-generic (x86_64) Xfce 4.14 X Server 1.20.8 NVIDIA 450.80.02 4.6.0 OpenCL 1.2 CUDA 11.0.228 1.2.133 GCC 9.3.0 ext4 5120x1600 Gigabyte NVIDIA GeForce GTX 980 Ti 6GB (1088/3505MHz) OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - 999: NONE / errors=remount-ro,relatime,rw Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701021 OpenCL Details - 999: GPU Compute Cores: 2816 Java Details - 999: OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.04) Python Details - 999: Python 3.7.6 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
999 blake2: blogbench: Read blogbench: Write cachebench: Read cachebench: Write cachebench: Read / Modify / Write cloverleaf: Lagrangian-Eulerian Hydrodynamics compilebench: Compile compilebench: Initial Create compilebench: Read Compiled Tree dacapobench: H2 dacapobench: Jython dacapobench: Tradesoap dacapobench: Tradebeans dolfyn: Computational Fluid Dynamics fftw: Stock - 1D FFT Size 4096 fftw: Stock - 2D FFT Size 4096 fftw: Float + SSE - 1D FFT Size 4096 fftw: Float + SSE - 2D FFT Size 4096 go-benchmark: http go-benchmark: json go-benchmark: build go-benchmark: garbage hpcg: java-scimark2: Composite java-scimark2: Monte Carlo java-scimark2: Fast Fourier Transform java-scimark2: Sparse Matrix Multiply java-scimark2: Dense LU Matrix Factorization java-scimark2: Jacobi Successive Over-Relaxation john-the-ripper: Blowfish john-the-ripper: MD5 lammps: Rhodopsin Protein lczero: BLAS lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Libdeflate 1 - Compression lzbench: Libdeflate 1 - Decompression mbw: Memory Copy - 128 MiB mbw: Memory Copy - 1024 MiB mbw: Memory Copy - 4096 MiB mbw: Memory Copy - 8192 MiB mbw: Memory Copy, Fixed Block Size - 128 MiB mbw: Memory Copy, Fixed Block Size - 1024 MiB mbw: Memory Copy, Fixed Block Size - 4096 MiB mbw: Memory Copy, Fixed Block Size - 8192 MiB namd: ATPase Simulation - 327,506 Atoms npb: BT.A npb: EP.C npb: FT.A npb: FT.B npb: LU.A npb: LU.C npb: SP.A parboil: OpenMP LBM parboil: OpenMP CUTCP parboil: OpenMP Stencil parboil: OpenMP MRI Gridding ramspeed: Add - Integer ramspeed: Copy - Integer ramspeed: Scale - Integer ramspeed: Triad - Integer ramspeed: Average - Integer ramspeed: Add - Floating Point ramspeed: Copy - Floating Point ramspeed: Scale - Floating Point ramspeed: Triad - Floating Point ramspeed: Average - Floating Point renaissance: Scala Dotty renaissance: Twitter Finagle renaissance: Apache Spark ALS renaissance: Apache Spark Bayes renaissance: Savina Reactors.IO renaissance: Apache Spark PageRank renaissance: In-Memory Database Shootout renaissance: Akka Unbalanced Cobwebbed Tree rodinia: OpenMP LavaMD rodinia: OpenMP Leukocyte rodinia: OpenMP CFD Solver rodinia: OpenMP Streamcluster sockperf: Throughput sockperf: Latency Ping Pong sockperf: Latency Under Load stream: Copy stream: Scale stream: Triad stream: Add t-test1: 1 t-test1: 2 hmmer: Pfam Database Search mafft: Multiple Sequence Alignment - LSU RNA mrbayes: Primate Phylogeny Analysis tinymembench: Standard Memcpy tinymembench: Standard Memset 999 989 8.63 1294782 26327 3047.65 32129.044687429 62207.549768603 3.81 2230.54 698.33 3430.23 3340 3976 3548 4037 15.477 8786.4 6339.7 57032 17847 5590 2933748 20276414910 940103 3.84180 3185.72 1770.88 2168.25 2809.61 7141.32 2038.53 9.379 557 44 134 563 1542 538 642 271 1269 13732.199 13913.765 14020.904 14005.471 7847.206 8161.791 8108.473 8101.923 1.50714 35956.21 1136.75 7259.30 8126.68 66530.53 26223.79 19617.12 138.081716 2.198867 13.395642 28.978306 21949.59 19353.38 19554.65 21536.29 20658.76 22068.01 19259.60 19383.33 21788.59 20579.72 1372.913 3432.919 1867.310 371.026 12559.191 3475.177 3908.873 10971.890 156.590 84.988 14.639 27.705 562923 2.942 69.926 17947.0 17897.9 20279.4 20207.1 23.093 7.953 93.482 9.383 68.504 8179.0 12806.3 20167 1278500 OpenBenchmarking.org
BLAKE2 OpenBenchmarking.org Cycles Per Byte, Fewer Is Better BLAKE2 20170307 999 2 4 6 8 10 SE +/- 0.05, N = 3 8.63 1. (CC) gcc options: -O3 -march=native -lcrypto -lz
BlogBench Test: Read OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Read 999 300K 600K 900K 1200K 1500K SE +/- 13712.62, N = 3 1294782 1. (CC) gcc options: -O2 -pthread
BlogBench Test: Write OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Write 999 6K 12K 18K 24K 30K SE +/- 218.91, N = 3 26327 1. (CC) gcc options: -O2 -pthread
CacheBench Test: Read OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read 999 700 1400 2100 2800 3500 SE +/- 10.34, N = 3 3047.65 MIN: 3026.03 / MAX: 3059.59 1. (CC) gcc options: -lrt
CacheBench Test: Write OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write 999 7K 14K 21K 28K 35K SE +/- 61.84, N = 3 32129.04 MIN: 26451.47 / MAX: 33504.93 1. (CC) gcc options: -lrt
CacheBench Test: Read / Modify / Write OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write 999 13K 26K 39K 52K 65K SE +/- 41.66, N = 3 62207.55 MIN: 56160.16 / MAX: 66587.82 1. (CC) gcc options: -lrt
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 999 0.8573 1.7146 2.5719 3.4292 4.2865 SE +/- 0.01, N = 3 3.81 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Compile Bench Test: Compile OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Compile 999 500 1000 1500 2000 2500 SE +/- 20.05, N = 3 2230.54
Compile Bench Test: Initial Create OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Initial Create 999 150 300 450 600 750 SE +/- 4.24, N = 3 698.33
Compile Bench Test: Read Compiled Tree OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Read Compiled Tree 999 700 1400 2100 2800 3500 SE +/- 0.00, N = 3 3430.23
DaCapo Benchmark Java Test: H2 OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: H2 999 700 1400 2100 2800 3500 SE +/- 25.20, N = 4 3340
DaCapo Benchmark Java Test: Jython OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Jython 999 900 1800 2700 3600 4500 SE +/- 32.61, N = 4 3976
DaCapo Benchmark Java Test: Tradesoap OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Tradesoap 999 800 1600 2400 3200 4000 SE +/- 37.58, N = 4 3548
DaCapo Benchmark Java Test: Tradebeans OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Tradebeans 999 900 1800 2700 3600 4500 SE +/- 14.72, N = 4 4037
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 999 4 8 12 16 20 SE +/- 0.14, N = 3 15.48
FFTW Build: Stock - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 999 2K 4K 6K 8K 10K SE +/- 14.00, N = 3 8786.4 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 999 1400 2800 4200 5600 7000 SE +/- 44.47, N = 3 6339.7 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 999 12K 24K 36K 48K 60K SE +/- 837.60, N = 3 57032 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 999 4K 8K 12K 16K 20K SE +/- 6.33, N = 3 17847 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
Go Benchmarks Test: http OpenBenchmarking.org Nanoseconds/Operation, Fewer Is Better Go Benchmarks Test: http 999 1200 2400 3600 4800 6000 SE +/- 74.88, N = 3 5590
Go Benchmarks Test: json OpenBenchmarking.org Nanoseconds/Operation, Fewer Is Better Go Benchmarks Test: json 999 600K 1200K 1800K 2400K 3000K SE +/- 728.80, N = 3 2933748
Go Benchmarks Test: build OpenBenchmarking.org Nanoseconds/Operation, Fewer Is Better Go Benchmarks Test: build 999 4000M 8000M 12000M 16000M 20000M SE +/- 223066887.28, N = 3 20276414910
Go Benchmarks Test: garbage OpenBenchmarking.org Nanoseconds/Operation, Fewer Is Better Go Benchmarks Test: garbage 999 200K 400K 600K 800K 1000K SE +/- 4092.17, N = 3 940103
High Performance Conjugate Gradient OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 999 0.8644 1.7288 2.5932 3.4576 4.322 SE +/- 0.05593, N = 4 3.84180 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi
Java SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better Java SciMark 2.0 Computational Test: Composite 999 700 1400 2100 2800 3500 SE +/- 29.87, N = 4 3185.72
Java SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better Java SciMark 2.0 Computational Test: Monte Carlo 999 400 800 1200 1600 2000 SE +/- 17.06, N = 4 1770.88
Java SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better Java SciMark 2.0 Computational Test: Fast Fourier Transform 999 500 1000 1500 2000 2500 SE +/- 18.64, N = 4 2168.25
Java SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better Java SciMark 2.0 Computational Test: Sparse Matrix Multiply 999 600 1200 1800 2400 3000 SE +/- 25.24, N = 4 2809.61
Java SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better Java SciMark 2.0 Computational Test: Dense LU Matrix Factorization 999 1500 3000 4500 6000 7500 SE +/- 64.50, N = 4 7141.32
Java SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better Java SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation 999 400 800 1200 1600 2000 SE +/- 24.81, N = 4 2038.53
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: Blowfish 989 4K 8K 12K 16K 20K SE +/- 252.07, N = 3 20167 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: MD5 989 300K 600K 900K 1200K 1500K SE +/- 15792.93, N = 4 1278500 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
LAMMPS Molecular Dynamics Simulator Test: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Test: Rhodopsin Protein 999 3 6 9 12 15 SE +/- 0.048, N = 3 9.379 1. (CXX) g++ options: -O3 -pthread -lm
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS 999 120 240 360 480 600 SE +/- 5.95, N = 7 557 1. (CXX) g++ options: -flto -pthread
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 999 10 20 30 40 50 SE +/- 0.67, N = 3 44 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 999 30 60 90 120 150 SE +/- 1.67, N = 3 134 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 999 120 240 360 480 600 SE +/- 8.67, N = 3 563 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 999 300 600 900 1200 1500 SE +/- 21.84, N = 3 1542 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 999 120 240 360 480 600 SE +/- 2.91, N = 3 538 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 999 140 280 420 560 700 SE +/- 1.20, N = 3 642 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 999 60 120 180 240 300 271 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression 999 300 600 900 1200 1500 1269 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
MBW Test: Memory Copy - Array Size: 128 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 128 MiB 999 3K 6K 9K 12K 15K SE +/- 7.26, N = 3 13732.20 1. (CC) gcc options: -O3 -march=native
MBW Test: Memory Copy - Array Size: 1024 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 1024 MiB 999 3K 6K 9K 12K 15K SE +/- 3.53, N = 3 13913.77 1. (CC) gcc options: -O3 -march=native
MBW Test: Memory Copy - Array Size: 4096 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 4096 MiB 999 3K 6K 9K 12K 15K SE +/- 24.88, N = 3 14020.90 1. (CC) gcc options: -O3 -march=native
MBW Test: Memory Copy - Array Size: 8192 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 8192 MiB 999 3K 6K 9K 12K 15K SE +/- 6.08, N = 3 14005.47 1. (CC) gcc options: -O3 -march=native
MBW Test: Memory Copy, Fixed Block Size - Array Size: 128 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 128 MiB 999 2K 4K 6K 8K 10K SE +/- 9.85, N = 3 7847.21 1. (CC) gcc options: -O3 -march=native
MBW Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB 999 2K 4K 6K 8K 10K SE +/- 4.71, N = 3 8161.79 1. (CC) gcc options: -O3 -march=native
MBW Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB 999 2K 4K 6K 8K 10K SE +/- 0.74, N = 3 8108.47 1. (CC) gcc options: -O3 -march=native
MBW Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB 999 2K 4K 6K 8K 10K SE +/- 23.07, N = 3 8101.92 1. (CC) gcc options: -O3 -march=native
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms 999 0.3391 0.6782 1.0173 1.3564 1.6955 SE +/- 0.02396, N = 3 1.50714
NAS Parallel Benchmarks Test / Class: BT.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.A 999 8K 16K 24K 32K 40K SE +/- 593.09, N = 3 35956.21 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 999 200 400 600 800 1000 SE +/- 1.07, N = 3 1136.75 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: FT.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.A 999 1600 3200 4800 6400 8000 SE +/- 20.56, N = 3 7259.30 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: FT.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.B 999 2K 4K 6K 8K 10K SE +/- 2.62, N = 3 8126.68 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.A 999 14K 28K 42K 56K 70K SE +/- 764.70, N = 15 66530.53 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 999 6K 12K 18K 24K 30K SE +/- 12.38, N = 3 26223.79 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: SP.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.A 999 4K 8K 12K 16K 20K SE +/- 190.91, N = 15 19617.12 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Parboil Test: OpenMP LBM OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM 999 30 60 90 120 150 SE +/- 0.03, N = 3 138.08 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenMP CUTCP OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP 999 0.4947 0.9894 1.4841 1.9788 2.4735 SE +/- 0.001259, N = 3 2.198867 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenMP Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil 999 3 6 9 12 15 SE +/- 0.03, N = 3 13.40 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenMP MRI Gridding OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding 999 7 14 21 28 35 SE +/- 0.12, N = 3 28.98 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
RAMspeed SMP Type: Add - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Add - Benchmark: Integer 999 5K 10K 15K 20K 25K SE +/- 113.34, N = 3 21949.59 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Copy - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Integer 999 4K 8K 12K 16K 20K SE +/- 155.24, N = 3 19353.38 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Scale - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Integer 999 4K 8K 12K 16K 20K SE +/- 5.52, N = 3 19554.65 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Triad - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Integer 999 5K 10K 15K 20K 25K SE +/- 300.16, N = 3 21536.29 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Average - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Integer 999 4K 8K 12K 16K 20K SE +/- 120.46, N = 3 20658.76 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Add - Benchmark: Floating Point OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Add - Benchmark: Floating Point 999 5K 10K 15K 20K 25K SE +/- 2.05, N = 3 22068.01 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Copy - Benchmark: Floating Point OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Floating Point 999 4K 8K 12K 16K 20K SE +/- 147.37, N = 3 19259.60 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Scale - Benchmark: Floating Point OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Floating Point 999 4K 8K 12K 16K 20K SE +/- 143.63, N = 3 19383.33 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Triad - Benchmark: Floating Point OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Floating Point 999 5K 10K 15K 20K 25K SE +/- 254.07, N = 3 21788.59 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP Type: Average - Benchmark: Floating Point OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Floating Point 999 4K 8K 12K 16K 20K SE +/- 222.58, N = 3 20579.72 1. (CC) gcc options: -O3 -march=native
Renaissance Test: Scala Dotty OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.10.0 Test: Scala Dotty 999 300 600 900 1200 1500 SE +/- 11.32, N = 5 1372.91
Renaissance Test: Twitter Finagle OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.10.0 Test: Twitter Finagle 999 700 1400 2100 2800 3500 SE +/- 30.24, N = 11 3432.92
Renaissance Test: Apache Spark ALS OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.10.0 Test: Apache Spark ALS 999 400 800 1200 1600 2000 SE +/- 16.23, N = 5 1867.31
Renaissance Test: Apache Spark Bayes OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.10.0 Test: Apache Spark Bayes 999 80 160 240 320 400 SE +/- 53.17, N = 25 371.03
Renaissance Test: Savina Reactors.IO OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.10.0 Test: Savina Reactors.IO 999 3K 6K 9K 12K 15K SE +/- 315.33, N = 15 12559.19
Renaissance Test: Apache Spark PageRank OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.10.0 Test: Apache Spark PageRank 999 700 1400 2100 2800 3500 SE +/- 37.76, N = 25 3475.18
Renaissance Test: In-Memory Database Shootout OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.10.0 Test: In-Memory Database Shootout 999 800 1600 2400 3200 4000 SE +/- 47.46, N = 25 3908.87
Renaissance Test: Akka Unbalanced Cobwebbed Tree OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.10.0 Test: Akka Unbalanced Cobwebbed Tree 999 2K 4K 6K 8K 10K SE +/- 81.64, N = 5 10971.89
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD 999 30 60 90 120 150 SE +/- 0.21, N = 3 156.59 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte 999 20 40 60 80 100 SE +/- 0.51, N = 3 84.99 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver 999 4 8 12 16 20 SE +/- 0.05, N = 3 14.64 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster 999 7 14 21 28 35 SE +/- 0.09, N = 3 27.71 1. (CXX) g++ options: -O2 -lOpenCL
Sockperf Test: Throughput OpenBenchmarking.org Messages Per Second, More Is Better Sockperf 3.4 Test: Throughput 999 120K 240K 360K 480K 600K SE +/- 3951.74, N = 5 562923 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Sockperf Test: Latency Ping Pong OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Ping Pong 999 0.662 1.324 1.986 2.648 3.31 SE +/- 0.025, N = 25 2.942 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Sockperf Test: Latency Under Load OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Under Load 999 16 32 48 64 80 SE +/- 10.99, N = 25 69.93 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Stream Type: Copy OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Copy 999 4K 8K 12K 16K 20K SE +/- 3.22, N = 5 17947.0 1. (CC) gcc options: -O3 -march=native -fopenmp
Stream Type: Scale OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Scale 999 4K 8K 12K 16K 20K SE +/- 3.76, N = 5 17897.9 1. (CC) gcc options: -O3 -march=native -fopenmp
Stream Type: Triad OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Triad 999 4K 8K 12K 16K 20K SE +/- 5.04, N = 5 20279.4 1. (CC) gcc options: -O3 -march=native -fopenmp
Stream Type: Add OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Add 999 4K 8K 12K 16K 20K SE +/- 1.51, N = 5 20207.1 1. (CC) gcc options: -O3 -march=native -fopenmp
t-test1 Threads: 1 OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 1 999 6 12 18 24 30 SE +/- 0.05, N = 3 23.09 1. (CC) gcc options: -pthread
t-test1 Threads: 2 OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 2 999 2 4 6 8 10 SE +/- 0.037, N = 3 7.953 1. (CC) gcc options: -pthread
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 999 20 40 60 80 100 SE +/- 0.12, N = 3 93.48 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 999 3 6 9 12 15 SE +/- 0.056, N = 3 9.383 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis 999 15 30 45 60 75 SE +/- 0.11, N = 3 68.50 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm -lreadline
Tinymembench Standard Memcpy OpenBenchmarking.org MB/s, More Is Better Tinymembench 2018-05-28 Standard Memcpy 999 2K 4K 6K 8K 10K SE +/- 7.54, N = 3 8179.0 1. (CC) gcc options: -O2 -lm
Tinymembench Standard Memset OpenBenchmarking.org MB/s, More Is Better Tinymembench 2018-05-28 Standard Memset 999 3K 6K 9K 12K 15K SE +/- 18.63, N = 3 12806.3 1. (CC) gcc options: -O2 -lm
Phoronix Test Suite v10.8.5