2 x AMD EPYC 7702 64-Core testing with a Supermicro H12DST-B and ASPEED ASPEED Family on Ubuntu 18.04 via the Phoronix Test Suite.
AMD EPYC 7702 64-Core Processor: 2 x AMD EPYC 7702 64-Core @ 2.00GHz (128 Cores), Motherboard: Supermicro H12DST-B, Chipset: AMD Device 1480, Memory: 258048MB, Disk: 220GB, Graphics: ASPEED ASPEED Family, Audio: AMD Device 1487, Network: Intel 82575GB Gigabit Connection
OS: Ubuntu 18.04, Kernel: 4.15.0-62-generic (x86_64), File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq ondemand
2019-09-23-2130 OpenBenchmarking.org Phoronix Test Suite 2 x AMD EPYC 7702 64-Core @ 2.00GHz (128 Cores) Supermicro H12DST-B AMD Device 1480 258048MB 220GB ASPEED ASPEED Family AMD Device 1487 Intel 82575GB Gigabit Connection Ubuntu 18.04 4.15.0-62-generic (x86_64) ext4 1920x1080 Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel File-System Screen Resolution 2019-09-23-2130 Benchmarks System Logs - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand
2019-09-23-2130 ctx-clock: Context Switch Time stress-ng: System V Message Passing stress-ng: Socket Activity stress-ng: Memory Copying stress-ng: Semaphores stress-ng: CPU Stress stress-ng: Crypto openssl: RSA 4096-bit Performance radiance: SMP Parallel radiance: Serial build-gcc: Time To Compile asmfish: 1024 Hash Memory, 26 Depth stockfish: Total Time compress-7zip: Compress Speed Test crafty: Elapsed Time scimark2: Composite scimark2: Jacobi Successive Over-Relaxation scimark2: Dense LU Matrix Factorization scimark2: Sparse Matrix Multiply scimark2: Monte Carlo scimark2: Composite cachebench: Write cachebench: Read cachebench: Read / Modify / Write cachebench: Read / Modify / Write cachebench: Write Cache cachebench: Write cachebench: Read cachebench: Read Cache namd: ATPase Simulation - 327,506 Atoms rodinia: OpenMP Streamcluster rodinia: OpenMP CFD Solver rodinia: OpenMP LavaMD mbw: Memory Copy, Fixed Block Size - 8192 MiB mbw: Memory Copy, Fixed Block Size - 4096 MiB mbw: Memory Copy, Fixed Block Size - 1024 MiB mbw: Memory Copy, Fixed Block Size - 128 MiB mbw: Memory Copy - 8192 MiB mbw: Memory Copy - 4096 MiB mbw: Memory Copy - 1024 MiB mbw: Memory Copy - 128 MiB tinymembench: Phoronix Test Suite v5.2.1 stream: Triad ramspeed: Average - Floating Point ramspeed: Triad - Floating Point ramspeed: Scale - Floating Point ramspeed: Copy - Floating Point ramspeed: Add - Floating Point ramspeed: Average - Integer ramspeed: Triad - Integer ramspeed: Scale - Integer ramspeed: Copy - Integer ramspeed: Add - Integer ramspeed: Floating-Point Add ramspeed: Integer Scale ramspeed: Integer Copy ramspeed: Integer Add stress-ng: Context Switching stress-ng: Forking scimark2: Fast Fourier Transform scimark2: Computational Fast Fourier Transform stream: Add stream: Scale stream: Copy stream: Add stream: Scale stream: Copy AMD EPYC 7702 64-Core 120 8071402.95 35581.73 3540.32 1275318.88 26861.44 21785.81 22440.03 236.71 762.34 708.28 168156760 154022695 269642 6633915 533.54 989.20 741.92 613.24 118.66 533.54 22085.06 2321.19 24032.79 24026.50 22074.74 22072.45 2321.05 2322.06 0.44685 96.27 196.60 11.27 8392.09 8587.64 8541.36 8231.04 15283.37 15035.18 14974.32 14794.09 15138.37 191645.74 40721.45 44136.68 37337.83 37806.30 44122.88 39435.07 41442.20 35546.40 38553.01 43776.38 44443.26 35819.60 37935.21 43655.51 21627262.55 41843.83 204.68 204.68 183665.28 179630.73 269405.18 183665.28 179630.73 279445.38 OpenBenchmarking.org
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Socket Activity AMD EPYC 7702 64-Core 8K 16K 24K 32K 40K SE +/- 606.21, N = 3 35581.73 1. (CC) gcc options: -O2 -std=gnu99 -lm -lz -lcrypt -lrt -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Memory Copying AMD EPYC 7702 64-Core 800 1600 2400 3200 4000 SE +/- 51.42, N = 3 3540.32 1. (CC) gcc options: -O2 -std=gnu99 -lm -lz -lcrypt -lrt -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Semaphores AMD EPYC 7702 64-Core 300K 600K 900K 1200K 1500K SE +/- 19345.97, N = 6 1275318.88 1. (CC) gcc options: -O2 -std=gnu99 -lm -lz -lcrypt -lrt -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: CPU Stress AMD EPYC 7702 64-Core 6K 12K 18K 24K 30K SE +/- 43.17, N = 3 26861.44 1. (CC) gcc options: -O2 -std=gnu99 -lm -lz -lcrypt -lrt -lpthread -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Crypto AMD EPYC 7702 64-Core 5K 10K 15K 20K 25K SE +/- 42.17, N = 3 21785.81 1. (CC) gcc options: -O2 -std=gnu99 -lm -lz -lcrypt -lrt -lpthread -lc
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance AMD EPYC 7702 64-Core 5K 10K 15K 20K 25K SE +/- 128.73, N = 3 22440.03 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time AMD EPYC 7702 64-Core 30M 60M 90M 120M 150M SE +/- 161855.54, N = 3 154022695 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Composite AMD EPYC 7702 64-Core 120 240 360 480 600 SE +/- 4.25, N = 3 533.54 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation AMD EPYC 7702 64-Core 200 400 600 800 1000 SE +/- 0.04, N = 3 989.20 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization AMD EPYC 7702 64-Core 160 320 480 640 800 SE +/- 14.92, N = 3 741.92 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply AMD EPYC 7702 64-Core 130 260 390 520 650 SE +/- 1.55, N = 3 613.24 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo AMD EPYC 7702 64-Core 30 60 90 120 150 SE +/- 0.09, N = 3 118.66 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite AMD EPYC 7702 64-Core 120 240 360 480 600 SE +/- 4.25, N = 3 533.54 1. (CC) gcc options: -lm
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write AMD EPYC 7702 64-Core 5K 10K 15K 20K 25K SE +/- 0.86, N = 3 24032.79 1. (CC) gcc options: -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Read / Modify / Write AMD EPYC 7702 64-Core 5K 10K 15K 20K 25K SE +/- 2.03, N = 3 24026.50 1. (CC) gcc options: -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read AMD EPYC 7702 64-Core 500 1000 1500 2000 2500 SE +/- 0.27, N = 3 2321.05 1. (CC) gcc options: -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Read Cache AMD EPYC 7702 64-Core 500 1000 1500 2000 2500 SE +/- 0.51, N = 3 2322.06 1. (CC) gcc options: -lrt
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.13b1 ATPase Simulation - 327,506 Atoms AMD EPYC 7702 64-Core 0.1005 0.201 0.3015 0.402 0.5025 SE +/- 0.00845, N = 6 0.44685
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP Streamcluster AMD EPYC 7702 64-Core 20 40 60 80 100 SE +/- 0.64, N = 3 96.27 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP CFD Solver AMD EPYC 7702 64-Core 40 80 120 160 200 SE +/- 4.36, N = 6 196.60 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP LavaMD AMD EPYC 7702 64-Core 3 6 9 12 15 SE +/- 0.21, N = 3 11.27 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB AMD EPYC 7702 64-Core 2K 4K 6K 8K 10K SE +/- 144.32, N = 3 8587.64 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB AMD EPYC 7702 64-Core 2K 4K 6K 8K 10K SE +/- 91.62, N = 3 8541.36 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 128 MiB AMD EPYC 7702 64-Core 2K 4K 6K 8K 10K SE +/- 108.44, N = 3 8231.04 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 8192 MiB AMD EPYC 7702 64-Core 3K 6K 9K 12K 15K SE +/- 70.22, N = 3 15283.37 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 4096 MiB AMD EPYC 7702 64-Core 3K 6K 9K 12K 15K SE +/- 192.83, N = 3 15035.18 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 1024 MiB AMD EPYC 7702 64-Core 3K 6K 9K 12K 15K SE +/- 191.67, N = 3 14974.32 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 128 MiB AMD EPYC 7702 64-Core 3K 6K 9K 12K 15K SE +/- 199.81, N = 3 14794.09 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.07.26 Test: Forking AMD EPYC 7702 64-Core 9K 18K 27K 36K 45K SE +/- 1471.74, N = 6 41843.83 1. (CC) gcc options: -O2 -std=gnu99 -lm -lz -lcrypt -lrt -lpthread -lc
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform AMD EPYC 7702 64-Core 40 80 120 160 200 SE +/- 10.18, N = 3 204.68 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Fast Fourier Transform AMD EPYC 7702 64-Core 40 80 120 160 200 SE +/- 10.18, N = 3 204.68 1. (CC) gcc options: -lm
OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Copy AMD EPYC 7702 64-Core 60K 120K 180K 240K 300K SE +/- 11083.75, N = 10 279445.38 1. (CC) gcc options: -O3 -march=native -fopenmp
AMD EPYC 7702 64-Core Processor: 2 x AMD EPYC 7702 64-Core @ 2.00GHz (128 Cores), Motherboard: Supermicro H12DST-B, Chipset: AMD Device 1480, Memory: 258048MB, Disk: 220GB, Graphics: ASPEED ASPEED Family, Audio: AMD Device 1487, Network: Intel 82575GB Gigabit Connection
OS: Ubuntu 18.04, Kernel: 4.15.0-62-generic (x86_64), File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq ondemand
Testing initiated at 23 September 2019 21:30 by user martinslabber.