AMD EPYC vs. Intel Xeon vs. Amazon EC2 Cloud Tests for a future article. Some initial AMD EPYC 7601 tests on Ubuntu 17.04 with Linux 4.13. Tests for a future article on Phoronix.com. Benchmarks by Michael Larabel.
HTML result view exported from: https://openbenchmarking.org/result/1709189-TY-EPYCCLOUD78&grr&sor .
AMD EPYC vs. Intel Xeon vs. Amazon EC2 Cloud Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Display Driver Compiler File-System Screen Resolution Desktop System Layer AMD EPYC 7601 AMD EPYC 7601 (NUMA Interleave All) 2 x Intel Xeon Gold 6138 m4.10xlarge m4.16xlarge c4.4xlarge c4.8xlarge c3.8xlarge r4.16xlarge AMD EPYC 7601 32-Core @ 2.20GHz (64 Cores) TYAN B8026T70AE24HR AMD Device 1450 129024MB 234GB ASPEED ASPEED Family Acer P243W Broadcom Limited NetXtreme BCM5720 Gigabit PCIe Ubuntu 17.04 4.13.0-041300-generic (x86_64) modesetting 1.19.3 GCC 6.3.0 20170406 ext4 1920x1200 Unity 7.5.0 2 x Intel Xeon Gold 6138 @ 3.70GHz (80 Cores) TYAN S7106 Intel Device 2020 96256MB 256GB Samsung SSD 850 + 2000GB Seagate ST2000DM006-2DM1 + 2 x 120GB TOSHIBA-TR150 Intel I210 Gigabit Connection 2 x Intel Xeon E5-2676 v3 @ 3.00GHz (40 Cores) Xen HVM domU Intel 440FX- 82441FX PMC 161792MB 8GB Cirrus Logic GD 5446 Intel 82599 Virtual Function Ubuntu 16.04 4.4.0-1022-aws (x86_64) GCC 5.4.0 20160609 Xen HVM domU 4.2.amazon 2 x Intel Xeon E5-2686 v4 @ 3.00GHz (64 Cores) 258048MB Device 1d0f:ec20 Intel Xeon E5-2666 v3 @ 2.90GHz (16 Cores) 30720MB Intel 82599 Virtual Function 2 x Intel Xeon E5-2666 v3 @ 3.50GHz (36 Cores) 60416MB 2 x Intel Xeon E5-2680 v2 @ 2.79GHz (32 Cores) 2 x Intel Xeon E5-2686 v4 @ 3.00GHz (64 Cores) 492544MB Device 1d0f:ec20 OpenBenchmarking.org Compiler Details - AMD EPYC 7601: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - AMD EPYC 7601 (NUMA Interleave All): --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - 2 x Intel Xeon Gold 6138: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - m4.10xlarge: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - m4.16xlarge: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - c4.4xlarge: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - c4.8xlarge: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - c3.8xlarge: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - r4.16xlarge: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details - AMD EPYC 7601: Scaling Governor: acpi-cpufreq ondemand - AMD EPYC 7601 (NUMA Interleave All): Scaling Governor: acpi-cpufreq ondemand - 2 x Intel Xeon Gold 6138: Scaling Governor: intel_pstate powersave - m4.10xlarge: Scaling Governor: intel_pstate powersave - m4.16xlarge: Scaling Governor: intel_pstate powersave - c4.8xlarge: Scaling Governor: intel_pstate powersave - r4.16xlarge: Scaling Governor: intel_pstate powersave
AMD EPYC vs. Intel Xeon vs. Amazon EC2 Cloud openssl: RSA 4096-bit Performance primesieve: 1e12 Prime Number Generation c-ray: Total Time build-llvm: Time To Compile build-linux-kernel: Time To Compile x264: H.264 Video Encoding john-the-ripper: Blowfish rodinia: OpenMP Streamcluster rodinia: OpenMP LavaMD parboil: OpenMP Stencil parboil: OpenMP LBM npb: LU.C npb: LU.A npb: EP.C AMD EPYC 7601 AMD EPYC 7601 (NUMA Interleave All) 2 x Intel Xeon Gold 6138 m4.10xlarge m4.16xlarge c4.4xlarge c4.8xlarge c3.8xlarge r4.16xlarge 3294.53 14.08 2.84 175.11 37.40 292.63 29553 23.15 30.96 13.82 50.95 46983.34 63042.80 1610.17 3306.37 14.06 2.85 192.26 39.28 288.28 34335 14.66 30.14 7.75 38.37 50046.90 62479.08 1607.68 4826.70 11.81 2.84 134.64 30.54 310.79 30373 22.52 31.44 7.75 50.04 50072.25 53974.49 1815.89 2229.10 22.18 6.45 213.94 41.84 264.84 23520 23.66 56.47 11.47 74.43 36509.86 37949.85 629.38 3835.70 14.39 4.21 147.24 30.62 304.36 39039 20.21 34.19 7.63 47.45 43175.90 66710.16 1036.50 1070.07 45.96 13.22 435.36 73.54 257.83 11382 25.67 115.91 12.86 114.31 18060.63 20546.79 302.17 2382.30 20.75 6.08 209.76 39.89 281.33 25152 23.45 52.77 11.86 65.38 672.88 1836.53 24.76 8.54 290.03 51.38 303.73 19532 15.78 124.85 11.20 75.51 33661.53 34478.44 565.48 3861.60 14.63 4.16 147.57 30.80 317.57 38527 16.23 34.86 7.66 51.78 45966.13 67748.67 1045.49 OpenBenchmarking.org
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.1g RSA 4096-bit Performance 2 x Intel Xeon Gold 6138 r4.16xlarge m4.16xlarge AMD EPYC 7601 (NUMA Interleave All) AMD EPYC 7601 c4.8xlarge m4.10xlarge c3.8xlarge c4.4xlarge 1000 2000 3000 4000 5000 SE +/- 23.22, N = 3 SE +/- 3.38, N = 3 SE +/- 3.53, N = 3 SE +/- 13.22, N = 3 SE +/- 14.45, N = 3 SE +/- 4.10, N = 3 SE +/- 1.32, N = 3 SE +/- 4.67, N = 3 SE +/- 0.20, N = 3 4826.70 3861.60 3835.70 3306.37 3294.53 2382.30 2229.10 1836.53 1070.07 1. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl
Primesieve 1e12 Prime Number Generation OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 5.4.2 1e12 Prime Number Generation 2 x Intel Xeon Gold 6138 AMD EPYC 7601 (NUMA Interleave All) AMD EPYC 7601 m4.16xlarge r4.16xlarge c4.8xlarge m4.10xlarge c3.8xlarge c4.4xlarge 10 20 30 40 50 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 11.81 14.06 14.08 14.39 14.63 20.75 22.18 24.76 45.96 1. (CXX) g++ options: -O2 -fopenmp
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time AMD EPYC 7601 2 x Intel Xeon Gold 6138 AMD EPYC 7601 (NUMA Interleave All) r4.16xlarge m4.16xlarge c4.8xlarge m4.10xlarge c3.8xlarge c4.4xlarge 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.84 2.84 2.85 4.16 4.21 6.08 6.45 8.54 13.22 1. (CC) gcc options: -lm -lpthread -O3
Timed LLVM Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 4.0.1 Time To Compile 2 x Intel Xeon Gold 6138 m4.16xlarge r4.16xlarge AMD EPYC 7601 AMD EPYC 7601 (NUMA Interleave All) c4.8xlarge m4.10xlarge c3.8xlarge c4.4xlarge 90 180 270 360 450 SE +/- 0.81, N = 3 SE +/- 1.14, N = 3 SE +/- 1.87, N = 3 SE +/- 3.02, N = 4 SE +/- 2.34, N = 3 SE +/- 1.88, N = 3 SE +/- 1.94, N = 3 SE +/- 0.11, N = 3 SE +/- 1.06, N = 3 134.64 147.24 147.57 175.11 192.26 209.76 213.94 290.03 435.36
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 4.9 Time To Compile 2 x Intel Xeon Gold 6138 m4.16xlarge r4.16xlarge AMD EPYC 7601 AMD EPYC 7601 (NUMA Interleave All) c4.8xlarge m4.10xlarge c3.8xlarge c4.4xlarge 16 32 48 64 80 SE +/- 0.89, N = 6 SE +/- 0.52, N = 6 SE +/- 0.51, N = 6 SE +/- 0.51, N = 6 SE +/- 0.58, N = 5 SE +/- 0.67, N = 4 SE +/- 0.63, N = 5 SE +/- 0.78, N = 3 SE +/- 0.82, N = 3 30.54 30.62 30.80 37.40 39.28 39.89 41.84 51.38 73.54
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2017-09-08 H.264 Video Encoding r4.16xlarge 2 x Intel Xeon Gold 6138 m4.16xlarge c3.8xlarge AMD EPYC 7601 AMD EPYC 7601 (NUMA Interleave All) c4.8xlarge m4.10xlarge c4.4xlarge 70 140 210 280 350 SE +/- 5.32, N = 4 SE +/- 3.63, N = 3 SE +/- 3.42, N = 3 SE +/- 3.63, N = 3 SE +/- 1.27, N = 3 SE +/- 0.45, N = 3 SE +/- 5.81, N = 6 SE +/- 5.12, N = 3 SE +/- 0.88, N = 3 317.57 310.79 304.36 303.73 292.63 288.28 281.33 264.84 257.83 -lavformat -lavcodec -lavutil -lswscale -lavformat -lavcodec -lavutil -lswscale 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.8.0 Test: Blowfish m4.16xlarge r4.16xlarge AMD EPYC 7601 (NUMA Interleave All) 2 x Intel Xeon Gold 6138 AMD EPYC 7601 c4.8xlarge m4.10xlarge c3.8xlarge c4.4xlarge 8K 16K 24K 32K 40K SE +/- 25.17, N = 3 SE +/- 51.33, N = 3 SE +/- 764.17, N = 6 SE +/- 2076.33, N = 6 SE +/- 2253.07, N = 6 SE +/- 181.43, N = 3 SE +/- 73.10, N = 3 SE +/- 13.00, N = 3 39039 38527 34335 30373 29553 25152 23520 19532 11382 1. (CC) gcc options: -fopenmp -lcrypt
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP Streamcluster AMD EPYC 7601 (NUMA Interleave All) c3.8xlarge r4.16xlarge m4.16xlarge 2 x Intel Xeon Gold 6138 AMD EPYC 7601 c4.8xlarge m4.10xlarge c4.4xlarge 6 12 18 24 30 SE +/- 0.27, N = 6 SE +/- 0.84, N = 6 SE +/- 0.67, N = 6 SE +/- 1.20, N = 6 SE +/- 0.42, N = 3 SE +/- 1.43, N = 6 SE +/- 1.06, N = 6 SE +/- 0.36, N = 3 SE +/- 0.53, N = 6 14.66 15.78 16.23 20.21 22.52 23.15 23.45 23.66 25.67 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP LavaMD AMD EPYC 7601 (NUMA Interleave All) AMD EPYC 7601 2 x Intel Xeon Gold 6138 m4.16xlarge r4.16xlarge c4.8xlarge m4.10xlarge c4.4xlarge c3.8xlarge 30 60 90 120 150 SE +/- 0.15, N = 3 SE +/- 0.01, N = 3 SE +/- 0.10, N = 3 SE +/- 0.15, N = 3 SE +/- 0.06, N = 3 SE +/- 0.15, N = 3 SE +/- 0.17, N = 3 SE +/- 0.12, N = 3 SE +/- 0.10, N = 3 30.14 30.96 31.44 34.19 34.86 52.77 56.47 115.91 124.85 1. (CXX) g++ options: -O2 -lOpenCL
Parboil Test: OpenMP Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil m4.16xlarge r4.16xlarge AMD EPYC 7601 (NUMA Interleave All) 2 x Intel Xeon Gold 6138 c3.8xlarge m4.10xlarge c4.8xlarge c4.4xlarge AMD EPYC 7601 4 8 12 16 20 SE +/- 0.15, N = 3 SE +/- 0.09, N = 3 SE +/- 0.00, N = 3 SE +/- 0.17, N = 6 SE +/- 0.27, N = 6 SE +/- 0.08, N = 3 SE +/- 0.31, N = 6 SE +/- 0.11, N = 3 SE +/- 0.68, N = 6 7.63 7.66 7.75 7.75 11.20 11.47 11.86 12.86 13.82 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
Parboil Test: OpenMP LBM OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM AMD EPYC 7601 (NUMA Interleave All) m4.16xlarge 2 x Intel Xeon Gold 6138 AMD EPYC 7601 r4.16xlarge c4.8xlarge m4.10xlarge c3.8xlarge c4.4xlarge 30 60 90 120 150 SE +/- 0.26, N = 3 SE +/- 3.52, N = 6 SE +/- 1.14, N = 6 SE +/- 0.66, N = 3 SE +/- 3.00, N = 6 SE +/- 1.81, N = 6 SE +/- 3.88, N = 6 SE +/- 1.54, N = 6 SE +/- 0.37, N = 3 38.37 47.45 50.04 50.95 51.78 65.38 74.43 75.51 114.31 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: LU.C 2 x Intel Xeon Gold 6138 AMD EPYC 7601 (NUMA Interleave All) AMD EPYC 7601 r4.16xlarge m4.16xlarge m4.10xlarge c3.8xlarge c4.4xlarge 11K 22K 33K 44K 55K SE +/- 637.78, N = 3 SE +/- 25.06, N = 3 SE +/- 747.98, N = 3 SE +/- 398.25, N = 3 SE +/- 341.99, N = 3 SE +/- 143.51, N = 3 SE +/- 598.90, N = 3 SE +/- 20.94, N = 3 50072.25 50046.90 46983.34 45966.13 43175.90 36509.86 33661.53 18060.63 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 2 x Intel Xeon Gold 6138: Open MPI 2.0.2 3. AMD EPYC 7601 (NUMA Interleave All): Open MPI 2.0.2 4. AMD EPYC 7601: Open MPI 2.0.2 5. r4.16xlarge: Open MPI 1.10.2 6. m4.16xlarge: Open MPI 1.10.2 7. m4.10xlarge: Open MPI 1.10.2 8. c3.8xlarge: Open MPI 1.10.2 9. c4.4xlarge: Open MPI 1.10.2
NAS Parallel Benchmarks Test / Class: LU.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: LU.A r4.16xlarge m4.16xlarge AMD EPYC 7601 AMD EPYC 7601 (NUMA Interleave All) 2 x Intel Xeon Gold 6138 m4.10xlarge c3.8xlarge c4.4xlarge 15K 30K 45K 60K 75K SE +/- 1660.00, N = 6 SE +/- 637.44, N = 3 SE +/- 1051.01, N = 4 SE +/- 437.89, N = 3 SE +/- 3783.72, N = 6 SE +/- 902.21, N = 6 SE +/- 104.91, N = 3 SE +/- 22.31, N = 3 67748.67 66710.16 63042.80 62479.08 53974.49 37949.85 34478.44 20546.79 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. r4.16xlarge: Open MPI 1.10.2 3. m4.16xlarge: Open MPI 1.10.2 4. AMD EPYC 7601: Open MPI 2.0.2 5. AMD EPYC 7601 (NUMA Interleave All): Open MPI 2.0.2 6. 2 x Intel Xeon Gold 6138: Open MPI 2.0.2 7. m4.10xlarge: Open MPI 1.10.2 8. c3.8xlarge: Open MPI 1.10.2 9. c4.4xlarge: Open MPI 1.10.2
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: EP.C 2 x Intel Xeon Gold 6138 AMD EPYC 7601 AMD EPYC 7601 (NUMA Interleave All) r4.16xlarge m4.16xlarge c4.8xlarge m4.10xlarge c3.8xlarge c4.4xlarge 400 800 1200 1600 2000 SE +/- 34.76, N = 6 SE +/- 0.32, N = 3 SE +/- 0.90, N = 3 SE +/- 1.67, N = 3 SE +/- 4.41, N = 3 SE +/- 1.49, N = 3 SE +/- 1.36, N = 3 SE +/- 1.50, N = 3 SE +/- 0.85, N = 3 1815.89 1610.17 1607.68 1045.49 1036.50 672.88 629.38 565.48 302.17 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 2 x Intel Xeon Gold 6138: Open MPI 2.0.2 3. AMD EPYC 7601: Open MPI 2.0.2 4. AMD EPYC 7601 (NUMA Interleave All): Open MPI 2.0.2 5. r4.16xlarge: Open MPI 1.10.2 6. m4.16xlarge: Open MPI 1.10.2 7. c4.8xlarge: Open MPI 1.10.2 8. m4.10xlarge: Open MPI 1.10.2 9. c3.8xlarge: Open MPI 1.10.2 10. c4.4xlarge: Open MPI 1.10.2
Phoronix Test Suite v10.8.5