AMD EPYC vs. Threadripper vs. Dual Xeon Gold

Some initial AMD EPYC 7601 tests on Ubuntu 17.04 with Linux 4.13. Tests for a future article on Phoronix.com. Benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1709106-TY-AMDEPYC2455&grs&rdt.

AMD EPYC vs. Threadripper vs. Dual Xeon GoldProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkAudioOSKernelDisplay DriverCompilerFile-SystemScreen ResolutionDesktopDisplay ServerOpenGLAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 6138AMD EPYC 7601 32-Core @ 2.20GHz (64 Cores)TYAN B8026T70AE24HRAMD Device 1450129024MB234GBASPEED ASPEED FamilyAcer P243WBroadcom Limited NetXtreme BCM5720 Gigabit PCIeUbuntu 17.044.13.0-041300-generic (x86_64)modesetting 1.19.3GCC 6.3.0 20170406ext41920x1200Unity 7.5.0AMD Ryzen Threadripper 1950X 16-Core @ 3.40GHz (32 Cores)Gigabyte X399 AORUS Gaming 732768MB120GB Force MP500XFX AMD Radeon R9 290/390 4096MBRealtek ALC1220Acer B286HKQualcomm Atheros Device e0b1 + Intel Wireless 8265 / 8275X Server 1.19.34.5 Mesa 17.0.3 Gallium 0.4 (LLVM 4.0.0)3840x21602 x Intel Xeon Gold 6138 @ 3.70GHz (80 Cores)TYAN S7106Intel Device 202096256MB256GB Samsung SSD 850 + 2000GB Seagate ST2000DM006-2DM1 + 2 x 120GB TOSHIBA-TR150ASPEED ASPEED FamilyAcer P243WIntel I210 Gigabit Connection1920x1200OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v Processor Details- AMD EPYC 7601: Scaling Governor: acpi-cpufreq ondemand- AMD EPYC 7601 - NUMA Interleave All: Scaling Governor: acpi-cpufreq ondemand- AMD Threadripper 1950X: Scaling Governor: acpi-cpufreq ondemand- 2 x Intel Xeon Gold 6138: Scaling Governor: intel_pstate powersave

AMD EPYC vs. Threadripper vs. Dual Xeon Goldnpb: LU.Cparboil: OpenMP LBMopenssl: RSA 4096-bit Performancehmmer: Pfam Database Searchbuild-llvm: Time To Compiledarktable: Masskrug - CPU-onlyprimesieve: 1e12 Prime Number Generationnpb: EP.Cramspeed: Add - Integerc-ray: Total Timerodinia: OpenMP LavaMDapache: Static Web Page Servingblender: BMW27 - CPU-Onlyx264: H.264 Video Encodingdarktable: Server Room - CPU-onlydarktable: Boat - CPU-onlybuild-linux-kernel: Time To Compilettsiod-renderer: Phong Rendering With Soft-Shadow Mappingjohn-the-ripper: Blowfishrodinia: OpenMP Streamclusterparboil: OpenMP Stencilhpcc: EP-STREAM Triadhpcc: G-Ptranshpcc: G-Fftehpcc: G-HPLnpb: LU.Anpb: FT.BAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 613846983.3450.953294.538.31175.119.2414.081610.1732948.542.8430.9622609.58621.29292.634.627.9437.40412.712955323.1513.822.654130.783620.796504.6439163042.801660.0550046.9038.373306.378.03192.267.5714.061607.6833856.412.8530.1422529.83625.93288.283.727.0339.28430.153433514.667.753.587030.723800.535734.1913062479.082866.6620949.0287.102189.707.77249.0810.8220.261084.6633915.394.2243.7127960.06525.41324.496.2211.6045.96424.771402437.6313.141.995880.540900.380016.0132227592.433398.8750072.2550.044826.7016.31134.6413.1411.811815.8920407.102.8431.4428366.96540.98310.795.2914.3030.54236.463037322.527.754.353150.370350.925921.7626753974.492916.99OpenBenchmarking.org

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: LU.CAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 613811K22K33K44K55KSE +/- 747.98, N = 3SE +/- 25.06, N = 3SE +/- 91.81, N = 3SE +/- 637.78, N = 346983.3450046.9020949.0250072.251. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.0.2

Parboil

Test: OpenMP LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBMAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 613820406080100SE +/- 0.66, N = 3SE +/- 0.26, N = 3SE +/- 0.05, N = 3SE +/- 1.14, N = 650.9538.3787.1050.041. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1gRSA 4096-bit PerformanceAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 613810002000300040005000SE +/- 14.45, N = 3SE +/- 13.22, N = 3SE +/- 1.96, N = 3SE +/- 23.22, N = 33294.533306.372189.704826.701. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 613848121620SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 38.318.037.7716.311. (CC) gcc options: -O2 -pthread -lhmmer -lsquid -lm

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 4.0.1Time To CompileAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 613850100150200250SE +/- 3.02, N = 4SE +/- 2.34, N = 3SE +/- 3.05, N = 3SE +/- 0.81, N = 3175.11192.26249.08134.64

Darktable

Test: Masskrug - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Masskrug - Acceleration: CPU-onlyAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 61383691215SE +/- 0.09, N = 3SE +/- 0.13, N = 4SE +/- 0.07, N = 3SE +/- 0.27, N = 69.247.5710.8213.14

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 5.4.21e12 Prime Number GenerationAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 6138510152025SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 314.0814.0620.2611.811. (CXX) g++ options: -O2 -fopenmp

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: EP.CAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 6138400800120016002000SE +/- 0.32, N = 3SE +/- 0.90, N = 3SE +/- 0.58, N = 3SE +/- 34.76, N = 61610.171607.681084.661815.891. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.0.2

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: IntegerAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 61387K14K21K28K35K32948.5433856.4133915.3920407.101. (CC) gcc options: -O3 -march=native

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 61380.94951.8992.84853.7984.7475SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 32.842.854.222.841. (CC) gcc options: -lm -lpthread -O3

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMDAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 61381020304050SE +/- 0.01, N = 3SE +/- 0.15, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 330.9630.1443.7131.441. (CXX) g++ options: -O2 -lOpenCL

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page ServingAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 61386K12K18K24K30KSE +/- 50.73, N = 3SE +/- 52.07, N = 3SE +/- 168.94, N = 3SE +/- 105.14, N = 322609.5822529.8327960.0628366.961. (CC) gcc options: -shared -fPIC -O2 -pthread

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.78cBlend File: BMW27 - Compute: CPU-OnlyAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 6138140280420560700621.29625.93525.41540.98

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2017-09-08H.264 Video EncodingAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 613870140210280350SE +/- 1.27, N = 3SE +/- 0.45, N = 3SE +/- 1.53, N = 3SE +/- 3.63, N = 3292.63288.28324.49310.79-lavformat -lavcodec -lavutil -lswscale-lavformat -lavcodec -lavutil -lswscale1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

Darktable

Test: Server Room - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Server Room - Acceleration: CPU-onlyAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 6138246810SE +/- 0.07, N = 6SE +/- 0.08, N = 6SE +/- 0.09, N = 3SE +/- 0.35, N = 64.623.726.225.29

Darktable

Test: Boat - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Boat - Acceleration: CPU-onlyAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 613848121620SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.59, N = 67.947.0311.6014.30

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.9Time To CompileAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 61381020304050SE +/- 0.51, N = 6SE +/- 0.58, N = 5SE +/- 0.62, N = 3SE +/- 0.89, N = 637.4039.2845.9630.54

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow MappingAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 613890180270360450SE +/- 4.72, N = 3SE +/- 3.57, N = 3SE +/- 3.24, N = 3SE +/- 17.30, N = 6412.71430.15424.77236.461. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0Test: BlowfishAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 61387K14K21K28K35KSE +/- 2253.07, N = 6SE +/- 764.17, N = 6SE +/- 2257.07, N = 6SE +/- 2076.33, N = 6295533433514024303731. (CC) gcc options: -fopenmp -lcrypt

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP StreamclusterAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 6138918273645SE +/- 1.43, N = 6SE +/- 0.27, N = 6SE +/- 0.74, N = 3SE +/- 0.42, N = 323.1514.6637.6322.521. (CXX) g++ options: -O2 -lOpenCL

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 613848121620SE +/- 0.68, N = 6SE +/- 0.00, N = 3SE +/- 0.08, N = 3SE +/- 0.17, N = 613.827.7513.147.751. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM TriadAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 61380.97951.9592.93853.9184.8975SE +/- 0.11259, N = 3SE +/- 1.07962, N = 3SE +/- 0.04876, N = 3SE +/- 0.84575, N = 32.654133.587031.995884.353151. (CC) gcc options: -lblas -lm -lmpich2. BLAS + mpicc for MPICH version 3.2

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-PtransAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 61380.17630.35260.52890.70520.8815SE +/- 0.02928, N = 3SE +/- 0.06341, N = 3SE +/- 0.05016, N = 3SE +/- 0.04259, N = 30.783620.723800.540900.370351. (CC) gcc options: -lblas -lm -lmpich2. BLAS + mpicc for MPICH version 3.2

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 61380.20830.41660.62490.83321.0415SE +/- 0.00574, N = 3SE +/- 0.08554, N = 3SE +/- 0.02518, N = 3SE +/- 0.10042, N = 30.796500.535730.380010.925921. (CC) gcc options: -lblas -lm -lmpich2. BLAS + mpicc for MPICH version 3.2

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 61380.20830.41660.62490.83321.0415SE +/- 0.00574, N = 3SE +/- 0.08554, N = 3SE +/- 0.02518, N = 3SE +/- 0.10042, N = 30.796500.535730.380010.925921. (CC) gcc options: -lblas -lm -lmpich2. BLAS + mpicc for MPICH version 3.2

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPLAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 6138246810SE +/- 0.06846, N = 3SE +/- 0.05229, N = 3SE +/- 0.17770, N = 6SE +/- 0.02653, N = 34.643914.191306.013221.762671. (CC) gcc options: -lblas -lm -lmpich2. BLAS + mpicc for MPICH version 3.2

NAS Parallel Benchmarks

Test / Class: LU.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: LU.AAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 613814K28K42K56K70KSE +/- 1051.01, N = 4SE +/- 437.89, N = 3SE +/- 11.08, N = 3SE +/- 3783.72, N = 663042.8062479.0827592.4353974.491. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.0.2

NAS Parallel Benchmarks

Test / Class: FT.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: FT.BAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 61387001400210028003500SE +/- 44.23, N = 6SE +/- 2.64, N = 3SE +/- 2.42, N = 3SE +/- 42.85, N = 41660.052866.663398.872916.991. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.0.2


Phoronix Test Suite v10.8.4