AMD EPYC vs. Threadripper vs. Dual Xeon Gold

Some initial AMD EPYC 7601 tests on Ubuntu 17.04 with Linux 4.13. Tests for a future article on Phoronix.com. Benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1709106-TY-AMDEPYC2455.

AMD EPYC vs. Threadripper vs. Dual Xeon GoldProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkAudioOSKernelDisplay DriverCompilerFile-SystemScreen ResolutionDesktopDisplay ServerOpenGLAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950XAMD EPYC 7601 32-Core @ 2.20GHz (64 Cores)TYAN B8026T70AE24HRAMD Device 1450129024MB234GBASPEED ASPEED FamilyAcer P243WBroadcom Limited NetXtreme BCM5720 Gigabit PCIeUbuntu 17.044.13.0-041300-generic (x86_64)modesetting 1.19.3GCC 6.3.0 20170406ext41920x1200Unity 7.5.02 x Intel Xeon Gold 6138 @ 3.70GHz (80 Cores)TYAN S7106Intel Device 202096256MB256GB Samsung SSD 850 + 2000GB Seagate ST2000DM006-2DM1 + 2 x 120GB TOSHIBA-TR150Intel I210 Gigabit ConnectionAMD Ryzen Threadripper 1950X 16-Core @ 3.40GHz (32 Cores)Gigabyte X399 AORUS Gaming 7AMD Device 145032768MB120GB Force MP500XFX AMD Radeon R9 290/390 4096MBRealtek ALC1220Acer B286HKQualcomm Atheros Device e0b1 + Intel Wireless 8265 / 8275X Server 1.19.34.5 Mesa 17.0.3 Gallium 0.4 (LLVM 4.0.0)3840x2160OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v Processor Details- AMD EPYC 7601: Scaling Governor: acpi-cpufreq ondemand- AMD EPYC 7601 - NUMA Interleave All: Scaling Governor: acpi-cpufreq ondemand- 2 x Intel Xeon Gold 6138: Scaling Governor: intel_pstate powersave- AMD Threadripper 1950X: Scaling Governor: acpi-cpufreq ondemand

AMD EPYC vs. Threadripper vs. Dual Xeon Goldnpb: EP.Cnpb: FT.Bnpb: LU.Anpb: LU.Chpcc: G-HPLhpcc: G-Fftehpcc: G-Ptranshpcc: EP-STREAM Triadparboil: OpenMP LBMparboil: OpenMP Stencilrodinia: OpenMP LavaMDrodinia: OpenMP Streamclusterjohn-the-ripper: Blowfishttsiod-renderer: Phong Rendering With Soft-Shadow Mappingx264: H.264 Video Encodingbuild-linux-kernel: Time To Compilebuild-llvm: Time To Compilec-ray: Total Timeprimesieve: 1e12 Prime Number Generationopenssl: RSA 4096-bit Performancedarktable: Boat - CPU-onlydarktable: Masskrug - CPU-onlydarktable: Server Room - CPU-onlyblender: BMW27 - CPU-Onlyapache: Static Web Page Servingramspeed: Add - Integerhmmer: Pfam Database SearchAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X1610.171660.0563042.8046983.344.643910.796500.783622.6541350.9513.8230.9623.1529553412.71292.6337.40175.112.8414.083294.537.949.244.62621.2922609.5832948.548.311607.682866.6662479.0850046.904.191300.535730.723803.5870338.377.7530.1414.6634335430.15288.2839.28192.262.8514.063306.377.037.573.72625.9322529.8333856.418.031815.892916.9953974.4950072.251.762670.925920.370354.3531550.047.7531.4422.5230373236.46310.7930.54134.642.8411.814826.7014.3013.145.29540.9828366.9620407.1016.311084.663398.8727592.4320949.026.013220.380010.540901.9958887.1013.1443.7137.6314024424.77324.4945.96249.084.2220.262189.7011.6010.826.22525.4127960.0633915.397.77OpenBenchmarking.org

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: EP.CAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X400800120016002000SE +/- 0.32, N = 3SE +/- 0.90, N = 3SE +/- 34.76, N = 6SE +/- 0.58, N = 31610.171607.681815.891084.661. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.0.2

NAS Parallel Benchmarks

Test / Class: FT.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: FT.BAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X7001400210028003500SE +/- 44.23, N = 6SE +/- 2.64, N = 3SE +/- 42.85, N = 4SE +/- 2.42, N = 31660.052866.662916.993398.871. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.0.2

NAS Parallel Benchmarks

Test / Class: LU.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: LU.AAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X14K28K42K56K70KSE +/- 1051.01, N = 4SE +/- 437.89, N = 3SE +/- 3783.72, N = 6SE +/- 11.08, N = 363042.8062479.0853974.4927592.431. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.0.2

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: LU.CAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X11K22K33K44K55KSE +/- 747.98, N = 3SE +/- 25.06, N = 3SE +/- 637.78, N = 3SE +/- 91.81, N = 346983.3450046.9050072.2520949.021. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.0.2

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPLAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X246810SE +/- 0.06846, N = 3SE +/- 0.05229, N = 3SE +/- 0.02653, N = 3SE +/- 0.17770, N = 64.643914.191301.762676.013221. (CC) gcc options: -lblas -lm -lmpich2. BLAS + mpicc for MPICH version 3.2

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X0.20830.41660.62490.83321.0415SE +/- 0.00574, N = 3SE +/- 0.08554, N = 3SE +/- 0.10042, N = 3SE +/- 0.02518, N = 30.796500.535730.925920.380011. (CC) gcc options: -lblas -lm -lmpich2. BLAS + mpicc for MPICH version 3.2

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X0.20830.41660.62490.83321.0415SE +/- 0.00574, N = 3SE +/- 0.08554, N = 3SE +/- 0.10042, N = 3SE +/- 0.02518, N = 30.796500.535730.925920.380011. (CC) gcc options: -lblas -lm -lmpich2. BLAS + mpicc for MPICH version 3.2

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-PtransAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X0.17630.35260.52890.70520.8815SE +/- 0.02928, N = 3SE +/- 0.06341, N = 3SE +/- 0.04259, N = 3SE +/- 0.05016, N = 30.783620.723800.370350.540901. (CC) gcc options: -lblas -lm -lmpich2. BLAS + mpicc for MPICH version 3.2

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM TriadAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X0.97951.9592.93853.9184.8975SE +/- 0.11259, N = 3SE +/- 1.07962, N = 3SE +/- 0.84575, N = 3SE +/- 0.04876, N = 32.654133.587034.353151.995881. (CC) gcc options: -lblas -lm -lmpich2. BLAS + mpicc for MPICH version 3.2

Parboil

Test: OpenMP LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBMAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X20406080100SE +/- 0.66, N = 3SE +/- 0.26, N = 3SE +/- 1.14, N = 6SE +/- 0.05, N = 350.9538.3750.0487.101. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X48121620SE +/- 0.68, N = 6SE +/- 0.00, N = 3SE +/- 0.17, N = 6SE +/- 0.08, N = 313.827.757.7513.141. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMDAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X1020304050SE +/- 0.01, N = 3SE +/- 0.15, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 330.9630.1431.4443.711. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP StreamclusterAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X918273645SE +/- 1.43, N = 6SE +/- 0.27, N = 6SE +/- 0.42, N = 3SE +/- 0.74, N = 323.1514.6622.5237.631. (CXX) g++ options: -O2 -lOpenCL

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0Test: BlowfishAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X7K14K21K28K35KSE +/- 2253.07, N = 6SE +/- 764.17, N = 6SE +/- 2076.33, N = 6SE +/- 2257.07, N = 6295533433530373140241. (CC) gcc options: -fopenmp -lcrypt

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow MappingAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X90180270360450SE +/- 4.72, N = 3SE +/- 3.57, N = 3SE +/- 17.30, N = 6SE +/- 3.24, N = 3412.71430.15236.46424.771. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2017-09-08H.264 Video EncodingAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X70140210280350SE +/- 1.27, N = 3SE +/- 0.45, N = 3SE +/- 3.63, N = 3SE +/- 1.53, N = 3292.63288.28310.79324.49-lavformat -lavcodec -lavutil -lswscale-lavformat -lavcodec -lavutil -lswscale1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.9Time To CompileAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X1020304050SE +/- 0.51, N = 6SE +/- 0.58, N = 5SE +/- 0.89, N = 6SE +/- 0.62, N = 337.4039.2830.5445.96

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 4.0.1Time To CompileAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X50100150200250SE +/- 3.02, N = 4SE +/- 2.34, N = 3SE +/- 0.81, N = 3SE +/- 3.05, N = 3175.11192.26134.64249.08

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X0.94951.8992.84853.7984.7475SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 32.842.852.844.221. (CC) gcc options: -lm -lpthread -O3

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 5.4.21e12 Prime Number GenerationAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X510152025SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 314.0814.0611.8120.261. (CXX) g++ options: -O2 -fopenmp

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1gRSA 4096-bit PerformanceAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X10002000300040005000SE +/- 14.45, N = 3SE +/- 13.22, N = 3SE +/- 23.22, N = 3SE +/- 1.96, N = 33294.533306.374826.702189.701. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl

Darktable

Test: Boat - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Boat - Acceleration: CPU-onlyAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X48121620SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.59, N = 6SE +/- 0.10, N = 37.947.0314.3011.60

Darktable

Test: Masskrug - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Masskrug - Acceleration: CPU-onlyAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X3691215SE +/- 0.09, N = 3SE +/- 0.13, N = 4SE +/- 0.27, N = 6SE +/- 0.07, N = 39.247.5713.1410.82

Darktable

Test: Server Room - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Server Room - Acceleration: CPU-onlyAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X246810SE +/- 0.07, N = 6SE +/- 0.08, N = 6SE +/- 0.35, N = 6SE +/- 0.09, N = 34.623.725.296.22

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.78cBlend File: BMW27 - Compute: CPU-OnlyAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X140280420560700621.29625.93540.98525.41

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page ServingAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X6K12K18K24K30KSE +/- 50.73, N = 3SE +/- 52.07, N = 3SE +/- 105.14, N = 3SE +/- 168.94, N = 322609.5822529.8328366.9627960.061. (CC) gcc options: -shared -fPIC -O2 -pthread

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: IntegerAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X7K14K21K28K35K32948.5433856.4120407.1033915.391. (CC) gcc options: -O3 -march=native

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X48121620SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 38.318.0316.317.771. (CC) gcc options: -O2 -pthread -lhmmer -lsquid -lm


Phoronix Test Suite v10.8.4