AMD EPYC vs. Threadripper vs. Dual Xeon Gold

Some initial AMD EPYC 7601 tests on Ubuntu 17.04 with Linux 4.13. Tests for a future article on Phoronix.com. Benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1709106-TY-AMDEPYC2455&sor&grt.

AMD EPYC vs. Threadripper vs. Dual Xeon GoldProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkAudioOSKernelDisplay DriverCompilerFile-SystemScreen ResolutionDesktopDisplay ServerOpenGLAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950XAMD EPYC 7601 32-Core @ 2.20GHz (64 Cores)TYAN B8026T70AE24HRAMD Device 1450129024MB234GBASPEED ASPEED FamilyAcer P243WBroadcom Limited NetXtreme BCM5720 Gigabit PCIeUbuntu 17.044.13.0-041300-generic (x86_64)modesetting 1.19.3GCC 6.3.0 20170406ext41920x1200Unity 7.5.02 x Intel Xeon Gold 6138 @ 3.70GHz (80 Cores)TYAN S7106Intel Device 202096256MB256GB Samsung SSD 850 + 2000GB Seagate ST2000DM006-2DM1 + 2 x 120GB TOSHIBA-TR150Intel I210 Gigabit ConnectionAMD Ryzen Threadripper 1950X 16-Core @ 3.40GHz (32 Cores)Gigabyte X399 AORUS Gaming 7AMD Device 145032768MB120GB Force MP500XFX AMD Radeon R9 290/390 4096MBRealtek ALC1220Acer B286HKQualcomm Atheros Device e0b1 + Intel Wireless 8265 / 8275X Server 1.19.34.5 Mesa 17.0.3 Gallium 0.4 (LLVM 4.0.0)3840x2160OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v Processor Details- AMD EPYC 7601: Scaling Governor: acpi-cpufreq ondemand- AMD EPYC 7601 - NUMA Interleave All: Scaling Governor: acpi-cpufreq ondemand- 2 x Intel Xeon Gold 6138: Scaling Governor: intel_pstate powersave- AMD Threadripper 1950X: Scaling Governor: acpi-cpufreq ondemand

AMD EPYC vs. Threadripper vs. Dual Xeon Goldapache: Static Web Page Servingblender: BMW27 - CPU-Onlyc-ray: Total Timedarktable: Boat - CPU-onlydarktable: Masskrug - CPU-onlydarktable: Server Room - CPU-onlyhpcc: G-HPLhpcc: G-Fftehpcc: G-Ptranshpcc: EP-STREAM Triadjohn-the-ripper: Blowfishnpb: EP.Cnpb: FT.Bnpb: LU.Anpb: LU.Copenssl: RSA 4096-bit Performanceparboil: OpenMP LBMparboil: OpenMP Stencilprimesieve: 1e12 Prime Number Generationramspeed: Add - Integerrodinia: OpenMP LavaMDrodinia: OpenMP Streamclusterhmmer: Pfam Database Searchbuild-linux-kernel: Time To Compilebuild-llvm: Time To Compilettsiod-renderer: Phong Rendering With Soft-Shadow Mappingx264: H.264 Video EncodingAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X22609.58621.292.847.949.244.624.643910.796500.783622.65413295531610.171660.0563042.8046983.343294.5350.9513.8214.0832948.5430.9623.158.3137.40175.11412.71292.6322529.83625.932.857.037.573.724.191300.535730.723803.58703343351607.682866.6662479.0850046.903306.3738.377.7514.0633856.4130.1414.668.0339.28192.26430.15288.2828366.96540.982.8414.3013.145.291.762670.925920.370354.35315303731815.892916.9953974.4950072.254826.7050.047.7511.8120407.1031.4422.5216.3130.54134.64236.46310.7927960.06525.414.2211.6010.826.226.013220.380010.540901.99588140241084.663398.8727592.4320949.022189.7087.1013.1420.2633915.3943.7137.637.7745.96249.08424.77324.49OpenBenchmarking.org

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page Serving2 x Intel Xeon Gold 6138AMD Threadripper 1950XAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All6K12K18K24K30KSE +/- 105.14, N = 3SE +/- 168.94, N = 3SE +/- 50.73, N = 3SE +/- 52.07, N = 328366.9627960.0622609.5822529.831. (CC) gcc options: -shared -fPIC -O2 -pthread

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.78cBlend File: BMW27 - Compute: CPU-OnlyAMD Threadripper 1950X2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All140280420560700525.41540.98621.29625.93

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeAMD EPYC 76012 x Intel Xeon Gold 6138AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X0.94951.8992.84853.7984.7475SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 32.842.842.854.221. (CC) gcc options: -lm -lpthread -O3

Darktable

Test: Boat - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Boat - Acceleration: CPU-onlyAMD EPYC 7601 - NUMA Interleave AllAMD EPYC 7601AMD Threadripper 1950X2 x Intel Xeon Gold 613848121620SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.59, N = 67.037.9411.6014.30

Darktable

Test: Masskrug - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Masskrug - Acceleration: CPU-onlyAMD EPYC 7601 - NUMA Interleave AllAMD EPYC 7601AMD Threadripper 1950X2 x Intel Xeon Gold 61383691215SE +/- 0.13, N = 4SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.27, N = 67.579.2410.8213.14

Darktable

Test: Server Room - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Server Room - Acceleration: CPU-onlyAMD EPYC 7601 - NUMA Interleave AllAMD EPYC 76012 x Intel Xeon Gold 6138AMD Threadripper 1950X246810SE +/- 0.08, N = 6SE +/- 0.07, N = 6SE +/- 0.35, N = 6SE +/- 0.09, N = 33.724.625.296.22

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPLAMD Threadripper 1950XAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138246810SE +/- 0.17770, N = 6SE +/- 0.06846, N = 3SE +/- 0.05229, N = 3SE +/- 0.02653, N = 36.013224.643914.191301.762671. (CC) gcc options: -lblas -lm -lmpich2. BLAS + mpicc for MPICH version 3.2

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X0.20830.41660.62490.83321.0415SE +/- 0.10042, N = 3SE +/- 0.00574, N = 3SE +/- 0.08554, N = 3SE +/- 0.02518, N = 30.925920.796500.535730.380011. (CC) gcc options: -lblas -lm -lmpich2. BLAS + mpicc for MPICH version 3.2

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X0.20830.41660.62490.83321.0415SE +/- 0.10042, N = 3SE +/- 0.00574, N = 3SE +/- 0.08554, N = 3SE +/- 0.02518, N = 30.925920.796500.535730.380011. (CC) gcc options: -lblas -lm -lmpich2. BLAS + mpicc for MPICH version 3.2

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-PtransAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X2 x Intel Xeon Gold 61380.17630.35260.52890.70520.8815SE +/- 0.02928, N = 3SE +/- 0.06341, N = 3SE +/- 0.05016, N = 3SE +/- 0.04259, N = 30.783620.723800.540900.370351. (CC) gcc options: -lblas -lm -lmpich2. BLAS + mpicc for MPICH version 3.2

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad2 x Intel Xeon Gold 6138AMD EPYC 7601 - NUMA Interleave AllAMD EPYC 7601AMD Threadripper 1950X0.97951.9592.93853.9184.8975SE +/- 0.84575, N = 3SE +/- 1.07962, N = 3SE +/- 0.11259, N = 3SE +/- 0.04876, N = 34.353153.587032.654131.995881. (CC) gcc options: -lblas -lm -lmpich2. BLAS + mpicc for MPICH version 3.2

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0Test: BlowfishAMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD EPYC 7601AMD Threadripper 1950X7K14K21K28K35KSE +/- 764.17, N = 6SE +/- 2076.33, N = 6SE +/- 2253.07, N = 6SE +/- 2257.07, N = 6343353037329553140241. (CC) gcc options: -fopenmp -lcrypt

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: EP.C2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X400800120016002000SE +/- 34.76, N = 6SE +/- 0.32, N = 3SE +/- 0.90, N = 3SE +/- 0.58, N = 31815.891610.171607.681084.661. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.0.2

NAS Parallel Benchmarks

Test / Class: FT.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: FT.BAMD Threadripper 1950X2 x Intel Xeon Gold 6138AMD EPYC 7601 - NUMA Interleave AllAMD EPYC 76017001400210028003500SE +/- 2.42, N = 3SE +/- 42.85, N = 4SE +/- 2.64, N = 3SE +/- 44.23, N = 63398.872916.992866.661660.051. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.0.2

NAS Parallel Benchmarks

Test / Class: LU.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: LU.AAMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950X14K28K42K56K70KSE +/- 1051.01, N = 4SE +/- 437.89, N = 3SE +/- 3783.72, N = 6SE +/- 11.08, N = 363042.8062479.0853974.4927592.431. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.0.2

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: LU.C2 x Intel Xeon Gold 6138AMD EPYC 7601 - NUMA Interleave AllAMD EPYC 7601AMD Threadripper 1950X11K22K33K44K55KSE +/- 637.78, N = 3SE +/- 25.06, N = 3SE +/- 747.98, N = 3SE +/- 91.81, N = 350072.2550046.9046983.3420949.021. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.0.2

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1gRSA 4096-bit Performance2 x Intel Xeon Gold 6138AMD EPYC 7601 - NUMA Interleave AllAMD EPYC 7601AMD Threadripper 1950X10002000300040005000SE +/- 23.22, N = 3SE +/- 13.22, N = 3SE +/- 14.45, N = 3SE +/- 1.96, N = 34826.703306.373294.532189.701. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl

Parboil

Test: OpenMP LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBMAMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD EPYC 7601AMD Threadripper 1950X20406080100SE +/- 0.26, N = 3SE +/- 1.14, N = 6SE +/- 0.66, N = 3SE +/- 0.05, N = 338.3750.0450.9587.101. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilAMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD Threadripper 1950XAMD EPYC 760148121620SE +/- 0.00, N = 3SE +/- 0.17, N = 6SE +/- 0.08, N = 3SE +/- 0.68, N = 67.757.7513.1413.821. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 5.4.21e12 Prime Number Generation2 x Intel Xeon Gold 6138AMD EPYC 7601 - NUMA Interleave AllAMD EPYC 7601AMD Threadripper 1950X510152025SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 311.8114.0614.0820.261. (CXX) g++ options: -O2 -fopenmp

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: IntegerAMD Threadripper 1950XAMD EPYC 7601 - NUMA Interleave AllAMD EPYC 76012 x Intel Xeon Gold 61387K14K21K28K35K33915.3933856.4132948.5420407.101. (CC) gcc options: -O3 -march=native

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMDAMD EPYC 7601 - NUMA Interleave AllAMD EPYC 76012 x Intel Xeon Gold 6138AMD Threadripper 1950X1020304050SE +/- 0.15, N = 3SE +/- 0.01, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 330.1430.9631.4443.711. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP StreamclusterAMD EPYC 7601 - NUMA Interleave All2 x Intel Xeon Gold 6138AMD EPYC 7601AMD Threadripper 1950X918273645SE +/- 0.27, N = 6SE +/- 0.42, N = 3SE +/- 1.43, N = 6SE +/- 0.74, N = 314.6622.5223.1537.631. (CXX) g++ options: -O2 -lOpenCL

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchAMD Threadripper 1950XAMD EPYC 7601 - NUMA Interleave AllAMD EPYC 76012 x Intel Xeon Gold 613848121620SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 37.778.038.3116.311. (CC) gcc options: -O2 -pthread -lhmmer -lsquid -lm

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.9Time To Compile2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X1020304050SE +/- 0.89, N = 6SE +/- 0.51, N = 6SE +/- 0.58, N = 5SE +/- 0.62, N = 330.5437.4039.2845.96

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 4.0.1Time To Compile2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950X50100150200250SE +/- 0.81, N = 3SE +/- 3.02, N = 4SE +/- 2.34, N = 3SE +/- 3.05, N = 3134.64175.11192.26249.08

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow MappingAMD EPYC 7601 - NUMA Interleave AllAMD Threadripper 1950XAMD EPYC 76012 x Intel Xeon Gold 613890180270360450SE +/- 3.57, N = 3SE +/- 3.24, N = 3SE +/- 4.72, N = 3SE +/- 17.30, N = 6430.15424.77412.71236.461. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2017-09-08H.264 Video EncodingAMD Threadripper 1950X2 x Intel Xeon Gold 6138AMD EPYC 7601AMD EPYC 7601 - NUMA Interleave All70140210280350SE +/- 1.53, N = 3SE +/- 3.63, N = 3SE +/- 1.27, N = 3SE +/- 0.45, N = 3324.49310.79292.63288.28-lavformat -lavcodec -lavutil -lswscale-lavformat -lavcodec -lavutil -lswscale1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize


Phoronix Test Suite v10.8.4