EPYC vs. Xeon POCL OpenCL

2 x Intel Xeon Gold 6138 testing with a TYAN S7106 and ASPEED ASPEED Family on Ubuntu 18.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1712079-AL-POCLING5673
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

NVIDIA GPU Compute 4 Tests
OpenCL 5 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
AMD EPYC 7601
December 06 2017
 
2 x Intel Xeon Gold 6138
December 07 2017
 
Invert Hiding All Results Option
 
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


EPYC vs. Xeon POCL OpenCLProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionAMD EPYC 76012 x Intel Xeon Gold 6138AMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads)TYAN B8026T70AE24HRAMD Device 1450129024MB280GB INTEL SSDPE21D280GAASPEED ASPEED FamilyVE228Broadcom Limited NetXtreme BCM5720 Gigabit PCIeUbuntu 17.104.15.0-999-generic (x86_64) 20171205GNOME Shell 3.26.1modesetting 1.19.53.3 Mesa 17.2.2 (LLVM 5.0 128 bits)OpenCL 1.2 pocl 1.0 LLVM 5.0.0GCC 7.2.0 + Clang 5.0.0-3 + LLVM 5.0.0ext41920x10802 x Intel Xeon Gold 6138 @ 3.70GHz (40 Cores / 80 Threads)TYAN S7106Intel Sky Lake-E DMI3 Registers96256MB256GB Samsung SSD 850 + 2000GB Seagate ST2000DM006-2DM1 + 2 x 120GB TOSHIBA-TR150Intel I210 Gigabit ConnectionUbuntu 18.04GNOME Shell 3.26.23.3 Mesa 17.2.2 (LLVM 5.0 256 bits)OpenCL 1.2 pocl 1.1-pre LLVM 5.0.0GCC 7.2.1 20171205 + Clang 5.0.0-4 + LLVM 5.0.0OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Processor Details- AMD EPYC 7601: Scaling Governor: acpi-cpufreq ondemand- 2 x Intel Xeon Gold 6138: Scaling Governor: intel_pstate powersave

AMD EPYC 7601 vs. 2 x Intel Xeon Gold 6138 ComparisonPhoronix Test SuiteBaseline+29.7%+29.7%+59.4%+59.4%+89.1%+89.1%46.7%43.1%20%17.5%10.3%8.6%OpenCL - Bus Speed Readback118.6%OpenCL - Triad113.2%OpenCL - Bus Speed Download92.7%OpenCL - Max SP FlopsO.L.FWrite42.1%OpenCL - FFT SP25.1%OpenCL - MD5 HashReadCopyCPUCPU5.3%OpenCL - T.R.B3.7%SHOC Scalable HeterOgeneous ComputingSHOC Scalable HeterOgeneous ComputingSHOC Scalable HeterOgeneous ComputingSHOC Scalable HeterOgeneous ComputingViennaCLcl-memSHOC Scalable HeterOgeneous ComputingSHOC Scalable HeterOgeneous Computingcl-memcl-memJuliaGPUMandelGPUSHOC Scalable HeterOgeneous ComputingAMD EPYC 76012 x Intel Xeon Gold 6138

EPYC vs. Xeon POCL OpenCLshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthviennacl: OpenCL LU Factorizationcl-mem: Copycl-mem: Readcl-mem: Writejuliagpu: CPUmandelgpu: CPUAMD EPYC 76012 x Intel Xeon Gold 61386.9513.490.40703.3413.2813.2941.8414.206.1010.274.9328355131.5013360267.103.2610.780.481031.486.896.0840.3520.326.7312.073.4730799704.9712693064.77OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadAMD EPYC 76012 x Intel Xeon Gold 6138246810SE +/- 0.27, N = 6SE +/- 0.13, N = 66.953.261. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadAMD EPYC 76012 x Intel Xeon Gold 61383691215Min: 5.99 / Avg: 6.95 / Max: 7.64Min: 2.9 / Avg: 3.26 / Max: 3.711. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPAMD EPYC 76012 x Intel Xeon Gold 61383691215SE +/- 0.16, N = 3SE +/- 0.08, N = 313.4910.781. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPAMD EPYC 76012 x Intel Xeon Gold 613848121620Min: 13.31 / Avg: 13.49 / Max: 13.8Min: 10.68 / Avg: 10.78 / Max: 10.941. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashAMD EPYC 76012 x Intel Xeon Gold 61380.1080.2160.3240.4320.54SE +/- 0.00, N = 3SE +/- 0.01, N = 60.400.481. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashAMD EPYC 76012 x Intel Xeon Gold 613812345Min: 0.4 / Avg: 0.4 / Max: 0.4Min: 0.44 / Avg: 0.48 / Max: 0.481. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsAMD EPYC 76012 x Intel Xeon Gold 61382004006008001000SE +/- 37.88, N = 6SE +/- 100.62, N = 6703.341031.481. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsAMD EPYC 76012 x Intel Xeon Gold 61382004006008001000Min: 532.84 / Avg: 703.34 / Max: 810.8Min: 539.4 / Avg: 1031.48 / Max: 1190.241. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadAMD EPYC 76012 x Intel Xeon Gold 61383691215SE +/- 0.02, N = 3SE +/- 0.57, N = 613.286.891. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadAMD EPYC 76012 x Intel Xeon Gold 613848121620Min: 13.24 / Avg: 13.28 / Max: 13.32Min: 5.22 / Avg: 6.89 / Max: 9.141. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackAMD EPYC 76012 x Intel Xeon Gold 61383691215SE +/- 0.04, N = 3SE +/- 0.31, N = 613.296.081. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackAMD EPYC 76012 x Intel Xeon Gold 613848121620Min: 13.25 / Avg: 13.29 / Max: 13.37Min: 5.17 / Avg: 6.08 / Max: 7.291. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthAMD EPYC 76012 x Intel Xeon Gold 61381020304050SE +/- 0.07, N = 3SE +/- 0.32, N = 341.8440.351. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthAMD EPYC 76012 x Intel Xeon Gold 6138918273645Min: 41.7 / Avg: 41.84 / Max: 41.94Min: 39.72 / Avg: 40.35 / Max: 40.741. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile uses ViennaCL OpenCL support and runs the included computational benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationAMD EPYC 76012 x Intel Xeon Gold 6138510152025SE +/- 0.20, N = 3SE +/- 0.18, N = 314.2020.321. (CXX) g++ options: -rdynamic -lOpenCL
OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationAMD EPYC 76012 x Intel Xeon Gold 6138510152025Min: 13.83 / Avg: 14.2 / Max: 14.5Min: 19.97 / Avg: 20.32 / Max: 20.521. (CXX) g++ options: -rdynamic -lOpenCL

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyAMD EPYC 76012 x Intel Xeon Gold 6138246810SE +/- 0.24, N = 6SE +/- 0.07, N = 36.106.731. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyAMD EPYC 76012 x Intel Xeon Gold 61383691215Min: 5 / Avg: 6.1 / Max: 6.6Min: 6.6 / Avg: 6.73 / Max: 6.81. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadAMD EPYC 76012 x Intel Xeon Gold 61383691215SE +/- 0.24, N = 6SE +/- 0.09, N = 310.2712.071. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadAMD EPYC 76012 x Intel Xeon Gold 613848121620Min: 9.7 / Avg: 10.27 / Max: 11.3Min: 11.9 / Avg: 12.07 / Max: 12.21. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteAMD EPYC 76012 x Intel Xeon Gold 61381.10932.21863.32794.43725.5465SE +/- 0.08, N = 6SE +/- 0.10, N = 64.933.471. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteAMD EPYC 76012 x Intel Xeon Gold 6138246810Min: 4.8 / Avg: 4.93 / Max: 5.3Min: 3.1 / Avg: 3.47 / Max: 3.81. (CC) gcc options: -O2 -flto -lOpenCL

JuliaGPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: CPUAMD EPYC 76012 x Intel Xeon Gold 61387M14M21M28M35MSE +/- 128279.94, N = 3SE +/- 142014.93, N = 328355131.5030799704.971. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: CPUAMD EPYC 76012 x Intel Xeon Gold 61385M10M15M20M25MMin: 28198136 / Avg: 28355131.5 / Max: 28609360.7Min: 30597340.9 / Avg: 30799704.97 / Max: 31073488.61. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: CPUAMD EPYC 76012 x Intel Xeon Gold 61383M6M9M12M15MSE +/- 18317.78, N = 3SE +/- 13616.78, N = 313360267.1012693064.771. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: CPUAMD EPYC 76012 x Intel Xeon Gold 61382M4M6M8M10MMin: 13341775.3 / Avg: 13360267.1 / Max: 13396902.1Min: 12669616.1 / Avg: 12693064.77 / Max: 12716783.71. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL