EPYC vs. Xeon vs. NVIDIA AMD GPUs POCL OpenCL

Tests for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1712072-PTS-POCLGPU384.

EPYC vs. Xeon vs. NVIDIA AMD GPUs POCL OpenCLProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkAudioOSKernelDesktopDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionDisplay ServerVulkanAMD EPYC 76012 x Intel Xeon Gold 6138Radeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiAMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads)TYAN B8026T70AE24HRAMD Device 1450129024MB280GB INTEL SSDPE21D280GAASPEED ASPEED FamilyVE228Broadcom Limited NetXtreme BCM5720 Gigabit PCIeUbuntu 17.104.15.0-999-generic (x86_64) 20171205GNOME Shell 3.26.1modesetting 1.19.53.3 Mesa 17.2.2 (LLVM 5.0 128 bits)OpenCL 1.2 pocl 1.0 LLVM 5.0.0GCC 7.2.0 + Clang 5.0.0-3 + LLVM 5.0.0ext41920x10802 x Intel Xeon Gold 6138 @ 3.70GHz (40 Cores / 80 Threads)TYAN S7106Intel Sky Lake-E DMI3 Registers96256MB256GB Samsung SSD 850 + 2000GB Seagate ST2000DM006-2DM1 + 2 x 120GB TOSHIBA-TR150Intel I210 Gigabit ConnectionUbuntu 18.04GNOME Shell 3.26.23.3 Mesa 17.2.2 (LLVM 5.0 256 bits)OpenCL 1.2 pocl 1.1-pre LLVM 5.0.0GCC 7.2.1 20171205 + Clang 5.0.0-4 + LLVM 5.0.0Intel Core i7-8700K @ 4.70GHz (6 Cores / 12 Threads)ASUS PRIME Z370-AIntel Device 3ec216384MB525GB Crucial_CT525MX3 + Samsung SSD 950 PRO 256GBMSI AMD Radeon RX 580 8192MBRealtek GenericDELL P2415QIntel ConnectionUbuntu 17.044.10.0-38-generic (x86_64)Unity 7.5.0X Server 1.19.3modesetting 1.19.34.5.13496OpenCL 2.0 AMD-APP (2482.3)1.0.42GCC 6.3.0 201704063840x2160Sapphire AMD Radeon 4096MBAMD Radeon RX Vega 8176MBamdgpu 1.3.99NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA 384.984.5.0Zotac NVIDIA GeForce GTX 1050 2048MB (1354/3504MHz)eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1354/3504MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1506/4006MHz)Zotac NVIDIA GeForce GTX 1070 Ti 8192MB (1480/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1607/5005MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1480/5508MHz)OpenBenchmarking.orgCompiler Details- AMD EPYC 7601: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v- 2 x Intel Xeon Gold 6138: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v- Radeon RX 580: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon R9 Fury: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon RX Vega 56: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon RX Vega 64: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 980 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1050: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1050 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1060: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1070: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1070 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1080: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1080 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -vProcessor Details- AMD EPYC 7601: Scaling Governor: acpi-cpufreq ondemand- 2 x Intel Xeon Gold 6138: Scaling Governor: intel_pstate powersave- Radeon RX 580: Scaling Governor: intel_pstate performance- Radeon R9 Fury: Scaling Governor: intel_pstate performance- Radeon RX Vega 56: Scaling Governor: intel_pstate performance- Radeon RX Vega 64: Scaling Governor: intel_pstate performance- GeForce GTX 980 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1050: Scaling Governor: intel_pstate performance- GeForce GTX 1050 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1060: Scaling Governor: intel_pstate performance- GeForce GTX 1070: Scaling Governor: intel_pstate performance- GeForce GTX 1070 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1080: Scaling Governor: intel_pstate performance- GeForce GTX 1080 Ti: Scaling Governor: intel_pstate performanceKernel Details- Radeon RX 580, Radeon R9 Fury, Radeon RX Vega 56, Radeon RX Vega 64: amdgpu.vm_fragment_size=9Graphics Details- Radeon RX 580, Radeon R9 Fury, Radeon RX Vega 56, Radeon RX Vega 64: GLAMOROpenCL Details- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 1050: GPU Compute Cores: 640- GeForce GTX 1050 Ti: GPU Compute Cores: 768- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1070 Ti: GPU Compute Cores: 2432- GeForce GTX 1080: GPU Compute Cores: 2560- GeForce GTX 1080 Ti: GPU Compute Cores: 3584System Details- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 1050: GPU Compute Cores: 640.- GeForce GTX 1050 Ti: GPU Compute Cores: 768.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1070 Ti: GPU Compute Cores: 2432.- GeForce GTX 1080: GPU Compute Cores: 2560.- GeForce GTX 1080 Ti: GPU Compute Cores: 3584.

EPYC vs. Xeon vs. NVIDIA AMD GPUs POCL OpenCLshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Texture Read Bandwidthviennacl: OpenCL LU Factorizationcl-mem: Copycl-mem: Readcl-mem: Writejuliagpu: OpenCLAMD EPYC 76012 x Intel Xeon Gold 6138Radeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti6.9513.490.40703.3441.8414.206.1010.274.9328355131.503.2610.780.481031.4840.3520.326.7312.073.4730799704.9712.02521.557.736261.97216.5614.13174.90208.40177.30248225442.3011.39793.708.847141.33249.3424.55330.70448.20381.60269983077.7012.47917.7314.2310702.80377.5241.47304.70339.10321.90331818340.6011.941072.7816.9912923.60431.2942.58363.90397.20378.20348619117.2012.26652.049.286250.35351.9859.24216.50266.20238.80138987453.5011.44197.203.242123.34267.7143.4187.1095.1086.306715780811.46197.934.112702.95303.8847.5586.8094.2085.2081114034.5011.94333.307.314854377.9556.09139.10153.40140121054859.2012.18481.0910.607256.76451.3859.66186.70205.40191.80149885098.2012.18514.9713.809127.64503.4163.24186.50205.50190.6017146526612.28582.5614.309593.38524.6063.58208.90229215.60176402393.6012.46990.4719.9013411.90598.5865.24317.10338.10336.30202544036.40OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadAMD EPYC 76012 x Intel Xeon Gold 6138Radeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti3691215SE +/- 0.27, N = 6SE +/- 0.13, N = 66.953.2612.0211.3912.4711.9412.2611.4411.4611.9412.1812.1812.2812.46-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPAMD EPYC 76012 x Intel Xeon Gold 6138Radeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti2004006008001000SE +/- 0.16, N = 3SE +/- 0.08, N = 313.4910.78521.55793.70917.731072.78652.04197.20197.93333.30481.09514.97582.56990.471. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashAMD EPYC 76012 x Intel Xeon Gold 6138Radeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti510152025SE +/- 0.00, N = 3SE +/- 0.01, N = 60.400.487.738.8414.2316.999.283.244.117.3110.6013.8014.3019.901. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsAMD EPYC 76012 x Intel Xeon Gold 6138Radeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti3K6K9K12K15KSE +/- 37.88, N = 6SE +/- 100.62, N = 6703.341031.486261.977141.3310702.8012923.606250.352123.342702.954854.007256.769127.649593.3813411.901. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthAMD EPYC 76012 x Intel Xeon Gold 6138Radeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti130260390520650SE +/- 0.07, N = 3SE +/- 0.32, N = 341.8440.35216.56249.34377.52431.29351.98267.71303.88377.95451.38503.41524.60598.581. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

ViennaCL

OpenCL LU Factorization

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationAMD EPYC 76012 x Intel Xeon Gold 6138Radeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti1530456075SE +/- 0.20, N = 3SE +/- 0.18, N = 314.2020.3214.1324.5541.4742.5859.2443.4147.5556.0959.6663.2463.5865.241. (CXX) g++ options: -rdynamic -lOpenCL

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyAMD EPYC 76012 x Intel Xeon Gold 6138Radeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti80160240320400SE +/- 0.24, N = 6SE +/- 0.07, N = 36.106.73174.90330.70304.70363.90216.5087.1086.80139.10186.70186.50208.90317.101. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadAMD EPYC 76012 x Intel Xeon Gold 6138Radeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti100200300400500SE +/- 0.24, N = 6SE +/- 0.09, N = 310.2712.07208.40448.20339.10397.20266.2095.1094.20153.40205.40205.50229.00338.101. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteAMD EPYC 76012 x Intel Xeon Gold 6138Radeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti80160240320400SE +/- 0.08, N = 6SE +/- 0.10, N = 64.933.47177.30381.60321.90378.20238.8086.3085.20140.00191.80190.60215.60336.301. (CC) gcc options: -O2 -flto -lOpenCL

JuliaGPU

OpenCL

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCLAMD EPYC 76012 x Intel Xeon Gold 6138Radeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti70M140M210M280M350MSE +/- 128279.94, N = 3SE +/- 142014.93, N = 328355131.5030799704.97248225442.30269983077.70331818340.60348619117.20138987453.5067157808.0081114034.50121054859.20149885098.20171465266.00176402393.60202544036.401. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm


Phoronix Test Suite v10.8.4