EPYC vs. Xeon vs. NVIDIA AMD GPUs POCL OpenCL

Tests for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1712072-PTS-POCLGPU384&grs&rdt.

EPYC vs. Xeon vs. NVIDIA AMD GPUs POCL OpenCLProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionOpenCLGeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 980 TiRadeon RX Vega 56Radeon RX Vega 64Radeon RX 580Radeon R9 FuryAMD EPYC 76012 x Intel Xeon Gold 6138Intel Core i7-8700K @ 4.70GHz (6 Cores / 12 Threads)ASUS PRIME Z370-AIntel Device 3ec216384MB525GB Crucial_CT525MX3 + Samsung SSD 950 PRO 256GBNVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz)Realtek GenericDELL P2415QIntel ConnectionUbuntu 17.044.10.0-38-generic (x86_64)Unity 7.5.0X Server 1.19.3NVIDIA 384.984.5.01.0.42GCC 6.3.0 20170406ext43840x2160eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1354/3504MHz)NVIDIA GeForce GTX 1070 8192MB (1506/4006MHz)Zotac NVIDIA GeForce GTX 1070 Ti 8192MB (1480/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1607/5005MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1480/5508MHz)Zotac NVIDIA GeForce GTX 1050 2048MB (1354/3504MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)AMD Radeon RX Vega 8176MBamdgpu 1.3.994.5.13496OpenCL 2.0 AMD-APP (2482.3)MSI AMD Radeon RX 580 8192MBmodesetting 1.19.3Sapphire AMD Radeon 4096MBAMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads)TYAN B8026T70AE24HRAMD Device 1450129024MB280GB INTEL SSDPE21D280GAASPEED ASPEED FamilyVE228Broadcom Limited NetXtreme BCM5720 Gigabit PCIeUbuntu 17.104.15.0-999-generic (x86_64) 20171205GNOME Shell 3.26.1modesetting 1.19.53.3 Mesa 17.2.2 (LLVM 5.0 128 bits)OpenCL 1.2 pocl 1.0 LLVM 5.0.0GCC 7.2.0 + Clang 5.0.0-3 + LLVM 5.0.01920x10802 x Intel Xeon Gold 6138 @ 3.70GHz (40 Cores / 80 Threads)TYAN S7106Intel Sky Lake-E DMI3 Registers96256MB256GB Samsung SSD 850 + 2000GB Seagate ST2000DM006-2DM1 + 2 x 120GB TOSHIBA-TR150Intel I210 Gigabit ConnectionUbuntu 18.04GNOME Shell 3.26.23.3 Mesa 17.2.2 (LLVM 5.0 256 bits)OpenCL 1.2 pocl 1.1-pre LLVM 5.0.0GCC 7.2.1 20171205 + Clang 5.0.0-4 + LLVM 5.0.0OpenBenchmarking.orgCompiler Details- GeForce GTX 1060: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1050 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1070: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1070 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1080: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1080 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1050: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 980 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon RX Vega 56: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon RX Vega 64: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon RX 580: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon R9 Fury: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- AMD EPYC 7601: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v- 2 x Intel Xeon Gold 6138: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Details- GeForce GTX 1060: Scaling Governor: intel_pstate performance- GeForce GTX 1050 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1070: Scaling Governor: intel_pstate performance- GeForce GTX 1070 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1080: Scaling Governor: intel_pstate performance- GeForce GTX 1080 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1050: Scaling Governor: intel_pstate performance- GeForce GTX 980 Ti: Scaling Governor: intel_pstate performance- Radeon RX Vega 56: Scaling Governor: intel_pstate performance- Radeon RX Vega 64: Scaling Governor: intel_pstate performance- Radeon RX 580: Scaling Governor: intel_pstate performance- Radeon R9 Fury: Scaling Governor: intel_pstate performance- AMD EPYC 7601: Scaling Governor: acpi-cpufreq ondemand- 2 x Intel Xeon Gold 6138: Scaling Governor: intel_pstate powersaveOpenCL Details- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1050 Ti: GPU Compute Cores: 768- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1070 Ti: GPU Compute Cores: 2432- GeForce GTX 1080: GPU Compute Cores: 2560- GeForce GTX 1080 Ti: GPU Compute Cores: 3584- GeForce GTX 1050: GPU Compute Cores: 640- GeForce GTX 980 Ti: GPU Compute Cores: 2816System Details- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1050 Ti: GPU Compute Cores: 768.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1070 Ti: GPU Compute Cores: 2432.- GeForce GTX 1080: GPU Compute Cores: 2560.- GeForce GTX 1080 Ti: GPU Compute Cores: 3584.- GeForce GTX 1050: GPU Compute Cores: 640.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.Kernel Details- Radeon RX Vega 56, Radeon RX Vega 64, Radeon RX 580, Radeon R9 Fury: amdgpu.vm_fragment_size=9Graphics Details- Radeon RX Vega 56, Radeon RX Vega 64, Radeon RX 580, Radeon R9 Fury: GLAMOR

EPYC vs. Xeon vs. NVIDIA AMD GPUs POCL OpenCLshoc: OpenCL - FFT SPcl-mem: Copyshoc: OpenCL - MD5 Hashcl-mem: Readviennacl: OpenCL LU Factorizationshoc: OpenCL - Triadshoc: OpenCL - Max SP Flopsshoc: OpenCL - Texture Read Bandwidthjuliagpu: OpenCLcl-mem: WriteGeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 980 TiRadeon RX Vega 56Radeon RX Vega 64Radeon RX 580Radeon R9 FuryAMD EPYC 76012 x Intel Xeon Gold 6138333.30139.107.31153.4056.0911.944854377.95121054859.20140197.9386.804.1194.2047.5511.462702.95303.8881114034.5085.20481.09186.7010.60205.4059.6612.187256.76451.38149885098.20191.80514.97186.5013.80205.5063.2412.189127.64503.41171465266190.60582.56208.9014.3022963.5812.289593.38524.60176402393.60215.60990.47317.1019.90338.1065.2412.4613411.90598.58202544036.40336.30197.2087.103.2495.1043.4111.442123.34267.716715780886.30652.04216.509.28266.2059.2412.266250.35351.98138987453.50238.80917.73304.7014.23339.1041.4712.4710702.80377.52331818340.60321.901072.78363.9016.99397.2042.5811.9412923.60431.29348619117.20378.20521.55174.907.73208.4014.1312.026261.97216.56248225442.30177.30793.70330.708.84448.2024.5511.397141.33249.34269983077.70381.6013.496.100.4010.2714.206.95703.3441.8428355131.504.9310.786.730.4812.0720.323.261031.4840.3530799704.973.47OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 980 TiRadeon RX Vega 56Radeon RX Vega 64Radeon RX 580Radeon R9 FuryAMD EPYC 76012 x Intel Xeon Gold 61382004006008001000SE +/- 0.16, N = 3SE +/- 0.08, N = 3333.30197.93481.09514.97582.56990.47197.20652.04917.731072.78521.55793.7013.4910.781. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 980 TiRadeon RX Vega 56Radeon RX Vega 64Radeon RX 580Radeon R9 FuryAMD EPYC 76012 x Intel Xeon Gold 613880160240320400SE +/- 0.24, N = 6SE +/- 0.07, N = 3139.1086.80186.70186.50208.90317.1087.10216.50304.70363.90174.90330.706.106.731. (CC) gcc options: -O2 -flto -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 980 TiRadeon RX Vega 56Radeon RX Vega 64Radeon RX 580Radeon R9 FuryAMD EPYC 76012 x Intel Xeon Gold 6138510152025SE +/- 0.00, N = 3SE +/- 0.01, N = 67.314.1110.6013.8014.3019.903.249.2814.2316.997.738.840.400.481. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 980 TiRadeon RX Vega 56Radeon RX Vega 64Radeon RX 580Radeon R9 FuryAMD EPYC 76012 x Intel Xeon Gold 6138100200300400500SE +/- 0.24, N = 6SE +/- 0.09, N = 3153.4094.20205.40205.50229.00338.1095.10266.20339.10397.20208.40448.2010.2712.071. (CC) gcc options: -O2 -flto -lOpenCL

ViennaCL

OpenCL LU Factorization

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationGeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 980 TiRadeon RX Vega 56Radeon RX Vega 64Radeon RX 580Radeon R9 FuryAMD EPYC 76012 x Intel Xeon Gold 61381530456075SE +/- 0.20, N = 3SE +/- 0.18, N = 356.0947.5559.6663.2463.5865.2443.4159.2441.4742.5814.1324.5514.2020.321. (CXX) g++ options: -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadGeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 980 TiRadeon RX Vega 56Radeon RX Vega 64Radeon RX 580Radeon R9 FuryAMD EPYC 76012 x Intel Xeon Gold 61383691215SE +/- 0.27, N = 6SE +/- 0.13, N = 611.9411.4612.1812.1812.2812.4611.4412.2612.4711.9412.0211.396.953.26-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 980 TiRadeon RX Vega 56Radeon RX Vega 64Radeon RX 580Radeon R9 FuryAMD EPYC 76012 x Intel Xeon Gold 61383K6K9K12K15KSE +/- 37.88, N = 6SE +/- 100.62, N = 64854.002702.957256.769127.649593.3813411.902123.346250.3510702.8012923.606261.977141.33703.341031.481. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 980 TiRadeon RX Vega 56Radeon RX Vega 64Radeon RX 580Radeon R9 FuryAMD EPYC 76012 x Intel Xeon Gold 6138130260390520650SE +/- 0.07, N = 3SE +/- 0.32, N = 3377.95303.88451.38503.41524.60598.58267.71351.98377.52431.29216.56249.3441.8440.351. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

JuliaGPU

OpenCL

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCLGeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 980 TiRadeon RX Vega 56Radeon RX Vega 64Radeon RX 580Radeon R9 FuryAMD EPYC 76012 x Intel Xeon Gold 613870M140M210M280M350MSE +/- 128279.94, N = 3SE +/- 142014.93, N = 3121054859.2081114034.50149885098.20171465266.00176402393.60202544036.4067157808.00138987453.50331818340.60348619117.20248225442.30269983077.7028355131.5030799704.971. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 980 TiRadeon RX Vega 56Radeon RX Vega 64Radeon RX 580Radeon R9 FuryAMD EPYC 76012 x Intel Xeon Gold 613880160240320400SE +/- 0.08, N = 6SE +/- 0.10, N = 6140.0085.20191.80190.60215.60336.3086.30238.80321.90378.20177.30381.604.933.471. (CC) gcc options: -O2 -flto -lOpenCL


Phoronix Test Suite v10.8.4