OpenCL benchmarks for a future article on Phoronix.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1611107-TA-AMDGPUPRO05 AMDGPU-PRO OpenCL vs. NVIDIA Linux Comparison - Phoronix Test Suite AMDGPU-PRO OpenCL vs. NVIDIA Linux Comparison OpenCL benchmarks for a future article on Phoronix.
HTML result view exported from: https://openbenchmarking.org/result/1611107-TA-AMDGPUPRO05&rdt&grr .
AMDGPU-PRO OpenCL vs. NVIDIA Linux Comparison Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Radeon R9 Fury Radeon RX 460 Radeon RX 480 GeForce GTX 1050 Ti GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1050 GeForce GTX 1060 Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores) MSI C236A WORKSTATION (MS-7998) v1.0 Intel Sky Lake 16384MB 256GB INTEL SSDPEKKW256G7 Sapphire AMD Radeon R9 Fury 4053.82421875MB Realtek ALC1150 Acer B286HK Intel Connection Ubuntu 16.04 4.8.4-040804-generic (x86_64) Unity 7.4.0 X Server 1.18.4 modesetting 1.18.4 4.5.13453 OpenCL 2.0 AMD-APP (2117.10) 1.0.8 GCC 5.4.0 20160609 + LLVM 3.8.0 + CUDA 8.0 ext4 3840x2160 AMD Radeon RX 460 2009.7109375MB AMD Radeon RX 480 8141.7109375MB eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1341/3504MHz) NVIDIA 375.10 4.5.0 OpenCL 1.2 CUDA 8.0.0 NVIDIA GeForce GTX 1080 8192MB (35/5005MHz) NVIDIA GeForce GTX 1070 8192MB (1504/4006MHz) Zotac NVIDIA GeForce GTX 1050 2048MB (1316/3504MHz) NVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz) OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details - Radeon R9 Fury: Scaling Governor: intel_pstate powersave - Radeon RX 460: Scaling Governor: intel_pstate performance - Radeon RX 480: Scaling Governor: intel_pstate performance - GeForce GTX 1050 Ti: Scaling Governor: intel_pstate performance - GeForce GTX 1080: Scaling Governor: intel_pstate powersave - GeForce GTX 1070: Scaling Governor: intel_pstate performance - GeForce GTX 1050: Scaling Governor: intel_pstate performance - GeForce GTX 1060: Scaling Governor: intel_pstate performance Graphics Details - Radeon R9 Fury, Radeon RX 460, Radeon RX 480: GLAMOR OpenCL Details - GeForce GTX 1050 Ti: GPU Compute Cores: 768 - GeForce GTX 1080: GPU Compute Cores: 2560 - GeForce GTX 1070: GPU Compute Cores: 1920 - GeForce GTX 1050: GPU Compute Cores: 640 - GeForce GTX 1060: GPU Compute Cores: 1280 System Details - GeForce GTX 1050 Ti: GPU Compute Cores: 768. - GeForce GTX 1080: GPU Compute Cores: 2560. - GeForce GTX 1070: GPU Compute Cores: 1920. - GeForce GTX 1050: GPU Compute Cores: 640. - GeForce GTX 1060: GPU Compute Cores: 1280.
AMDGPU-PRO OpenCL vs. NVIDIA Linux Comparison mandelbulbgpu: GPU juliagpu: GPU shoc: OpenCL - Texture Read Bandwidth shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Bus Speed Download shoc: OpenCL - Max SP Flops shoc: OpenCL - MD5 Hash shoc: OpenCL - FFT SP shoc: OpenCL - Triad Radeon R9 Fury Radeon RX 460 Radeon RX 480 GeForce GTX 1050 Ti GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1050 GeForce GTX 1060 44542867.47 79965845.45 221.97 13.45 13.07 7133.59 5.88 24.51 4.06 26828537.03 41787915.50 78.96 7.12 6.84 2150.57 2.42 26.83 3.12 38283926.00 60734776.57 162.54 13.38 13.11 5434.47 5.12 35.28 5.59 44503222.40 77022561.27 311.54 13.17 12.53 2658.39 3.01 128.64 11.05 91368843.10 164397650.07 526.96 13.17 12.54 9426.88 11.81 346.47 11.84 79186944.93 143023765.00 450.88 13.17 12.51 7110.46 8.34 302.33 11.69 37264194.30 64392809.47 279.97 13.17 12.52 2104.35 2.49 116.39 11.06 62974313.00 113379989.40 397.51 13.17 12.53 4781.43 5.61 214.12 11.51 OpenBenchmarking.org
MandelbulbGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelbulbGPU 1.0pts1 OpenCL Device: GPU Radeon R9 Fury Radeon RX 460 Radeon RX 480 GeForce GTX 1050 Ti GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1050 GeForce GTX 1060 20M 40M 60M 80M 100M SE +/- 691465.86, N = 3 SE +/- 76036.11, N = 3 SE +/- 121550.30, N = 2 SE +/- 74696.74, N = 3 SE +/- 361988.46, N = 3 SE +/- 247192.13, N = 3 SE +/- 38810.35, N = 3 SE +/- 142262.41, N = 3 44542867.47 26828537.03 38283926.00 44503222.40 91368843.10 79186944.93 37264194.30 62974313.00 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
JuliaGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better JuliaGPU 1.2pts1 OpenCL Device: GPU Radeon R9 Fury Radeon RX 460 Radeon RX 480 GeForce GTX 1050 Ti GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1050 GeForce GTX 1060 40M 80M 120M 160M 200M SE +/- 1373156.95, N = 2 SE +/- 629997.60, N = 2 SE +/- 478807.27, N = 3 SE +/- 586830.45, N = 3 SE +/- 545460.59, N = 3 SE +/- 507906.47, N = 3 SE +/- 64687.33, N = 3 SE +/- 406531.89, N = 3 79965845.45 41787915.50 60734776.57 77022561.27 164397650.07 143023765.00 64392809.47 113379989.40 1. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Texture Read Bandwidth Radeon R9 Fury Radeon RX 460 Radeon RX 480 GeForce GTX 1050 Ti GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1050 GeForce GTX 1060 110 220 330 440 550 SE +/- 0.11, N = 3 SE +/- 0.33, N = 3 SE +/- 0.34, N = 3 SE +/- 4.30, N = 3 SE +/- 0.23, N = 3 SE +/- 0.17, N = 3 SE +/- 3.00, N = 3 SE +/- 0.98, N = 3 221.97 78.96 162.54 311.54 526.96 450.88 279.97 397.51 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Readback Radeon R9 Fury Radeon RX 460 Radeon RX 480 GeForce GTX 1050 Ti GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1050 GeForce GTX 1060 3 6 9 12 15 SE +/- 0.21, N = 4 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 13.45 7.12 13.38 13.17 13.17 13.17 13.17 13.17 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Download Radeon R9 Fury Radeon RX 460 Radeon RX 480 GeForce GTX 1050 Ti GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1050 GeForce GTX 1060 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 13.07 6.84 13.11 12.53 12.54 12.51 12.52 12.53 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Max SP Flops Radeon R9 Fury Radeon RX 460 Radeon RX 480 GeForce GTX 1050 Ti GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1050 GeForce GTX 1060 2K 4K 6K 8K 10K SE +/- 0.46, N = 3 SE +/- 2.88, N = 3 SE +/- 39.66, N = 3 SE +/- 6.27, N = 3 SE +/- 57.81, N = 3 SE +/- 51.57, N = 3 SE +/- 5.38, N = 3 SE +/- 21.87, N = 3 7133.59 2150.57 5434.47 2658.39 9426.88 7110.46 2104.35 4781.43 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: MD5 Hash Radeon R9 Fury Radeon RX 460 Radeon RX 480 GeForce GTX 1050 Ti GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1050 GeForce GTX 1060 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 5.88 2.42 5.12 3.01 11.81 8.34 2.49 5.61 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: FFT SP Radeon R9 Fury Radeon RX 460 Radeon RX 480 GeForce GTX 1050 Ti GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1050 GeForce GTX 1060 80 160 240 320 400 SE +/- 0.23, N = 3 SE +/- 4.17, N = 6 SE +/- 1.64, N = 6 SE +/- 0.99, N = 3 SE +/- 0.65, N = 3 SE +/- 0.44, N = 3 SE +/- 0.37, N = 3 SE +/- 2.10, N = 3 24.51 26.83 35.28 128.64 346.47 302.33 116.39 214.12 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Triad Radeon R9 Fury Radeon RX 460 Radeon RX 480 GeForce GTX 1050 Ti GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1050 GeForce GTX 1060 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.06, N = 6 SE +/- 0.70, N = 6 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 4.06 3.12 5.59 11.05 11.84 11.69 11.06 11.51 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
Phoronix Test Suite v10.8.4