OpenCL benchmarks for a future article on Phoronix.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1611107-TA-AMDGPUPRO05 AMDGPU-PRO OpenCL vs. NVIDIA Linux Comparison - Phoronix Test Suite AMDGPU-PRO OpenCL vs. NVIDIA Linux Comparison OpenCL benchmarks for a future article on Phoronix.
HTML result view exported from: https://openbenchmarking.org/result/1611107-TA-AMDGPUPRO05&grw&sro .
AMDGPU-PRO OpenCL vs. NVIDIA Linux Comparison Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Radeon RX 460 Radeon RX 480 Radeon R9 Fury GeForce GTX 1050 GeForce GTX 1050 Ti GeForce GTX 1060 GeForce GTX 1070 GeForce GTX 1080 Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores) MSI C236A WORKSTATION (MS-7998) v1.0 Intel Sky Lake 16384MB 256GB INTEL SSDPEKKW256G7 AMD Radeon RX 460 2009.7109375MB Realtek ALC1150 Acer B286HK Intel Connection Ubuntu 16.04 4.8.4-040804-generic (x86_64) Unity 7.4.0 X Server 1.18.4 modesetting 1.18.4 4.5.13453 OpenCL 2.0 AMD-APP (2117.10) 1.0.8 GCC 5.4.0 20160609 + LLVM 3.8.0 + CUDA 8.0 ext4 3840x2160 AMD Radeon RX 480 8141.7109375MB Sapphire AMD Radeon R9 Fury 4053.82421875MB Zotac NVIDIA GeForce GTX 1050 2048MB (1316/3504MHz) NVIDIA 375.10 4.5.0 OpenCL 1.2 CUDA 8.0.0 eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1341/3504MHz) NVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz) NVIDIA GeForce GTX 1070 8192MB (1504/4006MHz) NVIDIA GeForce GTX 1080 8192MB (35/5005MHz) OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details - Radeon RX 460: Scaling Governor: intel_pstate performance - Radeon RX 480: Scaling Governor: intel_pstate performance - Radeon R9 Fury: Scaling Governor: intel_pstate powersave - GeForce GTX 1050: Scaling Governor: intel_pstate performance - GeForce GTX 1050 Ti: Scaling Governor: intel_pstate performance - GeForce GTX 1060: Scaling Governor: intel_pstate performance - GeForce GTX 1070: Scaling Governor: intel_pstate performance - GeForce GTX 1080: Scaling Governor: intel_pstate powersave Graphics Details - Radeon RX 460, Radeon RX 480, Radeon R9 Fury: GLAMOR OpenCL Details - GeForce GTX 1050: GPU Compute Cores: 640 - GeForce GTX 1050 Ti: GPU Compute Cores: 768 - GeForce GTX 1060: GPU Compute Cores: 1280 - GeForce GTX 1070: GPU Compute Cores: 1920 - GeForce GTX 1080: GPU Compute Cores: 2560 System Details - GeForce GTX 1050: GPU Compute Cores: 640. - GeForce GTX 1050 Ti: GPU Compute Cores: 768. - GeForce GTX 1060: GPU Compute Cores: 1280. - GeForce GTX 1070: GPU Compute Cores: 1920. - GeForce GTX 1080: GPU Compute Cores: 2560.
AMDGPU-PRO OpenCL vs. NVIDIA Linux Comparison shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth juliagpu: GPU mandelbulbgpu: GPU Radeon RX 460 Radeon RX 480 Radeon R9 Fury GeForce GTX 1050 GeForce GTX 1050 Ti GeForce GTX 1060 GeForce GTX 1070 GeForce GTX 1080 3.12 26.83 2.42 2150.57 6.84 7.12 78.96 41787915.50 26828537.03 5.59 35.28 5.12 5434.47 13.11 13.38 162.54 60734776.57 38283926.00 4.06 24.51 5.88 7133.59 13.07 13.45 221.97 79965845.45 44542867.47 11.06 116.39 2.49 2104.35 12.52 13.17 279.97 64392809.47 37264194.30 11.05 128.64 3.01 2658.39 12.53 13.17 311.54 77022561.27 44503222.40 11.51 214.12 5.61 4781.43 12.53 13.17 397.51 113379989.40 62974313.00 11.69 302.33 8.34 7110.46 12.51 13.17 450.88 143023765.00 79186944.93 11.84 346.47 11.81 9426.88 12.54 13.17 526.96 164397650.07 91368843.10 OpenBenchmarking.org
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Triad GeForce GTX 1050 GeForce GTX 1050 Ti GeForce GTX 1060 GeForce GTX 1070 GeForce GTX 1080 Radeon R9 Fury Radeon RX 460 Radeon RX 480 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 6 SE +/- 0.70, N = 6 11.06 11.05 11.51 11.69 11.84 4.06 3.12 5.59 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: FFT SP GeForce GTX 1050 GeForce GTX 1050 Ti GeForce GTX 1060 GeForce GTX 1070 GeForce GTX 1080 Radeon R9 Fury Radeon RX 460 Radeon RX 480 80 160 240 320 400 SE +/- 0.37, N = 3 SE +/- 0.99, N = 3 SE +/- 2.10, N = 3 SE +/- 0.44, N = 3 SE +/- 0.65, N = 3 SE +/- 0.23, N = 3 SE +/- 4.17, N = 6 SE +/- 1.64, N = 6 116.39 128.64 214.12 302.33 346.47 24.51 26.83 35.28 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: MD5 Hash GeForce GTX 1050 GeForce GTX 1050 Ti GeForce GTX 1060 GeForce GTX 1070 GeForce GTX 1080 Radeon R9 Fury Radeon RX 460 Radeon RX 480 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 2.49 3.01 5.61 8.34 11.81 5.88 2.42 5.12 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Max SP Flops GeForce GTX 1050 GeForce GTX 1050 Ti GeForce GTX 1060 GeForce GTX 1070 GeForce GTX 1080 Radeon R9 Fury Radeon RX 460 Radeon RX 480 2K 4K 6K 8K 10K SE +/- 5.38, N = 3 SE +/- 6.27, N = 3 SE +/- 21.87, N = 3 SE +/- 51.57, N = 3 SE +/- 57.81, N = 3 SE +/- 0.46, N = 3 SE +/- 2.88, N = 3 SE +/- 39.66, N = 3 2104.35 2658.39 4781.43 7110.46 9426.88 7133.59 2150.57 5434.47 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Download GeForce GTX 1050 GeForce GTX 1050 Ti GeForce GTX 1060 GeForce GTX 1070 GeForce GTX 1080 Radeon R9 Fury Radeon RX 460 Radeon RX 480 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 12.52 12.53 12.53 12.51 12.54 13.07 6.84 13.11 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Readback GeForce GTX 1050 GeForce GTX 1050 Ti GeForce GTX 1060 GeForce GTX 1070 GeForce GTX 1080 Radeon R9 Fury Radeon RX 460 Radeon RX 480 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.21, N = 4 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 13.17 13.17 13.17 13.17 13.17 13.45 7.12 13.38 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Texture Read Bandwidth GeForce GTX 1050 GeForce GTX 1050 Ti GeForce GTX 1060 GeForce GTX 1070 GeForce GTX 1080 Radeon R9 Fury Radeon RX 460 Radeon RX 480 110 220 330 440 550 SE +/- 3.00, N = 3 SE +/- 4.30, N = 3 SE +/- 0.98, N = 3 SE +/- 0.17, N = 3 SE +/- 0.23, N = 3 SE +/- 0.11, N = 3 SE +/- 0.33, N = 3 SE +/- 0.34, N = 3 279.97 311.54 397.51 450.88 526.96 221.97 78.96 162.54 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
JuliaGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better JuliaGPU 1.2pts1 OpenCL Device: GPU GeForce GTX 1050 GeForce GTX 1050 Ti GeForce GTX 1060 GeForce GTX 1070 GeForce GTX 1080 Radeon R9 Fury Radeon RX 460 Radeon RX 480 40M 80M 120M 160M 200M SE +/- 64687.33, N = 3 SE +/- 586830.45, N = 3 SE +/- 406531.89, N = 3 SE +/- 507906.47, N = 3 SE +/- 545460.59, N = 3 SE +/- 1373156.95, N = 2 SE +/- 629997.60, N = 2 SE +/- 478807.27, N = 3 64392809.47 77022561.27 113379989.40 143023765.00 164397650.07 79965845.45 41787915.50 60734776.57 1. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
MandelbulbGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelbulbGPU 1.0pts1 OpenCL Device: GPU GeForce GTX 1050 GeForce GTX 1050 Ti GeForce GTX 1060 GeForce GTX 1070 GeForce GTX 1080 Radeon R9 Fury Radeon RX 460 Radeon RX 480 20M 40M 60M 80M 100M SE +/- 38810.35, N = 3 SE +/- 74696.74, N = 3 SE +/- 142262.41, N = 3 SE +/- 247192.13, N = 3 SE +/- 361988.46, N = 3 SE +/- 691465.86, N = 3 SE +/- 76036.11, N = 3 SE +/- 121550.30, N = 2 37264194.30 44503222.40 62974313.00 79186944.93 91368843.10 44542867.47 26828537.03 38283926.00 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
Phoronix Test Suite v10.8.4