NVIDIA vs. OpenCL ROCm Linux vs. AMDGPU-PRO Benchmarks ROCm 1.4 benchmarks on Ubuntu 16.04 compared to AMDGPU-PRO. Now with NVIDIA comparison points. OpenCL benchmarks by Michael Larabel for a future article on Phoronix.com.
HTML result view exported from: https://openbenchmarking.org/result/1701190-KH-1701193RI82&grw&sor .
NVIDIA vs. OpenCL ROCm Linux vs. AMDGPU-PRO Benchmarks Processor Motherboard Chipset Memory Disk Graphics Audio Network Monitor OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution GeForce GTX 1050 GeForce GTX 1050 Ti GeForce GTX 1060 GeForce GTX 1070 GeForce GTX 1080 Radeon RX 460 - AMDGPU-PRO Radeon RX 480 - AMDGPU-PRO Radeon R9 Fury - AMDGPU-PRO Radeon RX 460 - ROCm Radeon RX 480 - ROCm Radeon R9 Fury - ROCm Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores) MSI C236A WORKSTATION (MS-7998) v1.0 Intel Sky Lake 16384MB 256GB TOSHIBA-RD400 Zotac NVIDIA GeForce GTX 1050 2048MB (1075/3504MHz) Realtek ALC1150 Intel Connection Ubuntu 16.04 4.4.0-59-generic (x86_64) Unity 7.4.0 X Server 1.18.3 NVIDIA 375.26 4.5.0 OpenCL 1.2 CUDA 8.0.0 1.0.24 GCC 5.4.0 20160609 ext4 3840x2160 eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1341/3504MHz) NVIDIA GeForce GTX 1060 6GB 6144MB (418/4006MHz) NVIDIA GeForce GTX 1070 8192MB (1504/4006MHz) NVIDIA GeForce GTX 1080 8192MB (109/5005MHz) AMD Radeon RX 460 2048MB Acer B286HK amdgpu 1.1.99 4.5.13462 OpenCL 2.0 AMD-APP (2236.5) AMD Radeon RX 480 8192MB Sapphire AMD Radeon R9 Fury 4096MB LLVMpipe 4.6.0-kfd-compute-rocm-rel-1.4-16 (x86_64) modesetting 1.18.3 3.3 Mesa 11.2.0 Gallium 0.4 OpenCL 2.0 AMD-APP (2300.5) GCC 5.4.0 20160609 + Clang 4.0 + LLVM 4.0.0 Sapphire AMD Radeon R9 FURY / NANO 3968MB 4.1 Mesa 11.2.0 Gallium 0.4 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details - Scaling Governor: intel_pstate powersave OpenCL Details - GeForce GTX 1050: GPU Compute Cores: 640 - GeForce GTX 1050 Ti: GPU Compute Cores: 768 - GeForce GTX 1060: GPU Compute Cores: 1280 - GeForce GTX 1070: GPU Compute Cores: 1920 - GeForce GTX 1080: GPU Compute Cores: 2560 System Details - GeForce GTX 1050: GPU Compute Cores: 640. - GeForce GTX 1050 Ti: GPU Compute Cores: 768. - GeForce GTX 1060: GPU Compute Cores: 1280. - GeForce GTX 1070: GPU Compute Cores: 1920. - GeForce GTX 1080: GPU Compute Cores: 2560. Graphics Details - Radeon RX 460 - AMDGPU-PRO, Radeon RX 480 - AMDGPU-PRO, Radeon R9 Fury - AMDGPU-PRO, Radeon R9 Fury - ROCm: GLAMOR Environment Details - Radeon RX 460 - ROCm, Radeon RX 480 - ROCm: LIBGL_ALWAYS_SOFTWARE=1
NVIDIA vs. OpenCL ROCm Linux vs. AMDGPU-PRO Benchmarks darktable: Boat - OpenCL darktable: Masskrug - OpenCL darktable: Server Room - OpenCL shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth rodinia: OpenCL Heartwall mandelgpu: GPU juliagpu: GPU luxmark: GPU - Hotel luxmark: GPU - Microphone luxmark: GPU - Luxball HDR mandelbulbgpu: GPU GeForce GTX 1050 GeForce GTX 1050 Ti GeForce GTX 1060 GeForce GTX 1070 GeForce GTX 1080 Radeon RX 460 - AMDGPU-PRO Radeon RX 480 - AMDGPU-PRO Radeon R9 Fury - AMDGPU-PRO Radeon RX 460 - ROCm Radeon RX 480 - ROCm Radeon R9 Fury - ROCm 15.45 15.16 11.78 11.25 223.30 2115.38 12.75 13.11 282.49 5.27 51548791.30 64896787.13 1128 3300 6656 37667402.03 13.97 15.44 11.01 11.38 188.16 2697.13 12.78 13.22 316.10 3.65 64272664.57 78171484.97 1334 3612 7391 44889116.70 4.67 5.90 1.20 11.85 296.88 4780.88 12.78 13.22 393.69 3.36 112043183.47 115523522.73 2092 5204 11768 63345982.20 3.87 5.74 0.99 12.08 456.72 7115.54 12.78 13.22 446.64 159458228.23 144431468.40 3023 7302 16215 79620073.63 3.72 5.73 0.99 12.20 573.71 9415.48 12.78 13.22 520.51 206148858.53 165302847.33 2993 6388 12968 91109498.40 9.51 7.20 2.83 6.25 245.13 2066.69 6.93 7.14 77.35 7.97 35552080.15 50807022.25 897 2623 5547 32208376.98 4.37 5.76 0.99 9.40 508.20 5750.69 13.66 14.20 160.57 5.35 81101281.90 81972594.40 2399 6924 14066 48517365.80 4.22 6.30 1.79 4.12 751.86 7131.18 13.69 14.21 223.25 6.38 107202116.40 75992404.70 2402 7681 19394 43447360.40 9.57 7.05 2.48 5.21 158.21 2158.12 5.72 5.27 91.14 13.51 28295516.33 46101692.27 381 3664 29562658.90 5.72 5.93 0.99 7.94 403.22 5815.52 8.37 8.38 193.49 7.28 59296261.87 70675082.10 987 9196 49050438.67 4.98 6.09 1.48 10.59 399.71 5330.67 11.32 10.86 214.53 6.45 82051996.27 73072755.80 1201 5695 11995 44388927.12 OpenBenchmarking.org
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 2.2.1 Test: Boat - Acceleration: OpenCL GeForce GTX 1080 GeForce GTX 1070 Radeon R9 Fury - AMDGPU-PRO Radeon RX 480 - AMDGPU-PRO GeForce GTX 1060 Radeon R9 Fury - ROCm Radeon RX 480 - ROCm Radeon RX 460 - AMDGPU-PRO Radeon RX 460 - ROCm GeForce GTX 1050 Ti GeForce GTX 1050 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.77, N = 6 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.72 3.87 4.22 4.37 4.67 4.98 5.72 9.51 9.57 13.97 15.45
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 2.2.1 Test: Masskrug - Acceleration: OpenCL GeForce GTX 1080 GeForce GTX 1070 Radeon RX 480 - AMDGPU-PRO GeForce GTX 1060 Radeon RX 480 - ROCm Radeon R9 Fury - ROCm Radeon R9 Fury - AMDGPU-PRO Radeon RX 460 - ROCm Radeon RX 460 - AMDGPU-PRO GeForce GTX 1050 GeForce GTX 1050 Ti 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.08, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 5.73 5.74 5.76 5.90 5.93 6.09 6.30 7.05 7.20 15.16 15.44
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 2.2.1 Test: Server Room - Acceleration: OpenCL GeForce GTX 1070 GeForce GTX 1080 Radeon RX 480 - AMDGPU-PRO Radeon RX 480 - ROCm GeForce GTX 1060 Radeon R9 Fury - ROCm Radeon R9 Fury - AMDGPU-PRO Radeon RX 460 - ROCm Radeon RX 460 - AMDGPU-PRO GeForce GTX 1050 Ti GeForce GTX 1050 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 4 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 6 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 0.99 0.99 0.99 0.99 1.20 1.48 1.79 2.48 2.83 11.01 11.78
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Triad GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1060 GeForce GTX 1050 Ti GeForce GTX 1050 Radeon R9 Fury - ROCm Radeon RX 480 - AMDGPU-PRO Radeon RX 480 - ROCm Radeon RX 460 - AMDGPU-PRO Radeon RX 460 - ROCm Radeon R9 Fury - AMDGPU-PRO 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.14, N = 4 SE +/- 0.01, N = 3 SE +/- 0.10, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 12.20 12.08 11.85 11.38 11.25 10.59 9.40 7.94 6.25 5.21 4.12 -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: FFT SP Radeon R9 Fury - AMDGPU-PRO GeForce GTX 1080 Radeon RX 480 - AMDGPU-PRO GeForce GTX 1070 Radeon RX 480 - ROCm Radeon R9 Fury - ROCm GeForce GTX 1060 Radeon RX 460 - AMDGPU-PRO GeForce GTX 1050 GeForce GTX 1050 Ti Radeon RX 460 - ROCm 160 320 480 640 800 SE +/- 14.35, N = 3 SE +/- 6.31, N = 3 SE +/- 2.19, N = 3 SE +/- 6.56, N = 6 SE +/- 6.05, N = 3 SE +/- 0.44, N = 3 SE +/- 4.87, N = 3 SE +/- 1.23, N = 3 SE +/- 2.58, N = 3 SE +/- 2.31, N = 3 SE +/- 0.04, N = 3 751.86 573.71 508.20 456.72 403.22 399.71 296.88 245.13 223.30 188.16 158.21 -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Max SP Flops GeForce GTX 1080 Radeon R9 Fury - AMDGPU-PRO GeForce GTX 1070 Radeon RX 480 - ROCm Radeon RX 480 - AMDGPU-PRO Radeon R9 Fury - ROCm GeForce GTX 1060 GeForce GTX 1050 Ti Radeon RX 460 - ROCm GeForce GTX 1050 Radeon RX 460 - AMDGPU-PRO 2K 4K 6K 8K 10K SE +/- 70.36, N = 3 SE +/- 0.69, N = 3 SE +/- 52.21, N = 3 SE +/- 4.14, N = 3 SE +/- 30.75, N = 3 SE +/- 369.63, N = 6 SE +/- 22.70, N = 3 SE +/- 5.23, N = 3 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 SE +/- 18.41, N = 3 9415.48 7131.18 7115.54 5815.52 5750.69 5330.67 4780.88 2697.13 2158.12 2115.38 2066.69 -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Download Radeon R9 Fury - AMDGPU-PRO Radeon RX 480 - AMDGPU-PRO GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1060 GeForce GTX 1050 Ti GeForce GTX 1050 Radeon R9 Fury - ROCm Radeon RX 480 - ROCm Radeon RX 460 - AMDGPU-PRO Radeon RX 460 - ROCm 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 13.69 13.66 12.78 12.78 12.78 12.78 12.75 11.32 8.37 6.93 5.72 -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Readback Radeon R9 Fury - AMDGPU-PRO Radeon RX 480 - AMDGPU-PRO GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1060 GeForce GTX 1050 Ti GeForce GTX 1050 Radeon R9 Fury - ROCm Radeon RX 480 - ROCm Radeon RX 460 - AMDGPU-PRO Radeon RX 460 - ROCm 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 14.21 14.20 13.22 13.22 13.22 13.22 13.11 10.86 8.38 7.14 5.27 -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Texture Read Bandwidth GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1060 GeForce GTX 1050 Ti GeForce GTX 1050 Radeon R9 Fury - AMDGPU-PRO Radeon R9 Fury - ROCm Radeon RX 480 - ROCm Radeon RX 480 - AMDGPU-PRO Radeon RX 460 - ROCm Radeon RX 460 - AMDGPU-PRO 110 220 330 440 550 SE +/- 1.14, N = 3 SE +/- 0.12, N = 3 SE +/- 0.96, N = 3 SE +/- 1.06, N = 3 SE +/- 0.98, N = 3 SE +/- 1.03, N = 3 SE +/- 4.26, N = 3 SE +/- 1.30, N = 3 SE +/- 0.37, N = 3 SE +/- 0.16, N = 3 SE +/- 0.69, N = 3 520.51 446.64 393.69 316.10 282.49 223.25 214.53 193.49 160.57 91.14 77.35 -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi -lSHOCCommonMPI -pthread -lmpi_cxx -lmpi 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
Rodinia Test: OpenCL Heartwall OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Heartwall GeForce GTX 1060 GeForce GTX 1050 Ti GeForce GTX 1050 Radeon RX 480 - AMDGPU-PRO Radeon R9 Fury - AMDGPU-PRO Radeon R9 Fury - ROCm Radeon RX 480 - ROCm Radeon RX 460 - AMDGPU-PRO Radeon RX 460 - ROCm 3 6 9 12 15 SE +/- 0.05, N = 5 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 SE +/- 0.16, N = 6 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.08, N = 3 3.36 3.65 5.27 5.35 6.38 6.45 7.28 7.97 13.51 1. (CXX) g++ options: -O2 -lOpenCL
MandelGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelGPU 1.3pts1 OpenCL Device: GPU GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1060 Radeon R9 Fury - AMDGPU-PRO Radeon R9 Fury - ROCm Radeon RX 480 - AMDGPU-PRO GeForce GTX 1050 Ti Radeon RX 480 - ROCm GeForce GTX 1050 Radeon RX 460 - AMDGPU-PRO Radeon RX 460 - ROCm 40M 80M 120M 160M 200M SE +/- 971382.09, N = 3 SE +/- 248567.98, N = 3 SE +/- 104172.86, N = 3 SE +/- 71744.88, N = 3 SE +/- 75826.86, N = 3 SE +/- 126265.24, N = 3 SE +/- 26110.91, N = 3 SE +/- 165178.15, N = 2 SE +/- 30521.44, N = 3 206148858.53 159458228.23 112043183.47 107202116.40 82051996.27 81101281.90 64272664.57 59296261.87 51548791.30 35552080.15 28295516.33 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
JuliaGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better JuliaGPU 1.2pts1 OpenCL Device: GPU GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1060 Radeon RX 480 - AMDGPU-PRO GeForce GTX 1050 Ti Radeon R9 Fury - AMDGPU-PRO Radeon R9 Fury - ROCm Radeon RX 480 - ROCm GeForce GTX 1050 Radeon RX 460 - AMDGPU-PRO Radeon RX 460 - ROCm 40M 80M 120M 160M 200M SE +/- 694138.93, N = 3 SE +/- 169012.99, N = 3 SE +/- 194570.11, N = 3 SE +/- 500734.10, N = 2 SE +/- 109924.23, N = 3 SE +/- 985012.76, N = 3 SE +/- 94849.94, N = 3 SE +/- 29908.92, N = 3 SE +/- 97714.65, N = 2 SE +/- 160084.77, N = 3 165302847.33 144431468.40 115523522.73 81972594.40 78171484.97 75992404.70 73072755.80 70675082.10 64896787.13 50807022.25 46101692.27 1. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
LuxMark OpenCL Device: GPU - Scene: Hotel OpenBenchmarking.org Score, More Is Better LuxMark 3.0 OpenCL Device: GPU - Scene: Hotel GeForce GTX 1070 GeForce GTX 1080 Radeon R9 Fury - AMDGPU-PRO Radeon RX 480 - AMDGPU-PRO GeForce GTX 1060 GeForce GTX 1050 Ti Radeon R9 Fury - ROCm GeForce GTX 1050 Radeon RX 480 - ROCm Radeon RX 460 - AMDGPU-PRO Radeon RX 460 - ROCm 600 1200 1800 2400 3000 SE +/- 4.91, N = 3 SE +/- 9.00, N = 3 SE +/- 11.46, N = 3 SE +/- 6.94, N = 3 SE +/- 6.03, N = 3 SE +/- 3.79, N = 3 SE +/- 0.00, N = 3 SE +/- 5.67, N = 3 SE +/- 2.40, N = 3 SE +/- 1.00, N = 3 SE +/- 0.58, N = 3 3023 2993 2402 2399 2092 1334 1201 1128 987 897 381
LuxMark OpenCL Device: GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.0 OpenCL Device: GPU - Scene: Microphone Radeon R9 Fury - AMDGPU-PRO GeForce GTX 1070 Radeon RX 480 - AMDGPU-PRO GeForce GTX 1080 Radeon R9 Fury - ROCm GeForce GTX 1060 GeForce GTX 1050 Ti GeForce GTX 1050 Radeon RX 460 - AMDGPU-PRO 1600 3200 4800 6400 8000 SE +/- 17.84, N = 3 SE +/- 38.17, N = 3 SE +/- 13.50, N = 3 SE +/- 2.03, N = 3 SE +/- 15.04, N = 3 SE +/- 3.51, N = 3 SE +/- 2.52, N = 3 SE +/- 3.38, N = 3 SE +/- 6.98, N = 3 7681 7302 6924 6388 5695 5204 3612 3300 2623
LuxMark OpenCL Device: GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 3.0 OpenCL Device: GPU - Scene: Luxball HDR Radeon R9 Fury - AMDGPU-PRO GeForce GTX 1070 Radeon RX 480 - AMDGPU-PRO GeForce GTX 1080 Radeon R9 Fury - ROCm GeForce GTX 1060 Radeon RX 480 - ROCm GeForce GTX 1050 Ti GeForce GTX 1050 Radeon RX 460 - AMDGPU-PRO Radeon RX 460 - ROCm 4K 8K 12K 16K 20K SE +/- 75.47, N = 3 SE +/- 2.31, N = 3 SE +/- 68.10, N = 3 SE +/- 12.45, N = 3 SE +/- 17.34, N = 3 SE +/- 36.34, N = 3 SE +/- 0.67, N = 3 SE +/- 17.00, N = 3 SE +/- 5.20, N = 3 SE +/- 9.82, N = 3 SE +/- 17.00, N = 3 19394 16215 14066 12968 11995 11768 9196 7391 6656 5547 3664
MandelbulbGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelbulbGPU 1.0pts1 OpenCL Device: GPU GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1060 Radeon RX 480 - ROCm Radeon RX 480 - AMDGPU-PRO GeForce GTX 1050 Ti Radeon R9 Fury - ROCm Radeon R9 Fury - AMDGPU-PRO GeForce GTX 1050 Radeon RX 460 - AMDGPU-PRO Radeon RX 460 - ROCm 20M 40M 60M 80M 100M SE +/- 423859.17, N = 3 SE +/- 503324.21, N = 3 SE +/- 290297.61, N = 3 SE +/- 81023.55, N = 3 SE +/- 112131.74, N = 3 SE +/- 2304744.64, N = 6 SE +/- 36018.97, N = 3 SE +/- 561923.72, N = 4 SE +/- 117840.12, N = 3 91109498.40 79620073.63 63345982.20 49050438.67 48517365.80 44889116.70 44388927.12 43447360.40 37667402.03 32208376.98 29562658.90 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
Phoronix Test Suite v10.8.5