NVIDIA vs. OpenCL ROCm Linux vs. AMDGPU-PRO Benchmarks

XFX AMD Radeon R9 390

HTML result view exported from: https://openbenchmarking.org/result/1701214-RI-1701193RI10&sor.

NVIDIA vs. OpenCL ROCm Linux vs. AMDGPU-PRO BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon RX 460 - AMDGPU-PRORadeon RX 480 - AMDGPU-PRORadeon R9 Fury - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - ROCmRadeon R9 Fury - ROCmXFX AMD Radeon R9 390Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.0Intel Sky Lake16384MB256GB TOSHIBA-RD400Zotac NVIDIA GeForce GTX 1050 2048MB (1075/3504MHz)Realtek ALC1150Intel ConnectionUbuntu 16.044.4.0-59-generic (x86_64)Unity 7.4.0X Server 1.18.3NVIDIA 375.264.5.0OpenCL 1.2 CUDA 8.0.01.0.24GCC 5.4.0 20160609ext43840x2160eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1341/3504MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (418/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1504/4006MHz)NVIDIA GeForce GTX 1080 8192MB (109/5005MHz)AMD Radeon RX 460 2048MBAcer B286HKamdgpu 1.1.994.5.13462OpenCL 2.0 AMD-APP (2236.5)AMD Radeon RX 480 8192MBSapphire AMD Radeon R9 Fury 4096MBLLVMpipe4.6.0-kfd-compute-rocm-rel-1.4-16 (x86_64)modesetting 1.18.33.3 Mesa 11.2.0 Gallium 0.4OpenCL 2.0 AMD-APP (2300.5)GCC 5.4.0 20160609 + Clang 4.0 + LLVM 4.0.0Sapphire AMD Radeon R9 FURY / NANO 3968MB4.1 Mesa 11.2.0 Gallium 0.4AMD FX-8350 Eight-Core @ 4.00GHz (8 Cores)ASRock 970DE3/U3S3AMD RX780/RX790 + SB7x0/SB8x0/SB9x032768MB128GB ADATA SP600 + 128GB OCZ AGILITY4 + 480GB TS480GSSD220SXFX AMD Radeon R9 390 8192MBRealtek ALC662 rev1VG2732 + SyncMasterRealtek RTL8111/8168/8411X Server 1.18.4modesetting 1.18.44.5.13462OpenCL 2.0 AMD-APP (2236.5)1.0.8GCC 5.4.1 20160904 + CUDA 8.01920x1200OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -vProcessor Details- GeForce GTX 1050: Scaling Governor: intel_pstate powersave- GeForce GTX 1050 Ti: Scaling Governor: intel_pstate powersave- GeForce GTX 1060: Scaling Governor: intel_pstate powersave- GeForce GTX 1070: Scaling Governor: intel_pstate powersave- GeForce GTX 1080: Scaling Governor: intel_pstate powersave- Radeon RX 460 - AMDGPU-PRO: Scaling Governor: intel_pstate powersave- Radeon RX 480 - AMDGPU-PRO: Scaling Governor: intel_pstate powersave- Radeon R9 Fury - AMDGPU-PRO: Scaling Governor: intel_pstate powersave- Radeon RX 460 - ROCm: Scaling Governor: intel_pstate powersave- Radeon RX 480 - ROCm: Scaling Governor: intel_pstate powersave- Radeon R9 Fury - ROCm: Scaling Governor: intel_pstate powersave- XFX AMD Radeon R9 390: Scaling Governor: acpi-cpufreq ondemandOpenCL Details- GeForce GTX 1050: GPU Compute Cores: 640- GeForce GTX 1050 Ti: GPU Compute Cores: 768- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560System Details- GeForce GTX 1050: GPU Compute Cores: 640.- GeForce GTX 1050 Ti: GPU Compute Cores: 768.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.Graphics Details- Radeon RX 460 - AMDGPU-PRO, Radeon RX 480 - AMDGPU-PRO, Radeon R9 Fury - AMDGPU-PRO, Radeon R9 Fury - ROCm, XFX AMD Radeon R9 390: GLAMOREnvironment Details- Radeon RX 460 - ROCm, Radeon RX 480 - ROCm: LIBGL_ALWAYS_SOFTWARE=1

NVIDIA vs. OpenCL ROCm Linux vs. AMDGPU-PRO Benchmarksshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthrodinia: OpenCL Heartwalldarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLjuliagpu: GPUmandelbulbgpu: GPUmandelgpu: GPUluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon RX 460 - AMDGPU-PRORadeon RX 480 - AMDGPU-PRORadeon R9 Fury - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - ROCmRadeon R9 Fury - ROCmXFX AMD Radeon R9 39011.25223.302115.3812.7513.11282.495.2715.4515.1611.7864896787.1337667402.0351548791.3011283300665611.38188.162697.1312.7813.22316.103.6513.9715.4411.0178171484.9744889116.7064272664.5713343612739111.85296.884780.8812.7813.22393.693.364.675.901.20115523522.7363345982.20112043183.47209252041176812.08456.727115.5412.7813.22446.643.875.740.99144431468.4079620073.63159458228.23302373021621512.20573.719415.4812.7813.22520.513.725.730.99165302847.3391109498.40206148858.5329936388129686.25245.132066.696.937.1477.357.979.517.202.8350807022.2532208376.9835552080.15897262355479.40508.205750.6913.6614.20160.575.354.375.760.9981972594.4048517365.8081101281.9023996924140664.12751.867131.1813.6914.21223.256.384.226.301.7975992404.7043447360.40107202116.4024027681193945.21158.212158.125.725.2791.1413.519.577.052.4846101692.2729562658.9028295516.3338136647.94403.225815.528.378.38193.497.285.725.930.9970675082.1049050438.6759296261.87987919610.59399.715330.6711.3210.86214.536.454.986.091.4873072755.8044388927.1282051996.2712015695119955.22861.395173.235.735.66262.535.1459949695.3020776644168704.210.270.27OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadGeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1050Radeon R9 Fury - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCmRadeon RX 460 - AMDGPU-PROXFX AMD Radeon R9 390Radeon RX 460 - ROCmRadeon R9 Fury - AMDGPU-PRO3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.14, N = 4SE +/- 0.01, N = 3SE +/- 0.10, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 312.2012.0811.8511.3811.2510.599.407.946.255.225.214.12-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lcudadevrt -lcudart_static -lpthread -ldl -lcufft-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPXFX AMD Radeon R9 390Radeon R9 Fury - AMDGPU-PROGeForce GTX 1080Radeon RX 480 - AMDGPU-PROGeForce GTX 1070Radeon RX 480 - ROCmRadeon R9 Fury - ROCmGeForce GTX 1060Radeon RX 460 - AMDGPU-PROGeForce GTX 1050GeForce GTX 1050 TiRadeon RX 460 - ROCm2004006008001000SE +/- 6.83, N = 3SE +/- 14.35, N = 3SE +/- 6.31, N = 3SE +/- 2.19, N = 3SE +/- 6.56, N = 6SE +/- 6.05, N = 3SE +/- 0.44, N = 3SE +/- 4.87, N = 3SE +/- 1.23, N = 3SE +/- 2.58, N = 3SE +/- 2.31, N = 3SE +/- 0.04, N = 3861.39751.86573.71508.20456.72403.22399.71296.88245.13223.30188.16158.21-lcudadevrt -lcudart_static -lpthread -ldl -lcufft-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 1080Radeon R9 Fury - AMDGPU-PROGeForce GTX 1070Radeon RX 480 - ROCmRadeon RX 480 - AMDGPU-PRORadeon R9 Fury - ROCmXFX AMD Radeon R9 390GeForce GTX 1060GeForce GTX 1050 TiRadeon RX 460 - ROCmGeForce GTX 1050Radeon RX 460 - AMDGPU-PRO2K4K6K8K10KSE +/- 70.36, N = 3SE +/- 0.69, N = 3SE +/- 52.21, N = 3SE +/- 4.14, N = 3SE +/- 30.75, N = 3SE +/- 369.63, N = 6SE +/- 2.31, N = 3SE +/- 22.70, N = 3SE +/- 5.23, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 18.41, N = 39415.487131.187115.545815.525750.695330.675173.234780.882697.132158.122115.382066.69-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lcudadevrt -lcudart_static -lpthread -ldl -lcufft-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadRadeon R9 Fury - AMDGPU-PRORadeon RX 480 - AMDGPU-PROGeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1050Radeon R9 Fury - ROCmRadeon RX 480 - ROCmRadeon RX 460 - AMDGPU-PROXFX AMD Radeon R9 390Radeon RX 460 - ROCm48121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 313.6913.6612.7812.7812.7812.7812.7511.328.376.935.735.72-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lcudadevrt -lcudart_static -lpthread -ldl -lcufft-lSHOCCommonOpenCL -lOpenCL1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackRadeon R9 Fury - AMDGPU-PRORadeon RX 480 - AMDGPU-PROGeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1050Radeon R9 Fury - ROCmRadeon RX 480 - ROCmRadeon RX 460 - AMDGPU-PROXFX AMD Radeon R9 390Radeon RX 460 - ROCm48121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.10, N = 6SE +/- 0.01, N = 314.2114.2013.2213.2213.2213.2213.1110.868.387.145.665.27-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lcudadevrt -lcudart_static -lpthread -ldl -lcufft-lSHOCCommonOpenCL -lOpenCL1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1050XFX AMD Radeon R9 390Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 480 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 460 - AMDGPU-PRO110220330440550SE +/- 1.14, N = 3SE +/- 0.12, N = 3SE +/- 0.96, N = 3SE +/- 1.06, N = 3SE +/- 0.98, N = 3SE +/- 0.79, N = 3SE +/- 1.03, N = 3SE +/- 4.26, N = 3SE +/- 1.30, N = 3SE +/- 0.37, N = 3SE +/- 0.16, N = 3SE +/- 0.69, N = 3520.51446.64393.69316.10282.49262.53223.25214.53193.49160.5791.1477.35-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lcudadevrt -lcudart_static -lpthread -ldl -lcufft-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

Rodinia

Test: OpenCL Heartwall

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL HeartwallGeForce GTX 1060GeForce GTX 1050 TiXFX AMD Radeon R9 390GeForce GTX 1050Radeon RX 480 - AMDGPU-PRORadeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 480 - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCm3691215SE +/- 0.05, N = 5SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.16, N = 6SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 33.363.655.145.275.356.386.457.287.9713.511. (CXX) g++ options: -O2 -lOpenCL

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Boat - Acceleration: OpenCLGeForce GTX 1080GeForce GTX 1070Radeon R9 Fury - AMDGPU-PRORadeon RX 480 - AMDGPU-PROGeForce GTX 1060Radeon R9 Fury - ROCmRadeon RX 480 - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmGeForce GTX 1050 TiGeForce GTX 105048121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.77, N = 6SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.723.874.224.374.674.985.729.519.5713.9715.45

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Masskrug - Acceleration: OpenCLGeForce GTX 1080GeForce GTX 1070Radeon RX 480 - AMDGPU-PROGeForce GTX 1060Radeon RX 480 - ROCmRadeon R9 Fury - ROCmRadeon R9 Fury - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 460 - AMDGPU-PROGeForce GTX 1050GeForce GTX 1050 Ti48121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 35.735.745.765.905.936.096.307.057.2015.1615.44

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Server Room - Acceleration: OpenCLGeForce GTX 1070GeForce GTX 1080Radeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCmGeForce GTX 1060Radeon R9 Fury - ROCmRadeon R9 Fury - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 460 - AMDGPU-PROGeForce GTX 1050 TiGeForce GTX 10503691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 4SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 6SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 30.990.990.990.991.201.481.792.482.8311.0111.78

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 1080GeForce GTX 1070GeForce GTX 1060Radeon RX 480 - AMDGPU-PROGeForce GTX 1050 TiRadeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 480 - ROCmGeForce GTX 1050Radeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCm40M80M120M160M200MSE +/- 694138.93, N = 3SE +/- 169012.99, N = 3SE +/- 194570.11, N = 3SE +/- 500734.10, N = 2SE +/- 109924.23, N = 3SE +/- 985012.76, N = 3SE +/- 94849.94, N = 3SE +/- 29908.92, N = 3SE +/- 97714.65, N = 2SE +/- 160084.77, N = 3165302847.33144431468.40115523522.7381972594.4078171484.9775992404.7073072755.8070675082.1064896787.1350807022.2546101692.271. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

MandelbulbGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GTX 1080GeForce GTX 1070GeForce GTX 1060XFX AMD Radeon R9 390Radeon RX 480 - ROCmRadeon RX 480 - AMDGPU-PROGeForce GTX 1050 TiRadeon R9 Fury - ROCmRadeon R9 Fury - AMDGPU-PROGeForce GTX 1050Radeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCm20M40M60M80M100MSE +/- 423859.17, N = 3SE +/- 503324.21, N = 3SE +/- 290297.61, N = 3SE +/- 81023.55, N = 3SE +/- 112131.74, N = 3SE +/- 2304744.64, N = 6SE +/- 36018.97, N = 3SE +/- 561923.72, N = 4SE +/- 117840.12, N = 391109498.4079620073.6363345982.2059949695.3049050438.6748517365.8044889116.7044388927.1243447360.4037667402.0332208376.9829562658.901. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGeForce GTX 1080GeForce GTX 1070GeForce GTX 1060Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 480 - AMDGPU-PROGeForce GTX 1050 TiRadeon RX 480 - ROCmGeForce GTX 1050Radeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCm40M80M120M160M200MSE +/- 971382.09, N = 3SE +/- 248567.98, N = 3SE +/- 104172.86, N = 3SE +/- 71744.88, N = 3SE +/- 75826.86, N = 3SE +/- 126265.24, N = 3SE +/- 26110.91, N = 3SE +/- 165178.15, N = 2SE +/- 30521.44, N = 3206148858.53159458228.23112043183.47107202116.4082051996.2781101281.9064272664.5759296261.8751548791.3035552080.1528295516.331. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon RX 480 - AMDGPU-PROGeForce GTX 1060XFX AMD Radeon R9 390GeForce GTX 1050 TiRadeon R9 Fury - ROCmGeForce GTX 1050Radeon RX 480 - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCm6001200180024003000SE +/- 4.91, N = 3SE +/- 9.00, N = 3SE +/- 11.46, N = 3SE +/- 6.94, N = 3SE +/- 6.03, N = 3SE +/- 2.08, N = 3SE +/- 3.79, N = 3SE +/- 0.00, N = 3SE +/- 5.67, N = 3SE +/- 2.40, N = 3SE +/- 1.00, N = 3SE +/- 0.58, N = 3302329932402239920922077133412011128987897381

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneRadeon R9 Fury - AMDGPU-PROGeForce GTX 1070Radeon RX 480 - AMDGPU-PROXFX AMD Radeon R9 390GeForce GTX 1080Radeon R9 Fury - ROCmGeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1050Radeon RX 460 - AMDGPU-PRO16003200480064008000SE +/- 17.84, N = 3SE +/- 38.17, N = 3SE +/- 13.50, N = 3SE +/- 14.84, N = 3SE +/- 2.03, N = 3SE +/- 15.04, N = 3SE +/- 3.51, N = 3SE +/- 2.52, N = 3SE +/- 3.38, N = 3SE +/- 6.98, N = 37681730269246644638856955204361233002623

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRRadeon R9 Fury - AMDGPU-PROXFX AMD Radeon R9 390GeForce GTX 1070Radeon RX 480 - AMDGPU-PROGeForce GTX 1080Radeon R9 Fury - ROCmGeForce GTX 1060Radeon RX 480 - ROCmGeForce GTX 1050 TiGeForce GTX 1050Radeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCm4K8K12K16K20KSE +/- 75.47, N = 3SE +/- 59.62, N = 3SE +/- 2.31, N = 3SE +/- 68.10, N = 3SE +/- 12.45, N = 3SE +/- 17.34, N = 3SE +/- 36.34, N = 3SE +/- 0.67, N = 3SE +/- 17.00, N = 3SE +/- 5.20, N = 3SE +/- 9.82, N = 3SE +/- 17.00, N = 31939416870162151406612968119951176891967391665655473664

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Boat - Acceleration: OpenCLXFX AMD Radeon R9 3900.94731.89462.84193.78924.7365SE +/- 0.05, N = 34.21

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Masskrug - Acceleration: OpenCLXFX AMD Radeon R9 3900.06080.12160.18240.24320.304SE +/- 0.00, N = 40.27

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Server Room - Acceleration: OpenCLXFX AMD Radeon R9 3900.06080.12160.18240.24320.304SE +/- 0.01, N = 60.27


Phoronix Test Suite v10.8.4