NVIDIA vs. OpenCL ROCm Linux vs. AMDGPU-PRO Benchmarks

ROCm 1.4 benchmarks on Ubuntu 16.04 compared to AMDGPU-PRO. Now with NVIDIA comparison points. OpenCL benchmarks by Michael Larabel for a future article on Phoronix.com.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1701190-KH-1701193RI82
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

CPU Massive 2 Tests
HPC - High Performance Computing 2 Tests
NVIDIA GPU Compute 3 Tests
OpenCL 7 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GeForce GTX 1050
January 18 2017
 
GeForce GTX 1050 Ti
January 18 2017
 
GeForce GTX 1060
January 18 2017
 
GeForce GTX 1070
January 18 2017
 
GeForce GTX 1080
January 18 2017
 
Radeon RX 460 - AMDGPU-PRO
January 16 2017
 
Radeon RX 480 - AMDGPU-PRO
January 16 2017
 
Radeon R9 Fury - AMDGPU-PRO
January 16 2017
 
Radeon RX 460 - ROCm
January 16 2017
 
Radeon RX 480 - ROCm
January 16 2017
 
Radeon R9 Fury - ROCm
January 15 2017
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA vs. OpenCL ROCm Linux vs. AMDGPU-PRO BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon RX 460 - AMDGPU-PRORadeon RX 480 - AMDGPU-PRORadeon R9 Fury - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - ROCmRadeon R9 Fury - ROCmIntel Xeon E3-1280 v5 @ 4.00GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.0Intel Sky Lake16384MB256GB TOSHIBA-RD400Zotac NVIDIA GeForce GTX 1050 2048MB (1075/3504MHz)Realtek ALC1150Intel ConnectionUbuntu 16.044.4.0-59-generic (x86_64)Unity 7.4.0X Server 1.18.3NVIDIA 375.264.5.0OpenCL 1.2 CUDA 8.0.01.0.24GCC 5.4.0 20160609ext43840x2160eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1341/3504MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (418/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1504/4006MHz)NVIDIA GeForce GTX 1080 8192MB (109/5005MHz)AMD Radeon RX 460 2048MBAcer B286HKamdgpu 1.1.994.5.13462OpenCL 2.0 AMD-APP (2236.5)AMD Radeon RX 480 8192MBSapphire AMD Radeon R9 Fury 4096MBLLVMpipe4.6.0-kfd-compute-rocm-rel-1.4-16 (x86_64)modesetting 1.18.33.3 Mesa 11.2.0 Gallium 0.4OpenCL 2.0 AMD-APP (2300.5)GCC 5.4.0 20160609 + Clang 4.0 + LLVM 4.0.0Sapphire AMD Radeon R9 FURY / NANO 3968MB4.1 Mesa 11.2.0 Gallium 0.4OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- Scaling Governor: intel_pstate powersaveOpenCL Details- GeForce GTX 1050: GPU Compute Cores: 640- GeForce GTX 1050 Ti: GPU Compute Cores: 768- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560System Details- GeForce GTX 1050: GPU Compute Cores: 640.- GeForce GTX 1050 Ti: GPU Compute Cores: 768.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.Graphics Details- Radeon RX 460 - AMDGPU-PRO, Radeon RX 480 - AMDGPU-PRO, Radeon R9 Fury - AMDGPU-PRO, Radeon R9 Fury - ROCm: GLAMOREnvironment Details- Radeon RX 460 - ROCm, Radeon RX 480 - ROCm: LIBGL_ALWAYS_SOFTWARE=1

GeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon RX 460 - AMDGPU-PRORadeon RX 480 - AMDGPU-PRORadeon R9 Fury - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - ROCmRadeon R9 Fury - ROCmResult OverviewPhoronix Test Suite100%257%414%571%729%MandelGPULuxMarkDarktableJuliaGPUSHOC Scalable HeterOgeneous ComputingMandelbulbGPU

NVIDIA vs. OpenCL ROCm Linux vs. AMDGPU-PRO Benchmarksdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthrodinia: OpenCL Heartwallmandelgpu: GPUjuliagpu: GPUluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRmandelbulbgpu: GPUGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon RX 460 - AMDGPU-PRORadeon RX 480 - AMDGPU-PRORadeon R9 Fury - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - ROCmRadeon R9 Fury - ROCm15.4515.1611.7811.25223.302115.3812.7513.11282.495.2751548791.3064896787.1311283300665637667402.0313.9715.4411.0111.38188.162697.1312.7813.22316.103.6564272664.5778171484.9713343612739144889116.704.675.901.2011.85296.884780.8812.7813.22393.693.36112043183.47115523522.73209252041176863345982.203.875.740.9912.08456.727115.5412.7813.22446.64159458228.23144431468.40302373021621579620073.633.725.730.9912.20573.719415.4812.7813.22520.51206148858.53165302847.33299363881296891109498.409.517.202.836.25245.132066.696.937.1477.357.9735552080.1550807022.258972623554732208376.984.375.760.999.40508.205750.6913.6614.20160.575.3581101281.9081972594.40239969241406648517365.804.226.301.794.12751.867131.1813.6914.21223.256.38107202116.4075992404.70240276811939443447360.409.577.052.485.21158.212158.125.725.2791.1413.5128295516.3346101692.27381366429562658.905.725.930.997.94403.225815.528.378.38193.497.2859296261.8770675082.10987919649050438.674.986.091.4810.59399.715330.6711.3210.86214.536.4582051996.2773072755.80120156951199544388927.12OpenBenchmarking.org

Darktable

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Boat - Acceleration: OpenCLGeForce GTX 1050GeForce GTX 1050 TiRadeon RX 460 - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 480 - ROCmRadeon R9 Fury - ROCmGeForce GTX 1060Radeon RX 480 - AMDGPU-PRORadeon R9 Fury - AMDGPU-PROGeForce GTX 1070GeForce GTX 108048121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.77, N = 6SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 315.4513.979.579.515.724.984.674.374.223.873.72

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Masskrug - Acceleration: OpenCLGeForce GTX 1050 TiGeForce GTX 1050Radeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 480 - ROCmGeForce GTX 1060Radeon RX 480 - AMDGPU-PROGeForce GTX 1070GeForce GTX 108048121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 315.4415.167.207.056.306.095.935.905.765.745.73

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Server Room - Acceleration: OpenCLGeForce GTX 1050GeForce GTX 1050 TiRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmGeForce GTX 1060Radeon RX 480 - ROCmRadeon RX 480 - AMDGPU-PROGeForce GTX 1080GeForce GTX 10703691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 6SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 4SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 311.7811.012.832.481.791.481.200.990.990.990.99

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadRadeon R9 Fury - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 480 - ROCmRadeon RX 480 - AMDGPU-PRORadeon R9 Fury - ROCmGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 10803691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.10, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 4SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.125.216.257.949.4010.5911.2511.3811.8512.0812.20-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPRadeon RX 460 - ROCmGeForce GTX 1050 TiGeForce GTX 1050Radeon RX 460 - AMDGPU-PROGeForce GTX 1060Radeon R9 Fury - ROCmRadeon RX 480 - ROCmGeForce GTX 1070Radeon RX 480 - AMDGPU-PROGeForce GTX 1080Radeon R9 Fury - AMDGPU-PRO160320480640800SE +/- 0.04, N = 3SE +/- 2.31, N = 3SE +/- 2.58, N = 3SE +/- 1.23, N = 3SE +/- 4.87, N = 3SE +/- 0.44, N = 3SE +/- 6.05, N = 3SE +/- 6.56, N = 6SE +/- 2.19, N = 3SE +/- 6.31, N = 3SE +/- 14.35, N = 3158.21188.16223.30245.13296.88399.71403.22456.72508.20573.71751.86-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsRadeon RX 460 - AMDGPU-PROGeForce GTX 1050Radeon RX 460 - ROCmGeForce GTX 1050 TiGeForce GTX 1060Radeon R9 Fury - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCmGeForce GTX 1070Radeon R9 Fury - AMDGPU-PROGeForce GTX 10802K4K6K8K10KSE +/- 18.41, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 5.23, N = 3SE +/- 22.70, N = 3SE +/- 369.63, N = 6SE +/- 30.75, N = 3SE +/- 4.14, N = 3SE +/- 52.21, N = 3SE +/- 0.69, N = 3SE +/- 70.36, N = 32066.692115.382158.122697.134780.885330.675750.695815.527115.547131.189415.48-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadRadeon RX 460 - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 480 - ROCmRadeon R9 Fury - ROCmGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon RX 480 - AMDGPU-PRORadeon R9 Fury - AMDGPU-PRO48121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 35.726.938.3711.3212.7512.7812.7812.7812.7813.6613.69-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackRadeon RX 460 - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 480 - ROCmRadeon R9 Fury - ROCmGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon RX 480 - AMDGPU-PRORadeon R9 Fury - AMDGPU-PRO48121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 35.277.148.3810.8613.1113.2213.2213.2213.2214.2014.21-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCmRadeon R9 Fury - ROCmRadeon R9 Fury - AMDGPU-PROGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080110220330440550SE +/- 0.69, N = 3SE +/- 0.16, N = 3SE +/- 0.37, N = 3SE +/- 1.30, N = 3SE +/- 4.26, N = 3SE +/- 1.03, N = 3SE +/- 0.98, N = 3SE +/- 1.06, N = 3SE +/- 0.96, N = 3SE +/- 0.12, N = 3SE +/- 1.14, N = 377.3591.14160.57193.49214.53223.25282.49316.10393.69446.64520.51-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -pthread -lmpi_cxx -lmpi1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL HeartwallRadeon RX 460 - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 480 - ROCmRadeon R9 Fury - ROCmRadeon R9 Fury - AMDGPU-PRORadeon RX 480 - AMDGPU-PROGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 10603691215SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.16, N = 6SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 513.517.977.286.456.385.355.273.653.361. (CXX) g++ options: -O2 -lOpenCL

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPURadeon RX 460 - ROCmRadeon RX 460 - AMDGPU-PROGeForce GTX 1050Radeon RX 480 - ROCmGeForce GTX 1050 TiRadeon RX 480 - AMDGPU-PRORadeon R9 Fury - ROCmRadeon R9 Fury - AMDGPU-PROGeForce GTX 1060GeForce GTX 1070GeForce GTX 108040M80M120M160M200MSE +/- 30521.44, N = 3SE +/- 165178.15, N = 2SE +/- 26110.91, N = 3SE +/- 126265.24, N = 3SE +/- 75826.86, N = 3SE +/- 71744.88, N = 3SE +/- 104172.86, N = 3SE +/- 248567.98, N = 3SE +/- 971382.09, N = 328295516.3335552080.1551548791.3059296261.8764272664.5781101281.9082051996.27107202116.40112043183.47159458228.23206148858.531. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

JuliaGPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPURadeon RX 460 - ROCmRadeon RX 460 - AMDGPU-PROGeForce GTX 1050Radeon RX 480 - ROCmRadeon R9 Fury - ROCmRadeon R9 Fury - AMDGPU-PROGeForce GTX 1050 TiRadeon RX 480 - AMDGPU-PROGeForce GTX 1060GeForce GTX 1070GeForce GTX 108040M80M120M160M200MSE +/- 160084.77, N = 3SE +/- 97714.65, N = 2SE +/- 29908.92, N = 3SE +/- 94849.94, N = 3SE +/- 985012.76, N = 3SE +/- 109924.23, N = 3SE +/- 500734.10, N = 2SE +/- 194570.11, N = 3SE +/- 169012.99, N = 3SE +/- 694138.93, N = 346101692.2750807022.2564896787.1370675082.1073072755.8075992404.7078171484.9781972594.40115523522.73144431468.40165302847.331. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender / SLG2. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelRadeon RX 460 - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 480 - ROCmGeForce GTX 1050Radeon R9 Fury - ROCmGeForce GTX 1050 TiGeForce GTX 1060Radeon RX 480 - AMDGPU-PRORadeon R9 Fury - AMDGPU-PROGeForce GTX 1080GeForce GTX 10706001200180024003000SE +/- 0.58, N = 3SE +/- 1.00, N = 3SE +/- 2.40, N = 3SE +/- 5.67, N = 3SE +/- 0.00, N = 3SE +/- 3.79, N = 3SE +/- 6.03, N = 3SE +/- 6.94, N = 3SE +/- 11.46, N = 3SE +/- 9.00, N = 3SE +/- 4.91, N = 338189798711281201133420922399240229933023

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneRadeon RX 460 - AMDGPU-PROGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060Radeon R9 Fury - ROCmGeForce GTX 1080Radeon RX 480 - AMDGPU-PROGeForce GTX 1070Radeon R9 Fury - AMDGPU-PRO16003200480064008000SE +/- 6.98, N = 3SE +/- 3.38, N = 3SE +/- 2.52, N = 3SE +/- 3.51, N = 3SE +/- 15.04, N = 3SE +/- 2.03, N = 3SE +/- 13.50, N = 3SE +/- 38.17, N = 3SE +/- 17.84, N = 3262333003612520456956388692473027681

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRRadeon RX 460 - ROCmRadeon RX 460 - AMDGPU-PROGeForce GTX 1050GeForce GTX 1050 TiRadeon RX 480 - ROCmGeForce GTX 1060Radeon R9 Fury - ROCmGeForce GTX 1080Radeon RX 480 - AMDGPU-PROGeForce GTX 1070Radeon R9 Fury - AMDGPU-PRO4K8K12K16K20KSE +/- 17.00, N = 3SE +/- 9.82, N = 3SE +/- 5.20, N = 3SE +/- 17.00, N = 3SE +/- 0.67, N = 3SE +/- 36.34, N = 3SE +/- 17.34, N = 3SE +/- 12.45, N = 3SE +/- 68.10, N = 3SE +/- 2.31, N = 3SE +/- 75.47, N = 336645547665673919196117681199512968140661621519394

MandelbulbGPU

MandelbulbGPU is an OpenCL benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPURadeon RX 460 - ROCmRadeon RX 460 - AMDGPU-PROGeForce GTX 1050Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmGeForce GTX 1050 TiRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCmGeForce GTX 1060GeForce GTX 1070GeForce GTX 108020M40M60M80M100MSE +/- 117840.12, N = 3SE +/- 561923.72, N = 4SE +/- 36018.97, N = 3SE +/- 2304744.64, N = 6SE +/- 112131.74, N = 3SE +/- 81023.55, N = 3SE +/- 290297.61, N = 3SE +/- 503324.21, N = 3SE +/- 423859.17, N = 329562658.9032208376.9837667402.0343447360.4044388927.1244889116.7048517365.8049050438.6763345982.2079620073.6391109498.401. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL