NVIDIA vs. OpenCL ROCm Linux vs. AMDGPU-PRO Benchmarks

XFX AMD Radeon R9 390

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1701214-RI-1701193RI10
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

CPU Massive 2 Tests
HPC - High Performance Computing 2 Tests
NVIDIA GPU Compute 3 Tests
OpenCL 7 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
GeForce GTX 1050
January 18 2017
 
GeForce GTX 1050 Ti
January 18 2017
 
GeForce GTX 1060
January 18 2017
 
GeForce GTX 1070
January 18 2017
 
GeForce GTX 1080
January 18 2017
 
Radeon RX 460 - AMDGPU-PRO
January 16 2017
 
Radeon RX 480 - AMDGPU-PRO
January 16 2017
 
Radeon R9 Fury - AMDGPU-PRO
January 16 2017
 
Radeon RX 460 - ROCm
January 16 2017
 
Radeon RX 480 - ROCm
January 16 2017
 
Radeon R9 Fury - ROCm
January 15 2017
 
XFX AMD Radeon R9 390
January 20 2017
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA vs. OpenCL ROCm Linux vs. AMDGPU-PRO Benchmarks - Phoronix Test Suite

NVIDIA vs. OpenCL ROCm Linux vs. AMDGPU-PRO Benchmarks

XFX AMD Radeon R9 390

HTML result view exported from: https://openbenchmarking.org/result/1701214-RI-1701193RI10&grw&sro.

NVIDIA vs. OpenCL ROCm Linux vs. AMDGPU-PRO BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon RX 460 - AMDGPU-PRORadeon RX 480 - AMDGPU-PRORadeon R9 Fury - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - ROCmRadeon R9 Fury - ROCmXFX AMD Radeon R9 390Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.0Intel Sky Lake16384MB256GB TOSHIBA-RD400Zotac NVIDIA GeForce GTX 1050 2048MB (1075/3504MHz)Realtek ALC1150Intel ConnectionUbuntu 16.044.4.0-59-generic (x86_64)Unity 7.4.0X Server 1.18.3NVIDIA 375.264.5.0OpenCL 1.2 CUDA 8.0.01.0.24GCC 5.4.0 20160609ext43840x2160eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1341/3504MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (418/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1504/4006MHz)NVIDIA GeForce GTX 1080 8192MB (109/5005MHz)AMD Radeon RX 460 2048MBAcer B286HKamdgpu 1.1.994.5.13462OpenCL 2.0 AMD-APP (2236.5)AMD Radeon RX 480 8192MBSapphire AMD Radeon R9 Fury 4096MBLLVMpipe4.6.0-kfd-compute-rocm-rel-1.4-16 (x86_64)modesetting 1.18.33.3 Mesa 11.2.0 Gallium 0.4OpenCL 2.0 AMD-APP (2300.5)GCC 5.4.0 20160609 + Clang 4.0 + LLVM 4.0.0Sapphire AMD Radeon R9 FURY / NANO 3968MB4.1 Mesa 11.2.0 Gallium 0.4AMD FX-8350 Eight-Core @ 4.00GHz (8 Cores)ASRock 970DE3/U3S3AMD RX780/RX790 + SB7x0/SB8x0/SB9x032768MB128GB ADATA SP600 + 128GB OCZ AGILITY4 + 480GB TS480GSSD220SXFX AMD Radeon R9 390 8192MBRealtek ALC662 rev1VG2732 + SyncMasterRealtek RTL8111/8168/8411X Server 1.18.4modesetting 1.18.44.5.13462OpenCL 2.0 AMD-APP (2236.5)1.0.8GCC 5.4.1 20160904 + CUDA 8.01920x1200OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -vProcessor Details- GeForce GTX 1050: Scaling Governor: intel_pstate powersave- GeForce GTX 1050 Ti: Scaling Governor: intel_pstate powersave- GeForce GTX 1060: Scaling Governor: intel_pstate powersave- GeForce GTX 1070: Scaling Governor: intel_pstate powersave- GeForce GTX 1080: Scaling Governor: intel_pstate powersave- Radeon RX 460 - AMDGPU-PRO: Scaling Governor: intel_pstate powersave- Radeon RX 480 - AMDGPU-PRO: Scaling Governor: intel_pstate powersave- Radeon R9 Fury - AMDGPU-PRO: Scaling Governor: intel_pstate powersave- Radeon RX 460 - ROCm: Scaling Governor: intel_pstate powersave- Radeon RX 480 - ROCm: Scaling Governor: intel_pstate powersave- Radeon R9 Fury - ROCm: Scaling Governor: intel_pstate powersave- XFX AMD Radeon R9 390: Scaling Governor: acpi-cpufreq ondemandOpenCL Details- GeForce GTX 1050: GPU Compute Cores: 640- GeForce GTX 1050 Ti: GPU Compute Cores: 768- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560System Details- GeForce GTX 1050: GPU Compute Cores: 640.- GeForce GTX 1050 Ti: GPU Compute Cores: 768.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.Graphics Details- Radeon RX 460 - AMDGPU-PRO, Radeon RX 480 - AMDGPU-PRO, Radeon R9 Fury - AMDGPU-PRO, Radeon R9 Fury - ROCm, XFX AMD Radeon R9 390: GLAMOREnvironment Details- Radeon RX 460 - ROCm, Radeon RX 480 - ROCm: LIBGL_ALWAYS_SOFTWARE=1

NVIDIA vs. OpenCL ROCm Linux vs. AMDGPU-PRO Benchmarksdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthrodinia: OpenCL Heartwallmandelgpu: GPUjuliagpu: GPUluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRmandelbulbgpu: GPUGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon RX 460 - AMDGPU-PRORadeon RX 480 - AMDGPU-PRORadeon R9 Fury - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - ROCmRadeon R9 Fury - ROCmXFX AMD Radeon R9 39015.4515.1611.7811.25223.302115.3812.7513.11282.495.2751548791.3064896787.1311283300665637667402.0313.9715.4411.0111.38188.162697.1312.7813.22316.103.6564272664.5778171484.9713343612739144889116.704.675.901.2011.85296.884780.8812.7813.22393.693.36112043183.47115523522.73209252041176863345982.203.875.740.9912.08456.727115.5412.7813.22446.64159458228.23144431468.40302373021621579620073.633.725.730.9912.20573.719415.4812.7813.22520.51206148858.53165302847.33299363881296891109498.409.517.202.836.25245.132066.696.937.1477.357.9735552080.1550807022.258972623554732208376.984.375.760.999.40508.205750.6913.6614.20160.575.3581101281.9081972594.40239969241406648517365.804.226.301.794.12751.867131.1813.6914.21223.256.38107202116.4075992404.70240276811939443447360.409.577.052.485.21158.212158.125.725.2791.1413.5128295516.3346101692.27381366429562658.905.725.930.997.94403.225815.528.378.38193.497.2859296261.8770675082.10987919649050438.674.986.091.4810.59399.715330.6711.3210.86214.536.4582051996.2773072755.80120156951199544388927.124.210.270.275.22861.395173.235.735.66262.535.14207766441687059949695.30OpenBenchmarking.org

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Boat - Acceleration: OpenCLXFX AMD Radeon R9 3900.94731.89462.84193.78924.7365SE +/- 0.05, N = 34.21

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Masskrug - Acceleration: OpenCLXFX AMD Radeon R9 3900.06080.12160.18240.24320.304SE +/- 0.00, N = 40.27

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Server Room - Acceleration: OpenCLXFX AMD Radeon R9 3900.06080.12160.18240.24320.304SE +/- 0.01, N = 60.27

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Boat - Acceleration: OpenCLGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCm48121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.77, N = 6SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 315.4513.974.673.873.724.224.989.519.574.375.72

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Masskrug - Acceleration: OpenCLGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCm48121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 315.1615.445.905.745.736.306.097.207.055.765.93

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Server Room - Acceleration: OpenCLGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCm3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 6SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 411.7811.011.200.990.991.791.482.832.480.990.99

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCmXFX AMD Radeon R9 3903691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.00, N = 3SE +/- 0.14, N = 4SE +/- 0.01, N = 3SE +/- 0.00, N = 311.2511.3811.8512.0812.204.1210.596.255.219.407.945.22-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCmXFX AMD Radeon R9 3902004006008001000SE +/- 2.58, N = 3SE +/- 2.31, N = 3SE +/- 4.87, N = 3SE +/- 6.56, N = 6SE +/- 6.31, N = 3SE +/- 14.35, N = 3SE +/- 0.44, N = 3SE +/- 1.23, N = 3SE +/- 0.04, N = 3SE +/- 2.19, N = 3SE +/- 6.05, N = 3SE +/- 6.83, N = 3223.30188.16296.88456.72573.71751.86399.71245.13158.21508.20403.22861.39-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCmXFX AMD Radeon R9 3902K4K6K8K10KSE +/- 0.01, N = 3SE +/- 5.23, N = 3SE +/- 22.70, N = 3SE +/- 52.21, N = 3SE +/- 70.36, N = 3SE +/- 0.69, N = 3SE +/- 369.63, N = 6SE +/- 18.41, N = 3SE +/- 0.07, N = 3SE +/- 30.75, N = 3SE +/- 4.14, N = 3SE +/- 2.31, N = 32115.382697.134780.887115.549415.487131.185330.672066.692158.125750.695815.525173.23-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCmXFX AMD Radeon R9 39048121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 312.7512.7812.7812.7812.7813.6911.326.935.7213.668.375.73-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCmXFX AMD Radeon R9 39048121620SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.10, N = 613.1113.2213.2213.2213.2214.2110.867.145.2714.208.385.66-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCmXFX AMD Radeon R9 390110220330440550SE +/- 0.98, N = 3SE +/- 1.06, N = 3SE +/- 0.96, N = 3SE +/- 0.12, N = 3SE +/- 1.14, N = 3SE +/- 1.03, N = 3SE +/- 4.26, N = 3SE +/- 0.69, N = 3SE +/- 0.16, N = 3SE +/- 0.37, N = 3SE +/- 1.30, N = 3SE +/- 0.79, N = 3282.49316.10393.69446.64520.51223.25214.5377.3591.14160.57193.49262.53-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonOpenCL -lOpenCL-lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

Rodinia

Test: OpenCL Heartwall

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL HeartwallGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCmXFX AMD Radeon R9 3903691215SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 5SE +/- 0.07, N = 3SE +/- 0.16, N = 6SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 35.273.653.366.386.457.9713.515.357.285.141. (CXX) g++ options: -O2 -lOpenCL

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCm40M80M120M160M200MSE +/- 26110.91, N = 3SE +/- 75826.86, N = 3SE +/- 104172.86, N = 3SE +/- 248567.98, N = 3SE +/- 971382.09, N = 3SE +/- 71744.88, N = 3SE +/- 165178.15, N = 2SE +/- 30521.44, N = 3SE +/- 126265.24, N = 351548791.3064272664.57112043183.47159458228.23206148858.53107202116.4082051996.2735552080.1528295516.3381101281.9059296261.871. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCm40M80M120M160M200MSE +/- 29908.92, N = 3SE +/- 109924.23, N = 3SE +/- 194570.11, N = 3SE +/- 169012.99, N = 3SE +/- 694138.93, N = 3SE +/- 985012.76, N = 3SE +/- 97714.65, N = 2SE +/- 160084.77, N = 3SE +/- 500734.10, N = 2SE +/- 94849.94, N = 364896787.1378171484.97115523522.73144431468.40165302847.3375992404.7073072755.8050807022.2546101692.2781972594.4070675082.101. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCmXFX AMD Radeon R9 3906001200180024003000SE +/- 5.67, N = 3SE +/- 3.79, N = 3SE +/- 6.03, N = 3SE +/- 4.91, N = 3SE +/- 9.00, N = 3SE +/- 11.46, N = 3SE +/- 0.00, N = 3SE +/- 1.00, N = 3SE +/- 0.58, N = 3SE +/- 6.94, N = 3SE +/- 2.40, N = 3SE +/- 2.08, N = 3112813342092302329932402120189738123999872077

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 480 - AMDGPU-PROXFX AMD Radeon R9 39016003200480064008000SE +/- 3.38, N = 3SE +/- 2.52, N = 3SE +/- 3.51, N = 3SE +/- 38.17, N = 3SE +/- 2.03, N = 3SE +/- 17.84, N = 3SE +/- 15.04, N = 3SE +/- 6.98, N = 3SE +/- 13.50, N = 3SE +/- 14.84, N = 33300361252047302638876815695262369246644

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCmXFX AMD Radeon R9 3904K8K12K16K20KSE +/- 5.20, N = 3SE +/- 17.00, N = 3SE +/- 36.34, N = 3SE +/- 2.31, N = 3SE +/- 12.45, N = 3SE +/- 75.47, N = 3SE +/- 17.34, N = 3SE +/- 9.82, N = 3SE +/- 17.00, N = 3SE +/- 68.10, N = 3SE +/- 0.67, N = 3SE +/- 59.62, N = 36656739111768162151296819394119955547366414066919616870

MandelbulbGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 Fury - AMDGPU-PRORadeon R9 Fury - ROCmRadeon RX 460 - AMDGPU-PRORadeon RX 460 - ROCmRadeon RX 480 - AMDGPU-PRORadeon RX 480 - ROCmXFX AMD Radeon R9 39020M40M60M80M100MSE +/- 36018.97, N = 3SE +/- 112131.74, N = 3SE +/- 290297.61, N = 3SE +/- 503324.21, N = 3SE +/- 423859.17, N = 3SE +/- 2304744.64, N = 6SE +/- 561923.72, N = 4SE +/- 117840.12, N = 3SE +/- 81023.55, N = 337667402.0344889116.7063345982.2079620073.6391109498.4043447360.4044388927.1232208376.9829562658.9048517365.8049050438.6759949695.301. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL


Phoronix Test Suite v10.8.4