NVIDIA vs. Radeon OpenCL compute testing on ubuntu Linux. Tests by Michael Larabel for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1707132-TR-1707100PT16
OpenCL ROCm 1.6 Radeon Compute vs. NVIDIA Linux
NVIDIA vs. Radeon OpenCL compute testing on ubuntu Linux. Tests by Michael Larabel for a future article.
,,"Radeon RX 560","Radeon RX 480","Radeon R9 Fury","Radeon RX 460","GeForce GTX 1060","GeForce GTX 1080","GeForce GTX 1080 Ti","GeForce GTX 1050","GeForce GTX 1070","GeForce GTX 980","GeForce GTX 980 Ti","GeForce GTX 970","GeForce GTX 960","GeForce GTX 780 Ti","Radeon RX 580","Gigabyte NVIDIA GeForce GTX 1050"
Processor,,Intel Core i7-7740K @ 4.50GHz (8 Cores),Intel Core i7-7740K @ 4.50GHz (8 Cores),Intel Core i7-7740K @ 4.50GHz (8 Cores),Intel Core i7-7740K @ 4.50GHz (8 Cores),Intel Core i7-7740K @ 4.50GHz (8 Cores),Intel Core i7-7740K @ 4.50GHz (8 Cores),Intel Core i7-7740K @ 4.50GHz (8 Cores),Intel Core i7-7740K @ 4.50GHz (8 Cores),Intel Core i7-7740K @ 4.50GHz (8 Cores),Intel Core i7-7740K @ 4.50GHz (8 Cores),Intel Core i7-7740K @ 4.50GHz (8 Cores),Intel Core i7-7740K @ 4.50GHz (8 Cores),Intel Core i7-7740K @ 4.50GHz (8 Cores),Intel Core i7-7740K @ 4.50GHz (8 Cores),Intel Core i7-7740K @ 4.50GHz (8 Cores),AMD Phenom II X3 720 @ 2.80GHz (3 Cores)
Motherboard,,ASUS PRIME X299-A,ASUS PRIME X299-A,ASUS PRIME X299-A,ASUS PRIME X299-A,ASUS PRIME X299-A,ASUS PRIME X299-A,ASUS PRIME X299-A,ASUS PRIME X299-A,ASUS PRIME X299-A,ASUS PRIME X299-A,ASUS PRIME X299-A,ASUS PRIME X299-A,ASUS PRIME X299-A,ASUS PRIME X299-A,ASUS PRIME X299-A,ASUS M4A89GTD-PRO/USB3
Chipset,,Intel Device 591f,Intel Device 591f,Intel Device 591f,Intel Device 591f,Intel Device 591f,Intel Device 591f,Intel Device 591f,Intel Device 591f,Intel Device 591f,Intel Device 591f,Intel Device 591f,Intel Device 591f,Intel Device 591f,Intel Device 591f,Intel Device 591f,AMD RS880
Memory,,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,8192MB
Disk,,Samsung SSD 950 PRO 256GB,Samsung SSD 950 PRO 256GB,Samsung SSD 950 PRO 256GB,Samsung SSD 950 PRO 256GB,Samsung SSD 950 PRO 256GB,Samsung SSD 950 PRO 256GB,Samsung SSD 950 PRO 256GB,Samsung SSD 950 PRO 256GB,Samsung SSD 950 PRO 256GB,Samsung SSD 950 PRO 256GB,Samsung SSD 950 PRO 256GB,Samsung SSD 950 PRO 256GB,Samsung SSD 950 PRO 256GB,Samsung SSD 950 PRO 256GB,Samsung SSD 950 PRO 256GB,120GB OCZ AGILITY3 + 3001GB Seagate ST3000DM001-1ER1 + 2000GB Seagate ST2000DM001-9YN1 + 1500GB SAMSUNG HD154UI + 300GB Western Digital WD3000HLFS-0 + 4001GB Seagate ST4000DM005-2DP1
Graphics,,AMD POLARIS11 3968MB,AMD POLARIS10 8064MB,Sapphire AMD Radeon R9 FURY / NANO 3968MB,AMD POLARIS11 1920MB,NVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz),NVIDIA GeForce GTX 1080 8192MB (1607/5005MHz),NVIDIA GeForce GTX 1080 Ti 11264MB (1480/5508MHz),Zotac NVIDIA GeForce GTX 1050 2048MB (1354/3504MHz),NVIDIA GeForce GTX 1070 8192MB (1506/4006MHz),NVIDIA GeForce GTX 980 4096MB (1126/3505MHz),NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz),eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz),eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz),NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz),MSI AMD POLARIS10 8064MB,Gigabyte NVIDIA GeForce GTX 1050 2048MB (120/405MHz)
Audio,,Realtek Generic,Realtek Generic,Realtek Generic,Realtek Generic,Realtek Generic,Realtek Generic,Realtek Generic,Realtek Generic,Realtek Generic,Realtek Generic,Realtek Generic,Realtek Generic,Realtek Generic,Realtek Generic,Realtek Generic,Realtek ALC892
Network,,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Realtek RTL8111/8168/8411 + Qualcomm Atheros AR93xx Wireless
OS,,Ubuntu 16.04,Ubuntu 16.04,Ubuntu 16.04,Ubuntu 16.04,Ubuntu 16.04,Ubuntu 16.04,Ubuntu 16.04,Ubuntu 16.04,Ubuntu 16.04,Ubuntu 16.04,Ubuntu 16.04,Ubuntu 16.04,Ubuntu 16.04,Ubuntu 16.04,Ubuntu 16.04,Ubuntu 16.04
Kernel,,4.9.0-kfd-compute-rocm-rel-1.6-77 (x86_64),4.9.0-kfd-compute-rocm-rel-1.6-77 (x86_64),4.9.0-kfd-compute-rocm-rel-1.6-77 (x86_64),4.9.0-kfd-compute-rocm-rel-1.6-77 (x86_64),4.9.0-kfd-compute-rocm-rel-1.6-77 (x86_64),4.9.0-kfd-compute-rocm-rel-1.6-77 (x86_64),4.9.0-kfd-compute-rocm-rel-1.6-77 (x86_64),4.9.0-kfd-compute-rocm-rel-1.6-77 (x86_64),4.9.0-kfd-compute-rocm-rel-1.6-77 (x86_64),4.9.0-kfd-compute-rocm-rel-1.6-77 (x86_64),4.9.0-kfd-compute-rocm-rel-1.6-77 (x86_64),4.9.0-kfd-compute-rocm-rel-1.6-77 (x86_64),4.9.0-kfd-compute-rocm-rel-1.6-77 (x86_64),4.9.0-kfd-compute-rocm-rel-1.6-77 (x86_64),4.9.0-kfd-compute-rocm-rel-1.6-77 (x86_64),4.4.0-83-generic (x86_64)
Desktop,,Unity 7.4.0,Unity 7.4.0,Unity 7.4.0,Unity 7.4.0,Unity 7.4.0,Unity 7.4.0,Unity 7.4.0,Unity 7.4.0,Unity 7.4.0,Unity 7.4.0,Unity 7.4.0,Unity 7.4.0,Unity 7.4.0,Unity 7.4.0,Unity 7.4.0,KDE Frameworks 5
Display Server,,X Server 1.18.4,X Server 1.18.4,X Server 1.18.4,X Server 1.18.4,X Server 1.18.4,X Server 1.18.4,X Server 1.18.4,X Server 1.18.4,X Server 1.18.4,X Server 1.18.4,X Server 1.18.4,X Server 1.18.4,X Server 1.18.4,X Server 1.18.4,X Server 1.18.4,
Display Driver,,amdgpu 1.1.2,amdgpu 1.1.2,modesetting 1.18.4,amdgpu 1.1.2,NVIDIA 384.47,NVIDIA 384.47,NVIDIA 384.47,NVIDIA 384.47,NVIDIA 384.47,NVIDIA 384.47,NVIDIA 384.47,NVIDIA 384.47,NVIDIA 384.47,NVIDIA 384.47,modesetting 1.18.4,NVIDIA 375.66
OpenGL,,4.1 Mesa 12.0.6 Gallium 0.4,4.1 Mesa 12.0.6 Gallium 0.4,4.1 Mesa 12.0.6 Gallium 0.4,4.1 Mesa 12.0.6 Gallium 0.4,4.5.0,4.5.0,4.5.0,4.5.0,4.5.0,4.5.0,4.5.0,4.5.0,4.5.0,4.5.0,4.1 Mesa 12.0.6 Gallium 0.4,4.5.0
OpenCL,,OpenCL 2.0 AMD-APP (2442.0),OpenCL 2.0 AMD-APP (2442.0),OpenCL 2.0 AMD-APP (2442.0),OpenCL 2.0 AMD-APP (2442.0),OpenCL 1.2 CUDA 9.0.101,OpenCL 1.2 CUDA 9.0.101,OpenCL 1.2 CUDA 9.0.101,OpenCL 1.2 CUDA 9.0.101,OpenCL 1.2 CUDA 9.0.101,OpenCL 1.2 CUDA 9.0.101,OpenCL 1.2 CUDA 9.0.101,OpenCL 1.2 CUDA 9.0.101,OpenCL 1.2 CUDA 9.0.101,OpenCL 1.2 CUDA 9.0.101,OpenCL 2.0 AMD-APP (2442.0),
Compiler,,GCC 5.4.0 20160609,GCC 5.4.0 20160609,GCC 5.4.0 20160609,GCC 5.4.0 20160609,GCC 5.4.0 20160609,GCC 5.4.0 20160609,GCC 5.4.0 20160609,GCC 5.4.0 20160609,GCC 5.4.0 20160609,GCC 5.4.0 20160609,GCC 5.4.0 20160609,GCC 5.4.0 20160609,GCC 5.4.0 20160609,GCC 5.4.0 20160609,GCC 5.4.0 20160609,GCC 5.4.0 20160609
File-System,,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4
Screen Resolution,,3840x2160,3840x2160,3840x2160,3840x2160,3840x2160,3840x2160,3840x2160,3840x2160,3840x2160,3840x2160,3840x2160,3840x2160,3840x2160,3840x2160,3840x2160,1680x1050
Vulkan,,,,,,,,,,,,,,,,,1.0.24
,,"Radeon RX 560","Radeon RX 480","Radeon R9 Fury","Radeon RX 460","GeForce GTX 1060","GeForce GTX 1080","GeForce GTX 1080 Ti","GeForce GTX 1050","GeForce GTX 1070","GeForce GTX 980","GeForce GTX 980 Ti","GeForce GTX 970","GeForce GTX 960","GeForce GTX 780 Ti","Radeon RX 580","Gigabyte NVIDIA GeForce GTX 1050"
"Darktable - Test: Boat - Acceleration: OpenCL (sec)",LIB,6.96,4.25,3.50,7.98,4.51,3.55,3.03,17.76,3.71,15.02,4.05,15.46,19.13,15.11,4.22,
"Darktable - Test: Masskrug - Acceleration: OpenCL (sec)",LIB,0.29,0.17,0.13,0.29,0.14,0.13,0.13,0.17,0.13,0.16,0.17,0.17,0.20,0.22,0.17,
"Darktable - Test: Server Room - Acceleration: OpenCL (sec)",LIB,0.31,0.17,0.15,0.28,0.14,0.13,0.13,0.18,0.14,0.17,0.17,0.17,0.20,0.21,0.16,
"Darktable - Test: Boat - Acceleration: OpenCL (sec)",LIB,,,,,,,,,,,,,,,,37.49
"Darktable - Test: Masskrug - Acceleration: OpenCL (sec)",LIB,,,,,,,,,,,,,,,,47.03
"Darktable - Test: Server Room - Acceleration: OpenCL (sec)",LIB,,,,,,,,,,,,,,,,28.19
"SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Bus Speed Download (GB/s)",HIB,5.72,11.47,11.38,5.71,12.99,13.00,13.00,13.00,13.00,13.00,13.00,13.00,13.00,13.00,11.49,
"SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Bus Speed Readback (GB/s)",HIB,5.24,10.42,10.61,5.20,13.20,13.20,13.20,13.20,13.20,13.20,13.20,13.20,13.19,13.20,10.21,
"clpeak - OpenCL Test: Kernel Latency (us)",LIB,41.06,39.87,41.62,42.49,3.60,3.59,3.60,3.68,3.60,4.12,4.32,4.08,3.94,5.67,39.54,
"clpeak - OpenCL Test: Integer Compute INT (GIOPS)",HIB,525.62,1161.64,1429.19,414.21,1227.52,2372.63,3276.83,562.46,1657.74,1293.68,1605.34,1134.30,781.83,958.80,1252.48,
"clpeak - OpenCL Test: Transfer Bandwidth enqueueWriteBuffer (GBPS)",HIB,30.04,30.13,30.52,30.03,12.61,12.63,12.62,12.53,12.60,12.46,12.46,12.47,12.44,12.36,30.47,
"clpeak - OpenCL Test: Single-Precision Float (GFLOPS)",HIB,2599.84,5755.59,7064.24,2135.23,4264.43,8398.22,11828.88,1941.07,6332.82,4286.78,5303.61,3725.85,2498.99,3661.68,6195.09,
"clpeak - OpenCL Test: Double-Precision Double (GFLOPS)",HIB,164.57,364.11,447.25,135.17,151.32,298.47,417.87,66.72,225.42,159.90,197.36,137.42,92.88,246.92,391.92,
"SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Texture Read Bandwidth (GB/s)",HIB,105.19,176.79,239.03,83.67,381.92,528.02,600.45,276.45,456.70,333.05,350.63,288.61,279.37,286.87,190.13,
"clpeak - OpenCL Test: Global Memory Bandwidth (GBPS)",HIB,88.69,203.08,388.73,89.25,146.64,222.43,329.40,92.42,196.24,164.05,262.13,143.41,81.13,252.46,201.45,
"SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: FFT SP (GFLOPS)",HIB,208.75,460.84,550.95,173.12,404.59,652.64,980.92,253.32,553.04,447.24,694.57,384.81,207.09,429.63,496.76,
"SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: MD5 Hash (GHash/s)",HIB,,,,,7.36,14.36,19.99,3.23,10.69,7.56,9.29,6.54,4.49,4.68,,
"Rodinia - Test: OpenCL Particle Filter (sec)",LIB,,,,,12.15,6.56,5.03,26.09,8.27,11.87,10.14,13.11,18.47,15.30,6.62,
"FAHBench - Phoronix Test Suite v7.2.1 (Ns/Day)",HIB,,,,,97.88,146.29,186.33,49.72,132.83,97.56,109.12,86.07,58.59,72.76,,
"Mixbench - Benchmark: Single Precision (GFLOPS)",HIB,2303.86,5442.89,6501.19,1851.75,4403.62,8489.90,,2037.30,6374.80,4692.97,5816.30,4121.47,2726.15,4260.60,5854.94,
"Mixbench - Benchmark: Double Precision (GFLOPS)",HIB,149.51,362.18,440.79,134.11,152.40,293.93,,66.66,222.10,159.89,197.04,137.54,92.73,245.04,389.88,
"Mixbench - Benchmark: Integer (GIOPS)",HIB,487.82,1139.87,1385.89,422.38,1370.35,2656.67,,615.44,2027.70,1404.21,1717.69,1220.95,835.02,967.53,1226.84,
"clpeak - OpenCL Test: Transfer Bandwidth enqueueReadBuffer (GBPS)",HIB,12.25,12.25,12.25,12.24,11.32,11.32,11.36,11.32,11.36,11.25,11.37,11.33,11.32,11.24,12.26,
"cl-mem - Benchmark: Read (GB/s)",HIB,89.47,151.13,121.03,91.50,153.40,228.73,337.90,94.90,205.37,164.50,266.07,143.67,,271.70,156.93,
"cl-mem - Benchmark: Write (GB/s)",HIB,74.50,146.30,300.30,78.57,139.37,214.97,334.80,85.70,191.20,151.80,238.07,129.20,,250.80,158.07,
"cl-mem - Benchmark: Copy (GB/s)",HIB,77.20,173.13,198.43,77.87,138.97,208.83,316.60,86.70,186.57,142.50,216.40,125.30,,237.00,172.20,
"ViennaCL - OpenCL LU Factorization (GFLOPS)",HIB,9.54,12.36,20.60,7.33,54.25,61.06,63.36,42.12,58.69,54.76,56.98,52.87,47.42,56.89,13.01,
"Lulesh OpenCL - Phoronix Test Suite v7.2.1 (z/s)",HIB,852.97,878.41,845.27,840.73,,,,,,,,,,,866.99,
"LuxMark - OpenCL Device: GPU - Scene: Luxball HDR (Score)",HIB,4481,9731,12894,3872,11627,12923,19842,6576,16184,11955,14961,10731,6114,9516,10160,6457
"LuxMark - OpenCL Device: GPU - Scene: Hotel (Score)",HIB,507,1063,1369,430,1782,2643,3739,1023,2506,1863,2034,1756,1125,1200,1206,1127
"Ethereum Ethminer - Device: GPU OpenCL (H/s)",HIB,7203134,12543590,16587889,7176920,18653001,20721026,31569419,11398894,25311459,19740899,18018030,17974339,10341580,15689318,12811559,
"CoMD OpenCL - Average Atom Update Rate (us/atom/task)",HIB,4.84,4.84,4.85,4.85,4.85,4.84,4.84,4.84,4.84,4.84,4.85,4.84,4.84,4.84,4.85,
"Xsbench OpenCL - Phoronix Test Suite v7.2.1 (Lookups/s)",HIB,38606004,74635621,85206540,36982576,,,,,,,,,,,74435707,
"SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Texture Read Bandwidth (GB/s/Watt)",HIB,1.21,1.33,1.51,0.96,3.16,3.64,3.15,3.12,2.95,2.25,1.82,1.84,2.47,1.12,1.45,
"SHOC Scalable HeterOgeneous Computing - System Power Consumption Monitor (Watts)",LIB,86.87,133.18,157.83,87.53,120.86,145.13,190.56,88.68,154.6,147.85,192.16,156.96,113.15,255.96,131.39,
"SHOC Scalable HeterOgeneous Computing - System Power Consumption Monitor (Watts)",LIB,56.9,98.95,98.9,75.45,71,49.0,118.8,67.65,74.45,55.9,142.7,84,72.85,139.5,68.8,
"LuxMark - OpenCL Device: GPU - Scene: Luxball HDR (Score/Watt)",HIB,43.14,51.41,54.91,36.39,78.53,72.32,78.40,60.76,89.05,59.86,60.65,57.14,43.35,32.63,51.48,
"LuxMark - System Power Consumption Monitor (Watts)",LIB,103.88,189.27,234.81,106.41,148.06,178.68,253.08,108.23,181.74,199.7,246.7,187.81,141.04,291.66,197.35,
"LuxMark - System Power Consumption Monitor (Watts)",LIB,77.41,103.14,116.66,81.4,86.49,97.96,,,,,,,,,,
"LuxMark - OpenCL Device: GPU - Scene: Hotel (Score/Watt)",HIB,5.15,6.29,6.34,4.28,12.40,13.70,13.96,9.61,13.70,9.22,8.73,9.24,7.56,4.38,6.83,
"LuxMark - System Power Consumption Monitor (Watts)",LIB,98.46,169.01,215.97,100.49,143.77,192.91,267.82,106.45,182.92,201.97,233.07,190.14,148.91,274.22,176.61,
"Mixbench - System Power Consumption Monitor (Watts)",LIB,84.7,95.82,84.5,79.72,77.7,49.6,,90.8,50.9,121.0,151.8,104.8,53.1,195.9,113.58,
"cl-mem - Benchmark: Read (GB/s/Watt)",HIB,0.95,1.00,0.73,0.92,1.33,1.76,1.82,1.05,1.57,1.12,1.53,1.05,,1.04,1.08,
"FAHBench - Phoronix Test Suite v7.2.1 (Ns/Day/Watt)",HIB,,,,,0.85,1.01,1.02,0.54,0.96,0.67,0.63,0.62,0.50,0.36,,
"FAHBench - System Power Consumption Monitor (Watts)",LIB,,,,,114.63,145.55,183.12,92.89,138.5,145.25,172.14,139.24,118.26,204.01,63.5,
"clpeak - OpenCL Test: Integer Compute INT (GIOPS/Watt)",HIB,5.33,8.59,8.60,4.79,,,,,,7.08,,,5.35,,9.62,
"clpeak - System Power Consumption Monitor (Watts)",LIB,98.6,135.3,166.19,86.44,185.3,93.75,158.7,87.6,117.05,182.83,195.05,75.15,146.17,148.7,130.17,
"Ethereum Ethminer - Device: GPU OpenCL (H/s/Watt)",HIB,69113.46,72609.92,80363.35,67367.83,124485.61,115090.84,126492.71,102539.74,135952.92,96016.05,76784.79,94763.88,70561.75,60749.50,72012.98,
"Ethereum Ethminer - System Power Consumption Monitor (Watts)",LIB,104.22,172.75,206.41,106.53,149.84,180.04,249.58,111.17,186.18,205.6,234.66,189.68,146.56,258.26,177.91,
"System Power Consumption Monitor - Phoronix Test Suite System Monitoring (Watts)",,82.97,129.75,150.69,84.6,109.18,140.7,184.89,90.71,139.11,145.65,185.09,141.74,116.21,201.83,135.51,