NVIDIA AMD OpenCL Linux GPGPU Tests Benchmarks by Michael Larabel for a future article on Phoronix. Just looking at a range of OpenBenchmarking.org OpenCL benchmarks on a range of AMD Radeon and NVIDIA GeForce graphics cards with the proprietary drivers under Ubuntu Linux.
HTML result view exported from: https://openbenchmarking.org/result/1406282-PL-1311092SO17&gru&sor .
NVIDIA AMD OpenCL Linux GPGPU Tests Processor Motherboard Chipset Memory Disk Graphics Audio Network Monitor OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution GeForce GT 240 GeForce GTX 460 GeForce GTX 550 Ti GeForce GT 610 GeForce GTX 650 GeForce GTX 680 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X AMD FX-8350 Eight-Core @ 4.00GHz (8 Cores) ASUS Crosshair V Formula AMD RD890 bridge 8192MB 64GB OCZ AGILITY ECS NVIDIA GeForce GT 240 512MB (550/1700MHz) Realtek ALC889 Intel 82583V Gigabit Connection Ubuntu 13.10 3.11.0-12-generic (x86_64) Unity 7.1.2 X Server 1.14.3 NVIDIA 331.20 3.3.0 GCC 4.8 ext4 2560x1600 ASUS NVIDIA GeForce GTX 460 768MB (675/1800MHz) 4.3.0 eVGA NVIDIA GeForce GTX 550 Ti 1024MB (951/2178MHz) NVIDIA GeForce GT 610 1024MB (810/533MHz) MSI NVIDIA GeForce GTX 650 1024MB (1084/2500MHz) NVIDIA GeForce GTX 680 2048MB (705/3004MHz) Sapphire AMD Radeon HD 6800 1024MB (900/1050MHz) SyncMaster + SyncMaster fglrx 13.25.5 4.3.12614 ASUS AMD Radeon HD 7800 1024MB (860/1200MHz) XFX AMD Radeon HD 7900 3072MB (900/1375MHz) Gigabyte AMD Radeon R9 200 2048MB (1100/1400MHz) OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details - Scaling Governor: acpi-cpufreq ondemand OpenCL Details - GeForce GT 240: GPU Compute Cores: 96 - GeForce GTX 460: GPU Compute Cores: 336 - GeForce GTX 550 Ti: GPU Compute Cores: 192 - GeForce GT 610: GPU Compute Cores: 48 - GeForce GTX 650: GPU Compute Cores: 384 - GeForce GTX 680: GPU Compute Cores: 1536 System Details - GeForce GT 240: GPU Compute Cores: 96. - GeForce GTX 460: GPU Compute Cores: 336. - GeForce GTX 550 Ti: GPU Compute Cores: 192. - GeForce GT 610: GPU Compute Cores: 48. - GeForce GTX 650: GPU Compute Cores: 384. - GeForce GTX 680: GPU Compute Cores: 1536. Environment Details - Radeon HD 6870, Radeon HD 7850, Radeon HD 7950, Radeon R9 270X: LIBGL_DRIVERS_PATH=/usr/lib/i386-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/dri
NVIDIA AMD OpenCL Linux GPGPU Tests juliagpu: GPU mandelbulbgpu: GPU mandelgpu: GPU luxmark: GPU - Room luxmark: GPU - Sala luxmark: GPU - Luxball HDR opendwarfs: LU Decomposition opendwarfs: Compressed Sparse Row rodinia: OpenCL LavaMD rodinia: OpenCL Myocyte rodinia: OpenCL Heartwall rodinia: OpenCL Leukocyte rodinia: OpenCL Particle Filter GeForce GT 240 GeForce GTX 460 GeForce GTX 550 Ti GeForce GT 610 GeForce GTX 650 GeForce GTX 680 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X 13543191.90 7095143.77 5291547.00 1051 220.27 6.88 137.90 36.99 40.49 21199859.50 9382926.17 21375259.40 382 2591 97.69 6.40 62.81 4.80 24.66 35.96 17540483.97 7729296.07 17362190.07 150 319 2162 110.81 6.55 5.49 27.70 44.35 4060660.07 1776400.70 3715320.13 27 42 315 458.26 9.00 95.88 22.72 100.31 15.28 10841150.33 4992671.83 12405397.70 82 177 1183 152.70 6.76 20.14 30.39 66.48 36123507.37 17083392.33 47966936.67 296 651 4216 58.94 6.10 9.26 60.78 5.42 14.02 18.19 107 631 6149 161.42 24.50 493.51 19.67 89.87 624 1134 8361 135.79 18.09 7.66 508.49 6.79 89.15 18.11 1002 1896 13087 123.96 17.83 5.79 481.62 5.88 83.83 10.78 867 1588 11031 126.60 16.25 6.17 485.32 5.40 84.66 13.01 OpenBenchmarking.org
JuliaGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better JuliaGPU 1.2pts1 OpenCL Device: GPU GeForce GTX 680 GeForce GTX 460 GeForce GTX 550 Ti GeForce GT 240 GeForce GTX 650 GeForce GT 610 8M 16M 24M 32M 40M SE +/- 72150.98, N = 3 SE +/- 4955.24, N = 3 SE +/- 13067.53, N = 3 SE +/- 17817.43, N = 3 SE +/- 2871.63, N = 3 SE +/- 2505.78, N = 3 36123507.37 21199859.50 17540483.97 13543191.90 10841150.33 4060660.07 1. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
MandelbulbGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelbulbGPU 1.0pts1 OpenCL Device: GPU GeForce GTX 680 GeForce GTX 460 GeForce GTX 550 Ti GeForce GT 240 GeForce GTX 650 GeForce GT 610 4M 8M 12M 16M 20M SE +/- 72849.89, N = 3 SE +/- 10640.98, N = 3 SE +/- 4259.44, N = 3 SE +/- 3953.96, N = 3 SE +/- 4281.68, N = 3 SE +/- 614.61, N = 3 17083392.33 9382926.17 7729296.07 7095143.77 4992671.83 1776400.70 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
MandelGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelGPU 1.3pts1 OpenCL Device: GPU GeForce GTX 680 GeForce GTX 460 GeForce GTX 550 Ti GeForce GTX 650 GeForce GT 240 GeForce GT 610 10M 20M 30M 40M 50M SE +/- 5781.86, N = 3 SE +/- 2448.11, N = 3 SE +/- 6289.55, N = 3 SE +/- 857.65, N = 3 SE +/- 217.94, N = 3 SE +/- 230.98, N = 3 47966936.67 21375259.40 17362190.07 12405397.70 5291547.00 3715320.13 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
LuxMark OpenCL Device: GPU - Scene: Room OpenBenchmarking.org Score, More Is Better LuxMark 2.1beta1 OpenCL Device: GPU - Scene: Room Radeon HD 7950 Radeon R9 270X Radeon HD 7850 GeForce GTX 680 GeForce GTX 550 Ti Radeon HD 6870 GeForce GTX 650 GeForce GT 610 200 400 600 800 1000 SE +/- 0.58, N = 3 SE +/- 0.67, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 1002 867 624 296 150 107 82 27
LuxMark OpenCL Device: GPU - Scene: Sala OpenBenchmarking.org Score, More Is Better LuxMark 2.1beta1 OpenCL Device: GPU - Scene: Sala Radeon HD 7950 Radeon R9 270X Radeon HD 7850 GeForce GTX 680 Radeon HD 6870 GeForce GTX 460 GeForce GTX 550 Ti GeForce GTX 650 GeForce GT 610 400 800 1200 1600 2000 SE +/- 0.67, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1896 1588 1134 651 631 382 319 177 42
LuxMark OpenCL Device: GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 2.1beta1 OpenCL Device: GPU - Scene: Luxball HDR Radeon HD 7950 Radeon R9 270X Radeon HD 7850 Radeon HD 6870 GeForce GTX 680 GeForce GTX 460 GeForce GTX 550 Ti GeForce GTX 650 GeForce GT 240 GeForce GT 610 3K 6K 9K 12K 15K SE +/- 4.84, N = 3 SE +/- 3.61, N = 3 SE +/- 1.45, N = 3 SE +/- 0.88, N = 3 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 13087 11031 8361 6149 4216 2591 2162 1183 1051 315
OpenDwarfs Test: LU Decomposition OpenBenchmarking.org ms, Fewer Is Better OpenDwarfs 2013-11-06 Test: LU Decomposition GeForce GTX 680 GeForce GTX 460 GeForce GTX 550 Ti Radeon HD 7950 Radeon R9 270X Radeon HD 7850 GeForce GTX 650 Radeon HD 6870 GeForce GT 240 GeForce GT 610 100 200 300 400 500 SE +/- 0.88, N = 5 SE +/- 0.03, N = 3 SE +/- 0.14, N = 3 SE +/- 4.65, N = 6 SE +/- 4.83, N = 6 SE +/- 5.24, N = 6 SE +/- 0.21, N = 3 SE +/- 4.12, N = 6 SE +/- 1.06, N = 3 SE +/- 0.06, N = 3 58.94 97.69 110.81 123.96 126.60 135.79 152.70 161.42 220.27 458.26 1. (CC) gcc options: -lm -lOpenCL
OpenDwarfs Test: Compressed Sparse Row OpenBenchmarking.org ms, Fewer Is Better OpenDwarfs 2013-11-06 Test: Compressed Sparse Row GeForce GTX 680 GeForce GTX 460 GeForce GTX 550 Ti GeForce GTX 650 GeForce GT 240 GeForce GT 610 Radeon R9 270X Radeon HD 7950 Radeon HD 7850 Radeon HD 6870 6 12 18 24 30 SE +/- 0.58, N = 6 SE +/- 0.76, N = 6 SE +/- 0.80, N = 6 SE +/- 0.57, N = 6 SE +/- 0.11, N = 3 SE +/- 0.56, N = 6 SE +/- 0.89, N = 6 SE +/- 0.86, N = 6 SE +/- 0.50, N = 6 SE +/- 0.81, N = 6 6.10 6.40 6.55 6.76 6.88 9.00 16.25 17.83 18.09 24.50 1. (CC) gcc options: -lm -lOpenCL
Rodinia Test: OpenCL LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL LavaMD Radeon HD 7950 Radeon R9 270X Radeon HD 7850 GeForce GTX 680 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.14, N = 6 5.79 6.17 7.66 9.26 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Myocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Myocyte GeForce GTX 680 GeForce GTX 460 GeForce GT 610 GeForce GT 240 Radeon HD 7950 Radeon R9 270X Radeon HD 6870 Radeon HD 7850 110 220 330 440 550 SE +/- 1.14, N = 3 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 SE +/- 0.12, N = 3 SE +/- 0.84, N = 3 SE +/- 0.67, N = 3 SE +/- 1.88, N = 3 SE +/- 0.96, N = 3 60.78 62.81 95.88 137.90 481.62 485.32 493.51 508.49 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Heartwall OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Heartwall GeForce GTX 460 Radeon R9 270X GeForce GTX 680 GeForce GTX 550 Ti Radeon HD 7950 Radeon HD 7850 Radeon HD 6870 GeForce GTX 650 GeForce GT 610 GeForce GT 240 9 18 27 36 45 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 SE +/- 0.09, N = 6 SE +/- 0.13, N = 6 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 SE +/- 0.13, N = 3 SE +/- 0.10, N = 3 SE +/- 0.10, N = 3 SE +/- 0.28, N = 3 4.80 5.40 5.42 5.49 5.88 6.79 19.67 20.14 22.72 36.99 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Leukocyte GeForce GTX 680 GeForce GTX 460 GeForce GTX 550 Ti GeForce GTX 650 GeForce GT 240 Radeon HD 7950 Radeon R9 270X Radeon HD 7850 Radeon HD 6870 GeForce GT 610 20 40 60 80 100 SE +/- 0.24, N = 4 SE +/- 0.30, N = 3 SE +/- 0.48, N = 4 SE +/- 0.26, N = 3 SE +/- 0.75, N = 3 SE +/- 4.92, N = 6 SE +/- 6.06, N = 6 SE +/- 5.05, N = 6 SE +/- 3.37, N = 6 SE +/- 1.23, N = 3 14.02 24.66 27.70 30.39 40.49 83.83 84.66 89.15 89.87 100.31 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Particle Filter OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Particle Filter Radeon HD 7950 Radeon R9 270X GeForce GT 610 Radeon HD 7850 GeForce GTX 680 GeForce GTX 460 GeForce GTX 550 Ti GeForce GTX 650 15 30 45 60 75 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.57, N = 6 SE +/- 0.02, N = 3 SE +/- 0.17, N = 3 SE +/- 0.01, N = 3 SE +/- 0.24, N = 3 SE +/- 0.07, N = 3 10.78 13.01 15.28 18.11 18.19 35.96 44.35 66.48 1. (CXX) g++ options: -O2 -lOpenCL
Phoronix Test Suite v10.8.5