NVIDIA AMD OpenCL Linux GPGPU Tests Benchmarks by Michael Larabel for a future article on Phoronix. Just looking at a range of OpenBenchmarking.org OpenCL benchmarks on a range of AMD Radeon and NVIDIA GeForce graphics cards with the proprietary drivers under Ubuntu Linux.
HTML result view exported from: https://openbenchmarking.org/result/1406282-PL-1311092SO17&grs&sor&rro .
NVIDIA AMD OpenCL Linux GPGPU Tests Processor Motherboard Chipset Memory Disk Graphics Audio Network Monitor OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution GeForce GT 240 GeForce GTX 460 GeForce GTX 550 Ti GeForce GT 610 GeForce GTX 650 GeForce GTX 680 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X AMD FX-8350 Eight-Core @ 4.00GHz (8 Cores) ASUS Crosshair V Formula AMD RD890 bridge 8192MB 64GB OCZ AGILITY ECS NVIDIA GeForce GT 240 512MB (550/1700MHz) Realtek ALC889 Intel 82583V Gigabit Connection Ubuntu 13.10 3.11.0-12-generic (x86_64) Unity 7.1.2 X Server 1.14.3 NVIDIA 331.20 3.3.0 GCC 4.8 ext4 2560x1600 ASUS NVIDIA GeForce GTX 460 768MB (675/1800MHz) 4.3.0 eVGA NVIDIA GeForce GTX 550 Ti 1024MB (951/2178MHz) NVIDIA GeForce GT 610 1024MB (810/533MHz) MSI NVIDIA GeForce GTX 650 1024MB (1084/2500MHz) NVIDIA GeForce GTX 680 2048MB (705/3004MHz) Sapphire AMD Radeon HD 6800 1024MB (900/1050MHz) SyncMaster + SyncMaster fglrx 13.25.5 4.3.12614 ASUS AMD Radeon HD 7800 1024MB (860/1200MHz) XFX AMD Radeon HD 7900 3072MB (900/1375MHz) Gigabyte AMD Radeon R9 200 2048MB (1100/1400MHz) OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details - Scaling Governor: acpi-cpufreq ondemand OpenCL Details - GeForce GT 240: GPU Compute Cores: 96 - GeForce GTX 460: GPU Compute Cores: 336 - GeForce GTX 550 Ti: GPU Compute Cores: 192 - GeForce GT 610: GPU Compute Cores: 48 - GeForce GTX 650: GPU Compute Cores: 384 - GeForce GTX 680: GPU Compute Cores: 1536 System Details - GeForce GT 240: GPU Compute Cores: 96. - GeForce GTX 460: GPU Compute Cores: 336. - GeForce GTX 550 Ti: GPU Compute Cores: 192. - GeForce GT 610: GPU Compute Cores: 48. - GeForce GTX 650: GPU Compute Cores: 384. - GeForce GTX 680: GPU Compute Cores: 1536. Environment Details - Radeon HD 6870, Radeon HD 7850, Radeon HD 7950, Radeon R9 270X: LIBGL_DRIVERS_PATH=/usr/lib/i386-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/dri
NVIDIA AMD OpenCL Linux GPGPU Tests mandelbulbgpu: GPU juliagpu: GPU rodinia: OpenCL Myocyte rodinia: OpenCL Heartwall rodinia: OpenCL Particle Filter luxmark: GPU - Sala luxmark: GPU - Luxball HDR luxmark: GPU - Room mandelgpu: GPU rodinia: OpenCL LavaMD opendwarfs: Compressed Sparse Row opendwarfs: LU Decomposition rodinia: OpenCL Leukocyte GeForce GT 240 GeForce GTX 460 GeForce GTX 550 Ti GeForce GT 610 GeForce GTX 650 GeForce GTX 680 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X 7095143.77 13543191.90 137.90 36.99 1051 5291547.00 6.88 220.27 40.49 9382926.17 21199859.50 62.81 4.80 35.96 382 2591 21375259.40 6.40 97.69 24.66 7729296.07 17540483.97 5.49 44.35 319 2162 150 17362190.07 6.55 110.81 27.70 1776400.70 4060660.07 95.88 22.72 15.28 42 315 27 3715320.13 9.00 458.26 100.31 4992671.83 10841150.33 20.14 66.48 177 1183 82 12405397.70 6.76 152.70 30.39 17083392.33 36123507.37 60.78 5.42 18.19 651 4216 296 47966936.67 9.26 6.10 58.94 14.02 493.51 19.67 631 6149 107 24.50 161.42 89.87 508.49 6.79 18.11 1134 8361 624 7.66 18.09 135.79 89.15 481.62 5.88 10.78 1896 13087 1002 5.79 17.83 123.96 83.83 485.32 5.40 13.01 1588 11031 867 6.17 16.25 126.60 84.66 OpenBenchmarking.org
MandelbulbGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelbulbGPU 1.0pts1 OpenCL Device: GPU GeForce GT 610 GeForce GTX 650 GeForce GT 240 GeForce GTX 550 Ti GeForce GTX 460 GeForce GTX 680 4M 8M 12M 16M 20M SE +/- 614.61, N = 3 SE +/- 4281.68, N = 3 SE +/- 3953.96, N = 3 SE +/- 4259.44, N = 3 SE +/- 10640.98, N = 3 SE +/- 72849.89, N = 3 1776400.70 4992671.83 7095143.77 7729296.07 9382926.17 17083392.33 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
JuliaGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better JuliaGPU 1.2pts1 OpenCL Device: GPU GeForce GT 610 GeForce GTX 650 GeForce GT 240 GeForce GTX 550 Ti GeForce GTX 460 GeForce GTX 680 8M 16M 24M 32M 40M SE +/- 2505.78, N = 3 SE +/- 2871.63, N = 3 SE +/- 17817.43, N = 3 SE +/- 13067.53, N = 3 SE +/- 4955.24, N = 3 SE +/- 72150.98, N = 3 4060660.07 10841150.33 13543191.90 17540483.97 21199859.50 36123507.37 1. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
Rodinia Test: OpenCL Myocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Myocyte Radeon HD 7850 Radeon HD 6870 Radeon R9 270X Radeon HD 7950 GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 680 110 220 330 440 550 SE +/- 0.96, N = 3 SE +/- 1.88, N = 3 SE +/- 0.67, N = 3 SE +/- 0.84, N = 3 SE +/- 0.12, N = 3 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 1.14, N = 3 508.49 493.51 485.32 481.62 137.90 95.88 62.81 60.78 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Heartwall OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Heartwall GeForce GT 240 GeForce GT 610 GeForce GTX 650 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 GeForce GTX 550 Ti GeForce GTX 680 Radeon R9 270X GeForce GTX 460 9 18 27 36 45 SE +/- 0.28, N = 3 SE +/- 0.10, N = 3 SE +/- 0.10, N = 3 SE +/- 0.13, N = 3 SE +/- 0.05, N = 3 SE +/- 0.11, N = 3 SE +/- 0.13, N = 6 SE +/- 0.09, N = 6 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 36.99 22.72 20.14 19.67 6.79 5.88 5.49 5.42 5.40 4.80 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Particle Filter OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Particle Filter GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 460 GeForce GTX 680 Radeon HD 7850 GeForce GT 610 Radeon R9 270X Radeon HD 7950 15 30 45 60 75 SE +/- 0.07, N = 3 SE +/- 0.24, N = 3 SE +/- 0.01, N = 3 SE +/- 0.17, N = 3 SE +/- 0.02, N = 3 SE +/- 0.57, N = 6 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 66.48 44.35 35.96 18.19 18.11 15.28 13.01 10.78 1. (CXX) g++ options: -O2 -lOpenCL
LuxMark OpenCL Device: GPU - Scene: Sala OpenBenchmarking.org Score, More Is Better LuxMark 2.1beta1 OpenCL Device: GPU - Scene: Sala GeForce GT 610 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 460 Radeon HD 6870 GeForce GTX 680 Radeon HD 7850 Radeon R9 270X Radeon HD 7950 400 800 1200 1600 2000 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.67, N = 3 42 177 319 382 631 651 1134 1588 1896
LuxMark OpenCL Device: GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 2.1beta1 OpenCL Device: GPU - Scene: Luxball HDR GeForce GT 610 GeForce GT 240 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 460 GeForce GTX 680 Radeon HD 6870 Radeon HD 7850 Radeon R9 270X Radeon HD 7950 3K 6K 9K 12K 15K SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.58, N = 3 SE +/- 0.67, N = 3 SE +/- 0.88, N = 3 SE +/- 1.45, N = 3 SE +/- 3.61, N = 3 SE +/- 4.84, N = 3 315 1051 1183 2162 2591 4216 6149 8361 11031 13087
LuxMark OpenCL Device: GPU - Scene: Room OpenBenchmarking.org Score, More Is Better LuxMark 2.1beta1 OpenCL Device: GPU - Scene: Room GeForce GT 610 GeForce GTX 650 Radeon HD 6870 GeForce GTX 550 Ti GeForce GTX 680 Radeon HD 7850 Radeon R9 270X Radeon HD 7950 200 400 600 800 1000 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 27 82 107 150 296 624 867 1002
MandelGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelGPU 1.3pts1 OpenCL Device: GPU GeForce GT 610 GeForce GT 240 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 460 GeForce GTX 680 10M 20M 30M 40M 50M SE +/- 230.98, N = 3 SE +/- 217.94, N = 3 SE +/- 857.65, N = 3 SE +/- 6289.55, N = 3 SE +/- 2448.11, N = 3 SE +/- 5781.86, N = 3 3715320.13 5291547.00 12405397.70 17362190.07 21375259.40 47966936.67 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
Rodinia Test: OpenCL LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL LavaMD GeForce GTX 680 Radeon HD 7850 Radeon R9 270X Radeon HD 7950 3 6 9 12 15 SE +/- 0.14, N = 6 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 9.26 7.66 6.17 5.79 1. (CXX) g++ options: -O2 -lOpenCL
OpenDwarfs Test: Compressed Sparse Row OpenBenchmarking.org ms, Fewer Is Better OpenDwarfs 2013-11-06 Test: Compressed Sparse Row Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X GeForce GT 610 GeForce GT 240 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 460 GeForce GTX 680 6 12 18 24 30 SE +/- 0.81, N = 6 SE +/- 0.50, N = 6 SE +/- 0.86, N = 6 SE +/- 0.89, N = 6 SE +/- 0.56, N = 6 SE +/- 0.11, N = 3 SE +/- 0.57, N = 6 SE +/- 0.80, N = 6 SE +/- 0.76, N = 6 SE +/- 0.58, N = 6 24.50 18.09 17.83 16.25 9.00 6.88 6.76 6.55 6.40 6.10 1. (CC) gcc options: -lm -lOpenCL
OpenDwarfs Test: LU Decomposition OpenBenchmarking.org ms, Fewer Is Better OpenDwarfs 2013-11-06 Test: LU Decomposition GeForce GT 610 GeForce GT 240 Radeon HD 6870 GeForce GTX 650 Radeon HD 7850 Radeon R9 270X Radeon HD 7950 GeForce GTX 550 Ti GeForce GTX 460 GeForce GTX 680 100 200 300 400 500 SE +/- 0.06, N = 3 SE +/- 1.06, N = 3 SE +/- 4.12, N = 6 SE +/- 0.21, N = 3 SE +/- 5.24, N = 6 SE +/- 4.83, N = 6 SE +/- 4.65, N = 6 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 SE +/- 0.88, N = 5 458.26 220.27 161.42 152.70 135.79 126.60 123.96 110.81 97.69 58.94 1. (CC) gcc options: -lm -lOpenCL
Rodinia Test: OpenCL Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Leukocyte GeForce GT 610 Radeon HD 6870 Radeon HD 7850 Radeon R9 270X Radeon HD 7950 GeForce GT 240 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 460 GeForce GTX 680 20 40 60 80 100 SE +/- 1.23, N = 3 SE +/- 3.37, N = 6 SE +/- 5.05, N = 6 SE +/- 6.06, N = 6 SE +/- 4.92, N = 6 SE +/- 0.75, N = 3 SE +/- 0.26, N = 3 SE +/- 0.48, N = 4 SE +/- 0.30, N = 3 SE +/- 0.24, N = 4 100.31 89.87 89.15 84.66 83.83 40.49 30.39 27.70 24.66 14.02 1. (CXX) g++ options: -O2 -lOpenCL
Phoronix Test Suite v10.8.5