NVIDIA AMD OpenCL Linux GPGPU Tests Benchmarks by Michael Larabel for a future article on Phoronix. Just looking at a range of OpenBenchmarking.org OpenCL benchmarks on a range of AMD Radeon and NVIDIA GeForce graphics cards with the proprietary drivers under Ubuntu Linux.
HTML result view exported from: https://openbenchmarking.org/result/1401213-PL-1311092SO60&grw&rdt&rro .
NVIDIA AMD OpenCL Linux GPGPU Tests Processor Motherboard Chipset Memory Disk Graphics Audio Network Monitor OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution GeForce GTX 680 GeForce GTX 550 Ti GeForce GTX 650 GeForce GTX 460 GeForce GT 610 GeForce GT 240 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X GeForce GTX 470 AMD FX-8350 Eight-Core @ 4.00GHz (8 Cores) ASUS Crosshair V Formula AMD RD890 bridge 8192MB 64GB OCZ AGILITY NVIDIA GeForce GTX 680 2048MB (705/3004MHz) Realtek ALC889 Intel 82583V Gigabit Connection Ubuntu 13.10 3.11.0-12-generic (x86_64) Unity 7.1.2 X Server 1.14.3 NVIDIA 331.20 4.3.0 GCC 4.8 ext4 2560x1600 eVGA NVIDIA GeForce GTX 550 Ti 1024MB (951/2178MHz) MSI NVIDIA GeForce GTX 650 1024MB (1084/2500MHz) ASUS NVIDIA GeForce GTX 460 768MB (675/1800MHz) NVIDIA GeForce GT 610 1024MB (810/533MHz) ECS NVIDIA GeForce GT 240 512MB (550/1700MHz) 3.3.0 Sapphire AMD Radeon HD 6800 1024MB (900/1050MHz) SyncMaster + SyncMaster fglrx 13.25.5 4.3.12614 ASUS AMD Radeon HD 7800 1024MB (860/1200MHz) XFX AMD Radeon HD 7900 3072MB (900/1375MHz) Gigabyte AMD Radeon R9 200 2048MB (1100/1400MHz) AMD Phenom II X4 955 @ 3.20GHz (4 Cores) Gigabyte GA-MA790X-UD3P 12288MB 2 x 1000GB SAMSUNG HD103UJ + 2000GB Hitachi HDS5C302 + 500GB SAMSUNG HD501LJ + 2000GB HDS5C3020ALA632 NVIDIA GeForce GTX 470 /3DNOW! 1280MB (656/1701MHz) Philips 230W Gentoo Base 2.2 3.7.6-gentoo-k8-31 (i686) KDE 4.10.5 NVIDIA 331.20 4.4.0 NVIDIA 331.20 GCC 4.6.3 + LLVM 3.1 1920x1200 OpenBenchmarking.org Compiler Details - GeForce GTX 680, GeForce GTX 550 Ti, GeForce GTX 650, GeForce GTX 460, GeForce GT 610, GeForce GT 240, Radeon HD 6870, Radeon HD 7850, Radeon HD 7950, Radeon R9 270X: --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details - Scaling Governor: acpi-cpufreq ondemand OpenCL Details - GeForce GTX 680: GPU Compute Cores: 1536 - GeForce GTX 550 Ti: GPU Compute Cores: 192 - GeForce GTX 650: GPU Compute Cores: 384 - GeForce GTX 460: GPU Compute Cores: 336 - GeForce GT 610: GPU Compute Cores: 48 - GeForce GT 240: GPU Compute Cores: 96 - GeForce GTX 470: GPU Compute Cores: 448 System Details - GeForce GTX 680: GPU Compute Cores: 1536. - GeForce GTX 550 Ti: GPU Compute Cores: 192. - GeForce GTX 650: GPU Compute Cores: 384. - GeForce GTX 460: GPU Compute Cores: 336. - GeForce GT 610: GPU Compute Cores: 48. - GeForce GT 240: GPU Compute Cores: 96. - GeForce GTX 470: GPU Compute Cores: 448. Environment Details - Radeon HD 6870, Radeon HD 7850, Radeon HD 7950, Radeon R9 270X: LIBGL_DRIVERS_PATH=/usr/lib/i386-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/dri
NVIDIA AMD OpenCL Linux GPGPU Tests rodinia: OpenCL LavaMD rodinia: OpenCL Myocyte rodinia: OpenCL Heartwall rodinia: OpenCL Leukocyte rodinia: OpenCL Particle Filter mandelgpu: GPU juliagpu: GPU luxmark: GPU - Room luxmark: GPU - Sala luxmark: GPU - Luxball HDR mandelbulbgpu: GPU opendwarfs: LU Decomposition opendwarfs: Compressed Sparse Row GeForce GTX 680 GeForce GTX 550 Ti GeForce GTX 650 GeForce GTX 460 GeForce GT 610 GeForce GT 240 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X GeForce GTX 470 9.26 60.78 5.42 14.02 18.19 47966936.67 36123507.37 296 651 4216 17083392.33 58.94 6.10 5.49 27.70 44.35 17362190.07 17540483.97 150 319 2162 7729296.07 110.81 6.55 20.14 30.39 66.48 12405397.70 10841150.33 82 177 1183 4992671.83 152.70 6.76 62.81 4.80 24.66 35.96 21375259.40 21199859.50 382 2591 9382926.17 97.69 6.40 95.88 22.72 100.31 15.28 3715320.13 4060660.07 27 42 315 1776400.70 458.26 9.00 137.90 36.99 40.49 5291547.00 13543191.90 1051 7095143.77 220.27 6.88 493.51 19.67 89.87 107 631 6149 161.42 24.50 7.66 508.49 6.79 89.15 18.11 624 1134 8361 135.79 18.09 5.79 481.62 5.88 83.83 10.78 1002 1896 13087 123.96 17.83 6.17 485.32 5.40 84.66 13.01 867 1588 11031 126.60 16.25 80.58 3.75 17.46 28658784.40 29347833.20 14828161.27 OpenBenchmarking.org
Rodinia Test: OpenCL LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL LavaMD Radeon R9 270X Radeon HD 7950 Radeon HD 7850 GeForce GTX 680 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.14, N = 6 6.17 5.79 7.66 9.26 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Myocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Myocyte GeForce GTX 470 Radeon R9 270X Radeon HD 7950 Radeon HD 7850 Radeon HD 6870 GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 680 110 220 330 440 550 SE +/- 1.37, N = 6 SE +/- 0.67, N = 3 SE +/- 0.84, N = 3 SE +/- 0.96, N = 3 SE +/- 1.88, N = 3 SE +/- 0.12, N = 3 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 1.14, N = 3 80.58 485.32 481.62 508.49 493.51 137.90 95.88 62.81 60.78 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Heartwall OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Heartwall GeForce GTX 470 Radeon R9 270X Radeon HD 7950 Radeon HD 7850 Radeon HD 6870 GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 680 9 18 27 36 45 SE +/- 0.26, N = 6 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 SE +/- 0.13, N = 3 SE +/- 0.28, N = 3 SE +/- 0.10, N = 3 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 SE +/- 0.13, N = 6 SE +/- 0.09, N = 6 3.75 5.40 5.88 6.79 19.67 36.99 22.72 4.80 20.14 5.49 5.42 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Leukocyte Radeon R9 270X Radeon HD 7950 Radeon HD 7850 Radeon HD 6870 GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 680 20 40 60 80 100 SE +/- 6.06, N = 6 SE +/- 4.92, N = 6 SE +/- 5.05, N = 6 SE +/- 3.37, N = 6 SE +/- 0.75, N = 3 SE +/- 1.23, N = 3 SE +/- 0.30, N = 3 SE +/- 0.26, N = 3 SE +/- 0.48, N = 4 SE +/- 0.24, N = 4 84.66 83.83 89.15 89.87 40.49 100.31 24.66 30.39 27.70 14.02 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Particle Filter OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Particle Filter GeForce GTX 470 Radeon R9 270X Radeon HD 7950 Radeon HD 7850 GeForce GT 610 GeForce GTX 460 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 680 15 30 45 60 75 SE +/- 0.23, N = 6 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.57, N = 6 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 SE +/- 0.24, N = 3 SE +/- 0.17, N = 3 17.46 13.01 10.78 18.11 15.28 35.96 66.48 44.35 18.19 1. (CXX) g++ options: -O2 -lOpenCL
MandelGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelGPU 1.3pts1 OpenCL Device: GPU GeForce GTX 470 GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 680 10M 20M 30M 40M 50M SE +/- 4541.93, N = 3 SE +/- 217.94, N = 3 SE +/- 230.98, N = 3 SE +/- 2448.11, N = 3 SE +/- 857.65, N = 3 SE +/- 6289.55, N = 3 SE +/- 5781.86, N = 3 28658784.40 5291547.00 3715320.13 21375259.40 12405397.70 17362190.07 47966936.67 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
JuliaGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better JuliaGPU 1.2pts1 OpenCL Device: GPU GeForce GTX 470 GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 680 8M 16M 24M 32M 40M SE +/- 120737.03, N = 3 SE +/- 17817.43, N = 3 SE +/- 2505.78, N = 3 SE +/- 4955.24, N = 3 SE +/- 2871.63, N = 3 SE +/- 13067.53, N = 3 SE +/- 72150.98, N = 3 29347833.20 13543191.90 4060660.07 21199859.50 10841150.33 17540483.97 36123507.37 1. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
LuxMark OpenCL Device: GPU - Scene: Room OpenBenchmarking.org Score, More Is Better LuxMark 2.1beta1 OpenCL Device: GPU - Scene: Room Radeon R9 270X Radeon HD 7950 Radeon HD 7850 Radeon HD 6870 GeForce GT 610 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 680 200 400 600 800 1000 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 867 1002 624 107 27 82 150 296
LuxMark OpenCL Device: GPU - Scene: Sala OpenBenchmarking.org Score, More Is Better LuxMark 2.1beta1 OpenCL Device: GPU - Scene: Sala Radeon R9 270X Radeon HD 7950 Radeon HD 7850 Radeon HD 6870 GeForce GT 610 GeForce GTX 460 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 680 400 800 1200 1600 2000 SE +/- 0.00, N = 3 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1588 1896 1134 631 42 382 177 319 651
LuxMark OpenCL Device: GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 2.1beta1 OpenCL Device: GPU - Scene: Luxball HDR Radeon R9 270X Radeon HD 7950 Radeon HD 7850 Radeon HD 6870 GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 680 3K 6K 9K 12K 15K SE +/- 3.61, N = 3 SE +/- 4.84, N = 3 SE +/- 1.45, N = 3 SE +/- 0.88, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.58, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.67, N = 3 11031 13087 8361 6149 1051 315 2591 1183 2162 4216
MandelbulbGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelbulbGPU 1.0pts1 OpenCL Device: GPU GeForce GTX 470 GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 680 4M 8M 12M 16M 20M SE +/- 58117.66, N = 3 SE +/- 3953.96, N = 3 SE +/- 614.61, N = 3 SE +/- 10640.98, N = 3 SE +/- 4281.68, N = 3 SE +/- 4259.44, N = 3 SE +/- 72849.89, N = 3 14828161.27 7095143.77 1776400.70 9382926.17 4992671.83 7729296.07 17083392.33 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
OpenDwarfs Test: LU Decomposition OpenBenchmarking.org ms, Fewer Is Better OpenDwarfs 2013-11-06 Test: LU Decomposition Radeon R9 270X Radeon HD 7950 Radeon HD 7850 Radeon HD 6870 GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 680 100 200 300 400 500 SE +/- 4.83, N = 6 SE +/- 4.65, N = 6 SE +/- 5.24, N = 6 SE +/- 4.12, N = 6 SE +/- 1.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.21, N = 3 SE +/- 0.14, N = 3 SE +/- 0.88, N = 5 126.60 123.96 135.79 161.42 220.27 458.26 97.69 152.70 110.81 58.94 1. (CC) gcc options: -lm -lOpenCL
OpenDwarfs Test: Compressed Sparse Row OpenBenchmarking.org ms, Fewer Is Better OpenDwarfs 2013-11-06 Test: Compressed Sparse Row Radeon R9 270X Radeon HD 7950 Radeon HD 7850 Radeon HD 6870 GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 650 GeForce GTX 550 Ti GeForce GTX 680 6 12 18 24 30 SE +/- 0.89, N = 6 SE +/- 0.86, N = 6 SE +/- 0.50, N = 6 SE +/- 0.81, N = 6 SE +/- 0.11, N = 3 SE +/- 0.56, N = 6 SE +/- 0.76, N = 6 SE +/- 0.57, N = 6 SE +/- 0.80, N = 6 SE +/- 0.58, N = 6 16.25 17.83 18.09 24.50 6.88 9.00 6.40 6.76 6.55 6.10 1. (CC) gcc options: -lm -lOpenCL
Phoronix Test Suite v10.8.5