NVIDIA AMD OpenCL Linux GPGPU Tests Benchmarks by Michael Larabel for a future article on Phoronix. Just looking at a range of OpenBenchmarking.org OpenCL benchmarks on a range of AMD Radeon and NVIDIA GeForce graphics cards with the proprietary drivers under Ubuntu Linux.
HTML result view exported from: https://openbenchmarking.org/result/1401213-PL-1311092SO60&grr&sro .
NVIDIA AMD OpenCL Linux GPGPU Tests Processor Motherboard Chipset Memory Disk Graphics Audio Network Monitor OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution GeForce GT 240 GeForce GTX 460 GeForce GTX 550 Ti GeForce GT 610 GeForce GTX 650 GeForce GTX 680 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X GeForce GTX 470 AMD FX-8350 Eight-Core @ 4.00GHz (8 Cores) ASUS Crosshair V Formula AMD RD890 bridge 8192MB 64GB OCZ AGILITY ECS NVIDIA GeForce GT 240 512MB (550/1700MHz) Realtek ALC889 Intel 82583V Gigabit Connection Ubuntu 13.10 3.11.0-12-generic (x86_64) Unity 7.1.2 X Server 1.14.3 NVIDIA 331.20 3.3.0 GCC 4.8 ext4 2560x1600 ASUS NVIDIA GeForce GTX 460 768MB (675/1800MHz) 4.3.0 eVGA NVIDIA GeForce GTX 550 Ti 1024MB (951/2178MHz) NVIDIA GeForce GT 610 1024MB (810/533MHz) MSI NVIDIA GeForce GTX 650 1024MB (1084/2500MHz) NVIDIA GeForce GTX 680 2048MB (705/3004MHz) Sapphire AMD Radeon HD 6800 1024MB (900/1050MHz) SyncMaster + SyncMaster fglrx 13.25.5 4.3.12614 ASUS AMD Radeon HD 7800 1024MB (860/1200MHz) XFX AMD Radeon HD 7900 3072MB (900/1375MHz) Gigabyte AMD Radeon R9 200 2048MB (1100/1400MHz) AMD Phenom II X4 955 @ 3.20GHz (4 Cores) Gigabyte GA-MA790X-UD3P 12288MB 2 x 1000GB SAMSUNG HD103UJ + 2000GB Hitachi HDS5C302 + 500GB SAMSUNG HD501LJ + 2000GB HDS5C3020ALA632 NVIDIA GeForce GTX 470 /3DNOW! 1280MB (656/1701MHz) Philips 230W Gentoo Base 2.2 3.7.6-gentoo-k8-31 (i686) KDE 4.10.5 NVIDIA 331.20 4.4.0 NVIDIA 331.20 GCC 4.6.3 + LLVM 3.1 1920x1200 OpenBenchmarking.org Compiler Details - GeForce GT 240, GeForce GTX 460, GeForce GTX 550 Ti, GeForce GT 610, GeForce GTX 650, GeForce GTX 680, Radeon HD 6870, Radeon HD 7850, Radeon HD 7950, Radeon R9 270X: --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details - Scaling Governor: acpi-cpufreq ondemand OpenCL Details - GeForce GT 240: GPU Compute Cores: 96 - GeForce GTX 460: GPU Compute Cores: 336 - GeForce GTX 550 Ti: GPU Compute Cores: 192 - GeForce GT 610: GPU Compute Cores: 48 - GeForce GTX 650: GPU Compute Cores: 384 - GeForce GTX 680: GPU Compute Cores: 1536 - GeForce GTX 470: GPU Compute Cores: 448 System Details - GeForce GT 240: GPU Compute Cores: 96. - GeForce GTX 460: GPU Compute Cores: 336. - GeForce GTX 550 Ti: GPU Compute Cores: 192. - GeForce GT 610: GPU Compute Cores: 48. - GeForce GTX 650: GPU Compute Cores: 384. - GeForce GTX 680: GPU Compute Cores: 1536. - GeForce GTX 470: GPU Compute Cores: 448. Environment Details - Radeon HD 6870, Radeon HD 7850, Radeon HD 7950, Radeon R9 270X: LIBGL_DRIVERS_PATH=/usr/lib/i386-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/dri
NVIDIA AMD OpenCL Linux GPGPU Tests opendwarfs: Compressed Sparse Row opendwarfs: LU Decomposition luxmark: GPU - Luxball HDR luxmark: GPU - Sala luxmark: GPU - Room mandelgpu: GPU mandelbulbgpu: GPU juliagpu: GPU rodinia: OpenCL Particle Filter rodinia: OpenCL Leukocyte rodinia: OpenCL Heartwall rodinia: OpenCL Myocyte rodinia: OpenCL LavaMD GeForce GT 240 GeForce GTX 460 GeForce GTX 550 Ti GeForce GT 610 GeForce GTX 650 GeForce GTX 680 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X GeForce GTX 470 6.88 220.27 1051 5291547.00 7095143.77 13543191.90 40.49 36.99 137.90 6.40 97.69 2591 382 21375259.40 9382926.17 21199859.50 35.96 24.66 4.80 62.81 6.55 110.81 2162 319 150 17362190.07 7729296.07 17540483.97 44.35 27.70 5.49 9.00 458.26 315 42 27 3715320.13 1776400.70 4060660.07 15.28 100.31 22.72 95.88 6.76 152.70 1183 177 82 12405397.70 4992671.83 10841150.33 66.48 30.39 20.14 6.10 58.94 4216 651 296 47966936.67 17083392.33 36123507.37 18.19 14.02 5.42 60.78 9.26 24.50 161.42 6149 631 107 89.87 19.67 493.51 18.09 135.79 8361 1134 624 18.11 89.15 6.79 508.49 7.66 17.83 123.96 13087 1896 1002 10.78 83.83 5.88 481.62 5.79 16.25 126.60 11031 1588 867 13.01 84.66 5.40 485.32 6.17 28658784.40 14828161.27 29347833.20 17.46 3.75 80.58 OpenBenchmarking.org
OpenDwarfs Test: Compressed Sparse Row OpenBenchmarking.org ms, Fewer Is Better OpenDwarfs 2013-11-06 Test: Compressed Sparse Row GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 550 Ti GeForce GTX 650 GeForce GTX 680 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X 6 12 18 24 30 SE +/- 0.11, N = 3 SE +/- 0.56, N = 6 SE +/- 0.76, N = 6 SE +/- 0.80, N = 6 SE +/- 0.57, N = 6 SE +/- 0.58, N = 6 SE +/- 0.81, N = 6 SE +/- 0.50, N = 6 SE +/- 0.86, N = 6 SE +/- 0.89, N = 6 6.88 9.00 6.40 6.55 6.76 6.10 24.50 18.09 17.83 16.25 1. (CC) gcc options: -lm -lOpenCL
OpenDwarfs Test: LU Decomposition OpenBenchmarking.org ms, Fewer Is Better OpenDwarfs 2013-11-06 Test: LU Decomposition GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 550 Ti GeForce GTX 650 GeForce GTX 680 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X 100 200 300 400 500 SE +/- 1.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.14, N = 3 SE +/- 0.21, N = 3 SE +/- 0.88, N = 5 SE +/- 4.12, N = 6 SE +/- 5.24, N = 6 SE +/- 4.65, N = 6 SE +/- 4.83, N = 6 220.27 458.26 97.69 110.81 152.70 58.94 161.42 135.79 123.96 126.60 1. (CC) gcc options: -lm -lOpenCL
LuxMark OpenCL Device: GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 2.1beta1 OpenCL Device: GPU - Scene: Luxball HDR GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 550 Ti GeForce GTX 650 GeForce GTX 680 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X 3K 6K 9K 12K 15K SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.58, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.67, N = 3 SE +/- 0.88, N = 3 SE +/- 1.45, N = 3 SE +/- 4.84, N = 3 SE +/- 3.61, N = 3 1051 315 2591 2162 1183 4216 6149 8361 13087 11031
LuxMark OpenCL Device: GPU - Scene: Sala OpenBenchmarking.org Score, More Is Better LuxMark 2.1beta1 OpenCL Device: GPU - Scene: Sala GeForce GT 610 GeForce GTX 460 GeForce GTX 550 Ti GeForce GTX 650 GeForce GTX 680 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X 400 800 1200 1600 2000 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 SE +/- 0.00, N = 3 42 382 319 177 651 631 1134 1896 1588
LuxMark OpenCL Device: GPU - Scene: Room OpenBenchmarking.org Score, More Is Better LuxMark 2.1beta1 OpenCL Device: GPU - Scene: Room GeForce GT 610 GeForce GTX 550 Ti GeForce GTX 650 GeForce GTX 680 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X 200 400 600 800 1000 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.58, N = 3 SE +/- 0.67, N = 3 27 150 82 296 107 624 1002 867
MandelGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelGPU 1.3pts1 OpenCL Device: GPU GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 470 GeForce GTX 550 Ti GeForce GTX 650 GeForce GTX 680 10M 20M 30M 40M 50M SE +/- 217.94, N = 3 SE +/- 230.98, N = 3 SE +/- 2448.11, N = 3 SE +/- 4541.93, N = 3 SE +/- 6289.55, N = 3 SE +/- 857.65, N = 3 SE +/- 5781.86, N = 3 5291547.00 3715320.13 21375259.40 28658784.40 17362190.07 12405397.70 47966936.67 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
MandelbulbGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelbulbGPU 1.0pts1 OpenCL Device: GPU GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 470 GeForce GTX 550 Ti GeForce GTX 650 GeForce GTX 680 4M 8M 12M 16M 20M SE +/- 3953.96, N = 3 SE +/- 614.61, N = 3 SE +/- 10640.98, N = 3 SE +/- 58117.66, N = 3 SE +/- 4259.44, N = 3 SE +/- 4281.68, N = 3 SE +/- 72849.89, N = 3 7095143.77 1776400.70 9382926.17 14828161.27 7729296.07 4992671.83 17083392.33 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
JuliaGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better JuliaGPU 1.2pts1 OpenCL Device: GPU GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 470 GeForce GTX 550 Ti GeForce GTX 650 GeForce GTX 680 8M 16M 24M 32M 40M SE +/- 17817.43, N = 3 SE +/- 2505.78, N = 3 SE +/- 4955.24, N = 3 SE +/- 120737.03, N = 3 SE +/- 13067.53, N = 3 SE +/- 2871.63, N = 3 SE +/- 72150.98, N = 3 13543191.90 4060660.07 21199859.50 29347833.20 17540483.97 10841150.33 36123507.37 1. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
Rodinia Test: OpenCL Particle Filter OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Particle Filter GeForce GT 610 GeForce GTX 460 GeForce GTX 470 GeForce GTX 550 Ti GeForce GTX 650 GeForce GTX 680 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X 15 30 45 60 75 SE +/- 0.57, N = 6 SE +/- 0.01, N = 3 SE +/- 0.23, N = 6 SE +/- 0.24, N = 3 SE +/- 0.07, N = 3 SE +/- 0.17, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 15.28 35.96 17.46 44.35 66.48 18.19 18.11 10.78 13.01 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Leukocyte GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 550 Ti GeForce GTX 650 GeForce GTX 680 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X 20 40 60 80 100 SE +/- 0.75, N = 3 SE +/- 1.23, N = 3 SE +/- 0.30, N = 3 SE +/- 0.48, N = 4 SE +/- 0.26, N = 3 SE +/- 0.24, N = 4 SE +/- 3.37, N = 6 SE +/- 5.05, N = 6 SE +/- 4.92, N = 6 SE +/- 6.06, N = 6 40.49 100.31 24.66 27.70 30.39 14.02 89.87 89.15 83.83 84.66 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Heartwall OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Heartwall GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 470 GeForce GTX 550 Ti GeForce GTX 650 GeForce GTX 680 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X 9 18 27 36 45 SE +/- 0.28, N = 3 SE +/- 0.10, N = 3 SE +/- 0.07, N = 3 SE +/- 0.26, N = 6 SE +/- 0.13, N = 6 SE +/- 0.10, N = 3 SE +/- 0.09, N = 6 SE +/- 0.13, N = 3 SE +/- 0.05, N = 3 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 36.99 22.72 4.80 3.75 5.49 20.14 5.42 19.67 6.79 5.88 5.40 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Myocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Myocyte GeForce GT 240 GeForce GT 610 GeForce GTX 460 GeForce GTX 470 GeForce GTX 680 Radeon HD 6870 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X 110 220 330 440 550 SE +/- 0.12, N = 3 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 1.37, N = 6 SE +/- 1.14, N = 3 SE +/- 1.88, N = 3 SE +/- 0.96, N = 3 SE +/- 0.84, N = 3 SE +/- 0.67, N = 3 137.90 95.88 62.81 80.58 60.78 493.51 508.49 481.62 485.32 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL LavaMD GeForce GTX 680 Radeon HD 7850 Radeon HD 7950 Radeon R9 270X 3 6 9 12 15 SE +/- 0.14, N = 6 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 9.26 7.66 5.79 6.17 1. (CXX) g++ options: -O2 -lOpenCL
Phoronix Test Suite v10.8.5