NVIDIA AMD OpenCL Linux GPGPU Tests

Benchmarks by Michael Larabel for a future article on Phoronix. Just looking at a range of OpenBenchmarking.org OpenCL benchmarks on a range of AMD Radeon and NVIDIA GeForce graphics cards with the proprietary drivers under Ubuntu Linux.

HTML result view exported from: https://openbenchmarking.org/result/1401213-PL-1311092SO60&rdt.

NVIDIA AMD OpenCL Linux GPGPU TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGeForce GTX 680GeForce GTX 550 TiGeForce GTX 650GeForce GTX 460GeForce GT 610GeForce GT 240Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGeForce GTX 470AMD FX-8350 Eight-Core @ 4.00GHz (8 Cores)ASUS Crosshair V FormulaAMD RD890 bridge8192MB64GB OCZ AGILITYNVIDIA GeForce GTX 680 2048MB (705/3004MHz)Realtek ALC889Intel 82583V Gigabit ConnectionUbuntu 13.103.11.0-12-generic (x86_64)Unity 7.1.2X Server 1.14.3NVIDIA 331.204.3.0GCC 4.8ext42560x1600eVGA NVIDIA GeForce GTX 550 Ti 1024MB (951/2178MHz)MSI NVIDIA GeForce GTX 650 1024MB (1084/2500MHz)ASUS NVIDIA GeForce GTX 460 768MB (675/1800MHz)NVIDIA GeForce GT 610 1024MB (810/533MHz)ECS NVIDIA GeForce GT 240 512MB (550/1700MHz)3.3.0Sapphire AMD Radeon HD 6800 1024MB (900/1050MHz)SyncMaster + SyncMasterfglrx 13.25.54.3.12614ASUS AMD Radeon HD 7800 1024MB (860/1200MHz)XFX AMD Radeon HD 7900 3072MB (900/1375MHz)Gigabyte AMD Radeon R9 200 2048MB (1100/1400MHz)AMD Phenom II X4 955 @ 3.20GHz (4 Cores)Gigabyte GA-MA790X-UD3P12288MB2 x 1000GB SAMSUNG HD103UJ + 2000GB Hitachi HDS5C302 + 500GB SAMSUNG HD501LJ + 2000GB HDS5C3020ALA632NVIDIA GeForce GTX 470 /3DNOW! 1280MB (656/1701MHz)Philips 230WGentoo Base 2.23.7.6-gentoo-k8-31 (i686)KDE 4.10.5NVIDIA 331.204.4.0 NVIDIA 331.20GCC 4.6.3 + LLVM 3.11920x1200OpenBenchmarking.orgCompiler Details- GeForce GTX 680, GeForce GTX 550 Ti, GeForce GTX 650, GeForce GTX 460, GeForce GT 610, GeForce GT 240, Radeon HD 6870, Radeon HD 7850, Radeon HD 7950, Radeon R9 270X: --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -vProcessor Details- Scaling Governor: acpi-cpufreq ondemandOpenCL Details- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GTX 550 Ti: GPU Compute Cores: 192- GeForce GTX 650: GPU Compute Cores: 384- GeForce GTX 460: GPU Compute Cores: 336- GeForce GT 610: GPU Compute Cores: 48- GeForce GT 240: GPU Compute Cores: 96- GeForce GTX 470: GPU Compute Cores: 448System Details- GeForce GTX 680: GPU Compute Cores: 1536.- GeForce GTX 550 Ti: GPU Compute Cores: 192.- GeForce GTX 650: GPU Compute Cores: 384.- GeForce GTX 460: GPU Compute Cores: 336.- GeForce GT 610: GPU Compute Cores: 48.- GeForce GT 240: GPU Compute Cores: 96.- GeForce GTX 470: GPU Compute Cores: 448.Environment Details- Radeon HD 6870, Radeon HD 7850, Radeon HD 7950, Radeon R9 270X: LIBGL_DRIVERS_PATH=/usr/lib/i386-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/dri

NVIDIA AMD OpenCL Linux GPGPU Testsrodinia: OpenCL LavaMDrodinia: OpenCL Myocyterodinia: OpenCL Heartwallrodinia: OpenCL Leukocyterodinia: OpenCL Particle Filterjuliagpu: GPUmandelbulbgpu: GPUmandelgpu: GPUluxmark: GPU - Roomluxmark: GPU - Salaluxmark: GPU - Luxball HDRopendwarfs: LU Decompositionopendwarfs: Compressed Sparse RowGeForce GTX 680GeForce GTX 550 TiGeForce GTX 650GeForce GTX 460GeForce GT 610GeForce GT 240Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGeForce GTX 4709.2660.785.4214.0218.1936123507.3717083392.3347966936.67296651421658.946.105.4927.7044.3517540483.977729296.0717362190.071503192162110.816.5520.1430.3966.4810841150.334992671.8312405397.70821771183152.706.7662.814.8024.6635.9621199859.509382926.1721375259.40382259197.696.4095.8822.72100.3115.284060660.071776400.703715320.132742315458.269.00137.9036.9940.4913543191.907095143.775291547.001051220.276.88493.5119.6789.871076316149161.4224.507.66508.496.7989.1518.1162411348361135.7918.095.79481.625.8883.8310.781002189613087123.9617.836.17485.325.4084.6613.01867158811031126.6016.2580.583.7517.4629347833.2014828161.2728658784.40OpenBenchmarking.org

Rodinia

Test: OpenCL LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL LavaMDGeForce GTX 680Radeon HD 7850Radeon HD 7950Radeon R9 270X3691215SE +/- 0.14, N = 6SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 39.267.665.796.171. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteGeForce GTX 680GeForce GTX 460GeForce GT 610GeForce GT 240Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGeForce GTX 470110220330440550SE +/- 1.14, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 1.88, N = 3SE +/- 0.96, N = 3SE +/- 0.84, N = 3SE +/- 0.67, N = 3SE +/- 1.37, N = 660.7862.8195.88137.90493.51508.49481.62485.3280.581. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Heartwall

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL HeartwallGeForce GTX 680GeForce GTX 550 TiGeForce GTX 650GeForce GTX 460GeForce GT 610GeForce GT 240Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGeForce GTX 470918273645SE +/- 0.09, N = 6SE +/- 0.13, N = 6SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.28, N = 3SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.26, N = 65.425.4920.144.8022.7236.9919.676.795.885.403.751. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL LeukocyteGeForce GTX 680GeForce GTX 550 TiGeForce GTX 650GeForce GTX 460GeForce GT 610GeForce GT 240Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X20406080100SE +/- 0.24, N = 4SE +/- 0.48, N = 4SE +/- 0.26, N = 3SE +/- 0.30, N = 3SE +/- 1.23, N = 3SE +/- 0.75, N = 3SE +/- 3.37, N = 6SE +/- 5.05, N = 6SE +/- 4.92, N = 6SE +/- 6.06, N = 614.0227.7030.3924.66100.3140.4989.8789.1583.8384.661. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterGeForce GTX 680GeForce GTX 550 TiGeForce GTX 650GeForce GTX 460GeForce GT 610Radeon HD 7850Radeon HD 7950Radeon R9 270XGeForce GTX 4701530456075SE +/- 0.17, N = 3SE +/- 0.24, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.57, N = 6SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.23, N = 618.1944.3566.4835.9615.2818.1110.7813.0117.461. (CXX) g++ options: -O2 -lOpenCL

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 680GeForce GTX 550 TiGeForce GTX 650GeForce GTX 460GeForce GT 610GeForce GT 240GeForce GTX 4708M16M24M32M40MSE +/- 72150.98, N = 3SE +/- 13067.53, N = 3SE +/- 2871.63, N = 3SE +/- 4955.24, N = 3SE +/- 2505.78, N = 3SE +/- 17817.43, N = 3SE +/- 120737.03, N = 336123507.3717540483.9710841150.3321199859.504060660.0713543191.9029347833.201. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

MandelbulbGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GTX 680GeForce GTX 550 TiGeForce GTX 650GeForce GTX 460GeForce GT 610GeForce GT 240GeForce GTX 4704M8M12M16M20MSE +/- 72849.89, N = 3SE +/- 4259.44, N = 3SE +/- 4281.68, N = 3SE +/- 10640.98, N = 3SE +/- 614.61, N = 3SE +/- 3953.96, N = 3SE +/- 58117.66, N = 317083392.337729296.074992671.839382926.171776400.707095143.7714828161.271. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGeForce GTX 680GeForce GTX 550 TiGeForce GTX 650GeForce GTX 460GeForce GT 610GeForce GT 240GeForce GTX 47010M20M30M40M50MSE +/- 5781.86, N = 3SE +/- 6289.55, N = 3SE +/- 857.65, N = 3SE +/- 2448.11, N = 3SE +/- 230.98, N = 3SE +/- 217.94, N = 3SE +/- 4541.93, N = 347966936.6717362190.0712405397.7021375259.403715320.135291547.0028658784.401. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

OpenCL Device: GPU - Scene: Room

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: RoomGeForce GTX 680GeForce GTX 550 TiGeForce GTX 650GeForce GT 610Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X2004006008001000SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.58, N = 3SE +/- 0.67, N = 329615082271076241002867

LuxMark

OpenCL Device: GPU - Scene: Sala

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: SalaGeForce GTX 680GeForce GTX 550 TiGeForce GTX 650GeForce GTX 460GeForce GT 610Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X400800120016002000SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.00, N = 365131917738242631113418961588

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 680GeForce GTX 550 TiGeForce GTX 650GeForce GTX 460GeForce GT 610GeForce GT 240Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X3K6K9K12K15KSE +/- 0.67, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.88, N = 3SE +/- 1.45, N = 3SE +/- 4.84, N = 3SE +/- 3.61, N = 342162162118325913151051614983611308711031

OpenDwarfs

Test: LU Decomposition

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: LU DecompositionGeForce GTX 680GeForce GTX 550 TiGeForce GTX 650GeForce GTX 460GeForce GT 610GeForce GT 240Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X100200300400500SE +/- 0.88, N = 5SE +/- 0.14, N = 3SE +/- 0.21, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 1.06, N = 3SE +/- 4.12, N = 6SE +/- 5.24, N = 6SE +/- 4.65, N = 6SE +/- 4.83, N = 658.94110.81152.7097.69458.26220.27161.42135.79123.96126.601. (CC) gcc options: -lm -lOpenCL

OpenDwarfs

Test: Compressed Sparse Row

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: Compressed Sparse RowGeForce GTX 680GeForce GTX 550 TiGeForce GTX 650GeForce GTX 460GeForce GT 610GeForce GT 240Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X612182430SE +/- 0.58, N = 6SE +/- 0.80, N = 6SE +/- 0.57, N = 6SE +/- 0.76, N = 6SE +/- 0.56, N = 6SE +/- 0.11, N = 3SE +/- 0.81, N = 6SE +/- 0.50, N = 6SE +/- 0.86, N = 6SE +/- 0.89, N = 66.106.556.766.409.006.8824.5018.0917.8316.251. (CC) gcc options: -lm -lOpenCL


Phoronix Test Suite v10.8.4