NVIDIA AMD OpenCL Linux GPGPU Tests

Benchmarks by Michael Larabel for a future article on Phoronix. Just looking at a range of OpenBenchmarking.org OpenCL benchmarks on a range of AMD Radeon and NVIDIA GeForce graphics cards with the proprietary drivers under Ubuntu Linux.

HTML result view exported from: https://openbenchmarking.org/result/1406282-PL-1311092SO17&gru&sor&rro.

NVIDIA AMD OpenCL Linux GPGPU TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XAMD FX-8350 Eight-Core @ 4.00GHz (8 Cores)ASUS Crosshair V FormulaAMD RD890 bridge8192MB64GB OCZ AGILITYECS NVIDIA GeForce GT 240 512MB (550/1700MHz)Realtek ALC889Intel 82583V Gigabit ConnectionUbuntu 13.103.11.0-12-generic (x86_64)Unity 7.1.2X Server 1.14.3NVIDIA 331.203.3.0GCC 4.8ext42560x1600ASUS NVIDIA GeForce GTX 460 768MB (675/1800MHz)4.3.0eVGA NVIDIA GeForce GTX 550 Ti 1024MB (951/2178MHz)NVIDIA GeForce GT 610 1024MB (810/533MHz)MSI NVIDIA GeForce GTX 650 1024MB (1084/2500MHz)NVIDIA GeForce GTX 680 2048MB (705/3004MHz)Sapphire AMD Radeon HD 6800 1024MB (900/1050MHz)SyncMaster + SyncMasterfglrx 13.25.54.3.12614ASUS AMD Radeon HD 7800 1024MB (860/1200MHz)XFX AMD Radeon HD 7900 3072MB (900/1375MHz)Gigabyte AMD Radeon R9 200 2048MB (1100/1400MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- Scaling Governor: acpi-cpufreq ondemandOpenCL Details- GeForce GT 240: GPU Compute Cores: 96- GeForce GTX 460: GPU Compute Cores: 336- GeForce GTX 550 Ti: GPU Compute Cores: 192- GeForce GT 610: GPU Compute Cores: 48- GeForce GTX 650: GPU Compute Cores: 384- GeForce GTX 680: GPU Compute Cores: 1536System Details- GeForce GT 240: GPU Compute Cores: 96.- GeForce GTX 460: GPU Compute Cores: 336.- GeForce GTX 550 Ti: GPU Compute Cores: 192.- GeForce GT 610: GPU Compute Cores: 48.- GeForce GTX 650: GPU Compute Cores: 384.- GeForce GTX 680: GPU Compute Cores: 1536.Environment Details- Radeon HD 6870, Radeon HD 7850, Radeon HD 7950, Radeon R9 270X: LIBGL_DRIVERS_PATH=/usr/lib/i386-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/dri

NVIDIA AMD OpenCL Linux GPGPU Testsjuliagpu: GPUmandelbulbgpu: GPUmandelgpu: GPUluxmark: GPU - Roomluxmark: GPU - Salaluxmark: GPU - Luxball HDRopendwarfs: LU Decompositionopendwarfs: Compressed Sparse Rowrodinia: OpenCL LavaMDrodinia: OpenCL Myocyterodinia: OpenCL Heartwallrodinia: OpenCL Leukocyterodinia: OpenCL Particle FilterGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X13543191.907095143.775291547.001051220.276.88137.9036.9940.4921199859.509382926.1721375259.40382259197.696.4062.814.8024.6635.9617540483.977729296.0717362190.071503192162110.816.555.4927.7044.354060660.071776400.703715320.132742315458.269.0095.8822.72100.3115.2810841150.334992671.8312405397.70821771183152.706.7620.1430.3966.4836123507.3717083392.3347966936.67296651421658.946.109.2660.785.4214.0218.191076316149161.4224.50493.5119.6789.8762411348361135.7918.097.66508.496.7989.1518.111002189613087123.9617.835.79481.625.8883.8310.78867158811031126.6016.256.17485.325.4084.6613.01OpenBenchmarking.org

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GT 610GeForce GTX 650GeForce GT 240GeForce GTX 550 TiGeForce GTX 460GeForce GTX 6808M16M24M32M40MSE +/- 2505.78, N = 3SE +/- 2871.63, N = 3SE +/- 17817.43, N = 3SE +/- 13067.53, N = 3SE +/- 4955.24, N = 3SE +/- 72150.98, N = 34060660.0710841150.3313543191.9017540483.9721199859.5036123507.371. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

MandelbulbGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GT 610GeForce GTX 650GeForce GT 240GeForce GTX 550 TiGeForce GTX 460GeForce GTX 6804M8M12M16M20MSE +/- 614.61, N = 3SE +/- 4281.68, N = 3SE +/- 3953.96, N = 3SE +/- 4259.44, N = 3SE +/- 10640.98, N = 3SE +/- 72849.89, N = 31776400.704992671.837095143.777729296.079382926.1717083392.331. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGeForce GT 610GeForce GT 240GeForce GTX 650GeForce GTX 550 TiGeForce GTX 460GeForce GTX 68010M20M30M40M50MSE +/- 230.98, N = 3SE +/- 217.94, N = 3SE +/- 857.65, N = 3SE +/- 6289.55, N = 3SE +/- 2448.11, N = 3SE +/- 5781.86, N = 33715320.135291547.0012405397.7017362190.0721375259.4047966936.671. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

OpenCL Device: GPU - Scene: Room

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: RoomGeForce GT 610GeForce GTX 650Radeon HD 6870GeForce GTX 550 TiGeForce GTX 680Radeon HD 7850Radeon R9 270XRadeon HD 79502004006008001000SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 327821071502966248671002

LuxMark

OpenCL Device: GPU - Scene: Sala

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: SalaGeForce GT 610GeForce GTX 650GeForce GTX 550 TiGeForce GTX 460Radeon HD 6870GeForce GTX 680Radeon HD 7850Radeon R9 270XRadeon HD 7950400800120016002000SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.67, N = 342177319382631651113415881896

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: Luxball HDRGeForce GT 610GeForce GT 240GeForce GTX 650GeForce GTX 550 TiGeForce GTX 460GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon R9 270XRadeon HD 79503K6K9K12K15KSE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 1.45, N = 3SE +/- 3.61, N = 3SE +/- 4.84, N = 331510511183216225914216614983611103113087

OpenDwarfs

Test: LU Decomposition

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: LU DecompositionGeForce GT 610GeForce GT 240Radeon HD 6870GeForce GTX 650Radeon HD 7850Radeon R9 270XRadeon HD 7950GeForce GTX 550 TiGeForce GTX 460GeForce GTX 680100200300400500SE +/- 0.06, N = 3SE +/- 1.06, N = 3SE +/- 4.12, N = 6SE +/- 0.21, N = 3SE +/- 5.24, N = 6SE +/- 4.83, N = 6SE +/- 4.65, N = 6SE +/- 0.14, N = 3SE +/- 0.03, N = 3SE +/- 0.88, N = 5458.26220.27161.42152.70135.79126.60123.96110.8197.6958.941. (CC) gcc options: -lm -lOpenCL

OpenDwarfs

Test: Compressed Sparse Row

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: Compressed Sparse RowRadeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGeForce GT 610GeForce GT 240GeForce GTX 650GeForce GTX 550 TiGeForce GTX 460GeForce GTX 680612182430SE +/- 0.81, N = 6SE +/- 0.50, N = 6SE +/- 0.86, N = 6SE +/- 0.89, N = 6SE +/- 0.56, N = 6SE +/- 0.11, N = 3SE +/- 0.57, N = 6SE +/- 0.80, N = 6SE +/- 0.76, N = 6SE +/- 0.58, N = 624.5018.0917.8316.259.006.886.766.556.406.101. (CC) gcc options: -lm -lOpenCL

Rodinia

Test: OpenCL LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL LavaMDGeForce GTX 680Radeon HD 7850Radeon R9 270XRadeon HD 79503691215SE +/- 0.14, N = 6SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 39.267.666.175.791. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteRadeon HD 7850Radeon HD 6870Radeon R9 270XRadeon HD 7950GeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 680110220330440550SE +/- 0.96, N = 3SE +/- 1.88, N = 3SE +/- 0.67, N = 3SE +/- 0.84, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 1.14, N = 3508.49493.51485.32481.62137.9095.8862.8160.781. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Heartwall

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL HeartwallGeForce GT 240GeForce GT 610GeForce GTX 650Radeon HD 6870Radeon HD 7850Radeon HD 7950GeForce GTX 550 TiGeForce GTX 680Radeon R9 270XGeForce GTX 460918273645SE +/- 0.28, N = 3SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.13, N = 6SE +/- 0.09, N = 6SE +/- 0.08, N = 3SE +/- 0.07, N = 336.9922.7220.1419.676.795.885.495.425.404.801. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL LeukocyteGeForce GT 610Radeon HD 6870Radeon HD 7850Radeon R9 270XRadeon HD 7950GeForce GT 240GeForce GTX 650GeForce GTX 550 TiGeForce GTX 460GeForce GTX 68020406080100SE +/- 1.23, N = 3SE +/- 3.37, N = 6SE +/- 5.05, N = 6SE +/- 6.06, N = 6SE +/- 4.92, N = 6SE +/- 0.75, N = 3SE +/- 0.26, N = 3SE +/- 0.48, N = 4SE +/- 0.30, N = 3SE +/- 0.24, N = 4100.3189.8789.1584.6683.8340.4930.3927.7024.6614.021. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterGeForce GTX 650GeForce GTX 550 TiGeForce GTX 460GeForce GTX 680Radeon HD 7850GeForce GT 610Radeon R9 270XRadeon HD 79501530456075SE +/- 0.07, N = 3SE +/- 0.24, N = 3SE +/- 0.01, N = 3SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 0.57, N = 6SE +/- 0.02, N = 3SE +/- 0.00, N = 366.4844.3535.9618.1918.1115.2813.0110.781. (CXX) g++ options: -O2 -lOpenCL


Phoronix Test Suite v10.8.5