NVIDIA AMD OpenCL Linux GPGPU Tests

Benchmarks by Michael Larabel for a future article on Phoronix. Just looking at a range of OpenBenchmarking.org OpenCL benchmarks on a range of AMD Radeon and NVIDIA GeForce graphics cards with the proprietary drivers under Ubuntu Linux.

HTML result view exported from: https://openbenchmarking.org/result/1406282-PL-1311092SO17&gru&sor.

NVIDIA AMD OpenCL Linux GPGPU TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XAMD FX-8350 Eight-Core @ 4.00GHz (8 Cores)ASUS Crosshair V FormulaAMD RD890 bridge8192MB64GB OCZ AGILITYECS NVIDIA GeForce GT 240 512MB (550/1700MHz)Realtek ALC889Intel 82583V Gigabit ConnectionUbuntu 13.103.11.0-12-generic (x86_64)Unity 7.1.2X Server 1.14.3NVIDIA 331.203.3.0GCC 4.8ext42560x1600ASUS NVIDIA GeForce GTX 460 768MB (675/1800MHz)4.3.0eVGA NVIDIA GeForce GTX 550 Ti 1024MB (951/2178MHz)NVIDIA GeForce GT 610 1024MB (810/533MHz)MSI NVIDIA GeForce GTX 650 1024MB (1084/2500MHz)NVIDIA GeForce GTX 680 2048MB (705/3004MHz)Sapphire AMD Radeon HD 6800 1024MB (900/1050MHz)SyncMaster + SyncMasterfglrx 13.25.54.3.12614ASUS AMD Radeon HD 7800 1024MB (860/1200MHz)XFX AMD Radeon HD 7900 3072MB (900/1375MHz)Gigabyte AMD Radeon R9 200 2048MB (1100/1400MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- Scaling Governor: acpi-cpufreq ondemandOpenCL Details- GeForce GT 240: GPU Compute Cores: 96- GeForce GTX 460: GPU Compute Cores: 336- GeForce GTX 550 Ti: GPU Compute Cores: 192- GeForce GT 610: GPU Compute Cores: 48- GeForce GTX 650: GPU Compute Cores: 384- GeForce GTX 680: GPU Compute Cores: 1536System Details- GeForce GT 240: GPU Compute Cores: 96.- GeForce GTX 460: GPU Compute Cores: 336.- GeForce GTX 550 Ti: GPU Compute Cores: 192.- GeForce GT 610: GPU Compute Cores: 48.- GeForce GTX 650: GPU Compute Cores: 384.- GeForce GTX 680: GPU Compute Cores: 1536.Environment Details- Radeon HD 6870, Radeon HD 7850, Radeon HD 7950, Radeon R9 270X: LIBGL_DRIVERS_PATH=/usr/lib/i386-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/dri

NVIDIA AMD OpenCL Linux GPGPU Testsjuliagpu: GPUmandelbulbgpu: GPUmandelgpu: GPUluxmark: GPU - Roomluxmark: GPU - Salaluxmark: GPU - Luxball HDRopendwarfs: LU Decompositionopendwarfs: Compressed Sparse Rowrodinia: OpenCL LavaMDrodinia: OpenCL Myocyterodinia: OpenCL Heartwallrodinia: OpenCL Leukocyterodinia: OpenCL Particle FilterGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X13543191.907095143.775291547.001051220.276.88137.9036.9940.4921199859.509382926.1721375259.40382259197.696.4062.814.8024.6635.9617540483.977729296.0717362190.071503192162110.816.555.4927.7044.354060660.071776400.703715320.132742315458.269.0095.8822.72100.3115.2810841150.334992671.8312405397.70821771183152.706.7620.1430.3966.4836123507.3717083392.3347966936.67296651421658.946.109.2660.785.4214.0218.191076316149161.4224.50493.5119.6789.8762411348361135.7918.097.66508.496.7989.1518.111002189613087123.9617.835.79481.625.8883.8310.78867158811031126.6016.256.17485.325.4084.6613.01OpenBenchmarking.org

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 680GeForce GTX 460GeForce GTX 550 TiGeForce GT 240GeForce GTX 650GeForce GT 6108M16M24M32M40MSE +/- 72150.98, N = 3SE +/- 4955.24, N = 3SE +/- 13067.53, N = 3SE +/- 17817.43, N = 3SE +/- 2871.63, N = 3SE +/- 2505.78, N = 336123507.3721199859.5017540483.9713543191.9010841150.334060660.071. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

MandelbulbGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GTX 680GeForce GTX 460GeForce GTX 550 TiGeForce GT 240GeForce GTX 650GeForce GT 6104M8M12M16M20MSE +/- 72849.89, N = 3SE +/- 10640.98, N = 3SE +/- 4259.44, N = 3SE +/- 3953.96, N = 3SE +/- 4281.68, N = 3SE +/- 614.61, N = 317083392.339382926.177729296.077095143.774992671.831776400.701. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGeForce GTX 680GeForce GTX 460GeForce GTX 550 TiGeForce GTX 650GeForce GT 240GeForce GT 61010M20M30M40M50MSE +/- 5781.86, N = 3SE +/- 2448.11, N = 3SE +/- 6289.55, N = 3SE +/- 857.65, N = 3SE +/- 217.94, N = 3SE +/- 230.98, N = 347966936.6721375259.4017362190.0712405397.705291547.003715320.131. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

OpenCL Device: GPU - Scene: Room

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: RoomRadeon HD 7950Radeon R9 270XRadeon HD 7850GeForce GTX 680GeForce GTX 550 TiRadeon HD 6870GeForce GTX 650GeForce GT 6102004006008001000SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 310028676242961501078227

LuxMark

OpenCL Device: GPU - Scene: Sala

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: SalaRadeon HD 7950Radeon R9 270XRadeon HD 7850GeForce GTX 680Radeon HD 6870GeForce GTX 460GeForce GTX 550 TiGeForce GTX 650GeForce GT 610400800120016002000SE +/- 0.67, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 318961588113465163138231917742

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: Luxball HDRRadeon HD 7950Radeon R9 270XRadeon HD 7850Radeon HD 6870GeForce GTX 680GeForce GTX 460GeForce GTX 550 TiGeForce GTX 650GeForce GT 240GeForce GT 6103K6K9K12K15KSE +/- 4.84, N = 3SE +/- 3.61, N = 3SE +/- 1.45, N = 3SE +/- 0.88, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 313087110318361614942162591216211831051315

OpenDwarfs

Test: LU Decomposition

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: LU DecompositionGeForce GTX 680GeForce GTX 460GeForce GTX 550 TiRadeon HD 7950Radeon R9 270XRadeon HD 7850GeForce GTX 650Radeon HD 6870GeForce GT 240GeForce GT 610100200300400500SE +/- 0.88, N = 5SE +/- 0.03, N = 3SE +/- 0.14, N = 3SE +/- 4.65, N = 6SE +/- 4.83, N = 6SE +/- 5.24, N = 6SE +/- 0.21, N = 3SE +/- 4.12, N = 6SE +/- 1.06, N = 3SE +/- 0.06, N = 358.9497.69110.81123.96126.60135.79152.70161.42220.27458.261. (CC) gcc options: -lm -lOpenCL

OpenDwarfs

Test: Compressed Sparse Row

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: Compressed Sparse RowGeForce GTX 680GeForce GTX 460GeForce GTX 550 TiGeForce GTX 650GeForce GT 240GeForce GT 610Radeon R9 270XRadeon HD 7950Radeon HD 7850Radeon HD 6870612182430SE +/- 0.58, N = 6SE +/- 0.76, N = 6SE +/- 0.80, N = 6SE +/- 0.57, N = 6SE +/- 0.11, N = 3SE +/- 0.56, N = 6SE +/- 0.89, N = 6SE +/- 0.86, N = 6SE +/- 0.50, N = 6SE +/- 0.81, N = 66.106.406.556.766.889.0016.2517.8318.0924.501. (CC) gcc options: -lm -lOpenCL

Rodinia

Test: OpenCL LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL LavaMDRadeon HD 7950Radeon R9 270XRadeon HD 7850GeForce GTX 6803691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.14, N = 65.796.177.669.261. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteGeForce GTX 680GeForce GTX 460GeForce GT 610GeForce GT 240Radeon HD 7950Radeon R9 270XRadeon HD 6870Radeon HD 7850110220330440550SE +/- 1.14, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.84, N = 3SE +/- 0.67, N = 3SE +/- 1.88, N = 3SE +/- 0.96, N = 360.7862.8195.88137.90481.62485.32493.51508.491. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Heartwall

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL HeartwallGeForce GTX 460Radeon R9 270XGeForce GTX 680GeForce GTX 550 TiRadeon HD 7950Radeon HD 7850Radeon HD 6870GeForce GTX 650GeForce GT 610GeForce GT 240918273645SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.09, N = 6SE +/- 0.13, N = 6SE +/- 0.11, N = 3SE +/- 0.05, N = 3SE +/- 0.13, N = 3SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.28, N = 34.805.405.425.495.886.7919.6720.1422.7236.991. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL LeukocyteGeForce GTX 680GeForce GTX 460GeForce GTX 550 TiGeForce GTX 650GeForce GT 240Radeon HD 7950Radeon R9 270XRadeon HD 7850Radeon HD 6870GeForce GT 61020406080100SE +/- 0.24, N = 4SE +/- 0.30, N = 3SE +/- 0.48, N = 4SE +/- 0.26, N = 3SE +/- 0.75, N = 3SE +/- 4.92, N = 6SE +/- 6.06, N = 6SE +/- 5.05, N = 6SE +/- 3.37, N = 6SE +/- 1.23, N = 314.0224.6627.7030.3940.4983.8384.6689.1589.87100.311. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterRadeon HD 7950Radeon R9 270XGeForce GT 610Radeon HD 7850GeForce GTX 680GeForce GTX 460GeForce GTX 550 TiGeForce GTX 6501530456075SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.57, N = 6SE +/- 0.02, N = 3SE +/- 0.17, N = 3SE +/- 0.01, N = 3SE +/- 0.24, N = 3SE +/- 0.07, N = 310.7813.0115.2818.1118.1935.9644.3566.481. (CXX) g++ options: -O2 -lOpenCL


Phoronix Test Suite v10.8.5