NVIDIA AMD OpenCL Linux GPGPU Tests

Benchmarks by Michael Larabel for a future article on Phoronix. Just looking at a range of OpenBenchmarking.org OpenCL benchmarks on a range of AMD Radeon and NVIDIA GeForce graphics cards with the proprietary drivers under Ubuntu Linux.

HTML result view exported from: https://openbenchmarking.org/result/1311130-SO-1311092SO95&sro.

NVIDIA AMD OpenCL Linux GPGPU TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGPUTestAMD FX-8350 Eight-Core @ 4.00GHz (8 Cores)ASUS Crosshair V FormulaAMD RD890 bridge8192MB64GB OCZ AGILITYECS NVIDIA GeForce GT 240 512MB (550/1700MHz)Realtek ALC889Intel 82583V Gigabit ConnectionUbuntu 13.103.11.0-12-generic (x86_64)Unity 7.1.2X Server 1.14.3NVIDIA 331.203.3.0GCC 4.8ext42560x1600ASUS NVIDIA GeForce GTX 460 768MB (675/1800MHz)4.3.0eVGA NVIDIA GeForce GTX 550 Ti 1024MB (951/2178MHz)NVIDIA GeForce GT 610 1024MB (810/533MHz)MSI NVIDIA GeForce GTX 650 1024MB (1084/2500MHz)NVIDIA GeForce GTX 680 2048MB (705/3004MHz)Sapphire AMD Radeon HD 6800 1024MB (900/1050MHz)SyncMaster + SyncMasterfglrx 13.25.54.3.12614ASUS AMD Radeon HD 7800 1024MB (860/1200MHz)XFX AMD Radeon HD 7900 3072MB (900/1375MHz)Gigabyte AMD Radeon R9 200 2048MB (1100/1400MHz)Intel Core 2 Quad @ 2.40GHz (4 Cores)Dell 0CT017Intel 82P965/G965 + ICH8DH320GB Western Digital WD3200AAKS-0 + 2000GB Western Digital WD20EARX-008Gigabyte NVIDIA GeForce GTX 650 Ti 2048MB (1032/2700MHz)Creative Labs SB X-FiIntel 82566DC Gigabit Connection3.11.0-13-generic (x86_64)NVIDIA 304.884.2.0 NVIDIA 304.88GCC 4.8 + CUDA 5.01920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -vProcessor Details- Scaling Governor: acpi-cpufreq ondemandOpenCL Details- GeForce GT 240: GPU Compute Cores: 96- GeForce GTX 460: GPU Compute Cores: 336- GeForce GTX 550 Ti: GPU Compute Cores: 192- GeForce GT 610: GPU Compute Cores: 48- GeForce GTX 650: GPU Compute Cores: 384- GeForce GTX 680: GPU Compute Cores: 1536- GPUTest: GPU Compute Cores: 768System Details- GeForce GT 240: GPU Compute Cores: 96.- GeForce GTX 460: GPU Compute Cores: 336.- GeForce GTX 550 Ti: GPU Compute Cores: 192.- GeForce GT 610: GPU Compute Cores: 48.- GeForce GTX 650: GPU Compute Cores: 384.- GeForce GTX 680: GPU Compute Cores: 1536.- GPUTest: GPU Compute Cores: 768.Environment Details- Radeon HD 6870, Radeon HD 7850, Radeon HD 7950, Radeon R9 270X: LIBGL_DRIVERS_PATH=/usr/lib/i386-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/dri

NVIDIA AMD OpenCL Linux GPGPU Testsrodinia: OpenCL LavaMDrodinia: OpenCL Myocyterodinia: OpenCL Heartwallrodinia: OpenCL Leukocyterodinia: OpenCL Particle Filterjuliagpu: GPUmandelbulbgpu: GPUmandelgpu: GPUluxmark: GPU - Roomluxmark: GPU - Salaluxmark: GPU - Luxball HDRopendwarfs: LU Decompositionopendwarfs: Compressed Sparse RowGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGPUTest137.9036.9940.4913543191.907095143.775291547.001051220.276.8862.814.8024.6635.9621199859.509382926.1721375259.40382259197.696.405.4927.7044.3517540483.977729296.0717362190.071503192162110.816.5595.8822.72100.3115.284060660.071776400.703715320.132742315458.269.0020.1430.3966.4810841150.334992671.8312405397.70821771183152.706.769.2660.785.4214.0218.1936123507.3717083392.3347966936.67296651421658.946.10493.5119.6789.871076316149161.4224.507.66508.496.7989.1518.1162411348361135.7918.095.79481.625.8883.8310.781002189613087123.9617.836.17485.325.4084.6613.01867158811031126.6016.25124.447.2628.9226.2716429443.437993787.5022070355.471332781807OpenBenchmarking.org

Rodinia

Test: OpenCL LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL LavaMDGeForce GTX 680Radeon HD 7850Radeon HD 7950Radeon R9 270X3691215SE +/- 0.14, N = 6SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 39.267.665.796.171. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteGPUTestGeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X110220330440550SE +/- 1.78, N = 5SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 1.14, N = 3SE +/- 1.88, N = 3SE +/- 0.96, N = 3SE +/- 0.84, N = 3SE +/- 0.67, N = 3124.44137.9095.8862.8160.78493.51508.49481.62485.321. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Heartwall

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL HeartwallGPUTestGeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 550 TiGeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X918273645SE +/- 0.16, N = 6SE +/- 0.28, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.13, N = 6SE +/- 0.10, N = 3SE +/- 0.09, N = 6SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 37.2636.9922.724.805.4920.145.4219.676.795.885.401. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL LeukocyteGPUTestGeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 550 TiGeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X20406080100SE +/- 0.46, N = 4SE +/- 0.75, N = 3SE +/- 1.23, N = 3SE +/- 0.30, N = 3SE +/- 0.48, N = 4SE +/- 0.26, N = 3SE +/- 0.24, N = 4SE +/- 3.37, N = 6SE +/- 5.05, N = 6SE +/- 4.92, N = 6SE +/- 6.06, N = 628.9240.49100.3124.6627.7030.3914.0289.8789.1583.8384.661. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterGPUTestGeForce GT 610GeForce GTX 460GeForce GTX 550 TiGeForce GTX 650GeForce GTX 680Radeon HD 7850Radeon HD 7950Radeon R9 270X1530456075SE +/- 0.41, N = 3SE +/- 0.57, N = 6SE +/- 0.01, N = 3SE +/- 0.24, N = 3SE +/- 0.07, N = 3SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 326.2715.2835.9644.3566.4818.1918.1110.7813.011. (CXX) g++ options: -O2 -lOpenCL

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGPUTestGeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 550 TiGeForce GTX 650GeForce GTX 6808M16M24M32M40MSE +/- 17021.97, N = 3SE +/- 17817.43, N = 3SE +/- 2505.78, N = 3SE +/- 4955.24, N = 3SE +/- 13067.53, N = 3SE +/- 2871.63, N = 3SE +/- 72150.98, N = 316429443.4313543191.904060660.0721199859.5017540483.9710841150.3336123507.371. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

MandelbulbGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGPUTestGeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 550 TiGeForce GTX 650GeForce GTX 6804M8M12M16M20MSE +/- 6624.17, N = 3SE +/- 3953.96, N = 3SE +/- 614.61, N = 3SE +/- 10640.98, N = 3SE +/- 4259.44, N = 3SE +/- 4281.68, N = 3SE +/- 72849.89, N = 37993787.507095143.771776400.709382926.177729296.074992671.8317083392.331. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGPUTestGeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 550 TiGeForce GTX 650GeForce GTX 68010M20M30M40M50MSE +/- 4799.23, N = 3SE +/- 217.94, N = 3SE +/- 230.98, N = 3SE +/- 2448.11, N = 3SE +/- 6289.55, N = 3SE +/- 857.65, N = 3SE +/- 5781.86, N = 322070355.475291547.003715320.1321375259.4017362190.0712405397.7047966936.671. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

OpenCL Device: GPU - Scene: Room

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: RoomGPUTestGeForce GT 610GeForce GTX 550 TiGeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X2004006008001000SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.58, N = 3SE +/- 0.67, N = 313327150822961076241002867

LuxMark

OpenCL Device: GPU - Scene: Sala

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: SalaGPUTestGeForce GT 610GeForce GTX 460GeForce GTX 550 TiGeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X400800120016002000SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.00, N = 327842382319177651631113418961588

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: Luxball HDRGPUTestGeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 550 TiGeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X3K6K9K12K15KSE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 1.45, N = 3SE +/- 4.84, N = 3SE +/- 3.61, N = 3180710513152591216211834216614983611308711031

OpenDwarfs

Test: LU Decomposition

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: LU DecompositionGeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 550 TiGeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X100200300400500SE +/- 1.06, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.14, N = 3SE +/- 0.21, N = 3SE +/- 0.88, N = 5SE +/- 4.12, N = 6SE +/- 5.24, N = 6SE +/- 4.65, N = 6SE +/- 4.83, N = 6220.27458.2697.69110.81152.7058.94161.42135.79123.96126.601. (CC) gcc options: -lm -lOpenCL

OpenDwarfs

Test: Compressed Sparse Row

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: Compressed Sparse RowGeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 550 TiGeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X612182430SE +/- 0.11, N = 3SE +/- 0.56, N = 6SE +/- 0.76, N = 6SE +/- 0.80, N = 6SE +/- 0.57, N = 6SE +/- 0.58, N = 6SE +/- 0.81, N = 6SE +/- 0.50, N = 6SE +/- 0.86, N = 6SE +/- 0.89, N = 66.889.006.406.556.766.1024.5018.0917.8316.251. (CC) gcc options: -lm -lOpenCL


Phoronix Test Suite v10.8.4