NVIDIA AMD OpenCL Linux GPGPU Tests

Benchmarks by Michael Larabel for a future article on Phoronix. Just looking at a range of OpenBenchmarking.org OpenCL benchmarks on a range of AMD Radeon and NVIDIA GeForce graphics cards with the proprietary drivers under Ubuntu Linux.

HTML result view exported from: https://openbenchmarking.org/result/1311130-SO-1311092SO95&grt.

NVIDIA AMD OpenCL Linux GPGPU TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGPUTestAMD FX-8350 Eight-Core @ 4.00GHz (8 Cores)ASUS Crosshair V FormulaAMD RD890 bridge8192MB64GB OCZ AGILITYECS NVIDIA GeForce GT 240 512MB (550/1700MHz)Realtek ALC889Intel 82583V Gigabit ConnectionUbuntu 13.103.11.0-12-generic (x86_64)Unity 7.1.2X Server 1.14.3NVIDIA 331.203.3.0GCC 4.8ext42560x1600ASUS NVIDIA GeForce GTX 460 768MB (675/1800MHz)4.3.0eVGA NVIDIA GeForce GTX 550 Ti 1024MB (951/2178MHz)NVIDIA GeForce GT 610 1024MB (810/533MHz)MSI NVIDIA GeForce GTX 650 1024MB (1084/2500MHz)NVIDIA GeForce GTX 680 2048MB (705/3004MHz)Sapphire AMD Radeon HD 6800 1024MB (900/1050MHz)SyncMaster + SyncMasterfglrx 13.25.54.3.12614ASUS AMD Radeon HD 7800 1024MB (860/1200MHz)XFX AMD Radeon HD 7900 3072MB (900/1375MHz)Gigabyte AMD Radeon R9 200 2048MB (1100/1400MHz)Intel Core 2 Quad @ 2.40GHz (4 Cores)Dell 0CT017Intel 82P965/G965 + ICH8DH320GB Western Digital WD3200AAKS-0 + 2000GB Western Digital WD20EARX-008Gigabyte NVIDIA GeForce GTX 650 Ti 2048MB (1032/2700MHz)Creative Labs SB X-FiIntel 82566DC Gigabit Connection3.11.0-13-generic (x86_64)NVIDIA 304.884.2.0 NVIDIA 304.88GCC 4.8 + CUDA 5.01920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -vProcessor Details- Scaling Governor: acpi-cpufreq ondemandOpenCL Details- GeForce GT 240: GPU Compute Cores: 96- GeForce GTX 460: GPU Compute Cores: 336- GeForce GTX 550 Ti: GPU Compute Cores: 192- GeForce GT 610: GPU Compute Cores: 48- GeForce GTX 650: GPU Compute Cores: 384- GeForce GTX 680: GPU Compute Cores: 1536- GPUTest: GPU Compute Cores: 768System Details- GeForce GT 240: GPU Compute Cores: 96.- GeForce GTX 460: GPU Compute Cores: 336.- GeForce GTX 550 Ti: GPU Compute Cores: 192.- GeForce GT 610: GPU Compute Cores: 48.- GeForce GTX 650: GPU Compute Cores: 384.- GeForce GTX 680: GPU Compute Cores: 1536.- GPUTest: GPU Compute Cores: 768.Environment Details- Radeon HD 6870, Radeon HD 7850, Radeon HD 7950, Radeon R9 270X: LIBGL_DRIVERS_PATH=/usr/lib/i386-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/dri

NVIDIA AMD OpenCL Linux GPGPU Testsjuliagpu: GPUluxmark: GPU - Roomluxmark: GPU - Salaluxmark: GPU - Luxball HDRmandelbulbgpu: GPUmandelgpu: GPUopendwarfs: LU Decompositionopendwarfs: Compressed Sparse Rowrodinia: OpenCL LavaMDrodinia: OpenCL Myocyterodinia: OpenCL Heartwallrodinia: OpenCL Leukocyterodinia: OpenCL Particle FilterGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGPUTest13543191.9010517095143.775291547.00220.276.88137.9036.9940.4921199859.5038225919382926.1721375259.4097.696.4062.814.8024.6635.9617540483.9715031921627729296.0717362190.07110.816.555.4927.7044.354060660.0727423151776400.703715320.13458.269.0095.8822.72100.3115.2810841150.338217711834992671.8312405397.70152.706.7620.1430.3966.4836123507.37296651421617083392.3347966936.6758.946.109.2660.785.4214.0218.191076316149161.4224.50493.5119.6789.8762411348361135.7918.097.66508.496.7989.1518.111002189613087123.9617.835.79481.625.8883.8310.78867158811031126.6016.256.17485.325.4084.6613.0116429443.4313327818077993787.5022070355.47124.447.2628.9226.27OpenBenchmarking.org

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680GPUTest8M16M24M32M40MSE +/- 17817.43, N = 3SE +/- 4955.24, N = 3SE +/- 13067.53, N = 3SE +/- 2505.78, N = 3SE +/- 2871.63, N = 3SE +/- 72150.98, N = 3SE +/- 17021.97, N = 313543191.9021199859.5017540483.974060660.0710841150.3336123507.3716429443.431. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

LuxMark

OpenCL Device: GPU - Scene: Room

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: RoomGeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGPUTest2004006008001000SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 315027822961076241002867133

LuxMark

OpenCL Device: GPU - Scene: Sala

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: SalaGeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGPUTest400800120016002000SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 338231942177651631113418961588278

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: Luxball HDRGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGPUTest3K6K9K12K15KSE +/- 0.00, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 1.45, N = 3SE +/- 4.84, N = 3SE +/- 3.61, N = 3SE +/- 0.33, N = 3105125912162315118342166149836113087110311807

MandelbulbGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680GPUTest4M8M12M16M20MSE +/- 3953.96, N = 3SE +/- 10640.98, N = 3SE +/- 4259.44, N = 3SE +/- 614.61, N = 3SE +/- 4281.68, N = 3SE +/- 72849.89, N = 3SE +/- 6624.17, N = 37095143.779382926.177729296.071776400.704992671.8317083392.337993787.501. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680GPUTest10M20M30M40M50MSE +/- 217.94, N = 3SE +/- 2448.11, N = 3SE +/- 6289.55, N = 3SE +/- 230.98, N = 3SE +/- 857.65, N = 3SE +/- 5781.86, N = 3SE +/- 4799.23, N = 35291547.0021375259.4017362190.073715320.1312405397.7047966936.6722070355.471. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OpenDwarfs

Test: LU Decomposition

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: LU DecompositionGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X100200300400500SE +/- 1.06, N = 3SE +/- 0.03, N = 3SE +/- 0.14, N = 3SE +/- 0.06, N = 3SE +/- 0.21, N = 3SE +/- 0.88, N = 5SE +/- 4.12, N = 6SE +/- 5.24, N = 6SE +/- 4.65, N = 6SE +/- 4.83, N = 6220.2797.69110.81458.26152.7058.94161.42135.79123.96126.601. (CC) gcc options: -lm -lOpenCL

OpenDwarfs

Test: Compressed Sparse Row

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: Compressed Sparse RowGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X612182430SE +/- 0.11, N = 3SE +/- 0.76, N = 6SE +/- 0.80, N = 6SE +/- 0.56, N = 6SE +/- 0.57, N = 6SE +/- 0.58, N = 6SE +/- 0.81, N = 6SE +/- 0.50, N = 6SE +/- 0.86, N = 6SE +/- 0.89, N = 66.886.406.559.006.766.1024.5018.0917.8316.251. (CC) gcc options: -lm -lOpenCL

Rodinia

Test: OpenCL LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL LavaMDGeForce GTX 680Radeon HD 7850Radeon HD 7950Radeon R9 270X3691215SE +/- 0.14, N = 6SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 39.267.665.796.171. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteGeForce GT 240GeForce GTX 460GeForce GT 610GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGPUTest110220330440550SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 1.14, N = 3SE +/- 1.88, N = 3SE +/- 0.96, N = 3SE +/- 0.84, N = 3SE +/- 0.67, N = 3SE +/- 1.78, N = 5137.9062.8195.8860.78493.51508.49481.62485.32124.441. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Heartwall

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL HeartwallGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGPUTest918273645SE +/- 0.28, N = 3SE +/- 0.07, N = 3SE +/- 0.13, N = 6SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 6SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.16, N = 636.994.805.4922.7220.145.4219.676.795.885.407.261. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL LeukocyteGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGPUTest20406080100SE +/- 0.75, N = 3SE +/- 0.30, N = 3SE +/- 0.48, N = 4SE +/- 1.23, N = 3SE +/- 0.26, N = 3SE +/- 0.24, N = 4SE +/- 3.37, N = 6SE +/- 5.05, N = 6SE +/- 4.92, N = 6SE +/- 6.06, N = 6SE +/- 0.46, N = 440.4924.6627.70100.3130.3914.0289.8789.1583.8384.6628.921. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterGeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 7850Radeon HD 7950Radeon R9 270XGPUTest1530456075SE +/- 0.01, N = 3SE +/- 0.24, N = 3SE +/- 0.57, N = 6SE +/- 0.07, N = 3SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.41, N = 335.9644.3515.2866.4818.1918.1110.7813.0126.271. (CXX) g++ options: -O2 -lOpenCL


Phoronix Test Suite v10.8.4