NVIDIA AMD OpenCL Linux GPGPU Tests

Benchmarks by Michael Larabel for a future article on Phoronix. Just looking at a range of OpenBenchmarking.org OpenCL benchmarks on a range of AMD Radeon and NVIDIA GeForce graphics cards with the proprietary drivers under Ubuntu Linux.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1406282-PL-1311092SO17
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

NVIDIA GPU Compute 2 Tests
OpenCL 6 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
GeForce GT 240
November 08 2013
 
GeForce GTX 460
November 07 2013
 
GeForce GTX 550 Ti
November 07 2013
 
GeForce GT 610
November 08 2013
 
GeForce GTX 650
November 07 2013
 
GeForce GTX 680
November 07 2013
 
Radeon HD 6870
November 09 2013
 
Radeon HD 7850
November 09 2013
 
Radeon HD 7950
November 09 2013
 
Radeon R9 270X
November 09 2013
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA AMD OpenCL Linux GPGPU TestsOpenBenchmarking.orgPhoronix Test SuiteAMD FX-8350 Eight-Core @ 4.00GHz (8 Cores)ASUS Crosshair V FormulaAMD RD890 bridge8192MB64GB OCZ AGILITYECS NVIDIA GeForce GT 240 512MB (550/1700MHz)ASUS NVIDIA GeForce GTX 460 768MB (675/1800MHz)eVGA NVIDIA GeForce GTX 550 Ti 1024MB (951/2178MHz)NVIDIA GeForce GT 610 1024MB (810/533MHz)MSI NVIDIA GeForce GTX 650 1024MB (1084/2500MHz)NVIDIA GeForce GTX 680 2048MB (705/3004MHz)Sapphire AMD Radeon HD 6800 1024MB (900/1050MHz)ASUS AMD Radeon HD 7800 1024MB (860/1200MHz)XFX AMD Radeon HD 7900 3072MB (900/1375MHz)Gigabyte AMD Radeon R9 200 2048MB (1100/1400MHz)Realtek ALC889Intel 82583V Gigabit ConnectionUbuntu 13.103.11.0-12-generic (x86_64)Unity 7.1.2X Server 1.14.3NVIDIA 331.20fglrx 13.25.53.3.04.3.04.3.12614GCC 4.8ext42560x1600ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriversOpenGLsCompilerFile-SystemScreen ResolutionNVIDIA AMD OpenCL Linux GPGPU Tests PerformanceSystem Logs- --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - Scaling Governor: acpi-cpufreq ondemand- GeForce GT 240: GPU Compute Cores: 96- GeForce GTX 460: GPU Compute Cores: 336- GeForce GTX 550 Ti: GPU Compute Cores: 192- GeForce GT 610: GPU Compute Cores: 48- GeForce GTX 650: GPU Compute Cores: 384- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GT 240: GPU Compute Cores: 96.- GeForce GTX 460: GPU Compute Cores: 336.- GeForce GTX 550 Ti: GPU Compute Cores: 192.- GeForce GT 610: GPU Compute Cores: 48.- GeForce GTX 650: GPU Compute Cores: 384.- GeForce GTX 680: GPU Compute Cores: 1536.- Radeon HD 6870, Radeon HD 7850, Radeon HD 7950, Radeon R9 270X: LIBGL_DRIVERS_PATH=/usr/lib/i386-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/dri

GeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XLogarithmic Result OverviewPhoronix Test SuiteRodiniaLuxMarkOpenDwarfs

NVIDIA AMD OpenCL Linux GPGPU Testsjuliagpu: GPUmandelbulbgpu: GPUmandelgpu: GPUluxmark: GPU - Roomluxmark: GPU - Salaluxmark: GPU - Luxball HDRopendwarfs: LU Decompositionopendwarfs: Compressed Sparse Rowrodinia: OpenCL LavaMDrodinia: OpenCL Myocyterodinia: OpenCL Heartwallrodinia: OpenCL Leukocyterodinia: OpenCL Particle FilterGeForce GT 240GeForce GTX 460GeForce GTX 550 TiGeForce GT 610GeForce GTX 650GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X13543191.907095143.775291547.001051220.276.88137.9036.9940.4921199859.509382926.1721375259.40382259197.696.4062.814.8024.6635.9617540483.977729296.0717362190.071503192162110.816.555.4927.7044.354060660.071776400.703715320.132742315458.269.0095.8822.72100.3115.2810841150.334992671.8312405397.70821771183152.706.7620.1430.3966.4836123507.3717083392.3347966936.67296651421658.946.109.2660.785.4214.0218.191076316149161.4224.50493.5119.6789.8762411348361135.7918.097.66508.496.7989.1518.111002189613087123.9617.835.79481.625.8883.8310.78867158811031126.6016.256.17485.325.4084.6613.01OpenBenchmarking.org

JuliaGPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GT 610GeForce GTX 650GeForce GT 240GeForce GTX 550 TiGeForce GTX 460GeForce GTX 6808M16M24M32M40MSE +/- 2505.78, N = 3SE +/- 2871.63, N = 3SE +/- 17817.43, N = 3SE +/- 13067.53, N = 3SE +/- 4955.24, N = 3SE +/- 72150.98, N = 34060660.0710841150.3313543191.9017540483.9721199859.5036123507.371. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

MandelbulbGPU

MandelbulbGPU is an OpenCL benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GT 610GeForce GTX 650GeForce GT 240GeForce GTX 550 TiGeForce GTX 460GeForce GTX 6804M8M12M16M20MSE +/- 614.61, N = 3SE +/- 4281.68, N = 3SE +/- 3953.96, N = 3SE +/- 4259.44, N = 3SE +/- 10640.98, N = 3SE +/- 72849.89, N = 31776400.704992671.837095143.777729296.079382926.1717083392.331. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGeForce GT 610GeForce GT 240GeForce GTX 650GeForce GTX 550 TiGeForce GTX 460GeForce GTX 68010M20M30M40M50MSE +/- 230.98, N = 3SE +/- 217.94, N = 3SE +/- 857.65, N = 3SE +/- 6289.55, N = 3SE +/- 2448.11, N = 3SE +/- 5781.86, N = 33715320.135291547.0012405397.7017362190.0721375259.4047966936.671. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: RoomGeForce GT 610GeForce GTX 650Radeon HD 6870GeForce GTX 550 TiGeForce GTX 680Radeon HD 7850Radeon R9 270XRadeon HD 79502004006008001000SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 327821071502966248671002

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: SalaGeForce GT 610GeForce GTX 650GeForce GTX 550 TiGeForce GTX 460Radeon HD 6870GeForce GTX 680Radeon HD 7850Radeon R9 270XRadeon HD 7950400800120016002000SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.67, N = 342177319382631651113415881896

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: Luxball HDRGeForce GT 610GeForce GT 240GeForce GTX 650GeForce GTX 550 TiGeForce GTX 460GeForce GTX 680Radeon HD 6870Radeon HD 7850Radeon R9 270XRadeon HD 79503K6K9K12K15KSE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 1.45, N = 3SE +/- 3.61, N = 3SE +/- 4.84, N = 331510511183216225914216614983611103113087

OpenDwarfs

OpenDwarfs is a non-commercial OpenCL compute benchmark suite developed at Virginia Tech in cooperation with various organizations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: LU DecompositionGeForce GT 610GeForce GT 240Radeon HD 6870GeForce GTX 650Radeon HD 7850Radeon R9 270XRadeon HD 7950GeForce GTX 550 TiGeForce GTX 460GeForce GTX 680100200300400500SE +/- 0.06, N = 3SE +/- 1.06, N = 3SE +/- 4.12, N = 6SE +/- 0.21, N = 3SE +/- 5.24, N = 6SE +/- 4.83, N = 6SE +/- 4.65, N = 6SE +/- 0.14, N = 3SE +/- 0.03, N = 3SE +/- 0.88, N = 5458.26220.27161.42152.70135.79126.60123.96110.8197.6958.941. (CC) gcc options: -lm -lOpenCL

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: Compressed Sparse RowRadeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XGeForce GT 610GeForce GT 240GeForce GTX 650GeForce GTX 550 TiGeForce GTX 460GeForce GTX 680612182430SE +/- 0.81, N = 6SE +/- 0.50, N = 6SE +/- 0.86, N = 6SE +/- 0.89, N = 6SE +/- 0.56, N = 6SE +/- 0.11, N = 3SE +/- 0.57, N = 6SE +/- 0.80, N = 6SE +/- 0.76, N = 6SE +/- 0.58, N = 624.5018.0917.8316.259.006.886.766.556.406.101. (CC) gcc options: -lm -lOpenCL

Rodinia

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL LavaMDGeForce GTX 680Radeon HD 7850Radeon R9 270XRadeon HD 79503691215SE +/- 0.14, N = 6SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 39.267.666.175.791. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteRadeon HD 7850Radeon HD 6870Radeon R9 270XRadeon HD 7950GeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 680110220330440550SE +/- 0.96, N = 3SE +/- 1.88, N = 3SE +/- 0.67, N = 3SE +/- 0.84, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 1.14, N = 3508.49493.51485.32481.62137.9095.8862.8160.781. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL HeartwallGeForce GT 240GeForce GT 610GeForce GTX 650Radeon HD 6870Radeon HD 7850Radeon HD 7950GeForce GTX 550 TiGeForce GTX 680Radeon R9 270XGeForce GTX 460918273645SE +/- 0.28, N = 3SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.13, N = 6SE +/- 0.09, N = 6SE +/- 0.08, N = 3SE +/- 0.07, N = 336.9922.7220.1419.676.795.885.495.425.404.801. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL LeukocyteGeForce GT 610Radeon HD 6870Radeon HD 7850Radeon R9 270XRadeon HD 7950GeForce GT 240GeForce GTX 650GeForce GTX 550 TiGeForce GTX 460GeForce GTX 68020406080100SE +/- 1.23, N = 3SE +/- 3.37, N = 6SE +/- 5.05, N = 6SE +/- 6.06, N = 6SE +/- 4.92, N = 6SE +/- 0.75, N = 3SE +/- 0.26, N = 3SE +/- 0.48, N = 4SE +/- 0.30, N = 3SE +/- 0.24, N = 4100.3189.8789.1584.6683.8340.4930.3927.7024.6614.021. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterGeForce GTX 650GeForce GTX 550 TiGeForce GTX 460GeForce GTX 680Radeon HD 7850GeForce GT 610Radeon R9 270XRadeon HD 79501530456075SE +/- 0.07, N = 3SE +/- 0.24, N = 3SE +/- 0.01, N = 3SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 0.57, N = 6SE +/- 0.02, N = 3SE +/- 0.00, N = 366.4844.3535.9618.1918.1115.2813.0110.781. (CXX) g++ options: -O2 -lOpenCL