NVIDIA AMD OpenCL Linux GPGPU Tests

Benchmarks by Michael Larabel for a future article on Phoronix. Just looking at a range of OpenBenchmarking.org OpenCL benchmarks on a range of AMD Radeon and NVIDIA GeForce graphics cards with the proprietary drivers under Ubuntu Linux.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1406282-PL-1311092SO17
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

NVIDIA GPU Compute 2 Tests
OpenCL 6 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
GeForce GT 240
November 08 2013
 
GeForce GTX 460
November 07 2013
 
GeForce GTX 550 Ti
November 07 2013
 
GeForce GT 610
November 08 2013
 
GeForce GTX 650
November 07 2013
 
GeForce GTX 680
November 07 2013
 
Radeon HD 6870
November 09 2013
 
Radeon HD 7850
November 09 2013
 
Radeon HD 7950
November 09 2013
 
Radeon R9 270X
November 09 2013
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA AMD OpenCL Linux GPGPU TestsOpenBenchmarking.orgPhoronix Test SuiteAMD FX-8350 Eight-Core @ 4.00GHz (8 Cores)ASUS Crosshair V FormulaAMD RD890 bridge8192MB64GB OCZ AGILITYNVIDIA GeForce GTX 680 2048MB (705/3004MHz)eVGA NVIDIA GeForce GTX 550 Ti 1024MB (951/2178MHz)MSI NVIDIA GeForce GTX 650 1024MB (1084/2500MHz)ASUS NVIDIA GeForce GTX 460 768MB (675/1800MHz)NVIDIA GeForce GT 610 1024MB (810/533MHz)ECS NVIDIA GeForce GT 240 512MB (550/1700MHz)Sapphire AMD Radeon HD 6800 1024MB (900/1050MHz)ASUS AMD Radeon HD 7800 1024MB (860/1200MHz)XFX AMD Radeon HD 7900 3072MB (900/1375MHz)Gigabyte AMD Radeon R9 200 2048MB (1100/1400MHz)Realtek ALC889Intel 82583V Gigabit ConnectionUbuntu 13.103.11.0-12-generic (x86_64)Unity 7.1.2X Server 1.14.3NVIDIA 331.20fglrx 13.25.54.3.03.3.04.3.12614GCC 4.8ext42560x1600ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriversOpenGLsCompilerFile-SystemScreen ResolutionNVIDIA AMD OpenCL Linux GPGPU Tests PerformanceSystem Logs- --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - Scaling Governor: acpi-cpufreq ondemand- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GTX 550 Ti: GPU Compute Cores: 192- GeForce GTX 650: GPU Compute Cores: 384- GeForce GTX 460: GPU Compute Cores: 336- GeForce GT 610: GPU Compute Cores: 48- GeForce GT 240: GPU Compute Cores: 96- GeForce GTX 680: GPU Compute Cores: 1536.- GeForce GTX 550 Ti: GPU Compute Cores: 192.- GeForce GTX 650: GPU Compute Cores: 384.- GeForce GTX 460: GPU Compute Cores: 336.- GeForce GT 610: GPU Compute Cores: 48.- GeForce GT 240: GPU Compute Cores: 96.- Radeon HD 6870, Radeon HD 7850, Radeon HD 7950, Radeon R9 270X: LIBGL_DRIVERS_PATH=/usr/lib/i386-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/dri

GeForce GTX 680GeForce GTX 550 TiGeForce GTX 650GeForce GTX 460GeForce GT 610GeForce GT 240Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270XLogarithmic Result OverviewPhoronix Test SuiteRodiniaLuxMarkOpenDwarfs

NVIDIA AMD OpenCL Linux GPGPU Testsjuliagpu: GPUmandelbulbgpu: GPUmandelgpu: GPUluxmark: GPU - Roomluxmark: GPU - Salaluxmark: GPU - Luxball HDRopendwarfs: LU Decompositionopendwarfs: Compressed Sparse Rowrodinia: OpenCL LavaMDrodinia: OpenCL Myocyterodinia: OpenCL Heartwallrodinia: OpenCL Leukocyterodinia: OpenCL Particle FilterGeForce GTX 680GeForce GTX 550 TiGeForce GTX 650GeForce GTX 460GeForce GT 610GeForce GT 240Radeon HD 6870Radeon HD 7850Radeon HD 7950Radeon R9 270X36123507.3717083392.3347966936.67296651421658.946.109.2660.785.4214.0218.1917540483.977729296.0717362190.071503192162110.816.555.4927.7044.3510841150.334992671.8312405397.70821771183152.706.7620.1430.3966.4821199859.509382926.1721375259.40382259197.696.4062.814.8024.6635.964060660.071776400.703715320.132742315458.269.0095.8822.72100.3115.2813543191.907095143.775291547.001051220.276.88137.9036.9940.491076316149161.4224.50493.5119.6789.8762411348361135.7918.097.66508.496.7989.1518.111002189613087123.9617.835.79481.625.8883.8310.78867158811031126.6016.256.17485.325.4084.6613.01OpenBenchmarking.org

JuliaGPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 650GeForce GTX 550 TiGeForce GTX 6808M16M24M32M40MSE +/- 17817.43, N = 3SE +/- 2505.78, N = 3SE +/- 4955.24, N = 3SE +/- 2871.63, N = 3SE +/- 13067.53, N = 3SE +/- 72150.98, N = 313543191.904060660.0721199859.5010841150.3317540483.9736123507.371. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

MandelbulbGPU

MandelbulbGPU is an OpenCL benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 650GeForce GTX 550 TiGeForce GTX 6804M8M12M16M20MSE +/- 3953.96, N = 3SE +/- 614.61, N = 3SE +/- 10640.98, N = 3SE +/- 4281.68, N = 3SE +/- 4259.44, N = 3SE +/- 72849.89, N = 37095143.771776400.709382926.174992671.837729296.0717083392.331. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 650GeForce GTX 550 TiGeForce GTX 68010M20M30M40M50MSE +/- 217.94, N = 3SE +/- 230.98, N = 3SE +/- 2448.11, N = 3SE +/- 857.65, N = 3SE +/- 6289.55, N = 3SE +/- 5781.86, N = 35291547.003715320.1321375259.4012405397.7017362190.0747966936.671. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: RoomRadeon R9 270XRadeon HD 7950Radeon HD 7850Radeon HD 6870GeForce GT 610GeForce GTX 650GeForce GTX 550 TiGeForce GTX 6802004006008001000SE +/- 0.67, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 386710026241072782150296

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: SalaRadeon R9 270XRadeon HD 7950Radeon HD 7850Radeon HD 6870GeForce GT 610GeForce GTX 460GeForce GTX 650GeForce GTX 550 TiGeForce GTX 680400800120016002000SE +/- 0.00, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 315881896113463142382177319651

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: Luxball HDRRadeon R9 270XRadeon HD 7950Radeon HD 7850Radeon HD 6870GeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 650GeForce GTX 550 TiGeForce GTX 6803K6K9K12K15KSE +/- 3.61, N = 3SE +/- 4.84, N = 3SE +/- 1.45, N = 3SE +/- 0.88, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.67, N = 311031130878361614910513152591118321624216

OpenDwarfs

OpenDwarfs is a non-commercial OpenCL compute benchmark suite developed at Virginia Tech in cooperation with various organizations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: LU DecompositionRadeon R9 270XRadeon HD 7950Radeon HD 7850Radeon HD 6870GeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 650GeForce GTX 550 TiGeForce GTX 680100200300400500SE +/- 4.83, N = 6SE +/- 4.65, N = 6SE +/- 5.24, N = 6SE +/- 4.12, N = 6SE +/- 1.06, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.21, N = 3SE +/- 0.14, N = 3SE +/- 0.88, N = 5126.60123.96135.79161.42220.27458.2697.69152.70110.8158.941. (CC) gcc options: -lm -lOpenCL

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: Compressed Sparse RowRadeon R9 270XRadeon HD 7950Radeon HD 7850Radeon HD 6870GeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 650GeForce GTX 550 TiGeForce GTX 680612182430SE +/- 0.89, N = 6SE +/- 0.86, N = 6SE +/- 0.50, N = 6SE +/- 0.81, N = 6SE +/- 0.11, N = 3SE +/- 0.56, N = 6SE +/- 0.76, N = 6SE +/- 0.57, N = 6SE +/- 0.80, N = 6SE +/- 0.58, N = 616.2517.8318.0924.506.889.006.406.766.556.101. (CC) gcc options: -lm -lOpenCL

Rodinia

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL LavaMDRadeon R9 270XRadeon HD 7950Radeon HD 7850GeForce GTX 6803691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.14, N = 66.175.797.669.261. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteRadeon R9 270XRadeon HD 7950Radeon HD 7850Radeon HD 6870GeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 680110220330440550SE +/- 0.67, N = 3SE +/- 0.84, N = 3SE +/- 0.96, N = 3SE +/- 1.88, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 1.14, N = 3485.32481.62508.49493.51137.9095.8862.8160.781. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL HeartwallRadeon R9 270XRadeon HD 7950Radeon HD 7850Radeon HD 6870GeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 650GeForce GTX 550 TiGeForce GTX 680918273645SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 3SE +/- 0.13, N = 3SE +/- 0.28, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 6SE +/- 0.09, N = 65.405.886.7919.6736.9922.724.8020.145.495.421. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL LeukocyteRadeon R9 270XRadeon HD 7950Radeon HD 7850Radeon HD 6870GeForce GT 240GeForce GT 610GeForce GTX 460GeForce GTX 650GeForce GTX 550 TiGeForce GTX 68020406080100SE +/- 6.06, N = 6SE +/- 4.92, N = 6SE +/- 5.05, N = 6SE +/- 3.37, N = 6SE +/- 0.75, N = 3SE +/- 1.23, N = 3SE +/- 0.30, N = 3SE +/- 0.26, N = 3SE +/- 0.48, N = 4SE +/- 0.24, N = 484.6683.8389.1589.8740.49100.3124.6630.3927.7014.021. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterRadeon R9 270XRadeon HD 7950Radeon HD 7850GeForce GT 610GeForce GTX 460GeForce GTX 650GeForce GTX 550 TiGeForce GTX 6801530456075SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.57, N = 6SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.24, N = 3SE +/- 0.17, N = 313.0110.7818.1115.2835.9666.4844.3518.191. (CXX) g++ options: -O2 -lOpenCL