OpenCL High-End GPU Comparison On Linux

High-end graphics card tests of OpenCL workloads under Ubuntu 14.04 LTS Linux by Michael Larabel for a future article on Phoronix.com.

HTML result view exported from: https://openbenchmarking.org/result/1405316-SO-1405219KH66&rdt&grr.

OpenCL High-End GPU Comparison On LinuxProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGeForce GTX 760GeForce GTX 680GeForce GTX 770GeForce GTX 780 TiGeForce GTX TITANGeForce GTX 750 TiRadeon HD 7950Radeon R9 270XRadeon R9 290Radeon HD 6870Intel Core i7-4770K @ 3.50GHz (8 Cores)ECS Z87H3-A2X EXTREME v1.0Intel 4th Gen Core DRAM16384MB120GB Samsung SSD 840NVIDIA GeForce GTX 760 2048MB (540/3004MHz)Realtek ALC1150Samsung SyncMasterRealtek RTL8111/8168/8411Ubuntu 14.043.13.0-24-generic (x86_64)Unity 7.2.0X Server 1.15.1NVIDIA 337.194.3.0GCC 4.8.2ext42560x1600NVIDIA GeForce GTX 680 2048MB (1006/3004MHz)NVIDIA GeForce GTX 770 2048MB (1045/3505MHz)NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)NVIDIA GeForce GTX TITAN 6144MB (836/3004MHz)NVIDIA GeForce GTX 750 Ti 2048MB (1019/2700MHz)XFX AMD Radeon HD 7900 3072MB (900/1375MHz)SyncMasterfglrx 14.10.24.3.12874Supported device 6810 2048MB (1100/1400MHz)Supported device 67B1 4096MB (947/1250MHz)AMD FX-6300 Six-Core @ 3.50GHz (6 Cores)ASUS M5A97 R2.032768MB250GB MAXTOR STM325031 + 160GB SAMSUNG SP1614C + 400GB SAMSUNG HD400LJ + 3001GB Seagate ST3000DM001-1CH1AMD Radeon HD 6800 1024MB (900/1050MHz)E2250 + M227WD + E2250 + M227WDGentoo 2.23.14.4-gentoo-noradeon (x86_64)Xfce 4.10X Server 1.15.0Clang 3.4.1 + LLVM 3.4.11920x1080OpenBenchmarking.orgCompiler Details- GeForce GTX 760: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 680: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 770: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 780 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX TITAN: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 750 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- Radeon HD 7950: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- Radeon R9 270X: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- Radeon R9 290: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- Radeon HD 6870: --bindir=/usr/x86_64-pc-linux-gnu/gcc-bin/4.7.3 --build=x86_64-pc-linux-gnu --datadir=/usr/share/gcc-data/x86_64-pc-linux-gnu/4.7.3 --disable-altivec --disable-fixed-point --disable-libgcj --disable-libssp --disable-ppl-version-check --disable-werror --enable-__cxa_atexit --enable-checking=release --enable-clocale=gnu --enable-languages=c,c++,fortran --enable-libgomp --enable-libmudflap --enable-libstdcxx-time --enable-lto --enable-multilib --enable-nls --enable-obsolete --enable-secureplt --enable-shared --enable-targets=all --enable-threads=posix --host=x86_64-pc-linux-gnu --includedir=/usr/lib/gcc/x86_64-pc-linux-gnu/4.7.3/include --mandir=/usr/share/gcc-data/x86_64-pc-linux-gnu/4.7.3/man --with-cloog --with-cloog-include=/usr/include/cloog-ppl --with-multilib-list=m32,m64 --with-ppl --with-python-dir=/share/gcc-data/x86_64-pc-linux-gnu/4.7.3/pythonProcessor Details- Scaling Governor: acpi-cpufreq ondemandOpenCL Details- GeForce GTX 760: GPU Compute Cores: 1152- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GTX 770: GPU Compute Cores: 1536- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX TITAN: GPU Compute Cores: 2688- GeForce GTX 750 Ti: GPU Compute Cores: 640System Details- GeForce GTX 760: GPU Compute Cores: 1152.- GeForce GTX 680: GPU Compute Cores: 1536.- GeForce GTX 770: GPU Compute Cores: 1536.- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX TITAN: GPU Compute Cores: 2688.- GeForce GTX 750 Ti: GPU Compute Cores: 640.Environment Details- Radeon HD 7950: LIBGL_DRIVERS_PATH=/usr/lib/i386-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/dri- Radeon R9 270X: LIBGL_DRIVERS_PATH=/usr/lib/i386-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/dri- Radeon R9 290: LIBGL_DRIVERS_PATH=/usr/lib/i386-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/dri- Radeon HD 6870: LIBGL_DRIVERS_PATH=/usr/lib64/dri:/usr/lib32/dri

OpenCL High-End GPU Comparison On Linuxopendwarfs: Cyclic Redundancy Checkopendwarfs: Compressed Sparse Rowopendwarfs: LU Decompositionluxmark: GPU - Luxball HDRluxmark: GPU - Salaluxmark: GPU - Roommandelgpu: GPUmandelbulbgpu: GPUjuliagpu: GPUrodinia: OpenCL Particle Filterrodinia: OpenCL Heartwallrodinia: OpenCL MyocyteGeForce GTX 760GeForce GTX 680GeForce GTX 770GeForce GTX 780 TiGeForce GTX TITANGeForce GTX 750 TiRadeon HD 7950Radeon R9 270XRadeon R9 290Radeon HD 68700.075.7769.777667109554733501696.4314878137.4031803078.2322.307.9450.050.065.8256.808931131465442837373.2318694237.5339685064.1017.935.1650.450.055.7254.639343137268545359706.6719542653.2041256658.5717.064.9448.740.045.4344.93180852406121464490219.1331666087.6766967826.7712.953.0953.860.055.7051.59164062211109061797466.5728313279.8357954293.9714.1654.900.145.5464.678105102058827866753.4714849487.7737946110.4337.534.6343.390.0321.7283.9512550181098310.795.59364.230.0416.8868.8410671147783613.015.18337.970.0313.6969.1116762234313128.275.6379.56610962228719.34315.89OpenBenchmarking.org

OpenDwarfs

Test: Cyclic Redundancy Check

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: Cyclic Redundancy CheckGeForce GTX 760GeForce GTX 680GeForce GTX 770GeForce GTX 780 TiGeForce GTX TITANGeForce GTX 750 TiRadeon HD 7950Radeon R9 270XRadeon R9 2900.03150.0630.09450.1260.1575SE +/- 0.00, N = 6SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 6SE +/- 0.00, N = 6SE +/- 0.00, N = 6SE +/- 0.00, N = 60.070.060.050.040.050.140.030.040.031. (CC) gcc options: -lm -lOpenCL

OpenDwarfs

Test: Compressed Sparse Row

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: Compressed Sparse RowGeForce GTX 760GeForce GTX 680GeForce GTX 770GeForce GTX 780 TiGeForce GTX TITANGeForce GTX 750 TiRadeon HD 7950Radeon R9 270XRadeon R9 290510152025SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 1.21, N = 6SE +/- 0.76, N = 6SE +/- 0.49, N = 65.775.825.725.435.705.5421.7216.8813.691. (CC) gcc options: -lm -lOpenCL

OpenDwarfs

Test: LU Decomposition

OpenBenchmarking.orgms, Fewer Is BetterOpenDwarfs 2013-11-06Test: LU DecompositionGeForce GTX 760GeForce GTX 680GeForce GTX 770GeForce GTX 780 TiGeForce GTX TITANGeForce GTX 750 TiRadeon HD 7950Radeon R9 270XRadeon R9 29020406080100SE +/- 0.06, N = 3SE +/- 0.23, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.22, N = 3SE +/- 0.06, N = 3SE +/- 6.80, N = 6SE +/- 4.20, N = 6SE +/- 0.62, N = 369.7756.8054.6344.9351.5964.6783.9568.8469.111. (CC) gcc options: -lm -lOpenCL

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 760GeForce GTX 680GeForce GTX 770GeForce GTX 780 TiGeForce GTX TITANGeForce GTX 750 TiRadeon HD 7950Radeon R9 270XRadeon R9 290Radeon HD 68704K8K12K16K20KSE +/- 1.20, N = 3SE +/- 1.53, N = 3SE +/- 1.86, N = 3SE +/- 9.85, N = 3SE +/- 113.19, N = 3SE +/- 2.52, N = 3SE +/- 20.07, N = 3SE +/- 19.73, N = 3SE +/- 59.37, N = 3SE +/- 25.78, N = 3766789319343180851640681051255010671167626109

LuxMark

OpenCL Device: GPU - Scene: Sala

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: SalaGeForce GTX 760GeForce GTX 680GeForce GTX 770GeForce GTX 780 TiGeForce GTX TITANGeForce GTX 750 TiRadeon HD 7950Radeon R9 270XRadeon R9 290Radeon HD 68705001000150020002500SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 2.60, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 8.54, N = 3SE +/- 0.67, N = 3SE +/- 1.86, N = 3SE +/- 4.36, N = 3109513141372240622111020181014772343622

LuxMark

OpenCL Device: GPU - Scene: Room

OpenBenchmarking.orgScore, More Is BetterLuxMark 2.1beta1OpenCL Device: GPU - Scene: RoomGeForce GTX 760GeForce GTX 680GeForce GTX 770GeForce GTX 780 TiGeForce GTX TITANGeForce GTX 750 TiRadeon HD 7950Radeon R9 270XRadeon R9 290Radeon HD 687030060090012001500SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 3SE +/- 0.88, N = 3SE +/- 0.88, N = 3SE +/- 3.21, N = 3SE +/- 1.00, N = 3547654685121410905889838361312287

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGeForce GTX 760GeForce GTX 680GeForce GTX 770GeForce GTX 780 TiGeForce GTX TITANGeForce GTX 750 Ti14M28M42M56M70MSE +/- 15675.12, N = 3SE +/- 6352.22, N = 3SE +/- 6781.46, N = 3SE +/- 274468.92, N = 3SE +/- 24850.67, N = 3SE +/- 5193.56, N = 333501696.4342837373.2345359706.6764490219.1361797466.5727866753.471. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

MandelbulbGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GTX 760GeForce GTX 680GeForce GTX 770GeForce GTX 780 TiGeForce GTX TITANGeForce GTX 750 Ti7M14M21M28M35MSE +/- 6057.51, N = 3SE +/- 62529.27, N = 3SE +/- 21521.06, N = 3SE +/- 64326.57, N = 3SE +/- 33588.22, N = 3SE +/- 5126.17, N = 314878137.4018694237.5319542653.2031666087.6728313279.8314849487.771. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 760GeForce GTX 680GeForce GTX 770GeForce GTX 780 TiGeForce GTX TITANGeForce GTX 750 Ti14M28M42M56M70MSE +/- 22044.37, N = 3SE +/- 24001.93, N = 3SE +/- 23516.24, N = 3SE +/- 22581.94, N = 3SE +/- 178115.56, N = 3SE +/- 37139.42, N = 331803078.2339685064.1041256658.5766967826.7757954293.9737946110.431. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterGeForce GTX 760GeForce GTX 680GeForce GTX 770GeForce GTX 780 TiGeForce GTX TITANGeForce GTX 750 TiRadeon HD 7950Radeon R9 270XRadeon R9 290918273645SE +/- 0.34, N = 6SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.22, N = 6SE +/- 0.01, N = 3SE +/- 0.66, N = 6SE +/- 0.01, N = 3SE +/- 0.24, N = 6SE +/- 0.13, N = 622.3017.9317.0612.9514.1637.5310.7913.018.271. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Heartwall

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL HeartwallGeForce GTX 760GeForce GTX 680GeForce GTX 770GeForce GTX 780 TiGeForce GTX 750 TiRadeon HD 7950Radeon R9 270XRadeon R9 290Radeon HD 6870510152025SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 6SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 37.945.164.943.094.635.595.185.6319.341. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteGeForce GTX 760GeForce GTX 680GeForce GTX 770GeForce GTX 780 TiGeForce GTX TITANGeForce GTX 750 TiRadeon HD 7950Radeon R9 270XRadeon R9 290Radeon HD 687080160240320400SE +/- 0.72, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.91, N = 3SE +/- 0.09, N = 3SE +/- 0.84, N = 3SE +/- 5.51, N = 3SE +/- 2.16, N = 3SE +/- 0.21, N = 3SE +/- 4.34, N = 650.0550.4548.7453.8654.9043.39364.23337.9779.56315.891. (CXX) g++ options: -O2 -lOpenCL


Phoronix Test Suite v10.8.4